Overview
The average time between system failures, measuring reliability and availability.
More in DevOps & Infrastructure
Chef
Infrastructure as CodeA configuration management tool using Ruby-based scripts to automate infrastructure setup and maintenance.
Horizontal Scaling
CI/CDAdding more machines or nodes to a system to handle increased load.
Rollback
CI/CDThe process of reverting a system to a previous version or state after a failed deployment or update.
Site Reliability Engineering
Site ReliabilityA discipline applying software engineering principles to infrastructure and operations to create scalable, reliable systems.
Blue-Green Infrastructure
CI/CDMaintaining two identical production environments to enable instant switching between versions.
Graceful Degradation
CI/CDA design approach where a system continues to operate with reduced functionality when components fail.
Metrics
ObservabilityQuantitative measurements collected over time to track system performance, health, and business outcomes.
Ansible
Infrastructure as CodeAn open-source automation tool for configuration management, application deployment, and task automation.