Overview
The processes and tools for detecting, responding to, resolving, and learning from service disruptions.
More in DevOps & Infrastructure
Secret Management
CI/CDThe practice of securely storing, accessing, and managing sensitive credentials, API keys, and certificates.
Elasticity
CI/CDThe ability of a system to automatically scale resources up or down based on current demand.
Error Budget
ObservabilityThe maximum amount of time a service can be unavailable within a given period based on its SLO.
Ansible
Infrastructure as CodeAn open-source automation tool for configuration management, application deployment, and task automation.
Monitoring
ObservabilityThe continuous observation of system performance, availability, and health using automated tools and dashboards.
Playbook
CI/CDA comprehensive guide containing strategies, procedures, and best practices for managing specific operational scenarios.
Post-Mortem Analysis
CI/CDA structured review conducted after an incident to identify root causes and prevent recurrence.
Blue-Green Infrastructure
CI/CDMaintaining two identical production environments to enable instant switching between versions.