Overview
A comprehensive guide containing strategies, procedures, and best practices for managing specific operational scenarios.
More in DevOps & Infrastructure
Secret Management
CI/CDThe practice of securely storing, accessing, and managing sensitive credentials, API keys, and certificates.
Blue-Green Infrastructure
CI/CDMaintaining two identical production environments to enable instant switching between versions.
Observability
ObservabilityThe ability to understand a system's internal state from its external outputs, encompassing metrics, logs, and traces.
Monitoring
ObservabilityThe continuous observation of system performance, availability, and health using automated tools and dashboards.
Puppet
Infrastructure as CodeA configuration management tool that automates the provisioning and management of infrastructure.
Rollback
CI/CDThe process of reverting a system to a previous version or state after a failed deployment or update.
Prometheus
ObservabilityAn open-source monitoring and alerting toolkit designed for reliability and scalability in cloud-native environments.
Chaos Engineering
Site ReliabilityThe discipline of experimenting on distributed systems to build confidence in their ability to withstand turbulent conditions.