Overview
A set of practices combining software development and IT operations to shorten the development lifecycle and deliver continuous value.
More in DevOps & Infrastructure
Site Reliability Engineering
Site ReliabilityA discipline applying software engineering principles to infrastructure and operations to create scalable, reliable systems.
Immutable Infrastructure
Infrastructure as CodeAn approach where infrastructure components are never modified after deployment but replaced entirely with updated versions.
Observability
ObservabilityThe ability to understand a system's internal state from its external outputs, encompassing metrics, logs, and traces.
Alerting
ObservabilityAutomated notifications triggered when system metrics or conditions exceed predefined thresholds.
Incident Management
Site ReliabilityThe processes and tools for detecting, responding to, resolving, and learning from service disruptions.
Helm
Containers & OrchestrationA package manager for Kubernetes that simplifies the deployment and management of applications using charts.
Blue-Green Infrastructure
CI/CDMaintaining two identical production environments to enable instant switching between versions.
Capacity Planning
Site ReliabilityThe process of determining the production capacity needed to meet changing demands for an organisation's products.