Overview
A method of tracking requests as they flow through distributed systems to diagnose latency and failure points.
More in DevOps & Infrastructure
Build Automation
CI/CDThe process of automating the compilation, testing, and packaging of software applications.
Chef
Infrastructure as CodeA configuration management tool using Ruby-based scripts to automate infrastructure setup and maintenance.
ChatOps
CI/CDA collaboration model connecting tools, processes, and automation with team chat platforms for operations management.
High Availability
Site ReliabilityA system design approach that ensures a certain degree of operational continuity during a given measurement period.
Vertical Scaling
CI/CDIncreasing the resources (CPU, RAM, storage) of an existing machine to handle more load.
Mean Time Between Failures
CI/CDThe average time between system failures, measuring reliability and availability.
Runbook
Site ReliabilityA documented set of procedures for handling routine operations and troubleshooting common issues.
Post-Mortem Analysis
CI/CDA structured review conducted after an incident to identify root causes and prevent recurrence.