Overview
The specialised hardware, software, and networking stack required to train and serve AI models at scale, including GPU clusters, high-bandwidth interconnects, and model serving frameworks.
Cross-References(2)
More in Cloud Computing
Multi-Cloud Strategy
Strategy & EconomicsAn approach that distributes workloads across multiple cloud providers to avoid vendor lock-in, optimise costs, meet regulatory requirements, and improve resilience.
Availability Zone
InfrastructureAn isolated location within a cloud region with independent power, cooling, and networking for high availability.
Container
InfrastructureA lightweight, portable software package that bundles application code with all its dependencies for consistent execution.
API
Architecture PatternsApplication Programming Interface — a set of protocols and tools for building and integrating software applications.
Event-Driven Architecture
Architecture PatternsAn architecture pattern where the flow of the system is determined by events — significant changes in state.
Spot Instances
Service ModelsSpare cloud computing capacity offered at steep discounts compared to on-demand pricing, available when the provider has excess resources but subject to interruption.
Platform Engineering
Deployment & OperationsThe practice of building and maintaining internal developer platforms that provide self-service capabilities, standardised tooling, and golden paths for software delivery teams.
Edge Computing
Architecture PatternsProcessing data near the source of data generation rather than in a centralised cloud data centre.