Overview
Direct Answer
Serverless computing is a cloud execution model in which developers deploy code without provisioning or managing underlying infrastructure, with the provider automatically scaling resources and charging exclusively for actual execution time measured in milliseconds.
How It Works
Developers package functions or application code, which the cloud provider executes on-demand when triggered by events such as API requests, database changes, or scheduled timers. The platform abstracts infrastructure management, automatically scaling compute capacity across multiple isolated execution environments and terminating instances immediately after code execution completes.
Why It Matters
Organisations reduce operational overhead by eliminating server management responsibilities whilst optimising costs through per-millisecond billing rather than reserved capacity. This model accelerates time-to-market for event-driven workloads and enables teams to focus engineering effort on business logic rather than infrastructure operations.
Common Applications
Real-world use cases include RESTful API backends triggered by HTTP requests, image processing pipelines activated by file uploads to cloud storage, real-time data processing from streaming sources, and scheduled batch jobs. Financial services use this model for transaction processing, whilst media and entertainment organisations employ it for content transformation workflows.
Key Considerations
Cold start latency—the delay when initialising a new execution environment—can impact latency-sensitive applications. Organisations must also evaluate vendor lock-in risks, monitor distributed function behaviour across multiple invocations, and manage stateless architecture constraints.
More in Cloud Computing
Microservices
Architecture PatternsAn architectural style structuring an application as a collection of loosely coupled, independently deployable services.
Sovereign Cloud
Strategy & EconomicsCloud infrastructure operated within national boundaries under local jurisdiction, ensuring data sovereignty, regulatory compliance, and protection from foreign government access.
Internal Developer Portal
Deployment & OperationsA centralised web interface that provides developers with self-service access to infrastructure, services, documentation, and templates within their organisation.
Load Balancer
InfrastructureA device or software that distributes network traffic across multiple servers to ensure no single server is overwhelmed.
OAuth
Strategy & EconomicsAn open standard for token-based authentication and authorisation on the internet.
Green Cloud Computing
Service ModelsCloud computing practices that minimise environmental impact through renewable energy usage, efficient cooling, workload consolidation, and carbon-aware scheduling of compute tasks.
Hypervisor
InfrastructureSoftware that creates and manages virtual machines, allowing multiple operating systems to share a single hardware host.
GPU Cloud Computing
Service ModelsCloud infrastructure providing on-demand access to graphics processing units optimised for AI training and inference, enabling organisations to scale compute without capital investment.