Online Learning — Technology Wiki

Overview

Direct Answer

Online learning is a machine learning paradigm in which models are continuously updated incrementally as individual data points or small batches arrive, rather than retraining on the complete dataset at once. This approach enables systems to adapt dynamically to concept drift and non-stationary environments.

How It Works

Models process each incoming observation (or mini-batch) sequentially, updating internal parameters through algorithms such as stochastic gradient descent or adaptive learning rules. The system discards or downweights old data, allowing it to reflect recent patterns whilst maintaining computational efficiency by avoiding full retraining cycles.

Why It Matters

Organisations benefit from reduced memory overhead, lower latency in adapting to changing data distributions, and the ability to process unbounded data streams in real-time. This is critical for applications where retraining on historical data is impractical or where rapid response to emerging patterns directly impacts business decisions.

Common Applications

Practical deployments include recommendation systems that personalise suggestions as user behaviour evolves, fraud detection systems that adjust to new attack patterns, sensor monitoring in IoT networks, and stock price prediction in financial markets. Autonomous vehicle perception systems and web search ranking similarly exploit this capability.

Key Considerations

Trade-offs include potential instability from individual noisy samples, difficulty in tuning hyperparameters without cross-validation datasets, and the risk of catastrophic forgetting in neural networks. Practitioners must carefully balance learning rates and implement safeguards to prevent degradation on earlier learned concepts.

Cross-References(1)

Machine Learning

Referenced By1 term mentions Online Learning

Other entries in the wiki whose definition references Online Learning — useful for understanding how this concept connects across Machine Learning and adjacent domains.

Bandit Algorithm·Machine Learning

Related in MLOps & Production

Machine Learning

A subset of AI that enables systems to automatically learn and improve from experience without being explicitly programmed.

Supervised Learning

A machine learning paradigm where models are trained on labelled data, learning to map inputs to known outputs.

Unsupervised Learning

A machine learning approach where models discover patterns and structures in data without labelled examples.

Reinforcement Learning

A machine learning paradigm where agents learn optimal behaviour through trial and error, receiving rewards or penalties.

Multi-Task Learning

A machine learning approach where a model is simultaneously trained on multiple related tasks to improve generalisation.

Batch Learning

Training a machine learning model on the entire dataset at once before deployment, as opposed to incremental updates.

Active Learning

A machine learning approach where the algorithm interactively queries a user or oracle to label new data points.

Ensemble Learning

Combining multiple machine learning models to produce better predictive performance than any single model.

Feature Selection

The process of identifying and selecting the most relevant input variables for a machine learning model.

Epoch

One complete pass through the entire training dataset during the machine learning model training process.

Model Serialisation

The process of converting a trained model into a format that can be stored, transferred, and later reconstructed for inference.

Model Serving

The infrastructure and processes for deploying trained machine learning models to production environments for real-time predictions.

More in Machine Learning

Loss Function

Training Techniques

A mathematical function that measures the difference between predicted outputs and actual target values during model training.

t-SNE

Unsupervised Learning

t-Distributed Stochastic Neighbour Embedding — a technique for visualising high-dimensional data in two or three dimensions.

Collaborative Filtering

Unsupervised Learning

A recommendation technique that makes predictions based on the collective preferences and behaviour of many users.

A/B Testing

Training Techniques

A controlled experiment comparing two variants to determine which performs better against a defined metric.

Naive Bayes

Supervised Learning

A probabilistic classifier based on applying Bayes' theorem with the assumption of independence between features.

SHAP Values

MLOps & Production

A game-theoretic approach to explaining individual model predictions by computing each feature's marginal contribution, based on Shapley values from cooperative game theory.

Gradient Descent

Training Techniques

An optimisation algorithm that iteratively adjusts parameters in the direction of steepest descent of the loss function.

Matrix Factorisation

Unsupervised Learning

A technique that decomposes a matrix into constituent matrices, widely used in recommendation systems and dimensionality reduction.