Overview
When a model learns the training data too well, including noise, resulting in poor performance on unseen data.
More in Machine Learning
Content-Based Filtering
Unsupervised LearningA recommendation approach that suggests items similar to those a user has previously liked, based on item attributes.
Model Serialisation
MLOps & ProductionThe process of converting a trained model into a format that can be stored, transferred, and later reconstructed for inference.
Feature Store
MLOps & ProductionA centralised repository for storing, managing, and serving machine learning features, ensuring consistency between training and inference environments across an organisation.
Class Imbalance
Feature Engineering & SelectionA situation where the distribution of classes in a dataset is significantly skewed, with some classes vastly outnumbering others.
Model Calibration
MLOps & ProductionThe process of adjusting a model's predicted probabilities so they accurately reflect the true likelihood of outcomes, essential for risk-sensitive decision-making.
K-Means Clustering
Unsupervised LearningA partitioning algorithm that divides data into k clusters by minimising the distance between points and their cluster centroids.
Multi-Task Learning
MLOps & ProductionA machine learning approach where a model is simultaneously trained on multiple related tasks to improve generalisation.
Experiment Tracking
MLOps & ProductionThe systematic recording of machine learning experiment parameters, metrics, artifacts, and code versions to enable reproducibility and comparison across training runs.