Overview
Artificially generated datasets that preserve the statistical properties of real data while protecting privacy, used for testing, development, and sharing across organisational boundaries.
More in Data Science & Analytics
Concept Drift
Statistics & MethodsChanges in the underlying patterns that a model was trained to capture, requiring model adaptation.
Data Catalogue
Data GovernanceA metadata management tool that helps organisations find, understand, and manage their data assets.
Outlier Detection
Statistics & MethodsIdentifying data points that differ significantly from other observations in a dataset.
Data Quality
Data EngineeringThe measure of data's fitness for its intended purpose based on accuracy, completeness, consistency, and timeliness.
Cohort Analysis
Applied AnalyticsA behavioural analytics technique that groups users with shared characteristics to track metrics over time.
Data Storytelling
VisualisationThe practice of building narratives around data insights using visualisations and narrative techniques.
Data Drift
Data GovernanceChanges in the statistical properties of data over time that can degrade machine learning model performance.
Data Silo
Statistics & MethodsAn isolated repository of data controlled by one department, inaccessible to other parts of the organisation.