Overview
Direct Answer
Market basket analysis is a data mining technique that identifies co-occurrence patterns and association rules between items purchased or selected together in transactional datasets. It uncovers which products, services, or behaviours frequently appear in combination, enabling predictive insights about customer purchasing patterns.
How It Works
The technique applies algorithms such as Apriori or Eclat to transaction data, calculating support (frequency of item co-occurrence), confidence (conditional probability), and lift (strength of association) metrics. These metrics generate association rules—such as 'if customer purchases item A, probability of purchasing item B increases by X%'—ranked by statistical significance and business relevance.
Why It Matters
Retailers and e-commerce organisations use these insights to optimise product placement, bundle offerings, and cross-sell strategies, directly improving transaction value and inventory efficiency. The technique reduces marketing waste by identifying genuine customer affinities rather than relying on demographic assumptions, yielding measurable returns on promotional spend.
Common Applications
Supermarkets analyse checkout data to position complementary products; online retailers use insights to personalise product recommendations and design bundled offers; financial services identify cross-sell opportunities for insurance and investment products; healthcare organisations analyse patient treatment sequences to improve clinical pathways.
Key Considerations
The quality of results depends heavily on data granularity and transaction volume; sparse datasets or those with excessive noise produce unreliable patterns. Discovered associations reflect historical behaviour and may not account for seasonality, market shifts, or causal relationships, requiring domain expertise to translate into actionable strategy.
Cross-References(1)
More in Data Science & Analytics
Data Pipeline
Data EngineeringAn automated set of processes that moves and transforms data from source systems to target destinations.
Synthetic Data for Analytics
Statistics & MethodsArtificially generated datasets that preserve the statistical properties of real data while protecting privacy, used for testing, development, and sharing across organisational boundaries.
Concept Drift
Statistics & MethodsChanges in the underlying patterns that a model was trained to capture, requiring model adaptation.
Semantic Layer
Statistics & MethodsAn abstraction layer that provides business-friendly definitions and consistent metrics on top of raw data, enabling self-service analytics with standardised terminology.
Data Wrangling
Statistics & MethodsThe process of cleaning, structuring, and enriching raw data into a desired format for analysis.
OLAP
Statistics & MethodsOnline Analytical Processing — a category of software tools enabling analysis of data stored in databases for business intelligence.
Customer Analytics
Applied AnalyticsThe practice of collecting and analysing customer data to understand behaviour, preferences, and lifetime value.
Natural Language Querying
VisualisationThe ability for users to ask questions about data in plain language and receive answers, with AI translating natural language into database queries and visualisations.