What is a Clustering?
Grouping similar items together without predefined categories, discovering natural clusters based on shared characteristics.
Clustering explained in plain English
Clustering groups similar items together without predefined categories. The system discovers natural clusters in data based on shared characteristics.
It is a core technique in unsupervised machine learning.
Analogy
Clustering is like organising a mixed box of photographs into piles of similar scenes — beaches, birthdays, landscapes — without anyone telling you the categories in advance.
Example
A retailer clusters shoppers into behaviour groups to tailor campaigns without manually defining segments upfront.
How is Clustering used?
Marketing teams segment customers by behaviour. News apps group related articles. Gene research groups similar DNA sequences.
Common misconceptions about Clustering
Clusters are not always meaningful — algorithms will group data even when no natural structure exists.
People also read
- A/B testing
A statistical way of comparing two (or more) techniques—the A and the B.
- ablation
A technique for evaluating the importance of a feature or component by temporarily removing it from a model.
- accuracy
The number of correct classification predictions divided by the total number of predictions.
- activation function
A function that enables neural networks to learn nonlinear (complex) relationships between features and the label.
- active learning
A training approach in which the algorithm chooses some of the data it learns from.
- adaptation
Synonym for tuning or fine-tuning.
- agglomerative clustering
See hierarchical clustering.
- anomaly detection
The process of identifying outliers.
- area under the PR curve
See PR AUC (Area under the PR Curve).
- area under the ROC curve
See AUC (Area under the ROC curve).