AIExplainer
Machine Learning Beginner 1 min read

What is a training set?

The subset of the dataset used to train a model.

The subset of the dataset used to train a model. Traditionally, examples in the dataset are divided into the following three distinct subsets: - a training set - a validation set - a test set Ideally, each example in the dataset should belong to only one of the preceding subsets. For example, a single example shouldn't belong to both the training set and the validation set. See Datasets: Dividing the original dataset in Machine Learning Crash Course for more information.

Practitioners refer to training set when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.