Machine Learning Beginner
training set
The subset of the dataset used to train a model.
Plain English Explanation
The subset of the dataset used to train a model. Traditionally, examples in the dataset are divided into the following three distinct subsets: - a training set - a validation set - a test set Ideally, each example in the dataset should belong to only one of the preceding subsets. For example, a single example shouldn't belong to both the training set and the validation set. See Datasets: Dividing the original dataset in Machine Learning Crash Course for more information.
How is it used?
Practitioners refer to training set when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.