What is a self-training?
A variant of self-supervised learning that is particularly useful when all of the following conditions are true: - The ratio of unlabeled examples to labeled examples in the dataset is high.
self-training explained in plain English
A variant of self-supervised learning that is particularly useful when all of the following conditions are true: - The ratio of unlabeled examples to labeled examples in the dataset is high. - This is a classification problem. Self-training works by iterating over the following two steps until the model stops improving: 1. Use supervised machine learning to train a model on the labeled examples. 2. Use the model created in Step 1 to generate predictions (labels) on the unlabeled examples, moving those in which there is high confidence into the labeled examples with the predicted label. Notice that each iteration of Step 2 adds more labeled examples for Step 1 to train on.
Example
Practitioners refer to self-training when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.
People also read
- A/B testing
A statistical way of comparing two (or more) techniques—the A and the B.
- ablation
A technique for evaluating the importance of a feature or component by temporarily removing it from a model.
- accuracy
The number of correct classification predictions divided by the total number of predictions.
- activation function
A function that enables neural networks to learn nonlinear (complex) relationships between features and the label.
- active learning
A training approach in which the algorithm chooses some of the data it learns from.
- adaptation
Synonym for tuning or fine-tuning.
- agglomerative clustering
See hierarchical clustering.
- anomaly detection
The process of identifying outliers.
- area under the PR curve
See PR AUC (Area under the PR Curve).
- area under the ROC curve
See AUC (Area under the ROC curve).