What is a recall?
A metric for classification models that answers the following question: When ground truth was the positive class, what percentage of predictions did the model correctly identify as the positive class?
recall explained in plain English
A metric for classification models that answers the following question: When ground truth was the positive class, what percentage of predictions did the model correctly identify as the positive class? Here is the formula: \[\text{Recall} = \frac{\text{true positives}} {\text{true positives} + \text{false negatives}} \] where: - true positive means the model correctly predicted the positive class. - false negative means that the model mistakenly predicted the negative class. For instance, suppose your model made 200 predictions on examples for which ground truth was the positive class. Of these 200 predictions: - 180 were true positives. - 20 were false negatives. In this case: \[\text{Recall} = \frac{\text{180}} {\text{180} + \text{20}} = 0.9 \]
Recall is particularly useful for determining the predictive power of classification models in which the positive class is rare. For example, consider a class-imbalanced dataset in which the positive class for a certain disease occurs in only 10 patients out of a million. Suppose your model makes five million predictions that yield the following outcomes: - 30 True Positives - 20 False Negatives - 4,999,000 True Negatives - 950 False Positives The recall of this model is therefore:
Example
That high value of accuracy looks impressive but is essentially meaningless. Recall is a much more useful metric for class-imbalanced datasets than accuracy. --- See Classification: Accuracy, recall, precision and related metrics for more information.
People also read
- A/B testing
A statistical way of comparing two (or more) techniques—the A and the B.
- ablation
A technique for evaluating the importance of a feature or component by temporarily removing it from a model.
- accuracy
The number of correct classification predictions divided by the total number of predictions.
- activation function
A function that enables neural networks to learn nonlinear (complex) relationships between features and the label.
- active learning
A training approach in which the algorithm chooses some of the data it learns from.
- adaptation
Synonym for tuning or fine-tuning.
- agglomerative clustering
See hierarchical clustering.
- anomaly detection
The process of identifying outliers.
- area under the PR curve
See PR AUC (Area under the PR Curve).
- area under the ROC curve
See AUC (Area under the ROC curve).