What is an ROUGE?
A family of metrics that evaluate automatic summarization and machine translation models.
Stands for: Recall-Oriented Understudy for Gisting Evaluation
ROUGE explained in plain English
A family of metrics that evaluate automatic summarization and machine translation models. ROUGE metrics determine the degree to which a reference text overlaps an ML model's generated text. Each member of the ROUGE family measures overlap in a different way. Higher ROUGE scores indicate more similarity between the reference text and generated text than lower ROUGE scores. Each ROUGE family member typically generates the following metrics: - Precision - Recall - F1 For details and examples, see: - ROUGE-L - ROUGE-N - ROUGE-S
Example
Practitioners refer to rouge when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.
People also read
- BERT
A model architecture for text representation.
- BLEU
A metric between 0.
- BLEURT
A metric for evaluating machine translations from one language to another, particularly to and from English.
- Character N-gram F-score
A metric to evaluate machine translation models.
- Embedding
A numerical representation of text, images, or other data that captures semantic meaning.
- encoder
In general, any ML system that converts from a raw, sparse, or external representation into a more processed, denser, or more internal representation.
- language model
A model that estimates the probability of a token or sequence of tokens occurring in a longer sequence of tokens.
- rotational invariance
In an image classification problem, an algorithm's ability to successfully classify images even when the orientation of the image changes.
- ROUGE-L
A member of the ROUGE family focused on the length of the longest common subsequence in the reference text and generated text.
- ROUGE-N
A set of metrics within the ROUGE family that compares the shared N-grams of a certain size in the reference text and generated text.