ROUGE
What does it stand for? Recall-Oriented Understudy for Gisting Evaluation
A family of metrics that evaluate automatic summarization and machine translation models.
Plain English Explanation
A family of metrics that evaluate automatic summarization and machine translation models. ROUGE metrics determine the degree to which a reference text overlaps an ML model's generated text. Each member of the ROUGE family measures overlap in a different way. Higher ROUGE scores indicate more similarity between the reference text and generated text than lower ROUGE scores. Each ROUGE family member typically generates the following metrics: - Precision - Recall - F1 For details and examples, see: - ROUGE-L - ROUGE-N - ROUGE-S
How is it used?
Practitioners refer to rouge when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.