AIExplainer

ROUGE

What does it stand for? Recall-Oriented Understudy for Gisting Evaluation

A family of metrics that evaluate automatic summarization and machine translation models.

A family of metrics that evaluate automatic summarization and machine translation models. ROUGE metrics determine the degree to which a reference text overlaps an ML model's generated text. Each member of the ROUGE family measures overlap in a different way. Higher ROUGE scores indicate more similarity between the reference text and generated text than lower ROUGE scores. Each ROUGE family member typically generates the following metrics: - Precision - Recall - F1 For details and examples, see: - ROUGE-L - ROUGE-N - ROUGE-S

Practitioners refer to rouge when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.