What is a ROUGE-L?
A member of the ROUGE family focused on the length of the longest common subsequence in the reference text and generated text.
ROUGE-L explained in plain English
A member of the ROUGE family focused on the length of the longest common subsequence in the reference text and generated text. The following formulas calculate recall and precision for ROUGE-L:
You can then use F1 to roll up ROUGE-L recall and ROUGE-L precision into a single metric:
Consider the following reference text and generated text. Text | --- | I want to understand a wide variety of things. | I want to learn plenty of things. | Therefore: - The longest common subsequence is 5 (I want to of things) - The number of words in the reference text is 9. - The number of words in the generated text is 7. Consequently:
Example
--- ROUGE-L ignores any newlines in the reference text and generated text, so the longest common subsequence could cross multiple sentences. When the reference text and generated text involve multiple sentences, a variation of ROUGE-L called ROUGE-Lsum is generally a better metric. ROUGE-Lsum determines the longest common subsequence for each sentence in a passage and then calculates the mean of those longest common subsequences.
Consider the following reference text and generated text. Text | --- | The surface of Mars is dry. Nearly all the water is deep underground. | Mars has a dry surface. However, the vast majority of water is underground. | Therefore:
People also read
- BERT
A model architecture for text representation.
- Character N-gram F-score
A metric to evaluate machine translation models.
- Embedding
A numerical representation of text, images, or other data that captures semantic meaning.
- encoder
In general, any ML system that converts from a raw, sparse, or external representation into a more processed, denser, or more internal representation.
- language model
A model that estimates the probability of a token or sequence of tokens occurring in a longer sequence of tokens.
- rotational invariance
In an image classification problem, an algorithm's ability to successfully classify images even when the orientation of the image changes.
- ROUGE
A family of metrics that evaluate automatic summarization and machine translation models.
- ROUGE-N
A set of metrics within the ROUGE family that compares the shared N-grams of a certain size in the reference text and generated text.
- ROUGE-S
A forgiving form of ROUGE-N that enables skip-gram matching.
- sentiment analysis
Using statistical or machine learning algorithms to determine a group's overall attitude—positive or negative—toward a service, product, organization, or topic.