What is a Character N-gram F-score?
A metric to evaluate machine translation models.
Character N-gram F-score explained in plain English
A metric to evaluate machine translation models. Character N-gram F-score determines the degree to which N-grams in reference text overlap the N-grams in an ML model's generated text. Character N-gram F-score is similar to metrics in the ROUGE and BLEU families, except that: - Character N-gram F-score operates on character N-grams. - ROUGE and BLEU operate on word N-grams or tokens.
Example
Practitioners refer to character n-gram f-score when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.
People also read
- BERT
A model architecture for text representation.
- Embedding
A numerical representation of text, images, or other data that captures semantic meaning.
- encoder
In general, any ML system that converts from a raw, sparse, or external representation into a more processed, denser, or more internal representation.
- language model
A model that estimates the probability of a token or sequence of tokens occurring in a longer sequence of tokens.
- rotational invariance
In an image classification problem, an algorithm's ability to successfully classify images even when the orientation of the image changes.
- ROUGE
A family of metrics that evaluate automatic summarization and machine translation models.
- ROUGE-L
A member of the ROUGE family focused on the length of the longest common subsequence in the reference text and generated text.
- ROUGE-N
A set of metrics within the ROUGE family that compares the shared N-grams of a certain size in the reference text and generated text.
- ROUGE-S
A forgiving form of ROUGE-N that enables skip-gram matching.
- sentiment analysis
Using statistical or machine learning algorithms to determine a group's overall attitude—positive or negative—toward a service, product, organization, or topic.