Natural Language Processing Large Language Models Intermediate 1 min read

What is a human evaluation?

A process in which people judge the quality of an ML model's output; for example, having bilingual people judge the quality of an ML translation model.

human evaluation explained in plain English

A process in which people judge the quality of an ML model's output; for example, having bilingual people judge the quality of an ML translation model. Human evaluation is particularly useful for judging models that have no one right answer. Contrast with automatic evaluation and autorater evaluation.

Example

Practitioners refer to human evaluation when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.

human evaluation explained in plain English

Example

People also read