AIExplainer

golden response

A response known to be good.

A response known to be good. For example, given the following prompt: 2 + 2 The golden response is hopefully: 4

Some evaluation metrics, such as ROUGE, compare reference text to a model's generated text. When there is a single right answer to a prompt, the golden response typically serves as the reference text. Some prompts have no one right answer. For example, the prompt Summarize this document would likely have many right answers. For such prompts, reference text is often impractical because a model can generate a very wide range of possible summaries. However, a golden response might be helpful in this situation. For example, a golden response containing a good document summary can help train an autorater to discover patterns of good document summaries. ---

Practitioners refer to golden response when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.