golden dataset
A set of manually curated data that captures ground truth.
Plain English Explanation
A set of manually curated data that captures ground truth. Teams can use one or more golden datasets to evaluate a model's quality. Some golden datasets capture different subdomains of ground truth. For example, a golden dataset for image classification might capture lighting conditions and image resolution.
How is it used?
Practitioners refer to golden dataset when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.