AIExplainer
Large Language Models Intermediate 1 min read

What is a math-pass@k?

A metric to determine an LLM's accuracy in solving a math problem within K attempts.

A metric to determine an LLM's accuracy in solving a math problem within K attempts. For example, math-pass@2 measures an LLM's ability to solve math problems within two attempts. An accuracy of 0.85 on math-pass@2 indicates that an LLM was able to solve math problems 85% of the time within two attempts. math-pass@k is identical to the pass@k metric, except that the term math-pass@k is specifically used for math evaluation.

Practitioners refer to math-pass@k when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.