AIExplainer

What is a language model?

A model that estimates the probability of a token or sequence of tokens occurring in a longer sequence of tokens.

A model that estimates the probability of a token or sequence of tokens occurring in a longer sequence of tokens.

Though counterintuitive, many models that evaluate text are not language models. For example, text classification models and sentiment analysis models are not language models. --- See What is a language model? in Machine Learning Crash Course for more information.

Practitioners refer to language model when building, training, or evaluating machine learning systems. It appears in research papers, product documentation, and technical discussions about AI capabilities and limitations.