What does GPT stand for?

GPT stands for Generative Pre-trained Transformer. A family of large language models developed by OpenAI that generate human-like text.

AI Basics Large Language Models AI Companies AI Models Acronyms Beginner 2 min read

What is a GPT?

A family of large language models developed by OpenAI that generate human-like text.

Stands for: Generative Pre-trained Transformer

Pronunciation: /dʒiː piː tiː/

GPT explained in plain English

GPT (Generative Pre-trained Transformer) is a series of AI language models created by OpenAI. They are "generative" because they create text, "pre-trained" because they learn from vast datasets before being fine-tuned, and "Transformer" because they use the Transformer architecture.\n\nEach GPT version has been larger and more capable than the last, with GPT-4 demonstrating strong reasoning, coding, and analysis abilities.

Analogy

GPT is like an apprentice writer who has read the entire internet, practiced writing every type of content imaginable, and then received specialised coaching for particular tasks.

Example

A marketing team uses GPT-4 via API to draft social media posts, analyse customer feedback, and generate product descriptions — all customised through prompt engineering.

How is GPT used?

GPT models power ChatGPT, Microsoft Copilot, and thousands of third-party applications via the OpenAI API. Developers use them for writing, coding, analysis, and customer service.

Common misconceptions about GPT

GPT is not a single model but a family. "GPT" is often used colloquially to mean ChatGPT, but they are related products, not the same thing.

History

GPT-1 (2018) proved the pre-training approach. GPT-2 (2019) showed scalability. GPT-3 (2020) demonstrated emergent abilities. GPT-4 (2023) added multimodal capabilities. ChatGPT (2022) made GPT accessible to the public.

Related terms

LLM Transformer

Also known as

Generative Pre-trained Transformer

References

Language Models are Few-Shot Learners