GPT
What does it stand for? Generative Pre-trained Transformer
Pronunciation: /dʒiː piː tiː/
A family of large language models developed by OpenAI that generate human-like text.
Plain English Explanation
GPT (Generative Pre-trained Transformer) is a series of AI language models created by OpenAI. They are "generative" because they create text, "pre-trained" because they learn from vast datasets before being fine-tuned, and "Transformer" because they use the Transformer architecture.\n\nEach GPT version has been larger and more capable than the last, with GPT-4 demonstrating strong reasoning, coding, and analysis abilities.
Analogy
GPT is like an apprentice writer who has read the entire internet, practiced writing every type of content imaginable, and then received specialised coaching for particular tasks.
How is it used?
GPT models power ChatGPT, Microsoft Copilot, and thousands of third-party applications via the OpenAI API. Developers use them for writing, coding, analysis, and customer service.
Real-world Example
A marketing team uses GPT-4 via API to draft social media posts, analyse customer feedback, and generate product descriptions — all customised through prompt engineering.
Common Misconceptions
GPT is not a single model but a family. "GPT" is often used colloquially to mean ChatGPT, but they are related products, not the same thing.
History
GPT-1 (2018) proved the pre-training approach. GPT-2 (2019) showed scalability. GPT-3 (2020) demonstrated emergent abilities. GPT-4 (2023) added multimodal capabilities. ChatGPT (2022) made GPT accessible to the public.
Related Terms
See Also
Also known as: Generative Pre-trained Transformer