AI Basics Large Language Models AI Companies AI Models Acronyms Beginner

GPT

What does it stand for? Generative Pre-trained Transformer

Pronunciation: /dʒiː piː tiː/

A family of large language models developed by OpenAI that generate human-like text.

Plain English Explanation

GPT (Generative Pre-trained Transformer) is a series of AI language models created by OpenAI. They are "generative" because they create text, "pre-trained" because they learn from vast datasets before being fine-tuned, and "Transformer" because they use the Transformer architecture.\n\nEach GPT version has been larger and more capable than the last, with GPT-4 demonstrating strong reasoning, coding, and analysis abilities.

Analogy

GPT is like an apprentice writer who has read the entire internet, practiced writing every type of content imaginable, and then received specialised coaching for particular tasks.

How is it used?

GPT models power ChatGPT, Microsoft Copilot, and thousands of third-party applications via the OpenAI API. Developers use them for writing, coding, analysis, and customer service.

Real-world Example

A marketing team uses GPT-4 via API to draft social media posts, analyse customer feedback, and generate product descriptions — all customised through prompt engineering.

Common Misconceptions

GPT is not a single model but a family. "GPT" is often used colloquially to mean ChatGPT, but they are related products, not the same thing.

History

GPT-1 (2018) proved the pre-training approach. GPT-2 (2019) showed scalability. GPT-3 (2020) demonstrated emergent abilities. GPT-4 (2023) added multimodal capabilities. ChatGPT (2022) made GPT accessible to the public.

Related Terms

LLM Transformer

References

Language Models are Few-Shot Learners