Glossary term
Glossary term
Architecture
GPT is a family of generative language models pre-trained on massive datasets, capable of understanding and generating human-like text for applications ranging from chatbots to summarization and AI agents.
A family of Transformer-based large language models developed by OpenAI.
GPT variants can apply to multiple modalities, including:
image generation (for example, ImageGPT)
text-to-image generation (for example, DALL-E).
OpenAI released GPT-1 in 2018, GPT-2 in 2019, GPT-3 in 2020, GPT-4 in 2023, and GPT-5 in 2025.
ChatGPT, launched November 2022, popularised the GPT family with consumers.
The original GPT architecture is now adapted across Llama, Mistral, Qwen, and DeepSeek model families.
Created for this library
An enterprise legal team uses a GPT-style model with retrieval grounding to draft contract analyses from internal clause libraries.
A startup integrates a GPT model into its customer support workflow to draft replies that agents edit and send.
A marketing team uses a GPT model to generate variants of product descriptions for editorial review.
Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License