AI Dictionary

Large Language Model (LLM)

Definition

A language model consisting of a neural network with many parameters, trained on large quantities of unlabeled text.

Deep Dive

A Large Language Model (LLM) is an advanced type of artificial intelligence model characterized by its neural network architecture, typically a transformer, containing billions of parameters, trained on vast quantities of unlabeled text data. These models learn to understand, generate, and process human language with remarkable fluency and coherence, capturing intricate grammatical rules, semantic relationships, and world knowledge present in their training corpora.

Examples & Use Cases

1OpenAI's GPT series (e.g., GPT-3, GPT-4) used for text generation, summarization, and coding assistance
2Google's Bard or Gemini models, capable of conversational AI and information retrieval
3Meta's LLaMA, an open-source model used by developers for various NLP applications

Related Terms

Natural Language Processing (NLP)Generative AITransformer