What Is a Large Language Model (LLM)? A Plain-English Explainer

The Technology Behind AI Chatbots

You've likely used an AI chatbot that wrote an email for you, explained a concept, or helped debug code. Behind almost all of these tools is something called a Large Language Model, or LLM. But what exactly is it, how does it work, and why does it sometimes get things embarrassingly wrong? This explainer cuts through the jargon.

What Is a Language Model?

At its core, a language model is a system trained to predict the next word in a sequence of text. Given the phrase "The sky is," a language model might predict "blue" as the most probable next word. This sounds simple — but when you scale this process to billions of parameters trained on vast amounts of text, something remarkable emerges: the ability to generate coherent, contextually relevant language across almost any topic.

What Makes an LLM "Large"?

The "large" refers to the number of parameters — essentially the numerical weights the model uses to make predictions. Modern LLMs can have tens to hundreds of billions of parameters. More parameters generally mean a greater capacity to model complex patterns in language, though they also require enormous computational resources to train and run.

How LLMs Are Trained

Pre-training: The model is fed huge amounts of text data (books, websites, code, etc.) and learns to predict missing or next tokens. This phase is computationally expensive and happens once.
Fine-tuning: The base model is then trained on narrower, curated datasets to improve performance on specific tasks or to align the model with human preferences.
RLHF (Reinforcement Learning from Human Feedback): Human raters score outputs, and the model is updated to produce responses humans prefer. This is a key step in making models feel helpful and safe.

What LLMs Can and Can't Do

What They're Good At

Summarizing and rephrasing text
Writing and editing in various styles
Answering questions based on their training data
Generating code, translations, and structured content
Reasoning through problems step by step

Their Limitations

Hallucinations: LLMs can confidently state incorrect information. They generate plausible-sounding text, not verified facts.
No real-time knowledge: Most LLMs have a training cutoff date and don't know recent events unless given tools to search the web.
No true understanding: LLMs don't "know" things the way humans do — they model statistical relationships between words.
Context window limits: They can only process a certain amount of text at once.

Key Terms at a Glance

Term	What It Means
Token	A chunk of text (roughly a word or part of a word) the model processes
Parameters	Numerical weights that define the model's behavior
Prompt	The input text you give the model
Temperature	A setting controlling how random/creative the output is
Context window	The maximum amount of text the model can consider at once

Why It Matters

LLMs are reshaping how people write, code, research, and communicate. Understanding their fundamentals — not just their outputs — helps you use them more effectively and critically. Knowing that an LLM predicts text rather than "thinks" changes how you interpret its answers and when to double-check its work.