005-how-llms-work

How LLMs Really Work

Source: https://arpitbhayani.me/blogs/how-llms-work Date: 2026-05-12

If you have used ChatGPT, Gemini, or Claude, you have already formed an intuition about what these systems do. You type something in, and text comes back that feels coherent, knowledgeable, and sometimes eerily human. But the machinery underneath is simultaneously simpler and stranger than most people expect.

If you have used ChatGPT, Gemini, or Claude, you have already formed an intuition about what these systems do. You type something in, and text comes back that feels coherent, knowledgeable, and sometimes eerily human. But the machinery underneath is simultaneously simpler and stranger than most people expect.

How LLMs Really Work

Source: https://arpitbhayani.me/blogs/how-llms-work Date: 2026-05-12

If you have used ChatGPT, Gemini, or Claude, you have already formed an intuition about what these systems do. You type something in, and text comes back that feels coherent, knowledgeable, and sometimes eerily human. But the machinery underneath is simultaneously simpler and stranger than most people expect.

Token	Logit (xix_ixi)	Exponent (exie^{x_i}exi)	Probability (PiP_iPi)
“jumps”	8.3	4023.8	90.7%
“leaped”	6.0	403.4	9.1%
“sat”	2.1	8.1	0.18%
“sleeps”	-1.5	0.2	0.004%
Sum		4435.5	100%

Token	Logit	Prob (T=1.0T=1.0T=1.0)	Prob (T=0.5T=0.5T=0.5)	Prob (T=2.0T=2.0T=2.0)
“jumps”	8.3	90.7%	~99.0%	~67.5%
“leaped”	6.0	9.1%	~1.0%	~21.3%
“sat”	2.1	0.18%	~0.0%	~3.0%
“sleeps”	-1.5	0.004%	~0.0%	~0.5%

How LLMs Really Work

005-how-llms-work

How LLMs Really Work

Next-token Prediction Machine

What Training Actually Does

Logits, Softmax, and Why Probabilities Matter

Temperature

Why Outputs are Inherently Probabilistic

No Meta Knowledge

What “the model knows” Actually Means

Footnote