Token

The basic unit of text an LLM processes — typically a word fragment, not a full word.

What is Token?

LLMs do not see words; they see tokens. Tokens are pieces of text that the model's tokeniser has chunked the input into. "Cat" might be one token; "predominantly" might be three.

The rough rule: 1 token ≈ 4 characters of English text ≈ 0.75 words. A 1000-word document is about 1300–1400 tokens. Hindi, Tamil, and other Indian languages typically have higher tokens-per-character because they are less common in training data.

LLM pricing is per token (both input and output). Context windows are measured in tokens. Cost engineering = token engineering — shorter prompts are cheaper and lower latency.

Why this matters

Everything in production Gen AI economics is measured in tokens. Cost optimisation, latency, context limits all depend on token counts.

Real-world example (India)

A Chennai chatbot was costing ₹3 per conversation until the team rewrote prompts to be 60% shorter. Same quality, ₹1.20 per conversation — pure margin.

Related terms

Want to master this?

Learn Token in a structured cohort

3-month live program with mentors, real projects, and 50+ partner placement support.

View the program →

← All glossary termsLLM & Gen AI