- How accurate are the token estimates?
- Estimates are within ±5% of official tokenizers for English prose. Code, JSON, and non-Latin scripts will tokenize higher than shown.
- Why do different models have different token counts?
- Each model uses its own tokenizer. GPT uses cl100k (newer: o200k), Claude uses its own vocab, Gemini and Llama differ as well. We support the latest versions.
- Can I use this offline?
- Yes. All tokenization runs in your browser — your text never leaves the page.
- What's the difference between tokens and characters?
- Roughly, one token ≈ 4 characters or ¾ of a word for English text. Tokens are what AI models "see" and what you pay for in API requests.