Token Limits compresses context on every request so your Claude Code sessions last up to 2x longer. Set up in seconds, runs silently in the background.
By default, requests pass through our server for compression — nothing is stored or logged. Prefer full privacy? Use --local mode to keep everything on your machine.
Two commands to install. Zero changes to your workflow after that.
Sits between Claude Code and Anthropic. Compresses context on every request without changing your results.
Server mode works out of the box. Switch to local mode to keep everything on your machine.
See compression stats, request counts, and estimated savings in real-time at localhost:4800.
Claude receives the same meaningful information. Only noise and redundancy are removed.
Free, Pro, or Teams. Works with any Claude Code plan out of the box.
Run token-limits learn to analyze your sessions, find recurring tool errors, and auto-write fixes to CLAUDE.md with --apply.
In server mode, data passes through in memory and is never written to disk. Local mode never leaves your machine.
The dashboard runs locally — see your savings in real-time.
One plan. Everything included.
Everything you need. Cancel anytime.
100 free requests. No credit card.
Everything you need to know.
curl -fsSL https://tokenlimits.app/api/install | bashInstalltoken-limits setup <your-key>Setuptoken-limits setup <your-key> --localSetup (local mode)token-limits startStart compressiontoken-limits stopStop compressiontoken-limits updateUpdate to latesttoken-limits uninstallRemove everythingYes. Sign up with your email and get 100 free compressed requests — no credit card required. If you like it, upgrade for $5/month.
By default, yes — requests are routed through our server for compression, then forwarded to Anthropic. No data is stored or logged. If you prefer, use local mode: token-limits setup <key> --local. In local mode, compression runs on your machine and requests go directly to Anthropic.
No. Token Limits is transparent. Claude receives the same meaningful information and responds the same way.
Claude Code works normally. You just lose the compression benefit until you restart it.
All of them — Free, Pro, and Teams.
Yes. Claude Code runs in WSL on Windows, and Token Limits works in WSL out of the box.
This usually means Claude Code needs to be restarted so it picks up the proxy. Type /exit in Claude Code to quit, then start it again. If the error persists, run token-limits stop and token-limits start to restart the proxy, then reopen Claude Code.
Run token-limits stop to pause compression. Run token-limits uninstall to remove everything.