Built for Claude Code

Stop hitting
context limits

Token Limits compresses context on every request so your Claude Code sessions last up to 2x longer. Set up in seconds, runs silently in the background.

100 free requestsNo credit cardNo data stored

By default, requests pass through our server for compression — nothing is stored or logged. Prefer full privacy? Use --local mode to keep everything on your machine.

terminal
$curl -fsSL https://tokenlimits.app/api/install | bash
$token-limits setup cc_live_a8f...
Token Limits is running_
Dashboard → localhost:4800
0%
less context
0x
longer sessions
0s
to set up

How it works

Two commands to install. Zero changes to your workflow after that.

Transparent compression

Sits between Claude Code and Anthropic. Compresses context on every request without changing your results.

Server or local mode

Server mode works out of the box. Switch to local mode to keep everything on your machine.

Live dashboard

See compression stats, request counts, and estimated savings in real-time at localhost:4800.

No quality loss

Claude receives the same meaningful information. Only noise and redundancy are removed.

Works with any plan

Free, Pro, or Teams. Works with any Claude Code plan out of the box.

Learn from failures

Run token-limits learn to analyze your sessions, find recurring tool errors, and auto-write fixes to CLAUDE.md with --apply.

Nothing stored

In server mode, data passes through in memory and is never written to disk. Local mode never leaves your machine.

localhost:4800running
$34.20
estimated savings this session
Requests
538
Compression
70%

The dashboard runs locally — see your savings in real-time.

Simple pricing

One plan. Everything included.

$0/month

Everything you need. Cancel anytime.

Unlimited compression
Server + local mode
Live savings dashboard
macOS, Linux, WSL
Start free trial

100 free requests. No credit card.

Commands

Everything you need to know.

curl -fsSL https://tokenlimits.app/api/install | bashInstall
token-limits setup <your-key>Setup
token-limits setup <your-key> --localSetup (local mode)
token-limits startStart compression
token-limits stopStop compression
token-limits updateUpdate to latest
token-limits uninstallRemove everything

FAQ

Yes. Sign up with your email and get 100 free compressed requests — no credit card required. If you like it, upgrade for $5/month.

By default, yes — requests are routed through our server for compression, then forwarded to Anthropic. No data is stored or logged. If you prefer, use local mode: token-limits setup <key> --local. In local mode, compression runs on your machine and requests go directly to Anthropic.

No. Token Limits is transparent. Claude receives the same meaningful information and responds the same way.

Claude Code works normally. You just lose the compression benefit until you restart it.

All of them — Free, Pro, and Teams.

Yes. Claude Code runs in WSL on Windows, and Token Limits works in WSL out of the box.

This usually means Claude Code needs to be restarted so it picks up the proxy. Type /exit in Claude Code to quit, then start it again. If the error persists, run token-limits stop and token-limits start to restart the proxy, then reopen Claude Code.

Run token-limits stop to pause compression. Run token-limits uninstall to remove everything.