Claude Max Plan 5-Hour Usage Window: How It Works and How to Stay Within It
Claude Max plan gives you high token limits but with a catch: 5-hour rolling usage windows. Use too many tokens in 5 hours, you hit a temporary limit. Token Limits proxy reduces token burn so you stay within the rolling window longer. This guide explains Max5 vs Max20, what counts toward the limit, and how to code efficiently on Max.
Claude Max plans: Max5 vs Max20
| Plan | Cost | Token Limit (5h) | Use Case |
|---|---|---|---|
| Max5 | $100/month | Varies (likely 2M tokens) | Heavy single-day use, research projects |
| Max20 | $150/month | Varies (likely higher) | Heavy sustained use across multiple days |
Exact limits are not published by Anthropic. Estimates based on user reports suggest Max5 allows 2-3M tokens per 5-hour window. Max20 allows higher limits across longer periods.
How the 5-hour rolling window works
- ✓Window is rolling: Every 5 hours, your token counter resets
- ✓If you use 2.5M tokens in 4 hours, you hit the limit immediately (no reset yet)
- ✓At the 5-hour mark, tokens from the first hour fall off, freeing space
- ✓You can resume immediately after the window rolls
What counts toward the 5-hour limit?
- ✓All input tokens: Your prompts, file contents, conversation history
- ✓All output tokens: Claude's responses
- ✓Tool call results: Everything returned by grep, ls, file reads, etc.
- ✓System tokens: Internal prompts and safety tokens
Everything counts. The 5-hour window applies to total token usage, not just conversation length.
Realistic usage: When you hit the 5-hour limit
- ✓Heavy coding day: 10+ hours of Claude Code, 50+ file edits, 100+ tool calls = likely hit limit around hour 4-5
- ✓Moderate coding day: 6-8 hours of Claude Code, 20-30 file edits, 40-60 tool calls = stay under limit
- ✓Light coding day: 2-3 hours, 5-10 files, 10-20 tool calls = plenty of space
Strategies to stay within the 5-hour window
- Take breaks: Step away for 5+ minutes every 2 hours to let old tokens roll off
- Monitor usage: Keep an eye on token count (if displayed)
- Clear history: Use /clear when you finish a task to free old tokens
- Use focused prompts: Longer prompts = more tokens = hit limit faster
- Batch tool calls: Minimize grep/ls calls; combine what you can
Token Limits extends your effective window
With Token Limits proxy, tool outputs shrink 60-80%. That same 10-hour heavy coding day now fits in the 5-hour window comfortably because tool calls use 1/5 the tokens.
- Install: npm install -g token-limits
- token-limits start
- Claude Code: Tools → API URL → http://localhost:4800
- 5-hour window now acts like 15-20 hour window effectively
Example: Max5 subscriber, heavy coding day
Without Token Limits: Use 2.5M tokens by hour 4.5, hit limit, wait for window to roll, continue.
With Token Limits: Use 2.5M tokens by hour 13-14 (same work, but compression makes tokens go 3-5x further), complete all work without interruption.
Extend Max plan 5-hour window 3-5x with Token Limits
Automatic 60-80% compression on every tool call. Work longer without hitting rolling window limits. Setup: 2 minutes.
FAQ
What happens when you hit the 5-hour limit on Max?
Claude stops accepting new messages temporarily. You can not make requests until the window rolls (5 hours from your first request in that window). It is a temporary throttle, not a permanent block.
Can you extend Max beyond the 5-hour window?
No. Max5 and Max20 have rolling windows built in. The only workaround is to reduce token usage with Token Limits.
Is Max20 better than Max5?
Max20 has higher limits across a longer period, so it is better for sustained heavy use. Max5 is for intense single-day projects.
Do I need Max if I use Token Limits?
Depends on your usage. Token Limits reduces token burn but does not eliminate it. Pro plan (Sonnet 4) may be sufficient with compression. Max is for extreme heavy use.
How do I know what my current 5-hour window usage is?
Anthropic does not currently expose a token counter in Claude.ai or Claude Code. You have to estimate based on message frequency and file size.