Developer Utility

See how rate limits behave before you ship them

Model token bucket, fixed window, and sliding window algorithms. Fire requests at different speeds. Watch what gets through and what gets blocked. All in your browser.

Open Simulator View Scenarios

Configure Your Limiter

Algorithm

Max Tokens / Requests

Window Size (ms)

Refill Rate (tokens/sec)

Request Pattern

Total Requests

Interval Between Requests (ms)

Load Preset

Timeline

Allowed: 0 Rejected: 0 Total: 0

Configure your limiter and press Run to see results.

Allowed Rejected Window Boundary

Scenario Walkthroughs

These examples show common situations engineers run into. Load one to see what happens.

The Burst at Window Edge

Fixed window counters reset at exact time boundaries. If a burst arrives right at the reset, it can get through even if the previous window was full. This can double your expected rate for a brief moment.

Slow Drain vs Sudden Flood

Token bucket refills slowly. A steady trickle of requests passes fine, but a sudden flood empties the bucket and everything after gets rejected until tokens refill. This is the most common surprise in production.

Sliding Window Fairness

Sliding window tracks each request timestamp. It avoids the boundary problem of fixed window but uses more memory. Here you see it smoothly rejecting requests that exceed the rolling limit.

Rate Limit Arms Race

A client retries aggressively after being throttled. The retries count against the limit, causing more rejections, causing more retries. This feedback loop can make a small spike look like a DDoS.

Common Mistakes and Reference

Ignoring Burst Tolerance

Setting a low request count without accounting for natural bursts in user behavior. Users click fast, scripts retry, and batch jobs fire many requests at once. If your limit is too tight, real users get blocked. Start with a higher burst allowance and tighten based on real traffic data.

Wrong Window Size

A 1-second window with 100 requests allows bursts of 100 in a single second. A 60-second window with 100 requests allows only ~1.67 requests per second on average. The same number feels very different depending on the window. Always check both values together.

Distributed Counters

This simulator models a single server. In production, you likely have multiple servers behind a load balancer. Each server may have its own counter, or you may use a shared store like Redis. Clock skew between servers can cause counters to drift. Test with your actual infrastructure.

Not Returning Retry-After

When you reject a request, the client needs to know when to try again. Without a Retry-After header, clients guess. Some back off too slowly and keep getting rejected. Some back off too fast and waste capacity. Always include timing hints in 429 responses.

Algorithm Comparison

Algorithm	Burst Handling	Memory	Boundary Issue	Best For
Token Bucket	Controlled by bucket size	Low	None	General APIs
Fixed Window	Full window at reset	Very Low	Double traffic at edges	Simple limits
Sliding Window	Smooth	Higher (timestamps)	None	Fair rate limiting