u/Electrical_Yam_9444

▲ 4 r/softwarearchitecture+1 crossposts

Most rate limiting discussions focus on algorithms: token bucket, leaky bucket, sliding window.

But in real systems I’ve noticed the algorithm often matters less than where the limiter sits in the request path.

If a limiter is placed too late, the system already carried the work too far and you end up with 503 Service Unavailable instead of controlled 429 responses.

I wrote a short piece about thinking of rate limiters as system boundaries rather than just algorithms, including some examples of failure modes when capacity or placement is wrong.

Article:
https://medium.com/p/fe8ad3eae36c

Curious how others approach this in production systems.

medium.com
u/Electrical_Yam_9444 — 15 days ago