Distributed rate limiter? #350

javadevmtl · 2019-02-27T18:44:59Z

Hi, unless I missed it. Will there be support to back the rate limiter by a distributed cache? Usually for fault tolerance, performance etc... We deploy at least 2 APIs side by side... So somehow the rate limiter would have to know between the API instances the count?

RobWin · 2019-03-13T17:08:32Z

No, we don't have a distributed cache for rate limiters.

javadevmtl · 2019-03-13T19:13:22Z

Is it something worth looking at? Thanks

RobWin · 2019-03-14T07:57:57Z

@storozhukBM Correct me if I'm wrong, but my opinion is that Rate Limiters should be fast and not decrease response times and throughput.
A distributed cache introduces a lot of complexity and latency when caches must be synchronized or replicated.
I think the disadvantage is bigger than the advantage.

Maybe you need a centralized ratelimiter inside of a load balancer or proxy instead.

jwcarman · 2019-03-21T20:37:30Z

In lieu of some form of shared state for your rate limiters, what is the suggested approach for implementing rate limiting in an elastic/cloud environment? Using a single load balancer or proxy would create a single point of failure, so I don't think that's going to work for a lot of use cases.

RobWin · 2019-03-21T21:20:12Z

There is no approach for the implementation of our current rate limiter.
But a better solution than rate limiters with a shared state could be adaptive capacity management.
See #201 and please have a look at the awesome talk.

@storozhukBM started with an implementation.

javadevmtl · 2019-03-23T00:46:41Z

One way I guess it could be done is if a bunch of counters are pre generated for each node and each node gets a range and when it has exhausted it's range and goes back and gets more from the cache...

Something like this: https://apacheignite.readme.io/docs/id-generator

RobWin · 2019-03-23T12:38:42Z

Do you want to implement rate limiting at client-side (source) or server-side (sink)?

If you want to have rate limiting at server-side and have a rate limit per client, you could use an API Gateway like Kong.

RobWin · 2019-03-23T13:03:06Z

Resilience4j-ratelimiter is better used at client-side. A client can also be another server.
If you just want to protect the sink server from overload, I still think that adaptive capacity management (or congestion control at application layer) is a better choice than a distributed rate limiter. Thats why we are working on an initial implementation of an adaptive bulkhead implementation.

RobWin · 2019-03-23T13:04:47Z

I don't think that you should use Resilience4j to reimplement an API gateway your own.

RobWin · 2019-03-23T14:02:57Z

Netflix already implemented an adaptive bulkhead: https://github.com/Netflix/concurrency-limits

storozhukBM · 2019-03-25T20:49:50Z

Totally agree with @RobWin on his thoughts.
It is better for your "elastic/cloud" distributed application to avoid any type of shared state by all means, especially if this state is shared across multiple nodes in your cluster.

For me adaptive capacity management looks like ideal solution in your case.

If it is not feasible/suitable for you for some reason, I'd recommend you to pre-calculate target throughput per node and configure it statically.

If your cluster is truly elastic and it is shrinking and growing under load it automatically means that you have some coordination solution (service discovery like consul or eureka, maybe some other type of coordination), in this case you already have some unavoidable shared state, so it would be convenient to use it to dynamically reconfigure your rate limiters on each node. In this case, cluster configuration changes it is relatively rare event of disturbance and all other time everything will work without unnecessary state sharing.

astubbs · 2020-11-03T07:58:33Z

There are more scenarios I think where shared state rate limiting is a good option - for example where you don't control the target system and where your source system scales dynamically for other reasons, and where rate limit usage may not be the same between all clients in the cluster.
I've found this interesting: https://github.com/vladimir-bukhtoyarov/bucket4j/blob/master/doc-pages/jcache-usage.md and https://github.com/mokies/ratelimitj

RobWin · 2020-11-03T08:52:11Z

Yes, but this would require a complete revision (overhaul) of our metric calculation and storage components.
We are open for contributions for Resilience4j 2.0.

astubbs · 2020-11-03T08:58:41Z

Haha yes ok understood, but that's not what was being pointed out previously. I'm just on the hunt for such a solution and came across all this stuff, so wanted to link them together for other people to also find :)

jonathannaguin · 2021-02-16T10:47:21Z

@RobWin do you have a general idea of what components we would need to change to support this? I also came across this thread and althoguh there seem to be alternatives, they don't seem as good as resilience4j.

RobWin added the question label Mar 18, 2019

RobWin closed this as completed Apr 3, 2019

astubbs mentioned this issue Nov 5, 2020

Add distributed rate limiting support confluentinc/parallel-consumer#24

Closed

RobWin mentioned this issue Mar 29, 2021

How to externalize the registry store? #1388

Closed

RobWin mentioned this issue Jun 9, 2022

how to use RegistryStore with Cache Redis ? #1706

Closed

RobWin mentioned this issue Apr 15, 2024

Query - Is resilience4j resilient to restarts? Is it distributed? #2146

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed rate limiter? #350

Distributed rate limiter? #350

javadevmtl commented Feb 27, 2019

RobWin commented Mar 13, 2019

javadevmtl commented Mar 13, 2019

RobWin commented Mar 14, 2019 •

edited

jwcarman commented Mar 21, 2019

RobWin commented Mar 21, 2019 •

edited

javadevmtl commented Mar 23, 2019 •

edited

RobWin commented Mar 23, 2019 •

edited

RobWin commented Mar 23, 2019 •

edited

RobWin commented Mar 23, 2019 •

edited

RobWin commented Mar 23, 2019 •

edited

storozhukBM commented Mar 25, 2019 •

edited

astubbs commented Nov 3, 2020 •

edited

RobWin commented Nov 3, 2020

astubbs commented Nov 3, 2020

jonathannaguin commented Feb 16, 2021

Distributed rate limiter? #350

Distributed rate limiter? #350

Comments

javadevmtl commented Feb 27, 2019

RobWin commented Mar 13, 2019

javadevmtl commented Mar 13, 2019

RobWin commented Mar 14, 2019 • edited

jwcarman commented Mar 21, 2019

RobWin commented Mar 21, 2019 • edited

javadevmtl commented Mar 23, 2019 • edited

RobWin commented Mar 23, 2019 • edited

RobWin commented Mar 23, 2019 • edited

RobWin commented Mar 23, 2019 • edited

RobWin commented Mar 23, 2019 • edited

storozhukBM commented Mar 25, 2019 • edited

astubbs commented Nov 3, 2020 • edited

RobWin commented Nov 3, 2020

astubbs commented Nov 3, 2020

jonathannaguin commented Feb 16, 2021

RobWin commented Mar 14, 2019 •

edited

RobWin commented Mar 21, 2019 •

edited

javadevmtl commented Mar 23, 2019 •

edited

RobWin commented Mar 23, 2019 •

edited

RobWin commented Mar 23, 2019 •

edited

RobWin commented Mar 23, 2019 •

edited

RobWin commented Mar 23, 2019 •

edited

storozhukBM commented Mar 25, 2019 •

edited

astubbs commented Nov 3, 2020 •

edited