Ratelimit-rewrite | Twilight | Page 1

solemn cloak · 2025-02-04T08:56:40.161Z

This is a breakout thread for 's http ratelimit rewrite started in [#2418](https://github.com/twilight-rs/twilight/pull/2418) (Some pings , )

solemn cloak Feb 4, 2025, 8:56 AM

#

This is a breakout thread for @deep raft's http ratelimit rewrite started in #2418

(Some pings @keen trout, @topaz mango)

GitHub

feat(http-ratelimiting)!: rewrite crate by vilgotf · Pull Request #...

Replaces InMemoryRatelimiter with a new implementation with proper path bucketing support and global limit accounting.
TODO:

Remove Ratelimiter trait (no limiter but a http proxy is planned as an...

#

I am not sure if channels are the best way to handle ratelimits as things can happen in a uncontrolled order, but I think a rewrite to remove them would probably be quite large.

#

I have also been looking a bit at another "twilight-proxy"-like proxy called nirn-proxy which seems to be able to handle arbitary paths which is something which would be cool if we could add without compromising how well it works.

topaz mango Feb 4, 2025, 10:45 AM

#

be careful with looking at nirn code btw, it is AGPL

#

I use nirn-proxy personally on top of serenity

solemn cloak Feb 4, 2025, 10:59 AM

#

topaz mango be careful with looking at nirn code btw, it is AGPL

Yeah I am aware

#

Its rather annoying tbh.

solemn cloak Feb 4, 2025, 11:00 AM

#

topaz mango be careful with looking at nirn code btw, it is AGPL

The issue with AGPL is that it is a bit strange what is part of the work covered by the license.

#

So we just need someone to read the code and describe it to us :)

solemn cloak Feb 4, 2025, 11:35 AM

#

Hmm are any of the nirn-proxy people here.

#

I don't think so.

deep raft Feb 4, 2025, 3:15 PM

#

solemn cloak I am not sure if channels are the best way to handle ratelimits as things can ha...

You mean the mpsc channel? I don't see a problem, it's collected and split into individual queues asap (similar to the gateway queue).

solemn cloak Feb 4, 2025, 3:23 PM

#

I have to read through your changes again but iirc you send your rate limit request through a queue, with a one shot to message back or am I completely off with that?

#

I have to read through your changes again but iirc you send your rate limit request through a queue, with a one shot to message back or am I completely off with that?

#

Hmm I did not send that twice??

#

And then we have some channels where we just run a sleep and then return which is more the place I could see issues or have those been removed?

deep raft Feb 4, 2025, 3:27 PM

#

Queues are sequential, but do not block each other

#

You could have multiple requests in-flight in parallel within a queue, but that complicates staying in sync with the returned limits (i.e. they may return out of order)

#

Also most of the time requests are sequential, so it's a somewhat niche optimization

#

I did however remove the ratelimiters receive timeout. I figure that the http client can manages that (dropping the oneshot receiver to unstuck the ratelimiter)

solemn cloak Feb 4, 2025, 3:45 PM

#

hmm if I am a consumer of a ratelimit and I want to know how long is left you can't do that with a channel even though it already is know right?

#

Or is there something that can change how long a ticket needs to sleep for after the initial creation?

#

And one I think we have seen is a 429 is hit and then the other requests in-flight for that endpoint still hit the 429 even though we know they will hit it, but I am not sure how to solve that.

#

I might just be talking shit though, because I am not sure how you make it in a better way without some rather hard central sync code.

deep raft Feb 4, 2025, 5:45 PM

#

solemn cloak hmm if I am a consumer of a ratelimit and I want to know how long is left you ca...

It is possible to implement something like that, but pending requests may unpredictably change your calculation. Note for example that my implementation does not decrement the queue's remaining/available count on making a request, but always assigns whatever Discord returns. Discord has stated to prefer a leaky bucket algorithm, but they have also stated that there may be exceptions to this.

deep raft Feb 5, 2025, 7:23 PM

#

The actor (runner function) is now feature complete if anyone wants to take a look. The last piece was interactions bypassing the global limit. It's incredibly dense and will definitely need lots of integration tests ⚠️

deep raft Feb 8, 2025, 1:33 PM

#

I added a new API in addition to acquire allowing for consumers to signal backpressure and inspect queues. Note that the predicate takes a function pointer and not a closure! You cannot reference any local variables with function pointers (the predicate runs in the actor, so closures would require allocations and Send bounds).

pub fn acquire(&self, path: Path) -> PermitFuture; // PermitFuture: Future<Permit>
pub fn acquire_if(&self, path: Path, predicate: fn(&Queue) -> bool) -> MaybePermitFuture; // MaybePermitFuture: Future<Option<Permit>>

I guess Queue will expose methods similar to what Bucket does today.

deep raft Feb 8, 2025, 1:57 PM

#

Rethinking this, taking a closure is probably better cause the allocation should never be a performance problem. Oh well

deep raft Feb 16, 2025, 12:23 PM

#

I have just marked the PR as ready for review and it is now also ready for early testing

deep raft Feb 16, 2025, 12:28 PM

#

solemn cloak I have also been looking a bit at another "twilight-proxy"-like proxy called nir...

I chose to exclude this for now. The rate limiter needs to just be able to hash the path, so dynamic parsing can easily be slipt in later

solemn cloak Feb 23, 2025, 11:16 AM

#

@deep raft Can I ask what the idea behind rehashing in actor.rs is?

deep raft Feb 23, 2025, 11:44 AM

#

Discord's bucket hash is not globally unique, but only unique within the "top level resource".
However, HashMap<Path, Vec<u8>> and HashMap<(TopLevelResource, Vec<u8>), Queue> doesn't play well together because of ownership over the bucket hash (Vec<u8>). That's why I use the hashbrowns HashTable and hash by hand.

solemn cloak Feb 23, 2025, 6:03 PM

#

Makes sense, I will continue looking through it

deep raft Feb 23, 2025, 7:03 PM

#

I hope that I have not produced write only code vikingblobsved

#

I honestly don't even remember how everything ties together. That is part of my motivation for writing integration tests haha

solemn cloak Feb 23, 2025, 7:13 PM

#

I do have a in-progress comment about documenting the macros, mostly so it is clear when to use them and such as they can be a bit hard to decipher.

deep raft Mar 7, 2025, 3:59 PM

#

solemn cloak I do have a in-progress comment about documenting the macros, mostly so it is cl...

I inlined Queue::pop into pop! (renamed from on_pop!). I this is a bit better

solemn cloak Mar 23, 2025, 11:29 AM

#

Finally finished a review

#

Don't really have many comments, but i looks nice

wanton seal Mar 24, 2025, 10:49 AM

#

@deep raft i deployed this fix to prod

#

because the gc removes queues bound to buckets it will eventually panic

#

atleast thats my thought

deep raft Mar 24, 2025, 10:51 AM

#

Did the crash happen after a gc? It is meant to only drop unused queues…

wanton seal Mar 24, 2025, 10:51 AM

#

deep raft Did the crash happen after a gc? It is meant to only drop unused queues…

not sure my traces weren't on

deep raft Mar 24, 2025, 10:52 AM

#

I look at writing some tests for the garbage collection and see if I can make it crash

#

Ideally your fix should not be necessary, as it is more of a work around for the gc being overly aggressive

deep raft Mar 24, 2025, 10:55 AM

#

wanton seal not sure my traces weren't on

GC occurs at a 6 hour interval so you could check if that lines up with your crash time

wanton seal Mar 24, 2025, 11:01 AM

#

ill check in a moment

#

bot started ~3:45 and people started complaining around ~10 that it was broken

#

so it was likely a gc

#

i was asleep when it broke so i dont have an exact time

wanton seal Mar 24, 2025, 11:25 AM

#

wanton seal because the gc removes queues bound to buckets it will eventually panic

yeah this isn't true clueless

#

at least with what i'm trying to do to replicate

wanton seal Mar 24, 2025, 11:41 AM

#

got it to crash

deep raft Mar 24, 2025, 11:50 AM

#

How did you replicate it?

wanton seal Mar 24, 2025, 12:23 PM

#

im sometimes getting "hash is unchanged' too

#

while testing

deep raft Apr 13, 2025, 10:00 AM

#

wanton seal <@131517556786855937> i deployed this fix to prod

The GC left invalid bucket references to removed queues. Fixed by removing the reference from the bucket map too

deep raft Apr 13, 2025, 10:02 AM

#

wanton seal im sometimes getting "hash is unchanged' too

I think this stems from the invalid bucket references too, so it should be resolved

deep raft May 14, 2025, 6:55 AM

#

@wanton seal are you still using the pr? And if so, did you run into any more issues?

deep raft May 21, 2025, 4:10 PM

#

@keen trout I managed to shorten the in_flight branch. I believe that one is the most complicated so PTAL regarding documentation

pure folio Jun 27, 2025, 9:28 PM

#

Is this ready to test yet?

deep raft Jun 28, 2025, 6:31 AM

#

Yes! Some internals are pending final approval but the API/functionality should be mostly stable

dawn vine Jun 28, 2025, 11:11 PM

#

been looking at this and i wonder for the http proxy if we still want this using that once the rewrite is in. it might be more stable (and correcter) for the proxy to not be tied to specific routes but do some generic parsing to find the major route and id. and collect them on the bucket id

solemn cloak Jun 29, 2025, 4:02 AM

#

Yeah we have been talking about doing that for a while

#Ratelimit-rewrite