no way around?
It doesn't have to be a firm rate limit, basically we are going for some limit of outstanding requests to not completely jam things up, but if they are answered fast we want to request more fast.
So kinda a mix between a rate limit and a semaphore
doesn't have to be exact
sounds like backpressure handling, fair