Hey everyone, I am aware of this implementation of parallel API requests to the OpenAI API: https://github.com/openai/openai-cookbook/blob/970d8261fbf6206718fe205e88e37f4745f9cf76/examples/api_request_parallel_processor.py
However, this code is too low-level for me and I'd rather have something that uses the "openai" module itself for easier maintanance and get rid of the low level stuff.
I already have many implementations for a threaded approach, however threaded approach doesn't make use of the token quota as good as the implementation above.
Does anybody have code written to fully utilize the token limits, but easier to understand and implement than the above approach? Thanks!