We managed to 2x our Azure GPT-4 rate limits, by using a simple model switching strategy
Decided to open-source it for everybody else - (feel free to DM me if you need help 🙂 )
Here's a tutorial: https://github.com/BerriAI/reliableGPT/blob/main/examples/Azure_OpenAI_Model_Switching_demo.ipynb