#High latency in GPT-4o compared to GPT-4o-mini
2 messages · Page 1 of 1 (latest)
It appears you're experiencing delays with GPT-4o on Azure AI Foundry, while GPT-4o-mini is performing well. Here's a potential approach to troubleshoot:
-
Timeout Settings: If the service typically takes longer than 10 seconds, consider increasing the timeout to 30 seconds or more.
-
Retry Policies: Implement exponential backoff and retry policies can help manage temporary service interruptions or overloads