I'm curious about how your uptime detection for different model providers work. I want to use Azure OpenAI, but have realized they have spikey poor performance. In a dream world, I would use OpenRouter and it would use Azure (which is faster normally) whenever their performance is around their normal (i.e. sorting by throughput). How often is throughput measured? Is it the 30 second window you mentioned in your docs? How does this compare to the yellow status signal shown under Azure OpenAI 4o-mini currently. Is the yellow status detected or inferred?
Would you expect OpenRouter to just work for my use case, or would I have to implement something custom? I'm not just falling back on errors, more so on throughput and latency degradations.