High granularity average API output speed for popular models | OpenRouter | Page 1

crude cypress Aug 12, 2025, 2:52 PM

#

(This is a bit more of a wish)
I'd really love to see OR using it's massive usage data to do graphs of high granularity(like hourly or sub-hourly) output speed for popular models(say, 1B+ tokens per day), this would really help monitoring the state of things, for example, GPT-5 recently has pretty significant speed flutuations

crude cypress Aug 13, 2025, 3:37 PM

#

@last compass

last compass Aug 13, 2025, 9:48 PM

#

cc @warm ice - we're doing pretty fine-grained analytics on this now

#

will check but i think we should have been able to show this

warm ice Aug 13, 2025, 9:56 PM

#

I was actually just about to ship this today or tomorrow

#

ah i see the specific stats youre asking about now

#

yeah so the thing im shipping soon is for 1h/1d/1m views on your user activity charts

#

we've been laying the foundation for all the other charts (global token counts, throughput, latency) to show minute & hour granularity as well, and have em in our internal dashboards already

just need to polish them up a little more and figure out good spots to show them

crude cypress Aug 14, 2025, 5:08 PM

#

warm ice yeah so the thing im shipping soon is for 1h/1d/1m views on your user activity c...

ah super nice, thank you very much!

thin hazel Aug 14, 2025, 5:27 PM

#

That would be great. I'm seeing massive variance in toks/s on some models and it would be really helpful if OR helps show which providers have a consistent speed

graceful gale Aug 14, 2025, 6:04 PM

#

I'd also like to see the 1% lows and the 1% highs (basically, the lowest TPS recorded and the highest TPS recorded)

#High granularity average API output speed for popular models