Hi,
I've been using OpenRouter since 1-2 months, and never had issues.
I didn't use since few days, but now I see persistent issue, errors such:
This model's maximum context length is 8000 tokens. However, you reques
ted 11147 tokens (11147 in the messages, 0 in the completion). Please re
duce the length of either one, or use the "middle-out" transform to compress your prompt automatically.
Do you apply transforms by default?
Because I can't find even find one request which displayed Input / Output tokens size is correct: mostly they are higher, even for the output.
I ask that because for example this 11147 tokens prompt (lol) is in fact at MOST 6k tokens (but allegedly it was <4k)
We have implemented our own tokens quantity handlers, which makes it always very safe and fallbacks to bigger models as soon as the message + 30% reaches 95% of model's token limits.... ๐ฆ
Thanks for help