openrouter/quasar-alpha tokenizer is wrong | OpenRouter | Page 1

terse finch Apr 7, 2025, 12:46 AM

#

The maxium token input is around limit 750000 token

oblique dawn Apr 7, 2025, 12:46 AM

#

which tokenizer are you using?

terse finch Apr 7, 2025, 3:00 AM

#

I have a long context which is over 2million tokens. But I can only send around 75_0000, when I send longer than this. error will returned.

terse finch Apr 7, 2025, 3:26 AM

#

data: {"error":{"message":"This endpoint's maximum context length is 1000000 tokens. However, you requested about 1124905 tokens (1124905 of text input). Please reduce the length of either one, or use the "middle-out" transform to compress your prompt automatically.","code":400,"metadata":{"provider_name":"Stealth"}}}

#

looks like you are using charactor based token counting

oblique dawn Apr 7, 2025, 9:07 PM

#

yeah it depends on how you're tokenizing your input. we're not using character based token counting though

#

I can't reveal the exact tokenizer we use unfortunately, but all will be revealed in time!

terse finch Apr 8, 2025, 3:36 AM

#

oblique dawn yeah it depends on how you're tokenizing your input. we're not using character b...

Can you fix this issue ? openrouter/quasar-alpha clearly use gpt-4o tokenizer ( cl100k_base ). Can you switch to that ?

terse finch Apr 9, 2025, 1:10 AM

#

@oblique dawn

oblique dawn Apr 11, 2025, 4:07 AM

#

sorry about the delay

#openrouter/quasar-alpha tokenizer is wrong