#[โ Solved] Error query
1 messages ยท Page 1 of 1 (latest)
The error message youโre seeing means that the combined length of your prompt and expected response is greater than the allowed max tokens for the model youโre using. Specifically, for CosmosRP-2.5 (the free/default model), the maximum number of tokens per request is 8192 (8,192) if youโre not using an API key, and up to 16,384 with a key.
Hereโs whatโs happening in detail:
- Your prompt length: 8,485 tokens.
- Max allowed tokens (+ reply size): 8,192 tokens.
- Because your prompt alone exceeds this limit, the API cannot process the request.
How to fix:
- Shorten your prompt: Edit your message history, system prompt, or instructions to reduce the token count.
- Start a new chat: If youโre on a chat website, clear the conversation and start fresh.
- Check your API settings: If using a model with a larger context window, make sure your key has the corresponding supporter tier to access higher limits (see Models Channel for details).
Tip:
- You can group previous replies or delete some parts of older conversation history to save tokens.
- Always keep the total input + expected output under the modelโs token limit.
If you keep hitting this error, you may want to upgrade to a supporter tier for more tokens per request. You can find more info in the Limits Channel.
Please note: This is an automated response from @wicked void. While it strive to provide accurate assistance, It may occasionally make mistakes. If you find any inaccuracies or need further clarification, feel free to wait for our community helpers to give you further guidance. Thank you.
[โ Solved] Error query