#Quota exceeded: Claude 3.5 Sonnet

3 messages · Page 1 of 1 (latest)

meager kraken
#

Hi guys, you've reached max for Claude. Will they allow an increase?

Here is the error message I get, and I haven't used it at all in the last 24 hours:
{"type":"error","error":{"type":"rate_limit_error","message":"Number of request tokens has exceeded your per-minute rate limit (https://docs.anthropic.com/en/api/rate-limits); see the response headers for current usage. Please reduce the prompt length or the maximum tokens requested, or try again later. You may also contact sales at https://www.anthropic.com/contact-sales to discuss your options for a rate limit increase."}}

Anthropic

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

uncut sonnet
#

Same happening to me on Sonnet 3.5 (self moderated) but with Google Vertex as provider:

{ "result": { "error": { "code": 429, "message": "[{\n \"error\": {\n \"code\": 429,\n \"message\": \"Quota exceeded for aiplatform.googleapis.com/online_prediction_requests_per_base_model with base model: anthropic-claude-3-5-sonnet. Please submit a quota increase request. https://cloud.google.com/vertex-ai/docs/generative-ai/quotas-genai.\",\n \"status\": \"RESOURCE_EXHAUSTED\"\n }\n}\n]" } }

I am about to launch an app based on this. Now I am doubting if I could really scale using OpenRouter, instead of connecting directly to Anthropic.

How can we avoid this in the future when my app scales to many users?

Thank you

wicked heart
#

Unfortunately I ask myself the same, I really like open router but I get constant errors or empty messages from different models. That does never happen when I connect to the providers directly...