Quota exceeded: Claude 3.5 Sonnet | OpenRouter | Page 1

meager kraken Sep 5, 2024, 10:09 PM

#

Hi guys, you've reached max for Claude. Will they allow an increase?

Here is the error message I get, and I haven't used it at all in the last 24 hours:
{"type":"error","error":{"type":"rate_limit_error","message":"Number of request tokens has exceeded your per-minute rate limit (https://docs.anthropic.com/en/api/rate-limits); see the response headers for current usage. Please reduce the prompt length or the maximum tokens requested, or try again later. You may also contact sales at https://www.anthropic.com/contact-sales to discuss your options for a rate limit increase."}}

Anthropic

Home - Anthropic

Contact Anthropic

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

uncut sonnet Sep 6, 2024, 2:21 AM

#

Same happening to me on Sonnet 3.5 (self moderated) but with Google Vertex as provider:

{ "result": { "error": { "code": 429, "message": "[{\n \"error\": {\n \"code\": 429,\n \"message\": \"Quota exceeded for aiplatform.googleapis.com/online_prediction_requests_per_base_model with base model: anthropic-claude-3-5-sonnet. Please submit a quota increase request. https://cloud.google.com/vertex-ai/docs/generative-ai/quotas-genai.\",\n \"status\": \"RESOURCE_EXHAUSTED\"\n }\n}\n]" } }

I am about to launch an app based on this. Now I am doubting if I could really scale using OpenRouter, instead of connecting directly to Anthropic.

How can we avoid this in the future when my app scales to many users?

Thank you

wicked heart Sep 6, 2024, 3:47 AM

#

Unfortunately I ask myself the same, I really like open router but I get constant errors or empty messages from different models. That does never happen when I connect to the providers directly...

#Quota exceeded: Claude 3.5 Sonnet