#Flex and Priority processing for Gemini models

1 messages · Page 1 of 1 (latest)

mystic stump
mystic stump
#

bump

#

is this implemented?

lament gale
#

Bump, and also, this is a common feature on other providers too like openAI which has basically the same feature and pricing scheme. I would like it on every major provider that support supports it. I believe Azure and Amazon Bedrock also have some models that support this. Let me use a :flex slug

wraith pike
#

in support website it says it supports openai but not google....tested and didn't see changes or pass through for the flex para

mystic stump
#

bump

#

we just want transparency man how hard is it to support flex processing for Google models, flash is costing me a ton

mystic stump
#

bump

indigo flame
#

openrouter finished. I found a solution and already setup LiteLLM and OpenWebUI and its working.

mystic stump
#

does it work?

#

I'm mainly using G3 flash for chat

#

which.... in terms of cost its not appealing

mystic stump
#

any updates?

#

bump

indigo flame
#

I'm spending SIGNIFICANTLY less with my setup. a few weeks ago I realized i'm paying2-3x times more than other developers using the same models. Now that I switched over OMG what a cost difference without any performance difference

#

the set up took a couple hours but the product is significantly better. Sure, I have to update myself and add new models as they come, but at least i wont experience the significant downtime and expense of using this crap platform.

#

FYI, i loved OpenRouter and recommended it to several friends. but there are limits and shutting out users and never updating us on whats going on is unacceptable

wraith pike
#

BaliHoo, get lost will you?

subtle finch
#

bump

mystic stump
#

bumpy bump bump meowslightsmile

indigo flame
#

You folks need to jump this sinking ship. You’re paying wayy too much for subpar service and no guarantee of uptime

mystic stump
#

using official google sdks is pita to work with though doable

wraith pike
#

where is the confirmation from the team? is this like soon, never or bottom of the priority list?

keen sphinx
#

bump

mystic stump
#

bump

mystic stump