#Help selecting provider

1 messages · Page 1 of 1 (latest)

polar horizon
#

I use WizardLM-2 8x22B through Venus.ai and I'd like to set it to use the provider DeepInfra only. But I can't find out where to put select this info on openrouter. Am I not able to do it on OpenRouter's end?
Thanks

flat jay
#

you can do it via our API, but it's up to Venus to implement support for provider selection

polar horizon
#

Yeah, I was playing around with that code from the page. I just didn't know where to put it.. Venus.ai doesn't have that feature that I can find. And google isn't really helping when I try to find out if any other character front ends do or don't.
I tried putting it in the system prompt when using chat on OpenRouter.ai... but that didn't seem to work even temporarily.

azure brook
#

example of provider injection in ST

polar horizon
polar horizon
#

If anybody comes back here in the future, I am now using Risu. It's like SillyTavern but looks much nicer, and super easy to select your provider.

azure brook
#

Upd: SillyTavern will be adding provider selection in update 1.12.0
This functionality is already available in staging branch

ember crypt
#

Help! I'm using langchain's ChatOpenAI and don't know how to switch providers

flat jay
#

can you pass in custom JSON for the request body?

ember crypt
#

In fact, the openai method adds a parameter, and langchain can be passed in.

#

@flat jay I want to know which provider is the most reliable and is there any way to measure it?

flat jay
#

We're planning a public dashboard soon to help people figure this out

#

@dapper pewter anything we could suggest here in the meantime?

dapper pewter
#

@ember crypt for one model in particular or just generally?

and mind expanding on what you mean by "reliable"? like thruput & uptime or "consistent quality responses with my chosen parameters"?

flat jay
#

I think the priority is probably uptime, followed by 0-token responses (eliminating those)

#

but curious about what @ember crypt says as well

ember crypt
#

By customizing the order of setting the provider myself, I found that the provider is still dynamically changing when actually used.😂
I'm guessing that your current call strategy is to sort the providers first, and then prioritize which ones are in low use at the same time(Throughput and latency trade-offs). Right?

flat jay
#

@ember crypt we forgot to update you here, but a couple days after this we disabled load balancing when you manually set an order yourself. sorry about that - very surprising behavior!

#

so it should work the way you'd think now