#Mixing OpenAI & local models

1 messages · Page 1 of 1 (latest)

simple cosmos
#

not sure how to fix

eager mesa
#

yeah, we'll have to have multiple clients at the same time. The routing idea is a good start

simple cosmos
#

The routing as we discussed it is implemented now

#

But, we need a more expressive way to configure the router now

#

Adding OLLAMA_... env variables will help.. but only if you use ollama. Don't want to lock ourselves too tightly to infra tooling

eager mesa
#

right it should be more generic as an "other" probably

#

are there ways to run gpt-* models outside of openai? We could be smart about the base_url for gpt-* models and send the rest to the base url. But then it still needs another name

simple cosmos
#

Maybe this is where LLM_BASE_URL and LLM_KEY come in, as a generic fallback?

eager mesa
#

yeah seems fine

#

we're basically assuming the fallback is an openai compatible api. Otherwise it'll need it's own client and router

#

safe bet