#Mixing OpenAI & local models
1 messages · Page 1 of 1 (latest)
yeah, we'll have to have multiple clients at the same time. The routing idea is a good start
The routing as we discussed it is implemented now
But, we need a more expressive way to configure the router now
Adding OLLAMA_... env variables will help.. but only if you use ollama. Don't want to lock ourselves too tightly to infra tooling
right it should be more generic as an "other" probably
are there ways to run gpt-* models outside of openai? We could be smart about the base_url for gpt-* models and send the rest to the base url. But then it still needs another name
Maybe this is where LLM_BASE_URL and LLM_KEY come in, as a generic fallback?