I've been trying to get a 100% local LLM arrangement to work, but keep running into this "500 Internal Server Error" no matter how I configure the model in AG Studio.
- AG Studio
A) I create a new model and populate only:
i) "model name" field = 'mistralai'
ii) "base URL" = "http://localhost:8000/v1" (confirmed same port as litellm reports)
B) Try the "test model" button and attempt model+agent+playground sanity test, but receive HTTP 500 - Internal Server Error 100% of the time. - See text snippet
-This seems like an AG Studio issue because:
>litellm acks the request -->INFO: 127.0.0.1:59266 - "POST /v1/chat/completions HTTP/1.1" 200 OK
>Passes to ollama, ollama loads and serves the LLM request (see 2nd attached log snippet)
>Ollama passes it back to litellm with HTTP 200 success message. -->[GIN] 2024/02/03 - 01:30:52 | 200 | 607.967781ms | 127.0.0.1 | POST "/api/generate"
I've tried a handful of LLM model config field permutations already (specifying "openai" as type, trying "ollama/mistral" as model name, etc.) with no change in results. Also tried using LM Studio server to try and debug/workaround but got the same result to litellm.