#Connect HA to vLLM

1 messages · Page 1 of 1 (latest)

lost hinge
#

I've been unhappy with the performance of Ollama when trying to use HA and OpenwebUi at the same time. The model loading and unloading takes too long when I simply need something done in my house.

What is the best way to connect HA to vLLM? Do I just use the ollama integration?

knotty oasis
#

All depends on your hardware your LLM is running on literally that's everything

lost hinge
#

While I don't disagree its partly.my hardware, vLLM always for concurrent requests. Ollama doesnt and uploads the model

cedar barn
lost hinge
#

Its the same model but I have tweaked the OWUI. Home Assistant has no way to set temperature or chunk size so it uploads it

#

@cedar barn

cedar barn
lost hinge
#

Which would make sense for most models but GPT-OSS:20b needs a but of tweaking to get it where I want it for other users

#

Bit

cedar barn
lost hinge
#

I need to look back into qwens. I just couldn't get all 280 entities to it. It would never find sunset and stuff like that. So I've even using 20b

cedar barn
lost hinge
#

I don't really know how to limit it too much more. I havent made any scripts or anything yet, but I have roughly 60 smart switches alone in my house lol

cedar barn
lost hinge
#

That last idea seems like the best option actually. I'm sure there are some switches I could get rid of, just havent figure out which ones yet.

#

So the script would be the best of both worlds I think

cedar barn
lost hinge
#

Awesome thank you so much for your help

cedar barn
knotty oasis
#

Theres a Tinyllama refined for HA only 1.0b

cloud nova
#

Nice