#Ollama, VA, & HA

1 messages · Page 1 of 1 (latest)

keen blade
#

Trying to get HA to work with Ollama and VA.

List of things I’m trying to do.

Ask VA about devices. ❌
Ask Ollama about devices. ✔️
Ask questions for Ollama thought VA. ❌

Where do I start?

misty ruin
keen blade
#

I think so. I’m at the gym right now and I will look when I get home. I know it’s an Nvidia.

past cape
#

are you just looking for a guide to set this up or your running into an issue?

keen blade
#

Setting up. I’ve got Ollama in Assist to see devices but not VA.

past cape
#

bit confused what you mean your LLM see the devices, HA and VA act in tandem and HA is what retains your devices and your VA simply looks at what is exposed in your HA to reference the entity ID for each call to command. So if you have your VA setup to only route commands through your LLM and not enabled the feature to run your commands locally first?

#

I think if you just go to your expose screen, you can setup aliases and should work without much problem

light plume
#

I'm curious why people mostly writing questions about their setup when they're not at home... 🙂

keen blade
keen blade
light plume
#

Probably your Pipeline setup is crooked. Or, as others said, your LLM is choking. Case is, that when you build it into pipeline, HA is adding system prompt to every message, that contains your exposed entities, and tools, exposed by HA. That adds up to the context a lot, and model requires more RAM and more time to process, comparaboe to using it directly.

keen blade
misty ruin
#

What model do you use? Are you sure the GPU is being used?

light plume
keen blade
past cape
#

Does ollama run native or in a docker?

#

If docker ensure u are using the --gpus=all foe the argument

misty ruin
#

No one sane uses docker run 😄

keen blade
past cape
#

Run a voice command and what your gpu levels to see if it's being used, not sure what GPU monitor you use

last tiger