Ollama is dumb with HA | Home Assistant | Page 1

sonic rune Aug 9, 2024, 12:18 AM

#

Need advise here

weak eagle Aug 9, 2024, 1:01 AM

#

seeing the same thing. I have ollama running on my macbook and it pulled down the most recent version too.

royal seal Aug 9, 2024, 1:15 AM

#

Same here

sonic rune Aug 9, 2024, 1:58 AM

#

@prime geyser should we open issue for integration, or it is known problem?
I saw Madalena's demo, and it went well - but everyone was surprised - so IDK if it's known behaviour, or just a fluke.

prime geyser Aug 9, 2024, 2:00 AM

#

The main contributors are working on extending the context window. I've been busy working on our voice hardware, so I haven't had time to try things out with Ollama lately 🙂

sonic rune Aug 9, 2024, 4:00 AM

#

Don't think it's the matter of context length. It just lies to me in both read-only and Assist modes... Doesn't look like lack of context. Especially when it is able to continue conversation - so it holds previous context flawlessly.

upper flicker Aug 9, 2024, 6:23 AM

#

same here, same model used. To be honest, based on the speed, thez quality of their TTS, I switched back to chatGPT (gpt4o-mini and GPT TTS)

#

it's instant, works quite well...and cost pennies

fading pagoda Aug 9, 2024, 10:44 AM

#

Getting comparable results with models that are as small as the ones available in ollama is a challenge. The context window size is not the only variable here. Language and model size are also important. Especially if you compare it with the cloud-hosted models which potentially are vastly larger in size than lamma3.1:8b

#

I can confirm your experience with llama3.1:8b though, it did the same for me 😄

sonic rune Aug 9, 2024, 12:33 PM

#

upper flicker same here, same model used. To be honest, based on the speed, thez quality of th...

Yeah for me privacy is the key. I use GPT now, but only manually, though script actions for refining already parsed info into more human-like form.

upper flicker Aug 9, 2024, 12:33 PM

#

yeah, well, I turn on my lights, I start playing music, ... this is not really what I'm calling privacy issues tbh

#

but I obviously do have LLMS/Ollama running since a long time

#

too bad I don't have a powerhorse to run these LLMS

sonic rune Aug 9, 2024, 12:36 PM

#

fading pagoda Getting comparable results with models that are as small as the ones available i...

I used Llama Conversation integration with weaker models than llama3.1, and it was working better.
Llama3.1 is really good, at least chatting with it is comparable to gpt3 turbo. So it should be able to do what it's intended to do here. I think the issue is somewhere in system message building. Would be cool to debug it and see raw API data.

upper flicker Aug 9, 2024, 12:36 PM

#

you could/dhould join us on the ollama discord server

#

another place where you'll meet knowledgeable people about that is the openwebui server

#

I only can confirm the results are not as good (for now) using different LLMs that gpt4o-mini

sonic rune Aug 9, 2024, 12:39 PM

#

upper flicker yeah, well, I turn on my lights, I start playing music, ... this is not really w...

All entities you have exposed are sent to it, no matter which request you have.
And if you restrict Assist to music and lights exclusively, then you cut the functionality. I want it to be functional, or not to be at all. Anyways, for now I use pure Assist, as I have other concerns about LLM. We'll see how it goes.

upper flicker Aug 9, 2024, 12:40 PM

#

as a matter of fact, I exposed approx 80 entities to it for now

#

but I guess a lot of people simply check them all, and voilà

sonic rune Aug 9, 2024, 12:41 PM

#

upper flicker I only can confirm the results are not as good (for now) using different LLMs th...

Whhooosh of course, comparing 7B model to GPT-4o is meaningless :))

upper flicker Aug 9, 2024, 12:41 PM

#

well, sure you could run crazy models, but you also know you'll have to dedicate a mac studio with 64G of unified ram to get this working 😛

sonic rune Aug 9, 2024, 12:41 PM

#

upper flicker but I guess a lot of people simply check them all, and voilà

Exactly. And then it starts hallucinating and opens your door for stranger.

sonic rune Aug 9, 2024, 12:45 PM

#

upper flicker well, sure you could run crazy models, but you also know you'll have to dedicate...

I don't have good hardware to run it, to be honest. Well, I do actually, but that PC is not intended to be turned on constantly. So I'm just playing now with it.
Curious if we'll get some cheap dedicated device like Tesla card, but cheaper, to be able running LLM on some ThinkCentre.

sonic rune Aug 9, 2024, 12:47 PM

#

upper flicker you could/dhould join us on the ollama discord server

Uhh, I'm holding myself from diving into another rabbit hole so far. It's really tempting, but time is crucial... 🙂

tender drum Aug 10, 2024, 7:45 PM

#

maybe try airllm i have not tried yet

#Ollama is dumb with HA