#Hardware performance and recommendations

1 messages · Page 1 of 1 (latest)

civic vigil
#

I'm new to HA and diving into the deep end with Voice Assistant. I've tested on an RPi4 w/8gb and an i7-7700T. In both cases I'm using an Atom Echo for the microphone. Using the local voice processing I can only get good enough speed on the i7, the RPi4 being too slow. I've also tried HomeLLM in HAOS and remote LLM's pointed at some other systems I have in the house. The time to action gets worse in every case. The ONLY situation I've had where the time between my voice command and the switch toggling is with the i7 and local processing. That would be fine, except about 50% of the time the speech to text gets it wrong. I get a lot of the "I don't see a device called off" responses. My first question is, is the Atom Echo up to the task? If it should be, what else do I need to tweak to get the faster-whisper to be accurate closer to 100% of the time?

loud roost
civic vigil
#

Yes, local whisper. US English.

loud roost
#

how much ram you working with on your i7 setup?

civic vigil
#

16GB

loud roost
#

ok try running ONNX ASR instead of whisper

#

you have the ram for it and it runs very quickly with good sucsess using the parakeet stt model

civic vigil
#

Nice! I'll give that a try

loud roost
#

i should point out that this is a very new thing

#

but from my testing it works great

#

once you install and add the addon it can take a few minutes to come up the first time but you can see its progress in the addons log

civic vigil
#

First pass wasn't much better. I'm restarting just to see if it makes a difference. Not expecting it to.

loud roost
#

did you update the pipeline config to switch over to it?

civic vigil
#

Yep, I did

#

might be slightly better after the restart

#

Still not "great"

loud roost
#

what type of mistakes is it making?

civic vigil
#

I'm practicing with a Matter device I've called "test plug". It just told me "sorry, I'm not aware of any device called touch plug". It's different every time. Sometimes it doesn't know a device called "on". I think I speak relatively clearly but I'm also not trying to change my voice. Don't want to think I have to slow down every time I use it.

loud roost
#

on the voice assistant settings page next to your pipeline hit the dots then select debug

#

this will show you the trace of a call and you can see exactly what it "thought" you said

civic vigil
loud roost
#

a better quality microphone. (e.g. a voice-pe instead of an AE) may help with this but its hard to be sure

civic vigil
#

Worked twice and then failed with this.

civic vigil
loud roost
civic vigil
#

Any other options out there that lean towards free or cheap?

loud roost
#

also remember that the basic conversation agent is very basic and picky about phrasing

civic vigil
#

I'm probably ending up with the voice-pe anyway, just would like to prove this out before I do that.

loud roost
#

if you dont particually need/want full recognition you can try using "speech to phrase" instead

#

it cant recognise wildcard text but it specifically looks for phrases that can be used and work

civic vigil
#

I'll give it a try

loud roost
#

so it precalculated turn on/off test plug (or whatever else you call stuff)
but it cant be used for something like play **songtitle** by **artist**"

#

because it looks for specific patterns instead of actual decoding of voice

civic vigil
#

which I might want when I get around to setting up MA

loud roost
civic vigil
#

I'm testing with the app on my phone and it's doing pretty well with the recognition. Missed the first one. Been fine since that.

loud roost
#

yeah the phone mic will likely be better

civic vigil
#

Ok, so that's my problem now. It's fast enough, I just need a good mic

loud roost
#

yeah. the mic (and speaker) in the AE is limited. the voice pe has a 2 mic array and a dedicated audio chip to help clean it up

#

the AE is a great cheap device for testing/debugging stuff. i have one that i mess around with. but for practical purposes it falls down

civic vigil
#

I appreciate all the help! I'll see about picking up a PE. Probably leave the AE in an area where I can be the only one impacted by it's challenges. 🙂

loud roost
#

yeah, its fine for your desk but if other people want to use it then its an issue

civic vigil
#

As a follow up, I've now tried the same pipeline between the RPi4 and the i7. The STT is my problem now. On the i7 it takes less than a second. .25s in some cases. On the RPi it's consistently around 5s to get through the STT step. I've tried both faster-whisper and onnx-asr. Doesn't seem to be dramatically different between the two systems.

#

Is there anything to be done to get faster STT on the RPi4?

loud roost
#

getting a pc with a n100 or n150 cpu is a great middle ground. if you want it to be quicker but still keep low power

civic vigil
#

That's what I'm thinking. I have an N97 board that I can test with. Roughly the same performance as the N100, maybe slightly better. I'll give that a go before I spend money.

loud roost
#

likely wont be as fast as the i7 but likely a lot quicker than the rpi