#Wake Word
1 messages · Page 1 of 1 (latest)
I'll let @keen widget give you advice.
Customising wake word is a pretty complicated task. It is planned to get easier in the future so if waiting is an option I would recommend it. however if you want to dive into customising firmware and stuff it can be done...
As for wakeword selection. You really don't want a short wake wake. this leads to a bunch of false positives. There is a reason the main wake words across all voice assistants have more syllables.
I don't mind waiting if it's under six months. I've seen the various posts around openWakeWord and reflashing the hardware and I really don't want to do that
Also my Alexa wakes up from "Alexa". I think I would get along just super with just "Jarvis" instead of "Hey Jarvis"
the VPE does not use OWW (well at least not by default) it uses MWW but even then currently you would have to add it to the firmware.
I cannot give you a timeline however the backlog issue is HERE which you can follow then when there is movement on the task you will see stuff happening.
"Alexa" is 3 syllables, whereas "Jarvis" is 2.
a-lex-a vs jar-vis which is why hey-jar-vis is used.
if you go with a 2 syllable wake word then you will end up having a bunch of issues with false positives.
Also, Whilst making a MWW model is easier now then it used to be. Without a bunch of live data sample sets. It can be hit and miss even with a good WW.
I don't mind taking my chances 🙂
The general process is:
train your MWW model.
host your MWW model somewhere where its easy to grab (GitHub recommended).
take control of the VPE firmware using the esphome builder tool, Add additional wake word lines below the main firmware package import.
build and flash the firmware.
waiting will solve the issue of loading it onto the VPE however you will need to train a model.
Do you have a linux system with a nvidia GPU and a a couple of hundred gig of SSD space to do this on?
I'm sure there's going to be a million pre trained wake words once you enable loading them into VPE
It is a bit of an involved process currently yes.
There is less options with this setup.
I do want to say that I'm very happy with how snappy and responsive the VPE is. It's an actual, realistic way of replacing Alexa and I'm grateful for it
I would tend to agree that once the functionality is in. There will be a rush of models being made yeah.
I'll have to wait for others then. Hopefully you guys can at least provide the option for "Jarvis" instead of "Hey Jarvis" down the road
Yeah the VPE is great. It has some limits but its constantly improving.