#JARVIS assistant
1 messages · Page 1 of 1 (latest)
also im a bit of an ironman nerd
and if you want to kow more about my jarvis ask me here
I can ask my Karin about a ambulance passing by where its going.
huh?
hallo o/ im also working on a similar voice assistant to help me manage my day to day activities, what tts are you using for ur va?
whisper
bcus i cant find a good one
home assistant and friends even has a hey jarvice wake words pre installed.
i have everything set in my own code, i just want a nice tts so i can finish it 😅
im using openwakework for the wake word managment
Here is a short shippet of my assistant
disclaimer about faster-whisper it takes a i bit longer than some stt plugins only a little bit longer though
also your snippet is coming
oh tht sounds pretty good
np take your time :D
ef record_audio(duration=5, samplerate=16000):
print("🎤 Recording...")
recording = sd.rec(int(duration * samplerate), samplerate=samplerate, channels=1, dtype='int16')
sd.wait()
path = tempfile.NamedTemporaryFile(suffix=".wav", delete=False).name
write(path, samplerate, recording)
return path
def offline_speech_to_text(duration=5):
try:
path = record_audio(duration)
print("🤖 Transcribing...")
model = WhisperModel("base", compute_type="int8") # Use "tiny" for faster performance
segments, _ = model.transcribe(path)
return " ".join(segment.text for segment in segments).strip()
except Exception as e:
return f"[Offline STT Error] {e}"
i thinks that's it
if you pick home assistant as the basis for your jarvis the possiblities are almost endless
but you have to make a lot of scripts your self for now
i have a full version of jarvis designed to work on a pc and a version designed to go on a raspberry pi
also same
ive just gotten into it the past few wekkss and picking up a lot from freinds
samee thts my end goal as well
whisper on a pi isnt that fast
guys wann see my rasberry pi version of jarvis in action becasue my full version has a lot of my personal info on it
yeah sadly
But you can run whisper on a remote system.
also i think this stt? i was asking abt text-to-speech 😅
i wld love to
I have a script that makes a local dynamic dns to pick the fastest whisper available.
ohhh than idk i just use pyxtss or what ever it's called
ah i was also using that, but for me recently it kept stopping in the middle of the sentence and i have no idea how to fix it, so im looking for something more natural and free x)
ayy, excited
ahh dang it it's too long of a video i might be able to go into a vc? i will be defeaned tho
and show you what he can do
idk man
well ok i will send a video but it will be a bit short ok
np np
also for ur hardware setup, are u running just a pi (5/4), or are do you have anything else connected to it?
i dont have a pi yet
im getting one soon and plus i cant seend you a video becasue its still too long
cant you compress it?
if you recorded it on a phone there are good free working apps
or else ask a AI for the linux compress command
ummm i will have to give it too you in snippets but it will be only like 3
wanna see the code for your self?
@haughty anchor
@west junco
np
sure
also i wont be able to respond for a while cus im gonna go study for my exams xD
also
i found a tts called coqui tts
imma try it out and let uk
gonna need an explanation on this one lol
Yeah i had this too
Also i might train a voice model for Jarvis with coqui tts
Il start doing that once my exams get over
Have u implemented a music player yet?
scripts to get real live data into your LLM. Contribute to Bram-diederik/home-assistant-LLM-scripts development by creating an account on GitHub.
Hi, "hey Jarvis" doesn't work on my devices (Voice PE) only Ok Nabu works, I didn't finde a solution to get this to work, does anyone have an idea what to do?
Its in the openwakeword setup open wakeword and use that
Hey Jarvis is built into Microwakeword. Also using PE with Openwakeword requires a lot of changes to YAML, and results in losing some functionality (like STOP word).
I thought it's not possible to change the wakewords on voice PE you just have "ok Nabu" "hey microft" and "hey jarvis" which are fix, am I wrong?
So far there's three official words for MWW, yes.
Well, in theory you can train your own wake word for Microwakeword (also for Openwakeword too, and it's easier). There's notebook available for training. But for good results, you'd have to get real voice samples, and big positive/negative set. For example, Ok Nabu is the only wake word for Microwakeword, that was trained on big synthetic dataset plus ~20000 real samples, donated by community.
Same goes with Openwakeword, but training and tuning models for it is a bit easier. However, official PE firmware doesn't work with Openwakeword (which is on HA side), and uses Microwakeword (which is on-device ESPHome algorithm).
"Hey Jarvis" works for some people better than "Ok Nabu". It heavily depends on voice height, stress, accent and pronunciation. Also because of synthetic dataset, it doesn't work good with children/women voices..
Ok but the I need to change the configurations files of the Voice PE right, I know how to train wakewords, I already did this but I‘m not sure if I really want to this with Voice PE, as I often realized problems when an ESP update arrived.
i'm working on an iteration where i'll re-introduce OpenWakeWord (or anything compatible) wake word detection, but that's for experimenting only. I think MicroWakeWord is superior.
Well, MWW works only for ESP32. OWW is universal, although server based... Each has advantages and disadvantages.
Yes, you'd need to take control and make changes to the YAML.