I have a bot set up with the simple recording example and it works perfectly, I have a realtime STT setup that can read WAV files. The only thing I'm missing is a way to obtain what a user is saying as they say it. My goal was to use voice state detection, similar to how discordjs does it. This bit in JS, which just tells you who is speaking as soon as they do:
connection.receiver.speaking.on('start', (userId) => {
console.log(`${userID} is Speaking`)
}
That way once the state is 'speaking' I could fire off the recording, once speaking is done, close and send the wav off to STT. But from what I can tell it's still not a feature? (please correct me if I'm wrong, please)
I've been pulling my hair out about this all night trying to find a way to do it, but I cant get STT working in nodejs, and I can't get speaking state in python, I'm losing it


"