One of the reasons Neuro and evil stick to thier role even when the human part of the stream seems to be down,angry,sad or wants to speak about a serious stuff.
Is, that neuro and evil cant detect emotions in the voice of a human they take everything like they are reading it from a book.
Here are some projects that maybe can help with that https://medium.com/@marko.briesemann/emotion-recognition-with-ai-c7f831332ed3
#Neuro needs to understand emotions in voices (so she is serious at the right point)
1 messages · Page 1 of 1 (latest)
I think this might be solved with intelligence upgrades and time
There’s only so much you can get from text. Even humans can’t fully understand emotions through it.
Oh right she doesn't just get the audio she has to convert it to text I'm stupid
She actually can recognize some sounds so... maybe we're on the right track? But maybe is just another AI saying: "Someone clapped"
So... she actually needs to... "listen" like us 
More work for vedal...
She’ll probably be completely multimodal kinda like chatgpt 4o eventually when its cheaper and more efficient to do so. Rn tho, something like that probably adds latency and would somehow be unethical or something
Am i missing something or isn't she already multimodel
imagine the l*tency