This is especially relevant for games like Among Us since a lot of people tend talk over one another, rather than one at a time. Specifically during discussions in the emergency meetings.
I believe there was already a similar discussion thread to this, regarding ways Neuro could discern which person is saying what in future group collabs like the dating shows, and gameshow collabs etc. without needing to implement complicated stuff like voice recognition.
There was talk about many different ways to do it. like everyone being in 2 calls for example (by being in 2 calls at once, they meant 1 for the group and one for Neuro to hear individually)
Which was apparently too complicated.
And another idea, was to simply use discord voice activity to detect which people are speaking at any given time. Which nobody seemed to have a problem with, and the discussion kinda ended there.
But i still don’t think that would adequately solve the problem. Because as soon as more than one person starts talking at once, then simply knowing who in the call is speaking at any given moment isn’t the same as knowing which words are coming from what person.
And also, if Neuro isn’t able to somehow “hear” every person individually, she might not even be able to pick up any coherent sentences at all whenever multiple people speak at once.
During an emergency meeting in an among us collab for example. If two players are arguing over each other at once regarding which one is the imposter or something, it really doesn’t matter if Neuro can discern that it’s those two people speaking if it all goes through one audio channel anyway and all she “hears” is an amalgamation of noise rather than two separate arguements.
It’s just no good if two sentences turn into one scuffed amalgamation that retains none of the meaning from either.
Knowing where it came from means nothing at that point.
TLDR: people loud, what do?
seems very interested in keeping latency to a minimum
, I’d imagine this probably isn’t too high on the list of priorities at the moment.
