#Subtitle bugs

1 messages · Page 1 of 1 (latest)

restive lodge
#

These are probably already known, but I will post them anyway. I noticed all of these in the current dev stream, by the way.

For both the V1 and V2 voices:

  • Sometimes, when Neuro says a message, the subtitles are appended to the end of the previous message's subtitles.
  • Sometimes, the first sentence is cleared from the subtitles after Neuro starts the second sentence.

For the V2 voice:

  • Subtitles are not formed dynamically (i.e., they are not formed word-by-word as they are spoken).
waxen pivot
#
  • the subtitles disappear too quickly
    also gets cut off more when interrupted
karmic ridge
#

this would be OBS/player fault mostly, it can't detect the changes fast enough, I have seen the best implementation done by character.ai, where they steam it to the web app instead (json) by sending chunks of the tts AudioSegments as base64. then have the web app handle syncing/showing the text and playing the audio:

"speech": "Base64AudioSegementToPlaydGVzdA==",
"text":"the text component to display"
}```

the audio segment is only about 3-5 sec long, thus eliminating the lag to rendering. Its always synced every few words.

And web app can render extra data along the subtitle, such as the sender username, original chat, and Twitch identity badges, etc.
waxen pivot