#Adding Pauses to Speech Synthesis
1 messages · Page 1 of 1 (latest)
Hey. I did figure something out. What I do is prompt the model to synthesize a length of "...". The more dots, the longer the pause. Then, I cache that output. Whenever I need, I can simply concatenate the bytes from that with whatever I need to be said. At the byte level, you can just slam it all together, injecting your pause audio bytes.