#What techniques can I use to create more consistent intonation across different generations of TTS
1 messages · Page 1 of 1 (latest)
To create more consistent intonation across different TTS generations, you can use the following techniques:
-
Use Consistent Training Audio: Ensure the audio used for training reflects the delivery style you want, maintaining uniform pacing and tone. Avoid mixing varied delivery styles, as this can lead to inconsistencies in the AI-generated voice.
-
Limit Break Tags: If you're using break tags to introduce pauses, limit them to one or two per generation. Using more than two or three can cause instability and faster speech. For longer pauses, consider using punctuation like dashes instead.
-
Regenerate Audio: If the output seems inconsistent or has pacing issues, try regenerating the audio. Two free regenerations are available for the same prompt and settings within two hours of the original generation, as long as you don’t refresh the page. This option is exclusive to the Speech Synthesis website and not applicable in Projects or via API.
These steps can help improve the consistency of intonation and pacing in your TTS outputs.
i'm doing it via API