#Clips of the same speaker generated different voices
1 messages · Page 1 of 1 (latest)
It sounds like you're experiencing inconsistencies in the voice output for your project. This can sometimes happen due to variations in the original voice samples used for voice cloning.
The quality and volume of voice samples in the voice library can vary significantly, as they come from users with different recording setups - ranging from professional equipment to basic amateur setups. This variation can lead to discrepancies in the sonic characteristics when generating different lines or even regenerating the same line.
To help improve consistency, here are a few suggestions:
- Check your voice settings to ensure they're consistent across all clips.
- Try regenerating specific clips where the voice changed unexpectedly.
- Select a voice from the Voice Library with the HQ badge, as these tend to be more consistent.
- Adjust the stability and similarity settings to find a balance that works for your project.
If you're using the dubbing tool, make sure the voice settings are standardized across the entire audio track. Sometimes, different settings can be accidentally applied to various sections of the text.
For best results, it's advisable to test multiple short sentences with the voices you're considering to evaluate their performance and consistency before committing to a full project.
If you continue to experience issues after trying these steps, please don't hesitate to reach out for further assistance.
The generated clips from the default voice "callum" are all different. Can I copy the voice on one of the clips and paste it to another so they sound the same and are consistent?
Currently, it is not possible to directly copy the voice characteristics from one generated clip and apply them to another. However, there are steps you can take to improve consistency in generated audio:
-
Use High-Quality Input Samples: If you're cloning a voice, ensure the input audio is of high quality, with no background noise, consistent tone, and a single speaker. For Professional Voice Cloning, providing 30 minutes to 3 hours of consistent audio is recommended.
-
Adjust Voice Settings: If you're experiencing inconsistency, try tweaking the
stabilityandsimilarity_boostparameters in the voice settings. Increasing stability can help maintain a more uniform sound across generations. -
Break Text into Smaller Segments: Generating shorter clips instead of long passages can help reduce variability in the output.
-
Re-clone the Voice: If the inconsistency persists, consider re-cloning the voice using more consistent and higher-quality samples. This can help the AI better capture the nuances of the voice.
If you're still facing issues, you may want to experiment with these adjustments or reach out to support for further assistance.
I did not clone the voice, it is a default generated voice