#if i submit a professional voice cloning am i able to get it retrained later when i get more sample
1 messages · Page 1 of 1 (latest)
No, once a Professional Voice Clone (PVC) is created, it cannot be retrained or updated with additional samples. If you want to include more audio samples, you will need to delete the existing PVC and create a new one using the updated dataset.
To delete your PVC, go to Voices > Personal, click on "View" for your PVC, and then select "Delete." Keep in mind that if your PVC has been shared in the voice library with a notice period, you will not be able to delete it until the notice period has elapsed.
For the best results when creating a new PVC, ensure you provide high-quality audio samples, ideally 30 minutes to 3 hours of clear recordings without background noise or reverb.
should my audio be in stereo or mono
For Professional Voice Cloning, your audio should ideally be in mono format. This helps reduce file size without compromising quality. If your audio is currently in stereo, you can convert it to mono before uploading.
Additionally, ensure the audio is high-quality, with no background noise, consistent tone and volume, and only contains speech without long silences or other sounds. For more details, you can refer to the Professional Voice Cloning guide.
is the result better if it's in mono? or is it just a quality of life thing
The provided content does not specify whether stereo or mono audio is preferred for Professional Voice Cloning. However, the focus should be on ensuring the audio is of high quality—clear, consistent, and free of background noise or interference. The format (stereo or mono) is less critical than the overall quality of the recording.
If you have further questions or need clarification, feel free to ask!
how much audio should i record
For Professional Voice Cloning, it is recommended to provide 30 minutes to 3 hours of high-quality audio for the best results. While 30 minutes is the minimum, closer to 3 hours will yield a more accurate and detailed voice clone. Ensure the audio is clear, features a single speaker, and is consistent in tone and style.
For Instant Voice Cloning, only 1 to 2 minutes of good-quality audio is required. However, the quality of the audio is more important than the length.
is professional voice cloning meant to be like 1:1?
-
Retraining a Professional Voice Clone: Once a Professional Voice Clone (PVC) is created, it cannot be retrained or updated with additional samples. You would need to create a new PVC if you want to include more audio data.
-
Mono vs. Stereo: Your audio should be in mono. This is not just a quality-of-life preference; mono recordings are essential for ensuring consistent and accurate results during the cloning process.
-
Amount of Audio: For Professional Voice Cloning, you should provide at least 30 minutes of high-quality audio. However, for the best results, closer to 3 hours of audio is recommended. The more high-quality data you provide, the more accurate the clone will be.
-
1:1 Accuracy: Professional Voice Cloning is designed to create a highly accurate and realistic replica of your voice, capturing intricate details like tone, accent, and delivery style. However, the quality of the result depends heavily on the quality and consistency of the audio samples you provide.
any specific audio presets i should pre-process my audio with
1. Can you retrain Professional Voice Clone later?
No, once a Professional Voice Clone (PVC) is created, it cannot be retrained or updated with additional samples. If you want to improve the clone, you would need to delete the current PVC and start the process again with new audio samples.
2. Should your audio be in stereo or mono?
Your audio should be in mono format. This ensures consistency and avoids potential issues during the cloning process.
3. Is mono better for results or just convenience?
Using mono is not just a quality-of-life choice—it is essential for achieving the best results. Stereo recordings can introduce inconsistencies between channels, which may negatively affect the cloning process.
4. How much audio should you record?
For Professional Voice Cloning, you should record at least 30 minutes of audio, but closer to 3 hours is recommended for optimal results. The more high-quality data you provide, the better the clone will be.
5. Is Professional Voice Cloning meant to be 1:1?
Yes, Professional Voice Cloning is designed to create a highly accurate, near 1:1 replica of your voice. It captures intricate details like tone, cadence, and even emotional range, provided the input audio is of high quality and consistent.
6. Should you pre-process your audio with specific presets?
Pre-processing is optional but can help improve the quality of your audio. If you choose to pre-process, follow these guidelines:
- Use light compression to balance volume levels.
- Avoid excessive processing like heavy noise reduction, EQ, or normalization, as these can degrade the natural quality of the audio.
- Ensure the audio is clean, with no background noise, distortion, or artifacts.
Keep the processing minimal to preserve the natural characteristics of your voice.
what do the labels do for the voice
Labels and descriptions used during the creation of a Professional Voice Clone do not impact the quality or outcome of the voice cloning. They are primarily for organizational purposes, helping you identify and manage your voice clones within the Voice Library.
same goes for the description?