#I need more input and guidance on how to create my professional voice clone
1 messages · Page 1 of 1 (latest)
To create a high-quality Professional Voice Clone (PVC), follow these steps:
-
Access VoiceLab: Go to the "Voices" tab in your VoiceLab.
-
Add a New Voice: Click on "Add Generative or Cloned Voice" and select "Professional Voice Cloning".
-
Read Guidelines: Confirm that you have read the Guidelines and Rules, then click "Start".
-
Name and Upload: Name your PVC, choose the language, and upload your recordings.
-
Add Details: Add labels and a description for your PVC, then click "Create Professional Voice".
-
Verify Your Voice: Ensure your browser has access to your microphone and that you are not muted. Use the same equipment and match the tone and delivery of your uploaded samples. Click "Start Recording" and read the generated captcha.
-
Fine-Tuning: Once verified, your PVC will be queued for fine-tuning. This process can take several hours.
-
Use Your PVC: After fine-tuning, find your PVC in your VoiceLab and click "use" to generate the audio you need.
Recording Key Considerations
- Recording Quality: Use high-quality equipment and ensure clear audio with no background noise.
- Single Speaker: Ensure only one voice is present in the recordings.
- Audio Length: Use at least 30 minutes to 3 hours of audio for the best results.
- Consistent Delivery: Maintain a consistent speaking style throughout the recordings.
- Language: Use samples in the same language as your PVC model.
Recording Quality Guidelines
- Equipment: Use a professional XLR mic and audio interface.
- Pop-Filter: Minimize plosives with a pop-filter.
- Microphone Distance: Position yourself about two fists away from the mic.
- Noise-Free: Record in a quiet, acoustically-treated room.
- Volume Control: Maintain a consistent volume level.
- Audio Format: Use mono, .wav files with a minimum 44.1 kHz sample rate and 16-bit depth.
Performance Guidelines
- Emphasis and Emotion: Align your intonations and emotions with the context of the text.
- Consistent Voice: Avoid changing voices for different characters in a single recording.
- Pronunciation: Ensure clear and articulate pronunciation.
By following these steps and guidelines, you can create a high-quality Professional Voice Clone.
No, I need more resources on how this process works. What kind of samples can I record? Someone mentioned having a separate audio for a whispering voice for example
To create a high-quality Professional Voice Clone (PVC), follow these guidelines:
-
Recording Quality: Use high-quality recording equipment, such as an XLR mic with a dedicated audio interface. Ensure your recordings are free from background noise, room reverb, and other unwanted sounds.
-
Clear Audio: Make sure there’s only a single speaking voice throughout the audio. Avoid background music or sound effects.
-
Audio Length: Provide at least 30 minutes of audio, but closer to 3 hours is recommended for the best results. You can upload multiple files, but ensure they have consistent audio quality.
-
Consistent Delivery Style: The speaking style in your samples will be replicated in the output. Use one style per upload. For example, if you’re creating a voice model for audiobooks, submit recordings of yourself reading books in a consistent style.
-
Language: Use samples in the language you primarily intend the PVC for. Cloning a voice from a different language may result in accents or mispronunciations.
-
Clone Your Own Voice: For now, you can only clone your own voice. You will need to go through a verification process before submitting your fine-tuning request.
Recording Key Considerations
- Professional Recording Equipment: Use high-quality equipment for optimal results.
- Use a Pop-Filter: This minimizes plosives when recording.
- Microphone Distance: Position yourself about two fists away from the microphone.
- Noise-Free Recording: Ensure the audio input doesn’t have any interference.
- Room Acoustics: Record in an acoustically-treated room to reduce unwanted echoes.
- Audio Pre-processing: Consider light compression or other tools to improve your audio files.
- Volume Control: Maintain a consistent volume that’s loud enough to be clear but not so loud that it causes distortion.
Performance Guidelines
- Emphasis and Intonation: Align appropriately with the context of the text.
- Emotional Range: Vary your tone and pace naturally when reading.
- Consistent Voice: Avoid changing voices for different characters in a single recording.
- Pronunciation: Ensure correct and articulate pronunciation.
Scripts
Choose a text or script that conveys the emotion you want to capture and read in the tone of voice you want to use. Here are some examples:
By following these guidelines, you can create a high-quality Professional Voice Clone that accurately replicates your voice.
Do you have scripts in German for an informative podcast?