#pvc
1 messages · Page 1 of 1 (latest)
When creating a Professional Voice Clone, it's best to use samples in the language that the clone will mainly be used for. If you're planning to use the clone to speak English, it would be better to provide training audio in English. This is because if you train the model with Italian audio and then use it to speak English, the resulting voice may have an Italian accent when speaking English and might mispronounce words and inflections.
However, it's important to note that the AI can speak any language that is currently supported. The application will reject your sample if it is recorded in an unsupported language. The bare minimum recommended is 30 minutes of audio, but for the optimal result and the most accurate clone, it's recommended to provide closer to 3 hours of audio.
Remember, the AI will try to replicate everything it hears in the audio, including the speaking style. So, the training data should correspond to the style you want (e.g., if you're looking to voice an audiobook with a clone of your voice, the audio you submit for training should be a recording of you reading a book in the tone of voice you want to use).
Lastly, ensure that the audio quality is high, with no background noise or interference, and that there's only a single speaking voice throughout the audio. The AI works best with clean, uncluttered audio.
quick question, is there a way also using bees to translate an audio/video directly with a personalized voice or not?
Yes, you can clone your voice speaking any language that is supported by the Multilingual V2 model. However, the language of the voice you want to clone should match the language you want the AI to speak. This is because the AI needs both the text context and the voice context to function optimally. The language is determined by the text, and the accent and pronunciation are determined by the voice itself.
For your case, if you want to clone an Italian voice and use it to speak English, it's recommended to provide training data in Italian. This is because the AI will try to mimic the tonality of the speaker in the training data. However, the results might be unpredictable when the cloned voice speaks English, as the AI has been trained on Italian data.
As for your second question, currently, it is not possible to use any subtitle files, such as SRT or SBV, to generate speech that fits a specific time frame. The closest solution at the moment is dubbing, which will attempt to do this automatically by cloning the original voice, transcribing and translating the text, and retaining the original performance. However, this technology is still in progress and might not provide perfect results.
I mean when you upload a video to do dubbing, for example from English to Italian, initially it is translated with the basic voice and then you have to modify each single track with the customized voice, there is a way to do dubbing of a video directly with the chosen voice ?
Based on the information provided, when you start the dubbing process, you will be asked to select the original language and the language it will be dubbed into. You will also be asked to select the video or audio you want to dub. You can either upload a video or audio file that will be dubbed or import a video directly from various platforms.
However, the information does not specify whether you can directly dub a video with a chosen voice. It seems that the AI handles the dubbing process, including separating speakers from the background and keeping the soundtrack and sound effects intact.
For more detailed instructions and options, it's recommended to read the full documentation provided by the service.
What format should I use to do the transcript to speech with the API?
I'm sorry, but the information provided does not include details on the format required for transcript to speech with the API.
by uploading csv file in dubbing, is’s possible to select a custom voice?
Yes, when you're creating a dub and choose the manual dubbing option, you can upload a CSV file with specific information. This file should include the names of the speakers, the start and end time of when they are speaking, the original text, and the translated text. However, the provided content does not explicitly mention the ability to select a custom voice through the CSV file. The custom voice selection seems to be done within the dubbing studio, where you can change settings for individual clips or the whole track, including the voice settings.
vorrei sapere qual'è il miglior script da leggere per allenare il professional voice cloning per doppiare dei video di un canale di videogiochi dall'italiano all'inglese
Il tipo di script che scegli per l'allenamento del Professional Voice Cloning dipende molto dal tipo di tono e stile di consegna che desideri per il clone della voce. L'IA cercherà di imitare tutto ciò che sente in una voce: la qualità tonale, l'accento, l'inflessione e molti altri dettagli intricati. Riprodurrà come pronunci certe parole, vocali e consonanti, ma non le parole stesse. Quindi, è meglio scegliere un testo o uno script che trasmetta l'emozione che vuoi catturare e leggere con il tono di voce che desideri utilizzare.
Se stai cercando di doppiare video di un canale di videogiochi, potrebbe essere utile scegliere uno script che sia in linea con il tipo di dialogo o commento che si trova tipicamente in quei video. Ad esempio, potresti voler leggere un commento di un videogioco o un dialogo da un videogioco popolare. Ricorda, il tono e lo stile di consegna che usi durante la lettura dello script saranno replicati nell'output, quindi dovresti cercare di leggere lo script nel modo in cui vorresti che il clone della voce parlasse.
Inoltre, è importante notare che se stai cercando di clonare la tua voce mentre parli italiano e poi vuoi che parli inglese, è molto probabile che avrà un accento italiano quando parla inglese. Pertanto, se desideri un clone della voce che parli inglese senza un accento italiano, potrebbe essere necessario fornire campioni di te stesso che parli inglese per l'allenamento.
ma se il mio inglese non è molto buono e io parlo inglese per allenare il pvc quando poi eseguo il doppiaggio avrà il mio accento sbagliato, se invece io parlassi in italiano non sarebbe meglio?