#what is the impact if you use multiple voices for training a multilingual model where you have two
1 messages · Page 1 of 1 (latest)
When using multiple voices for training a multilingual model, especially with different voices for different languages, there is a possibility of language and accent drift. This means that the AI might switch languages or accents within a single generation, particularly in longer texts.
To minimize this issue, it's recommended to use a properly cloned voice that has been trained in the specific language and accent you want the AI to speak. This approach provides the AI with the most context for accurate pronunciation and language selection. Additionally, using the Projects feature can help mitigate these issues by allowing more control over the generation process.
is there a liklihood that the voices would blend slightly and both languages would have characteristics of both voices?
Using multiple voices for training a multilingual model can lead to some blending of characteristics between the voices, especially if the training data includes mixed language audio recordings. This can result in a situation where the AI might not maintain clear distinctions between the voices for each language, potentially causing some overlap in characteristics.
To achieve optimal results, it's recommended to use training data that consists of recordings in a single language for each voice. This helps avoid confusion during the cloning process and ensures that each voice maintains its unique characteristics without bleeding into the other language.
In regards to your comment "This can result in a situation where the AI might not maintain clear distinctions between the voices for each language, potentially causing some overlap in characteristics." if you stick to voice generation that in one language (without intending to switch back and forth) is there a possibility that the model will start speaking in the second langugage