#Applio speaker embedding
1 messages · Page 1 of 1 (latest)
- if not sure, use the default contentvec
- depend on the language as it is mostly optimized for English
- make sure the model dataset and the inference sample articulate well
Thanks. If I wabt to use a dataset with two linguage? Is it better make two different model? One for every linguage?
I wouldn't say a single bilingual dataset really bad, you gotta try and see
I'm doing some tests. The Italian accent isn't very good; it sounds like an Englishman singing in Italian. Which speaker embedding could I use for Italian?
How long is your dataset?
7 min
Does it only consists of you singing or also speaking?
singing in italian and english and speaking in italian and english. But the sing parts are longer.
If you really want the output to sound Italian then I'd recommend to only use Italian in your dataset and use the index to improve accent
Ok. What do you mean with use the index? I use replay to create cover. Don't know if this software use the index file too...
I've never heard of that software
Why don't you use Applio to make covers
I'm learning how to use it now and I'm starting to make my model voice.
you may want it longer to improve
they are 7 min of HQ sound anyway.