As in title:
I bought a 2 hour long audio drama (fully casted) in mp3 and the main character is voiced by an impersonator rather than a real actor that played the same character in tv series. And he sounds awful. I want to change his voice using an ai to a real voice actor. But heres the catch. I dont want to spoil myself a plot. So i need a method or program to automaticly find a every instance of voice of that one actor in a whole file (maybe based on a short voice sample idk), and replace only that voice with AI. Is this possible?
#Changing voice of one actor in an audio drama, with auto detection of that voice, not manually.
1 messages · Page 1 of 1 (latest)
What's your PC GPU
You can just give RVC the audio file, then the trained model of the original voice actor, without needing to check the file manually
However, you need a trainer model of the original voice actor
GTX 1060, very old graphic card
But come on, that's only 2 hours of audio, selectively around 20 minutes maybe, it can't take that much time
Original you mean the bad impersonator from audiobook or the real actor I want to hear?
the opposite
You would need a model of the original voice actor you want to hear
Okay, I used command /find and I already found it, sounds good enough
Eh, it could run it locally, it should work just might be slow
If you want, there's cloud (remote good PC), which would be faster with a better GPU, though it's tied to a limited time
Don't care, I don't need it right away, how long could it take?
Do you want it to run on your PC (locally), or use a cloud service?
Sorry but we can't really estimate that
Even with a large margin, just indicatively, how long? Hour, hours, days?
I would rather choose my pc
Maybe an hour or more, but I'm not sure, I can tell you it won't take a day+ though
Okay, thank you, I will do it tomorrow or at the weekend, thanks for help. I will be updating this thread in case of needing help. Once again, thanks.
Alright yw
what exactly do you mean voice model? There is a good model on Weights but i dont know how to download it
To Download a Model from Weights.com:
- Login
- Click the 3 dots at the right of the image of the model
- Click download
- Download Anyways
- Unzip the zip, and you might wanna rename the pth and index since all models on weights are renamed as 'model'
If you have an audio file with two different persons speaking
I dont know any model that would separate them into two different files and preservice the timings
and I think that's what you want
all the advice above would replace ALL voices in the audio with the same voice
and I dont think that's what you want
Well, that's exactly what I want to do XD
there are no good models for separating ppl speaking together, mostly compromising or ruining the result quality, or even not effective
@lime patio btw how's it going? You could also try some karaoke model to separate the vocals
i just listened to the audio drama normally, after some time i managed to get used to the impersonator voice and the plot was great so it wasnt that much of an issue
altrought i still experimented in Weights with Capaldi AI
i might make another post later to ask about some technical stuff with ai , but this little project i am making is far from being even started, i have a lot of things going around me now
so should this post being kept open ?
oh, sorry i didn't knew i should close it. of course it can be closed.