How to get rid of AI vocal tone | AI HUB | Page 1

abstract elk Jun 2, 2024, 11:18 PM

#

theres no way to remove "ai vocal tone"

#

thats impossible lmao

#

it depends

#

on inference

plain herald Jun 2, 2024, 11:22 PM

#

can i refine my inference to prevent it somehow?

#

ive heard examples of almost flawless sounding ais without that sort of tonality

#

i feel like there has to be something i can do to aid it

flint pilot Jun 3, 2024, 3:12 AM

#

plain herald can i refine my inference to prevent it somehow?

Use RMVPE

plain herald Jun 3, 2024, 3:13 AM

#

That's how it sounds using RMVPE during inferencing

swift prismBOT Jun 3, 2024, 3:14 AM

#

Ayo? @plain herald level 1 !!! lfg

flint pilot Jun 3, 2024, 3:15 AM

#

plain herald That's how it sounds using RMVPE during inferencing

Try decreasing index ratio

#

It will have less accent but will sound better

#

And make sure your using clean .wav vocals

plain herald Jun 3, 2024, 3:16 AM

#

this is the input for inferencing, is this clean enough?

flint pilot Jun 3, 2024, 3:17 AM

#

plain herald this is the input for inferencing, is this clean enough?

Don’t post copyrighted content

#

Delete

#

Make sure it’s wav or flac

#

(Don’t convert mp3 to wav that won’t do anything)

#

Audio downloaded from YouTube to compressed also

plain herald Jun 3, 2024, 3:18 AM

#

Yeah it is not from YouTube

#

let me test with a raw wav file

flint pilot Jun 3, 2024, 3:19 AM

#

plain herald let me test with a raw wav file

Make sure to isolate vocals

plain herald Jun 3, 2024, 3:20 AM

#

I always do that using bs rofomer via UVR

flint pilot Jun 3, 2024, 3:20 AM

#

k

plain herald Jun 3, 2024, 3:25 AM

#

That is definitely an improvement, is there anything I can do to aid it more, or is that about the max I can do?

cobalt socket Jun 3, 2024, 12:57 PM

#

plain herald That is definitely an improvement, is there anything I can do to aid it more, or...

I don't think so.

#

The only way to get an almost 1:1 model of an artist done is using raw vocal takes from sessions.

#

At least i think so.

plain herald Jun 3, 2024, 12:59 PM

#

That is using raw vocal takes

#

The only problem with the session is that a lot of takes are the same line repeated

#

Just in a different tone or way

#

Does it make sense to include that in my dataset?

cobalt socket Jun 3, 2024, 3:50 PM

#

plain herald Does it make sense to include that in my dataset?

I think it doesn't hurt to use repeated lines with different tone.

#

As long as these are raw

plain herald Jun 3, 2024, 7:39 PM

#

yeah they're fully raw

#

no processing at all

cobalt socket Jun 3, 2024, 8:30 PM

#

plain herald no processing at all

Then it should be fine.

plain herald Jun 3, 2024, 9:53 PM

#

cobalt socket Then it should be fine.

How much data is too much data though?

#

I'm refusing to put some sessions in because they were recorded differently

#

as in the acoustics and whatnot

#

but i have a good amount of raw raw data

cobalt socket Jun 3, 2024, 9:54 PM

#

plain herald How much data is too much data though?

Umm.. over 30-35 mins is too much data

#

With less than 30 mins it should be fine if the audio is raw anyway

#

But you can also use as much data as you want

plain herald Jun 3, 2024, 9:56 PM

#

Alrightie, sounds good!

swift prismBOT Jun 3, 2024, 9:56 PM

#

Ayo? @plain herald level 2 !!! lfg

plain herald Jun 3, 2024, 9:56 PM

#

I'll train with more data and we will see how it goes

cobalt socket Jun 3, 2024, 9:56 PM

#

plain herald I'll train with more data and we will see how it goes

It's matter of testing

plain herald Jun 3, 2024, 11:08 PM

#

Of course, different variants can turn out better

#

How do I know what to fine tune/change?

cobalt socket Jun 3, 2024, 11:16 PM

#

plain herald How do I know what to fine tune/change?

I can't really help you on that.

#

Only if your output sounds kinda noisy, you could double check the dataset and clean it further

plain herald Jun 3, 2024, 11:23 PM

#

Okay fair

#

Trial and error is the way

#

Thank you for your help!

plain herald Jun 4, 2024, 1:54 AM

#

Alright, running it with 28 minutes of raw session data

#

Wish me luck

#

OOPS swapped d and g

#

LMAO

#

we're good now

plain herald Jun 4, 2024, 3:02 AM

#

its slightly different

cobalt socket Jun 4, 2024, 1:24 PM

#

plain herald we're good now

#

1 - i would suggest you to retrain with Titan 32k

#

2 - You misplaced the paths to the G and D.

#

G path is where the path to the G file would go, not D

#

On G path change the D for G and on D path, G for D

#

3 - Maybe you can also try retraining with KLM4 Test 2

plain herald Jun 4, 2024, 2:53 PM

#

cobalt socket 2 - You misplaced the paths to the G and D.

I did, and I fixed that right after

plain herald Jun 4, 2024, 2:53 PM

#

cobalt socket 1 - i would suggest you to retrain with Titan 32k

Why is that?

plain herald Jun 4, 2024, 2:53 PM

#

cobalt socket 3 - Maybe *you can also* try retraining with KLM4 Test 2

And this?

cobalt socket Jun 4, 2024, 2:54 PM

#

plain herald Why is that?

I've tested and SCRFilms (a contributor) tested and 32k is kinda less artifacty

cobalt socket Jun 4, 2024, 2:55 PM

#

plain herald And this?

If your dataset got little to no noise, you can use that pretrain since it's trained with singing vocals.

plain herald Jun 4, 2024, 2:56 PM

#

Oh okay, I'll attempt with both tonight and I'll share my results here again

#

Thank you so much for your help btw! I really appreciate it

plain herald Jun 4, 2024, 4:08 PM

#

trying with klm4 test 2

plain herald Jun 4, 2024, 4:29 PM

#

KLM4 is not working

#

Will retry with titan 32k

#

Nothing 32k is working

swift prismBOT Jun 4, 2024, 4:44 PM

#

Ayo? @plain herald level 3 !!! lfg

plain herald Jun 4, 2024, 4:45 PM

#

Is it the repo im using to train?

#

https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI

GitHub

GitHub - RVC-Project/Retrieval-based-Voice-Conversion-WebUI: Easily...

Easily train a good VC model with voice data <= 10 mins! - RVC-Project/Retrieval-based-Voice-Conversion-WebUI

#

Theres no option for 32

cobalt socket Jun 4, 2024, 4:49 PM

#

plain herald Theres no option for 32

Try clicking on v1 and then click on V2 again

plain herald Jun 4, 2024, 11:14 PM

#

cobalt socket Try clicking on v1 and then click on V2 again

That fixed it!

#

I wonder why its like that

cobalt socket Jun 4, 2024, 11:14 PM

#

plain herald I wonder why its like that

Maybe it's a GUI bug

plain herald Jun 4, 2024, 11:15 PM

#

i presume so

#

thats pretty strange though haha

#

alright ill retrain and share my results

#

wish me luck!

cobalt socket Jun 4, 2024, 11:16 PM

#

plain herald wish me luck!

In case of anything you can also use the #✨│ai-help channel.

plain herald Jun 4, 2024, 11:16 PM

#

I will if needed, thank you!

plain herald Jun 5, 2024, 1:23 AM

#

im happy with this model :D thank you for your help!!

cobalt socket Jun 5, 2024, 1:36 AM

#

Wow, sounds almost real.

plain herald Jun 5, 2024, 1:36 AM

#

having raw sessions helped a lot

cobalt socket Jun 5, 2024, 1:37 AM

#

plain herald having raw sessions helped a lot

And using an HQ pretrain like KLMV2 helped too.

plain herald Jun 5, 2024, 1:37 AM

#

Yup especially klm 4 v2

cobalt socket Jun 5, 2024, 1:37 AM

#

plain herald Yup especially klm 4 v2

God i misspelled

#

It's KLM4V2

plain herald Jun 5, 2024, 1:38 AM

#

yeah i noticed that i just followed what you said haha

#

now to record in his tonality 😭

#

thats gonna be a process in of itself

cobalt socket Jun 5, 2024, 1:39 AM

#

Also, please when showing samples don't use instrumentals.

#

Delete the files where you used insts.

plain herald Jun 5, 2024, 1:39 AM

#

done!

#How to get rid of AI vocal tone