#✨│ai-help | AI HUB | Page 203

crude flame Jan 5, 2025, 4:57 PM

#

T de-ess is

#

RX is not

#

but you can get RX "for free"

peak falcon Jan 5, 2025, 4:57 PM

#

cracked

crude flame Jan 5, 2025, 4:57 PM

#

perhaps

low shard Jan 5, 2025, 4:57 PM

#

crude flame but you can get RX "for free"

🏴‍☠️

peak falcon Jan 5, 2025, 4:58 PM

#

I have a de esser from FabFilter but I have no idea how to use it properly

#

It's called Pro-DS

proven hill Jan 5, 2025, 4:59 PM

#

please no cracked stuff pay full price.

peak falcon Jan 5, 2025, 5:01 PM

#

rn_image_picker_lib_temp_94d255bd-adf9-4b21-87ec-16662de93ca5.jpg

#

Everything cracked

crude flame Jan 5, 2025, 5:02 PM

#

peak falcon Everything cracked

you mean you payed full price right? because you are an outstanding citizen

peak falcon Jan 5, 2025, 5:02 PM

#

Yes sir

#

Of course I did

#

especially FL Studio

crude flame Jan 5, 2025, 5:03 PM

#

ight, thank you for your honesty

bold yarrow Jan 5, 2025, 5:03 PM

#

peak falcon Everything cracked

someones about to get banned

peak falcon Jan 5, 2025, 5:04 PM

#

For using cracked software?

bold yarrow Jan 5, 2025, 5:04 PM

#

yes

#

isnt that in the rules

peak falcon Jan 5, 2025, 5:04 PM

#

I don't know, I didn't read them

#

I just joined for the Ai models because I tried to make a producer tag

low shard Jan 5, 2025, 5:06 PM

#

bold yarrow someones about to get banned

everyone 🏴‍☠️ it

#

it's literally said in our docs lol

bold yarrow Jan 5, 2025, 5:07 PM

#

oh

low shard Jan 5, 2025, 5:07 PM

#

bold yarrow isnt that in the rules

well sharing the link of it causes us troubles

bold yarrow Jan 5, 2025, 5:07 PM

#

well i embarrassed myself

low shard Jan 5, 2025, 5:07 PM

#

but just saying "google how to be a pirate" won't get u banned

bold yarrow Jan 5, 2025, 5:07 PM

#

could piss off your isp

#

well i just downloaded de-esser

#

but i can't find it anywhere

#

did i download a virus from ruislip or something

#

ill show them

peak falcon Jan 5, 2025, 5:12 PM

#

what DAW are you using?

bold yarrow Jan 5, 2025, 5:12 PM

#

daw?

peak falcon Jan 5, 2025, 5:13 PM

#

how do you mix the vocals?

bold yarrow Jan 5, 2025, 5:13 PM

#

with the instrumentals?

#

i use plain old audacity

peak falcon Jan 5, 2025, 5:13 PM

#

Oh

#

I have no idea how that works

tame mica Jan 5, 2025, 6:41 PM

#

get a free daw like reaper

#

audacity is not ideal for audio mixing

#

or you can yk "buy" other daws

proven hill Jan 5, 2025, 6:48 PM

#

reaper best

vapid mantle Jan 5, 2025, 7:30 PM

#

@hot ledge hocam kaç Target Sample Rate olmalı ?

hot ledge Jan 5, 2025, 7:30 PM

#

32

#

@vapid mantle32 k

vapid mantle Jan 5, 2025, 7:31 PM

#

tm hocam eyvallah

brittle wing Jan 5, 2025, 9:52 PM

#

can sm1 send me the voice changer file for windows i cant fibd it

#

@tame mica

crystal jetty Jan 5, 2025, 10:02 PM

#

Hello everyone, can someone help me? I'm generally 0 in these matters(

azure marshBOT Jan 5, 2025, 10:02 PM

#

crystal jetty Hello everyone, can someone help me? I'm generally 0 in these matters(

Hey, Leroy PVE! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:

General RVC help: #✨│ai-help
W-Okada / Realtime RVC: #🔍│help-w-okada
AI image related: #🔍│help-ai-art

low shard Jan 5, 2025, 10:49 PM

#

brittle wing can sm1 send me the voice changer file for windows i cant fibd it

that's wokada, tell me ur pc gpu in #🔍│help-w-okada

low shard Jan 5, 2025, 10:49 PM

#

crystal jetty Hello everyone, can someone help me? I'm generally 0 in these matters(

in what?

latent cypress Jan 5, 2025, 11:18 PM

#

do you guys prefer klm 5.0 mini or klm 4.3 x2?

jaunty shale Jan 6, 2025, 12:43 AM

#

is there a way to convert .safetensors file to .pth file so I can use it in applio?

simple ore Jan 6, 2025, 12:46 AM

#

safetensors of what? gptsovits?

jaunty shale Jan 6, 2025, 12:49 AM

#

I used okada to make a merged voice

#

it created a safetensor file in the model_dir folder

#

#

i run it on browser (it works better than window one for me)

static oar Jan 6, 2025, 2:01 AM

#

so i just used Google Collab to make a voice model from audio clips right, works good with the voice changer but i was wondering what i can use to be able to apply that voice model to audio. any ideas?

rough star Jan 6, 2025, 2:09 AM

#

Does that mean that I have to create a new user on my PC?

lime otter Jan 6, 2025, 2:15 AM

#

-colab

azure marshBOT Jan 6, 2025, 2:15 AM

#

lime otter -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
FaceFusion UI, by Nick088 Google Colab
FaceFusion NO UI, by Nick088 Google Colab
EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

simple ore Jan 6, 2025, 2:17 AM

#

rough star Does that mean that I have to create a new user on my PC?

it mean literally do not run it using Administrator user

#

you can run it as a normal user that has local admin priveleges

red gale Jan 6, 2025, 2:21 AM

#

does anyone of you know any uhh,good free voice changers?

rough star Jan 6, 2025, 2:22 AM

#

simple ore you can run it as a normal user that has local admin priveleges

Okay so double click wouldn't run it as admin right?

simple ore Jan 6, 2025, 2:25 AM

#

rough star Okay so double click wouldn't run it as admin right?

just download a compiled version, my dude

#

https://huggingface.co/IAHispano/Applio/tree/main/Compiled/Windows

knotty moth Jan 6, 2025, 6:21 AM

#

vapid mantle <@687355869884973059> hocam kaç Target Sample Rate olmalı ?

the result will be always bad even if you upscale it to 32k

32k'ya yükseltseniz bile sonuç her zaman kötü olacaktır

fleet trail Jan 6, 2025, 7:54 AM

#

hey

#

i just formatted my pc

#

i wanna

#

reinstall it

latent kettle Jan 6, 2025, 11:02 AM

#

fleet trail reinstall it

What

proven hill Jan 6, 2025, 11:29 AM

#

fleet trail reinstall it

reinstall what?

proven hill Jan 6, 2025, 11:30 AM

#

red gale does anyone of you know any uhh,good free voice changers?

#🔍│help-w-okada

red gale Jan 6, 2025, 12:13 PM

#

proven hill <#1159290161683767298>

Figured it out already,no worries

jaunty shale Jan 6, 2025, 1:52 PM

#

jaunty shale is there a way to convert .safetensors file to .pth file so I can use it in appl...

is there way to convert it so I don't have to make a dataset for the merged model?

bold yarrow Jan 6, 2025, 2:00 PM

#

hey guys

#

is there a website where i can accurately predict how many epochs my model would need without becoming either undertrained or overtrained

proven hill Jan 6, 2025, 2:02 PM

#

bold yarrow is there a website where i can accurately predict how many epochs my model would...

tensorboard

proven hill Jan 6, 2025, 2:02 PM

#

jaunty shale is there way to convert it so I don't have to make a dataset for the merged mode...

why would you do that, no its not possible

jaunty shale Jan 6, 2025, 2:02 PM

#

proven hill why would you do that, no its not possible

wanted to create a merged voice and use it to convert audio voices into that voice-

proven hill Jan 6, 2025, 2:03 PM

#

you can simply merge two voices

jaunty shale Jan 6, 2025, 2:03 PM

#

I can't.

#

I tried in applio it doesn't work anymore.

#

The merged voice I have has more than 2 voices combined

proven hill Jan 6, 2025, 2:03 PM

#

merge one at the time

#

also wdym it doesnt work anymore?

jaunty shale Jan 6, 2025, 2:04 PM

#

it shows error

#

I would have to check what kind of error in a bit

proven hill Jan 6, 2025, 2:06 PM

#

are the files youre merging the same sample rate?

jaunty shale Jan 6, 2025, 2:06 PM

#

Yep

proven hill Jan 6, 2025, 2:06 PM

#

can you show me the error?

jaunty shale Jan 6, 2025, 2:07 PM

#

will do

craggy brook Jan 6, 2025, 2:08 PM

#

Is there any other way to access the 15.ai site? Or is there a site as good as this site with better sound?

proven hill Jan 6, 2025, 2:08 PM

#

craggy brook Is there any other way to access the 15.ai site? Or is there a site as good as t...

what do you need?

craggy brook Jan 6, 2025, 2:09 PM

#

voicing for a character I want to use. realistically, with emotions and without distorting the tone of voice

proven hill Jan 6, 2025, 2:10 PM

#

craggy brook voicing for a character I want to use. realistically, with emotions and without ...

try weights.gg

jaunty shale Jan 6, 2025, 2:13 PM

#

ok now it worked-

#

..somehow

#

but I already have premade merged voice. Is there like a long enough audio to convert in real time?

proven hill Jan 6, 2025, 2:15 PM

#

jaunty shale but I already have premade merged voice. Is there like a long enough audio to co...

wdym

jaunty shale Jan 6, 2025, 2:16 PM

#

proven hill wdym

So you know how you can use voice in real time? Not only I can use my voice but also audio to convert into in real time pretty much. It works very well and I did it multiple times, but I just need some audio that is long enough to train the merged audio I did in okada.

#

(I use soundpad to make it work)

proven hill Jan 6, 2025, 2:17 PM

#

jaunty shale So you know how you can use voice in real time? Not only I can use my voice but ...

-rt

azure marshBOT Jan 6, 2025, 2:17 PM

#

proven hill -rt

Interaction has expired, use the command again for a new interaction.

💻 Local Realtime RVC

proven hill Jan 6, 2025, 2:17 PM

#

first link

solid arch Jan 6, 2025, 2:17 PM

#

yo

#

how can i use my gpu to rvc?

proven hill Jan 6, 2025, 2:17 PM

#

solid arch how can i use my gpu to rvc?

what do you need to do?
whats your gpu?

jaunty shale Jan 6, 2025, 2:18 PM

#

proven hill -rt

I have okada already.

proven hill Jan 6, 2025, 2:18 PM

#

jaunty shale I have okada already.

then what you need exactly?

#

they are already doing that

jaunty shale Jan 6, 2025, 2:19 PM

#

proven hill then what you need exactly?

I only need some audio to play through my soundboard to convert it to that merged voice and record it.
Some audio that is long enough, so I can make a dataset.

knotty moth Jan 6, 2025, 2:19 PM

#

proven hill they are already doing that

nvm I scrolled too much

solid arch Jan 6, 2025, 2:20 PM

#

proven hill what do you need to do? whats your gpu?

its stuck on cpu dawg

#

T-T

#

im using rx 570

proven hill Jan 6, 2025, 2:20 PM

#

jaunty shale I only need some audio to play through my soundboard to convert it to that merge...

i dont understand this at all honestly

solid arch Jan 6, 2025, 2:20 PM

#

or it is set that way

proven hill Jan 6, 2025, 2:20 PM

#

solid arch im using rx 570

cooked badly

solid arch Jan 6, 2025, 2:20 PM

#

idk abt anything but its working

solid arch Jan 6, 2025, 2:20 PM

#

proven hill cooked badly

very cooked

#

only works when i record

proven hill Jan 6, 2025, 2:21 PM

#

so no file input?

knotty moth Jan 6, 2025, 2:23 PM

#

solid arch how can i use my gpu to rvc?

if u mean the inference, use applio with zluda. I suppose it will be okay with at least 4 gb vram

heady gorge Jan 6, 2025, 2:23 PM

#

Do I really need winzip for the voice changer

proven hill Jan 6, 2025, 2:24 PM

#

you have to unpack it somehow.
but i suggest 7zip

solid arch Jan 6, 2025, 2:24 PM

#

best way to like stop cutting the audio out?

knotty moth Jan 6, 2025, 2:25 PM

#

heady gorge Do I really need winzip for the voice changer

7zip or winrar is preferrable

proven hill Jan 6, 2025, 2:25 PM

#

solid arch best way to like stop cutting the audio out?

realtime?

solid arch Jan 6, 2025, 2:25 PM

#

proven hill Jan 6, 2025, 2:25 PM

#

knotty moth 7zip or winrar is preferrable

hun, 7zip>>>

proven hill Jan 6, 2025, 2:25 PM

#

solid arch

set gpu to your gpu

heady gorge Jan 6, 2025, 2:25 PM

#

Why do voice changers have to cost?

proven hill Jan 6, 2025, 2:25 PM

#

set f0 do rmvpe

solid arch Jan 6, 2025, 2:25 PM

#

proven hill set gpu to your gpu

aint working tho only cpu

proven hill Jan 6, 2025, 2:25 PM

#

heady gorge Why do voice changers have to cost?

the one we have its free

proven hill Jan 6, 2025, 2:25 PM

#

solid arch aint working tho only cpu

download the modified version

#

-rt

azure marshBOT Jan 6, 2025, 2:25 PM

#

proven hill -rt

Interaction has expired, use the command again for a new interaction.

💻 Local Realtime RVC

proven hill Jan 6, 2025, 2:25 PM

#

first link

#

theres the guide

solid arch Jan 6, 2025, 2:26 PM

#

oh

heady gorge Jan 6, 2025, 2:26 PM

#

Yeah, but I need winzip

proven hill Jan 6, 2025, 2:26 PM

#

heady gorge Yeah, but I need winzip

download 7zip

heady gorge Jan 6, 2025, 2:26 PM

#

Is that free

knotty moth Jan 6, 2025, 2:26 PM

#

proven hill hun, 7zip>>>

both work at least

proven hill Jan 6, 2025, 2:26 PM

#

yes

solid arch Jan 6, 2025, 2:26 PM

#

i download it after

proven hill Jan 6, 2025, 2:26 PM

#

knotty moth both work at least

true

knotty moth Jan 6, 2025, 2:26 PM

#

heady gorge Is that free

7zip is open source

solid arch Jan 6, 2025, 2:26 PM

#

this

jaunty shale Jan 6, 2025, 2:26 PM

#

proven hill i dont understand this at all honestly

..okay let me explain it differently.

I use Voicemeeter Banana to make this happen. B2 is input device that is used to convert anything that can be played in it. B1 is something that will be played through the actual microphone. When I play audio through soundboard, I can play it in the B2 to convert it into the different voice in real time. (since I cannot convert it in applio). With that, I can make a dataset.

heady gorge Jan 6, 2025, 2:26 PM

#

knotty moth 7zip is open source

Can you send me the link

proven hill Jan 6, 2025, 2:26 PM

#

solid arch this

you have AMD

proven hill Jan 6, 2025, 2:27 PM

#

heady gorge Can you send me the link

google is free

solid arch Jan 6, 2025, 2:27 PM

#

jaunty shale I have okada already.

ure telling me that u want some audios to be converted into a voice which is to okada so okada would transmit that and turn into a different voice and will play it thru to ur mic

#

a audio came from ur soundboard?

#

soundboard > okada > mic

jaunty shale Jan 6, 2025, 2:27 PM

#

solid arch ure telling me that u want some audios to be converted into a voice which is to ...

correct. So I can make a dataset.

#

I really don't wanna spend more hours to make it in applio again.

solid arch Jan 6, 2025, 2:28 PM

#

i use voicemod for the soundboard

#

works clearly when i use it

heady gorge Jan 6, 2025, 2:28 PM

#

Are you sure this is safe

solid arch Jan 6, 2025, 2:28 PM

#

sends any kind of audio thru my mic

proven hill Jan 6, 2025, 2:28 PM

#

jaunty shale ..okay let me explain it differently. I use Voicemeeter Banana to make this hap...

you wanna make a dataset out of converted audios?

proven hill Jan 6, 2025, 2:28 PM

#

heady gorge Are you sure this is safe

yes and if you dont trust check internet

jaunty shale Jan 6, 2025, 2:28 PM

#

proven hill you wanna make a dataset out of converted audios?

yes. I did it before in applio and it workedd out just fine.

proven hill Jan 6, 2025, 2:29 PM

#

jaunty shale yes. I did it before in applio and it workedd out just fine.

why would you make a dataset of an already existing model

solid arch Jan 6, 2025, 2:29 PM

#

imma extract the file

jaunty shale Jan 6, 2025, 2:29 PM

#

because that's not a .pth file...

#

I cannot use safetensor file in applio

proven hill Jan 6, 2025, 2:30 PM

#

where do you eeveeen get a safetensor file

#

TO WORK IN OKADA TOO

jaunty shale Jan 6, 2025, 2:31 PM

#

#

I never knew it makes a different file until I merged it yesterday.

knotty moth Jan 6, 2025, 2:32 PM

#

jaunty shale I cannot use safetensor file in applio

use the original model file before you added it into the voice changer

heady gorge Jan 6, 2025, 2:32 PM

#

I just installed 7zip

#

What's the link to the voice changer

jaunty shale Jan 6, 2025, 2:33 PM

#

merged voice makes only a safetensor file.

knotty moth Jan 6, 2025, 2:34 PM

#

jaunty shale merged voice makes only a safetensor file.

only if you do it within the voice changer

proven hill Jan 6, 2025, 2:34 PM

#

heady gorge What's the link to the voice changer

-rt

azure marshBOT Jan 6, 2025, 2:34 PM

#

proven hill -rt

Interaction has expired, use the command again for a new interaction.

💻 Local Realtime RVC

proven hill Jan 6, 2025, 2:34 PM

#

first link

proven hill Jan 6, 2025, 2:34 PM

#

knotty moth only if you do it within the voice changer

that explains it

#

yea you need to merge in mainline (discontinued), applio (suggested) or ilaria rvc mainline (discontinued)

jaunty shale Jan 6, 2025, 2:34 PM

#

knotty moth only if you do it within the voice changer

in okada, yes.

heady gorge Jan 6, 2025, 2:35 PM

#

I'm not good with PC stuff But I can try

proven hill Jan 6, 2025, 2:35 PM

#

heady gorge I'm not good with PC stuff But I can try

just follow the guide, its easy :)

solid arch Jan 6, 2025, 2:36 PM

#

chat why im one my browser 😭

jaunty shale Jan 6, 2025, 2:36 PM

#

I'll just figure it out. I only need a long audio from youtube.

solid arch Jan 6, 2025, 2:36 PM

#

where i should put my models?

#

proven hill Jan 6, 2025, 2:37 PM

#

please follow the guide

proven hill Jan 6, 2025, 2:37 PM

#

jaunty shale I'll just figure it out. I only need a long audio from youtube.

again follow what i said

low shard Jan 6, 2025, 2:37 PM

#

solid arch chat why im one my browser 😭

That's wokada, use #🔍│help-w-okada

solid arch Jan 6, 2025, 2:37 PM

#

oh

#

mb ggang

jaunty shale Jan 6, 2025, 2:37 PM

#

proven hill again follow what i said

?

proven hill Jan 6, 2025, 2:38 PM

#

jaunty shale ?

dont merge with the voice changer

heady gorge Jan 6, 2025, 2:38 PM

#

Okay. Can I have the link to the voice changer

proven hill Jan 6, 2025, 2:38 PM

#

-rt

azure marshBOT Jan 6, 2025, 2:38 PM

#

proven hill -rt

Interaction has expired, use the command again for a new interaction.

💻 Local Realtime RVC

proven hill Jan 6, 2025, 2:38 PM

#

first link

jaunty shale Jan 6, 2025, 2:38 PM

#

proven hill dont merge with the voice changer

oh right, will keep in mind for now, like I said I never knew it makes safetensor file in the first place.

proven hill Jan 6, 2025, 2:39 PM

#

jaunty shale oh right, will keep in mind for now, like I said I never knew it makes safetenso...

merged files in applio will be in pth, exactly what you need

heady gorge Jan 6, 2025, 2:39 PM

#

I don't see the Voice changer in 7zip

low shard Jan 6, 2025, 2:40 PM

#

heady gorge I don't see the Voice changer in 7zip

Realtime voice changer for calls? Tell me ur PC GPU in #🔍│help-w-okada

heady gorge Jan 6, 2025, 2:41 PM

#

it keeps on changing

craggy brook Jan 6, 2025, 2:43 PM

#

proven hill try weights.gg

How to make characters voices with emotions such as angry, confused, sad etc.

proven hill Jan 6, 2025, 2:44 PM

#

craggy brook How to make characters voices with emotions such as angry, confused, sad etc.

you cant

#

assuming youre using tts

craggy brook Jan 6, 2025, 2:45 PM

#

proven hill assuming youre using tts

what ?

proven hill Jan 6, 2025, 2:46 PM

#

craggy brook what ?

are you using text to make them speak?

craggy brook Jan 6, 2025, 2:46 PM

#

yes

proven hill Jan 6, 2025, 2:47 PM

#

craggy brook yes

then you cant

winged crane Jan 6, 2025, 2:48 PM

#

yo mods can i get permission to share my screen?

proven hill Jan 6, 2025, 2:49 PM

#

winged crane yo mods can i get permission to share my screen?

for?

winged crane Jan 6, 2025, 2:50 PM

#

i wannashre my screen when im playing dragonball sparkeling zero with a friend

#

if thats ok?

#

share

proven hill Jan 6, 2025, 2:50 PM

#

idk

#

@low shard

winged crane Jan 6, 2025, 2:50 PM

#

you can join and see aswell if you want to

knotty moth Jan 6, 2025, 2:52 PM

#

winged crane you can join and see aswell if you want to

ask mods for streaming permission, and be sure to avoid showing any inappropriate stuffs

winged crane Jan 6, 2025, 2:53 PM

#

yea im not doing dat im married

#

thx allot

hallow thistle Jan 6, 2025, 2:53 PM

#

proven hill Jan 6, 2025, 2:54 PM

#

laught

winged crane Jan 6, 2025, 2:56 PM

#

@quick jungle can i get permission to share my screen

quick jungleBOT Jan 6, 2025, 2:56 PM

#

winged crane <@1158132286169022555> can i get permission to share my screen

Watchu want? :3

proven hill Jan 6, 2025, 2:56 PM

#

its a bot

winged crane Jan 6, 2025, 2:56 PM

#

oh

#

lol

flint geyser Jan 6, 2025, 3:26 PM

#

yo do you want me to still fix it

low shard Jan 6, 2025, 3:28 PM

#

flint geyser yo do you want me to still fix it

I mean if u could fix it it would be good, but it's left to rot since months

proven hill Jan 6, 2025, 3:28 PM

#

yea dont waste your time

#

people use ilaria rvc zero anyway

peak falcon Jan 6, 2025, 4:47 PM

#

Eight by Eight

#

36 zero waist

honest junco Jan 6, 2025, 7:08 PM

#

Sorry I tried couples times but my RVC keep showing Frequent errors occur. Please check if the model of the framework being targeted is loaded.

#

And my colab are showing my server is not an accepted origin. (further occurrences of this error will be logged with level INFO)

#

I tried to search in github but I still can't find anyway to fix it

#

f_cry

#

(Fun story I used 3 hours to do pip install pip==24.0

proven hill Jan 6, 2025, 7:12 PM

#

honest junco (Fun story I used 3 hours to do pip install pip==24.0

download a precompiled version

honest junco Jan 6, 2025, 7:13 PM

#

proven hill download a precompiled version

What that mean (Sorry I'm a script nub

proven hill Jan 6, 2025, 7:14 PM

#

honest junco What that mean (Sorry I'm a script nub

it comes with everything preinstalled basically

honest junco Jan 6, 2025, 7:14 PM

#

Oh you mean the rar

#

I can'

proven hill Jan 6, 2025, 7:14 PM

#

honest junco Oh you mean the rar

ye

honest junco Jan 6, 2025, 7:14 PM

#

can't*

#

I tryied

proven hill Jan 6, 2025, 7:14 PM

#

why?

honest junco Jan 6, 2025, 7:14 PM

#

it have 20k ms on res

#

💀

#

Thats why I need to use colab

proven hill Jan 6, 2025, 7:15 PM

#

what colab are you using?

honest junco Jan 6, 2025, 7:15 PM

#

google colab

proven hill Jan 6, 2025, 7:16 PM

#

yes i mean

#

give me the link

honest junco Jan 6, 2025, 7:16 PM

#

Alr

honest junco Jan 6, 2025, 7:21 PM

#

proven hill give me the link

dmed u

proven hill Jan 6, 2025, 7:22 PM

#

honest junco dmed u

please dont dm me, send it here

#

also i saw it was ngrok

honest junco Jan 6, 2025, 7:22 PM

#

alr mb

proven hill Jan 6, 2025, 7:22 PM

#

-ngrok

honest junco Jan 6, 2025, 7:22 PM

#

yeah

proven hill Jan 6, 2025, 7:22 PM

#

damn

honest junco Jan 6, 2025, 7:22 PM

#

https://colab.research.google.com/github/w-okada/voice-changer/blob/master/Realtime_Voice_Changer_on_Colab.ipynb#scrollTo=lLWQuUd7WW9U

#

This

proven hill Jan 6, 2025, 7:23 PM

#

ohhh its a voice changer

honest junco Jan 6, 2025, 7:23 PM

#

yeah

proven hill Jan 6, 2025, 7:23 PM

#

i think this is old

#

why not use your gpu?

honest junco Jan 6, 2025, 7:24 PM

#

Cuz when I use my gpu

#

the res of it

#

are 20k ms

#

basically it take 20s to tranfer my voice to u know

proven hill Jan 6, 2025, 7:26 PM

#

what gpu do you have?

honest junco Jan 6, 2025, 7:27 PM

#

honest junco Jan 6, 2025, 7:27 PM

#

proven hill what gpu do you have?

6800

#

lfg

proven hill Jan 6, 2025, 7:27 PM

#

are you using the forked version?

honest junco Jan 6, 2025, 7:27 PM

#

forked version?

proven hill Jan 6, 2025, 7:27 PM

#

the new version

honest junco Jan 6, 2025, 7:27 PM

#

Yes

proven hill Jan 6, 2025, 7:27 PM

#

better support for amd cards

honest junco Jan 6, 2025, 7:27 PM

#

1.5.3.18a

proven hill Jan 6, 2025, 7:28 PM

#

nah thats old

#

-rt

azure marshBOT Jan 6, 2025, 7:28 PM

#

proven hill -rt

Interaction has expired, use the command again for a new interaction.

💻 Local Realtime RVC

proven hill Jan 6, 2025, 7:28 PM

#

first link

honest junco Jan 6, 2025, 7:31 PM

#

alr tysm

low shard Jan 6, 2025, 7:32 PM

#

honest junco alr tysm

also use #🔍│help-w-okada

proven hill Jan 6, 2025, 7:32 PM

#

np

proven hill Jan 6, 2025, 7:32 PM

#

low shard also use <#1159290161683767298>

https://tenor.com/view/boo-gif-19787475173016375

Tenor

honest junco Jan 6, 2025, 7:33 PM

#

low shard also use <#1159290161683767298>

My brain was shorted

#

sorry

#

😔

low shard Jan 6, 2025, 7:33 PM

#

proven hill https://tenor.com/view/boo-gif-19787475173016375

skill issue

low shard Jan 6, 2025, 7:33 PM

#

honest junco sorry

dw all fine

proven hill Jan 6, 2025, 7:34 PM

#

low shard skill issue

smh

honest junco Jan 6, 2025, 7:34 PM

#

Fun fact I paid the google colab

#

proven hill Jan 6, 2025, 7:35 PM

#

honest junco Fun fact I paid the google colab

why….

honest junco Jan 6, 2025, 7:35 PM

#

I spent 56.5 for using that

honest junco Jan 6, 2025, 7:35 PM

#

proven hill why….

I though google were p2w 💀

proven hill Jan 6, 2025, 7:36 PM

#

lmao

low shard Jan 6, 2025, 7:36 PM

#

honest junco I though google were p2w 💀

u can use it for free 4 hours daily not granted, or use ur own gpu

honest junco Jan 6, 2025, 7:36 PM

#

low shard u can use it for free 4 hours daily not granted, or use ur own gpu

I even used the limit of ngrok

#

I need to use my nd google acount to log in to use it Skullflushed

low shard Jan 6, 2025, 7:37 PM

#

honest junco I even used the limit of ngrok

u can use horizon

#

it's another tunnel

#

anyways, ur gpu should be good enough

honest junco Jan 6, 2025, 7:38 PM

#

honest junco Jan 6, 2025, 7:38 PM

#

low shard anyways, ur gpu should be good enough

More than enough ig

low shard Jan 6, 2025, 7:39 PM

#

honest junco

old colab too, https://colab.research.google.com/github/hinabl/voice-changer-colab/blob/master/Hina_Modified_Realtime_Voice_Changer_on_Colab.ipynb is the updated one

Google Colab

low shard Jan 6, 2025, 7:39 PM

#

honest junco More than enough ig

yeah, u should follow the wokada deiteris fork guide

proven hill Jan 6, 2025, 7:39 PM

#

let him download the fork

honest junco Jan 6, 2025, 7:39 PM

#

low shard yeah, u should follow the wokada deiteris fork guide

Idid

low shard Jan 6, 2025, 7:39 PM

#

ye i was just saying

honest junco Jan 6, 2025, 7:39 PM

#

seting up tho

sudden tree Jan 6, 2025, 10:23 PM

#

where can you find models like the melband roformer karoake model by viper?

#

do they upload them I am too nervous to download some random ckpt on huggingface lmfao

proven hill Jan 6, 2025, 10:27 PM

#

sudden tree where can you find models like the melband roformer karoake model by viper?

@viscid moss you got this

sudden tree Jan 6, 2025, 10:33 PM

#

he already sent me the mega but idk where he obtained file from

#

I wanna know the source

viscid moss Jan 6, 2025, 10:34 PM

#

sudden tree where can you find models like the melband roformer karoake model by viper?

The ViperX models are here:
https://github.com/TRvlvr/model_repo/releases/tag/all_public_uvr_models

And most of the UVR5 models

proven hill Jan 6, 2025, 10:36 PM

#

sudden tree he already sent me the mega but idk where he obtained file from

*she, btw

viscid moss Jan 6, 2025, 10:36 PM

#

sudden tree do they upload them I am too nervous to download some random ckpt on huggingface...

well u can check the model hash, that way you make sure it's the same original just re-uploaded

proven hill Jan 6, 2025, 10:37 PM

#

also huggingface is safe :)

viscid moss Jan 6, 2025, 10:37 PM

#

ye

sudden tree Jan 6, 2025, 10:40 PM

#

thanks a lot guys

#

well doesnt huggingface literally virus check each file regardless

#

just wondering why isnt the viperx karaoke and stuff included in uvr 5 model download set

viscid moss Jan 6, 2025, 10:42 PM

#

sudden tree just wondering why isnt the viperx karaoke and stuff included in uvr 5 model dow...

anjok is working on that

sudden tree Jan 6, 2025, 10:42 PM

#

oh thats sick

viscid moss Jan 6, 2025, 10:42 PM

#

Will be available soon ig

sudden tree Jan 6, 2025, 10:42 PM

#

nice

#

thank you

#

i wonder why everyone recs the mvsep when the queue is ungodly lmfao

#

not worth

valid spruce Jan 6, 2025, 10:45 PM

#

What sample rate should I use?

proven hill Jan 6, 2025, 10:45 PM

#

valid spruce What sample rate should I use?

32k

valid spruce Jan 6, 2025, 10:46 PM

#

proven hill 32k

Alright, thanks for the help.

proven hill Jan 6, 2025, 10:46 PM

#

no problem!

sudden tree Jan 6, 2025, 10:50 PM

#

everyone always says you dont need to cut audio yourself, but I realized when I train with my own clips, my Crepe models even beat the RVMPE models! I think the auto clipping of applio causes the ai to be confused. Has anyone else experienced this>?

analog obsidian Jan 6, 2025, 10:51 PM

#

sudden tree everyone always says you dont need to cut audio yourself, but I realized when I ...

wdym by train with my own clips
like, slicing the dataset yourself and disabling rvc's splitting?

sudden tree Jan 6, 2025, 10:51 PM

#

yeah

#

exactly

analog obsidian Jan 6, 2025, 10:52 PM

#

sudden tree exactly

are you using the script found in the docs? you're supposed to slice the whole dataset in chunks of 3 seconds with an overlap of 0.3

#

and crepe vs rmvpe the difference is subtle, crepe models are softer while rmvpe are more harsh

sudden tree Jan 6, 2025, 10:53 PM

#

i just basically use a audio ceiling to prevent white noise

#

and then split the audio into like 5 min chunks

#

rmvpe has always sounded better to me tho

analog obsidian Jan 6, 2025, 10:53 PM

#

sudden tree rmvpe has always sounded better to me tho

yes bc is more robust and new, crepe its literally from 2018

sudden tree Jan 6, 2025, 10:53 PM

#

i wish there was just a vid of someone training with the doc settings

analog obsidian Jan 6, 2025, 10:53 PM

#

rmvpe was made in 2023

sudden tree Jan 6, 2025, 10:54 PM

#

most vids just do what I do and throw the audio in the training

#

I did not use the settings in the doc lmfao

analog obsidian Jan 6, 2025, 10:54 PM

#

sudden tree and then split the audio into like 5 min chunks

this is bad, hifigan can't read files over 5 secs

#

you're not slicing it yourself, rvc is slicing the dataset for u

sudden tree Jan 6, 2025, 10:54 PM

#

no, but the pretrain split it for me automatically in applio

#

Yes i use the applio pre cut setting

analog obsidian Jan 6, 2025, 10:55 PM

#

so you didn't disabled rvc splitting

sudden tree Jan 6, 2025, 10:55 PM

#

and the process

#

no I didnt

#

but still my old crepe model sounds better when i split it by hand

analog obsidian Jan 6, 2025, 10:55 PM

#

every training is different + batch size matters

sudden tree Jan 6, 2025, 10:55 PM

#

true I just run like 8 batch size even though i have 12 vram bc i train on 32fp

analog obsidian Jan 6, 2025, 10:55 PM

#

u can actually get different results using the same exact parameters

sudden tree Jan 6, 2025, 10:56 PM

#

fp32 was big mistake activating maybe?

analog obsidian Jan 6, 2025, 10:56 PM

#

sudden tree fp32 was big mistake activating maybe?

enabling fp32 is a W move

#

fp16 is too unstable

sudden tree Jan 6, 2025, 10:56 PM

#

or maybe i should deactivate the process audio preset in applio?

analog obsidian Jan 6, 2025, 10:56 PM

#

sudden tree or maybe i should deactivate the process audio preset in applio?

nope leave it enabled

sudden tree Jan 6, 2025, 10:56 PM

#

also should I make the input audio loud or just leave it as is?

#

some of the audio i train is raw vocal dataset and is quite

analog obsidian Jan 6, 2025, 10:57 PM

#

i can tell you the """right""""(not really) way to preprocess a dataset

sudden tree Jan 6, 2025, 10:57 PM

#

sure

analog obsidian Jan 6, 2025, 10:58 PM

#

so open audacity, open your dataset (if your dataset are multiple audio files, merge them into one audio before doing this), select the whole audio, find truncate silence and use these settings:

#

damn i forgot

#

before doing that, convert the dataset to mono

sudden tree Jan 6, 2025, 10:58 PM

#

why mono/

#

dont you lose data quality

analog obsidian Jan 6, 2025, 10:59 PM

#

bc rvc cant read stereo files

sudden tree Jan 6, 2025, 10:59 PM

#

oh shit so could that have ruined my training?

analog obsidian Jan 6, 2025, 10:59 PM

#

sudden tree oh shit so could that have ruined my training?

no bc applio converts it to mono anyways

sudden tree Jan 6, 2025, 10:59 PM

#

ah lmfao

analog obsidian Jan 6, 2025, 10:59 PM

#

but since you're doing this method, you should convert it to mono

sudden tree Jan 6, 2025, 10:59 PM

#

isnt truncate silence same as doing noise gate in fl

analog obsidian Jan 6, 2025, 10:59 PM

#

sudden tree isnt truncate silence same as doing noise gate in fl

no

#

this literally removes silences

#

and leaves only the speech audio

sudden tree Jan 6, 2025, 11:00 PM

#

ohhhhh

#

thats why im failing

#

its probably training on the silences?

analog obsidian Jan 6, 2025, 11:00 PM

#

so like this

analog obsidian Jan 6, 2025, 11:00 PM

#

sudden tree its probably training on the silences?

yup

#

after you have your truncate silence dataset do the next step

#

#

use these settings and you should be fine

#

only use simple mode if you truncated the silence

#

never use it for datasets that have silence

sudden tree Jan 6, 2025, 11:01 PM

#

ok thank you a lot for this

#

is there a full guide so i can train

#

My only question is why truncate instead of just using the auto setting?

analog obsidian Jan 6, 2025, 11:02 PM

#

sudden tree is there a full guide so i can train

https://docs.ai-hub.wtf/rvc/resources/training/

Training

Last update: Dec 24, 2024

analog obsidian Jan 6, 2025, 11:03 PM

#

sudden tree My only question is why truncate instead of just using the auto setting?

automatic mode leaves more silences, is not technically bad but you require more epochs to train

sudden tree Jan 6, 2025, 11:03 PM

#

ok thanks where do you find these links

analog obsidian Jan 6, 2025, 11:04 PM

#

-docs

azure marshBOT Jan 6, 2025, 11:04 PM

#

analog obsidian -docs

📚 Documentation

AI HUB Docs

https://docs.ai-hub.wtf

🍏 Applio Docs

https://docs.applio.org/

✨ More guides

How to use RVC Mainline Colab by Cauthess
AICoverGen Colab Guide by Eddy (Spanish Helper)
Create a model with RVC disconnected (colab) by Angetyde

analog obsidian Jan 6, 2025, 11:04 PM

#

^

sudden tree Jan 6, 2025, 11:04 PM

#

thnx

#

what is batch size can you help me

azure marshBOT Jan 6, 2025, 11:04 PM

#

sudden tree what is batch size can you help me

Hey, hypeslxyer! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:

General RVC help: #✨│ai-help
W-Okada / Realtime RVC: #🔍│help-w-okada
AI image related: #🔍│help-ai-art

sudden tree Jan 6, 2025, 11:04 PM

#

is 8 batch size good for 13 min vocal data

analog obsidian Jan 6, 2025, 11:04 PM

#

sudden tree is 8 batch size good for 13 min vocal data

yes

sudden tree Jan 6, 2025, 11:04 PM

#

what about 12 or 4

analog obsidian Jan 6, 2025, 11:05 PM

#

4 is unsafe but works in some cases where the dataset is very monotone and repetitive

#

12 works in some cases as well

#

where 8 fails

sudden tree Jan 6, 2025, 11:05 PM

#

do you just use the built in pretrain?

analog obsidian Jan 6, 2025, 11:05 PM

#

yes, original pretrain

sudden tree Jan 6, 2025, 11:05 PM

#

ok just wondering if going to 12 batch size would help it

analog obsidian Jan 6, 2025, 11:06 PM

#

sudden tree ok just wondering if going to 12 batch size would help it

8 is safer

sudden tree Jan 6, 2025, 11:06 PM

#

hopefully truncate silence will remove all random noises in a studio sesh

analog obsidian Jan 6, 2025, 11:06 PM

#

as long they're below -42,5 db

sudden tree Jan 6, 2025, 11:06 PM

#

also do you personally use the melband roformer karaoke models to isolate leads?

analog obsidian Jan 6, 2025, 11:07 PM

#

sudden tree also do you personally use the melband roformer karaoke models to isolate leads?

yuh, only good model we have for vocals that have harmonies

sudden tree Jan 6, 2025, 11:07 PM

#

lmfao, audacity doesnt take m4as jeez

analog obsidian Jan 6, 2025, 11:08 PM

#

😭

glacial pollen Jan 6, 2025, 11:08 PM

#

sudden tree lmfao, audacity doesnt take m4as jeez

you can use ffmpeg

#

a must have for everyone working with audio

analog obsidian Jan 6, 2025, 11:09 PM

#

my beloved

sudden tree Jan 6, 2025, 11:09 PM

#

yeah it would just degrade quality lmfao

#

double compression

glacial pollen Jan 6, 2025, 11:09 PM

#

well no, m4a is not a codec

analog obsidian Jan 6, 2025, 11:09 PM

#

bro convert it to wav

glacial pollen Jan 6, 2025, 11:09 PM

#

it is a container

analog obsidian Jan 6, 2025, 11:09 PM

#

lmao

sudden tree Jan 6, 2025, 11:09 PM

#

oh i thought m4a was codec

glacial pollen Jan 6, 2025, 11:09 PM

#

could hold aac, opus or vorbis

#

nope

sudden tree Jan 6, 2025, 11:10 PM

#

nice for a while i thought youtube had opus but i dont see it anymore to download

glacial pollen Jan 6, 2025, 11:10 PM

#

it's kinda mp4 counterpart

#

just the difference is, it doesn't contain " video layer "

analog obsidian Jan 6, 2025, 11:10 PM

#

opus is still the best you can get from youtube

#

im still getting it

glacial pollen Jan 6, 2025, 11:10 PM

#

essentially, it's " MPEG-4 Audio Layer "

sudden tree Jan 6, 2025, 11:10 PM

#

lmfao now i gotta learn cmd ffmpeg fuck!

glacial pollen Jan 6, 2025, 11:10 PM

#

Ye, Opus is in fact a very good codec

#

for lossy stuff

#

direct successor of vorbis ( ogg )

analog obsidian Jan 6, 2025, 11:10 PM

#

.\ffmpeg -i audio.m4a audio.wav

glacial pollen Jan 6, 2025, 11:11 PM

#

^ ye, will unwrap the container and get it to wave

#

Tho as Lyery said, if you're working on audio from youtube, use yt-dlp.exe ( a cli tool )

#

it'll fetch the audio from yt servers in best possible available quality ( mostly opus and rarely aac ) And then convert that using ffmpeg
(( that's my exact workflow for ' yt sourced audio ' ))

sudden tree Jan 6, 2025, 11:12 PM

#

damn never new m4a was container does it just hold mp3 atp?

glacial pollen Jan 6, 2025, 11:12 PM

#

I believe aac

sudden tree Jan 6, 2025, 11:12 PM

#

glacial pollen it'll fetch the audio from yt servers in best possible available quality ( mostl...

i use ytdlp but cmd version and i can only see m4as and mp4s i just grab best m4a tbh

glacial pollen Jan 6, 2025, 11:12 PM

#

use -x arg

sudden tree Jan 6, 2025, 11:12 PM

#

damn wtf converting it to wav made filesize 10x

glacial pollen Jan 6, 2025, 11:12 PM

#

#

that gets you opus ( if it's available )

analog obsidian Jan 6, 2025, 11:13 PM

#

.\yt-dlp.exe -x url

glacial pollen Jan 6, 2025, 11:13 PM

#

else, aac or m4a ( still aac I believe. )

sudden tree Jan 6, 2025, 11:13 PM

#

yeah -x usually grabs the video tho ngl

glacial pollen Jan 6, 2025, 11:13 PM

#

well no

analog obsidian Jan 6, 2025, 11:13 PM

#

no

sudden tree Jan 6, 2025, 11:13 PM

#

oh sh

#

if wav is container, how does it make file size increase if bitrate stays exact same

glacial pollen Jan 6, 2025, 11:14 PM

#

The thing is

#

wave pcm is not using any compression

#

so effectively, whatever would be ( which is not as file comes from lossy compression )

#

gets ' 0 filled '

#

that's a thing that has to be done, no other way.

#

All the missing data is just filled in

sudden tree Jan 6, 2025, 11:15 PM

#

ok so i am a noob

glacial pollen Jan 6, 2025, 11:15 PM

#

So yea, whatever you have or get from yt-dlp -> wave

#

that wave after editing / processing -> 32 bit float 44.1khz

sudden tree Jan 6, 2025, 11:15 PM

#

and i just did a zoom out on the audacity and the audio is completely peaked out now

#

tf

#

oh i zoomed on the db lmfao hahaha

sudden tree Jan 6, 2025, 11:16 PM

#

glacial pollen that wave after editing / processing -> 32 bit float 44.1khz

32 bit float wave?

glacial pollen Jan 6, 2025, 11:16 PM

#

yes

#

it's the bit depth

#

32 bit float is the " target end " for files that rvc processes anyways

sudden tree Jan 6, 2025, 11:16 PM

#

and if you are using different songs etc we can import multiple files into the training at different db normalizations?

glacial pollen Jan 6, 2025, 11:16 PM

#

you avoid potential issues during editing

sudden tree Jan 6, 2025, 11:17 PM

#

or should each seperate session be normalized to equal level

glacial pollen Jan 6, 2025, 11:17 PM

#

Well, the whole dynamics aspect of rvc is a lil skewed up anyways

#

Biggest issue is, if you mangle with dynamics on your own ( be it rms, peak norm or compression )

#

it can screw up model's ability to express itself well at high volumes. it'll cause tearing

#

so at best... if you have to, tame the peaks and maybe add a tiny bit of compression

sudden tree Jan 6, 2025, 11:18 PM

#

export audio as mono wav or stereo?

glacial pollen Jan 6, 2025, 11:18 PM

#

wave

sudden tree Jan 6, 2025, 11:18 PM

#

i forgot to make the audio mono lmfao

glacial pollen Jan 6, 2025, 11:18 PM

#

just copy one channel into a blank file
( aka, do not use any " merge channels " algos or such )

sudden tree Jan 6, 2025, 11:18 PM

#

?

analog obsidian Jan 6, 2025, 11:18 PM

#

sudden tree export audio as mono wav or stereo?

mono wav

sudden tree Jan 6, 2025, 11:18 PM

#

confused

#

i just downloaded audacity so i dont know how to do that

glacial pollen Jan 6, 2025, 11:18 PM

#

you wanna delete 1 channel from the audio
either L or R

#

and just save it as mono wave

analog obsidian Jan 6, 2025, 11:19 PM

#

sudden tree i just downloaded audacity so i dont know how to do that

tracks > mix > convert stereo to mono

glacial pollen Jan 6, 2025, 11:19 PM

#

Alternatively, copy / highlight just 1 channel of your choice and paste it over

#

Cause like, depending on what audacity does

sudden tree Jan 6, 2025, 11:19 PM

#

i see

glacial pollen Jan 6, 2025, 11:19 PM

#

if it fuses the channels / centers em, it's pretty bad

#

that's a " merged mono " and not true mono

sudden tree Jan 6, 2025, 11:19 PM

#

lyery is just trying to get me to fuse you are just saying delete one to prevent the wrongful merge and distortion

analog obsidian Jan 6, 2025, 11:19 PM

#

hear him not me

#

lol

sudden tree Jan 6, 2025, 11:19 PM

#

how to delete one track?

glacial pollen Jan 6, 2025, 11:19 PM

#

I mean, what he says isn't wrong

#

but just no ideal imo

analog obsidian Jan 6, 2025, 11:20 PM

#

he knows better than me

glacial pollen Jan 6, 2025, 11:20 PM

#

Because rather than raw mono, you get a fuse of channels, ish ( as long audacity does that which I am not 100% sure of

sudden tree Jan 6, 2025, 11:20 PM

#

yeah how do you do it

glacial pollen Jan 6, 2025, 11:20 PM

#

as it does kind of algo magic and averaging of phase and such

analog obsidian Jan 6, 2025, 11:20 PM

#

he told u above how

#

takes u a few clicks

glacial pollen Jan 6, 2025, 11:20 PM

#

hold on

sudden tree Jan 6, 2025, 11:21 PM

#

also clipping every 3 sec with a .3 sec overlap seems like a disaster intuitevely to me idk why

glacial pollen Jan 6, 2025, 11:21 PM

#

You'll have uhhh

sudden tree Jan 6, 2025, 11:21 PM

#

if you are training on only 3 sec you are guaranteed to get clipping on harmonization it seems like that would mess up the fluency

glacial pollen Jan 6, 2025, 11:22 PM

#

" split stereo track "

#

or so

#

#

Gonna be somewhere here

analog obsidian Jan 6, 2025, 11:22 PM

#

sudden tree if you are training on only 3 sec you are guaranteed to get clipping on harmoniz...

no

glacial pollen Jan 6, 2025, 11:22 PM

#

glacial pollen

Then select the other one ( which ever you want but I personally use RX and do params measurements on both channels to pick the better one ) and delete it

#

leaving only just 1 channel ( and so, you have your file mono in the end

sudden tree Jan 6, 2025, 11:22 PM

#

maybe mine is mono lmfao i can select differently

#

haha

analog obsidian Jan 6, 2025, 11:22 PM

#

trolley

glacial pollen Jan 6, 2025, 11:22 PM

#

#

if it looks like so, it is stereo

sudden tree Jan 6, 2025, 11:23 PM

#

wait couldnt you just pan 100% stereo lmfao

glacial pollen Jan 6, 2025, 11:23 PM

#

Cause well ye, you have 2 channels visible

analog obsidian Jan 6, 2025, 11:23 PM

#

glacial pollen Then select the other one ( which ever you want but I personally use RX and do p...

im interested in this 🥹

glacial pollen Jan 6, 2025, 11:23 PM

#

#

I mean sure, panning

#

but it's just 1 click

#

#

then x one the other track

#

#

Done

#

That simple

sudden tree Jan 6, 2025, 11:24 PM

#

ah nice i figured it out thanks'

glacial pollen Jan 6, 2025, 11:24 PM

#

Nice

sudden tree Jan 6, 2025, 11:24 PM

#

analog obsidian no

i wonder why

glacial pollen Jan 6, 2025, 11:24 PM

#

In any case

#

always go for 44.1

#

for yt

sudden tree Jan 6, 2025, 11:24 PM

#

and you can train multiple files or do you need to merge them into 1 audio file

glacial pollen Jan 6, 2025, 11:24 PM

#

no difference really

#

But best is imo to just use 1 track, 1 file. And do processing on 1 file ( to keep the uniformness

sudden tree Jan 6, 2025, 11:25 PM

#

yeah

analog obsidian Jan 6, 2025, 11:25 PM

#

sudden tree i wonder why

uuuh i can't remember the explanation lmao 😭 but dw this method is fine

glacial pollen Jan 6, 2025, 11:27 PM

#

sudden tree also clipping every 3 sec with a .3 sec overlap seems like a disaster intuitevel...

well

#

the only reason overlap exists is to avoid the discontinuity in " context "

#

naturally, if you can afford to split it all on your own, properly, you can bypass it

#

But that's the best we have if it's automation

#

( I tried various methods to better it, sadly didn't work out well / significantly. Such as envelope or better rms methods )

#

Tho ye, dw about it. As lyery said, it's alright

sudden tree Jan 6, 2025, 11:29 PM

#

i see

#

thank you

#

the last question i have is regarding the normalization

#

like one of my sessions is higher peaks and normalization

#

just wondering if it will mess up training

#

should i lower the gain on the loud one

#

like i just ran a normalization of -10db to match to look of each

glacial pollen Jan 6, 2025, 11:39 PM

#

sudden tree like i just ran a normalization of -10db to match to look of each

tbf, normalization won't help here much either way ( not as much as compression which in the same time can break stuff but whatev. )

sudden tree Jan 6, 2025, 11:39 PM

#

i see so dont worry about it

glacial pollen Jan 6, 2025, 11:39 PM

#

But then, it all should go well if your audio comes from " same source "

sudden tree Jan 6, 2025, 11:39 PM

#

well its not

#

thats the problem

glacial pollen Jan 6, 2025, 11:39 PM

#

in that case, you can more or less match the " overal " volume levels

#

per clip / track

sudden tree Jan 6, 2025, 11:40 PM

#

some are way loiuder than others so i am just normalizing until the clips look the same

glacial pollen Jan 6, 2025, 11:40 PM

#

doesn't have to be perfect but it'll help

#

you can just do each track at -3 dB norm

#

that's because rvc does normalize each anyways ( each cut 3 sec segment )

sudden tree Jan 6, 2025, 11:40 PM

#

o shit i just normalized one to -8db to match the other

#

ok i see

glacial pollen Jan 6, 2025, 11:40 PM

#

but ye, getting em to similar levels is a nice thing to do regardless

sudden tree Jan 6, 2025, 11:40 PM

#

is there a way to collapse all tracks in audacity to one continuous singular

glacial pollen Jan 6, 2025, 11:40 PM

#

Yes

#

That's actually the only reason I keep audacity ( and use it just for that lol )

sudden tree Jan 6, 2025, 11:41 PM

#

how?

#

yeah audacity is pretty fire now that i see it

glacial pollen Jan 6, 2025, 11:41 PM

#

#

tracks > align tracks > end to end

#

or however it's localized for you

sudden tree Jan 6, 2025, 11:41 PM

#

thank you

#

are you slovak?

glacial pollen Jan 6, 2025, 11:41 PM

#

Polish

analog obsidian Jan 6, 2025, 11:41 PM

#

glacial pollen doesn't have to be perfect but it'll help

do i need to use this before exporting one of the stereo tracks as mono?

sudden tree Jan 6, 2025, 11:41 PM

#

haha nice

#

im slav

glacial pollen Jan 6, 2025, 11:42 PM

#

Nicee haha

glacial pollen Jan 6, 2025, 11:42 PM

#

analog obsidian do i need to use this before exporting one of the stereo tracks as mono?

Oh, well no

#

the way I do it is copy 1 channel

#

and paste it into new blank file

#

length will auto match

analog obsidian Jan 6, 2025, 11:42 PM

#

oooh ok ok

#

ty

glacial pollen Jan 6, 2025, 11:42 PM

#

pick one that seems better btw, the channel

analog obsidian Jan 6, 2025, 11:42 PM

#

Okayge

glacial pollen Jan 6, 2025, 11:42 PM

#

for instance, one that has better sdr levels or dc or peaks, you know the deal

sudden tree Jan 6, 2025, 11:43 PM

#

do you recommend alternativing sides like left-right-left for clips or only using left-left-left

glacial pollen Jan 6, 2025, 11:43 PM

#

glacial pollen tracks > align tracks > end to end

After that you export it

#

and import again

#

Then you have one continuous file

sudden tree Jan 6, 2025, 11:43 PM

#

i see thanks

glacial pollen Jan 6, 2025, 11:43 PM

#

sudden tree do you recommend alternativing sides like left-right-left for clips or only usin...

Not really

#

as in, doesn't matter tbh

#

best channel per track

#

but if it's the same source..

#

for instance, 1 anime but different episodes

#

I always want to believe the recording session and so on was set more or less similar

#

so in that case I do pick the same channel throughout my project ( just in case ✨ ) (( Unless one is explicitly bad or worse

sudden tree Jan 6, 2025, 11:45 PM

#

damn the align track thing isnt working ugh!

glacial pollen Jan 6, 2025, 11:46 PM

#

how so?

sudden tree Jan 6, 2025, 11:46 PM

#

nevermind i figured it out

glacial pollen Jan 6, 2025, 11:46 PM

#

a

sudden tree Jan 6, 2025, 11:47 PM

#

should we pan center after splitting stereo?

#

@glacial pollen

glacial pollen Jan 6, 2025, 11:47 PM

#

no

sudden tree Jan 6, 2025, 11:47 PM

#

aight

glacial pollen Jan 6, 2025, 11:48 PM

#

At least I never do that

#

guess you can try one time

#

but I don't see any point in that personally

sudden tree Jan 6, 2025, 11:48 PM

#

shit i just figured out something

glacial pollen Jan 6, 2025, 11:48 PM

#

I guess the biggest clue whether you should try that or not is seeing how phase behaves
if channels differ significantly in that aspect, perhaps you could try

sudden tree Jan 6, 2025, 11:48 PM

#

if you export as mono and you have left and right tracks it just mutes the right one

#

hahahaha

#

ill just pan left on the rights

glacial pollen Jan 6, 2025, 11:49 PM

#

oh then problem's solved, if there's no mixing or centering algo involved

sudden tree Jan 6, 2025, 11:49 PM

#

haha

glacial pollen Jan 6, 2025, 11:49 PM

#

then you good to go

sudden tree Jan 6, 2025, 11:49 PM

#

thanks for all the help brother

#

means a lot

#

and then what settings for applio you recommend for the preprocess?

#

manual splitting setttings?

glacial pollen Jan 6, 2025, 11:50 PM

#

Given the most propable case for you uhhh, go for default

#

and do include preprocessing

#

it's the normalization + butterworth filtering ( 0-57hz iirc )

sudden tree Jan 6, 2025, 11:50 PM

#

i see i was just asking bc the laf dude was saying 3sec with .3 sec overlap

glacial pollen Jan 6, 2025, 11:50 PM

#

ye, that's the default

sudden tree Jan 6, 2025, 11:50 PM

#

i see

#

he said simple tho

glacial pollen Jan 6, 2025, 11:50 PM

#

automatic + 3 / 0.3

#

Simple can work too

#

but if you truncate stuff, that is
Silence truncation

sudden tree Jan 6, 2025, 11:51 PM

#

i see i guess since we already truncated so simple makes sense

#

i will do

#

thank you for the help

glacial pollen Jan 6, 2025, 11:51 PM

#

yup

#

Np man, best of luck

sudden tree Jan 6, 2025, 11:51 PM

#

is 8 batch size good for 15 min dataset?

#

and how many epochs you rec

glacial pollen Jan 6, 2025, 11:51 PM

#

It really depends
for instance I used to work with bs 12 / 14 and 16 for most of my above 10 or 13 min sets

#

yet sometimes that works like crap and 7,8, 9 are safer

#

As always I recommend bs range finding

#

train the model at: 4, 8, 12, 16 ( each for 400-500 epochs )

sudden tree Jan 6, 2025, 11:52 PM

#

oh wow, I didnt know people did that

glacial pollen Jan 6, 2025, 11:52 PM

#

if you're aiming for " perfectionist " model

sudden tree Jan 6, 2025, 11:52 PM

#

lots of effort haha

#

makes sense tho

glacial pollen Jan 6, 2025, 11:52 PM

#

well no, people do not do that

#

but I just recommend that workflow if you're a perfectionist like me lol

sudden tree Jan 6, 2025, 11:52 PM

#

yeah i am lmfao

glacial pollen Jan 6, 2025, 11:52 PM

#

( tho in reality, both learning rate and batch size should be picked individually )

#

Oh ye, in that case def go for that

#

and from there see on graphs + do some inference testing on various epochs

sudden tree Jan 6, 2025, 11:53 PM

#

yeah i dont even know how to modify learning rate in applio

glacial pollen Jan 6, 2025, 11:53 PM

#

n see which one does the well

#

from there, you can finetune it even further as in, do -/+ 1 batch from the base batch size ( one that performed the best )

sudden tree Jan 6, 2025, 11:53 PM

#

you rec 48k sample rate>

#

?

#

also how do you change LR in appluo

glacial pollen Jan 6, 2025, 11:54 PM

#

what's the frequency response of your files?

sudden tree Jan 6, 2025, 11:54 PM

#

not sure haha i am super noob

glacial pollen Jan 6, 2025, 11:54 PM

#

sudden tree also how do you change LR in appluo

nah, that thing don't touch was just giving examples

sudden tree Jan 6, 2025, 11:54 PM

#

lmfao

glacial pollen Jan 6, 2025, 11:54 PM

#

sudden tree not sure haha i am super noob

You can try spek software

sudden tree Jan 6, 2025, 11:54 PM

#

i usually just put 48k because its highest level

glacial pollen Jan 6, 2025, 11:54 PM

#

pretty basic but will do

sudden tree Jan 6, 2025, 11:54 PM

#

is that bad

glacial pollen Jan 6, 2025, 11:54 PM

#

model's sr should be more or less aligned with your files

#

with some minor exceptions

#

for example, a deviation of 1-2khz shouldn't hurt or 3

sudden tree Jan 6, 2025, 11:55 PM

#

can i use spectrograph on audacity

glacial pollen Jan 6, 2025, 11:55 PM

#

For instance, if I have somewhat imperfect audio ( can be compression ) that's ranging anywhere from 41 to 43khz or even 44

sudden tree Jan 6, 2025, 11:55 PM

#

its peaking at 19k

glacial pollen Jan 6, 2025, 11:55 PM

#

I'll use 48k model ( because those extra 2,3 or 4 khz does mean clarity and fidelity, esp in respiration

sudden tree Jan 6, 2025, 11:55 PM

#

well all my audio is ripped from youtube

#

so woulkdnt it peak 20khz

glacial pollen Jan 6, 2025, 11:55 PM

#

In that case 40khz model ye

sudden tree Jan 6, 2025, 11:55 PM

#

i wonder why

glacial pollen Jan 6, 2025, 11:56 PM

#

yt should never be used for 48k

sudden tree Jan 6, 2025, 11:56 PM

#

why 40khz if audio is hitting 20khz

#

oh it actually looks like its hitting 18khz

glacial pollen Jan 6, 2025, 11:56 PM

#

because that's nyquist range

sudden tree Jan 6, 2025, 11:56 PM

#

i see haha

glacial pollen Jan 6, 2025, 11:56 PM

#

Essentially

#

spectrograms

sudden tree Jan 6, 2025, 11:56 PM

#

damn you have to become a audio expert for this

glacial pollen Jan 6, 2025, 11:56 PM

#

for them you do *2 the sr and that's your true sr

sudden tree Jan 6, 2025, 11:56 PM

#

so using 48khz was ruining my models possibly?

glacial pollen Jan 6, 2025, 11:57 PM

#

Quite possible yes

sudden tree Jan 6, 2025, 11:57 PM

#

wow!

#

thank you

glacial pollen Jan 6, 2025, 11:57 PM

#

because the models are trained on specific frequency ranges ( pretrained models

sudden tree Jan 6, 2025, 11:57 PM

#

i see

#

that makes sense due to pretrained

glacial pollen Jan 6, 2025, 11:57 PM

#

they are accustomed to working within a giving frequency spectrum ye

#

Yup

#

but it's not a 'hardcoded rule'

sudden tree Jan 6, 2025, 11:57 PM

#

damn thanks so i gotta download the legit studio rips to be able to go to the 48khz range

glacial pollen Jan 6, 2025, 11:57 PM

#

For instance

sudden tree Jan 6, 2025, 11:57 PM

#

or find raw vocals with really good mics

glacial pollen Jan 6, 2025, 11:58 PM

#

My Kurisu ( best model I ever made )
was 38-42 ( variable ) sr

#

yet trained on 48k

#

https://www.youtube.com/watch?v=2CW2Nyhtio8

YouTube

Codename;0

Kurisu Makise - Oki ni mesu mama / お気に召すまま by EVE「 AI Cover 」

One of my fave tracks from Eve. Remember back in my worst days I used to spam it a lot. Oh yeah, I kinda love how I don't have to readjust Kurisu's pitch with Eve's stuff, they just click on " 0 ". Enjoy ~

Original song by Eve and all people associated with the project:
https://www.youtube.com/watch?v=nROvY9uiYYk

� Cover details �
Inferenced ...

▶ Play video

#

Yet she sounds nice, right

sudden tree Jan 6, 2025, 11:58 PM

#

yeah

glacial pollen Jan 6, 2025, 11:58 PM

#

So there's no strict strict rule, yet it's highly advisable to stick to what I mentioned yup

sudden tree Jan 6, 2025, 11:58 PM

#

yeah i wonder if the mismatch causes audio ripping or the glitching noises

glacial pollen Jan 6, 2025, 11:59 PM

#

not quite

#

it primarily affects the model's potential / generalization or generally adapting to your voice ( finetuning potential

sudden tree Jan 6, 2025, 11:59 PM

#

damn cant find the custom cutting in applio

sudden tree Jan 6, 2025, 11:59 PM

#

glacial pollen it primarily affects the model's potential / generalization or generally adaptin...

that makes sense actually haha

#

where is audio cutting setting located in training

#

cant seem to find

glacial pollen Jan 7, 2025, 12:04 AM

#

show ss

sudden tree Jan 7, 2025, 12:05 AM

#

#

i just see this

#

i cant customize the cutting @glacial pollen

glacial pollen Jan 7, 2025, 12:06 AM

#

Which applio version you running?

#

Seems like outdated one

sudden tree Jan 7, 2025, 12:06 AM

#

newest

glacial pollen Jan 7, 2025, 12:06 AM

#

hmmm

sudden tree Jan 7, 2025, 12:06 AM

#

i saw it a while ago it disappeared for somereason

glacial pollen Jan 7, 2025, 12:06 AM

#

show me the full ui ss

#

upper part

sudden tree Jan 7, 2025, 12:07 AM

#

im just gonna reboot rq

analog obsidian Jan 7, 2025, 12:08 AM

#

Ah thats the latest compiled, yes its outdated

sudden tree Jan 7, 2025, 12:08 AM

#

so fucking weird i cant find the simple cutting @analog obsidian

analog obsidian Jan 7, 2025, 12:08 AM

#

use latest main branch repo

glacial pollen Jan 7, 2025, 12:08 AM

#

ah, if it's compiled and not from repo

sudden tree Jan 7, 2025, 12:08 AM

#

im using 3.2.8

glacial pollen Jan 7, 2025, 12:08 AM

#

then outdated

#

ye but precompiled / zip packages aren't updated in-line with repo atm

sudden tree Jan 7, 2025, 12:09 AM

#

can i just not use simple lmfao

glacial pollen Jan 7, 2025, 12:09 AM

#

download the repo and use 3.2.8's env folder

#

and if that doesn't work, delete the borrowed 3.2.8's env folder and redownload all ( using install-applio .bat file )

#

Lyery will help you hopefully as I have to get back to my work

sudden tree Jan 7, 2025, 12:09 AM

#

alr

#

imma just use default atp

knotty moth Jan 7, 2025, 12:13 AM

#

sudden tree im using 3.2.8

without the bugfix is broken

sudden tree Jan 7, 2025, 12:14 AM

#

wym

#

it is bugfixed 328

analog obsidian Jan 7, 2025, 12:15 AM

#

knotty moth without the bugfix is broken

tldr of this convo:
we teach him the truncate silence method of slicing

#

he can't do it because he's using the latest compiled version, which is outdated

#

#

just do this and decompress it in your applio folder

#

don't use run-install.bat

#

no need to reinstall anything

#

run applio

knotty moth Jan 7, 2025, 12:18 AM

#

or use codename's fork

sudden tree Jan 7, 2025, 12:49 AM

#

i mean @analog obsidian i can still use my current version and just use default splitting?

analog obsidian Jan 7, 2025, 12:49 AM

#

sudden tree i mean <@775545133448953856> i can still use my current version and just use def...

no

sudden tree Jan 7, 2025, 12:49 AM

#

it seems to have worked

#

what

#

fuck i am at epoch 100 alr

#

why not codename said i could

analog obsidian Jan 7, 2025, 12:49 AM

#

if u want to use default splitting then don't use truncate silence

sudden tree Jan 7, 2025, 12:49 AM

#

what why

#

it shouldnt matter

#

it may try to truncate for me but i alr did

analog obsidian Jan 7, 2025, 12:50 AM

#

sudden tree it shouldnt matter

well the idea of this method is that the chunks are meant to be consistent, rvc old slicing is not consistent so yeah

sudden tree Jan 7, 2025, 12:51 AM

#

i see

analog obsidian Jan 7, 2025, 12:51 AM

#

does not affect quality

sudden tree Jan 7, 2025, 12:51 AM

#

i did it default tho

#

after truncating

analog obsidian Jan 7, 2025, 12:51 AM

#

just means your model is gonna take more epochs

sudden tree Jan 7, 2025, 12:51 AM

#

i see but all the 16k splits are legit 3 seconds anyways

analog obsidian Jan 7, 2025, 12:51 AM

#

the truncate silence method helps rvc to learn the dataset faster

sudden tree Jan 7, 2025, 12:51 AM

#

i see

#

i just dont understand the diff between simple cutting and default

analog obsidian Jan 7, 2025, 12:52 AM

#

there is not an audible difference between this method and the casual old method of automatic slicing anyways

#

only thing that changes is how fast rvc learns the dataset

sudden tree Jan 7, 2025, 12:52 AM

#

i see it just prevents those like 1 second clipped audios?

#

so the epochs for same audio is lower?

analog obsidian Jan 7, 2025, 12:52 AM

#

yuh

#

1 second clips are ass for rvc

#

very bad

sudden tree Jan 7, 2025, 12:53 AM

#

makes sense tbh tho its slicing it fire ngl

#

quality degredation ?

analog obsidian Jan 7, 2025, 12:54 AM

#

sudden tree quality degredation ?

rvc kinda ignores those samples

sudden tree Jan 7, 2025, 12:54 AM

#

oh i didnt know that

#

lmfao

#

i wonder what would happen if you set the time for each to 10 seconds

analog obsidian Jan 7, 2025, 12:54 AM

#

not really ignoring but it separates them from the rest

sudden tree Jan 7, 2025, 12:54 AM

#

i used to use like 7 sec samples

analog obsidian Jan 7, 2025, 12:54 AM

#

so the model learns the dataset even slower

#

since it has to learn two things at the same time

#

instead of 1

sudden tree Jan 7, 2025, 12:55 AM

#

i see

#

why dont we just use 7 sec samples or 10 sec

#

would be faster

analog obsidian Jan 7, 2025, 12:55 AM

#

every 3 sec chunk get paired and every 1 sec chunk gets paired

#

and rvc learns them individually

#

smth like that

#

pepoPray

analog obsidian Jan 7, 2025, 12:56 AM

#

sudden tree why dont we just use 7 sec samples or 10 sec

iirc hifigan only accepts 5 sec max

#

i might be wrong with this tho no idea

sudden tree Jan 7, 2025, 12:56 AM

#

ah so its the new training

#

i remember using 10 sec samples in 2023

analog obsidian Jan 7, 2025, 12:56 AM

#

no u didn't, rvc sliced them

#

trolley

sudden tree Jan 7, 2025, 12:56 AM

#

oh haha

#

troll moment

analog obsidian Jan 7, 2025, 12:57 AM

#

dont worry it will not kill your quality

#

the model will learn the dataset a bit slower

#

but thats really it

#

you can continue using the old slicing method if you wish

sudden tree Jan 7, 2025, 12:58 AM

#

ngl you said truncating wouldnt help but my model seems to be improving way more conistently this time

#

just looking at the loss values in cmd

analog obsidian Jan 7, 2025, 12:58 AM

#

yea because like i told you, it learns it faster

sudden tree Jan 7, 2025, 12:58 AM

#

i see

analog obsidian Jan 7, 2025, 12:58 AM

#

so you notice it sounds good because its learning faster

sudden tree Jan 7, 2025, 12:59 AM

#

just removing the silences = less dead space = faster training

#

makes sense

#

you are really just maximizing the roi

analog obsidian Jan 7, 2025, 12:59 AM

#

u still need silence for training

sudden tree Jan 7, 2025, 12:59 AM

#

just not a lot

analog obsidian Jan 7, 2025, 12:59 AM

#

thats why the setting set it to kept a bit of it

#

yuh

#

rvc injects 2 silences in your dataset

#

this is bc you have to teach the model to understand what silence is

#

at least that was noobies told me

glacial pollen Jan 7, 2025, 12:59 AM

#

2 of them is really enough for typical dataset

sudden tree Jan 7, 2025, 1:00 AM

#

haha do you think rvmpe is the ultimate development of this technology?

#

i wonder if rvc can even improve atp

analog obsidian Jan 7, 2025, 1:00 AM

#

sudden tree haha do you think rvmpe is the ultimate development of this technology?

very robust and good

glacial pollen Jan 7, 2025, 1:00 AM

#

definitely not ultimate

#

but so far the best we've got

sudden tree Jan 7, 2025, 1:00 AM

#

the problem is realtime translation

#

it still sounds blocky on my end with large chunk size

analog obsidian Jan 7, 2025, 1:00 AM

#

well realtime perfomance heavily depends in the dataset

#

singing datasets are bad for speech

sudden tree Jan 7, 2025, 1:00 AM

#

yessir

analog obsidian Jan 7, 2025, 1:01 AM

#

while speech datasets are okayish-mid for singing (it depends)

sudden tree Jan 7, 2025, 1:01 AM

#

we need to be able to develop some agi ig to make the tech flawless

glacial pollen Jan 7, 2025, 1:01 AM

#

analog obsidian while speech datasets are okayish-mid for singing (it depends)

^ There

#

a colorful and vibrant in emotions and pitch set can sing well

sudden tree Jan 7, 2025, 1:01 AM

#

yeah i try speech on juice wrld model and it works since he raps haha

sudden tree Jan 7, 2025, 1:01 AM

#

glacial pollen a colorful and vibrant in emotions and pitch set can sing well

lmfao reminds me of alex jones set

glacial pollen Jan 7, 2025, 1:01 AM

#

Any tsundere anime set will do as well

#

lol

sudden tree Jan 7, 2025, 1:01 AM

#

https://tenor.com/view/ahhh-screaming-shout-info-wars-alex-jones-gif-16531120

Tenor

#

https://www.youtube.com/watch?v=vZLpHI87ktI

YouTube

soulnull

[AI] Alex Jones SINGS "I Don't Care Anymore" by Phil Collins

made using RVCv2.

▶ Play video

#

what epoch level do yall tend to set the models at

#

like 300 for 15 mins is peak usually?

analog obsidian Jan 7, 2025, 1:03 AM

#

depends in your batch size

#

but real answer: its random

#

u cannot predict it

sudden tree Jan 7, 2025, 1:04 AM

#

yeah makes sense

#

since its practically learning different vocal tunes and aspects

analog obsidian Jan 7, 2025, 1:04 AM

#

sadly old graphs are not accurate enough to show you which epoch to choose since they only tell you the latest value in that specific epoch

sudden tree Jan 7, 2025, 1:04 AM

#

have any of yall looked into onnx model conversion for realtime

#

apparently you can offload to cpu?

analog obsidian Jan 7, 2025, 1:05 AM

#

at least when i tested onnx it degraded my model quality a bit

#

and also on nvidia is slow af

sudden tree Jan 7, 2025, 1:05 AM

#

analog obsidian sadly old graphs are not accurate enough to show you which epoch to choose since...

i just use g/loss ngl

#

total g loss

#

then max the smoothing

analog obsidian Jan 7, 2025, 1:05 AM

#

yea g/total (from 3.2.8) is outdated by now

sudden tree Jan 7, 2025, 1:05 AM

#

wait why is it inaccurate

#

insane how 3.2.8 is alr outdated

analog obsidian Jan 7, 2025, 1:06 AM

#

in simple words, it only tells you the latest value of that specific epoch
this means you might have a better value in another epoch and you'll never know

#

so if ur lowest g/total was 29
u might actually have another low one hidden

#

the new graphs fixed this

sudden tree Jan 7, 2025, 1:06 AM

#

what cant you see on the graph every epoch?

#

i never had that issue

#

i can see all of them on the graph

analog obsidian Jan 7, 2025, 1:07 AM

#

no like you see that if you hover your mouse in a random point it tells you something like "value: 32,5"

#

well that value is one of multiple values in each epoch

#

so 32,5 in the epoch 100 (for example) is just its latest value

sudden tree Jan 7, 2025, 1:07 AM

#

i see

#

didnt know that

#

is 3.2.8 really outdated?

#

i just installed last 2 weeks

analog obsidian Jan 7, 2025, 1:08 AM

#

very outdated

#

new applio has a new optimizer

sudden tree Jan 7, 2025, 1:08 AM

#

damn just no updates to the package installer huh

simple ore Jan 7, 2025, 1:08 AM

#

i would not call it oudated

#

it is the official release

sudden tree Jan 7, 2025, 1:08 AM

#

so just zip the github repo and unzip in the same file and replace all folders?

analog obsidian Jan 7, 2025, 1:08 AM

#

yuh

#

no need to reinstall the env

sudden tree Jan 7, 2025, 1:09 AM

#

rip to all my premade model logs lmfao

analog obsidian Jan 7, 2025, 1:09 AM

#

F

simple ore Jan 7, 2025, 1:11 AM

#

there's some cleanup in progress, so your stuff may break

analog obsidian Jan 7, 2025, 1:11 AM

#

oh damn

#

nails

simple ore Jan 7, 2025, 1:12 AM

#

like need to update filelist.txt and replace "mute/v2_" with "mute/"

analog obsidian Jan 7, 2025, 1:12 AM

#

oo

simple ore Jan 7, 2025, 1:12 AM

#

and need to add soxr using env/python -m pip install soxr

sudden tree Jan 7, 2025, 1:15 AM

#

shit cant i just pip install entire project haha

simple ore Jan 7, 2025, 1:16 AM

#

waste of time

sudden tree Jan 7, 2025, 3:16 AM

#

does anyone else have the issue when converting vocals?

#

its like the converted file is longer than input causing the vocals to be off beat

hallow thistle Jan 7, 2025, 3:22 AM

#

What a waste of time.

#

analog obsidian Jan 7, 2025, 3:26 AM

#

sudden tree its like the converted file is longer than input causing the vocals to be off be...

enable split audio option in applio's inference

sudden tree Jan 7, 2025, 3:54 AM

#

wait is g/loss/total reliable

#

apparently the smoothed one starts consistently increasing at 14k steps

#

but the higher steps still sounds vastly better

#

like 20k even sounds amazing

#

it really sounds more real

analog obsidian Jan 7, 2025, 3:55 AM

#

sudden tree wait is g/loss/total reliable

g/total is the average of mel, fm and kl

sudden tree Jan 7, 2025, 3:55 AM

#

is mel better?

#

keep training?

analog obsidian Jan 7, 2025, 3:56 AM

#

sudden tree is mel better?

mel is the clarity of your model, this metric always improve the longer you train so this is why you feel it sounds more real

sudden tree Jan 7, 2025, 3:56 AM

#

hmmmm didnt know that

analog obsidian Jan 7, 2025, 3:56 AM

#

but your g/total graph stopped improving the moment it started rising

sudden tree Jan 7, 2025, 3:56 AM

#

what would you do w this

analog obsidian Jan 7, 2025, 3:57 AM

#

sudden tree what would you do w this

i choose generalization over everything (g/total) so i always use the lowest point in g/total before overtraining

sudden tree Jan 7, 2025, 3:57 AM

#

is generalization just the ability to be applied in any scenario?

analog obsidian Jan 7, 2025, 3:57 AM

#

sudden tree is generalization just the ability to be applied in any scenario?

generalization is the ability to the model of generating new audio

#

overtrained epochs have distorted frequencies and other bad stuff

sudden tree Jan 7, 2025, 3:58 AM

#

because it seriously sounds much much more accurate to juice wrld at even 30k steps

#

even though it started rising at 14k

analog obsidian Jan 7, 2025, 3:58 AM

#

yuh because mel is still improving

#

so the spectogram is clearer

#

which gives the feel the model is more realistic

sudden tree Jan 7, 2025, 3:59 AM

#

i see, why does everyone use generalization when clarity is much more important for realism

knotty moth Jan 7, 2025, 3:59 AM

#

mel is more likely to keep going down even more than 1k epochs

sudden tree Jan 7, 2025, 4:00 AM

#

so generalization = more flexibility

analog obsidian Jan 7, 2025, 4:00 AM

#

sudden tree i see, why does everyone use generalization when clarity is much more important ...

because a model that can generalize well sounds good no matter the audio u give to it

sudden tree Jan 7, 2025, 4:00 AM

#

i see that makes sense

analog obsidian Jan 7, 2025, 4:01 AM

#

but anyways like i said before the old graphs only logs the last step of the epoch, we can't tell if your model is actually overtrained or not since the graph is innacurate

sudden tree Jan 7, 2025, 4:01 AM

#

so at .999 smooth the graph start increasing at 240 epochs but the lowest recorded loss was at 270

analog obsidian Jan 7, 2025, 4:01 AM

#

the new ones are like this

sudden tree Jan 7, 2025, 4:01 AM

#

should i use 280 epoch or 240 as the base

knotty moth Jan 7, 2025, 4:01 AM

#

sudden tree wait is g/loss/total reliable

the applio main branch and codename's fork use more accurate loss values which are average of each epoch instead of the epoch's last step (since the mainline rvc versions)

analog obsidian Jan 7, 2025, 4:02 AM

#

analog obsidian the new ones are like this

and i can confirm every epoch in the rising zone sounds like shit

sudden tree Jan 7, 2025, 4:02 AM

#

knotty moth the applio main branch and codename's fork use more accurate loss values which a...

so if i update will i be able to see the averages or will need to retrain?

analog obsidian Jan 7, 2025, 4:02 AM

#

i believe your model is not overtrained and the g/total is just fluctuating

#

but we will never know

#

the log is innacurate

sudden tree Jan 7, 2025, 4:03 AM

#

oh really?

analog obsidian Jan 7, 2025, 4:03 AM

#

yuh

#

no way to tell

#

besides hearing them

#

thats what used to be before

sudden tree Jan 7, 2025, 4:03 AM

#

wow so i should just take whichever sounds best then since its inaccurate

knotty moth Jan 7, 2025, 4:03 AM

#

sudden tree so if i update will i be able to see the averages or will need to retrain?

you cant change the logged values unless you start over training

sudden tree Jan 7, 2025, 4:03 AM

#

makes sense

analog obsidian Jan 7, 2025, 4:03 AM

#

yeah

#

choose the one u like the most since the graphs will not help at all

sudden tree Jan 7, 2025, 4:03 AM

#

this is the graph for reference @knotty moth

#

its 15 min training data batch size 8

analog obsidian Jan 7, 2025, 4:04 AM

#

i remember back then i used to choose an epoch based in the mel graph

#

i felt it was a bit more reliable than g/total

sudden tree Jan 7, 2025, 4:04 AM

#

yeah it is honestly sounding like 30k steps is the best

#

500 epochs sounds a bit overtrained ngl

#

idk tho

#

it would theoretically make sense to choose the mel graph if converting rap vocals to rap vocals imo

analog obsidian Jan 7, 2025, 4:05 AM

#

a rap model will always be good at inferencing rap songs regardless

#

its literally whats made for

sudden tree Jan 7, 2025, 4:06 AM

#

true haha

#

yeah i am getting robotic noises at 30k steps

#

thats overtraining correct?

analog obsidian Jan 7, 2025, 4:06 AM

#

yeah

#

robotic sounds happen when the model is overtrained

#

as long the model is not robotic, its fine

sudden tree Jan 7, 2025, 4:06 AM

#

imma just download a conversion and compare

#

haha

#

ill update because the new g total is accurate right?

analog obsidian Jan 7, 2025, 4:07 AM

#

overtraining is pretty easy to spot, literally if it sounds robotic, its overtrained

analog obsidian Jan 7, 2025, 4:07 AM

#

sudden tree ill update because the new g total is accurate right?

yeah new g/total is accurate

#

every new graph is reliable now

#

u can trust them

sudden tree Jan 7, 2025, 4:07 AM

#

alright sounds great thank you

#

i will see how to update

astral jungle Jan 7, 2025, 4:32 AM

#

-train

azure marshBOT Jan 7, 2025, 4:32 AM

#

astral jungle -train

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
FaceFusion UI, by Nick088 Google Colab
FaceFusion NO UI, by Nick088 Google Colab
EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.