covert anchor Nov 30, 2024, 9:49 AM

#

only train button

rare gobletBOT Nov 30, 2024, 9:49 AM

#

Ayo? @covert anchor level 3 !!! lfg

knotty moth Nov 30, 2024, 9:49 AM

#

sorry no idea cuz I'm not using the latest applio

covert anchor Nov 30, 2024, 9:50 AM

#

oh :c

knotty moth Nov 30, 2024, 9:50 AM

#

btw check the model name, should be only alphanumeric without spaces as safe bet

covert anchor Nov 30, 2024, 9:50 AM

#

well, dw, you tried and i apreciate that

#

this is the name

#

it's a champion from league, on spanish

#

the name of the dataset, "mezcla" means mix

#

and that's all

#

probably im just gonna wait untill tomorrow and get help of a friend that uses the latest version of applio

#

anyways, thx!

simple ore Nov 30, 2024, 10:16 AM

#

for 10+ hours, sure

#

slice the audio. You cant train with one 9 min file unsliced.

covert anchor Nov 30, 2024, 10:18 AM

#

simple ore slice the audio. You cant train with one 9 min file unsliced.

what, why?

simple ore Nov 30, 2024, 10:20 AM

#

because that's how it works. Either you slice the file youself in using 3-5s slices, or you let Applio slice it in preprocess

covert anchor Nov 30, 2024, 10:23 AM

#

how do i let applio do it?

#

i will do it tomorrow btw but i want to know

simple ore Nov 30, 2024, 10:47 AM

#

covert anchor how do i let applio do it?

dont uncheck the box in preprocess

#

covert anchor Nov 30, 2024, 10:47 AM

#

okay okay

#

i will try it tomorrow

#

thk so much ❤️

boreal sluice Nov 30, 2024, 12:13 PM

#

does anybody knows how to use kaggle?

#

the modified version?

flint solar Nov 30, 2024, 1:32 PM

#

boreal sluice does anybody knows how to use kaggle?

there is a guide

flint solar Nov 30, 2024, 1:32 PM

#

boreal sluice does anybody knows how to use kaggle?

https://rentry.co/RVC-Mainline-Kaggle

boreal sluice Nov 30, 2024, 1:46 PM

#

flint solar https://rentry.co/RVC-Mainline-Kaggle

yhanks so muchh

rare gobletBOT Nov 30, 2024, 1:46 PM

#

Ayo? @boreal sluice level 1 !!! lfg

dusty bone Nov 30, 2024, 2:42 PM

#

whenever i use any voice it rlly just cracks alot lol

#

is there a fix or is my mic the problem

knotty moth Nov 30, 2024, 3:30 PM

#

dusty bone whenever i use any voice it rlly just cracks alot lol

try not to infer doubled vocals

gritty zinc Nov 30, 2024, 3:52 PM

#

What is best ai changer for pc

#

Tried voice.ai

#

Crash alot

odd shale Nov 30, 2024, 3:54 PM

#

gritty zinc Crash alot

I would suggest Deiteris' w-okada fork

odd shale Nov 30, 2024, 3:55 PM

#

gritty zinc Tried voice.ai

https://rentry.co/ForkVoiceChangerGuide

Guide for deiteris' optimized W-Okada RealTime Voice Changer Client...

Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update October 6th, 2024: Multi PC setup explanation added
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVo...

#

It's kinda better than the OG version.

#

And if you ask, nope, we can't give support nor troubleshooting for voice.ai issues

odd shale Nov 30, 2024, 3:56 PM

#

dusty bone whenever i use any voice it rlly just cracks alot lol

The source of the problem can be either 2 of these things: Or the model you're using wasn't properly made/trained or it simply doesn't fit your voice.

gritty zinc Nov 30, 2024, 3:56 PM

#

Make sense

#

Ty

knotty moth Nov 30, 2024, 3:57 PM

#

gritty zinc Crash alot

gritty zinc Nov 30, 2024, 3:57 PM

#

Fr

#

Agree

#

Badly

odd shale Nov 30, 2024, 4:01 PM

#

gritty zinc Ty

You're welcome.

#

Tiny fact: The models made with tiny/moderated-size dataset prolly won't perform properly on W-Okada nor Voice.ai

#

Tho there can be tiny/lucky exceptions

gritty zinc Nov 30, 2024, 4:03 PM

#

Make sense

rare gobletBOT Nov 30, 2024, 4:03 PM

#

Ayo? @gritty zinc level 1 !!! lfg

gritty zinc Nov 30, 2024, 4:03 PM

#

So this One work on everything?

odd shale Nov 30, 2024, 4:03 PM

#

gritty zinc So this One work on everything?

Yep, it should.

#

Deiteris fork is kinda more optimized than others.

#

Also it's matter of playing around with settings and reading the guide.

gritty zinc Nov 30, 2024, 5:05 PM

#

@odd shale

#

i want use rick voice

#

soooo

#

#

hm

odd shale Nov 30, 2024, 5:06 PM

#

gritty zinc hm

Read the docs i gave you above.

#

I'm not sure which version of W-Okada you're using

#

https://rentry.co/ForkVoiceChangerGuide

Guide for deiteris' optimized W-Okada RealTime Voice Changer Client...

Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update October 6th, 2024: Multi PC setup explanation added
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVo...

gritty zinc Nov 30, 2024, 5:06 PM

#

oh yh ty

pure pecan Nov 30, 2024, 5:15 PM

#

I have f5-tts installed, how do I put weighted voices on it? it only seems to let me upload audio samples, and not .zip of weighted

dusty bone Nov 30, 2024, 5:20 PM

#

odd shale The source of the problem can be either 2 of these things: Or the model you're u...

thank u! Im guessing it must be voice

#

i do have a deeper voice

#

soo ill find smth that fits more

low shard Nov 30, 2024, 5:21 PM

#

pure pecan I have f5-tts installed, how do I put weighted voices on it? it only seems to le...

F5 tts is only 0shot

pure pecan Nov 30, 2024, 5:22 PM

#

low shard F5 tts is only 0shot

Sorry, what is 0shot?

low shard Nov 30, 2024, 5:22 PM

#

pure pecan Sorry, what is 0shot?

0shot = no actual training needed, just an audio file to work, inferior in terms of quality to few-shots
few-shots = training needed, an example is GPT-SoVITS (TTS) & RVC (STS)

#

You can't upload RVC models to GPT-SoVITS nor F5 TTS

pure pecan Nov 30, 2024, 5:23 PM

#

huh, could have sworn I installed a version of f5-tts that could do training based on the github tutorial, oh well, thanks fo the information

low shard Nov 30, 2024, 5:23 PM

#

RVC models can be used only in programs that use RVC, such as W-okada

low shard Nov 30, 2024, 5:24 PM

#

pure pecan huh, could have sworn I installed a version of f5-tts that could do training bas...

The training is related to the model who does 0shot, not actual single model training

gritty zinc Nov 30, 2024, 5:24 PM

#

hm

low shard Nov 30, 2024, 5:24 PM

#

You can train the big model that's being used for 0shot, not train every voice you want to a model in F5 tts

gritty zinc Nov 30, 2024, 5:25 PM

#

low shard You can train the big model that's being used for 0shot, not train every voice y...

can i smh link it to disc?

low shard Nov 30, 2024, 5:28 PM

#

gritty zinc can i smh link it to disc?

You mean as a realtime voice changer for calls? you could technically do that for any TTS

Google Docs

AIs for TTS

Table Of Contents Introduction Index of the best TTS 1. ElevenLabs/11Labs: 2. GPT-SoVITS: 3. Fish Speech: 4. F5 TTS: 5. Edge TTS: 6. StyleTTS2: 7. XTTS2: 8. OpenVoice v2: 9. MeloTTS: Use TTS in Realtime on calls (ONLY PC) Introduction TTS Means Text To Speech! Inference means when you use the...

#

But it's not really realtime

#

You'd have to type the words and let the audio play so

gritty zinc Nov 30, 2024, 5:28 PM

#

eh beter than nothing

low shard Nov 30, 2024, 5:28 PM

#

gritty zinc eh beter than nothing

There's actually Wokada, which uses RVC models in realtime

#

So it's Speech To Speech, rather than Text To Speech

#

What's your pc gpu?

gritty zinc Nov 30, 2024, 5:29 PM

#

this is too complicated for my dump ass i just want act in vc for online class

#

thought it would be good idea

#

prob no

low shard Nov 30, 2024, 5:29 PM

#

gritty zinc this is too complicated for my dump ass i just want act in vc for online class

It's AI

#

Open Source AI

low shard Nov 30, 2024, 5:30 PM

#

gritty zinc prob no

There's guides

#

Leo gave you the wokada guide above

#

which has everything you need to know

gritty zinc Nov 30, 2024, 5:30 PM

#

yes buttt

rare gobletBOT Nov 30, 2024, 5:30 PM

#

Ayo? @gritty zinc level 2 !!! lfg

gritty zinc Nov 30, 2024, 5:31 PM

#

it kinda NOT what im looking for excatly

low shard Nov 30, 2024, 5:31 PM

#

gritty zinc it kinda NOT what im looking for excatly

what are u looking for exactly

gritty zinc Nov 30, 2024, 5:33 PM

#

low shard what are u looking for exactly

smt like the text speech thing

#

i use that prob

low shard Nov 30, 2024, 5:36 PM

#

gritty zinc smt like the text speech thing

Yea you can install any TTS in the guide and use it for calls

Really depends tho
If you want generic voices and easy: edge tts
If you want custom voices and easy: F5 TTS or Fish Speech
If you want custom voices and best quality (but more complex): gpt-sovits

#

the process for using each of those in calls is kind of the same

#

Also, depends if your pc gpu is good enough tho

gritty zinc Nov 30, 2024, 5:39 PM

#

alr tyy

true ravine Nov 30, 2024, 6:01 PM

#

pls help

rare gobletBOT Nov 30, 2024, 6:01 PM

#

Ayo? @true ravine level 1 !!! lfg

pure pecan Nov 30, 2024, 6:13 PM

#

ok I downloaded RVC off of pinokio, what tab and where do I put the weighted file?

rare gobletBOT Nov 30, 2024, 6:13 PM

#

Ayo? @pure pecan level 1 !!! lfg

true ravine Nov 30, 2024, 6:14 PM

#

why did you guys get ai voice changer

#

whats the point of this

pure pecan Nov 30, 2024, 6:25 PM

#

wait, is RVC only for singing, I thought it was a text to speech

timid valve Nov 30, 2024, 6:27 PM

#

it's speech to speech actually

#

so that includes singing

pure pecan Nov 30, 2024, 6:30 PM

#

well I founded a folder called weights, and put my model in there, but when im in the rvc ui, I don't see any option to select it.

pure pecan Nov 30, 2024, 6:49 PM

#

nvm figrued it out, had to take the pth file out

#

just wish i could use tts instead of voice...

#

I want to script something and have it read it all out

#

blah i just keep getting errors trying to convert voice, oh well, thanks anyway

pure pecan Nov 30, 2024, 7:12 PM

#

what am i doing wrong

#

well crepe woked

rare gobletBOT Nov 30, 2024, 7:14 PM

#

Ayo? @pure pecan level 2 !!! lfg

pure pecan Nov 30, 2024, 7:14 PM

#

sounds awful with my voice though

brittle wing Nov 30, 2024, 7:20 PM

#

pure pecan what am i doing wrong

What is that

pure pecan Nov 30, 2024, 7:20 PM

#

brittle wing What is that

rvc

brittle wing Nov 30, 2024, 7:20 PM

#

Colab or local

pure pecan Nov 30, 2024, 7:20 PM

#

local

true ravine Nov 30, 2024, 7:23 PM

#

Hello

#

How to hide app?

#

crude flame Nov 30, 2024, 7:24 PM

#

true ravine How to hide app?

windows button + tab then create a new desktop and use that

pure pecan Nov 30, 2024, 7:33 PM

#

maybe the model was just bad, tried another one and it's really good (1000 epocs hatsune miku)

#

seems to not work with recordings that are longer than a minute

#

and rmvpe doesnt work at all

#

yeah i think im gonna need some 1 on 1 help in call or something i just dont get what im doing

neon hemlock Nov 30, 2024, 7:49 PM

#

hirari

pure pecan Nov 30, 2024, 7:53 PM

#

yeah?

blazing crane Nov 30, 2024, 8:19 PM

#

Need help
No errors, just the RVC bugs out and stops whenever i try to train
INFO:torch.nn.parallel.distributed:Reducer buckets have been rebuilt in this iteration

rare gobletBOT Nov 30, 2024, 8:19 PM

#

Ayo? @blazing crane level 1 !!! lfg

blazing crane Nov 30, 2024, 8:20 PM

#

Using Ov2 pretrain at 40k

brittle wing Nov 30, 2024, 8:20 PM

#

Last time, which one is the best alternative for denoising?

blazing crane Nov 30, 2024, 8:20 PM

#

blazing crane Need help No errors, just the RVC bugs out and stops whenever i try to train INF...

pure pecan Nov 30, 2024, 8:21 PM

#

neon hemlock hirari

yes?

blazing crane Nov 30, 2024, 8:21 PM

#

blazing crane Need help No errors, just the RVC bugs out and stops whenever i try to train INF...

brittle wing Nov 30, 2024, 8:22 PM

#

-colab

azure marshBOT Nov 30, 2024, 8:22 PM

#

brittle wing -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

brittle wing Nov 30, 2024, 8:22 PM

#

brittle wing Last time, which one is the best alternative for denoising?

@crude flame

crude flame Nov 30, 2024, 8:23 PM

#

brittle wing <@673327878288703519>

2

brittle wing Nov 30, 2024, 8:24 PM

#

crude flame 2

I didn't put a Number which one.

#

The one in the colab.???

pure pecan Nov 30, 2024, 8:24 PM

#

crude flame 2

what is 2 bro speak up since u at least getting help

crude flame Nov 30, 2024, 8:24 PM

#

oh

#

misread

#

i thought it was several images

brittle wing Nov 30, 2024, 8:26 PM

#

crude flame i thought it was several images

Which one to be exact

#

It's a collage...

#

Yh

#

Just asking

crude flame Nov 30, 2024, 8:27 PM

#

for colab melband it is the second one for mvsep you can use either standard or aggressive for denoise by aufr33 and xminus you can use either of the melband

brittle wing Nov 30, 2024, 8:27 PM

#

crude flame for colab melband it is the second one for mvsep you can use either standard or ...

What's the best option among all MelBand Roformers

#

#

Cause results are different everywhere

crude flame Nov 30, 2024, 8:28 PM

#

the first two are the same (mvsep and colab) and idk about xminus

brittle wing Nov 30, 2024, 8:28 PM

#

crude flame the first two are the same (mvsep and colab) and idk about xminus

They're the same?

crude flame Nov 30, 2024, 8:28 PM

#

just use the one that sounds best

brittle wing Nov 30, 2024, 8:28 PM

#

Which one is it

brittle wing Nov 30, 2024, 8:29 PM

#

crude flame the first two are the same (mvsep and colab) and idk about xminus

Number 1 or two in the Colab

crude flame Nov 30, 2024, 8:29 PM

#

2

brittle wing Nov 30, 2024, 8:29 PM

#

I don't wanna waste time or GPU

brittle wing Nov 30, 2024, 8:29 PM

#

crude flame 2

THIS???

crude flame Nov 30, 2024, 8:29 PM

#

yes

#

thats the second one

#

thats the one with a 2

brittle wing Nov 30, 2024, 8:30 PM

#

crude flame thats the second one

The one I circled

#

In the Colab?

crude flame Nov 30, 2024, 8:30 PM

#

yes

brittle wing Nov 30, 2024, 8:30 PM

#

With high Chunk size and overlap 16?

crude flame Nov 30, 2024, 8:30 PM

#

thats fine

brittle wing Nov 30, 2024, 8:31 PM

#

Is this dereverb model okay for future datasets?

#

Should I use overlap 8 or 16?

covert anchor Nov 30, 2024, 8:55 PM

#

simple ore dont uncheck the box in preprocess

tsm it worked ❤️

blazing crane Nov 30, 2024, 8:57 PM

#

blazing crane

could i get some help please?

marsh schooner Nov 30, 2024, 8:59 PM

#

what do u think about this?

simple ore Nov 30, 2024, 9:00 PM

#

1hr+ - 8

#

<1hr - 4

#

10 hr+ - 16

#

something like that

#

more data makes a batch more uniform

covert anchor Nov 30, 2024, 9:02 PM

#

now i have another problem

#

i train too slow, idk why

#

52 sec every epoch

marsh schooner Nov 30, 2024, 9:02 PM

#

simple ore <1hr - 4

so if its less than 1 hour 4 right?

simple ore Nov 30, 2024, 9:03 PM

#

marsh schooner so if its less than 1 hour 4 right?

it really depends on the variety of the dataset, if there's a different content using larger batch kinda levels all outliers

#

with smaller batch those outliers have more noticeable effeect

marsh schooner Nov 30, 2024, 9:04 PM

#

simple ore it really depends on the variety of the dataset, if there's a different content ...

what if there was like a 2-8 hour data set i would still use 8 right?

simple ore Nov 30, 2024, 9:04 PM

#

sure you can try batch 8 for 30m+

marsh schooner Nov 30, 2024, 9:05 PM

#

simple ore sure you can try batch 8 for 30m+

do they affect quality at all?

#

or is it just the time

simple ore Nov 30, 2024, 9:05 PM

#

it does affect the training result

#

the model is trying to find optimal parameters to generate a voice, with different batches it may overshoot or undershoot the target or circle around local minima

marsh schooner Nov 30, 2024, 9:06 PM

#

oh dang should i try to reatrain then bc yesterday i made like a 22 min data set on 8 batch size and it doesnt sound like the voice i was making but it sounds realistic

simple ore Nov 30, 2024, 9:07 PM

#

https://pytorch-optimizers.readthedocs.io/en/latest/visualization/

#

just to give an idea

covert anchor Nov 30, 2024, 9:08 PM

#

there's any way i can improve the speed on the training?

simple ore Nov 30, 2024, 9:08 PM

#

covert anchor there's any way i can improve the speed on the training?

what's your gpu and dataset size?

covert anchor Nov 30, 2024, 9:08 PM

#

i should be training at 2 or 3 secs on every epoch, or at least i think i should

covert anchor Nov 30, 2024, 9:08 PM

#

simple ore what's your gpu and dataset size?

rtx 4060

marsh schooner Nov 30, 2024, 9:08 PM

#

simple ore https://pytorch-optimizers.readthedocs.io/en/latest/visualization/

no clue on wwhat these pics r

simple ore Nov 30, 2024, 9:08 PM

#

it was training 2-3sec/epoch because you had an empty set lol

marsh schooner Nov 30, 2024, 9:08 PM

#

covert anchor rtx 4060

no u shouldnt

#

lol

covert anchor Nov 30, 2024, 9:09 PM

#

simple ore it was training 2-3sec/epoch because you had an empty set lol

empty set?

simple ore Nov 30, 2024, 9:09 PM

#

with 2 mute files and discarded 9 min audio

simple ore Nov 30, 2024, 9:09 PM

#

marsh schooner no clue on wwhat these pics r

it is an example how the model looks for the optimal solution (the peak in the middle) doing small incremental steps towards it

covert anchor Nov 30, 2024, 9:10 PM

#

simple ore with 2 mute files and discarded 9 min audio

discarded? im not understanding, idk if it was for me

simple ore Nov 30, 2024, 9:11 PM

#

covert anchor discarded? im not understanding, idk if it was for me

when you tried to use unsliced 9 min file, it was not used at all

#

but now that you've sliced it the training actually uses it and for 9 min file and 4060 52/s epoch is a good speed

marsh schooner Nov 30, 2024, 9:12 PM

#

simple ore it is an example how the model looks for the optimal solution (the peak in the m...

this better?

covert anchor Nov 30, 2024, 9:12 PM

#

oh

rare gobletBOT Nov 30, 2024, 9:12 PM

#

Ayo? @marsh schooner level 4 !!! lfg

covert anchor Nov 30, 2024, 9:12 PM

#

okay okay

#

thx again, i will train at that speed and check if it was good

marsh schooner Nov 30, 2024, 9:31 PM

#

the goal is to get it as far down as possible right?

#

the further down means more realism most likely

#

and is it fine going backwards like this?

modern surge Nov 30, 2024, 9:49 PM

#

-rvc

azure marshBOT Nov 30, 2024, 9:49 PM

#

modern surge -rvc

📚 Documentation

AI HUB Docs

https://docs.ai-hub.wtf

🍏 Applio Docs

https://docs.applio.org/

✨ More guides

How to use RVC Mainline Colab by Cauthess
AICoverGen Colab Guide by Eddy (Spanish Helper)
Create a model with RVC disconnected (colab) by Angetyde

modern surge Nov 30, 2024, 9:49 PM

#

-colab

azure marshBOT Nov 30, 2024, 9:49 PM

#

modern surge -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

simple ore Nov 30, 2024, 9:51 PM

#

marsh schooner and is it fine going backwards like this?

that's restarting from some previous saved set of weights

simple ore Nov 30, 2024, 9:52 PM

#

marsh schooner the further down means more realism most likely

g total is a summof feature match (most important), mel loss (medium), and kl

#

mel loss is how close the spectrogram of the generated audio is close to the original

#

less is better

#

fm is more complex measure, generally it should go down or stay somewhat stable

marsh schooner Nov 30, 2024, 10:02 PM

#

simple ore g total is a summof feature match (most important), mel loss (medium), and kl

so if i trained something and its too high from my liking is there something i can change in my training settings to get it to sound closer to the original

sharp rampart Nov 30, 2024, 10:25 PM

#

hey does rvc work in ableton or any daw for real time or no

#

like can you talk through rvc and get the outputs /inputs into your daw like fl studio, ableton, etc in realtime with no delayt

#

or is that simply not possible

marsh schooner Nov 30, 2024, 10:26 PM

#

sharp rampart like can you talk through rvc and get the outputs /inputs into your daw like fl ...

do u mean rvc okada aka the voice changere

sharp rampart Nov 30, 2024, 10:26 PM

#

this

rare gobletBOT Nov 30, 2024, 10:26 PM

#

Ayo? @sharp rampart level 1 !!! lfg

sharp rampart Nov 30, 2024, 10:26 PM

#

or do i have the wrong thing to do it

marsh schooner Nov 30, 2024, 10:26 PM

#

that i have no clue i use the voice changer i thought that was just text to speech though

sharp rampart Nov 30, 2024, 10:27 PM

#

oh thats probably what thats for then

#

let me go check

marsh schooner Nov 30, 2024, 10:28 PM

#

sharp rampart oh thats probably what thats for then

theres a voice changer for real time called okada but it impossible not to have delay but u could still have full convos with ppl using it and u dont need a crazy gpu

#

i run mine using my 3060 ti

sharp rampart Nov 30, 2024, 10:28 PM

#

i need it to have no delay so i can make music with it

#

thats not possible yet is it?

marsh schooner Nov 30, 2024, 10:28 PM

#

i have no clue i dont do music ai

sharp rampart Nov 30, 2024, 10:28 PM

#

i got a 40 series

#

ok

marsh schooner Nov 30, 2024, 10:31 PM

#

marsh schooner so if i trained something and its too high from my liking is there something i c...

bc im trying to get my voice to sound closer to the original is there a setting that helps that or no?

unique rock Nov 30, 2024, 10:58 PM

#

For how many minutes of dataset can I use this batch size without the colab disconnecting and for how many times so that the model does not sound bad?

marsh schooner Nov 30, 2024, 11:20 PM

#

isnt this slow for a 4090 22-25 min dataset

steady geyser Nov 30, 2024, 11:29 PM

#

Where do i get a virtual mic

simple ore Dec 1, 2024, 12:16 AM

#

marsh schooner isnt this slow for a 4090 22-25 min dataset

33s per 25 min set, about right?

chilly ridge Dec 1, 2024, 12:27 AM

#

Hello everyone! I would need the advice of someone who knows about RVC please, I can't find what I'm looking for on the forums or it's old (or I don't know how to search). I'm looking to train an rvc model, the problem is that I only have 2min of audio in wav files of about 2s. So I've tried 50, 100, 300 Epoch, changing the batch size... I still have a "robotic" voice and all the Ss or CHs are "metallic". Is it possible to train my model with all these wav files, or is it better to increase the size of my dataset? All my audio files are studio quality, without reverb ect ect... Thank you!

#

I run it locally on a quadro RTX 4000 Ada 8Go

analog obsidian Dec 1, 2024, 12:29 AM

#

chilly ridge Hello everyone! I would need the advice of someone who knows about RVC please, I...

increase the size of the dataset that fixes the robotic S, ch

#

it does not really fix them but makes them appear way less

kindred swallow Dec 1, 2024, 12:29 AM

#

"frequent errors occured" anyone know how to fix

chilly ridge Dec 1, 2024, 12:30 AM

#

analog obsidian increase the size of the dataset that fixes the robotic S, ch

Do you think it's better to concatenate all my files in one 2min wav file ?

analog obsidian Dec 1, 2024, 12:31 AM

#

chilly ridge Do you think it's better to concatenate all my files in one 2min wav file ?

never let rvc to slice full wav files for you, is bad
ideally every sample should be 3s-5s long

#

use audacity audio labelling

chilly ridge Dec 1, 2024, 12:32 AM

#

All my files is around 2s long, it's better to increase manually the duration ?

#

I can make a python script for that, 3s of full audio, without silence

#

3s average

analog obsidian Dec 1, 2024, 12:33 AM

#

chilly ridge I can make a python script for that, 3s of full audio, without silence

is not going to help because your dataset is just 2 minutes of audio

chilly ridge Dec 1, 2024, 12:33 AM

#

Okay thx

analog obsidian Dec 1, 2024, 12:33 AM

#

10 minutes is bare minimum for okay perfomance
models hit the "realistic" tone at around 40 minutes

chilly ridge Dec 1, 2024, 12:40 AM

#

I have a another model with 8min of audio also in 2s wave file, but it's a really "specific" voice, it's really glitched when i use it. (It's Pat from Mickey in french), even if i try to sound like the original voice to help rvc it's the same. But if i set the pitch to -12 it's super clean, but, tooooooo deep

chilly ridge Dec 1, 2024, 12:40 AM

#

analog obsidian 10 minutes is bare minimum for okay perfomance models hit the "realistic" tone a...

I'll try 10min audio

oak inlet Dec 1, 2024, 12:40 AM

#

yii

analog obsidian Dec 1, 2024, 12:40 AM

#

chilly ridge I have a another model with 8min of audio also in 2s wave file, but it's a reall...

use batch size 4
train for not longer than 200 epochs

oak inlet Dec 1, 2024, 12:40 AM

#

yoo

#

wsg

chilly ridge Dec 1, 2024, 12:41 AM

#

analog obsidian use batch size 4 train for not longer than 200 epochs

And increase the dataset size ?

rare gobletBOT Dec 1, 2024, 12:41 AM

#

Ayo? @chilly ridge level 2 !!! lfg

analog obsidian Dec 1, 2024, 12:42 AM

#

chilly ridge And increase the dataset size ?

yes also when a voice does not match the original is due to:
undertraining (most common)
dataset has too much variety, for example a person using two "voices" at the same time

oak inlet Dec 1, 2024, 12:42 AM

#

analog obsidian yes also when a voice does not match the original is due to: undertraining (most...

does uvr5 also give both

#

vocal and audio

analog obsidian Dec 1, 2024, 12:43 AM

#

oak inlet does uvr5 also give both

for separating vocals? yea, use mvsep bs roformer

#

or mel roformer kim

oak inlet Dec 1, 2024, 12:43 AM

#

analog obsidian for separating vocals? yea, use mvsep bs roformer

alr

oak inlet Dec 1, 2024, 12:43 AM

#

analog obsidian or mel roformer kim

which do u prefer personally

rare gobletBOT Dec 1, 2024, 12:43 AM

#

Ayo? @oak inlet level 1 !!! lfg

chilly ridge Dec 1, 2024, 12:44 AM

#

analog obsidian yes also when a voice does not match the original is due to: undertraining (most...

Audio came from kingdom hearts game, so, i have file with just him yelling. Does i need to use these kind of files ?

analog obsidian Dec 1, 2024, 12:44 AM

#

oak inlet which do u prefer personally

i prefer mel roformer, bs roformer is known to have muddy instrumentals and vocals

oak inlet Dec 1, 2024, 12:44 AM

#

oh

analog obsidian Dec 1, 2024, 12:44 AM

#

chilly ridge Audio came from kingdom hearts game, so, i have file with just him yelling. Does...

avoid yelling and laughing audios

#

you can keep a few laughs

#

but be sure to not add too much of them

#

might confuse ai believing thats how the person sounds

oak inlet Dec 1, 2024, 12:45 AM

#

how do i add ffprobe and ffmpeg to root. Im kinda restarted

chilly ridge Dec 1, 2024, 12:46 AM

#

analog obsidian might confuse ai believing thats how the person sounds

Thx for all these advice ! I'll try theme all

analog obsidian Dec 1, 2024, 12:46 AM

#

oak inlet how do i add ffprobe and ffmpeg to root. Im kinda restarted

are u on linux? sorry idk

oak inlet Dec 1, 2024, 12:46 AM

#

nah win

#

11

analog obsidian Dec 1, 2024, 12:46 AM

#

if ur new to rvc check this https://docs.ai-hub.wtf/

oak inlet Dec 1, 2024, 12:46 AM

#

ok

simple ore Dec 1, 2024, 12:53 AM

#

chilly ridge Hello everyone! I would need the advice of someone who knows about RVC please, I...

sibilants are made out of noise "columns" that is shifted in frequency

#

#

so RVC needs about 5-10k attempts to make a proper 'S' or 'Ch' or similar sounds out of pure noise. That is why undertrained model produces metallic S.

chilly ridge Dec 1, 2024, 12:54 AM

#

simple ore sibilants are made out of noise "columns" that is shifted in frequency

Can you explain please ?

#

Ho i see

analog obsidian Dec 1, 2024, 12:54 AM

#

chilly ridge Can you explain please ?

too small dataset causes them to appear often due to lack of data

simple ore Dec 1, 2024, 12:55 AM

#

with a too small dataset you need to train for like 2000+ epochs

#

but chances are that while it may fix S sounds it may ruin voiced parts (those wavy lines)

chilly ridge Dec 1, 2024, 12:55 AM

#

I'll use audio file from all kingdom hearts game instead of 1 to increase the dataset size

#

I think i can reach 8-10min

chilly ridge Dec 1, 2024, 12:56 AM

#

simple ore with a too small dataset you need to train for like 2000+ epochs

That's a lot, even with my gpu

simple ore Dec 1, 2024, 12:56 AM

#

the example aboive took 5000 loops

chilly ridge Dec 1, 2024, 12:56 AM

#

The image ?

simple ore Dec 1, 2024, 12:57 AM

#

analog obsidian Dec 1, 2024, 12:57 AM

#

chilly ridge I think i can reach 8-10min

it will not fully remove them but they're still going to appear moderately

#

he's trying to explain you that the more steps you train, they appear less

#

xD

simple ore Dec 1, 2024, 12:57 AM

#

not, they dont appear less, they take a proper shape

chilly ridge Dec 1, 2024, 12:57 AM

#

Wow

#

It's a big difference

simple ore Dec 1, 2024, 12:58 AM

#

the noise column shifts into the right frequency range

#

so instead of metallic z it is a proper hissing s

analog obsidian Dec 1, 2024, 12:58 AM

#

simple ore not, they dont appear less, they take a proper shape

well this is a better way to say it lol

chilly ridge Dec 1, 2024, 12:58 AM

#

Haha

#

Ok so, with 10min dataset. 3 batch size and 2000epoch ? 🙃

analog obsidian Dec 1, 2024, 12:59 AM

#

chilly ridge Ok so, with 10min dataset. 3 batch size and 2000epoch ? 🙃

do 4 and no more than 200 epochs

chilly ridge Dec 1, 2024, 1:00 AM

#

Ok, i'll try it tomorrow (it's 2am here)

#

Good night, thx for all

analog obsidian Dec 1, 2024, 1:00 AM

#

doggowave

simple ore Dec 1, 2024, 1:00 AM

#

it would depend on how many times there's S in the dataset and if you're lucky enough for the training to hit that S slice of audio

knotty moth Dec 1, 2024, 1:08 AM

#

simple ore 10 hr+ - 16

I dont think even 10h+ dataset from 3k voice recordings for a specific person's voice is necessary like this lol https://www.techspot.com/news/105764-panasonic-resurrects-long-dead-founder-ai-share-management.html

TechSpot

Panasonic resurrects long-dead founder as an AI to share his manage...

simple ore Dec 1, 2024, 1:09 AM

#

anyway, with a pretrain the requirements are not that high.. i've been testing it from scratch

simple ore Dec 1, 2024, 1:12 AM

#

knotty moth I dont think even 10h+ dataset from 3k voice recordings for a specific person's ...

with a good dataset 10-20 epochs with a pretrain get you a recognizable voice of a specific person

#

nowhere perfect, but like 50% of the work is done there

#

after that it is just small touches here and there that slowly shape up the voice

#

but anyway, rvc is not a real voice clone, there are very important characteristics it can not reproduce such as peronal inter-phoneme microdelays and mannerisms

knotty moth Dec 1, 2024, 1:23 AM

#

covert anchor i should be training at 2 or 3 secs on every epoch, or at least i think i should

~30-40 sec per epoch for 10 min dataset & batch size 8 should be normal

dull ledge Dec 1, 2024, 1:24 AM

#

@analog obsidian Hi Lyery, are you there?

analog obsidian Dec 1, 2024, 1:24 AM

#

dull ledge <@775545133448953856> Hi Lyery, are you there?

doggowave

dull ledge Dec 1, 2024, 1:25 AM

#

You have a really cool Goth Mommy model, unfortunately the link is not working, did you delete it? If not, mind to share it with me if its still public? Thank you 😄
The link is this one: https://voice-models.com/model/1ucea3z45g5

analog obsidian Dec 1, 2024, 1:26 AM

#

dull ledge You have a really cool Goth Mommy model, unfortunately the link is not working, ...

i no longer have it sry, lost it when i upgraded my ssd

dull ledge Dec 1, 2024, 1:27 AM

#

Was the best one I heard 🤭 . No worries, thank you

oak inlet Dec 1, 2024, 1:48 AM

#

yo

#

i got an error

azure marshBOT Dec 1, 2024, 1:49 AM

#

oak inlet i got an error

Hey, alt136735! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:

General RVC help: #✨│ai-help
W-Okada / Realtime RVC: #🔍│help-w-okada
AI image related: #🔍│help-ai-art

oak inlet Dec 1, 2024, 1:49 AM

#

File "C:\Users\vedant\Desktop\Retrieval-based-Voice-Conversion-WebUI-main\infer\modules\vc\modules.py", line 172, in vc_single
self.hubert_model = load_hubert(self.config)
File "C:\Users\vedant\Desktop\Retrieval-based-Voice-Conversion-WebUI-main\infer\modules\vc\utils.py", line 23, in load_hubert
models, _, _ = checkpoint_utils.load_model_ensemble_and_task(
File "C:\Users\vedant\AppData\Local\Programs\Python\Python310\lib\site-packages\fairseq\checkpoint_utils.py", line 423, in load_model_ensemble_and_task
raise IOError("Model file not found: {}".format(filename))
OSError: Model file not found: assets/hubert/hubert_base.pt

#

!howtoask

patent trellisBOT Dec 1, 2024, 1:51 AM

#

oak inlet !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

azure marshBOT Dec 1, 2024, 2:13 AM

#

📚 Audioguías y Herramientas

Creating Datasets for RVC using iZotope RX11, por Cauthess
Gathering and Isolating Audio, por SCRFilms ❄
Instrumental and vocal & stems separation & mastering guide, por deton24
Vocal Mixing Tutorial, por Roomie
https://mvsep.com/

magic badge Dec 1, 2024, 2:13 AM

#

/collab

#

-colab

azure marshBOT Dec 1, 2024, 2:14 AM

#

magic badge -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

wet fable Dec 1, 2024, 7:09 AM

#

hi there, i'd like to ask whether RVC is better than weights.gg
RVC needs lots of time to generate, while weights.gg not
idk if there's differences. Thanks!

crude flame Dec 1, 2024, 7:19 AM

#

wet fable hi there, i'd like to ask whether RVC is better than weights.gg RVC needs lots o...

weights uses rvc

wet fable Dec 1, 2024, 7:20 AM

#

I'm sorry so it's exactly the same thing?

knotty moth Dec 1, 2024, 7:21 AM

#

wet fable hi there, i'd like to ask whether RVC is better than weights.gg RVC needs lots o...

if u have a 5090, it might be faster

crude flame Dec 1, 2024, 7:21 AM

#

wet fable I'm sorry so it's exactly the same thing?

yup

wet fable Dec 1, 2024, 7:22 AM

#

Ok thanks!

low shard Dec 1, 2024, 9:50 AM

#

wet fable hi there, i'd like to ask whether RVC is better than weights.gg RVC needs lots o...

The difference is the GPU

#

I don't remember if it was an AI specialized GPU like A100 or an rtx 4090 tho

flint solar Dec 1, 2024, 9:57 AM

#

hello chat

#

oh fuck wrong chat

prisma kite Dec 1, 2024, 11:41 AM

#

pomocy wyskoczyło mi No module named 'gradio'
i nie wiem jak to naprawić

#

help, I got No module named 'gradio'
and I don't know how to fix it****

simple ore Dec 1, 2024, 12:55 PM

#

prisma kite help, I got No module named 'gradio' and I don't know how to fix it****

if you got the source, you need to install all requirements

#

but better just get a compiled version

#

with all the required packages included

gentle hollow Dec 1, 2024, 1:06 PM

#

Guys plsss tell me wat Ai vocal remover y’all useee

young perch Dec 1, 2024, 1:27 PM

#

Guys, what should I do if I experience a large delay in my voice changer? When I test it in the program, delay is about 3-4 seconds, but when I join the game, it shows around 30 000 ms. Please Help(((

sage wind Dec 1, 2024, 1:45 PM

#

ummm guys

#

when i opened voice models channel theres literally 0 models

low shard Dec 1, 2024, 1:46 PM

#

sage wind when i opened voice models channel theres literally 0 models

refresh discord, be sure your connection is stable

#

might be a discord moment

sage wind Dec 1, 2024, 1:46 PM

#

low shard refresh discord, be sure your connection is stable

bro 100mb per sec and also cable how is that can be unstable 😭😭😭

low shard Dec 1, 2024, 1:46 PM

#

sage wind bro 100mb per sec and also cable how is that can be unstable 😭😭😭

see if there's an 'X' next to the search thing, could be that your last research got stuck

sage wind Dec 1, 2024, 1:47 PM

#

low shard see if there's an 'X' next to the search thing, could be that your last research...

o ty

low shard Dec 1, 2024, 1:47 PM

#

sage wind o ty

did that solve it?

sage wind Dec 1, 2024, 1:48 PM

#

low shard did that solve it?

yeah

rare gobletBOT Dec 1, 2024, 1:48 PM

#

Ayo? @sage wind level 1 !!! lfg

sage wind Dec 1, 2024, 1:48 PM

#

i can even send screenshot but i dont have permission

low shard Dec 1, 2024, 1:49 PM

#

sage wind i can even send screenshot but i dont have permission

yea you need to be level 1 by chatting to send images in help channels

low shard Dec 1, 2024, 1:49 PM

#

sage wind yeah

yw 🔥

sage wind Dec 1, 2024, 1:49 PM

#

#

🔥🔥🔥🔥

#

ty yall

low shard Dec 1, 2024, 1:50 PM

#

yw

chilly ridge Dec 1, 2024, 2:26 PM

#

analog obsidian use batch size 4 train for not longer than 200 epochs

I increased the dataset to 10min with every file duration between 5 and 8s (idk why but audacity don't want to work with less duration), it's actually in training for 200 epochs

#

20s/epoch

dawn bronze Dec 1, 2024, 2:32 PM

#

@low shard How can I convert vocals from huggingface.co to A Mp3

analog obsidian Dec 1, 2024, 2:37 PM

#

chilly ridge I increased the dataset to 10min with every file duration between 5 and 8s (idk ...

8s is too long, but it doesn't really matter because rvc is going to slice those long samples
(and audacity already removed long silences by itself so the chances of the rvc slider adding long silences are very low)

low shard Dec 1, 2024, 2:38 PM

#

dawn bronze <@911742715019001897> How can I convert vocals from huggingface.co to A Mp3

I'm guessing you're talking about the vocals output you get from Ilaria RVC Zero?

#

Why would you want to convert them from .wav to mp3? mp3 is lossy and lower quality than wav

#

wav is way better, it's lossless

#

There's no reason to convert them,

If you really want to do that, you can just google any wav to mp3 converting site like this

WAV to MP3 (Online & Free) — Convertio

Best way to convert your WAV to MP3 file in seconds. 100% free, secure and easy to use! Convertio — advanced online tool that solving any problems with any files.

#

But It's really not suggested

knotty moth Dec 1, 2024, 2:53 PM

#

dawn bronze <@911742715019001897> How can I convert vocals from huggingface.co to A Mp3

use audacity (with ffmpeg) or any other audio editor, or the online sites like suggested above

true ravine Dec 1, 2024, 3:22 PM

#

crude flame windows button + tab then create a new desktop and use that

thanks men

azure marshBOT Dec 1, 2024, 3:34 PM

#

✍ Suggestions

Search for it in AI HUB Docs or Applio Docs. You will probably find your answer there 📚
Ask for help in #🔍│help-w-okada if it's related to real time voice changing but make sure to read #1297207135469305866 first
Ask for help in #✨│ai-help for general help, but use the command !howtoask first to learn how to structure your question properly and increase your chances of getting a reply
Last but not least, ask for help in #🔍│help-ai-art if it's related to AI images.

chilly ridge Dec 1, 2024, 3:41 PM

#

analog obsidian 8s is too long, but it doesn't really matter because rvc is going to slice those...

It's better but sometimes it's robotic again

#

but it's really really better

analog obsidian Dec 1, 2024, 3:42 PM

#

chilly ridge It's better but sometimes it's robotic again

yes its normal, usually you stop getting robotic sibilances at the 30 minute mark

#

anything below that use a batch size of 4

#

and pray for getting them less often

chilly ridge Dec 1, 2024, 3:43 PM

#

okay, and if i try more epoch ? like 400 ? it will just getting worst ?

analog obsidian Dec 1, 2024, 3:44 PM

#

chilly ridge okay, and if i try more epoch ? like 400 ? it will just getting worst ?

not worse but the improvement is very marginal and you have the risk of overfitting

#

not audible difference + unnecesary risks

chilly ridge Dec 1, 2024, 3:44 PM

#

ok i'll try to increase the dataset again

#

do you know how i can extract automatically a voice from a certain caracter from a movie file ?

#

it look really long to do it manually

analog obsidian Dec 1, 2024, 3:46 PM

#

chilly ridge do you know how i can extract automatically a voice from a certain caracter from...

that is called speaker diarization, currently the only one that exists is named pyannote and is extremely ass

chilly ridge Dec 1, 2024, 3:46 PM

#

i tryed with python and speechbrain but it's not convincing

rare gobletBOT Dec 1, 2024, 3:47 PM

#

Ayo? @chilly ridge level 3 !!! lfg

analog obsidian Dec 1, 2024, 3:47 PM

#

sadly is better to separate speakers manually

low shard Dec 1, 2024, 3:47 PM

#

analog obsidian that is called speaker diarization, currently the only one that exists is named ...

nah not that ass

chilly ridge Dec 1, 2024, 3:47 PM

#

analog obsidian that is called speaker diarization, currently the only one that exists is named ...

i tryed that too

analog obsidian Dec 1, 2024, 3:47 PM

#

low shard nah not that ass

was very ass for me

#

what

chilly ridge Dec 1, 2024, 3:47 PM

#

it juste identifyed the whole movie like it's the character i'm looking for

low shard Dec 1, 2024, 3:48 PM

#

analog obsidian was very ass for me

idk why it wasn't for me

analog obsidian Dec 1, 2024, 3:48 PM

#

chilly ridge it juste identifyed the whole movie like it's the character i'm looking for

yea its trash dont use it

low shard Dec 1, 2024, 3:48 PM

#

I haven't played with it since a while tho https://github.com/sanctuary-osai/Pyannote-Speaker-Diarization-3.1

chilly ridge Dec 1, 2024, 3:48 PM

#

low shard idk why it wasn't for me

if it work for you, so explained how to use it

analog obsidian Dec 1, 2024, 3:48 PM

#

low shard I haven't played with it since a while tho https://github.com/sanctuary-osai/Pya...

better not having the risks of ai getting confused at voices

#

they do have a paid version i havent tried

#

probably he tried that

low shard Dec 1, 2024, 3:49 PM

#

analog obsidian probably he tried that

How could I?

analog obsidian Dec 1, 2024, 3:49 PM

#

low shard How could I?

idk i was guessing

low shard Dec 1, 2024, 3:49 PM

#

Nuh uh

#

Not paying for any shit

#

I love Open Source

analog obsidian Dec 1, 2024, 3:50 PM

#

me 2

chilly ridge Dec 1, 2024, 3:51 PM

#

i'll try again with diarization

#

maybe i did something wrong

low shard Dec 1, 2024, 3:52 PM

#

chilly ridge if it work for you, so explained how to use it

the parameters are explained in the ui of the project i sent

low shard Dec 1, 2024, 3:52 PM

#

analog obsidian me 2

🔥

chilly ridge Dec 1, 2024, 3:52 PM

#

low shard the parameters are explained in the ui of the project i sent

thx

low shard Dec 1, 2024, 3:55 PM

#

analog obsidian better not having the risks of ai getting confused at voices

That's true tho, better double check it

#

but maybe could help, im saying it just for that

#

imagine having to watch a whole movie to make a model 😭

chilly ridge Dec 1, 2024, 6:30 PM

#

analog obsidian yea its trash dont use it

diarization does't worked

#

any idea how i can export all the movie scene with that character ?

#

clearelly i don't like the idea to do it mannualy

analog obsidian Dec 1, 2024, 6:31 PM

#

chilly ridge any idea how i can export all the movie scene with that character ?

do it manually

#

nails

chilly ridge Dec 1, 2024, 6:31 PM

#

sad

#

for just 2min of audio :)

oak inlet Dec 1, 2024, 6:56 PM

#

oak inlet File "C:\Users\vedant\Desktop\Retrieval-based-Voice-Conversion-WebUI-main\infer\...

anyone know fix

chilly ridge Dec 1, 2024, 7:01 PM

#

analog obsidian yes its normal, usually you stop getting robotic sibilances at the 30 minute mar...

i have 1h of dataset, do i also use 4 for batch size and 200Epoch ?

glacial pollen Dec 1, 2024, 7:03 PM

#

so, where is it

analog obsidian Dec 1, 2024, 7:03 PM

#

chilly ridge i have 1h of dataset, do i also use 4 for batch size and 200Epoch ?

hard to answer this since it depends on the variety of the dataset

glacial pollen Dec 1, 2024, 7:03 PM

#

ah

#

I mean

#

It should be pretty simple

#

🤔

#

you lack a model ( a component of rvc, that is )

oak inlet Dec 1, 2024, 7:04 PM

#

oh

chilly ridge Dec 1, 2024, 7:04 PM

#

analog obsidian hard to answer this since it depends on the variety of the dataset

same as always, i have 1h of audio with only 3s-4s audio file

glacial pollen Dec 1, 2024, 7:04 PM

#

https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main

#

@oak inlet

analog obsidian Dec 1, 2024, 7:04 PM

#

chilly ridge same as always, i have 1h of audio with only 3s-4s audio file

for simplicity sake use batch 8

glacial pollen Dec 1, 2024, 7:04 PM

#

Dl all you lack from there

chilly ridge Dec 1, 2024, 7:04 PM

#

thx

oak inlet Dec 1, 2024, 7:04 PM

#

glacial pollen <@1311544405361954909>

THAT WAS IT

glacial pollen Dec 1, 2024, 7:04 PM

#

hubert .pth goes into assets/hubert

#

rmvpe goes into assets/rmvpe

oak inlet Dec 1, 2024, 7:04 PM

#

I DIDNT KNOW WHAT TO DO ON THAT TY

glacial pollen Dec 1, 2024, 7:05 PM

#

oof

#

You see

#

always look at the last part of traceback

oak inlet Dec 1, 2024, 7:05 PM

#

glacial pollen always look at the last part of traceback

ty

#

legend fr

glacial pollen Dec 1, 2024, 7:05 PM

#

( + additionally, the root of the issue

#

but typically the last line tells you what is the main deal

oak inlet Dec 1, 2024, 7:05 PM

#

glacial pollen ( + additionally, the root of the issue

ye

glacial pollen Dec 1, 2024, 7:05 PM

#

Alr, glad I could help. best of luck man

oak inlet Dec 1, 2024, 7:05 PM

#

glacial pollen Alr, glad I could help. best of luck man

same to u

#

is it rvmpe.pt?

glacial pollen Dec 1, 2024, 7:06 PM

#

yes

#

.pt or .pth are pytorch models format
~ for the record

oak inlet Dec 1, 2024, 7:07 PM

#

oh

#

is it hubert base

glacial pollen Dec 1, 2024, 7:07 PM

#

yes

oak inlet Dec 1, 2024, 7:07 PM

#

oh ty

glacial pollen Dec 1, 2024, 7:07 PM

#

rmvpe is for f0 extraction
hubert is for features

waxen kelp Dec 1, 2024, 7:07 PM

#

hello how do i send pictures for help?

oak inlet Dec 1, 2024, 7:08 PM

#

glacial pollen rmvpe is for f0 extraction hubert is for features

ohh

glacial pollen Dec 1, 2024, 7:08 PM

#

waxen kelp hello how do i send pictures for help?

First you must get to some level, I think 5, 7 or 10 to have an ability to ( in case you aren't able to rn
You level up by being active on chat

oak inlet Dec 1, 2024, 7:08 PM

#

its levek 2

glacial pollen Dec 1, 2024, 7:08 PM

#

oh, then 2 then. Seems like the threshold got lowered

oak inlet Dec 1, 2024, 7:08 PM

#

i can do it rn

waxen kelp Dec 1, 2024, 7:08 PM

#

glacial pollen First you must get to some level, I think 5, 7 or 10 to have an ability to ( in ...

oh so i just yap?

oak inlet Dec 1, 2024, 7:09 PM

#

waxen kelp oh so i just yap?

yues

glacial pollen Dec 1, 2024, 7:09 PM

#

I mean yea kinda, but you can just go #🤖│bots I suppose

#

or somethin'
to not clog the main chats

#

Anyway, I gotta go back to work now

oak inlet Dec 1, 2024, 7:16 PM

#

yo i got another error now there is no trace back it procceses for a few sec then it says error

#

2024-12-01 11:13:13 | INFO | fairseq.tasks.hubert_pretraining | current directory is C:\Users\vedant\Desktop\Retrieval-based-Voice-Conversion-WebUI-main
2024-12-01 11:13:13 | INFO | fairseq.tasks.hubert_pretraining | HubertPretrainingTask Config {'_name': 'hubert_pretraining', 'data': 'metadata', 'fine_tuning': False, 'labels': ['km'], 'label_dir': 'label', 'label_rate': 50.0, 'sample_rate': 16000, 'normalize': False, 'enable_padding': False, 'max_keep_size': None, 'max_sample_size': 250000, 'min_sample_size': 32000, 'single_target': False, 'random_crop': True, 'pad_audio': False}
2024-12-01 11:13:13 | INFO | fairseq.models.hubert.hubert | HubertModel Config: {'_name': 'hubert', 'label_rate': 50.0, 'extractor_mode': default, 'encoder_layers': 12, 'encoder_embed_dim': 768, 'encoder_ffn_embed_dim': 3072, 'encoder_attention_heads': 12, 'activation_fn': gelu, 'layer_type': transformer, 'dropout': 0.1, 'attention_dropout': 0.1, 'activation_dropout': 0.0, 'encoder_layerdrop': 0.05, 'dropout_input': 0.1, 'dropout_features': 0.1, 'final_dim': 256, 'untie_final_proj': True, 'layer_norm_first': False, 'conv_feature_layers': '[(512,10,5)] + [(512,3,2)] * 4 + [(512,2,2)] * 2', 'conv_bias': False, 'logit_temp': 0.1, 'target_glu': False, 'feature_grad_mult': 0.1, 'mask_length': 10, 'mask_prob': 0.8, 'mask_selection': static, 'mask_other': 0.0, 'no_mask_overlap': False, 'mask_min_space': 1, 'mask_channel_length': 10, 'mask_channel_prob': 0.0, 'mask_channel_selection': static, 'mask_channel_other': 0.0, 'no_mask_channel_overlap': False, 'mask_channel_min_space': 1, 'conv_pos': 128, 'conv_pos_groups': 16, 'latent_temp': [2.0, 0.5, 0.999995], 'skip_masked': False, 'skip_nomask': False, 'checkpoint_activations': False, 'required_seq_len_multiple': 2, 'depthwise_conv_kernel_size': 31, 'attn_type': '', 'pos_enc_type': 'abs', 'fp16': False}
2024-12-01 11:13:14 | INFO | infer.modules.vc.pipeline | Loading rmvpe model,assets/rmvpe/rmvpe.pt

flint solar Dec 1, 2024, 7:21 PM

#

oak inlet 2024-12-01 11:13:13 | INFO | fairseq.tasks.hubert_pretraining | current director...

this isn't an error

oak inlet Dec 1, 2024, 7:21 PM

#

oh

flint solar Dec 1, 2024, 7:21 PM

#

its info

oak inlet Dec 1, 2024, 7:21 PM

#

but it says error

#

in the webui

flint solar Dec 1, 2024, 7:22 PM

#

oak inlet in the webui

its normal

flint solar Dec 1, 2024, 7:22 PM

#

oak inlet yo i got another error now there is no trace back it procceses for a few sec the...

nothing seems wrong here

waxen kelp Dec 1, 2024, 7:23 PM

#

hi! i try to use RVC but in the end it says this.does anyone know whats the problem and how to fix it

oak inlet Dec 1, 2024, 7:23 PM

#

flint solar nothing seems wrong here

should i rec it

flint solar Dec 1, 2024, 7:23 PM

#

waxen kelp hi! i try to use RVC but in the end it says this.does anyone know whats the prob...

this gui is extremely outdated

waxen kelp Dec 1, 2024, 7:23 PM

#

?

#

i thought i have the latest version

oak inlet Dec 1, 2024, 7:24 PM

#

waxen kelp ?

very out of date

#

https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/docs/en/README.en.md

#

this is new

distant turtle Dec 1, 2024, 7:24 PM

#

-colab

azure marshBOT Dec 1, 2024, 7:24 PM

#

distant turtle -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

waxen kelp Dec 1, 2024, 7:25 PM

#

oak inlet this is new

oh snap is there a tuto on how to download it?

#

seems a lot of stuff

#

im not aware of much github

oak inlet Dec 1, 2024, 7:27 PM

#

waxen kelp oh snap is there a tuto on how to download it?

nah there isnt

#

wait

#

make sure u have pip 24.0

#

3.10

#

rc1

waxen kelp Dec 1, 2024, 7:28 PM

#

oak inlet make sure u have pip 24.0

tf is that?

oak inlet Dec 1, 2024, 7:28 PM

#

waxen kelp tf is that?

u need it to install the requirements

rare gobletBOT Dec 1, 2024, 7:28 PM

#

Ayo? @oak inlet level 3 !!! lfg

oak inlet Dec 1, 2024, 7:28 PM

#

wait

#

u wanna call or sm?

#

through dc

#

discoerd

waxen kelp Dec 1, 2024, 7:29 PM

#

nah its aight i was just tryna make a silly cover

oak inlet Dec 1, 2024, 7:29 PM

#

oh

waxen kelp Dec 1, 2024, 7:29 PM

#

thank you very much helping tho!

rare gobletBOT Dec 1, 2024, 7:29 PM

#

Ayo? @waxen kelp level 2 !!! lfg

low shard Dec 1, 2024, 8:05 PM

#

waxen kelp thank you very much helping tho!

what's ur pc gpu?

#

and yea u shouldn't use rvc gui

waxen kelp Dec 1, 2024, 8:06 PM

#

low shard what's ur pc gpu?

its GX 840

low shard Dec 1, 2024, 8:08 PM

#

waxen kelp its GX 840

you sure it's that?

#

You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
and here u should find the gpu memory

#

I have never heard of that gpu + can't find it online

gloomy lynx Dec 1, 2024, 8:09 PM

#

I just realized my version of rvc gui is outdated too damn

rare gobletBOT Dec 1, 2024, 8:09 PM

#

Ayo? @gloomy lynx level 1 !!! lfg

low shard Dec 1, 2024, 8:11 PM

#

gloomy lynx I just realized my version of rvc gui is outdated too damn

yea nah u should never use rvc gui

#

what's ur pc gpu

gloomy lynx Dec 1, 2024, 8:11 PM

#

nvidia geforce rtx 3060 ti

low shard Dec 1, 2024, 8:11 PM

#

gloomy lynx nvidia geforce rtx 3060 ti

As you got a good PC, you can use RVC locally, you can choose between:

Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
Mainline: The original RVC

gloomy lynx Dec 1, 2024, 8:11 PM

#

Oh alr

#

what do you thinks better?

low shard Dec 1, 2024, 8:13 PM

#

gloomy lynx what do you thinks better?

applio imo

gloomy lynx Dec 1, 2024, 8:13 PM

#

Alr lemme try it

chilly ridge Dec 1, 2024, 8:40 PM

#

analog obsidian for simplicity sake use batch 8

10min25/Epoch is normal ??

analog obsidian Dec 1, 2024, 8:42 PM

#

chilly ridge 10min25/Epoch is normal ??

what, 10 minutes per epoch or 25 seconds?

chilly ridge Dec 1, 2024, 8:42 PM

#

Both

#

10min and 25s

analog obsidian Dec 1, 2024, 8:42 PM

#

thats not normal, it should be 1-2-3 minute per epoch if your dataset has 1 hour of data

#

how much vram do you have

chilly ridge Dec 1, 2024, 8:42 PM

#

I reach 48min without silence in audio

chilly ridge Dec 1, 2024, 8:43 PM

#

analog obsidian how much vram do you have

8Go Vram, 32Go ram and a big xenon processor

#

I have an IA gpu normally

#

I dont' know if it's change something

analog obsidian Dec 1, 2024, 8:44 PM

#

oh that explains

chilly ridge Dec 1, 2024, 8:44 PM

#

Quadro ADA RTX 4000 8Go

analog obsidian Dec 1, 2024, 8:44 PM

#

chilly ridge Quadro ADA RTX 4000 8Go

this is what are you using rn?

chilly ridge Dec 1, 2024, 8:44 PM

#

Yep

#

I cheked my cuda installation and everything look ok

analog obsidian Dec 1, 2024, 8:45 PM

#

idk sorry i have no idea, i don't use these types of gpu

chilly ridge Dec 1, 2024, 8:46 PM

#

With the 10min dataset and 4batch it was 7s/epoch

#

I checked the "charge dataset in gpu"

analog obsidian Dec 1, 2024, 8:46 PM

#

chilly ridge I checked the "charge dataset in gpu"

oohhh

#

this explains

#

caching dataset in gpu in huge datasets cause massive vram usage

#

you're using system RAM

#

this is why became so slow

#

system is using fallback memory

#

disable that on big datasets

chilly ridge Dec 1, 2024, 8:47 PM

#

Ok so i need to uncheck this option ?

analog obsidian Dec 1, 2024, 8:47 PM

#

yes

chilly ridge Dec 1, 2024, 8:47 PM

#

Okay, i try

#

Thx

chilly ridge Dec 1, 2024, 8:58 PM

#

analog obsidian caching dataset in gpu in huge datasets cause massive vram usage

Now it's 2min/Epoch

rare gobletBOT Dec 1, 2024, 8:58 PM

#

Ayo? @chilly ridge level 4 !!! lfg

chilly ridge Dec 1, 2024, 8:58 PM

#

Thx 🙏

analog obsidian Dec 1, 2024, 8:58 PM

#

chilly ridge Now it's 2min/Epoch

yes, now is normal

#

no problem

chilly ridge Dec 1, 2024, 8:58 PM

#

I'll credit you if i finish my project one day

oak hearth Dec 1, 2024, 10:17 PM

#

so uhwhere do i download

rare gobletBOT Dec 1, 2024, 10:17 PM

#

Ayo? @oak hearth level 2 !!! lfg

oak hearth Dec 1, 2024, 10:18 PM

#

my thingy flatlined, it TECHnically is going down and it isnt going up do i just download the latest .pth saved?

#

or do i let it go until it goes up?

simple ore Dec 1, 2024, 10:30 PM

#

click

knotty moth Dec 1, 2024, 10:34 PM

#

chilly ridge I checked the "charge dataset in gpu"

~~you need an A100 to do that~~
there's no point of using that option tho

oak hearth Dec 1, 2024, 10:45 PM

#

simple ore click

oml thank you how did it get like that

knotty moth Dec 1, 2024, 10:50 PM

#

chilly ridge Quadro ADA RTX 4000 8Go

I dont recall there's RTX ada with only 8 GB (you could get a cheaper normal rtx one with 12/16 gb vram lmao)

oak hearth Dec 1, 2024, 11:51 PM

#

so uh is there any way to resume training without a g and d file

gloomy lynx Dec 2, 2024, 12:02 AM

#

i dont think so?

#

I havent made a model in awhile so im not sure

chilly ridge Dec 2, 2024, 12:06 AM

#

knotty moth I dont recall there's RTX ada with only 8 GB (you could get a cheaper normal rtx...

I get it for free, so i don't really care 🙃

chilly ridge Dec 2, 2024, 12:08 AM

#

knotty moth I dont recall there's RTX ada with only 8 GB (you could get a cheaper normal rtx...

I just checked and you'r right, i have a quadro rtx 4000 8Go, the 4000 ada 20Go is in my boss's workstation

#

Mb

knotty moth Dec 2, 2024, 12:09 AM

#

chilly ridge I get it for free, so i don't really care 🙃

you can resell it, profit! goofy

chilly ridge Dec 2, 2024, 12:10 AM

#

Haha, no, i prefer keep it

pastel fiber Dec 2, 2024, 12:11 AM

#

can someone help

azure marshBOT Dec 2, 2024, 12:11 AM

#

pastel fiber can someone help

Hey, jinxss! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:

General RVC help: #✨│ai-help
W-Okada / Realtime RVC: #🔍│help-w-okada
AI image related: #🔍│help-ai-art

chilly ridge Dec 2, 2024, 12:11 AM

#

knotty moth you can resell it, profit! <:goofy:1159397241199534152>

I have a 3060 8Go in my laptop. With same dataset and setting, wich one you think have less min/epoch ?

timber hamlet Dec 2, 2024, 12:11 AM

#

What’s the

pastel fiber Dec 2, 2024, 12:12 AM

#

why does it say the input is the vb output, it also says output is vb input

timber hamlet Dec 2, 2024, 12:12 AM

#

We’re are the buzzes poles

#

People

#

What’s a good room too be in

rare gobletBOT Dec 2, 2024, 12:13 AM

#

Ayo? @timber hamlet level 1 !!! lfg

timber hamlet Dec 2, 2024, 12:14 AM

#

How do u find that iam new🫤

#

Why u yell

meager comet Dec 2, 2024, 12:29 AM

#

how much faster is an a100 compared to colab t4

knotty moth Dec 2, 2024, 12:42 AM

#

chilly ridge I just checked and you'r right, i have a quadro rtx 4000 8Go, the 4000 ada 20Go ...

there's only RTX 4000 ada 20Gogs, your boss must have modded yours to 8Gogs (if not physically, perhaps only bios tweaking). performance wise, it's roughly between 4060 Ti (including 16Gogs) and 4070 (12 Gogs).

chilly ridge Dec 2, 2024, 12:42 AM

#

No no

#

I said i don't have an ada

#

Just a quadro rtx 4000

#

My boss's gpu is an rtx 4000 ada

knotty moth Dec 2, 2024, 12:44 AM

#

chilly ridge Quadro ADA RTX 4000 8Go

bruh you did say this

chilly ridge Dec 2, 2024, 12:44 AM

#

chilly ridge I just checked and you'r right, i have a quadro rtx 4000 8Go, the 4000 ada 20Go ...

Here

knotty moth Dec 2, 2024, 12:45 AM

#

chilly ridge Here

the name seems a bit ambiguous, perhaps the old Turing one?
https://www.techpowerup.com/gpu-specs/quadro-rtx-4000.c3336

chilly ridge Dec 2, 2024, 12:46 AM

#

Haha yeah, it's this one

#

I thought I had the same computer as my boss

#

But it look like he keep the big gpu for him

knotty moth Dec 2, 2024, 12:48 AM

#

chilly ridge Haha yeah, it's this one

in comparison, there's RTX 2070 with the same gogs and similar performance

chilly ridge Dec 2, 2024, 12:49 AM

#

I see, so, my laptop with the 3060 8Go could be faster/epoch ?

knotty moth Dec 2, 2024, 12:51 AM

#

chilly ridge I see, so, my laptop with the 3060 8Go could be faster/epoch ?

#

also RTX laptops are usually easier to overheat than desktop ones

chilly ridge Dec 2, 2024, 12:52 AM

#

I put my laptop upside down in front of the AC 🤫

chilly ridge Dec 2, 2024, 12:53 AM

#

knotty moth

I don't understand the graph

knotty moth Dec 2, 2024, 12:53 AM

#

chilly ridge I don't understand the graph

skill issue

#

it literally shows the same performance as 2060

chilly ridge Dec 2, 2024, 12:54 AM

#

Okay, not bad for a free gpu

shut goblet Dec 2, 2024, 1:21 AM

#

how the fuck do i fix this

simple ore Dec 2, 2024, 1:26 AM

#

shut goblet how the fuck do i fix this

probbaly something like vc redist is missing

#

or cuda tools

shut goblet Dec 2, 2024, 1:28 AM

#

simple ore probbaly something like vc redist is missing

well when i was opening the rvc file

#

it showed up a error

#

is there a proper way to open the file?

simple ore Dec 2, 2024, 2:40 AM

#

is failing to load because some dependency is missing

simple ore Dec 2, 2024, 2:41 AM

#

shut goblet is there a proper way to open the file?

there's a way to find the missing depencency

#

https://github.com/lucasg/Dependencies/releases/tag/v1.11.1 - download x64 release, unzip, run the dependenciesgui.exe

#

from that open the dll shown on the screenshot, it will list what it needs

#

but my guess is either vc++ redist or CUDA toolkit

frank owl Dec 2, 2024, 3:30 AM

#

Check out this creation I made on Weights.gg! https://www.weights.gg/shared/cm43ggokn1qqwogar8nn0oibe?inviteCode=4619f

wet fable Dec 2, 2024, 4:29 AM

#

Are there any accessible applications that could have the same effect as RVC?

blazing solar Dec 2, 2024, 5:22 AM

#

-cOLAB

azure marshBOT Dec 2, 2024, 5:22 AM

#

blazing solar -cOLAB

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

rare gobletBOT Dec 2, 2024, 5:22 AM

#

Ayo? @blazing solar level 1 !!! lfg

low shard Dec 2, 2024, 7:20 AM

#

wet fable Are there any accessible applications that could have the same effect as RVC?

wdym same effects as RVC

#

what’s ur pc gpu and what are u looking for?

wet fable Dec 2, 2024, 7:22 AM

#

low shard wdym same effects as RVC

Can do the same thing as RVC I mean

low shard Dec 2, 2024, 7:23 AM

#

wet fable Can do the same thing as RVC I mean

You can just use RVC

#

RVC is the best speech to speech program

low shard Dec 2, 2024, 7:23 AM

#

low shard what’s ur pc gpu and what are u looking for?

Reply to this

knotty moth Dec 2, 2024, 7:41 AM

#

wet fable Are there any accessible applications that could have the same effect as RVC?

do you mean any frontend applications using RVC? I'd recommend any of following:

#

-rvc

azure marshBOT Dec 2, 2024, 7:41 AM

#

knotty moth -rvc

📚 Documentation

AI HUB Docs

https://docs.ai-hub.wtf

🍏 Applio Docs

https://docs.applio.org/

✨ More guides

How to use RVC Mainline Colab by Cauthess
AICoverGen Colab Guide by Eddy (Spanish Helper)
Create a model with RVC disconnected (colab) by Angetyde

shut goblet Dec 2, 2024, 7:44 AM

#

#

tf is a gradio

low shard Dec 2, 2024, 7:50 AM

#

shut goblet tf is a gradio

gradio is one of the most used framework for making webuis in AI projects

#

so a requirement for eddy’s UVR UI

#

by checking its github, https://github.com/Eddycrack864/UVR5-UI

GitHub

GitHub - Eddycrack864/UVR5-UI: Ultimate Vocal Remover 5 with Gradio...

Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models - Eddycrack864/UVR5-UI

#

you seem to be on windows, did you run UVR5-UI-installer.bat Without administrator?

shut goblet Dec 2, 2024, 7:54 AM

#

low shard you seem to be on windows, did you run UVR5-UI-installer.bat Without administrat...

Yes

low shard Dec 2, 2024, 7:58 AM

#

shut goblet Yes

Are you sure it’s in the C drive and there’s no special characters in whatever folder it is?

#

it doesn’t seem to be on the C drive

marsh schooner Dec 2, 2024, 9:44 AM

#

how much hours do yall think is too much hours on a data set to be used on a voice changer

#

like if i wanted to avoid it being overtrained what would i do

#

for the maximum

frosty coral Dec 2, 2024, 9:46 AM

#

why does rvc keep crashing whenever i click start audio conversion? (not responding)

brittle wing Dec 2, 2024, 9:47 AM

#

-hf

azure marshBOT Dec 2, 2024, 9:47 AM

#

brittle wing -hf

🤗 Huggingface Spaces

UVR5 UI, by Eddy and Ilaria Huggingface Spaces
Ilaria RVC Zero, by thestingerx Huggingface Spaces
RVC⚡ZERO, by r3gm Huggingface Spaces
Applio, by IA Hispano Huggingface Spaces
🆕 FaceFusion UI, by Nick088 Huggingface Spaces

flint solar Dec 2, 2024, 9:56 AM

#

frosty coral why does rvc keep crashing whenever i click start audio conversion? (not respond...

What are u using to run rvc

frosty coral Dec 2, 2024, 9:59 AM

#

as in gpu?

marsh schooner Dec 2, 2024, 10:03 AM

#

flint solar What are u using to run rvc

do u know that maximum dataset length a voice can have without being overtrained

flint solar Dec 2, 2024, 10:04 AM

#

marsh schooner do u know that maximum dataset length a voice can have without being overtrained

There is no constant length u can’t predict anything

#

But if I were to choose

#

I’d go for 7-10 min dataset of clean and diverse data

marsh schooner Dec 2, 2024, 10:04 AM

#

minutes???

flint solar Dec 2, 2024, 10:04 AM

#

marsh schooner minutes???

Yes

marsh schooner Dec 2, 2024, 10:04 AM

#

i was ab to go for like a 2-3 hours

#

dataset

flint solar Dec 2, 2024, 10:05 AM

#

marsh schooner i was ab to go for like a 2-3 hours

No dats bad

low shard Dec 2, 2024, 10:07 AM

#

frosty coral as in gpu?

program & gpu

#

be sure to NOT follow yt tutd

flint solar Dec 2, 2024, 10:08 AM

#

low shard be sure to NOT follow yt tutd

Someone need to put out a new one 😭

low shard Dec 2, 2024, 10:08 AM

#

flint solar Someone need to put out a new one 😭

I can’t do it for local rvc

flint solar Dec 2, 2024, 10:08 AM

#

Ik what he using

low shard Dec 2, 2024, 10:08 AM

#

Can’t people just read our guides

#

its not that hard to read

flint solar Dec 2, 2024, 10:09 AM

#

It’s the gui wit harvest and pm

#

The old ass one

low shard Dec 2, 2024, 10:09 AM

#

flint solar It’s the gui wit harvest and pm

smh

low shard Dec 2, 2024, 10:09 AM

#

flint solar The old ass one

smh

#

fuck old yt tuts

marsh schooner Dec 2, 2024, 10:14 AM

#

flint solar No dats bad

how is that bad

flint solar Dec 2, 2024, 10:17 AM

#

marsh schooner how is that bad

Dats what will most likely happen

knotty moth Dec 2, 2024, 10:23 AM

#

flint solar Dats what will most likely happen

it seems he resumed training with different batch size from before

flint solar Dec 2, 2024, 10:23 AM

#

knotty moth it seems he resumed training with different batch size from before

It was fucked before they even stopped training

knotty moth Dec 2, 2024, 10:25 AM

#

he also kept overtraining, perhaps till more than 1k epochs 💀

vale raptor Dec 2, 2024, 10:27 AM

#

hi I have problem, I tried to import voice model to gui but got this error:
size mismatch for enc_p.emb_phone.weight: copying a param with shape torch.Size([192, 768]) from checkpoint, the shape in current model is torch.Size([192, 256])

how can I fix it?

#

I'm sorry for my mistakes, english is not my native language

flint solar Dec 2, 2024, 10:32 AM

#

vale raptor hi I have problem, I tried to import voice model to gui but got this error: size...

What are u using to run rvc

#

locally or colab

vale raptor Dec 2, 2024, 10:32 AM

#

flint solar What are u using to run rvc

rvc gui from there https://github.com/Tiger14n/RVC-GUI?tab=readme-ov-file

marsh schooner Dec 2, 2024, 10:32 AM

#

knotty moth it seems he resumed training with different batch size from before

so 2-3 hours would be fine?

flint solar Dec 2, 2024, 10:33 AM

#

vale raptor rvc gui from there https://github.com/Tiger14n/RVC-GUI?tab=readme-ov-file

This is an architecture mismatch error

#

The gui ur using is very out of date

#

It’s only compatible with rvc v1 models

flint solar Dec 2, 2024, 10:35 AM

#

marsh schooner so 2-3 hours would be fine?

1 hour is my maximum length

vale raptor Dec 2, 2024, 10:35 AM

#

flint solar This is an architecture mismatch error

oh what have I use instead?

flint solar Dec 2, 2024, 10:36 AM

#

vale raptor oh what have I use instead?

-colab

azure marshBOT Dec 2, 2024, 10:36 AM

#

flint solar -colab

Suggestions for @vale raptor

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

flint solar Dec 2, 2024, 10:36 AM

#

vale raptor oh what have I use instead?

Use applio it’s the first link

vale raptor Dec 2, 2024, 10:37 AM

#

flint solar Use applio it’s the first link

thanks I'll try

flint solar Dec 2, 2024, 10:38 AM

#

vale raptor thanks I'll try

Let me know if something goes wrong

low shard Dec 2, 2024, 10:47 AM

#

flint solar -colab

it’s better to ask whats the user pc gpu first than giving colab

#

in case he has a good pc to run it on

flint solar Dec 2, 2024, 10:47 AM

#

low shard in case he has a good pc to run it on

True

low shard Dec 2, 2024, 10:47 AM

#

vale raptor oh what have I use instead?

what’s ur pc gpu?

low shard Dec 2, 2024, 10:48 AM

#

flint solar True

yea bc i seen people using colab while they got an rtx 3060, just because the helper gave them a colab

#

The user should always be asked what hardware they got so the helper can give them the tools to use

amber fjord Dec 2, 2024, 10:54 AM

#

flint solar True

Could you explain how to download it?

flint solar Dec 2, 2024, 10:55 AM

#

amber fjord Could you explain how to download it?

Download what

amber fjord Dec 2, 2024, 10:57 AM

#

azure marsh Suggestions for <@1177104427363090445>

he

vale raptor Dec 2, 2024, 10:57 AM

#

low shard what’s ur pc gpu?

rtx 2060s

flint solar Dec 2, 2024, 10:58 AM

#

vale raptor rtx 2060s

U can run rvc locally

#

No need to use colab

knotty moth Dec 2, 2024, 11:03 AM

#

vale raptor rvc gui from there https://github.com/Tiger14n/RVC-GUI?tab=readme-ov-file

-gui

azure marshBOT Dec 2, 2024, 11:03 AM

#

knotty moth -gui

https://cdn.discordapp.com/attachments/1122285248844144733/1203460490475343953/caption.gif?ex=65d12cec&is=65beb7ec&hm=bd2fb8d010006dd7c6e3c1c67d3ae846fd1478e1a3124c544c31b43086fe54aa&

low shard Dec 2, 2024, 11:08 AM

#

flint solar U can run rvc locally

see

#

that’s why every staff should always ask for the hardware first

low shard Dec 2, 2024, 11:09 AM

#

vale raptor rtx 2060s

your pc is good enough to run it locally (use it on ur pc), colab is just a cloud service (run it on remote good pc, for people who got bad pc)

#

As you got a good PC, you can use RVC locally, you can choose between:

Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
Mainline: The original RVC

low shard Dec 2, 2024, 11:09 AM

#

amber fjord Could you explain how to download it?

elaborate whats ur pc gpu and what u want to do

amber fjord Dec 2, 2024, 11:10 AM

#

rtx 4060

rare gobletBOT Dec 2, 2024, 11:10 AM

#

Ayo? @amber fjord level 1 !!! lfg

low shard Dec 2, 2024, 11:11 AM

#

amber fjord rtx 4060

and what do u want to do

amber fjord Dec 2, 2024, 11:11 AM

#

low shard and what do u want to do

trolling my friends

low shard Dec 2, 2024, 11:11 AM

#

amber fjord trolling my friends

trolling can mean anything

#

im guessing you want realtime voice changer for calls?

amber fjord Dec 2, 2024, 11:12 AM

#

yeah

low shard Dec 2, 2024, 11:12 AM

#

Yea its the best u always be specific when asking for help

low shard Dec 2, 2024, 11:12 AM

#

amber fjord yeah

-rt

azure marshBOT Dec 2, 2024, 11:12 AM

#

low shard -rt

Interaction has expired, use the command again for a new interaction.

💻 Local Realtime RVC

amber fjord Dec 2, 2024, 11:13 AM

#

?

#

me need click?

low shard Dec 2, 2024, 11:13 AM

#

amber fjord ?

1st link, its the wokada (program for using rvc , speech to speech, models in realtime for calls) fork (modified version, this one has better performance)

#

the 1st link is a guide to install it (which u will use the nvidia way as you got an rtx)

#

By reading the guide, you have everything you need to know, if you run into any issues, ask in #🔍│help-w-okada as this isn’t the correct help channel for that

amber fjord Dec 2, 2024, 11:14 AM

#

thank

low shard Dec 2, 2024, 11:15 AM

#

yw

vale raptor Dec 2, 2024, 11:21 AM

#

flint solar Let me know if something goes wrong

I'm sorry for silly questions but how can I add voice model

rare gobletBOT Dec 2, 2024, 11:21 AM

#

Ayo? @vale raptor level 2 !!! lfg

flint solar Dec 2, 2024, 11:23 AM

#

vale raptor I'm sorry for silly questions but how can I add voice model

Download tab then paste the huggingface model link

vale raptor Dec 2, 2024, 11:24 AM

#

I see

vale raptor Dec 2, 2024, 11:28 AM

#

flint solar Download tab then paste the huggingface model link

damn i did it I finally did it it's victory

#

thank you very much

simple ore Dec 2, 2024, 11:30 AM

#

flint solar It was fucked before they even stopped training

since the whole thing is in 35.3-36.4 range, the growing value is not a big deal. but the value itself is high, so yeah, likely just bad dataset

#

too much variety in the samples, i guess?

flint solar Dec 2, 2024, 12:08 PM

#

simple ore too much variety in the samples, i guess?

Too much of something is always bad not just in rvc trolley

flint solar Dec 2, 2024, 12:09 PM

#

simple ore since the whole thing is in 35.3-36.4 range, the growing value is not a big deal...

He trained it for way too long I think dats why the value is high

simple ore Dec 2, 2024, 12:12 PM

#

it started with 35 and ended with 36.4?...so training duration played little difference

#

I have an opposite problem

#

amber fjord Dec 2, 2024, 12:22 PM

#

what is it?

simple ore Dec 2, 2024, 12:23 PM

#

I wish people stopped placing projects into 'special' folders

simple ore Dec 2, 2024, 12:24 PM

#

amber fjord what is it?

move the vc folder to C:\vc

#

in short, programs like voice changer are not designed with the best Windows SDK guidelines. VC executed by you, without admin permissions, is trying to write stuff into semi-protected Program Files folder, which Windows does not allow, because the software should utilize a user folder instead

flint solar Dec 2, 2024, 12:33 PM

#

simple ore

I never really understood what the fm graph is good for

opal verge Dec 2, 2024, 12:34 PM

#

hey im using google RVC colab added index didnt crated ); everthing eles goes well what shuold i do?

low shard Dec 2, 2024, 12:35 PM

#

opal verge hey im using google RVC colab added index didnt crated ); everthing eles goes w...

elaborate more, which colab and whats the error when training the index?

opal verge Dec 2, 2024, 12:35 PM

#

rvc V2

simple ore Dec 2, 2024, 12:36 PM

#

flint solar I never really understood what the fm graph is good for

there's a comparison between original slice of audio and a generated slice of audio, the distriminator (the teacher in GAN algorithm) compares them together by looking thru several filters and calculates a mean difference

#

since it does not compare the exact values, it is possible that one part of the generated audio gets better while the other gets worse, but the average difference stays about the same

amber fjord Dec 2, 2024, 12:38 PM

#

maybe I don't understand something, but no matter where I put it, the program doesn't work for me

flint solar Dec 2, 2024, 12:39 PM

#

simple ore since it does not compare the exact values, it is possible that one part of the ...

How do I know if the model is improving or not thru feature matching

simple ore Dec 2, 2024, 12:41 PM

#

amber fjord maybe I don't understand something, but no matter where I put it, the program do...

👉 https://discord.com/channels/1159260121998827560/1159290161683767298

simple ore Dec 2, 2024, 12:42 PM

#

flint solar How do I know if the model is improving or not thru feature matching

generally the difference between real and fake audio, and thus fm value, should be going down

flint solar Dec 2, 2024, 12:43 PM

#

simple ore generally the difference between real and fake audio, and thus fm value, should...

My fm be going down sometimes

#

Actually most of the time

simple ore Dec 2, 2024, 12:43 PM

#

here I had a model try to produce a single sample

#

#

you can hear the difference and the FM chart reflects that

flint solar Dec 2, 2024, 12:44 PM

#

flint solar My fm be going down sometimes

I meant up** skullsob

flint solar Dec 2, 2024, 12:46 PM

#

simple ore you can hear the difference and the FM chart reflects that

What are the factors that influence fm

low shard Dec 2, 2024, 12:46 PM

#

opal verge rvc V2

which colab? Send link and show the error

simple ore Dec 2, 2024, 12:46 PM

#

yeah, best case I see something like

low shard Dec 2, 2024, 12:47 PM

#

simple ore 👉 https://discord.com/channels/1159260121998827560/1159290161683767298

real, so many people still use this channel for wokada

simple ore Dec 2, 2024, 12:49 PM

#

cuz someone needs to rename the channel 🙂

#

help-rvc (not VC!)

#

help-w-okada (the VC!)

flint solar Dec 2, 2024, 12:50 PM

#

simple ore yeah, best case I see something like

Yeah dats what I be getting too

#

I thought it was bad

simple ore Dec 2, 2024, 12:52 PM

#

I think the value going up is expected as the model figures out the parameters to use, but it should stabilize and settle at some value or around it without much deviation

#

like that 'cement' example above

#

the discriminator used in RVC is not stellar, so in most cases the model just finds some local minima for parameters and settles around that

#

and it does not get out of that hole no matter how much more you train it

#

there's a chance that fm going up is just that hump on the chart after which the value would go down once the model learns to reproduce the audio better, but I've yet too train for that long to see it

flint solar Dec 2, 2024, 1:02 PM

#

simple ore the discriminator used in RVC is not stellar, so in most cases the model just fi...

I guess improving the discriminator is the next move

simple ore Dec 2, 2024, 1:03 PM

#

that would require a whole new set of pretrains

#

I've tested a new loss function that does not require much change, seems to be doing better

flint solar Dec 2, 2024, 1:07 PM

#

simple ore I've tested a new loss function that does not require much change, seems to be d...

When is it releasing

#

trolley

fallen grotto Dec 2, 2024, 1:17 PM

#

does anyone have any colab i can use to make ai covers

rare gobletBOT Dec 2, 2024, 1:17 PM

#

Ayo? @fallen grotto level 2 !!! lfg

flint solar Dec 2, 2024, 1:17 PM

#

fallen grotto does anyone have any colab i can use to make ai covers

-colab

azure marshBOT Dec 2, 2024, 1:17 PM

#

flint solar -colab

Suggestions for @fallen grotto

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

flint solar Dec 2, 2024, 1:17 PM

#

Use applio

fallen grotto Dec 2, 2024, 1:17 PM

#

thanks ho

low shard Dec 2, 2024, 1:30 PM

#

simple ore help-rvc (not VC!)

😭

azure marshBOT Dec 2, 2024, 1:47 PM

#

Not available yet

grim bay Dec 2, 2024, 2:27 PM

#

Any order of model to use to extract vocals from a song to make AI cover?
I'm both using MVSEP and UVR

simple ore Dec 2, 2024, 3:04 PM

#

flint solar When is it releasing

i have a branch created for applio 3.2.7

rare bear Dec 2, 2024, 3:10 PM

#

Sorry if this is the wrong place to ask this question but I'm brand new to AI voices and what can be done with them.
I was wondering if the w-okada voice changer is the only 'app' available for real-time changing of voices on a Mac or are there any other apps I could try RVC files out on a Mac with? Thanks.

knotty moth Dec 2, 2024, 3:12 PM

#

rare bear Sorry if this is the wrong place to ask this question but I'm brand new to AI vo...

read the pinned guide there: #🔍│help-w-okada

rare bear Dec 2, 2024, 3:14 PM

#

knotty moth read the pinned guide there: <#1159290161683767298>

So basically w-okada is the only way to use RVC voices in real-time on a Mac then? There aren't any other apps that can be used instead?

flint solar Dec 2, 2024, 3:19 PM

#

simple ore i have a branch created for applio 3.2.7

Can u link it

simple ore Dec 2, 2024, 3:21 PM

#

https://github.com/IAHispano/Applio/pull/895

knotty moth Dec 2, 2024, 3:23 PM

#

rare bear So basically w-okada is the only way to use RVC voices in real-time on a Mac the...

nope, not even voice.ai

rare bear Dec 2, 2024, 3:24 PM

#

knotty moth nope, not even voice.ai

Okay thanks. Seems rather strange but it is what it is I guess.

rare gobletBOT Dec 2, 2024, 3:24 PM

#

Ayo? @rare bear level 1 !!! lfg

low shard Dec 2, 2024, 3:47 PM

#

rare bear So basically w-okada is the only way to use RVC voices in real-time on a Mac the...

W-okada is the best lfg

rare bear Dec 2, 2024, 3:51 PM

#

low shard W-okada is the best <:lfg:1159355870119993496>

Well from what I've been told so far it's the only one so would be the best wouldn't it? 😉

low shard Dec 2, 2024, 3:51 PM

#

lol

flint solar Dec 2, 2024, 4:29 PM

#

knotty moth nope, not even voice.ai

voice ai so ass bro omg

low shard Dec 2, 2024, 4:38 PM

#

flint solar voice ai so ass bro omg

🙏

inner cloak Dec 2, 2024, 6:40 PM

#

When I try to start Applio (colab), I am getting this:

An error occurred connecting to Discord: Could not find Discord installed and running on this machine.
Traceback (most recent call last):
File "/content/program_ml/app.py", line 90, in <module>
inference_tab()
File "/content/program_ml/tabs/inference/inference.py", line 418, in inference_tab
choices=get_speakers_id(model_file.value),
File "/content/program_ml/tabs/inference/inference.py", line 325, in get_speakers_id
model_data = torch.load(model, map_location="cpu")
File "/usr/local/lib/python3.10/dist-packages/torch/serialization.py", line 1004, in load
with _open_zipfile_reader(opened_file) as opened_zipfile:
File "/usr/local/lib/python3.10/dist-packages/torch/serialization.py", line 456, in init
super().init(torch._C.PyTorchFileReader(name_or_buffer))
OSError: [Errno 22] Invalid argument

It just stays like this with no link to gradio. Anyone else?

#

It was working last night, so I am guessing it needs to be fixed. Edit: nvm

marsh schooner Dec 2, 2024, 6:42 PM

#

how do i train 2 datasets at once

#

im on a 4090

shut goblet Dec 2, 2024, 7:41 PM

#

how do i fix this

#

why must you be this way

rare gobletBOT Dec 2, 2024, 7:45 PM

#

Ayo? @shut goblet level 3 !!! lfg

simple ore Dec 2, 2024, 7:51 PM

#

if I had to guess, some translation is messed up

spice wing Dec 2, 2024, 7:56 PM

#

Where can i find Voice models for AllTalk Ai?
pth files dont work there

shut goblet Dec 2, 2024, 7:58 PM

#

simple ore if I had to guess, some translation is messed up

i dont understand what this means skullsob

unique rock Dec 2, 2024, 8:10 PM

#

Hey, I need to remove the breaths from my dataset, for example: the breaths when a singer is about to sing du verse and takes a breath or releases it when he finishes singing.

low shard Dec 2, 2024, 8:27 PM

#

spice wing Where can i find Voice models for AllTalk Ai? pth files dont work there

There aren't here
80% of the models are RVC (Best few shots Speech To Speech)
20% GPT-SoVITS (Best few shots Text To Speech)

low shard Dec 2, 2024, 8:28 PM

#

shut goblet how do i fix this

Be sure to put it in the C drive

#

also @viscid moss

wild yoke Dec 2, 2024, 8:46 PM

#

Why is .wav not recognized as a format?

simple ore Dec 2, 2024, 9:17 PM

#

shut goblet i dont understand what this means <:skullsob:1159372531992645662>

i18n is a package that is used to translate UI to different languages

simple ore Dec 2, 2024, 9:17 PM

#

wild yoke Why is .wav not recognized as a format?

not an actual .wav maybe?

marsh schooner Dec 2, 2024, 11:00 PM

#

what does uvr denoise even do

#

i feel like it does nothing

viscid moss Dec 2, 2024, 11:05 PM

#

shut goblet how do i fix this

Did you solve it?

#

I thought that bug was fixed. Anyway, how to fix that is described here

GitHub

UVR5-UI/info/troubleshooting.md at main · Eddycrack864/UVR5-UI

Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models - Eddycrack864/UVR5-UI

viscid moss Dec 2, 2024, 11:10 PM

#

low shard also <@274566299349155851>

Thanks for lmk Nick

oak hearth Dec 2, 2024, 11:17 PM

#

What are pretrains? If I want to clone a voice, should I use a pretrain or spend more time making it myself? What will be more quality?

rare gobletBOT Dec 2, 2024, 11:17 PM

#

Ayo? @oak hearth level 3 !!! lfg

oak hearth Dec 2, 2024, 11:17 PM

#

Are pretrains just faster and better and should i use them to make the most lifelike realistic clone?

analog obsidian Dec 2, 2024, 11:41 PM

#

oak hearth What are pretrains? If I want to clone a voice, should I use a pretrain or spend...

pretrains are pre-made models trained on days worth of audio (in rvc case)
making them from scratch is a very hard process even for people that have the knowledge
always use a pretrain model when training on rvc
their purpose is to have a baseline during the finetuning process (when you are training a model)
They can affect the final result quality since audio upscaling is involved
If you train without a pretrain your model is going to sound like shit because rvc has no prior knowledge of sounds

analog obsidian Dec 2, 2024, 11:44 PM

#

oak hearth Are pretrains just faster and better and should i use them to make the most life...

always use the original pretrain as the custom made pretrains dont reconstruct your dataset frequencies as good as the original

#

for a realistic model train an expressive non monotone dataset of 30 minutes and above

the more data, the better and realistic

be sure your dataset has variety, for example avoid monotone dialogue or audios repeating the same sentence/words etc

coral frigate Dec 3, 2024, 12:02 AM

#

Can someone give me advice? I’m training a model with 15 hours of data and I don’t want to use a pretrain on it. Which one should I select? Custom makes me add my own pretrain I think but I’m not sure.

rare gobletBOT Dec 3, 2024, 12:02 AM

#

Ayo? @coral frigate level 5 !!! lfg

analog obsidian Dec 3, 2024, 12:03 AM

#

coral frigate Can someone give me advice? I’m training a model with 15 hours of data and I don...

15 hours is too short for a pretrain

#

aim for the amount of time as vctk dataset

crude flame Dec 3, 2024, 12:04 AM

#

coral frigate Can someone give me advice? I’m training a model with 15 hours of data and I don...

are you making a from scratch pretrain or a finetune

coral frigate Dec 3, 2024, 12:04 AM

#

I’m not aiming to make a pretrain. I just want to make a voice model with my dataset but I don’t want to use any of the pretrains applio offers

analog obsidian Dec 3, 2024, 12:04 AM

#

coral frigate I’m not aiming to make a pretrain. I just want to make a voice model with my dat...

what i said still applies

coral frigate Dec 3, 2024, 12:04 AM

#

If that makes sense

analog obsidian Dec 3, 2024, 12:04 AM

#

rvc will not be able to give you a good result

#

use a pretrain instead

#

when you remove a pretrain you remove the knowledge of sounds

coral frigate Dec 3, 2024, 12:06 AM

#

Im still new to making models but everytime I’ve used a pretrain, the voice has sounded strange and very off. Is there a way I can fix that then if I have to use a pretrain?

crude flame Dec 3, 2024, 12:06 AM

#

coral frigate I’m not aiming to make a pretrain. I just want to make a voice model with my dat...

de-select this

analog obsidian Dec 3, 2024, 12:07 AM

#

coral frigate Im still new to making models but everytime I’ve used a pretrain, the voice has ...

you've used a pretrain with this exact dataset?

coral frigate Dec 3, 2024, 12:08 AM

#

Yes with 150 epoch. It sounds nothing like the dataset and just sounds really off. It has been the case whenever I use a pretrain. In this case I used the contentvec pretrain.

analog obsidian Dec 3, 2024, 12:08 AM

#

coral frigate Yes with 150 epoch. It sounds nothing like the dataset and just sounds really of...

contentvec is not a pretrained model

#

is an embedder

crude flame Dec 3, 2024, 12:09 AM

#

coral frigate Yes with 150 epoch. It sounds nothing like the dataset and just sounds really of...

contentvec is for feature extraction

analog obsidian Dec 3, 2024, 12:09 AM

#

for your huge dataset of 15 hours you can try batch 16, and it should take 1 or 2 days to give good results with a pretrain

coral frigate Dec 3, 2024, 12:10 AM

#

crude flame contentvec is for feature extraction

Ow I assumed that they were all just pretrains.

coral frigate Dec 3, 2024, 12:10 AM

#

analog obsidian for your huge dataset of 15 hours you can try batch 16, and it should take 1 or ...

Can you suggest which pretrain I should use from the list for an English model?

analog obsidian Dec 3, 2024, 12:11 AM

#

coral frigate Can you suggest which pretrain I should use from the list for an English model?

original pretrain

coral frigate Dec 3, 2024, 12:11 AM

#

Sorry if this is a dumb question but How do I use an original pretrain?

crude flame Dec 3, 2024, 12:12 AM

#

coral frigate Sorry if this is a dumb question but How do I use an original pretrain?

dont touch this

analog obsidian Dec 3, 2024, 12:12 AM

#

coral frigate Sorry if this is a dumb question but How do I use an original pretrain?

are you on applio? simply don't change anything related to pretrains, it'll use the original

#

yea that

#

leave it like that

coral frigate Dec 3, 2024, 12:13 AM

#

Ow ok I just keep that unchecked and I’m good to go?

coral frigate Dec 3, 2024, 12:13 AM

#

analog obsidian are you on applio? simply don't change anything related to pretrains, it'll use ...

And yes I’m on applio

analog obsidian Dec 3, 2024, 12:13 AM

#

coral frigate Ow ok I just keep that unchecked and I’m good to go?

yep

coral frigate Dec 3, 2024, 12:13 AM

#

Ok thank you so much guys

coral frigate Dec 3, 2024, 12:16 AM

#

coral frigate Can someone give me advice? I’m training a model with 15 hours of data and I don...

So would it matter which one of these I select if I’m unchecking the box anyway?

crude flame Dec 3, 2024, 12:16 AM

#

coral frigate So would it matter which one of these I select if I’m unchecking the box anyway?

leave it on contentvec

coral frigate Dec 3, 2024, 12:16 AM

#

Ok thank you

knotty moth Dec 3, 2024, 12:40 AM

#

coral frigate I’m not aiming to make a pretrain. I just want to make a voice model with my dat...

you dont need shitton hours of a single speaker dataset from 3k voice recordings like this https://www.techspot.com/news/105764-panasonic-resurrects-long-dead-founder-ai-share-management.html

30m-2h can already produce good results

TechSpot

Panasonic resurrects long-dead founder as an AI to share his manage...

coral frigate Dec 3, 2024, 12:42 AM

#

knotty moth you dont need shitton hours of a single speaker dataset from 3k voice recordings...

I was just told when I started that the bigger the dataset the t he better

knotty moth Dec 3, 2024, 12:46 AM

#

coral frigate I was just told when I started that the bigger the dataset the t he better

do you even have enough time and effort to clean the massive dataset lmao

#

remember: quality > quantity

analog obsidian Dec 3, 2024, 12:51 AM

#

the more the better but i agree 15 hours is too much 😭
also you have to be sure your dataset has no possible sounds that could cause problems later

coral frigate Dec 3, 2024, 12:51 AM

#

Yeah sadly I did have to listen through all 15 hours like a podcast

analog obsidian Dec 3, 2024, 12:52 AM

#

i mean technically a 15 hour dataset is better than a 30 min or an 1 hour one
but rvc already becomes realistic at the 30 minute mark

#

technically speaking is not bad
but is too much effort

coral frigate Dec 3, 2024, 12:53 AM

#

Idk I’m dumb and still figuring rvc out. I’ve actually been trying to train this for 3 weeks now but applio hasn’t been able to train it. Just keep getting an error

analog obsidian Dec 3, 2024, 12:54 AM

#

coral frigate Idk I’m dumb and still figuring rvc out. I’ve actually been trying to train this...

for realistic purposes i recommend using a dataset of 1 hour

#

max 2 hours
higher than that is not bad, but not worth the effort

covert axle Dec 3, 2024, 12:54 AM

#

-colab

azure marshBOT Dec 3, 2024, 12:54 AM

#

covert axle -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

coral frigate Dec 3, 2024, 12:55 AM

#

analog obsidian max 2 hours higher than that is not bad, but not worth the effort

Would’ve been nice to know before going through hours and hours of data

analog obsidian Dec 3, 2024, 12:56 AM

#

coral frigate Would’ve been nice to know before going through hours and hours of data

yea... we tell people the more the better because most believe a model is going to sound realistic at 5 minutes of data

knotty moth Dec 3, 2024, 12:58 AM

#

coral frigate Yeah sadly I did have to listen through all 15 hours like a podcast

surely the cleaning effort could take more than double the audio duration itself, unless you hire a few freelancers (that still waste of money)

coral frigate Dec 3, 2024, 12:58 AM

#

analog obsidian yea... we tell people the more the better because most believe a model is going ...

Is it normal for feature extraction to take 10-15 minutes? I’m trying to troubleshoot why I still can’t train the model

analog obsidian Dec 3, 2024, 12:58 AM

#

coral frigate Is it normal for feature extraction to take 10-15 minutes? I’m trying to trouble...

yes because your dataset is 15 hours

#

@knotty moth is rmvpe on applio using gpu now?

#

or they're still using rmvpe cpu

#

if applio is still using rmvpe cpu, is going to be super slow

#

if applio is using rmvpe gpu it should be done in 30 mins approx

analog obsidian Dec 3, 2024, 1:00 AM

#

knotty moth surely the cleaning effort could take more than double the audio duration itself...

what i can say about this is at least his 15 hour model will be able to do much more than a 1 or 2 hour one

#

so the effort will at least be worth a little bit

analog obsidian Dec 3, 2024, 1:02 AM

#

analog obsidian what i can say about this is at least his 15 hour model will be able to do much ...

but still, rvc is really limited because hifigan and bad code

coral frigate Dec 3, 2024, 1:03 AM

#

ive been getting this error in the cmd for a month and im probably doing something dumb thats failing the extraction. have any suggestions on how else i can trouble shoot it?

analog obsidian Dec 3, 2024, 1:03 AM

#

coral frigate ive been getting this error in the cmd for a month and im probably doing somethi...

reduce cpu cores to 4

knotty moth Dec 3, 2024, 1:04 AM

#

analog obsidian <@681186927151546397> is rmvpe on applio using gpu now?

not sure but it still has some gpu usage

coral frigate Dec 3, 2024, 1:04 AM

#

ill try that

analog obsidian Dec 3, 2024, 1:04 AM

#

knotty moth not sure but it still has some gpu usage

hmm he got oom so i suppose is using rmvpe gpu

coral frigate Dec 3, 2024, 1:05 AM

#

analog obsidian reduce cpu cores to 4

do i use 4 for both preprocess and extract?

analog obsidian Dec 3, 2024, 1:06 AM

#

coral frigate do i use 4 for both preprocess and extract?

yep, it doesnt really a speed difference, or at least i havent noticed it

#

maybe it has one but is very marginal

knotty moth Dec 3, 2024, 1:07 AM

#

analog obsidian hmm he got oom so i suppose is using rmvpe gpu

I only recall oom happens when inferring a big whole audio depending on ram

analog obsidian Dec 3, 2024, 1:07 AM

#

knotty moth I only recall oom happens when inferring a big whole audio depending on ram

oom also happens when you're using too much cpu cores during feature extraction

#

staring at crepe

knotty moth Dec 3, 2024, 1:08 AM

#

analog obsidian oom also happens when you're using too much cpu cores during feature extraction

not sure, it simply hangs the system for a while and then crashes

analog obsidian Dec 3, 2024, 1:09 AM

#

knotty moth not sure, it simply hangs the system for a while and then crashes

it gave me a bsod once 😭

#

crashed that bad

coral frigate Dec 3, 2024, 1:18 AM

#

analog obsidian reduce cpu cores to 4

It’s in the middle of extracting all the audio files but as its going in seeing that it still gives the same error line for each one

analog obsidian Dec 3, 2024, 1:19 AM

#

coral frigate It’s in the middle of extracting all the audio files but as its going in seeing ...

uh.... are you sure you reduced the cpu cores during feature extraction?
whats going on is that your system is getting out of vram during the feature extraction

coral frigate Dec 3, 2024, 1:19 AM

#

Yh it’s at 4 now for both

#

Could it be because I have 33 files in the dataset folder maybe?

analog obsidian Dec 3, 2024, 1:21 AM

#

coral frigate Could it be because I have 33 files in the dataset folder maybe?

i dont think so, first process is the preprocessing, this slices your audios since hifigan does not work in long audios
after that feature extraction, extracts the features of the sliced audios

#

so is doing it on 3s samples

#

it goes out of vram when is doing a lot of them

#

at the same time

coral frigate Dec 3, 2024, 1:22 AM

#

This has been an issue for a while now. Idk what else I can do to fix the issue

analog obsidian Dec 3, 2024, 1:22 AM

#

coral frigate This has been an issue for a while now. Idk what else I can do to fix the issue

last resort is to try using 2 cpu cores instead of 4
if that doesn't work im lost sorry, have never faced an issue like that on applio before

#

might be related to applio only

coral frigate Dec 3, 2024, 1:24 AM

#

analog obsidian last resort is to try using 2 cpu cores instead of 4 if that doesn't work im los...

and nothing is wrong in this section?

analog obsidian Dec 3, 2024, 1:25 AM

#

coral frigate and nothing is wrong in this section?

nope, looks fine to me

coral frigate Dec 3, 2024, 1:25 AM

#

Well then I’m doomed

analog obsidian Dec 3, 2024, 1:26 AM

#

coral frigate Well then I’m doomed

cpu cores to 2 also shouldn't affect the speed quality
since it looks applio is finally using rmvpe gpu
like u could try that

#

cpu cores only affect feature speed when you're using an cpu based extractor

coral frigate Dec 3, 2024, 1:28 AM

#

I’ll try that but I don’t think it’ll change much seeing that my cpu should’ve been able to handle 4

#

Maybe redownloading applio will do something

analog obsidian Dec 3, 2024, 1:28 AM

#

coral frigate I’ll try that but I don’t think it’ll change much seeing that my cpu should’ve b...

your cpu is not being used here, your gpu is

#

the gpu is doing the extractor

#

hence why it gets out of vram

#

in older versions of applio rmvpe was actually cpu based

#

but is very slow compared to rmvpe gpu

coral frigate Dec 3, 2024, 1:29 AM

#

Even then my gpu should’ve been able to handle 4 without vram issues. I’ll try redownloading applio and hope that fixes something

analog obsidian Dec 3, 2024, 1:30 AM

#

yeah exactly it should be able to do it, i find weird its getting out of vram

#

i hope redownloading fixes the issue

coral frigate Dec 3, 2024, 1:31 AM

#

Thanks. I hope so

rare gobletBOT Dec 3, 2024, 1:31 AM

#

Ayo? @coral frigate level 6 !!! lfg

coral frigate Dec 3, 2024, 1:31 AM

#

I’m out of options after that

glacial pollen Dec 3, 2024, 1:33 AM

#

coral frigate Could it be because I have 33 files in the dataset folder maybe?

Limit each file's length to 10 mins ( maybe 15 )

#

use 4 or 8 threads and switch to rmvpe ( non gpu variant )

#

btw, would be helpful if you provided info on your amount of ram, ( and vram if you actually did use rmvpe gpu variant )
Also, total length of the set and length per file ( can be avg )

analog obsidian Dec 3, 2024, 1:36 AM

#

glacial pollen btw, would be helpful if you provided info on your amount of ram, ( and vram if ...

is applio using rmvpe gpu?

#

is weird he's getting oom with a 4090

glacial pollen Dec 3, 2024, 1:36 AM

#

analog obsidian is applio using rmvpe gpu?

not sure as I don't use applio
perhaps it's a 2 in 1 ?

#

in any case, 4090 shouldn't have any ooms like that

analog obsidian Dec 3, 2024, 1:37 AM

#

glacial pollen Limit each file's length to 10 mins ( maybe 15 )

its a 15 hour dataset
he is using rvc slicer to slice those files

glacial pollen Dec 3, 2024, 1:37 AM

#

only reasonable way out of it is.. the extraction's done on cpu and / or per-file length

#

oh

#

That's an overkill

#

what's the length per file? ( as I assume it's not big sample but split

coral frigate Dec 3, 2024, 1:37 AM

#

glacial pollen use 4 or 8 threads and switch to rmvpe ( non gpu variant )

How Do I switch to the none gpu variant? I have 32gb ram and the file lengths vary from 20 mins to the longest being 50 minutes

glacial pollen Dec 3, 2024, 1:38 AM

#

coral frigate How Do I switch to the none gpu variant? I have 32gb ram and the file lengths va...

ye that's the thing

#

above 20 or maybe 25 ( but I'd stick to 20 per file ) mins, it can cause issues

analog obsidian Dec 3, 2024, 1:38 AM

#

oh but even if he lets the slicer to do it?

glacial pollen Dec 3, 2024, 1:38 AM

#

oh huh

#

lemme think

analog obsidian Dec 3, 2024, 1:38 AM

#

yea this is new to me too xD

glacial pollen Dec 3, 2024, 1:39 AM

#

oh yea

#

that can surely be the case cause, 1 set where extraction is loading 15 hours of data vs sequential 1 by 1 sid processing ( even 40 or 100 hours )

#

is different

analog obsidian Dec 3, 2024, 1:39 AM

#

interesting, rvc always giving surprises

glacial pollen Dec 3, 2024, 1:40 AM

#

or so I suspect at least

#

there's no other way out of it

#

other than applio being jammed ( wouldn't be surprised at this point

analog obsidian Dec 3, 2024, 1:40 AM

#

glacial pollen other than applio being jammed ( wouldn't be surprised at this point

i believe this might be the problem actually lmao

glacial pollen Dec 3, 2024, 1:41 AM

#

lemme prepare something

#

might have a 'temp' solution

#

@coral frigate Amount of threads you have?

#

I'll assume you can afford to use 8

coral frigate Dec 3, 2024, 1:45 AM

#

How do I check? But I definitely would

glacial pollen Dec 3, 2024, 1:45 AM

#

that's alright then, gimme a sec

knotty moth Dec 3, 2024, 1:47 AM

#

analog obsidian is weird he's getting oom with a 4090

have we known his cpu? if it's intel 13/14th, could be its degradation issue

coral frigate Dec 3, 2024, 1:48 AM

#

Ryzen 9 9950x

glacial pollen Dec 3, 2024, 1:49 AM

#

okay so

#

now the question is, are you willing to spend a lil bit of time on that?

knotty moth Dec 3, 2024, 1:50 AM

#

coral frigate Ryzen 9 9950x

check your cpu & gpu stability, shouldnt be overvolted

coral frigate Dec 3, 2024, 1:50 AM

#

glacial pollen now the question is, are you willing to spend a lil bit of time on that?

Sure on what?

glacial pollen Dec 3, 2024, 1:51 AM

#

@coral frigate Cause basically, you'd have try 2 things

Try to feature extract on single 30-50 min file
If that failed, you'd have to use another fork or just, mainline for the sake of extraction ( and then move stuff back to applio ) (( that'd confirm whether applio is the issue or just the situation itself isn't favored by rvc in general ))
If that failed too.. rip, in that case it'd mean rvc's not optimized for single speaker 15h at once extraction

coral frigate Dec 3, 2024, 1:54 AM

#

Ok how do I move the feature extract back into applio if that’s the issue ?

#✨│ai-help

AI HUB Docs

🍏 Applio Docs

How To Troubleshoot

AI HUB Docs

🍏 Applio Docs