#✨│ai-help

1 messages · Page 192 of 1

coarse heron
#

thanks again, Deitris is indeed a lot better and stable than okada.

marsh schooner
#

does someone have a attractive female voice free or something i can pay for idc im tired of making data sets that turn out mid

azure osprey
#

Could this be why it doesn't want to work? It's happened every time and I probably should have said so from the start, but it happens every time I try a fresh install

#

Some of them just get stuck and don't download and it's not as if there's a "Retry" button

#

As someone who prides themselves on being able to usually fix a problem with a computer this has got me brain burnt

#

Wouldn't be so bad if it didn't take upwards of 20 minutes to complete

#

the ones thtat aren't above 19 minutes are frozen 🙃

mental dawn
#

What would you all say is important when trying to make a real time rvc that can keep up with someone that changes between 75 to 300 fx in pitch often? Tying to develop a model that can keep up with someone who has that kind of speech pattern.

knotty moth
azure osprey
#

I think I got it to work!

rare gobletBOT
#

Ayo? @azure osprey level 2 !!! lfg

azure osprey
#

As of right now, disregard my crying

knotty moth
#

if the input audio is too short, try extend it to at least around 10-20 sec

unique rock
#

How do I make my model sound good? For example, before saying a phrase or part of a song, there is a type of breathing, right? So I want this not to sound too robotic, and I train my models without this type of breathing, just the voice. What do you recommend?

azure osprey
#

Now it crashes whenever I try a custom voice, lemme watch a few more tutorials before I post anything more about this issue

#

This shit just doesn't want to be easy whygod

azure osprey
#

Checking around it doesn't seem like I'm doing anything wrong but getting an error code and crash when I use any custom voice

#

probably leaked some important info there idk, don't really care

azure osprey
#

please ping me if you have a fix

#

or any ideas

latent kettle
azure osprey
#

I was connected to the internet ;-;

latent kettle
#

Then try to delete user settings. And restart it again

azure osprey
#

I did that too, but I'll try it again

latent kettle
#

If not working re-extract it. And try to launch it again

#

Delete the old one too

azure osprey
#

whygod Alright, I'll do it again

rare gobletBOT
#

Ayo? @azure osprey level 3 !!! lfg

azure osprey
#

This was the first time the installation went off without a hitch

#

But I'll do it again

latent kettle
azure osprey
#

Ah, will do, I asked here cuz it had to do with voices, my bad

latent kettle
#

@pastel oak is a great helper

azure osprey
#

He was helping me earlier, great guy

latent kettle
azure osprey
#

no...

#

I figured the part he was helping me with out on my own

#

But it's no fault of his

latent kettle
#

Okay then re install it

azure osprey
#

Yeah doing that now

#

This thing has fought me the entire way lmfao

#

I appreciate your help btw

#

Alright all done and connected, default voices work, I'm going to try a custom one now

#

Same thing

#

Crashes on Custom Voice

brittle wing
#

-colab

azure marshBOT
# brittle wing -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

magic musk
#

why cant i go over 1000 epoch when trying to train??

#

like i stopped, reentered my g&d but i still cannot go over 1000 epoch

#

like it detects its already 1000 and stops

knotty moth
#

though practically most models will be likely to overtrain in less than 1000 epochs

low shard
#

i highly suggest u not to do it

#

more epochs dont mean more quality

#

use the tensorboard

low shard
#

also be sure to not use yt tuts

#

-rt

azure marshBOT
low shard
#

be sure to be using onky the 1st link, the wokada fork

low shard
#

realtime voice changer for calls?

#

-rt

azure marshBOT
low shard
#

1dt link, wokada fork

low shard
#

this channel got really messy tbh

azure osprey
low shard
#

oop very weird, thought u had some pre-historical thing

pastel oak
azure osprey
#

I got them to work now, the new issue is posted in the okada channel

pastel oak
#

Okok

azure osprey
#

I do appreciate your help though

wary apex
#

can somone tell me how to use on Mac

latent kettle
low shard
magic musk
#

what file should i use?

#

like, to import my model in the voice changer

#

i guess the added_... as the index

#

but what pth?

simple ore
#

pth is the voice model

#

index is a cherry on top

simple ore
#

trying to load sovits model?

#

or v1 pretrain into v2 training

#

768 is the number of channels in rvc v2 model

#

256 is in v1

#

what are you trying to do?

#

using an 2-year old RVC app or something?

rare gobletBOT
#

Ayo? @limpid cradle level 1 !!! lfg

simple ore
#

v1 model should work for inference

#

at least Applio supports both v1 and v2

#

dunno about mainline

#

-rvc

azure marshBOT
simple ore
#

err

lavish escarp
#

-colab

azure marshBOT
# lavish escarp -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

distant turtle
#

-colab

azure marshBOT
# distant turtle -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

brisk nova
#

what do i need to do to run RVC

#

its aways closes himself when i tried to open

#

i open the other file

#

and its says that

#

help?

coral frigate
#

Does anyone have any recommendations where I can find more pretrains online? Besides the pretrain section on the server. I can’t seem to find any online

pastel oak
#

pretrains are not recommended anyway

glacial pollen
#

No idea where you took that info from

glacial pollen
#

erhm

#

Try to guess ( date )

#

You wanna pick either original rvc ( mainline ) or Applio

brisk nova
#

soo its not released?

glacial pollen
#

no, it is just simply outdated lol

brisk nova
#

soo its aready dead?

glacial pollen
#

🤦‍♂️

#

rvc gui =/= rvc

#

If I may ask, where did you find out about rvc gui, yt? or someone recommended you it?

#

'rvc gui' is outdated and not used anymore ( for a long while now, in fact )

brisk nova
#

from this

#

someone recommended me it

glacial pollen
#

well, rip my dude

brisk nova
#

oh...

glacial pollen
brisk nova
#

soo its no longer to install rvc?

glacial pollen
#

Applio is a fork of rvc
whereas rvc one is the original

#

Pick whichever you want

brisk nova
#

ok

#

thanks

glacial pollen
#

Just please, carefully read the repository and instructions

brisk nova
#

alright

glacial pollen
#

@brisk nova
in case you got lost or something

#

Gluck ~

distant turtle
#

-colab

azure marshBOT
# distant turtle -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

brisk nova
rare gobletBOT
#

Ayo? @brisk nova level 2 !!! lfg

glacial pollen
glacial pollen
#

I literally attached the msg

brisk nova
#

ok

glacial pollen
brisk nova
#

which one

glacial pollen
#

just read

brisk nova
glacial pollen
#

focus

#

Don't be brainrot, just focus and you'll be good

brisk nova
#

hummm

glacial pollen
#

uhhh, I see you went the harder way

#

F

brisk nova
#

bu i downloaded theses 2

glacial pollen
#

?

brisk nova
glacial pollen
#

Well then go for applio, simple

brisk nova
glacial pollen
#

Cmon, don't act like if you had no brain

#

The instructions are literally on the repo, nah, even better, a pre-compiled package is there for you to download

#

Like, really, no offense but I am reading papers rn, coding and working on rvc upgrades and I can't afford assisting people who, despite being asked to, do not carefully read instructions or msgs. I am pretty sure it's understandable for a lot of people and someone has to already say it outloud.
Because if you asked me? If I see incompetent people trying to play with AI, I can only suspect huge tragedies in future.

marsh schooner
#

what does ko-fi mean when looking to buy voice models

glacial pollen
#

aka, you tip ( pay ) the creators, you get the models / commission someone for making them

#

same goes for paypal and such ( as these do happen to be in use quite often too but they're not exactly as ko-fi persay )

simple ore
# brisk nova

delete what you got and download a proper compiled release

brisk nova
#

i dont understandddd

glacial pollen
#

that's all you have to do ^

#

You can't get it any simpler man

brisk nova
#

dude

#

i need do download more shits?

#

i download 1 i download 2 i download 3

#

how many times do i need to download?

#

all i what i want its just a AI voice

#

i already have my model

#

of what my friend gived to me

crude flame
# brisk nova dude

Maybe you shouldn’t be using Ai since you are having a hard time reading simple instructions

brisk nova
#

theres 3 of them and the RVC

#

and this comand

glacial pollen
#
  1. You're given clear instructions
  2. You're asked to carefully read what is meant to be read by you to avoid redundant questions in here
  3. You download what you should not download and then make a problem out of it
#

You had one simple job. Read the instruction.
Then you'd know that **the only thing you're meant to download is " precompiled package " **
So please, don't histerize just because you can't read

brisk nova
#

i will put them all in trash

#

what do i need to install first

glacial pollen
#

boohooh you can't be helped, I am sorry for your loss.

brisk nova
#

why

glacial pollen
#

@red kayak save me 😭

#

lmao

red kayak
#

im here to save :3

glacial pollen
#

basically, idk how to handle them

#

maybe you have some ideas

red kayak
#

@brisk nova hey mate!

brisk nova
red kayak
#

we've got guides available that give u a step by step tut on how to run RVC

red kayak
#

check those out first and then you can reach out for help if needed

brisk nova
#

but what do i need to install first

rare gobletBOT
#

Ayo? @brisk nova level 3 !!! lfg

brisk nova
#

?

glacial pollen
#

Uhhh, don't think it's a good idea Litsa

red kayak
#

then run the go-web.bat

glacial pollen
#

if they can't read guides right, I doubt they can handle visual c and faiss

brisk nova
#

i have one

brisk nova
red kayak
#

We'll see

red kayak
glacial pollen
#

Again arg, do not use gui, as I said it's outdated and nobody uses it

red kayak
#

download the one i gave ya

glacial pollen
brisk nova
#

ok

red kayak
glacial pollen
#

ikrrr

#

I still wonder who the heck recommends these as apparently it was an actual ' someone ' recommending them(?) the thing

glacial pollen
#

well maybe, but that makes me think

#

is there still no proper guides anywhere? like, 2024 edition

red kayak
#

yeah none

glacial pollen
#

oof, maybe I should legit do one sometime, would save everyone tons of time lol

brisk nova
#

Its istalling

crude flame
glacial pollen
#

rip

knotty moth
glacial pollen
# brisk nova But its not ai Voice?

You have to understand one simple thing
'Rvc gui' is outdated.
Nobody maintains it, nobody can promise if it's bug-free, there's most likely worse performance expectations and obviously, you lose features that are in new stuff

#

In fact, rvc gui in here is treated as a meme

brisk nova
#

alright

brisk nova
#

what do i need to do

nova plaza
#

how do I safely quit this, I just want to use the current Epoch 150, don't want to go further.
also how do I find the .pth and index file

simple ore
coarse heron
#

the AI cant hum well, is there a way to make it smoother or make it sound better?

coarse heron
#

im using deitris, all the models caant hum well. I was wondering if i need to tweak something. cant be my mic either.

flint solar
coarse heron
#

sigh* sheesh... i kinda thought about it but didn't know that was indeed the case.

flint solar
coarse heron
#

i guess so... can i train existing models?

flint solar
coarse heron
#

okay bro, I guess I'll go right ahead and learn about AI training kittystare

distant turtle
#

-colab

azure marshBOT
# distant turtle -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

gloomy flame
#

can someone give me a tutorial on how to install the voice changer

glacial kiln
#

Please how do i create a .PTH file and .INDEX file on the step that says upload a voice model.

I'm having difficulties doing that

glacial kiln
low shard
#

Applio or Mainline?

glacial kiln
rare gobletBOT
#

Ayo? @glacial kiln level 2 !!! lfg

low shard
glacial kiln
low shard
flint geyser
#

mewo

tired raft
#

this is the section to create an ai voice model?

glacial pollen
#

yes

tired raft
#

what do i do?

#

@glacial pollen

glacial pollen
#

you should read the guides carefully

#

but generally, you do not touch that box

#

here's an example of how one could set it

brittle wing
#

i cant send images for help :(

glacial pollen
#

(1) you name the model, pick samplerate that fits your dataset, select version v2

(2) you put the path to your sample, for instance: "bla/bla/bla/your_sample.wav"

(3) feature / f0 extraction

(4) train index button

(5) set the saving frequency to 5 as well, not gonna go too deep into that.

(6) amount of epochs uhhh, well, again not gonna go into advanced details but, pick 200 or 100, maybe 300 if you have larger dataset

(7) batch size: you can try 4, 8, 12, 16 ( if your hardware can handle it, go for 8 or 16 )

for choices pick: yes, no, yes

brittle wing
#
Running with the system Python.
Traceback (most recent call last):
  File "C:\Users\admin\Downloads\RVC-GUI-main\RVC-GUI-main\rvcgui.py", line 23, in <module>
    from vc_infer_pipeline import VC
  File "C:\Users\admin\Downloads\RVC-GUI-main\RVC-GUI-main\vc_infer_pipeline.py", line 1, in <module>
    import numpy as np, parselmouth, torch, pdb
ModuleNotFoundError: No module named 'parselmouth'
glacial pollen
#

That's all I can tell you as I am busy rn and can't go too indepth

glacial pollen
brittle wing
glacial pollen
#

Aside, for future.
If you see things like " ModuleNotFoundError: No module named 'parselmouth' "

#

module not found, that means you're lacking some python package or modules ( scripts )

#

typically you could try to install such with pip ( if applicable at given situation )

tired raft
glacial pollen
brittle wing
#

where can i find any of those

glacial pollen
#

<name> github

#

in google

brittle wing
#

ok

glacial pollen
#

But I'd rather recommend you applio as it's easier to set

brittle wing
#

alright

glacial pollen
#

@brittle wing

#

that's pretty much all for " easiest " option

brittle wing
#

alrighty

rare gobletBOT
#

Ayo? @brittle wing level 1 !!! lfg

tired raft
#

@glacial pollen

#

thats for TB

#

what do we do?

#

@glacial pollen

rare gobletBOT
#

Ayo? @tired raft level 3 !!! lfg

glacial pollen
#

I'd appreciate if you did not spam the pings
I am busy working on my project as mentioned + I reply to many people rn

tired raft
#

sorry

glacial pollen
#

What's the issue?

tired raft
#

the TB thing

#

doesn't work

glacial pollen
#

no I meant, what's the issue in your case
as in, what happened

#

docs 404 ?

#

Not really from website / docs department so can't say much more but

tired raft
#

tensorboard doesn't work

#

error 404

glacial pollen
#

have you tried this one?

#

Unless it's the same one you use ( can't see well on the ss )

tired raft
# tired raft

i used the link you send, and for the tensorboard thing there was a "here" link and it didn't work as u can see in this screenshot

glacial pollen
#

In that case I can't help much

#

again, I am not responsible for docs nor I took a part in creating them so I wouldn't know what happened and / or if it was moved somewhere

tired raft
#

yeah i understand

#

thanks for the help though

glacial pollen
#

Tensorboard is a very complex topic tho so. For now you're alright without it, as for your first model ( in my opinion at least )

ember bay
#

how to download rvc?

#

w-okada isn't working for me

nocturne mural
#

the same old method ( stealing accounts )

brittle wing
#

what is this about

crude flame
#

nvm found it

tired raft
#

The model is training rn

#

My friend did it

#

I think it's at 550 epochs rn (we put 1000)

oak edge
#

what are these embedded models can anyone guide me through? (My training set is in a south asian language)

simple ore
oak edge
#

soo

#

chinese means chinese lang training set?

simple ore
#

a model trained with chinese hubert only works with chinese hubert for inference

oak edge
#

wait so it isn't about language that im trying to convert

#

so I'm using training sets in tamil, and also converting voices to my training set voice in tamil only

simple ore
#

one more time... a hubert feature extractor is a model that 'transcribes' speech into codes.. imagine one person writing down a chinese greeting as "ni hao ma" and another as "你好吗"

rare gobletBOT
#

Ayo? @simple ore level 39 !!! lfg

simple ore
#

which person does it better?

#

contentvec is a person transcribing any language into roman characters

#

close enough for most purposes

#

there's a custom hubert you can use for Tamil

#

but again I have to say that if you do that, you have to use the same custom hubert for inference

coral frigate
#

can someone help me figure out why this is the output im recieving when i try to train my model?

#

when i run the training i go to 1000 epochs in about 5 mins so theres something definitely wrong

simple ore
#

already indicates something is wrong, do not go train

#

figure out what is wrong first

#

and stop using mangio ffs

#

you did not preprocess properly

#

you did not extract features properly

#

you've trained 1000 epochs on two mute files at best

coral frigate
low shard
coral frigate
#

i downloaded the newest version of applio and tried using it and im having the same issue. im wondering if it has to do with the size of my dataset (24hours) because its the first time ive faced this problem. ive made sure the audio is in the folder and that its linked properly. the preprocess takes about 40 minutes to complete which makes sense. my GPU is selected yet it still wont work and im not sure how else to troubleshoot this. im not trying to be annoying or dumb, im just still trying to figure all this out.

knotty moth
# coral frigate i downloaded the newest version of applio and tried using it and im having the s...

that preprocess sounds not normal, shouldnt be hell slow on even a crappy hdd. instead of using a single huge dataset file, I'd suggest following the audio labeling section in this guide: https://rentry.co/RVC-dataset-RX11

coral frigate
brittle wing
#

-colab

azure marshBOT
# brittle wing -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

dense drift
#

.np

brazen gorgeBOT
dense drift
glacial pollen
#

speaking of mangio.. the heck is up with Kalo 🤔 did he just stop all rvc or ai altogether?

dense drift
glacial pollen
#

a

low shard
glacial pollen
#

o, well, at least something he enjoys

#

so that's good def

pearl cove
#

-rvc

azure marshBOT
mint stag
#

.

scarlet wedge
#

how to use this?

#

how do i upload my voice model?

wild yoke
#

is there a way to use a voice model to change the voice of an existing mp3 file. everywhere i looked dosnt seem to allow custom voice models.

brave garnetBOT
#

Settings for Nvidia GPUs nvidiagpu

F0 Det.: rmvpe (suggested for all series)

RTX 40-series: 80-96 chunk | +16384 extra
RTX 30-series: 96-112 chunk | +16384 extra
RTX 20-series: 112-128 chunk | +16384 extra
GTX 16-series: 128-192 chunk | +8192 extra
GTX 10-series: 128-192 chunk | +8192 extra

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

glacial pollen
#

what you wanna do instead is a separation ( to obtain extracted vocals and music / background whatever ) and then inferencing ( aka changing the voice ) then combine it all together

wild yoke
#

I have extracted vocals

#

and everything else

#

idk where id do that, or if i can

rare gobletBOT
#

Ayo? @wild yoke level 1 !!! lfg

glacial pollen
# wild yoke idk where id do that, or if i can

to change the voice you're using a model ( we call that process inferencing )
Now, where can you use the models? you see, these are RVC models, so logically, you'd use RVC or Applio ( think of it as custom rvc with few things here n there )

#

but given that handling rvc isn't really noob-friendly

wild yoke
#

XD

#

thanks

glacial pollen
#

what's so funny?

wild yoke
#

the way you explained it to me. idk, not a bad thing

glacial pollen
#

Trust me, there's too many people that barely can read
at this point it's for me to avoid redundant repetition

cobalt cairn
#

кто русский

#

помочь

#

надо

#

нейронка не работает

#

пж хелпаните

#

pls help ai not working:(

candid meteor
#

can someone help?

glacial pollen
#

@candid meteor go ask on audio separation discord

candid meteor
#

what

#

where can i find one

glacial pollen
#

There are people responsible for uvr, bsrof and so on and so on

#

or at least those that work on these

candid meteor
#

ok thanks

glacial pollen
#

In any case, they'll help you

candid meteor
#

ok ill find it

flint solar
#

It has all the models u will need

rugged solar
#

-audio

azure marshBOT
rare gobletBOT
#

Ayo? @rugged solar level 1 !!! lfg

heavy bough
#

-colab

azure marshBOT
# heavy bough -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

brave garnetBOT
#

Settings for Nvidia GPUs nvidiagpu

F0 Det.: rmvpe (suggested for all series)

RTX 40-series: 80-96 chunk | +16384 extra
RTX 30-series: 96-112 chunk | +16384 extra
RTX 20-series: 112-128 chunk | +16384 extra
GTX 16-series: 128-192 chunk | +8192 extra
GTX 10-series: 128-192 chunk | +8192 extra

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

brave garnetBOT
#

Settings for AMD GPUs amdgpu

Don't forget that your models needs to be converted in ONNX!

F0 Det.: rmvpe_onnx (suggested for all series)

7xxx XT cards: 112-128 chunk | +16384 extra
6xxx XT cards: 128-192 chunk | +16384 extra
5xxx XT cards: 192-256 chunk | +8192 extra

RX 580: 192-256 chunk | +8192 extra
RX 570: 192-256 chunk | +8192 extra
RX 560: 256-384 chunk | +8192 extra

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

#

Settings for Nvidia GPUs nvidiagpu

F0 Det.: rmvpe (suggested for all series)

RTX 40-series: 80-96 chunk | +16384 extra
RTX 30-series: 96-112 chunk | +16384 extra
RTX 20-series: 112-128 chunk | +16384 extra
GTX 16-series: 128-192 chunk | +8192 extra
GTX 10-series: 128-192 chunk | +8192 extra

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

#

Settings for Nvidia GPUs nvidiagpu

F0 Det.: rmvpe (suggested for all series)

RTX 40-series: 80-96 chunk | +16384 extra
RTX 30-series: 96-112 chunk | +16384 extra
RTX 20-series: 112-128 chunk | +16384 extra
GTX 16-series: 128-192 chunk | +8192 extra
GTX 10-series: 128-192 chunk | +8192 extra

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

robust halo
#

Anyone know why my RVC is talking garbage?

#

😭

#

@scenic gale

#

@brittle wing

#

Like what I'm hearing isn't what I'm saying.

olive vale
#

Hello my rvc is just saying "waiting generating pipeline and pipeline not installed" looped how do i install the pipeline?

brave garnetBOT
low shard
brittle wing
#

What's the best model for echo removal?

#

On MVSEP

glacial pollen
#

as far as I know

flint solar
#

I js use uvr normal de echo

#

Not de echo de reverb

brittle wing
#

Mvsep or x-minus

flint solar
#

Uvr section

brittle wing
# flint solar Mvsep

Aggressive or normal also what do you use for reverb removal also noise?
You first remove reverb then echo?

brave garnetBOT
#

Local Forks 🖥️


Mainline RVC
Original project, suggested for advanced users,
by the RVC-Project team.

Applio
Simplified, suggested for all, by the Applio team.

RVC Studio
Simplified, suggested for all, by SayanoAI.

Mangio-RVC
Simplified, may not be supported anymore, by Mangio621.

AICoverGen
Simple yet great way to make covers, by SociallyIneptWeeb.

Replay
From the greators of weights.gg, excellent product for everyone.

brittle wing
#

is it possible to just make my voice slightly deeper and change nothing else

tame mica
#

why did it use k means for whatever reason

simple ore
#

when you get past ~4000 slices it starts to combine features

brittle wing
jaunty quarry
#

i need help with something my if i se my graphics card to use a voice changer it doesnt work only with my CPU

flint solar
brittle wing
flint solar
#

then on rx 10
-de click
-de crackle
-mouth de click

flint solar
brittle wing
flint solar
brittle wing
flint solar
brittle wing
flint solar
brittle wing
flint solar
brittle wing
#

I can use audacity but web version I'm on mobile

flint solar
brittle wing
#

Latest BSRoformer right?

flint solar
brittle wing
flint solar
#

and resample to 32khz when exporting as wav

brittle wing
flint solar
brittle wing
flint solar
#

what are u isolating on mvsep?

brittle wing
brittle wing
flint solar
brittle wing
flint solar
brittle wing
flint solar
brittle wing
flint solar
brittle wing
glacial pollen
#

I'd opt more towards plosives handling

glacial pollen
#

Yea, what rvc does in preprocessing is

#

normalization ish, butterworth filtering of low low hz

#

in fact, on the user-end, main things that should be in-check is dynamics, noise and maybe few other things ( aside of obvious reverb and delay, tho slight reverb isn't as destructive really )

potent saffron
#

how do i get weaights cover ai to sound good when i select good voice it doesn't sound like em some parts of song do not all any tips

rare gobletBOT
#

Ayo? @potent saffron level 1 !!! lfg

glacial pollen
#

Yet, given how plosives tend to residue at around 60-70 to 150 / 200 hz ( depending on the voice ) rvc won't task that
so user has to take care of it

flint solar
glacial pollen
#

then each is normalized so, if breathing is captured in there, it'll be fine

#

Only case when you could get rid of such is when they're too contaminated with noise

#

where you have a suspicion rvc would mismatch it with noise

flint solar
glacial pollen
#

F
I mean, you can always add some in, to the dataset but that sometimes might not be ideal

brittle wing
flint solar
#

but its the best

glacial pollen
#

for dereverb I'll always recommend a thing I use in fl ( an AI vst )

brittle wing
flint solar
glacial pollen
#

nope

#

dialogue isolate is actually pretty bad

#

the vst is from waves

#

waves clarity vx dereverb pro

glacial pollen
flint solar
brittle wing
#

It's there too

glacial pollen
#

Nevermind then lol
Wanted to dereverb it for you

brittle wing
flint solar
#

actually really bad

brittle wing
#

Cause I ran out of minutes on X-Minus have to wait til next week

brittle wing
flint solar
brittle wing
#

I mean yesss

#

I know

#

Mel use as is?

rare gobletBOT
#

Ayo? @brittle wing level 17 !!! lfg

flint solar
brittle wing
#

I remember it takes from the vocals too

flint solar
brittle wing
#

Understand?

flint solar
brittle wing
glacial pollen
#

@flint solar check it out
Output isn't tiptop perfect as I gave it a very hard scenario to handle to showcase the performance ( reflections cranked up almost to max aside of reverb )

brittle wing
#

The noise is stuck in embedded

glacial pollen
#

?

glacial pollen
#

well unfortunately yea, unless you know where to find stuff

flint solar
#

ur still de reverbing ur audio

brittle wing
#

BSRoformer derverb takes away very much from the lead vocals

flint solar
#

its the same model

brittle wing
brittle wing
glacial pollen
#

Anyway, I'm about to start my work so, would appreciate no unnecessary @ s

barren fern
#

hey can someone suggest me a google colab that works for making ai covers?

rare gobletBOT
#

Ayo? @barren fern level 1 !!! lfg

brittle wing
cedar nymph
#

any idea how to fix: [Failed to fetch
TypeError: Failed to fetch] when trying to download a checkpoint

rare gobletBOT
#

Ayo? @cedar nymph level 1 !!! lfg

flint solar
brittle wing
flint solar
#

yes

brittle wing
#

It's very aggressive there

brittle wing
#

But bsroformer dereverb takes from the vocals

flint solar
brittle wing
glacial pollen
# brittle wing But not as full

You gotta accept the way it is
No current existing method will give you 100% perfect or full vocals

roformer type models will give you best result but might be aggressive compared to mdx.
Mdx on the other hand are less aggressive ( typically do not damage audio ) but it's results are questionable and often times quite poor

#

it is a matter of " pick your devil "

knotty moth
#

MDX dereverb has worse quality imo

mellow nest
#

hi i forgot how to use this stuff when im download a model what files do I need? right now im download a bin file does that have the pth. in it ?

brave garnetBOT
#

Local Forks 🖥️


Mainline RVC
Original project, suggested for advanced users,
by the RVC-Project team.

Applio
Simplified, suggested for all, by the Applio team.

RVC Studio
Simplified, suggested for all, by SayanoAI.

Mangio-RVC
Simplified, may not be supported anymore, by Mangio621.

AICoverGen
Simple yet great way to make covers, by SociallyIneptWeeb.

Replay
From the greators of weights.gg, excellent product for everyone.

flint solar
knotty moth
jaunty quarry
#

is there a way to get less delay that works well? becuase when i use my voice changer it takes several seconds to work

simple veldt
#

When I train a voice with RVC it generates the .pth file(s) but not the .index file(s). Is this a common issue?

brave garnetBOT
rugged solar
#

-uvr

azure marshBOT
# rugged solar -uvr
Ultimate Vocal Remover

One of the best free and open source vocal and instrumental isolation tool.

turbid root
#

How to fix "RuntimeError: Error(s) in loading state_dict for SynthesizerTrnMs768NSFsid:"

#

In RVC Disconnected Training

#

I can't train cause of that error

dusty mural
#

does anyone know that one website that is used for making ai covers? if anyone knows what im talking about please let me know

dusty mural
#

thank u sm

glacial pollen
brittle wing
frosty python
#

Hi, it might be the wrong place to ask this but i recently used applio and suddenly everytime i convert, there would be segments, i did restart and reset the output setting but it seems to stay there, is there anything i can do to convert without the segmentation?

Edit: Nevermind, it worked by simply converting and not opening the output tab, thank you all\

robust halo
#

@glacial pollen

#

Is there something not laggy?

glacial pollen
#

the heck you talk about

tropic nymph
#

hey everyone why is my voice so glitchy

knotty moth
brittle wing
knotty moth
brittle wing
tropic nymph
brittle wing
#

The more aggressive or less

knotty moth
knotty moth
brittle wing
#

I mean what steps and models

#

What are the actual settings for denoising

knotty moth
# brittle wing Nice how do I make a dataset through using this colab

you can follow this datasetting guide: https://rentry.co/RVC-dataset-RX11
(use vpn if you can't load some contents)

brittle wing
#

Use test time argumentation?

knotty moth
# brittle wing What should I tick/untick on your Colab
  • extract_instrumental: includes inversion of the target stem
  • use_modelname: the model name is included in the output file name (Model Test Mode in UVR)
  • use_modelconf: some config params (overlap & chunk_size) are included in the output file name
  • use_customconfig: will use the custom config below it
  • not 100% sure but I think TTA is not really necessary
brittle wing
#

Use custom configuration or not?

knotty moth
brittle wing
#

That's what I'm asking

brittle wing
#

@knotty moth is this correct

knotty moth
brittle wing
#

You said chunk size 352800, Mel roformer dereverb normal

#

For reverb removal

brittle wing
knotty moth
brittle wing
#

1 or 2?

knotty moth
brittle wing
#

Thanks

robust halo
#

RX 570.

glacial pollen
#

lagging is to be expected

#

alternative you have is to just, go for onnx and w-okada but with quite a lot of latency so sadly yea, you won't get any actual 'realtime' experience

brittle wing
knotty moth
rare gobletBOT
#

Ayo? @knotty moth level 47 !!! lfg

brittle wing
#

Cuz 0.3 still leaves out

knotty moth
brittle wing
knotty moth
# brittle wing For deecho

btw I have been working on mostly good ol' rock & metal songs, barely on the modern 10's and 20's pop songs that may contain such difficult echoes

knotty moth
brittle wing
knotty moth
brittle wing
knotty moth
#

and also sometimes love live and weeb songs

#

the echoes are still quite easily removed by UVR/dereverb models

brittle wing
brittle wing
knotty moth
brittle wing
#

UVr Denoiser at 0.5 or Mel-roformer denoise 1?

knotty moth
brittle wing
brittle wing
pastel kiln
#

so how do i making ai covers guys

knotty moth
pastel kiln
#

🇬 🅰️ 🇾

#

GUYS HOW

rare gobletBOT
#

Ayo? @pastel kiln level 1 !!! lfg

frosty python
#

the easiest ive had experience with is Applio, just run the bat and you can even train model yourself if you have the resource

rare gobletBOT
#

Ayo? @frosty python level 2 !!! lfg

knotty moth
# brittle wing You said chunk size 352800, Mel roformer dereverb normal

I found a secret sauce: when I tried on kim's melroformer, chunk_size = 485100 turns to be optimal one since it corresponds to dim_t = 1101. it is also used in unwa's models in their config file, and I think it should also apply to other roformer models, yea including the dereverb & denoise model as well.

@brittle wing notice this also..

brittle wing
#

You're smart

polar raft
#

hey , i js started to use the voice changer , is there any way for it to mute the app so i dont hear myself echoing on a call

rare gobletBOT
#

Ayo? @polar raft level 2 !!! lfg

tame mica
azure marshBOT
tame mica
#

^

hallow thistle
tame mica
#

agfdsgh

knotty moth
brittle wing
#

@knotty moth is it okay if use BSRoformer for Acapella and then the Mel roformer karaoke model for lead vocals and after I use Mel roformer dereverb normal in the colab then UVr deecho at 0.3 and Mel rodormer Denoiser 1 in the Colab?Will that help me get model maker

brittle wing
#

@alisa how do you prepare your samples for inference tho

shell zodiac
#

I trained a voice and i have a G_2333333 and a D_233333 Data and i cant use it in RVC GUI why?

knotty moth
brittle wing
#

For noise UVR Denoiser or Mel Denoiser?one last question

knotty moth
knotty moth
brittle wing
brittle wing
#

It actually adds it's own noise in the output

#

It has been proven I remember thinking of it and someone posted proof of that through spectrogram analysis

#

YES

#

So Mel roformer Denoise for dataset and stuff

#

But Uvr Denoise on uvronline is the best at times it doesn't add noise or does it ...

#

But yeah UVr Denoise is good for denoising UVr outputs of instrumentals.

knotty moth
#

nevertheless, imo Renegate plugin can act as a final denoising process (it works as noise gating but in smarter way)

brittle wing
#

It's the best but noisy

#

But I wouldn't use that model for Denoise cause it generates it's own noise!For real I noticed do myself and there was even a post w proof

knotty moth
brittle wing
#

@knotty moth stop suggesting me unwa models for Acapella they make the lead vocals sound muffled, I tried duality V1 and just no and the dereverb eats them out even more.
I prefer the official BSRoformer model made by the true developers, thanks

knotty moth
brittle wing
knotty moth
brittle wing
#

Beta 4 oh the background vocals and reverb are every stubborn

#

These models don't work for me.

#

If they work for you fine.

knotty moth
brittle wing
#

BSRoformer is the best.

knotty moth
brittle wing
#

They say they don't plan on releasing it

knotty moth
brittle wing
#

Nah I ran out of minutes tho

stoic forum
#

Hi I'm new to applio and I was wondering if there's some way to tie settings to a voice model. Specifically TTS Voice, TTS Speed, and the Pitch.

#

Like is there maybe some kind of settings file that could be made and put in the model folder that could set it up?

low shard
stoic forum
#

Rip

tame mica
stoic forum
#

-# I've been found

rare gobletBOT
#

Ayo? @stoic forum level 1 !!! lfg

fallow linden
#

Hello

rare gobletBOT
#

Ayo? @fallow linden level 1 !!! lfg

fallow linden
#

I forgot to make ai cover soo....it's been long time since i didn't doing it

stoic forum
#

I've noticed a few voice models come with a "trained" and an "added" index file. What is the difference between them?

azure marshBOT
unreal yarrow
#

Hey guys I just joined and how da heck do I get the list of existing models, voice models shows no list. TY HUGGZ

golden walrus
#

kittypawbite guys, why do my models always have a quiet voice, like barely hear it unless i put it 200% output

flint solar
brittle wing
#

@quasi gate what's w the moon sounds

#

-colab

azure marshBOT
# brittle wing -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

simple ore
golden walrus
low shard
#

@brittle wing do not send datasets here for copyright reasons

brittle wing
#

💀

low shard
# brittle wing 💀

If you crashed 4 times out of RAM, your PC is not powerful enough to do it locally

#

you can use cloud (remote good pc)

brittle wing
low shard
#

the server got already taken down once

brittle wing
#

bruh

low shard
#

we don't want this again

low shard
# brittle wing bruh

You can run RVC on cloud (remote good pc):

  1. Prepare the Dataset
  2. Setup RVC:
    Choose a cloud way to use RVC,
  • Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
  • Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
  • Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
    • Mainline (UI, No guide as of right now)
    • Applio (UI, No guide as of right now)

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time

  1. Be sure to know about the tensorboard

If you are looking for the easiest way and for free, is using https://weights.gg which ofc uses RVC

simple ore
golden walrus
#

but the result is so much smaller

simple ore
#

what are you using for training?

analog obsidian
golden walrus
#

like if i use normal mic, it's normal, whenever i use rvc, it got really small

analog obsidian
golden walrus
#

yes, very.

analog obsidian
#

ahh then if your mic volume is fine, its the model fault

golden walrus
#

but let me try again, maybe it was my fault somewhere

analog obsidian
#

so u can check if your real mic volume is loud enough

golden walrus
#

kittyblush okay, i will do that, thank you for your time

simple ore
golden walrus
#

kittypawbite other models have the same issue tho

rare gobletBOT
#

Ayo? @golden walrus level 5 !!! lfg

golden walrus
#

kittypawbite but my input is fine

knotty moth
golden walrus
#

i used applio to train model

knotty moth
#

and inference?

golden walrus
#

and this

#

kittypawbite i don't know the name so i just call it rvc

knotty moth
golden walrus
#

but i set it at 100

knotty moth
#

try speak louder since you apply Sup2, or how about when Sup2 turned off?

golden walrus
#

same volume when turn off sup 2

brittle wing
#

Hello, I am using local RVC, I have a save every 30 epochs. Last night I was training a model and the PC shut down but I still have my saves. Can I continue training from models with these saves?

golden walrus
#

aye, maybe i will reinstall this

#

-rvc

azure marshBOT
analog obsidian
golden walrus
#

i read somewhere doing so will drastically reduce to quality right ?

analog obsidian
golden walrus
#

ahhhhhhhhhhhh

knotty moth
analog obsidian
#

be sure to not increase it too loud

golden walrus
#

i got it

analog obsidian
#

when it reach distortion/clipping the model quality degrades

golden walrus
#

catblush i get it now

#

thank you

golden walrus
#

ah 1 last question.
if i train a model, those D and G are pre-trained right ?
It stores the data I feed it, so if i want a model for singing in my language, i can throw a bunch of songs then train ?
After that i use a voice i want to mimic to train on top of that pretrain ?

simple ore
#

D and G are model weights used for training

analog obsidian
simple ore
#

default pretrain is generalized model that is average for everything

#

when you train a new model on top of it, you change generalized model to more specific

analog obsidian
simple ore
#

training a new model on top of a different model may not be beneficial

golden walrus
#

ohhhh, so how to allow a model to sing tho ?

analog obsidian
#

they dont have singing data so they are never gonna sound as good as a model trained on singing

brittle wing
golden walrus
#

pepecry maybe i think too simple about these models

#

so i have to pick 1 between singing and talking model

analog obsidian
# brittle wing Yes, I've registered them both. How can I continue the training?

go to the training tab
in the model name use the same exact name of your model (if you named it lyery before, then it has to be exactly lyery)
don't preprocess, don't pitch extract (very important too)
set the same batch size you set (VERY IMPORTANT) if you used batch 8 while training, use batch 8 again
same epoch amount and same save freq amount

#

if everything is the same, you can now click start training

#

and is going to continue training using the latest g and d

analog obsidian
#

singing models suck at speech

golden walrus
#

ah i get it

analog obsidian
#

and don't mix singing and speech in a dataset

#

you'll get better result training a pure speech/singing model than a mixed

golden walrus
#

by the way, is embedder model important ?

analog obsidian
golden walrus
#

ah so pretrained is where i pick which one suitable for my language ?

analog obsidian
#

despite the original one being on english only, you can train any language with good results (i train mostly spanish models and they have good pronunciation even if the pretrained is english only)

#

avoid using custom pretraineds and use only the original

#

you can use them but you may get "exclusive" problems related to that specific custom pretrained used

golden walrus
#

new thing learned

#

right, i will get it to work, but do you have an example of epoch number ? or just purely observe the graph

analog obsidian
#

me personally i dont train over 200 epochs

#

most of my models are usuable at around 100-150 ish

golden walrus
#

kittyblush ah okay

#

cuz i only do 120 then i look at graph

#

but sometimes graph is flat

#

no low point

#

just flat

analog obsidian
#

or both together mixed

golden walrus
#

i set batch size at 8 and most of the data is around 40mins

#

kittypawbite let me try this time, maybe it's different.

analog obsidian
golden walrus
#

i set it 0.982 like in doc said

analog obsidian
golden walrus
analog obsidian
#

but smooth graphs in big datasets is normal anyways

#

not really a bad thing

#

as long you hit low points is fine

#

and the graph is not rising

#

small datasets have multiple low points but that doesn't mean its good

#

(actually thats kind of bad lol)

golden walrus
#

kittypawbite so how do you know when it get overtrained

#

if you train a small dataset

analog obsidian
# golden walrus if you train a small dataset

u have to hear the epochs, smaller datasets suffer from overfitting rather than overtraining
for example you'll hear robotic sibilances (s, ch sounds), they also sound unnatural due to the lack of data

golden walrus
#

gru i have to check one by one ?

analog obsidian
#

choose the mel low points, not the g/total ones

#

g/total is the generator loss, is merely an average of mel, kl and fm

golden walrus
#

ahhhhhhh

#

1 step closer to Chamber's voice

analog obsidian
#

you can also get those on big datasets but those happen very late in training (but the big dataset is always going to sound more natural)

#

while on smol datasets that happen extremely early

golden walrus
#

catblush that explained why 300 epoch sound so bad

analog obsidian
#

10 minutes is the bare minimum, it still overfits quite fast but not as fast as a 5 minute one

#

the more data u get, the more later the model is going to overfit

#

and the more natural is going to sound

#

with more data

#

you don't need to de-ess the dataset to fix the robotic SH sounds, this is just a myth

#

is merely a dataset length thing

golden walrus
#

ah, so can i add data to the model ? like i got 1 data that is 10 mins, i train it. then later i got more data, can i add on top of it ? or i have to train again ?

rare gobletBOT
#

Ayo? @golden walrus level 6 !!! lfg

analog obsidian
#

but you have to train it again

#

from 0

#

a new model with the added data

brittle wing
analog obsidian
#

you can't add data to an already existing model

golden walrus
#

kittypawbite interesting

#

well, that's all for my curious. You r awesome

analog obsidian
#

no prob training model is easy, whats hard is to be sure the dataset is not damaged hehe

#

keep in mind that every model trained on mvsep/separation models is damaged

#

so the quality is degraded a lot compared to a non mvsep model

#

so if you notice the model sounds a bit... broken? is just because of that

quasi dagger
golden walrus
#

kittyblush oh, that's explained why my chamber voice is so robotic

#

like

#

metalic af

analog obsidian
golden walrus
#

i will redo the data

#

thank youuuuuuuuuuu

quasi dagger
analog obsidian
#

codename has a tutorial explaining model merging

knotty moth
analog obsidian
#

im praying everything goes well, this sounds amazing

quasi dagger
analog obsidian
#

not only separation models but every type of damage will do it

#

ideally you want the dataset to have very few post processing or in best case, no post processing at all, raw quality

quasi dagger
#

I guess every separation model would do that in UVR as well?

analog obsidian
#

private models on mvsep and public uvr models

quasi dagger
#

So a clean acapella would be the best i guess

#

Like leaked stems

analog obsidian
#

raw acapella, without effects

quasi dagger
#

Hard to find 🤣

analog obsidian
#

i agree 😭

knotty moth
knotty moth
analog obsidian
#

anyways i notice what kills the quality in the dataset is when we use isolated de-reverb datasets

quasi dagger
#

I noticed that too

analog obsidian
#

i have a couple of mel roformer-only models (that lacks any type of reverb/sound effect) and they have very good quality

#

aka streamer models

#

so is not like mel roformer/bs is gonna destroy the quality by itself

#

is when you use multiple separation models to remove multiple sound effects

#

so more effects removed, more robotic the end result

#

ideally to remove reverb use clarity vx de-reverb

#

is better than the separation models since it doesnt destroy the quality too much (still expect a slightly metallic model)

knotty moth
quasi dagger
#

That's why i prefer live/intimate performances as they have less reverb, such as the tiny desk concerts

quasi dagger
analog obsidian
#

tldr less separation models used, less metallic the model will be

knotty moth
#

may also depend on amount of background noise & instruments
I think the BGM in vtuber talking streams are not as loud as song tracks, so the extracted vocals would be less muffled enough

analog obsidian
#

natural background noise tho

#

not synthetic noise

#

rvc is pretty robust towards noise so dont worry too much about that

#

focus more on how muffled/damaged the audio sounds

#

and keep the least damaged audio in the dataset

knotty moth
#

the reason why I barely made dataset from song tracks...

#

only for inference

simple ore
#

sibilants are made from white noise "columns"

#

but during training this white noise gets reshaped, not exactly in a good way - this is not baked enough metallic thing

#

after 3000 attemps:

#

and after 5.5k it is close to the original

#

this was trained on a single 0.5s sample

analog obsidian
simple ore
#

unfortunately the default training method is random, so you cant guarantee it would hit every c, s, ch x 5000+ times during a training loop

analog obsidian
simple ore
#

so that's where the size of the dataset or number of epochs comes in play

#

during one epoch loop a random 1/10th of a standard 3sec slice of each sample is used

#

if you decide to slice your training set to 5+ sec samples, it is even less than 1/10th

analog obsidian
#

or it was 3?

#

cant remember lol

simple ore
#

usually it is 3, unless there's a lot of silence so it cuts smaller pieces around silence gaps

#

I made a modification of the training loop, so it goes thru the entire set each epoch

#

0.5s slices with a small overlap

analog obsidian
#

that helped with sibilances?

#

making them better than default rvc?

simple ore
#

well, it guarantees every bit of data is being used

#

so like 12min data set is more or less equal to 2hr normal rvc dataset

analog obsidian
#

it can be used in normal finetuning?

simple ore
#

I need to adjust learning rate, I think, so it does not overfit over long epoch.. or maybe save a model more often

analog obsidian
#

ow i see

simple ore
#

12min x 15 epoch of singing data

#

lil undercooked, but still pretty good for what it was made from

#

I may add it to Applio as 'experimental' training method

analog obsidian
#

uhm yea i suppose fully cooking it would give better results than original rvc

knotty moth
quasi dagger
simple ore
#

I guess, it unvoiced piece of the audio