coarse heron Nov 19, 2024, 12:30 AM

#

thanks again, Deitris is indeed a lot better and stable than okada.

marsh schooner Nov 19, 2024, 1:33 AM

#

does someone have a attractive female voice free or something i can pay for idc im tired of making data sets that turn out mid

azure osprey Nov 19, 2024, 1:36 AM

#

Could this be why it doesn't want to work? It's happened every time and I probably should have said so from the start, but it happens every time I try a fresh install

#

Some of them just get stuck and don't download and it's not as if there's a "Retry" button

#

As someone who prides themselves on being able to usually fix a problem with a computer this has got me brain burnt

#

#

Wouldn't be so bad if it didn't take upwards of 20 minutes to complete

#

the ones thtat aren't above 19 minutes are frozen 🙃

mental dawn Nov 19, 2024, 1:45 AM

#

What would you all say is important when trying to make a real time rvc that can keep up with someone that changes between 75 to 300 fx in pitch often? Tying to develop a model that can keep up with someone who has that kind of speech pattern.

knotty moth Nov 19, 2024, 1:55 AM

#

no wonder this channel is so messy, you should discuss that topic in #🔍│help-w-okada

azure osprey Nov 19, 2024, 1:57 AM

#

I think I got it to work!

rare gobletBOT Nov 19, 2024, 1:57 AM

#

Ayo? @azure osprey level 2 !!! lfg

azure osprey Nov 19, 2024, 1:57 AM

#

As of right now, disregard my crying

knotty moth Nov 19, 2024, 2:00 AM

#

if the input audio is too short, try extend it to at least around 10-20 sec

unique rock Nov 19, 2024, 2:03 AM

#

How do I make my model sound good? For example, before saying a phrase or part of a song, there is a type of breathing, right? So I want this not to sound too robotic, and I train my models without this type of breathing, just the voice. What do you recommend?

azure osprey Nov 19, 2024, 2:10 AM

#

Now it crashes whenever I try a custom voice, lemme watch a few more tutorials before I post anything more about this issue

#

This shit just doesn't want to be easy whygod

azure osprey Nov 19, 2024, 2:33 AM

#

Checking around it doesn't seem like I'm doing anything wrong but getting an error code and crash when I use any custom voice

#

probably leaked some important info there idk, don't really care

azure osprey Nov 19, 2024, 3:37 AM

#

please ping me if you have a fix

#

or any ideas

latent kettle Nov 19, 2024, 3:43 AM

#

azure osprey Checking around it doesn't seem like I'm doing anything wrong but getting an err...

Connect your internet. It will surely work. There is a plug-in named sup3 that is loaded over the internet.

azure osprey Nov 19, 2024, 3:44 AM

#

I was connected to the internet ;-;

latent kettle Nov 19, 2024, 3:44 AM

#

Then try to delete user settings. And restart it again

azure osprey Nov 19, 2024, 3:44 AM

#

I did that too, but I'll try it again

latent kettle Nov 19, 2024, 3:45 AM

#

If not working re-extract it. And try to launch it again

#

Delete the old one too

azure osprey Nov 19, 2024, 3:46 AM

#

whygod Alright, I'll do it again

rare gobletBOT Nov 19, 2024, 3:46 AM

#

Ayo? @azure osprey level 3 !!! lfg

azure osprey Nov 19, 2024, 3:46 AM

#

This was the first time the installation went off without a hitch

#

But I'll do it again

latent kettle Nov 19, 2024, 3:46 AM

#

Also ask in #🔍│help-w-okada

azure osprey Nov 19, 2024, 3:46 AM

#

Ah, will do, I asked here cuz it had to do with voices, my bad

latent kettle Nov 19, 2024, 3:47 AM

#

@pastel oak is a great helper

azure osprey Nov 19, 2024, 3:47 AM

#

He was helping me earlier, great guy

latent kettle Nov 19, 2024, 3:47 AM

#

azure osprey He was helping me earlier, great guy

So did you got your answer?

azure osprey Nov 19, 2024, 3:47 AM

#

no...

#

I figured the part he was helping me with out on my own

#

But it's no fault of his

latent kettle Nov 19, 2024, 3:48 AM

#

Okay then re install it

azure osprey Nov 19, 2024, 3:48 AM

#

Yeah doing that now

#

This thing has fought me the entire way lmfao

#

I appreciate your help btw

#

Alright all done and connected, default voices work, I'm going to try a custom one now

#

Same thing

#

Crashes on Custom Voice

brittle wing Nov 19, 2024, 6:22 AM

#

-colab

azure marshBOT Nov 19, 2024, 6:22 AM

#

brittle wing -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

magic musk Nov 19, 2024, 6:51 AM

#

why cant i go over 1000 epoch when trying to train??

#

like i stopped, reentered my g&d but i still cannot go over 1000 epoch

#

like it detects its already 1000 and stops

knotty moth Nov 19, 2024, 7:10 AM

#

magic musk why cant i go over 1000 epoch when trying to train??

for original RVC, you could modify the maximum value of total epoch gui by finding this part in infer-web.py and change it like this:

                    total_epoch11 = gr.Slider(
                        minimum=2,
                        maximum=10000,
                        step=1,
                        label=i18n("总训练轮数total_epoch"),
                        value=500,
                        interactive=True,
                    )

#

though practically most models will be likely to overtrain in less than 1000 epochs

low shard Nov 19, 2024, 8:38 AM

#

magic musk like it detects its already 1000 and stops

tbh its really worthless to train over 1k

#

i highly suggest u not to do it

#

more epochs dont mean more quality

#

use the tensorboard

low shard Nov 19, 2024, 8:57 AM

#

azure osprey Now it crashes whenever I try a custom voice, lemme watch a few more tutorials b...

What’s ur pc gpu?

#

also be sure to not use yt tuts

#

-rt

azure marshBOT Nov 19, 2024, 8:57 AM

#

low shard -rt

Interaction has expired, use the command again for a new interaction.

💻 Local Realtime RVC

low shard Nov 19, 2024, 8:57 AM

#

be sure to be using onky the 1st link, the wokada fork

low shard Nov 19, 2024, 8:58 AM

#

unique rock How do I make my model sound good? For example, before saying a phrase or part o...

you need a better dataset and check the tensorboard

#

realtime voice changer for calls?

#

-rt

azure marshBOT Nov 19, 2024, 9:00 AM

#

low shard -rt

Interaction has expired, use the command again for a new interaction.

💻 Local Realtime RVC

low shard Nov 19, 2024, 9:00 AM

#

1dt link, wokada fork

#

FOR WOKADA (REALTIME VOICE CHANGER FOR CALLS) ASK IN #🔍│help-w-okada

knotty moth Nov 19, 2024, 9:01 AM

#

knotty moth no wonder this channel is so messy, you should discuss that topic in <#115929016...

skullsob

low shard Nov 19, 2024, 9:01 AM

#

low shard # FOR WOKADA (REALTIME VOICE CHANGER FOR CALLS) ASK IN <#1159290161683767298>

knotty moth Nov 19, 2024, 9:02 AM

#

low shard # FOR WOKADA (REALTIME VOICE CHANGER FOR CALLS) ASK IN <#1159290161683767298>

pinned tskr matsuripray

low shard Nov 19, 2024, 9:02 AM

#

knotty moth no wonder this channel is so messy, you should discuss that topic in <#115929016...

Real

low shard Nov 19, 2024, 9:02 AM

#

knotty moth pinned tskr <:matsuripray:1159685390156967936>

next time someone asks about wokada maybe we should just tell them to use #🔍│help-w-okada

#

this channel got really messy tbh

azure osprey Nov 19, 2024, 9:06 AM

#

low shard What’s ur pc gpu?

RTX 4060

#

more info about my problem in #🔍│help-w-okada

low shard Nov 19, 2024, 9:10 AM

#

oop very weird, thought u had some pre-historical thing

pastel oak Nov 19, 2024, 9:14 AM

#

azure osprey Could this be why it doesn't want to work? It's happened every time and I probab...

Well probably yea.

You can manually install these files from huggingface and deag them into the folders, i can send a link later

azure osprey Nov 19, 2024, 9:14 AM

#

I got them to work now, the new issue is posted in the okada channel

pastel oak Nov 19, 2024, 9:14 AM

#

Okok

azure osprey Nov 19, 2024, 9:15 AM

#

I do appreciate your help though

wary apex Nov 19, 2024, 1:07 PM

#

can somone tell me how to use on Mac

latent kettle Nov 19, 2024, 2:01 PM

#

wary apex can somone tell me how to use on Mac

What ?

low shard Nov 19, 2024, 2:10 PM

#

wary apex can somone tell me how to use on Mac

Btw already helped in #🧬│ai-chat message

magic musk Nov 19, 2024, 3:18 PM

#

low shard tbh its really worthless to train over 1k

okay, wont then

#

what file should i use?

#

like, to import my model in the voice changer

#

i guess the added_... as the index

#

but what pth?

simple ore Nov 19, 2024, 3:31 PM

#

pth is the voice model

#

index is a cherry on top

simple ore Nov 19, 2024, 4:28 PM

#

trying to load sovits model?

#

or v1 pretrain into v2 training

#

768 is the number of channels in rvc v2 model

#

256 is in v1

#

what are you trying to do?

#

using an 2-year old RVC app or something?

rare gobletBOT Nov 19, 2024, 4:30 PM

#

Ayo? @limpid cradle level 1 !!! lfg

simple ore Nov 19, 2024, 4:30 PM

#

v1 model should work for inference

#

at least Applio supports both v1 and v2

#

dunno about mainline

#

-rvc

azure marshBOT Nov 19, 2024, 4:31 PM

#

simple ore -rvc

📚 Documentation

AI HUB Docs

https://docs.ai-hub.wtf

🍏 Applio Docs

https://docs.applio.org/

✨ More guides

How to use RVC Mainline Colab by Cauthess
AICoverGen Colab Guide by Eddy (Spanish Helper)
Create a model with RVC disconnected (colab) by Angetyde

simple ore Nov 19, 2024, 4:31 PM

#

err

lavish escarp Nov 19, 2024, 4:52 PM

#

-colab

azure marshBOT Nov 19, 2024, 4:52 PM

#

lavish escarp -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

distant turtle Nov 19, 2024, 5:37 PM

#

-colab

azure marshBOT Nov 19, 2024, 5:37 PM

#

distant turtle -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

brisk nova Nov 19, 2024, 6:53 PM

#

what do i need to do to run RVC

#

its aways closes himself when i tried to open

#

i open the other file

#

and its says that

#

help?

coral frigate Nov 19, 2024, 7:18 PM

#

Does anyone have any recommendations where I can find more pretrains online? Besides the pretrain section on the server. I can’t seem to find any online

pastel oak Nov 19, 2024, 7:47 PM

#

coral frigate Does anyone have any recommendations where I can find more pretrains online? Bes...

there are none online its not a big thing

#

pretrains are not recommended anyway

glacial pollen Nov 19, 2024, 8:13 PM

#

brisk nova what do i need to do to run RVC

You see.. You are not meant to use rvc gui

#

No idea where you took that info from

brisk nova Nov 19, 2024, 8:13 PM

#

glacial pollen You see.. You are not meant to use rvc gui

why

glacial pollen Nov 19, 2024, 8:13 PM

#

erhm

#

#

Try to guess ( date )

#

You wanna pick either original rvc ( mainline ) or Applio

brisk nova Nov 19, 2024, 8:14 PM

#

soo its not released?

glacial pollen Nov 19, 2024, 8:14 PM

#

no, it is just simply outdated lol

brisk nova Nov 19, 2024, 8:14 PM

#

soo its aready dead?

glacial pollen Nov 19, 2024, 8:15 PM

#

🤦‍♂️

#

rvc gui =/= rvc

#

If I may ask, where did you find out about rvc gui, yt? or someone recommended you it?

#

'rvc gui' is outdated and not used anymore ( for a long while now, in fact )

brisk nova Nov 19, 2024, 8:16 PM

#

from this

#

https://github.com/Tiger14n/RVC-GUI/releases

#

someone recommended me it

glacial pollen Nov 19, 2024, 8:16 PM

#

well, rip my dude

brisk nova Nov 19, 2024, 8:16 PM

#

oh...

glacial pollen Nov 19, 2024, 8:16 PM

#

https://github.com/IAHispano/Applio
https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/tree/main

GitHub

GitHub - IAHispano/Applio: A simple, high-quality voice conversion ...

A simple, high-quality voice conversion tool focused on ease of use and performance - IAHispano/Applio

GitHub

GitHub - RVC-Project/Retrieval-based-Voice-Conversion-WebUI: Easily...

Easily train a good VC model with voice data <= 10 mins! - RVC-Project/Retrieval-based-Voice-Conversion-WebUI

brisk nova Nov 19, 2024, 8:17 PM

#

soo its no longer to install rvc?

glacial pollen Nov 19, 2024, 8:17 PM

#

Applio is a fork of rvc
whereas rvc one is the original

#

Pick whichever you want

brisk nova Nov 19, 2024, 8:17 PM

#

ok

#

thanks

glacial pollen Nov 19, 2024, 8:17 PM

#

Just please, carefully read the repository and instructions

brisk nova Nov 19, 2024, 8:17 PM

#

alright

glacial pollen Nov 19, 2024, 8:18 PM

#

azure marsh

Here's some useful info, docs and such just in case

#

@brisk nova
in case you got lost or something

#

Gluck ~

distant turtle Nov 19, 2024, 8:21 PM

#

-colab

azure marshBOT Nov 19, 2024, 8:22 PM

#

distant turtle -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

brisk nova Nov 19, 2024, 10:02 PM

#

glacial pollen Here's some useful info, docs and such just in case

i done it, what do i need to do now

rare gobletBOT Nov 19, 2024, 10:02 PM

#

Ayo? @brisk nova level 2 !!! lfg

glacial pollen Nov 19, 2024, 10:02 PM

#

brisk nova i done it, what do i need to do now

I mean, everything is in the docs or on this server here n there

brisk nova Nov 19, 2024, 10:02 PM

#

glacial pollen I mean, everything is in the docs or on this server here n there

where is the doc

glacial pollen Nov 19, 2024, 10:03 PM

#

I literally attached the msg

brisk nova Nov 19, 2024, 10:03 PM

#

ok

glacial pollen Nov 19, 2024, 10:03 PM

#

cmon bro
#✨│ai-help message

brisk nova Nov 19, 2024, 10:03 PM

#

which one

glacial pollen Nov 19, 2024, 10:03 PM

#

just read

brisk nova Nov 19, 2024, 10:03 PM

#

glacial pollen Nov 19, 2024, 10:03 PM

#

focus

#

Don't be brainrot, just focus and you'll be good

brisk nova Nov 19, 2024, 10:04 PM

#

hummm

glacial pollen Nov 19, 2024, 10:04 PM

#

uhhh, I see you went the harder way

#

F

brisk nova Nov 19, 2024, 10:04 PM

#

bu i downloaded theses 2

glacial pollen Nov 19, 2024, 10:04 PM

#

?

brisk nova Nov 19, 2024, 10:04 PM

#

glacial pollen Nov 19, 2024, 10:05 PM

#

Well then go for applio, simple

brisk nova Nov 19, 2024, 10:05 PM

#

glacial pollen Nov 19, 2024, 10:06 PM

#

glacial pollen Just please, carefully read the repository and instructions

Look, no offense but, can you please read what's being said to you

#

#

Cmon, don't act like if you had no brain

#

nails

#

The instructions are literally on the repo, nah, even better, a pre-compiled package is there for you to download

#

Like, really, no offense but I am reading papers rn, coding and working on rvc upgrades and I can't afford assisting people who, despite being asked to, do not carefully read instructions or msgs. I am pretty sure it's understandable for a lot of people and someone has to already say it outloud.
Because if you asked me? If I see incompetent people trying to play with AI, I can only suspect huge tragedies in future.

marsh schooner Nov 19, 2024, 10:09 PM

#

what does ko-fi mean when looking to buy voice models

glacial pollen Nov 19, 2024, 10:12 PM

#

marsh schooner what does ko-fi mean when looking to buy voice models

Ko-fi is a platform where creators can receive support from their audience by allowing people to "buy them a coffee."

#

aka, you tip ( pay ) the creators, you get the models / commission someone for making them

#

same goes for paypal and such ( as these do happen to be in use quite often too but they're not exactly as ko-fi persay )

simple ore Nov 19, 2024, 10:15 PM

#

brisk nova

delete what you got and download a proper compiled release

brisk nova Nov 19, 2024, 10:17 PM

#

i dont understandddd

glacial pollen Nov 19, 2024, 10:18 PM

#

brisk nova i dont understandddd

https://github.com/IAHispano/Applio/releases

GitHub

Releases · IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance - IAHispano/Applio

#

#

that's all you have to do ^

#

You can't get it any simpler man

brisk nova Nov 19, 2024, 10:25 PM

#

dude

#

i need do download more shits?

#

i download 1 i download 2 i download 3

#

how many times do i need to download?

#

all i what i want its just a AI voice

#

i already have my model

#

of what my friend gived to me

crude flame Nov 19, 2024, 10:28 PM

#

brisk nova dude

Maybe you shouldn’t be using Ai since you are having a hard time reading simple instructions

brisk nova Nov 19, 2024, 10:29 PM

#

crude flame Maybe you shouldn’t be using Ai since you are having a hard time reading simple ...

what do you mean by that?

#

theres 3 of them and the RVC

#

and this comand

#

glacial pollen Nov 19, 2024, 10:31 PM

#

brisk nova what do you mean by that?

No man, this is a clear skill issue

#

You're given clear instructions
You're asked to carefully read what is meant to be read by you to avoid redundant questions in here
You download what you should not download and then make a problem out of it

#

You had one simple job. Read the instruction.
Then you'd know that **the only thing you're meant to download is " precompiled package " **
So please, don't histerize just because you can't read

brisk nova Nov 19, 2024, 10:34 PM

#

i will put them all in trash

#

what do i need to install first

glacial pollen Nov 19, 2024, 10:34 PM

#

boohooh you can't be helped, I am sorry for your loss.

brisk nova Nov 19, 2024, 10:35 PM

#

why

glacial pollen Nov 19, 2024, 10:35 PM

#

@red kayak save me 😭

#

lmao

red kayak Nov 19, 2024, 10:35 PM

#

glacial pollen <@460577350900514837> save me 😭

whaar

#

im here to save :3

glacial pollen Nov 19, 2024, 10:35 PM

#

basically, idk how to handle them

#

maybe you have some ideas

red kayak Nov 19, 2024, 10:36 PM

#

@brisk nova hey mate!

brisk nova Nov 19, 2024, 10:36 PM

#

red kayak <@1082341544784306226> hey mate!

what does that mean?

red kayak Nov 19, 2024, 10:36 PM

#

we've got guides available that give u a step by step tut on how to run RVC

brisk nova Nov 19, 2024, 10:36 PM

#

red kayak we've got guides available that give u a step by step tut on how to run RVC

alright

red kayak Nov 19, 2024, 10:37 PM

#

check those out first and then you can reach out for help if needed

brisk nova Nov 19, 2024, 10:37 PM

#

but what do i need to install first

rare gobletBOT Nov 19, 2024, 10:37 PM

#

Ayo? @brisk nova level 3 !!! lfg

brisk nova Nov 19, 2024, 10:38 PM

#

?

red kayak Nov 19, 2024, 10:39 PM

#

brisk nova ?

https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/RVC1006Nvidia.7z

#

grab this

#

unzip it

glacial pollen Nov 19, 2024, 10:39 PM

#

Uhhh, don't think it's a good idea Litsa

red kayak Nov 19, 2024, 10:39 PM

#

then run the go-web.bat

glacial pollen Nov 19, 2024, 10:39 PM

#

if they can't read guides right, I doubt they can handle visual c and faiss

brisk nova Nov 19, 2024, 10:39 PM

#

i have one

red kayak Nov 19, 2024, 10:39 PM

#

glacial pollen if they can't read guides right, I doubt they can handle visual c and faiss

bOoOoOOoOoOoO

brisk nova Nov 19, 2024, 10:39 PM

#

red kayak Nov 19, 2024, 10:39 PM

#

We'll see

red kayak Nov 19, 2024, 10:40 PM

#

brisk nova

no bad

glacial pollen Nov 19, 2024, 10:40 PM

#

Again arg, do not use gui, as I said it's outdated and nobody uses it

red kayak Nov 19, 2024, 10:40 PM

#

download the one i gave ya

glacial pollen Nov 19, 2024, 10:40 PM

#

nails

brisk nova Nov 19, 2024, 10:40 PM

#

ok

red kayak Nov 19, 2024, 10:40 PM

#

glacial pollen Again arg, do not use gui, as I said it's outdated and nobody uses it

i havent seen that in a year ohh god

glacial pollen Nov 19, 2024, 10:40 PM

#

ikrrr

#

I still wonder who the heck recommends these as apparently it was an actual ' someone ' recommending them(?) the thing

red kayak Nov 19, 2024, 10:41 PM

#

glacial pollen I still wonder who the heck recommends these as apparently it was an actual ' so...

old yt vids im guessing

glacial pollen Nov 19, 2024, 10:41 PM

#

well maybe, but that makes me think

#

is there still no proper guides anywhere? like, 2024 edition

red kayak Nov 19, 2024, 10:41 PM

#

yeah none

glacial pollen Nov 19, 2024, 10:41 PM

#

oof, maybe I should legit do one sometime, would save everyone tons of time lol

red kayak Nov 19, 2024, 10:42 PM

#

glacial pollen oof, maybe I should legit do one sometime, would save everyone tons of time lol

owo yes please

brisk nova Nov 19, 2024, 10:58 PM

#

Its istalling

crude flame Nov 19, 2024, 11:01 PM

#

glacial pollen is there still no proper guides anywhere? like, 2024 edition

oh there are people just dont read them

glacial pollen Nov 19, 2024, 11:01 PM

#

rip

knotty moth Nov 20, 2024, 12:56 AM

#

brisk nova

-gui

azure marshBOT Nov 20, 2024, 12:56 AM

#

knotty moth -gui

https://cdn.discordapp.com/attachments/1122285248844144733/1203460490475343953/caption.gif?ex=65d12cec&is=65beb7ec&hm=bd2fb8d010006dd7c6e3c1c67d3ae846fd1478e1a3124c544c31b43086fe54aa&

brisk nova Nov 20, 2024, 12:58 AM

#

azure marsh https://cdn.discordapp.com/attachments/1122285248844144733/1203460490475343953/c...

What

#

But its not ai Voice?

glacial pollen Nov 20, 2024, 1:15 AM

#

brisk nova But its not ai Voice?

You have to understand one simple thing
'Rvc gui' is outdated.
Nobody maintains it, nobody can promise if it's bug-free, there's most likely worse performance expectations and obviously, you lose features that are in new stuff

#

In fact, rvc gui in here is treated as a meme

brisk nova Nov 20, 2024, 1:27 AM

#

alright

brisk nova Nov 20, 2024, 1:28 AM

#

red kayak download the one i gave ya

i downloaded

#

what do i need to do

nova plaza Nov 20, 2024, 3:59 AM

#

how do I safely quit this, I just want to use the current Epoch 150, don't want to go further.
also how do I find the .pth and index file

simple ore Nov 20, 2024, 4:35 AM

#

nova plaza how do I safely quit this, I just want to use the current Epoch 150, don't want ...

did it save the model into assets/weights?

coarse heron Nov 20, 2024, 8:59 AM

#

the AI cant hum well, is there a way to make it smoother or make it sound better?

flint solar Nov 20, 2024, 9:04 AM

#

coarse heron the AI cant hum well, is there a way to make it smoother or make it sound better...

What model were u using

coarse heron Nov 20, 2024, 9:07 AM

#

im using deitris, all the models caant hum well. I was wondering if i need to tweak something. cant be my mic either.

flint solar Nov 20, 2024, 9:08 AM

#

coarse heron im using deitris, all the models caant hum well. I was wondering if i need to tw...

Simply because the model wasn’t trained on humming

coarse heron Nov 20, 2024, 9:09 AM

#

sigh* sheesh... i kinda thought about it but didn't know that was indeed the case.

flint solar Nov 20, 2024, 9:09 AM

#

coarse heron sigh* sheesh... i kinda thought about it but didn't know that was indeed the cas...

A fix for this is training ur own

coarse heron Nov 20, 2024, 9:10 AM

#

i guess so... can i train existing models?

flint solar Nov 20, 2024, 9:46 AM

#

coarse heron i guess so... can i train existing models?

I meant u train with a dataset dat you’ll make and will contain humming

coarse heron Nov 20, 2024, 9:47 AM

#

okay bro, I guess I'll go right ahead and learn about AI training kittystare

distant turtle Nov 20, 2024, 11:21 AM

#

-colab

azure marshBOT Nov 20, 2024, 11:21 AM

#

distant turtle -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

gloomy flame Nov 20, 2024, 2:41 PM

#

can someone give me a tutorial on how to install the voice changer

glacial kiln Nov 20, 2024, 2:42 PM

#

Please how do i create a .PTH file and .INDEX file on the step that says upload a voice model.

I'm having difficulties doing that

low shard Nov 20, 2024, 2:43 PM

#

glacial kiln Please how do i create a .PTH file and .INDEX file on the step that says upload ...

Which RVC are u using

glacial kiln Nov 20, 2024, 2:44 PM

#

low shard Which RVC are u using

I'm installing locally

low shard Nov 20, 2024, 2:45 PM

#

glacial kiln I'm installing locally

Yeah which

#

Applio or Mainline?

glacial kiln Nov 20, 2024, 2:47 PM

#

low shard Yeah which

Mainline

rare gobletBOT Nov 20, 2024, 2:47 PM

#

Ayo? @glacial kiln level 2 !!! lfg

low shard Nov 20, 2024, 2:49 PM

#

glacial kiln Mainline

Did you firstly make the dataset

glacial kiln Nov 20, 2024, 2:49 PM

#

low shard Did you firstly make the dataset

I'm new to this.
how do i make dataset?

low shard Nov 20, 2024, 2:50 PM

#

glacial kiln I'm new to this. how do i make dataset?

Check this out first https://docs.ai-hub.wtf/rvc/resources/datasets/

Datasets

Last update: Mar 8, 2024

glacial kiln Nov 20, 2024, 2:51 PM

#

low shard Check this out first https://docs.ai-hub.wtf/rvc/resources/datasets/

ok

flint geyser Nov 20, 2024, 2:53 PM

#

mewo

tame mica Nov 20, 2024, 2:55 PM

#

https://tenor.com/view/pokemon-pokemon-first-movie-pokemon-the-first-movie-mewtwo-mewtwo-pokemon-gif-26672925

Tenor

tired raft Nov 20, 2024, 7:27 PM

#

this is the section to create an ai voice model?

glacial pollen Nov 20, 2024, 7:28 PM

#

yes

tired raft Nov 20, 2024, 7:28 PM

#

what do i do?

#

@glacial pollen

glacial pollen Nov 20, 2024, 7:31 PM

#

you should read the guides carefully

#

but generally, you do not touch that box

#

here's an example of how one could set it

brittle wing Nov 20, 2024, 7:31 PM

#

i cant send images for help :(

glacial pollen Nov 20, 2024, 7:31 PM

#

(1) you name the model, pick samplerate that fits your dataset, select version v2

(2) you put the path to your sample, for instance: "bla/bla/bla/your_sample.wav"

(3) feature / f0 extraction

(4) train index button

(5) set the saving frequency to 5 as well, not gonna go too deep into that.

(6) amount of epochs uhhh, well, again not gonna go into advanced details but, pick 200 or 100, maybe 300 if you have larger dataset

(7) batch size: you can try 4, 8, 12, 16 ( if your hardware can handle it, go for 8 or 16 )

for choices pick: yes, no, yes

brittle wing Nov 20, 2024, 7:31 PM

#

Running with the system Python.
Traceback (most recent call last):
  File "C:\Users\admin\Downloads\RVC-GUI-main\RVC-GUI-main\rvcgui.py", line 23, in <module>
    from vc_infer_pipeline import VC
  File "C:\Users\admin\Downloads\RVC-GUI-main\RVC-GUI-main\vc_infer_pipeline.py", line 1, in <module>
    import numpy as np, parselmouth, torch, pdb
ModuleNotFoundError: No module named 'parselmouth'

brittle wing Nov 20, 2024, 7:32 PM

#

brittle wing ``` Running with the system Python. Traceback (most recent call last): File "C...

what do i do for this

glacial pollen Nov 20, 2024, 7:34 PM

#

glacial pollen (1) you name the model, pick samplerate that fits your dataset, select version v...

@tired raft

#

That's all I can tell you as I am busy rn and can't go too indepth

glacial pollen Nov 20, 2024, 7:34 PM

#

brittle wing ``` Running with the system Python. Traceback (most recent call last): File "C...

You should not use rvcgui It is obsolete and nobody should use it

brittle wing Nov 20, 2024, 7:34 PM

#

glacial pollen You should not use rvcgui It is obsolete and nobody should use it

what is used nowadays?

glacial pollen Nov 20, 2024, 7:35 PM

#

Aside, for future.
If you see things like " ModuleNotFoundError: No module named 'parselmouth' "

#

module not found, that means you're lacking some python package or modules ( scripts )

#

typically you could try to install such with pip ( if applicable at given situation )

tired raft Nov 20, 2024, 7:35 PM

#

glacial pollen <@645322370172715018>

im doing it with my friend. everything goes well for now

glacial pollen Nov 20, 2024, 7:35 PM

#

brittle wing what is used nowadays?

RVC or Applio for inferencing and training

rvc's built in real-time voice changer or w-okada for real-time voice changing

brittle wing Nov 20, 2024, 7:35 PM

#

where can i find any of those

glacial pollen Nov 20, 2024, 7:36 PM

#

<name> github

#

in google

brittle wing Nov 20, 2024, 7:36 PM

#

ok

glacial pollen Nov 20, 2024, 7:36 PM

#

But I'd rather recommend you applio as it's easier to set

brittle wing Nov 20, 2024, 7:36 PM

#

alright

glacial pollen Nov 20, 2024, 7:36 PM

#

https://huggingface.co/IAHispano/Applio/resolve/main/Compiled/Windows/ApplioV3.2.7.zip

#

@brittle wing

#

that's pretty much all for " easiest " option

brittle wing Nov 20, 2024, 7:38 PM

#

alrighty

rare gobletBOT Nov 20, 2024, 7:38 PM

#

Ayo? @brittle wing level 1 !!! lfg

tired raft Nov 20, 2024, 7:39 PM

#

#

@glacial pollen

#

thats for TB

#

what do we do?

#

@glacial pollen

rare gobletBOT Nov 20, 2024, 7:40 PM

#

Ayo? @tired raft level 3 !!! lfg

glacial pollen Nov 20, 2024, 7:41 PM

#

I'd appreciate if you did not spam the pings
I am busy working on my project as mentioned + I reply to many people rn

tired raft Nov 20, 2024, 7:41 PM

#

sorry

glacial pollen Nov 20, 2024, 7:41 PM

#

What's the issue?

tired raft Nov 20, 2024, 7:41 PM

#

the TB thing

#

doesn't work

tired raft Nov 20, 2024, 7:41 PM

#

glacial pollen I'd appreciate if you did not spam the pings I am busy working on my project as ...

yeah im sorry

glacial pollen Nov 20, 2024, 7:42 PM

#

no I meant, what's the issue in your case
as in, what happened

#

docs 404 ?

#

Not really from website / docs department so can't say much more but

tired raft Nov 20, 2024, 7:42 PM

#

tensorboard doesn't work

#

error 404

glacial pollen Nov 20, 2024, 7:42 PM

#

https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/

How to Make Voice Models

In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.

#

have you tried this one?

#

Unless it's the same one you use ( can't see well on the ss )

tired raft Nov 20, 2024, 7:44 PM

#

tired raft

i used the link you send, and for the tensorboard thing there was a "here" link and it didn't work as u can see in this screenshot

glacial pollen Nov 20, 2024, 7:48 PM

#

In that case I can't help much

#

again, I am not responsible for docs nor I took a part in creating them so I wouldn't know what happened and / or if it was moved somewhere

tired raft Nov 20, 2024, 7:49 PM

#

yeah i understand

#

thanks for the help though

glacial pollen Nov 20, 2024, 7:49 PM

#

Tensorboard is a very complex topic tho so. For now you're alright without it, as for your first model ( in my opinion at least )

ember bay Nov 20, 2024, 8:27 PM

#

how to download rvc?

#

w-okada isn't working for me

nocturne mural Nov 20, 2024, 8:38 PM

#

the same old method ( stealing accounts )

#

N_C_like

brittle wing Nov 20, 2024, 8:52 PM

#

#

what is this about

crude flame Nov 20, 2024, 9:04 PM

#

tired raft tensorboard doesn't work

where is this? Im fixing up the docs

#

nvm found it

tired raft Nov 20, 2024, 9:21 PM

#

The model is training rn

#

My friend did it

#

I think it's at 550 epochs rn (we put 1000)

nocturne mural Nov 20, 2024, 10:30 PM

#

brittle wing

https://huggingface.co/IAHispano/Applio/resolve/main/Compiled/Windows/ApplioV3.2.7.zip?download=true try the precompiled

oak edge Nov 21, 2024, 3:52 AM

#

what are these embedded models can anyone guide me through? (My training set is in a south asian language)

simple ore Nov 21, 2024, 8:09 AM

#

oak edge what are these embedded models can anyone guide me through? (My training set is ...

they are specifically trained for specific language features

oak edge Nov 21, 2024, 8:09 AM

#

soo

#

chinese means chinese lang training set?

simple ore Nov 21, 2024, 8:09 AM

#

a model trained with chinese hubert only works with chinese hubert for inference

oak edge Nov 21, 2024, 8:11 AM

#

wait so it isn't about language that im trying to convert

#

so I'm using training sets in tamil, and also converting voices to my training set voice in tamil only

simple ore Nov 21, 2024, 8:16 AM

#

one more time... a hubert feature extractor is a model that 'transcribes' speech into codes.. imagine one person writing down a chinese greeting as "ni hao ma" and another as "你好吗"

rare gobletBOT Nov 21, 2024, 8:16 AM

#

Ayo? @simple ore level 39 !!! lfg

simple ore Nov 21, 2024, 8:16 AM

#

which person does it better?

#

contentvec is a person transcribing any language into roman characters

#

close enough for most purposes

#

there's a custom hubert you can use for Tamil

#

https://huggingface.co/utter-project/mHuBERT-147

#

but again I have to say that if you do that, you have to use the same custom hubert for inference

coral frigate Nov 21, 2024, 9:06 AM

#

can someone help me figure out why this is the output im recieving when i try to train my model?

#

when i run the training i go to 1000 epochs in about 5 mins so theres something definitely wrong

simple ore Nov 21, 2024, 9:10 AM

#

already indicates something is wrong, do not go train

#

figure out what is wrong first

#

and stop using mangio ffs

#

you did not preprocess properly

#

you did not extract features properly

#

you've trained 1000 epochs on two mute files at best

coral frigate Nov 21, 2024, 10:10 AM

#

simple ore and stop using mangio ffs

Yh I downloaded applio and im trying it on that instead now

low shard Nov 21, 2024, 10:37 AM

#

simple ore and stop using mangio ffs

@idle adder go update it smh

coral frigate Nov 21, 2024, 10:48 AM

#

i downloaded the newest version of applio and tried using it and im having the same issue. im wondering if it has to do with the size of my dataset (24hours) because its the first time ive faced this problem. ive made sure the audio is in the folder and that its linked properly. the preprocess takes about 40 minutes to complete which makes sense. my GPU is selected yet it still wont work and im not sure how else to troubleshoot this. im not trying to be annoying or dumb, im just still trying to figure all this out.

knotty moth Nov 21, 2024, 11:04 AM

#

coral frigate i downloaded the newest version of applio and tried using it and im having the s...

that preprocess sounds not normal, shouldnt be hell slow on even a crappy hdd. instead of using a single huge dataset file, I'd suggest following the audio labeling section in this guide: https://rentry.co/RVC-dataset-RX11

Creating Datasets for RVC Using iZotope RX11

In this guide I will be explaining how to use a "Paid Software" to clean audio for training models.
iZotope RX is known to be the software for denoising audio and the one used by "every" good model maker.
Compared to the previous guide with RX10, there has been drastic changes...

coral frigate Nov 21, 2024, 11:13 AM

#

knotty moth that preprocess sounds not normal, shouldnt be hell slow on even a crappy hdd. i...

Ow ok thank you. I assumed it was normal for it to take so long because it was such a big dataset. Thank you

brittle wing Nov 21, 2024, 12:20 PM

#

-colab

azure marshBOT Nov 21, 2024, 12:20 PM

#

brittle wing -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

dense drift Nov 21, 2024, 3:50 PM

#

.np

brazen gorgeBOT Nov 21, 2024, 3:50 PM

#

Last track for butnobodycamee

Detention

Melanie Martinez • K-12

dense drift Nov 21, 2024, 3:51 PM

#

/set last.fm

glacial pollen Nov 21, 2024, 3:52 PM

#

low shard <@106318431023800320> go update it smh

actually, a lot of what was mangio ended up in my fork lol

#

speaking of mangio.. the heck is up with Kalo 🤔 did he just stop all rvc or ai altogether?

dense drift Nov 21, 2024, 3:53 PM

#

,set last.fm

low shard Nov 21, 2024, 4:05 PM

#

glacial pollen speaking of mangio.. the heck is up with Kalo 🤔 did he just stop all rvc or ai ...

ig

glacial pollen Nov 21, 2024, 4:05 PM

#

a

low shard Nov 21, 2024, 4:06 PM

#

glacial pollen a

last time i talked with him he just said he does guitar now or smt

glacial pollen Nov 21, 2024, 4:06 PM

#

o, well, at least something he enjoys

#

so that's good def

pearl cove Nov 21, 2024, 5:36 PM

#

-rvc

azure marshBOT Nov 21, 2024, 5:36 PM

#

pearl cove -rvc

📚 Documentation

AI HUB Docs

https://docs.ai-hub.wtf

🍏 Applio Docs

https://docs.applio.org/

✨ More guides

How to use RVC Mainline Colab by Cauthess
AICoverGen Colab Guide by Eddy (Spanish Helper)
Create a model with RVC disconnected (colab) by Angetyde

mint stag Nov 21, 2024, 6:42 PM

#

.

scarlet wedge Nov 21, 2024, 6:47 PM

#

how to use this?

#

how do i upload my voice model?

wild yoke Nov 21, 2024, 6:51 PM

#

is there a way to use a voice model to change the voice of an existing mp3 file. everywhere i looked dosnt seem to allow custom voice models.

brave garnetBOT Nov 21, 2024, 7:01 PM

#

Voicechanger Settings (Okada)

⠀

Settings for Nvidia GPUs

F0 Det.: rmvpe (suggested for all series)

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

⠀

glacial pollen Nov 21, 2024, 7:08 PM

#

wild yoke is there a way to use a voice model to change the voice of an existing mp3 file....

well not quite in a literal sense / in a way you probs intend to

#

what you wanna do instead is a separation ( to obtain extracted vocals and music / background whatever ) and then inferencing ( aka changing the voice ) then combine it all together

wild yoke Nov 21, 2024, 7:09 PM

#

I have extracted vocals

#

and everything else

#

i just need to change the voice using a model, like from #1175430844685484042

#

idk where id do that, or if i can

rare gobletBOT Nov 21, 2024, 7:10 PM

#

Ayo? @wild yoke level 1 !!! lfg

glacial pollen Nov 21, 2024, 7:14 PM

#

wild yoke idk where id do that, or if i can

to change the voice you're using a model ( we call that process inferencing )
Now, where can you use the models? you see, these are RVC models, so logically, you'd use RVC or Applio ( think of it as custom rvc with few things here n there )

#

but given that handling rvc isn't really noob-friendly

#

https://huggingface.co/IAHispano/Applio/resolve/main/Compiled/Windows/ApplioV3.2.7.zip
https://github.com/IAHispano/Applio/releases

GitHub

Releases · IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance - IAHispano/Applio

wild yoke Nov 21, 2024, 7:15 PM

#

XD

#

thanks

glacial pollen Nov 21, 2024, 7:15 PM

#

what's so funny?

wild yoke Nov 21, 2024, 7:16 PM

#

the way you explained it to me. idk, not a bad thing

glacial pollen Nov 21, 2024, 7:16 PM

#

Trust me, there's too many people that barely can read
at this point it's for me to avoid redundant repetition

cobalt cairn Nov 21, 2024, 9:07 PM

#

кто русский

#

помочь

#

надо

#

нейронка не работает

#

пж хелпаните

#

pls help ai not working:(

candid meteor Nov 21, 2024, 9:12 PM

#

can someone help?

#

SpongeCry

glacial pollen Nov 21, 2024, 9:43 PM

#

@candid meteor go ask on audio separation discord

candid meteor Nov 21, 2024, 9:43 PM

#

what

#

where can i find one

glacial pollen Nov 21, 2024, 9:43 PM

#

There are people responsible for uvr, bsrof and so on and so on

#

or at least those that work on these

candid meteor Nov 21, 2024, 9:43 PM

#

ok thanks

glacial pollen Nov 21, 2024, 9:45 PM

#

candid meteor ok thanks

can't send links to discord servs so, just search it up

#

In any case, they'll help you

candid meteor Nov 21, 2024, 9:46 PM

#

ok ill find it

flint solar Nov 21, 2024, 10:37 PM

#

candid meteor can someone help?

Just use mvsep.com

#

It has all the models u will need

rugged solar Nov 21, 2024, 11:50 PM

#

-audio

azure marshBOT Nov 21, 2024, 11:50 PM

#

rugged solar -audio

📚 Audio Guides & Tools

Creating Datasets for RVC using iZotope RX11, by Cauthess
Gathering and Isolating Audio, by SCRFilms ❄
Instrumental and vocal & stems separation & mastering guide, by deton24
Vocal Mixing Tutorial, by Roomie
https://mvsep.com/

rare gobletBOT Nov 21, 2024, 11:50 PM

#

Ayo? @rugged solar level 1 !!! lfg

heavy bough Nov 22, 2024, 12:14 AM

#

-colab

azure marshBOT Nov 22, 2024, 12:14 AM

#

heavy bough -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

brave garnetBOT Nov 22, 2024, 5:04 AM

#

Voicechanger Settings (Okada)

⠀

Settings for Nvidia GPUs

F0 Det.: rmvpe (suggested for all series)

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

⠀

#

RVC Colabs and Spaces

⠀

Google Colabs

⠀
AICoverGen-WebUI
Useful for making quick covers, by Hina.

AICoverGen-NoWebUI
Useful for making covers, doesn't include a UI, by Ardha, by Eddy, Hina and Gdr.

RVC Disconnected
To train new voice models, by Kit Lemonfoot.

EasyGUI
The OG interface, by Rejects.
⠀

#

Voicechanger Download (Okada)

⠀
Download for Nvidia GPUs nvidiagpu
Version 18a cuda

Download for AMD GPUs amdgpu
Version 18a directml

Download for Intel GPUs intelgpu
Version 18a directml

Download for Mac macgpu
Version 17b Mac
⠀

brave garnetBOT Nov 22, 2024, 6:33 AM

#

Voicechanger Settings (Okada)

⠀

Settings for AMD GPUs

Don't forget that your models needs to be converted in ONNX!

F0 Det.: rmvpe_onnx (suggested for all series)

7xxx XT cards: 112-128 chunk | +16384 extra
6xxx XT cards: 128-192 chunk | +16384 extra
5xxx XT cards: 192-256 chunk | +8192 extra

RX 580: 192-256 chunk | +8192 extra
RX 570: 192-256 chunk | +8192 extra
RX 560: 256-384 chunk | +8192 extra

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

⠀

#

Voicechanger Settings (Okada)

⠀

Settings for Nvidia GPUs

F0 Det.: rmvpe (suggested for all series)

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

⠀

#

Voicechanger Settings (Okada)

⠀

Settings for Nvidia GPUs

F0 Det.: rmvpe (suggested for all series)

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

⠀

#

Voicechanger Download (Okada)

⠀
Download for Nvidia GPUs nvidiagpu
Version 18a cuda

Download for AMD GPUs amdgpu
Version 18a directml

Download for Intel GPUs intelgpu
Version 18a directml

Download for Mac macgpu
Version 17b Mac
⠀

robust halo Nov 22, 2024, 6:39 AM

#

Anyone know why my RVC is talking garbage?

#

😭

#

@scenic gale

#

@brittle wing

#

Like what I'm hearing isn't what I'm saying.

olive vale Nov 22, 2024, 8:47 AM

#

Hello my rvc is just saying "waiting generating pipeline and pipeline not installed" looped how do i install the pipeline?

brave garnetBOT Nov 22, 2024, 8:48 AM

#

RVC Colabs and Spaces

⠀

HuggingFace Spaces 🤗

⠀
Ilaria RVC
EasyGUI port with some improvements, by Ilaria.

RVC-HFv2
Applio port, by r3gm.

AICoverGen
AICoverGen port, by r3gm.

Advanced RVC Inference
Extended version of the GUI with advanced settings, r3gm.
⠀

low shard Nov 22, 2024, 11:40 AM

#

olive vale Hello my rvc is just saying "waiting generating pipeline and pipeline not instal...

Try to delete pretrain and model_dir folders and relaunch.

#

Also next time use #🔍│help-w-okada

brittle wing Nov 22, 2024, 2:06 PM

#

What's the best model for echo removal?

#

On MVSEP

glacial pollen Nov 22, 2024, 2:09 PM

#

brittle wing What's the best model for echo removal?

sadly for pure echo ( not reverb ) there's not single one that'd do the job without damaging the audio

#

as far as I know

brittle wing Nov 22, 2024, 2:09 PM

#

glacial pollen sadly for pure echo ( not reverb ) there's not single one that'd do the job wit...

Yeah but which one?

brittle wing Nov 22, 2024, 2:09 PM

#

glacial pollen sadly for pure echo ( not reverb ) there's not single one that'd do the job wit...

I know:(

flint solar Nov 22, 2024, 2:16 PM

#

I js use uvr normal de echo

#

Not de echo de reverb

brittle wing Nov 22, 2024, 2:28 PM

#

flint solar I js use uvr normal de echo

On which site

#

Mvsep or x-minus

flint solar Nov 22, 2024, 2:52 PM

#

brittle wing On which site

Mvsep

#

Uvr section

brittle wing Nov 22, 2024, 2:54 PM

#

flint solar Mvsep

Aggressive or normal also what do you use for reverb removal also noise?
You first remove reverb then echo?

brave garnetBOT Nov 22, 2024, 2:56 PM

#

RVC Colabs and Spaces

⠀

Local Forks 🖥️

⠀
Mainline RVC
Original project, suggested for advanced users,
by the RVC-Project team.

Applio
Simplified, suggested for all, by the Applio team.

RVC Studio
Simplified, suggested for all, by SayanoAI.

Mangio-RVC
Simplified, may not be supported anymore, by Mangio621.

AICoverGen
Simple yet great way to make covers, by SociallyIneptWeeb.

Replay
From the greators of weights.gg, excellent product for everyone.
⠀

brittle wing Nov 22, 2024, 3:02 PM

#

is it possible to just make my voice slightly deeper and change nothing else

tame mica Nov 22, 2024, 3:02 PM

#

wgat

#

why did it use k means for whatever reason

simple ore Nov 22, 2024, 3:28 PM

#

when you get past ~4000 slices it starts to combine features

flint solar Nov 22, 2024, 3:36 PM

#

brittle wing Aggressive or normal also what do you use for reverb removal also noise? You fir...

noraml

#

normal

brittle wing Nov 22, 2024, 3:37 PM

#

flint solar noraml

What's your dataset making procedure?

jaunty quarry Nov 22, 2024, 3:38 PM

#

i need help with something my if i se my graphics card to use a voice changer it doesnt work only with my CPU

flint solar Nov 22, 2024, 3:39 PM

#

brittle wing What's your dataset making procedure?

-bs roformer
-melband karaoke (if needed to separate lead from back vocals)
-bs roformr de reverb
-uve de echo normal
-uvr denoise (0.5 aggressiveness)

brittle wing Nov 22, 2024, 3:39 PM

#

flint solar -bs roformer -melband karaoke (if needed to separate lead from back vocals) -bs ...

BSRoformer derverb use as Is or extract vocals?

tame mica Nov 22, 2024, 3:39 PM

#

simple ore when you get past ~4000 slices it starts to combine features

icic

flint solar Nov 22, 2024, 3:40 PM

#

then on rx 10
-de click
-de crackle
-mouth de click

flint solar Nov 22, 2024, 3:40 PM

#

brittle wing BSRoformer derverb use as Is or extract vocals?

extract

brittle wing Nov 22, 2024, 3:40 PM

#

flint solar extract

Does this method work

flint solar Nov 22, 2024, 3:40 PM

#

brittle wing Does this method work

dats how i isolate my songs

brittle wing Nov 22, 2024, 3:40 PM

#

flint solar dats how i isolate my songs

And you got model maker?

flint solar Nov 22, 2024, 3:41 PM

#

brittle wing And you got model maker?

im model master

brittle wing Nov 22, 2024, 3:41 PM

#

flint solar im model master

So your method is very effective for dataset?

flint solar Nov 22, 2024, 3:41 PM

#

brittle wing So your method is very effective for dataset?

pretty much

brittle wing Nov 22, 2024, 3:41 PM

#

flint solar pretty much

Only with MVSep?

flint solar Nov 22, 2024, 3:41 PM

#

brittle wing Only with MVSep?

rx 10 and audacity

brittle wing Nov 22, 2024, 3:42 PM

#

flint solar rx 10 and audacity

Hm well will it still work without

#

I can use audacity but web version I'm on mobile

flint solar Nov 22, 2024, 3:42 PM

#

brittle wing Hm well will it still work without

its better to use rx 10

brittle wing Nov 22, 2024, 3:42 PM

#

Latest BSRoformer right?

flint solar Nov 22, 2024, 3:42 PM

#

brittle wing Latest BSRoformer right?

yes the august ver

brittle wing Nov 22, 2024, 3:43 PM

#

flint solar its better to use rx 10

I'm on mobile

flint solar Nov 22, 2024, 3:43 PM

#

brittle wing I can use audacity but web version I'm on mobile

use it to eq low end and noise gate ur audio

#

and resample to 32khz when exporting as wav

brittle wing Nov 22, 2024, 3:44 PM

#

flint solar and resample to 32khz when exporting as wav

Can goldwave be useful too?

flint solar Nov 22, 2024, 3:45 PM

#

brittle wing Can goldwave be useful too?

i never used it, u can use it if it does the job

brittle wing Nov 22, 2024, 3:45 PM

#

flint solar i never used it, u can use it if it does the job

Is the mvsep process enough?

flint solar Nov 22, 2024, 3:45 PM

#

brittle wing Is the mvsep process enough?

depends

#

what are u isolating on mvsep?

brittle wing Nov 22, 2024, 3:45 PM

#

flint solar depends

On what

brittle wing Nov 22, 2024, 3:45 PM

#

flint solar what are u isolating on mvsep?

Song

flint solar Nov 22, 2024, 3:46 PM

#

brittle wing Song

from who?

brittle wing Nov 22, 2024, 3:46 PM

#

flint solar from who?

Jhope, BTS
Why you asking?

flint solar Nov 22, 2024, 3:48 PM

#

brittle wing Is the mvsep process enough?

can't really say

brittle wing Nov 22, 2024, 3:48 PM

#

flint solar can't really say

What did you make models of?

flint solar Nov 22, 2024, 3:48 PM

#

brittle wing What did you make models of?

wym

brittle wing Nov 22, 2024, 3:49 PM

#

flint solar -bs roformer -melband karaoke (if needed to separate lead from back vocals) -bs ...

Mel and karaoke-extract from mixture or vocals?

flint solar Nov 22, 2024, 3:49 PM

#

brittle wing Mel and karaoke-extract from mixture or vocals?

both are the same bu from mixture is faster

brittle wing Nov 22, 2024, 3:50 PM

#

flint solar both are the same bu from mixture is faster

Extract from Vocals takes away from the lead vocals

glacial pollen Nov 22, 2024, 3:53 PM

#

flint solar use it to eq low end and noise gate ur audio

tho, doing filtering on your own is pretty much useless as rvc does it better with butterworth filter ( 0-48~ hz )

#

I'd opt more towards plosives handling

flint solar Nov 22, 2024, 3:55 PM

#

glacial pollen tho, doing filtering on your own is pretty much useless as rvc does it better wi...

frl?

#

i didn't know dat

glacial pollen Nov 22, 2024, 3:56 PM

#

Yea, what rvc does in preprocessing is

#

normalization ish, butterworth filtering of low low hz

#

in fact, on the user-end, main things that should be in-check is dynamics, noise and maybe few other things ( aside of obvious reverb and delay, tho slight reverb isn't as destructive really )

potent saffron Nov 22, 2024, 3:56 PM

#

how do i get weaights cover ai to sound good when i select good voice it doesn't sound like em some parts of song do not all any tips

rare gobletBOT Nov 22, 2024, 3:56 PM

#

Ayo? @potent saffron level 1 !!! lfg

glacial pollen Nov 22, 2024, 3:58 PM

#

Yet, given how plosives tend to residue at around 60-70 to 150 / 200 hz ( depending on the voice ) rvc won't task that
so user has to take care of it

flint solar Nov 22, 2024, 3:58 PM

#

glacial pollen in fact, on the user-end, main things that should be in-check is dynamics, noise...

how can i know if i should cut certain breaths in my dataset

glacial pollen Nov 22, 2024, 3:59 PM

#

flint solar how can i know if i should cut certain breaths in my dataset

tbh, you shouldn't
reason is, rvc's slicing the audios into 3 or 3.7 sec segments where there's always 0.3 or so secs of overlap ( from consecutive segments )

#

then each is normalized so, if breathing is captured in there, it'll be fine

#

Only case when you could get rid of such is when they're too contaminated with noise

#

where you have a suspicion rvc would mismatch it with noise

flint solar Nov 22, 2024, 4:00 PM

#

glacial pollen tbh, you shouldn't reason is, rvc's slicing the audios into 3 or 3.7 sec segment...

good lord i used to cut every single breath at sum point 😭

glacial pollen Nov 22, 2024, 4:00 PM

#

F
I mean, you can always add some in, to the dataset but that sometimes might not be ideal

brittle wing Nov 22, 2024, 4:00 PM

#

flint solar how can i know if i should cut certain breaths in my dataset

BSRoformer derverb sadly takes away from the vocals

flint solar Nov 22, 2024, 4:00 PM

#

brittle wing BSRoformer derverb sadly takes away from the vocals

its aggressive yeah

#

but its the best

glacial pollen Nov 22, 2024, 4:01 PM

#

for dereverb I'll always recommend a thing I use in fl ( an AI vst )

brittle wing Nov 22, 2024, 4:01 PM

#

flint solar its aggressive yeah

Yeah but what do I do if it takes from the vocals...

flint solar Nov 22, 2024, 4:01 PM

#

glacial pollen for dereverb I'll always recommend a thing I use in fl ( an AI vst )

is it the one smilar to dialogue isolate?

glacial pollen Nov 22, 2024, 4:01 PM

#

nope

#

dialogue isolate is actually pretty bad

#

the vst is from waves

#

waves clarity vx dereverb pro

glacial pollen Nov 22, 2024, 4:02 PM

#

brittle wing Yeah but what do I do if it takes from the vocals...

is it long? thing you work with ( that has reverb

flint solar Nov 22, 2024, 4:02 PM

#

brittle wing Yeah but what do I do if it takes from the vocals...

use mel but set preprocess as is

brittle wing Nov 22, 2024, 4:02 PM

#

flint solar use mel but set preprocess as is

Is that okay?

#

#

It's there too

glacial pollen Nov 22, 2024, 4:03 PM

#

Nevermind then lol
Wanted to dereverb it for you

brittle wing Nov 22, 2024, 4:04 PM

#

glacial pollen Nevermind then lol Wanted to dereverb it for you

? No need

flint solar Nov 22, 2024, 4:04 PM

#

brittle wing

mdx is bad

#

actually really bad

brittle wing Nov 22, 2024, 4:04 PM

#

Cause I ran out of minutes on X-Minus have to wait til next week

brittle wing Nov 22, 2024, 4:04 PM

#

flint solar actually really bad

Why

flint solar Nov 22, 2024, 4:04 PM

#

brittle wing Why

it adds noise

brittle wing Nov 22, 2024, 4:05 PM

#

flint solar it adds noise

YES

#

I mean yesss

#

I know

#

Mel use as is?

rare gobletBOT Nov 22, 2024, 4:05 PM

#

Ayo? @brittle wing level 17 !!! lfg

flint solar Nov 22, 2024, 4:05 PM

#

brittle wing Mel use as is?

yh

brittle wing Nov 22, 2024, 4:06 PM

#

flint solar yh

That stuff?

#

I remember it takes from the vocals too

flint solar Nov 22, 2024, 4:07 PM

#

brittle wing I remember it takes from the vocals too

u acn use mx if u want, but then youll have to deal with the mdx noise 😂

brittle wing Nov 22, 2024, 4:08 PM

#

flint solar u acn use mx if u want, but then youll have to deal with the mdx noise 😂

I have dealt w it already it's unfixable

#

Understand?

flint solar Nov 22, 2024, 4:08 PM

#

brittle wing I have dealt w it already it's unfixable

de reverb doesnt remove delay

brittle wing Nov 22, 2024, 4:08 PM

#

flint solar de reverb doesnt remove delay

I meant "dealt"

glacial pollen Nov 22, 2024, 4:09 PM

#

@flint solar check it out
Output isn't tiptop perfect as I gave it a very hard scenario to handle to showcase the performance ( reflections cranked up almost to max aside of reverb )

brittle wing Nov 22, 2024, 4:09 PM

#

The noise is stuck in embedded

brittle wing Nov 22, 2024, 4:10 PM

#

glacial pollen <@852974102537175071> check it out Output isn't tiptop perfect as I gave it a ve...

I'm on mobile

glacial pollen Nov 22, 2024, 4:10 PM

#

?

flint solar Nov 22, 2024, 4:10 PM

#

glacial pollen <@852974102537175071> check it out Output isn't tiptop perfect as I gave it a ve...

is it paid

glacial pollen Nov 22, 2024, 4:10 PM

#

well unfortunately yea, unless you know where to find stuff

flint solar Nov 22, 2024, 4:10 PM

#

glacial pollen well unfortunately yea, unless you know where to find stuff

fsfs

flint solar Nov 22, 2024, 4:12 PM

#

brittle wing The noise is stuck in embedded

i dont rlly understand what u mean

#

ur still de reverbing ur audio

brittle wing Nov 22, 2024, 4:12 PM

#

flint solar i dont rlly understand what u mean

I don't use computer

#

BSRoformer derverb takes away very much from the lead vocals

flint solar Nov 22, 2024, 4:13 PM

#

brittle wing BSRoformer derverb takes away very much from the lead vocals

there is no diff between mobile and computer

#

its the same model

brittle wing Nov 22, 2024, 4:14 PM

#

flint solar its the same model

Uh I got confused

brittle wing Nov 22, 2024, 4:14 PM

#

glacial pollen <@852974102537175071> check it out Output isn't tiptop perfect as I gave it a ve...

I thought you sent this @flint solar

glacial pollen Nov 22, 2024, 4:14 PM

#

brittle wing I thought you sent this <@852974102537175071>

?? Yea I did send it but I don't get what you're on

#

boohooh

#

Anyway, I'm about to start my work so, would appreciate no unnecessary @ s

barren fern Nov 22, 2024, 5:09 PM

#

hey can someone suggest me a google colab that works for making ai covers?

rare gobletBOT Nov 22, 2024, 5:09 PM

#

Ayo? @barren fern level 1 !!! lfg

brittle wing Nov 22, 2024, 5:14 PM

#

barren fern hey can someone suggest me a google colab that works for making ai covers?

https://docs.google.com/document/d/1dW9t5UxO6vga_dIg4chfnf6veaYYFXXyTKdQkDAirz4/edit?usp=drivesdk

brittle wing Nov 22, 2024, 5:17 PM

#

flint solar there is no diff between mobile and computer

UVr deecho normal at what aggressiveness?

cedar nymph Nov 22, 2024, 5:17 PM

#

any idea how to fix: [Failed to fetch
TypeError: Failed to fetch] when trying to download a checkpoint

rare gobletBOT Nov 22, 2024, 5:17 PM

#

Ayo? @cedar nymph level 1 !!! lfg

flint solar Nov 22, 2024, 5:17 PM

#

brittle wing UVr deecho normal at what aggressiveness?

keep it at 0.3

brittle wing Nov 22, 2024, 5:17 PM

#

flint solar keep it at 0.3

Default?

flint solar Nov 22, 2024, 5:18 PM

#

yes

brittle wing Nov 22, 2024, 5:20 PM

#

flint solar yes

I used MDX23xc Dereverb on the colab
Is that okay?

#

It's very aggressive there

flint solar Nov 22, 2024, 5:22 PM

#

brittle wing I used MDX23xc Dereverb on the colab Is that okay?

no

brittle wing Nov 22, 2024, 5:22 PM

#

flint solar no

Why

#

But bsroformer dereverb takes from the vocals

flint solar Nov 22, 2024, 5:27 PM

#

brittle wing But bsroformer dereverb takes from the vocals

u still got ur lead vocals bro

brittle wing Nov 22, 2024, 5:28 PM

#

flint solar u still got ur lead vocals bro

But not as full

glacial pollen Nov 22, 2024, 5:52 PM

#

brittle wing But not as full

You gotta accept the way it is
No current existing method will give you 100% perfect or full vocals

roformer type models will give you best result but might be aggressive compared to mdx.
Mdx on the other hand are less aggressive ( typically do not damage audio ) but it's results are questionable and often times quite poor

#

it is a matter of " pick your devil "

knotty moth Nov 22, 2024, 5:55 PM

#

brittle wing But bsroformer dereverb takes from the vocals

use the recent melroformer dereverb (not the archived one): https://huggingface.co/anvuew/dereverb_mel_band_roformer/tree/main

anvuew/dereverb_mel_band_roformer at main

#

MDX dereverb has worse quality imo

mellow nest Nov 22, 2024, 7:37 PM

#

hi i forgot how to use this stuff when im download a model what files do I need? right now im download a bin file does that have the pth. in it ?

brave garnetBOT Nov 22, 2024, 8:10 PM

#

RVC Colabs and Spaces

⠀

Local Forks 🖥️

⠀
Mainline RVC
Original project, suggested for advanced users,
by the RVC-Project team.

Applio
Simplified, suggested for all, by the Applio team.

RVC Studio
Simplified, suggested for all, by SayanoAI.

Mangio-RVC
Simplified, may not be supported anymore, by Mangio621.

AICoverGen
Simple yet great way to make covers, by SociallyIneptWeeb.

Replay
From the greators of weights.gg, excellent product for everyone.
⠀

flint solar Nov 22, 2024, 8:23 PM

#

mellow nest hi i forgot how to use this stuff when im download a model what files do I need?...

it has to have .pth and the added .index file

knotty moth Nov 22, 2024, 10:38 PM

#

mellow nest hi i forgot how to use this stuff when im download a model what files do I need?...

that seems not an RVC model, where did you get it from?

jaunty quarry Nov 22, 2024, 11:13 PM

#

is there a way to get less delay that works well? becuase when i use my voice changer it takes several seconds to work

simple veldt Nov 22, 2024, 11:48 PM

#

When I train a voice with RVC it generates the .pth file(s) but not the .index file(s). Is this a common issue?

brave garnetBOT Nov 22, 2024, 11:54 PM

#

RVC Colabs and Spaces

⠀

Google Colabs

⠀
AICoverGen-WebUI
Useful for making quick covers, by Hina.

AICoverGen-NoWebUI
Useful for making covers, doesn't inclued a UI, by Ardha, by Eddy, Hina and Gdr.

RVC Disconnected
To train new voice models, by Kit Lemonfoot.

EasyGUI
The OG interface, by Rejects.
⠀

rugged solar Nov 23, 2024, 12:15 AM

#

-uvr

azure marshBOT Nov 23, 2024, 12:15 AM

#

rugged solar -uvr

Ultimate Vocal Remover

One of the best free and open source vocal and instrumental isolation tool.

turbid root Nov 23, 2024, 1:38 AM

#

How to fix "RuntimeError: Error(s) in loading state_dict for SynthesizerTrnMs768NSFsid:"

#

In RVC Disconnected Training

#

I can't train cause of that error

dusty mural Nov 23, 2024, 1:56 AM

#

does anyone know that one website that is used for making ai covers? if anyone knows what im talking about please let me know

glacial pollen Nov 23, 2024, 1:59 AM

#

dusty mural does anyone know that one website that is used for making ai covers? if anyone k...

weights.gg ?

dusty mural Nov 23, 2024, 2:04 AM

#

thank u sm

glacial pollen Nov 23, 2024, 2:05 AM

#

dusty mural thank u sm

np ✨

brittle wing Nov 23, 2024, 2:06 AM

#

glacial pollen You gotta accept the way it is No current existing method will give you 100% per...

Nice

brittle wing Nov 23, 2024, 2:46 AM

#

knotty moth use the recent melroformer dereverb (not the archived one): https://huggingface....

I don't have a computer.Any Colab/alternative?

frosty python Nov 23, 2024, 2:53 AM

#

Hi, it might be the wrong place to ask this but i recently used applio and suddenly everytime i convert, there would be segments, i did restart and reset the output setting but it seems to stay there, is there anything i can do to convert without the segmentation?

Edit: Nevermind, it worked by simply converting and not opening the output tab, thank you all\

robust halo Nov 23, 2024, 3:35 AM

#

@glacial pollen

#

Is there something not laggy?

glacial pollen Nov 23, 2024, 3:37 AM

#

robust halo <@1239634084133601423>

mh?

#

the heck you talk about

tropic nymph Nov 23, 2024, 4:04 AM

#

hey everyone why is my voice so glitchy

knotty moth Nov 23, 2024, 4:05 AM

#

brittle wing I don't have a computer.Any Colab/alternative?

it is included in my tweaked version of MSST colab: #1159290752195633273 message
it's not included in the original jarredou's but you can also add it in this code part:

brittle wing Nov 23, 2024, 4:05 AM

#

knotty moth it is included in my tweaked version of MSST colab: https://discord.com/channels...

Can you link the colab I lost it also at what settings

knotty moth Nov 23, 2024, 4:06 AM

#

brittle wing Can you link the colab I lost it also at what settings

mine or jarredou's colab?

brittle wing Nov 23, 2024, 4:07 AM

#

knotty moth mine or jarredou's colab?

Got the link, what are the settings

tropic nymph Nov 23, 2024, 4:07 AM

#

tropic nymph hey everyone why is my voice so glitchy

hey can I still get help with this

brittle wing Nov 23, 2024, 4:07 AM

#

The more aggressive or less

knotty moth Nov 23, 2024, 4:09 AM

#

brittle wing The more aggressive or less

use the normal one, and you can leave these settings (or overlap 8 for faster processing).
chunk value 0 means it uses the config file's setting

brittle wing Nov 23, 2024, 4:09 AM

#

knotty moth use the normal one, and you can leave these settings (or overlap 8 for faster pr...

Chunk size zero?Why

knotty moth Nov 23, 2024, 4:10 AM

#

brittle wing Chunk size zero?Why

yea it uses the config setting
for melrofo dereverb it is 352800

#

brittle wing Nov 23, 2024, 4:10 AM

#

knotty moth yea it uses the config setting for melrofo dereverb it is 352800

Nice how do I make a dataset through using this colab

#

I mean what steps and models

#

What are the actual settings for denoising

knotty moth Nov 23, 2024, 4:11 AM

#

brittle wing Nice how do I make a dataset through using this colab

you can follow this datasetting guide: https://rentry.co/RVC-dataset-RX11
(use vpn if you can't load some contents)

Creating Datasets for RVC Using iZotope RX11

In this guide I will be explaining how to use a "Paid Software" to clean audio for training models.
iZotope RX is known to be the software for denoising audio and the one used by "every" good model maker.
Compared to the previous guide with RX10, there has been drastic changes...

brittle wing Nov 23, 2024, 4:12 AM

#

knotty moth you can follow this datasetting guide: https://rentry.co/RVC-dataset-RX11 (use v...

What should I tick/untick on your Colab

#

#

Use test time argumentation?

knotty moth Nov 23, 2024, 4:18 AM

#

brittle wing What should I tick/untick on your Colab

extract_instrumental: includes inversion of the target stem
use_modelname: the model name is included in the output file name (Model Test Mode in UVR)
use_modelconf: some config params (overlap & chunk_size) are included in the output file name
use_customconfig: will use the custom config below it
not 100% sure but I think TTA is not really necessary

brittle wing Nov 23, 2024, 4:18 AM

#

knotty moth - extract_instrumental: includes inversion of the target stem - use_modelname: t...

Yeah but what settings should I leave

#

Use custom configuration or not?

knotty moth Nov 23, 2024, 4:20 AM

#

brittle wing Yeah but what settings should I leave

yes, unless you want to use overlap 2 from the model's config file

brittle wing Nov 23, 2024, 4:21 AM

#

knotty moth yes, unless you want to use overlap 2 from the model's config file

What settings should I leave ticked and which ones unticked?

#

That's what I'm asking

knotty moth Nov 23, 2024, 4:22 AM

#

brittle wing What settings should I leave ticked and which ones unticked?

refer to the pic I gave you above
https://cdn.discordapp.com/attachments/1159290139609137264/1309734764177133598/image.png?ex=6742a90d&is=6741578d&hm=f6386a03983b18f2796f558dd7eb8b0d23a8575effdfd979d19fd5924da20233&

brittle wing Nov 23, 2024, 4:22 AM

#

knotty moth refer to the pic I gave you above https://cdn.discordapp.com/attachments/1159290...

Only use custom configuration?

#

@knotty moth is this correct

#

knotty moth Nov 23, 2024, 4:26 AM

#

brittle wing

it's okay

brittle wing Nov 23, 2024, 4:26 AM

#

knotty moth it's okay

Are the settings correct?

#

You said chunk size 352800, Mel roformer dereverb normal

#

For reverb removal

knotty moth Nov 23, 2024, 4:27 AM

#

brittle wing You said chunk size 352800, Mel roformer dereverb normal

yes

brittle wing Nov 23, 2024, 4:27 AM

#

knotty moth yes

Nice what are the settings for denoising

knotty moth Nov 23, 2024, 4:28 AM

#

brittle wing Nice what are the settings for denoising

you can use the same settings

brittle wing Nov 23, 2024, 4:28 AM

#

knotty moth you can use the same settings

Which model

#

1 or 2?

knotty moth Nov 23, 2024, 4:29 AM

#

brittle wing Which model

the normal mel denoise (1)

brittle wing Nov 23, 2024, 4:29 AM

#

knotty moth the normal mel denoise (1)

Uh right

#

Thanks

robust halo Nov 23, 2024, 4:31 AM

#

glacial pollen mh?

Is there an ai voice changer that doesn’t lag as much?

#

RX 570.

glacial pollen Nov 23, 2024, 4:33 AM

#

robust halo Is there an ai voice changer that doesn’t lag as much?

well, rx 570 isn't anything beefy really

#

lagging is to be expected

#

alternative you have is to just, go for onnx and w-okada but with quite a lot of latency so sadly yea, you won't get any actual 'realtime' experience

brittle wing Nov 23, 2024, 4:54 AM

#

knotty moth the normal mel denoise (1)

For deecho

knotty moth Nov 23, 2024, 5:09 AM

#

brittle wing For deecho

you can use UVR de-echo in mvsep

rare gobletBOT Nov 23, 2024, 5:09 AM

#

Ayo? @knotty moth level 47 !!! lfg

brittle wing Nov 23, 2024, 5:10 AM

#

knotty moth you can use UVR de-echo in mvsep

How much aggressiveness setting

#

Cuz 0.3 still leaves out

knotty moth Nov 23, 2024, 5:13 AM

#

brittle wing How much aggressiveness setting

if you tried higher values: 0.5 and 0.7, and still the same, I don't think there's any good de-echo model yet

brittle wing Nov 23, 2024, 5:13 AM

#

knotty moth if you tried higher values: 0.5 and 0.7, and still the same, I don't think there...

The one on x-minuscough but I ran out of minutes I have to wait til next week

knotty moth Nov 23, 2024, 5:18 AM

#

brittle wing For deecho

btw I have been working on mostly good ol' rock & metal songs, barely on the modern 10's and 20's pop songs that may contain such difficult echoes

brittle wing Nov 23, 2024, 5:18 AM

#

knotty moth btw I have been working on mostly good ol' rock & metal songs, barely on the mod...

And?

knotty moth Nov 23, 2024, 5:19 AM

#

brittle wing And?

shrug

brittle wing Nov 23, 2024, 5:20 AM

#

knotty moth <:shrug:1159570387953258616>

You make datasets put of these?

knotty moth Nov 23, 2024, 5:20 AM

#

brittle wing You make datasets put of these?

nah just for making covers

brittle wing Nov 23, 2024, 5:21 AM

#

knotty moth nah just for making covers

How do you remove the echo

knotty moth Nov 23, 2024, 5:21 AM

#

and also sometimes love live and weeb songs

#

the echoes are still quite easily removed by UVR/dereverb models

brittle wing Nov 23, 2024, 5:21 AM

#

knotty moth and also sometimes love live and weeb songs

How do you prepare your samples for inferencing

brittle wing Nov 23, 2024, 5:22 AM

#

knotty moth the echoes are still quite easily removed by UVR/dereverb models

Yes but at 0.3 aggressiveness?

knotty moth Nov 23, 2024, 5:22 AM

#

brittle wing Yes but at 0.3 aggressiveness?

yeah, the equivalent value in UVR gui is 30 (of 100)

brittle wing Nov 23, 2024, 5:22 AM

#

UVr Denoiser at 0.5 or Mel-roformer denoise 1?

knotty moth Nov 23, 2024, 5:23 AM

#

brittle wing UVr Denoiser at 0.5 or Mel-roformer denoise 1?

the latter, and it's fullband

brittle wing Nov 23, 2024, 5:23 AM

#

knotty moth yeah, the equivalent value in UVR gui is 30 (of 100)

I know that, I'm very familiar w UVr architecture it's the best one

brittle wing Nov 23, 2024, 5:23 AM

#

knotty moth the latter, and it's fullband

Mel

pastel kiln Nov 23, 2024, 6:07 AM

#

so how do i making ai covers guys

knotty moth Nov 23, 2024, 6:07 AM

#

pastel kiln Nov 23, 2024, 6:08 AM

#

🇬 🅰️ 🇾

#

GUYS HOW

rare gobletBOT Nov 23, 2024, 6:09 AM

#

Ayo? @pastel kiln level 1 !!! lfg

frosty python Nov 23, 2024, 6:39 AM

#

the easiest ive had experience with is Applio, just run the bat and you can even train model yourself if you have the resource

rare gobletBOT Nov 23, 2024, 6:39 AM

#

Ayo? @frosty python level 2 !!! lfg

knotty moth Nov 23, 2024, 7:24 AM

#

brittle wing You said chunk size 352800, Mel roformer dereverb normal

I found a secret sauce: when I tried on kim's melroformer, chunk_size = 485100 turns to be optimal one since it corresponds to dim_t = 1101. it is also used in unwa's models in their config file, and I think it should also apply to other roformer models, yea including the dereverb & denoise model as well.

@brittle wing notice this also..

brittle wing Nov 23, 2024, 7:25 AM

#

knotty moth I found a secret sauce: when I tried on kim's melroformer, chunk_size = 485100 t...

Imo this is the best dereverb model it doesn't take away from the vocals as much as the BSRoformer one

#

You're smart

polar raft Nov 23, 2024, 7:26 AM

#

hey , i js started to use the voice changer , is there any way for it to mute the app so i dont hear myself echoing on a call

rare gobletBOT Nov 23, 2024, 7:26 AM

#

Ayo? @polar raft level 2 !!! lfg

tame mica Nov 23, 2024, 7:28 AM

#

pastel kiln 🇬 🅰️ 🇾

-rvc

azure marshBOT Nov 23, 2024, 7:28 AM

#

tame mica -rvc

📚 Documentation

AI HUB Docs

https://docs.ai-hub.wtf

🍏 Applio Docs

https://docs.applio.org/

✨ More guides

How to use RVC Mainline Colab by Cauthess
AICoverGen Colab Guide by Eddy (Spanish Helper)
Create a model with RVC disconnected (colab) by Angetyde

tame mica Nov 23, 2024, 7:29 AM

#

^

hallow thistle Nov 23, 2024, 7:29 AM

#

tame mica Nov 23, 2024, 7:30 AM

#

agfdsgh

knotty moth Nov 23, 2024, 7:50 AM

#

brittle wing Imo this is the best dereverb model it doesn't take away from the vocals as much...

yea lol feels like I should apply these new methods on some cover songs and past vtuber datasets I have been working on, plus unwa's inst v1e is goated for my covers

brittle wing Nov 23, 2024, 7:52 AM

#

@knotty moth is it okay if use BSRoformer for Acapella and then the Mel roformer karaoke model for lead vocals and after I use Mel roformer dereverb normal in the colab then UVr deecho at 0.3 and Mel rodormer Denoiser 1 in the Colab?Will that help me get model maker

brittle wing Nov 23, 2024, 7:52 AM

#

knotty moth yea lol feels like I should apply these new methods on some cover songs and past...

Yes you should

#

@alisa how do you prepare your samples for inference tho

shell zodiac Nov 23, 2024, 7:54 AM

#

I trained a voice and i have a G_2333333 and a D_233333 Data and i cant use it in RVC GUI why?

knotty moth Nov 23, 2024, 7:58 AM

#

brittle wing <@681186927151546397> is it okay if use BSRoformer for Acapella and then the Mel...

kim's melroformer is my primary choice for vocals, though I may also consider beta4 for a bit more fullness or mvsep/unwa's BS rofo as secondary one to mitigate some bleeds (personally some hip-hop/rap, k-pop and weeb songs may be little more difficult to deal with)

brittle wing Nov 23, 2024, 7:59 AM

#

knotty moth kim's melroformer is my primary choice for vocals, though I may also consider be...

Mhm the method I listed actually works

#

For noise UVR Denoiser or Mel Denoiser?one last question

knotty moth Nov 23, 2024, 7:59 AM

#

brittle wing @alisa how do you prepare your samples for inference tho

after vocal extraction, dereverb and denoise as usual, also I used renegate as final process for both cover & dataset making

knotty moth Nov 23, 2024, 8:00 AM

#

brittle wing For noise UVR Denoiser or Mel Denoiser?one last question

melband one, and plus renegate plugin (idk if there's similar one for mobile)

brittle wing Nov 23, 2024, 8:00 AM

#

knotty moth after vocal extraction, dereverb and denoise as usual, also I used renegate as f...

Denoise mhm I have a bandlab preset that makes vocals realistic and filters out noise

brittle wing Nov 23, 2024, 8:01 AM

#

knotty moth melband one, and plus renegate plugin (idk if there's similar one for mobile)

Seriously isn't UVr Denoise better

#

It actually adds it's own noise in the output

#

It has been proven I remember thinking of it and someone posted proof of that through spectrogram analysis

#

YES

#

So Mel roformer Denoise for dataset and stuff

#

But Uvr Denoise on uvronline is the best at times it doesn't add noise or does it ...

#

But yeah UVr Denoise is good for denoising UVr outputs of instrumentals.

knotty moth Nov 23, 2024, 8:05 AM

#

brittle wing Seriously isn't UVr Denoise better

I feel like UVR denoise is such an old method (though many novices may still use it), and it has 17.5 khz cutoff also somewhat aggressively removes quiet voices (including whispering) under -27 dB

#

nevertheless, imo Renegate plugin can act as a final denoising process (it works as noise gating but in smarter way)

brittle wing Nov 23, 2024, 8:07 AM

#

knotty moth I feel like UVR denoise is such an old method (though many novices may still use...

But I like it UVr architecture is my personal favorite

#

It's the best but noisy

#

But I wouldn't use that model for Denoise cause it generates it's own noise!For real I noticed do myself and there was even a post w proof

knotty moth Nov 23, 2024, 8:12 AM

#

brittle wing But I wouldn't use that model for Denoise cause it generates it's own noise!For ...

ig the noise it may add is just chunk artifacts, but the updated MSS colab should eliminate it by internally using batch_size = 2 instead of 1 in the model config file

brittle wing Nov 23, 2024, 8:15 AM

#

@knotty moth stop suggesting me unwa models for Acapella they make the lead vocals sound muffled, I tried duality V1 and just no and the dereverb eats them out even more.
I prefer the official BSRoformer model made by the true developers, thanks

knotty moth Nov 23, 2024, 8:23 AM

#

brittle wing <@681186927151546397> stop suggesting me unwa models for Acapella they make the ...

the vocals in even mvsep and viperx 1296/1297 BS roformer are not less muffled than kim's melrofo, I haven't found truly full vocals that are better than unwa's. you may also try Lew's vocal enhancer though may not be ideal for dataset making.
personally I haven't made datasets using song tracks though.

brittle wing Nov 23, 2024, 8:25 AM

#

knotty moth the vocals in even mvsep and viperx 1296/1297 BS roformer are not less muffled t...

No the vocals enhancer adds noise even more to deal with
Which model by unwa are you talking about

#

????

knotty moth Nov 23, 2024, 8:26 AM

#

brittle wing No the vocals enhancer adds noise even more to deal with Which model by unwa ar...

beta4 and duality v1, though the latter has more noise

brittle wing Nov 23, 2024, 8:26 AM

#

knotty moth beta4 and duality v1, though the latter has more noise

Duality v1 makes my dataset sound muffled as hell 🤦

#

Beta 4 oh the background vocals and reverb are every stubborn

#

These models don't work for me.

#

If they work for you fine.

knotty moth Nov 23, 2024, 8:29 AM

#

brittle wing Duality v1 makes my dataset sound muffled as hell 🤦

ig you'd say the same thing on kim's melrofo? (as I judged on the high-end fullness)
though true that BS rofo 2024.08 is quite solid

brittle wing Nov 23, 2024, 8:30 AM

#

knotty moth ig you'd say the same thing on kim's melrofo? (as I judged on the high-end fulln...

I don't use Kim's Mel roformer for datasets also it's a good one for instrumentals why

#

BSRoformer is the best.

knotty moth Nov 23, 2024, 8:34 AM

#

brittle wing BSRoformer is the best.

tbh it is my SOTA in early to mid 2024 (2024.04 then .08)
I wish it could be eventually downloadable and usable for UVR and MSST colab

brittle wing Nov 23, 2024, 8:35 AM

#

knotty moth tbh it is my SOTA in early to mid 2024 (2024.04 then .08) I wish it could be eve...

I know.

#

They say they don't plan on releasing it

knotty moth Nov 23, 2024, 9:08 AM

#

brittle wing The one on x-minus*cough* but I ran out of minutes I have to wait til next week

ah I see x-minus server is unstable rn 💀

brittle wing Nov 23, 2024, 9:08 AM

#

Nah I ran out of minutes tho

stoic forum Nov 23, 2024, 10:20 AM

#

Hi I'm new to applio and I was wondering if there's some way to tie settings to a voice model. Specifically TTS Voice, TTS Speed, and the Pitch.

#

Like is there maybe some kind of settings file that could be made and put in the model folder that could set it up?

low shard Nov 23, 2024, 10:31 AM

#

stoic forum Hi I'm new to applio and I was wondering if there's some way to tie settings to ...

You can't really force it to be tied to that specific settings for tts

stoic forum Nov 23, 2024, 10:32 AM

#

Rip

tame mica Nov 23, 2024, 10:37 AM

#

stoic forum Rip

morenatsu shocked

stoic forum Nov 23, 2024, 10:37 AM

#

-# I've been found

rare gobletBOT Nov 23, 2024, 10:37 AM

#

Ayo? @stoic forum level 1 !!! lfg

fallow linden Nov 23, 2024, 10:39 AM

#

Hello

rare gobletBOT Nov 23, 2024, 10:39 AM

#

Ayo? @fallow linden level 1 !!! lfg

fallow linden Nov 23, 2024, 10:40 AM

#

I forgot to make ai cover soo....it's been long time since i didn't doing it

low shard Nov 23, 2024, 10:59 AM

#

fallow linden I forgot to make ai cover soo....it's been long time since i didn't doing it

What's your PC GPU?

stoic forum Nov 23, 2024, 11:06 AM

#

I've noticed a few voice models come with a "trained" and an "added" index file. What is the difference between them?

azure marshBOT Nov 23, 2024, 11:06 AM

#

stoic forum I've noticed a few voice models come with a "trained" and an "added" index file....

Faiss Integration (.index file): The Faiss library enables efficient approximate nearest neighbor search in RVC during inference, retrieving and combining training audio segments with closest embeddings. For your final RVC model, include the one which the file name starts with added.

Example: added_IVF157_Flat_nprobe_myModel.index

unreal yarrow Nov 23, 2024, 11:49 AM

#

Hey guys I just joined and how da heck do I get the list of existing models, voice models shows no list. TY HUGGZ

golden walrus Nov 23, 2024, 12:22 PM

#

kittypawbite guys, why do my models always have a quiet voice, like barely hear it unless i put it 200% output

flint solar Nov 23, 2024, 12:36 PM

#

stoic forum I've noticed a few voice models come with a "trained" and an "added" index file....

the added index is the one used for inference

brittle wing Nov 23, 2024, 1:43 PM

#

@quasi gate what's w the moon sounds

#

-colab

azure marshBOT Nov 23, 2024, 1:45 PM

#

brittle wing -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

simple ore Nov 23, 2024, 1:49 PM

#

golden walrus <:kittypawbite:1167394009887539200> guys, why do my models always have a quiet v...

fix the volume in the trained data

golden walrus Nov 23, 2024, 1:58 PM

#

simple ore fix the volume in the trained data

so i need to fix the sound file before training right ?

low shard Nov 23, 2024, 2:08 PM

#

@brittle wing do not send datasets here for copyright reasons

brittle wing Nov 23, 2024, 2:08 PM

#

💀

low shard Nov 23, 2024, 2:08 PM

#

brittle wing 💀

If you crashed 4 times out of RAM, your PC is not powerful enough to do it locally

#

you can use cloud (remote good pc)

brittle wing Nov 23, 2024, 2:08 PM

#

low shard <@456226577798135808> do **not** send datasets here for copyright reasons

where you saw copyrights?

low shard Nov 23, 2024, 2:09 PM

#

brittle wing where you saw copyrights?

we don't allow datasets to be shared here

#

the server got already taken down once

brittle wing Nov 23, 2024, 2:09 PM

#

bruh

low shard Nov 23, 2024, 2:09 PM

#

we don't want this again

low shard Nov 23, 2024, 2:09 PM

#

brittle wing bruh

You can run RVC on cloud (remote good pc):

Prepare the Dataset
Setup RVC:
Choose a cloud way to use RVC,

Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Mainline (UI, No guide as of right now)
- Applio (UI, No guide as of right now)

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time

Be sure to know about the tensorboard

If you are looking for the easiest way and for free, is using https://weights.gg which ofc uses RVC

#

or #1159289738314919936 / #1191429836321849435

simple ore Nov 23, 2024, 2:33 PM

#

golden walrus so i need to fix the sound file before training right ?

if the source audio is quiet, then the trained model will be quiet

golden walrus Nov 23, 2024, 2:38 PM

#

simple ore if the source audio is quiet, then the trained model will be quiet

kittypawbite but i checked multiple time, the source audio volume is pretty good

#

but the result is so much smaller

simple ore Nov 23, 2024, 2:44 PM

#

what are you using for training?

analog obsidian Nov 23, 2024, 2:46 PM

#

golden walrus but the result is so much smaller

this sounds like a very obvious question
but
is your real mic volume… low?

golden walrus Nov 23, 2024, 2:47 PM

#

analog obsidian this sounds like a very obvious question but is your real mic volume… low?

gru i put everything to 100 but still low

#

like if i use normal mic, it's normal, whenever i use rvc, it got really small

analog obsidian Nov 23, 2024, 2:48 PM

#

golden walrus like if i use normal mic, it's normal, whenever i use rvc, it got really small

are you close to the mic?

golden walrus Nov 23, 2024, 2:48 PM

#

yes, very.

analog obsidian Nov 23, 2024, 2:49 PM

#

ahh then if your mic volume is fine, its the model fault

golden walrus Nov 23, 2024, 2:49 PM

#

but let me try again, maybe it was my fault somewhere

analog obsidian Nov 23, 2024, 2:49 PM

#

golden walrus but let me try again, maybe it was my fault somewhere

be sure to hear your real voice first

#

so u can check if your real mic volume is loud enough

golden walrus Nov 23, 2024, 2:50 PM

#

kittyblush okay, i will do that, thank you for your time

simple ore Nov 23, 2024, 2:55 PM

#

golden walrus <:kittyblush:1167393107118149642> okay, i will do that, thank you for your time

if other model works fine, but yours does not, then the problem is obvious

golden walrus Nov 23, 2024, 2:56 PM

#

kittypawbite other models have the same issue tho

rare gobletBOT Nov 23, 2024, 2:56 PM

#

Ayo? @golden walrus level 5 !!! lfg

golden walrus Nov 23, 2024, 2:57 PM

#

kittypawbite but my input is fine

knotty moth Nov 23, 2024, 2:59 PM

#

golden walrus <:kittypawbite:1167394009887539200> but i checked multiple time, the source audi...

is the inferred audio really quieter than the source one?
check the volume envelope setting in the inference, the default one should be fine
(did you use ilaria RVC or Applio, tho?)

golden walrus Nov 23, 2024, 3:00 PM

#

i used applio to train model

knotty moth Nov 23, 2024, 3:00 PM

#

and inference?

golden walrus Nov 23, 2024, 3:00 PM

#

and this

#

kittypawbite i don't know the name so i just call it rvc

knotty moth Nov 23, 2024, 3:01 PM

#

golden walrus and this

ah, then it may be the mic settings

golden walrus Nov 23, 2024, 3:03 PM

#

but i set it at 100

knotty moth Nov 23, 2024, 3:03 PM

#

try speak louder since you apply Sup2, or how about when Sup2 turned off?

golden walrus Nov 23, 2024, 3:04 PM

#

same volume when turn off sup 2

brittle wing Nov 23, 2024, 3:04 PM

#

Hello, I am using local RVC, I have a save every 30 epochs. Last night I was training a model and the PC shut down but I still have my saves. Can I continue training from models with these saves?

golden walrus Nov 23, 2024, 3:04 PM

#

aye, maybe i will reinstall this

#

-rvc

azure marshBOT Nov 23, 2024, 3:05 PM

#

golden walrus -rvc

📚 Documentation

AI HUB Docs

https://docs.ai-hub.wtf

🍏 Applio Docs

https://docs.applio.org/

✨ More guides

How to use RVC Mainline Colab by Cauthess
AICoverGen Colab Guide by Eddy (Spanish Helper)
Create a model with RVC disconnected (colab) by Angetyde

analog obsidian Nov 23, 2024, 3:06 PM

#

golden walrus and this

increase in to around 150

golden walrus Nov 23, 2024, 3:06 PM

#

i read somewhere doing so will drastically reduce to quality right ?

analog obsidian Nov 23, 2024, 3:07 PM

#

golden walrus i read somewhere doing so will drastically reduce to quality right ?

no, this setting increases your mic volume
in cases like this where the real input volume is low you need to increase it

golden walrus Nov 23, 2024, 3:07 PM

#

ahhhhhhhhhhhh

knotty moth Nov 23, 2024, 3:07 PM

#

golden walrus i read somewhere doing so will drastically reduce to quality right ?

if it is too loud to cause clipping

analog obsidian Nov 23, 2024, 3:07 PM

#

be sure to not increase it too loud

golden walrus Nov 23, 2024, 3:07 PM

#

i got it

analog obsidian Nov 23, 2024, 3:07 PM

#

when it reach distortion/clipping the model quality degrades

golden walrus Nov 23, 2024, 3:08 PM

#

catblush i get it now

#

thank you

brittle wing Nov 23, 2024, 3:10 PM

#

brittle wing Hello, I am using local RVC, I have a save every 30 epochs. Last night I was tra...

Any ideas?

golden walrus Nov 23, 2024, 3:10 PM

#

ah 1 last question.
if i train a model, those D and G are pre-trained right ?
It stores the data I feed it, so if i want a model for singing in my language, i can throw a bunch of songs then train ?
After that i use a voice i want to mimic to train on top of that pretrain ?

simple ore Nov 23, 2024, 3:12 PM

#

D and G are model weights used for training

analog obsidian Nov 23, 2024, 3:12 PM

#

golden walrus ah 1 last question. if i train a model, those D and G are pre-trained right ? ...

g is the generator of the model, d is the discriminator, it doesn't work like that

simple ore Nov 23, 2024, 3:12 PM

#

default pretrain is generalized model that is average for everything

#

when you train a new model on top of it, you change generalized model to more specific

analog obsidian Nov 23, 2024, 3:13 PM

#

brittle wing Hello, I am using local RVC, I have a save every 30 epochs. Last night I was tra...

did you save the g and d files? if you did it then you can continue training, if not, no

simple ore Nov 23, 2024, 3:13 PM

#

training a new model on top of a different model may not be beneficial

golden walrus Nov 23, 2024, 3:14 PM

#

ohhhh, so how to allow a model to sing tho ?

analog obsidian Nov 23, 2024, 3:14 PM

#

golden walrus ohhhh, so how to allow a model to sing tho ?

speech models will always suck at singing

#

they dont have singing data so they are never gonna sound as good as a model trained on singing

brittle wing Nov 23, 2024, 3:14 PM

#

analog obsidian did you save the g and d files? if you did it then you can continue training, if...

Yes, I've registered them both. How can I continue the training?

golden walrus Nov 23, 2024, 3:15 PM

#

pepecry maybe i think too simple about these models

#

so i have to pick 1 between singing and talking model

analog obsidian Nov 23, 2024, 3:17 PM

#

brittle wing Yes, I've registered them both. How can I continue the training?

go to the training tab
in the model name use the same exact name of your model (if you named it lyery before, then it has to be exactly lyery)
don't preprocess, don't pitch extract (very important too)
set the same batch size you set (VERY IMPORTANT) if you used batch 8 while training, use batch 8 again
same epoch amount and same save freq amount

#

if everything is the same, you can now click start training

#

and is going to continue training using the latest g and d

analog obsidian Nov 23, 2024, 3:19 PM

#

golden walrus so i have to pick 1 between singing and talking model

for realtime voice changing use a speech model

#

singing models suck at speech

golden walrus Nov 23, 2024, 3:20 PM

#

ah i get it

analog obsidian Nov 23, 2024, 3:20 PM

#

and don't mix singing and speech in a dataset

#

you'll get better result training a pure speech/singing model than a mixed

golden walrus Nov 23, 2024, 3:20 PM

#

by the way, is embedder model important ?

analog obsidian Nov 23, 2024, 3:20 PM

#

golden walrus by the way, is embedder model important ?

yes, always use contentvec

golden walrus Nov 23, 2024, 3:21 PM

#

ah so pretrained is where i pick which one suitable for my language ?

analog obsidian Nov 23, 2024, 3:22 PM

#

golden walrus ah so pretrained is where i pick which one suitable for my language ?

custom pretraineds have issues the original doesn't have
in simpler words the original pretrain is going to reconstruct your dataset frequencies better

#

despite the original one being on english only, you can train any language with good results (i train mostly spanish models and they have good pronunciation even if the pretrained is english only)

#

avoid using custom pretraineds and use only the original

#

you can use them but you may get "exclusive" problems related to that specific custom pretrained used

golden walrus Nov 23, 2024, 3:23 PM

#

catblush

#

new thing learned

#

right, i will get it to work, but do you have an example of epoch number ? or just purely observe the graph

analog obsidian Nov 23, 2024, 3:26 PM

#

golden walrus right, i will get it to work, but do you have an example of epoch number ? or ju...

an example amount would be 200
but the real answer is we can't give an "good amount" of epochs since its random

#

me personally i dont train over 200 epochs

#

most of my models are usuable at around 100-150 ish

golden walrus Nov 23, 2024, 3:27 PM

#

kittyblush ah okay

#

cuz i only do 120 then i look at graph

#

but sometimes graph is flat

#

no low point

#

just flat

analog obsidian Nov 23, 2024, 3:27 PM

#

golden walrus but sometimes graph is flat

batch size too high or dataset too big*

#

or both together mixed

golden walrus Nov 23, 2024, 3:28 PM

#

i set batch size at 8 and most of the data is around 40mins

#

kittypawbite let me try this time, maybe it's different.

analog obsidian Nov 23, 2024, 3:28 PM

#

golden walrus i set batch size at 8 and most of the data is around 40mins

you should be getting low points uhh, is your graph smoothing set too high? like 0.999

golden walrus Nov 23, 2024, 3:29 PM

#

i set it 0.982 like in doc said

analog obsidian Nov 23, 2024, 3:29 PM

#

golden walrus i set it 0.982 like in doc said

ahh that explains, you can set it lower and you'll see your low points

golden walrus Nov 23, 2024, 3:30 PM

#

catblush

analog obsidian Nov 23, 2024, 3:30 PM

#

but smooth graphs in big datasets is normal anyways

#

not really a bad thing

#

as long you hit low points is fine

#

and the graph is not rising

#

small datasets have multiple low points but that doesn't mean its good

#

(actually thats kind of bad lol)

golden walrus Nov 23, 2024, 3:32 PM

#

kittypawbite so how do you know when it get overtrained

#

if you train a small dataset

analog obsidian Nov 23, 2024, 3:36 PM

#

golden walrus if you train a small dataset

u have to hear the epochs, smaller datasets suffer from overfitting rather than overtraining
for example you'll hear robotic sibilances (s, ch sounds), they also sound unnatural due to the lack of data

golden walrus Nov 23, 2024, 3:37 PM

#

gru i have to check one by one ?

analog obsidian Nov 23, 2024, 3:37 PM

#

golden walrus <:gru:1159574235509968976> i have to check one by one ?

nah only low points

#

choose the mel low points, not the g/total ones

#

g/total is the generator loss, is merely an average of mel, kl and fm

golden walrus Nov 23, 2024, 3:38 PM

#

analog obsidian choose the mel low points, not the g/total ones

about this, does this apply to all quantity of dataset or just smol one ?

#

ahhhhhhh

#

1 step closer to Chamber's voice

analog obsidian Nov 23, 2024, 3:38 PM

#

golden walrus about this, does this apply to all quantity of dataset or just smol one ?

the robotic sibilance and unnatural sounding model? yeah this happens due to smol dataset only

#

you can also get those on big datasets but those happen very late in training (but the big dataset is always going to sound more natural)

#

while on smol datasets that happen extremely early

golden walrus Nov 23, 2024, 3:39 PM

#

catblush that explained why 300 epoch sound so bad

analog obsidian Nov 23, 2024, 3:40 PM

#

10 minutes is the bare minimum, it still overfits quite fast but not as fast as a 5 minute one

#

the more data u get, the more later the model is going to overfit

#

and the more natural is going to sound

#

with more data

#

you don't need to de-ess the dataset to fix the robotic SH sounds, this is just a myth

#

is merely a dataset length thing

golden walrus Nov 23, 2024, 3:41 PM

#

ah, so can i add data to the model ? like i got 1 data that is 10 mins, i train it. then later i got more data, can i add on top of it ? or i have to train again ?

rare gobletBOT Nov 23, 2024, 3:41 PM

#

Ayo? @golden walrus level 6 !!! lfg

analog obsidian Nov 23, 2024, 3:42 PM

#

golden walrus ah, so can i add data to the model ? like i got 1 data that is 10 mins, i train ...

as long all of the data is the same quality, and not damaged, sure you can add more

#

but you have to train it again

#

from 0

#

a new model with the added data

brittle wing Nov 23, 2024, 3:42 PM

#

analog obsidian and is going to continue training using the latest g and d

Okay, I'm trying, thank you very much

analog obsidian Nov 23, 2024, 3:42 PM

#

you can't add data to an already existing model

golden walrus Nov 23, 2024, 3:43 PM

#

kittypawbite interesting

#

well, that's all for my curious. You r awesome

analog obsidian Nov 23, 2024, 3:44 PM

#

no prob training model is easy, whats hard is to be sure the dataset is not damaged hehe

#

keep in mind that every model trained on mvsep/separation models is damaged

#

so the quality is degraded a lot compared to a non mvsep model

#

so if you notice the model sounds a bit... broken? is just because of that

quasi dagger Nov 23, 2024, 3:45 PM

#

analog obsidian you can't add data to an already existing model

Wouldn't be possible with model blends?

golden walrus Nov 23, 2024, 3:45 PM

#

kittyblush oh, that's explained why my chamber voice is so robotic

#

like

#

metalic af

analog obsidian Nov 23, 2024, 3:46 PM

#

quasi dagger Wouldn't be possible with model blends?

you meant model fusion? not really this only merges the timbre

golden walrus Nov 23, 2024, 3:46 PM

#

i will redo the data

#

thank youuuuuuuuuuu

quasi dagger Nov 23, 2024, 3:46 PM

#

analog obsidian you meant model fusion? not really this only merges the timbre

Yes that's what I mean. Never tried that, can you explain more about it?

analog obsidian Nov 23, 2024, 3:47 PM

#

quasi dagger Yes that's what I mean. Never tried that, can you explain more about it?

merges the timbre of the models, this was meant to create new voices but you can also do it to try to "fix" a small dataset model by merging epochs

#

codename has a tutorial explaining model merging

knotty moth Nov 23, 2024, 3:48 PM

#

quasi dagger Wouldn't be possible with model blends?

hopefully the new codename's rvc will allow multispeaker, so you could add normal speaking, singing, screaming, etc. in a model

quasi dagger Nov 23, 2024, 3:48 PM

#

knotty moth hopefully the new codename's rvc will allow multispeaker, so you could add norma...

That sounds so cool

analog obsidian Nov 23, 2024, 3:49 PM

#

im praying everything goes well, this sounds amazing

#

pepoPray

quasi dagger Nov 23, 2024, 3:53 PM

#

analog obsidian so the quality is degraded a lot compared to a non mvsep model

Oh, also didn't know about this. I've been using the MelBand Roformer model to isolate vocals

analog obsidian Nov 23, 2024, 3:55 PM

#

quasi dagger Oh, also didn't know about this. I've been using the MelBand Roformer model to i...

yep every type of damage to the audios will make a model sound a bit robotic/metallic

#

not only separation models but every type of damage will do it

#

ideally you want the dataset to have very few post processing or in best case, no post processing at all, raw quality

quasi dagger Nov 23, 2024, 3:57 PM

#

I guess every separation model would do that in UVR as well?

analog obsidian Nov 23, 2024, 3:57 PM

#

quasi dagger I guess every separation model would do that in UVR as well?

every separation model, yes correct

#

private models on mvsep and public uvr models

quasi dagger Nov 23, 2024, 3:57 PM

#

So a clean acapella would be the best i guess

#

Like leaked stems

analog obsidian Nov 23, 2024, 3:57 PM

#

quasi dagger Like leaked stems

exactly

#

raw acapella, without effects

quasi dagger Nov 23, 2024, 3:58 PM

#

Hard to find 🤣

analog obsidian Nov 23, 2024, 3:58 PM

#

i agree 😭

knotty moth Nov 23, 2024, 4:00 PM

#

quasi dagger Like leaked stems

some leaked stems I had have 16/17.5khz cutoff, have background noise, or may have a little bleed

knotty moth Nov 23, 2024, 4:01 PM

#

analog obsidian raw acapella, without effects

if it's mixed with lead vocals and harmonies, you'd have to separate it tho

analog obsidian Nov 23, 2024, 4:01 PM

#

knotty moth if it's mixed with lead vocals and harmonies, you'd have to separate it tho

tru, good luck finding leaked steams without harmonies 😭

#

anyways i notice what kills the quality in the dataset is when we use isolated de-reverb datasets

quasi dagger Nov 23, 2024, 4:03 PM

#

I noticed that too

analog obsidian Nov 23, 2024, 4:03 PM

#

i have a couple of mel roformer-only models (that lacks any type of reverb/sound effect) and they have very good quality

#

aka streamer models

#

trolley

#

so is not like mel roformer/bs is gonna destroy the quality by itself

#

is when you use multiple separation models to remove multiple sound effects

#

so more effects removed, more robotic the end result

#

ideally to remove reverb use clarity vx de-reverb

#

is better than the separation models since it doesnt destroy the quality too much (still expect a slightly metallic model)

knotty moth Nov 23, 2024, 4:05 PM

#

analog obsidian anyways i notice what kills the quality in the dataset is when we use isolated d...

every separation, dereverb, and denoise models are not too perfect (though still enough for making covers), not as perfect as raw studio sessions (even it may also have little room reverb)

quasi dagger Nov 23, 2024, 4:05 PM

#

That's why i prefer live/intimate performances as they have less reverb, such as the tiny desk concerts

analog obsidian Nov 23, 2024, 4:05 PM

#

knotty moth every separation, dereverb, and denoise models are not too perfect (though still...

exactly

quasi dagger Nov 23, 2024, 4:06 PM

#

quasi dagger That's why i prefer live/intimate performances as they have less reverb, such as...

Also less instruments so the separation turns better

analog obsidian Nov 23, 2024, 4:08 PM

#

tldr less separation models used, less metallic the model will be

knotty moth Nov 23, 2024, 4:11 PM

#

may also depend on amount of background noise & instruments
I think the BGM in vtuber talking streams are not as loud as song tracks, so the extracted vocals would be less muffled enough

analog obsidian Nov 23, 2024, 4:12 PM

#

knotty moth may also depend on amount of background noise & instruments I think the BGM in v...

ive trained noisy models and they dont sound robotic, rvc expects noise in the dataset

#

natural background noise tho

#

not synthetic noise

#

rvc is pretty robust towards noise so dont worry too much about that

#

focus more on how muffled/damaged the audio sounds

#

and keep the least damaged audio in the dataset

knotty moth Nov 23, 2024, 4:14 PM

#

the reason why I barely made dataset from song tracks...

#

only for inference

simple ore Nov 23, 2024, 4:35 PM

#

analog obsidian u have to hear the epochs, smaller datasets suffer from overfitting rather than ...

not exactly the lack of data, more like the lack of attempts to reproduce them

#

sibilants are made from white noise "columns"

#

#

but during training this white noise gets reshaped, not exactly in a good way - this is not baked enough metallic thing

#

after 3000 attemps:

#

and after 5.5k it is close to the original

#

this was trained on a single 0.5s sample

analog obsidian Nov 23, 2024, 4:52 PM

#

simple ore not exactly the lack of data, more like the lack of attempts to reproduce them

yea this explain that better
Thanks

simple ore Nov 23, 2024, 4:53 PM

#

unfortunately the default training method is random, so you cant guarantee it would hit every c, s, ch x 5000+ times during a training loop

analog obsidian Nov 23, 2024, 4:54 PM

#

simple ore unfortunately the default training method is random, so you cant guarantee it wo...

why does more data makes this happen less often?

simple ore Nov 23, 2024, 4:54 PM

#

so that's where the size of the dataset or number of epochs comes in play

#

during one epoch loop a random 1/10th of a standard 3sec slice of each sample is used

#

if you decide to slice your training set to 5+ sec samples, it is even less than 1/10th

analog obsidian Nov 23, 2024, 4:56 PM

#

simple ore if you decide to slice your training set to 5+ sec samples, it is even less than...

basically default rvc

#

or it was 3?

#

cant remember lol

simple ore Nov 23, 2024, 4:57 PM

#

usually it is 3, unless there's a lot of silence so it cuts smaller pieces around silence gaps

#

I made a modification of the training loop, so it goes thru the entire set each epoch

#

0.5s slices with a small overlap

analog obsidian Nov 23, 2024, 4:58 PM

#

that helped with sibilances?

#

making them better than default rvc?

simple ore Nov 23, 2024, 4:59 PM

#

well, it guarantees every bit of data is being used

#

so like 12min data set is more or less equal to 2hr normal rvc dataset

analog obsidian Nov 23, 2024, 5:00 PM

#

simple ore so like 12min data set is more or less equal to 2hr normal rvc dataset

damn thats pretty cool, is there some downsides to what you did?

#

it can be used in normal finetuning?

simple ore Nov 23, 2024, 5:01 PM

#

I need to adjust learning rate, I think, so it does not overfit over long epoch.. or maybe save a model more often

analog obsidian Nov 23, 2024, 5:01 PM

#

ow i see

simple ore Nov 23, 2024, 5:01 PM

#

made a test set https://drive.google.com/file/d/1id1xD8jmqc17XsVS1tnVlS6WZ6SP5YQJ/view?usp=sharing

#

12min x 15 epoch of singing data

#

lil undercooked, but still pretty good for what it was made from

#

I may add it to Applio as 'experimental' training method

analog obsidian Nov 23, 2024, 5:07 PM

#

uhm yea i suppose fully cooking it would give better results than original rvc

knotty moth Nov 23, 2024, 5:18 PM

#

analog obsidian ive trained noisy models and they dont sound robotic, rvc expects noise in the d...

may somehow remind me of some old mid/early-2023 models

quasi dagger Nov 23, 2024, 5:30 PM

#

simple ore and after 5.5k it is close to the original

Very interesting, I assume the same applies to breath sounds?

simple ore Nov 23, 2024, 5:32 PM

#

I guess, it unvoiced piece of the audio