pastel oak Oct 6, 2024, 8:33 PM

#

Whats ur gpu and send screenshot of settings

#

Send screenshot

feral marsh Oct 6, 2024, 8:34 PM

#

you want a screenshot of my settings, yea?

pastel oak Oct 6, 2024, 8:34 PM

#

Yes

feral marsh Oct 6, 2024, 8:34 PM

#

pastel oak Oct 6, 2024, 8:35 PM

#

feral marsh

Chunk controls delay
Select 80 for example

feral marsh Oct 6, 2024, 8:35 PM

#

ima try that!

#

god damnit

#

thank you.....

#

that fixed it.

pastel oak Oct 6, 2024, 8:36 PM

#

easy

tight tiger Oct 6, 2024, 10:06 PM

#

pastel oak Whats ur gpu and send screenshot of settings

4060 ti

#

and my extra is like 13100 and my chunk is 2400

brave garnetBOT Oct 7, 2024, 12:17 AM

#

RVC Colabs and Spaces

⠀

Google Colabs

⠀
AICoverGen-WebUI
Useful for making quick covers, by Hina.

AICoverGen-NoWebUI
Useful for making covers, doesn't include a UI, by Ardha, by Eddy, Hina and Gdr.

RVC Disconnected
To train new voice models, by Kit Lemonfoot.

EasyGUI
The OG interface, by Rejects.
⠀

brave garnetBOT Oct 7, 2024, 12:42 AM

#

RVC Colabs and Spaces

⠀

Google Colabs

⠀
AICoverGen-WebUI
Useful for making quick covers, by Hina.

AICoverGen-NoWebUI
Useful for making covers, doesn't include a UI, by Ardha, by Eddy, Hina and Gdr.

RVC Disconnected
To train new voice models, by Kit Lemonfoot.

EasyGUI
The OG interface, by Rejects.
⠀

sick wraith Oct 7, 2024, 1:32 AM

#

Yeah I can't get the voices to sound like the people they should. I downloaded Robocop and can't get b the voice to sound right

odd shale Oct 7, 2024, 1:47 AM

#

sick wraith Yeah I can't get the voices to sound like the people they should. I downloaded R...

If you're using W-Okada, maybe you could try with different settings and also reading the docs below.

#

-realtime

azure marshBOT Oct 7, 2024, 1:47 AM

#

odd shale -realtime

This interaction has expired, use the command /guides realtime if you wish to see it again.

odd shale Oct 7, 2024, 1:47 AM

#

Check Deiteris' one.

brave garnetBOT Oct 7, 2024, 1:50 AM

#

RVC Colabs and Spaces

⠀

Google Colabs

⠀
AICoverGen-WebUI
Useful for making quick covers, by Hina.

AICoverGen-NoWebUI
Useful for making covers, doesn't include a UI, by Ardha, by Eddy, Hina and Gdr.

RVC Disconnected
To train new voice models, by Kit Lemonfoot.

EasyGUI
The OG interface, by Rejects.
⠀

wild vale Oct 7, 2024, 4:24 AM

#

Okay so im confused right now.

#

The reason noone could answer my questions were because my questions didnt make sense?

#

That is very confusing.

#

Its not like I am speaking a different language

#

The Buffering rises above the threshold of 512 to 1000, and the res goes to 4k plus which makes my voice robotic and hard to hear.

rare gobletBOT Oct 7, 2024, 4:28 AM

#

Ayo? @wild vale level 2 !!! lfg

wild vale Oct 7, 2024, 4:28 AM

#

What part of that is hard?

#

Do I edit the chunk or something or am I missing something?

#

When I use a voice in discord chat its fine

#

But when I use it in game or a heavy game
The Chunk goes from
buf:512
res: 12-128

to

buf: 3x the normal
res: Goes to places higher than 2.7k

simple ore Oct 7, 2024, 7:11 AM

#

wild vale But when I use it in game or a heavy game The Chunk goes from buf:512 res: 12-...

since there's no way to prioritize RVC over the game, yeah, that's gonna happen

#

get another GPU I guess

pastel oak Oct 7, 2024, 8:40 AM

#

tight tiger 4060 ti

Chunk could be too low for your gpu, id need to know the ms number instead of the 2400 number

Else download v1
https://rentry.co/voicechangerguide

Guide for W-Okada's RealTimeVoiceChangerClient

Github - Blanc-dot
Discord User - https://discord.com/users/824922747423031359
Special thanks to the following people : lusbert, poopmaster, felt, fazemasta, antasma, shadictl, x_hina, sushi
thanks are for anything added to guide, taken from any talks, settings added when previously collecting st...

pastel oak Oct 7, 2024, 8:44 AM

#

wild vale What part of that is hard?

I dont know who youre flaming here but youre in the wrong channel first of all but ok

You might be running into 100% GPU issues, so you have a few options to try:

reduce your ingame quality and cap fps to just above your monitors refresh rate
increase chunk and reduce extra for less gpu and cpu load from the voice changer
if that didnt work, try out the fork. Has very little resources used and runs better: https://rentry.co/ForkVoiceChangerGuide

If all fails and it turns out youre playing a very high end game that goes to 99% gpu usage either way, then upgrade gpu or get multi pc setup

Guide for deiteris' optimized W-Okada RealTime Voice Changer Client...

Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update October 6th, 2024: Multi PC setup explanation added
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVo...

odd valve Oct 7, 2024, 9:03 AM

#

anyone know the end of some words get cut out when u say a sentence with rvc

#

it doesnt cut out, but the ai gets weird and almost ignores the last few letters if you know what i mean

#

unless you really put emphasis, which just kind of makes it sound unrealistic

pastel oak Oct 7, 2024, 9:33 AM

#

odd valve anyone know the end of some words get cut out when u say a sentence with rvc

If your threshold is too high it might not pick up if you get quieter at the end of the sentence below the threshold

#

Move threshold/n gate to the left if its on the right

odd valve Oct 7, 2024, 9:34 AM

#

pastel oak If your threshold is too high it might not pick up if you get quieter at the end...

i have it on the lowest thing

#

pastel oak Oct 7, 2024, 9:34 AM

#

Oh ur on rvcs voice changer

#

Send full screenshot

odd valve Oct 7, 2024, 9:35 AM

#

pastel oak Send full screenshot

#

like i said it doesnt "cut out"

#

it just ignores some last letters

#

most of the time

pastel oak Oct 7, 2024, 9:37 AM

#

odd valve

Put extra to 2.70

odd valve Oct 7, 2024, 9:39 AM

#

pastel oak Put extra to 2.70

what does the extra do?

#

thats soo much better

pastel oak Oct 7, 2024, 9:42 AM

#

odd valve what does the extra do?

Extra is both the voice model quality and controls the length of a consistent tone, like if you hold a tone aaaaa you can hold it up to 2.7 before the voice breaks. And in this case, 2.7 is considered the max setting (most models struggle to go above this number, but some are capable of it)

#

In rvcs gui it does more damage than benefits to go above 2.7 from my testing

odd valve Oct 7, 2024, 9:44 AM

#

pastel oak In rvcs gui it does more damage than benefits to go above 2.7 from my testing

i see, thanks though it fixed basically everything

brave garnetBOT Oct 7, 2024, 9:59 AM

#

RVC Colabs and Spaces

⠀

Local Forks 🖥️

⠀
Mainline RVC
Original project, suggested for advanced users,
by the RVC-Project team.

Applio
Simplified, suggested for all, by the Applio team.

RVC Studio
Simplified, suggested for all, by SayanoAI.

Mangio-RVC
Simplified, may not be supported anymore, by Mangio621.

AICoverGen
Simple yet great way to make covers, by SociallyIneptWeeb.

Replay
From the greators of weights.gg, excellent product for everyone.
⠀

next plinth Oct 7, 2024, 10:28 AM

#

How to use this AI 🥹

#

I'm newbie and i don't understand anything 😭

low shard Oct 7, 2024, 10:29 AM

#

next plinth How to use this AI 🥹

which ai ?

#

and whats ur pc gpu

next plinth Oct 7, 2024, 10:31 AM

#

I want to try Genshin rvc model by HuggingFace but the web i click on doesn't look like the old one 🥹

low shard Oct 7, 2024, 10:31 AM

#

next plinth I want to try Genshin rvc model by HuggingFace but the web i click on doesn't lo...

whats ur pc gpu?

#

You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU

#

Also, are u looking for ai covers or realtime

next plinth Oct 7, 2024, 10:36 AM

#

Is "Intel(R) UHD Graphics" a GPU? I don't understand 🥹💔

rare gobletBOT Oct 7, 2024, 10:36 AM

#

Ayo? @next plinth level 1 !!! lfg

low shard Oct 7, 2024, 10:37 AM

#

next plinth Is "Intel(R) UHD Graphics" a GPU? I don't understand 🥹💔

its the integrated graphics, which is bad, u cant dont do it locally (on ur pc) but can use cloud (remote good pc)

#

are you looking for ai covers or realtime for calls

next plinth Oct 7, 2024, 10:38 AM

#

Ai covers 🥹

low shard Oct 7, 2024, 10:38 AM

#

next plinth Ai covers 🥹

use ilaria rvc zero, a zerogpu (A100 paid by Ilaria) huggingface (biggest ai platform) space (service they offer to try ai), its the fastest way

brave garnetBOT Oct 7, 2024, 10:38 AM

#

Ilaria RVC: CLICK HERE 🤗

Guide on how to use it: CLICK HERE 📝

Don't forget to thank Ilaria if you find it useful! 💖

Ilaria RVC - a Hugging Face Space by TheStinger

next plinth Oct 7, 2024, 10:39 AM

#

O-okay

#

Thank you

sick wraith Oct 7, 2024, 12:11 PM

#

What do I need to do to get the voices to sound right in voice changer? They always sound off? Am I supposed to be tweaking the voices based on the sound file or voice I'm using?

odd shale Oct 7, 2024, 12:29 PM

#

sick wraith What do I need to do to get the voices to sound right in voice changer? They alw...

Probably some models you're using aren't meant to be used on W-Okada.

sick wraith Oct 7, 2024, 12:29 PM

#

Ah ok. Are rvc models not universal?

odd shale Oct 7, 2024, 12:32 PM

#

sick wraith Ah ok. Are rvc models not universal?

Some voices can work for everything, some don't.

#

It also depends (i think) how the author made the model. (dataset cleanup and length)

#

It can also depend on your settings and your own voice.

wild vale Oct 7, 2024, 12:38 PM

#

pastel oak I dont know who youre flaming here but youre in the wrong channel first of all b...

I have an
Processor Intel(R) Core(TM) i7-10700K CPU @ 3.80GHz 3.80 GHz
Installed RAM 32.0 GB
Device ID 60F634EE-B521-4FCF-A554-CDCB5FDC830E
Product ID 00330-80000-00000-AA684
System type 64-bit operating system, x64-based processor
Pen and touch No pen or touch input is available for this display
RTX 3060Ti

wild vale Oct 7, 2024, 12:38 PM

#

pastel oak I dont know who youre flaming here but youre in the wrong channel first of all b...

Whats the right channel then

analog obsidian Oct 7, 2024, 12:50 PM

#

sick wraith Ah ok. Are rvc models not universal?

Singing models are very bad at speech
While speech models are mid/bad at singing

#

and models not sounding like the original voice are undertrained or the dataset had timbre issues

odd shale Oct 7, 2024, 12:55 PM

#

analog obsidian and models not sounding like the original voice are undertrained or the dataset ...

Y algunas veces no es tan buena idea mezclar canto y dialogo en un mismo dataset verdad?

#

Porque soy fiel creyente que es mejor hacer 2 modelos distintos de la misma persona/personaje dependiendo del proposito que uno le quiera dar.

analog obsidian Oct 7, 2024, 12:57 PM

#

odd shale Y algunas veces no es tan buena idea mezclar canto y dialogo en un mismo dataset...

Es mejor hacer dos modelos distintos y no mezclar canto con dialogo

#

Okayge

odd shale Oct 7, 2024, 12:57 PM

#

analog obsidian Es mejor hacer dos modelos distintos y no mezclar canto con dialogo

Factos.

#

Es lo mismo que evito hacer.

#

Pero no mucha gente sabe de esto.

pastel oak Oct 7, 2024, 1:00 PM

#

wild vale Whats the right channel then

#🔍│help-w-okada

pastel oak Oct 7, 2024, 1:01 PM

#

wild vale I have an Processor Intel(R) Core(TM) i7-10700K CPU @ 3.80GHz 3.80 GHz Ins...

Ok, can you tell me what game youre running and what settings on wokada, send screenshot

sick wraith Oct 7, 2024, 1:09 PM

#

where does ilaria rvc store it's models?

#

i took a secondary model and placed it in it's root folder where model.pth is but it's not detecting it

low shard Oct 7, 2024, 1:17 PM

#

sick wraith where does ilaria rvc store it's models?

ilaria rvc zero?

brittle wing Oct 7, 2024, 1:56 PM

#

-colab

azure marshBOT Oct 7, 2024, 1:56 PM

#

brittle wing -colab

☁️ Google Colabs

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

rare gobletBOT Oct 7, 2024, 1:56 PM

#

Ayo? @brittle wing level 6 !!! lfg

ornate hawk Oct 7, 2024, 2:06 PM

#

Which option should I choose for a pre-trained model if the dataset is at 44,100 Hz? Titan only supports 32k, 40k, and 48k

pastel oak Oct 7, 2024, 2:08 PM

#

ornate hawk Which option should I choose for a pre-trained model if the dataset is at 44,100...

First check if its truly 44,1k hz because often times the waves show something different. You can check this with a program called "spek"

This is a debated topic: You can use 40k because you do not have those ranges from 44.1 - 48k in your dataset, so the model could get inaccurate. Some say use 48k because you wont hear the difference anyway and get more out of it

Imo it doesnt matter, i would probably still use 40k

radiant loom Oct 7, 2024, 2:19 PM

#

Does anyone know how to remove the robotic sound at the end of words?

low shard Oct 7, 2024, 2:52 PM

#

@jagged hawk im pinging gu in the right channel, whats ur pc gpu?

jagged hawk Oct 7, 2024, 2:53 PM

#

low shard <@226078787765272586> im pinging gu in the right channel, whats ur pc gpu?

4070 ti

low shard Oct 7, 2024, 2:54 PM

#

jagged hawk 4070 ti

alright good enough

#

the docs are temporary down so lemme send u the temp ones rq

jagged hawk Oct 7, 2024, 2:54 PM

#

Ok, thanks

low shard Oct 7, 2024, 2:55 PM

#

jagged hawk Ok, thanks

As you got a good PC, you can use RVC locally, you can choose between:

Applio: A fork of RVC with some extra features like Applio TTS, same quality tho
Mainline: The original RVC

jagged hawk Oct 7, 2024, 2:59 PM

#

Oh nice

#

I'm instaling Applio rn

#

Is it intuitive?

#

Or u recomend watching a tutorial?

low shard Oct 7, 2024, 3:01 PM

#

jagged hawk Or u recomend watching a tutorial?

The guide i sent is already a written guide

#

there is no updated video tutorial

jagged hawk Oct 7, 2024, 3:02 PM

#

Ohhh

rare gobletBOT Oct 7, 2024, 3:02 PM

#

Ayo? @jagged hawk level 2 !!! lfg

jagged hawk Oct 7, 2024, 3:02 PM

#

It's a link

#

I didn't realize 😅

#

Ty! ur the best

simple ore Oct 7, 2024, 3:16 PM

#

you can always just read the docs

wise lark Oct 7, 2024, 4:08 PM

#

#

How to watch voice model?

low shard Oct 7, 2024, 4:11 PM

#

wise lark

wtf, mind u refreshing the site?

wise lark Oct 7, 2024, 4:16 PM

#

I refreshed site and I restarted discord

#

but I still can't see it

nocturne mural Oct 7, 2024, 4:35 PM

#

wise lark

I've experienced something similar. It might be because you still have an active search in the top section. Try clearing it and maybe that will fix it.

#

sly sluice Oct 7, 2024, 5:33 PM

#

python trainset_preprocess_pipeline_print.py "/content/dataset/EVRAART" 40000 2 "/content/Mangio-RVC-Fork/logs/EVRAART" 1
python3: can't open file '/content/Mangio-RVC-Fork/trainset_preprocess_pipeline_print.py': [Errno 2] No such file or directory
python extract_f0_print.py "/content/Mangio-RVC-Fork/logs/EVRAART" 2 rmvpe 64
python3: can't open file '/content/Mangio-RVC-Fork/extract_f0_print.py': [Errno 2] No such file or directory
python extract_feature_print.py "device" 1 0 0 "/content/Mangio-RVC-Fork/logs/EVRAART" v2
python3: can't open file '/content/Mangio-RVC-Fork/extract_feature_print.py': [Errno 2] No such file or directory

i have the same error as this guy, except i did put .wav files, and that i already tried installing depencies again

brittle wing Oct 7, 2024, 5:37 PM

#

How much crepe hop length in inference and training

sly sluice Oct 7, 2024, 5:38 PM

#

sly sluice python trainset_preprocess_pipeline_print.py "/content/dataset/EVRAART" 40000 2 ...

i'm thinking about deleting the whole mango-RVC-fork folder to install it again is this a good idea?

analog obsidian Oct 7, 2024, 5:41 PM

#

sly sluice python trainset_preprocess_pipeline_print.py "/content/dataset/EVRAART" 40000 2 ...

mangio rvc fork is very outdated and most of the dependencies aren't compatible to each other anymore
for training use mainline (the original rvc) or applio (fork of mainline)
mainline has faster ui and in some cases, training is faster than applio
applio has slower ui but some claim they have better training speed there

both options give the same result in terms of model quality, etc

analog obsidian Oct 7, 2024, 5:41 PM

#

brittle wing How much crepe hop length in inference and training

64 for both

sly sluice Oct 7, 2024, 5:42 PM

#

analog obsidian mangio rvc fork is very outdated and most of the dependencies aren't compatible ...

k good thanks

rare gobletBOT Oct 7, 2024, 5:42 PM

#

Ayo? @sly sluice level 1 !!! lfg

brittle wing Oct 7, 2024, 5:42 PM

#

analog obsidian 64 for both

Wasn't it 64x2=128?

analog obsidian Oct 7, 2024, 5:43 PM

#

brittle wing Wasn't it 64x2=128?

128 is too innacurate when i tried training with that value (the model ended up having more voice cracks)

brittle wing Oct 7, 2024, 5:43 PM

#

analog obsidian 128 is too innacurate when i tried training with that value (the model ended up ...

Okie but I'm asking about inference rn...and yes you're correct

analog obsidian Oct 7, 2024, 5:43 PM

#

brittle wing Okie but I'm asking about inference rn...and yes you're correct

for inference try 64 or 32

brittle wing Oct 7, 2024, 5:44 PM

#

I remember training a model with 128 hop length and it sounded bad

hidden dew Oct 7, 2024, 6:30 PM

#

where do i find the saved models while training?

#

i cant find them

#

#

nevermind

#

i found them

#

lol

crude flame Oct 7, 2024, 6:47 PM

#

Bruh why am i getting this error now, applio was working just fine the other day

#

ive already tried reinstalling the newest complied version and last versions pre compiled and it still gave me that error

analog obsidian Oct 7, 2024, 6:50 PM

#

crude flame ive already tried reinstalling the newest complied version and last versions pre...

have u tried updating ur gpu drivers?

crude flame Oct 7, 2024, 6:55 PM

#

analog obsidian have u tried updating ur gpu drivers?

yup

nocturne mural Oct 7, 2024, 6:57 PM

#

.\env\python.exe -m pip install torch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 --upgrade --index-url https://download.pytorch.org/whl/cu121

#

try reinstalling the torch dependencies again

crude flame Oct 7, 2024, 6:58 PM

#

still same error

nocturne mural Oct 7, 2024, 6:58 PM

#

DeadCat

analog obsidian Oct 7, 2024, 7:01 PM

#

crude flame still same error

what if u just delete python and everything related to it lols, thats what i do when shit stops working

noble vortex Oct 7, 2024, 7:15 PM

#

ive been trying to use rvc webui for model training, but when i click on one-click training, the output information box has been stuck on 'processing data' for the past hour. any suggestions?

twilit kernel Oct 7, 2024, 7:33 PM

#

Hi did anyone manage to download Mangio on Mac?

low shard Oct 7, 2024, 7:39 PM

#

twilit kernel Hi did anyone manage to download Mangio on Mac?

mangio is outdated, its better to use applio or mainline, however, mac can only inference (use models) locally (on ur pc), i would suggest to use cloud (remote good pc)

twilit kernel Oct 7, 2024, 7:40 PM

#

How can I cloud for RVC?

#

Can I download Applio on mac?

low shard Oct 7, 2024, 7:41 PM

#

twilit kernel How can I cloud for RVC?

for inference ilaria rvc zero or applio colab

Google Docs

Ilaria RVC Zero

Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Vocal Separator (UVR) Troubleshooting “No gpu is available for you for 60s” Introduction (with website link) Ilaria RVC Zero, is an RVC (Re...

Applio Colab

Last update: June 15, 2024

#

For rvc training cloud you can choose between:

Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Mainline (UI, No guide as of right now)
- Applio (UI, No guide as of right now)

twilit kernel Oct 7, 2024, 7:42 PM

#

low shard for [inference ilaria rvc zero](https://docs.google.com/document/d/1YbXcLFPaGjhO...

Doees it work on Mac?

rare gobletBOT Oct 7, 2024, 7:42 PM

#

Ayo? @twilit kernel level 1 !!! lfg

low shard Oct 7, 2024, 7:42 PM

#

twilit kernel Can I download Applio on mac?

if u wanna do it locally, theres a mac installation: https://docs.applio.org/getting-started/installation

But i Would HIGHLY suggest to just use cloud

Installation - Applio

Documentation for a high-quality, open-source speech conversion ecosystem designed for simplicity and optimized performance

low shard Oct 7, 2024, 7:42 PM

#

twilit kernel Doees it work on Mac?

yes, its cloud, it doesnt run on ur pc, it runs on a remote good pc

twilit kernel Oct 7, 2024, 7:43 PM

#

And it makes good results?

low shard Oct 7, 2024, 7:43 PM

#

twilit kernel And it makes good results?

yes, rvc is the best Speech To Speech program

#

its used by like 90% of ai covers

twilit kernel Oct 7, 2024, 7:43 PM

#

Thanks I'll try

low shard Oct 7, 2024, 7:43 PM

#

yw

twilit kernel Oct 7, 2024, 7:44 PM

#

the cloud version works as good as localy?

low shard Oct 7, 2024, 7:44 PM

#

twilit kernel the cloud version works as good as localy?

the performance of the local one depends on ur pc, and cloud will run better than your mac

#

in terms of quality: yes, its just the same program

twilit kernel Oct 7, 2024, 7:45 PM

#

even if I have m1 pro?

low shard Oct 7, 2024, 7:45 PM

#

twilit kernel even if I have m1 pro?

yea, an A100 (The ZeroGPU used by Ilaria RVC Zero) is way better than that

twilit kernel Oct 7, 2024, 7:45 PM

#

thanks

low shard Oct 7, 2024, 7:45 PM

#

those use AI Specialized gpus, like T4, A100, etc

#

Yw

low shard Oct 7, 2024, 7:46 PM

#

twilit kernel thanks

for training id suggest Kaggle as it gives the most gpu time so u wont lose ur work

For inference i would suggest ilaria rvc zero

feral plank Oct 7, 2024, 7:59 PM

#

Is there a way I can do this on mobile

low shard Oct 7, 2024, 8:00 PM

#

feral plank Is there a way I can do this on mobile

Inference (use models) or train (make models)

feral plank Oct 7, 2024, 8:16 PM

#

low shard Inference (use models) or train (make models)

i don’t get it sorry

low shard Oct 7, 2024, 8:16 PM

#

feral plank i don’t get it sorry

are you trying to use models for pre-recorded audios like ai covers, or make models?

#

or are you trying to use modles in realtime for voice changing in calls?

feral plank Oct 7, 2024, 8:17 PM

#

low shard are you trying to use models for pre-recorded audios like ai covers, or make mod...

use models

low shard Oct 7, 2024, 8:17 PM

#

feral plank use models

for pre-recorded audios right?

feral plank Oct 7, 2024, 8:18 PM

#

low shard for pre-recorded audios right?

yeah, i’m trying to use a donald duck voice for some audios i found but the files won’t work, unless it’s not for mobile to do that

rare gobletBOT Oct 7, 2024, 8:18 PM

#

Ayo? @feral plank level 1 !!! lfg

low shard Oct 7, 2024, 8:19 PM

#

feral plank yeah, i’m trying to use a donald duck voice for some audios i found but the file...

idk what ur using, but this is RVC Technology

You could technically do it locally on ur phone but its on CPU so slow and not suggested

Its way better u use cloud (remote good pc), use ilaria rvc zero

Google Docs

Ilaria RVC Zero

Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Vocal Separator (UVR) Troubleshooting “No gpu is available for you for 60s” Introduction (with website link) Ilaria RVC Zero, is an RVC (Re...

radiant loom Oct 7, 2024, 8:28 PM

#

how many epochs should i train my IA model?

rare gobletBOT Oct 7, 2024, 8:28 PM

#

Ayo? @radiant loom level 1 !!! lfg

radiant loom Oct 7, 2024, 8:28 PM

#

I have 9:44 min of audio

keen pollen Oct 7, 2024, 8:28 PM

#

Hey, i downloaded my voice model and i cant find files, pls dm me

low shard Oct 7, 2024, 8:29 PM

#

radiant loom how many epochs should i train my IA model?

there isn't a right amount of epochs, see https://docs.ai-hub.wtf/rvc/resources/epochs--tensorboard/

Epochs & TensorBoard

Last update: Feb 10, 2024

keen pollen Oct 7, 2024, 8:30 PM

#

low shard there isn't a right amount of epochs, see https://docs.ai-hub.wtf/rvc/resources/...

Can you dm me?

low shard Oct 7, 2024, 8:31 PM

#

keen pollen Can you dm me?

its better to ask here than dms

keen pollen Oct 7, 2024, 8:31 PM

#

I cant send pic here

rare gobletBOT Oct 7, 2024, 8:31 PM

#

Ayo? @keen pollen level 1 !!! lfg

low shard Oct 7, 2024, 8:31 PM

#

keen pollen Hey, i downloaded my voice model and i cant find files, pls dm me

how did u train it? u sure u downloaded the .pth and .index?

keen pollen Oct 7, 2024, 8:31 PM

#

Nvm

#

now i can xD

keen pollen Oct 7, 2024, 8:31 PM

#

low shard how did u train it? u sure u downloaded the .pth and .index?

I downloaded folder

low shard Oct 7, 2024, 8:31 PM

#

Oh, sorry but i cant help much about local

radiant loom Oct 7, 2024, 8:31 PM

#

low shard there isn't a right amount of epochs, see https://docs.ai-hub.wtf/rvc/resources/...

tysn

low shard Oct 7, 2024, 8:32 PM

#

I don't do things locally, i use cloud

#

I would suggest using a more updated version like mainline or applio tho

low shard Oct 7, 2024, 8:32 PM

#

keen pollen Nvm

maybe this could help https://docs.ai-hub.wtf/rvc/local/mangio/#15-gather-models-files

Mangio

Last update: Mar 8, 2024

radiant loom Oct 7, 2024, 8:41 PM

#

@low shard can u help me installing this app?

#

idk what im doing wrong

#

low shard Oct 7, 2024, 8:47 PM

#

radiant loom <@911742715019001897> can u help me installing this app?

are you doing it locally or using google colab?

radiant loom Oct 7, 2024, 8:47 PM

#

locally

#

or idk

#

really

low shard Oct 7, 2024, 8:48 PM

#

radiant loom really

did you download mangio rvc on ur pc?

radiant loom Oct 7, 2024, 8:48 PM

#

i use mangio rvc

#

ye

low shard Oct 7, 2024, 8:48 PM

#

radiant loom ye

first, whats ur pc gpu?

radiant loom Oct 7, 2024, 8:48 PM

#

2060

#

im cooked?

simple ore Oct 7, 2024, 8:52 PM

#

radiant loom idk what im doing wrong

you ran tensorboard without any logs present?

radiant loom Oct 7, 2024, 8:53 PM

#

idkl what are u talking abt

#

im noob

sullen jungle Oct 7, 2024, 9:02 PM

#

Ive got these 2 models, both v2 and 40k, but it keeps saying they are different versions

rare gobletBOT Oct 7, 2024, 9:02 PM

#

Ayo? @sullen jungle level 2 !!! lfg

brittle wing Oct 7, 2024, 11:20 PM

#

How do I make my ai vocals sound more realistic?Like what kind of lowpass/highpass filter or settings do I use

#

https://vm.tiktok.com/ZGdJph7aE/
Like this video

frail plank Oct 7, 2024, 11:48 PM

#

you guys probably get this all the time

#

but how do you make voice models

brittle wing Oct 7, 2024, 11:59 PM

#

Rvc

#

Chat is there a website version for it frfr

brittle wing Oct 8, 2024, 1:58 AM

#

could anyone help im not hearing any outputs for the voice changer (rvc google colab) but I am for just regulaur in discord

prisma carbon Oct 8, 2024, 2:01 AM

#

how do i use a voice model 😦

brittle wing Oct 8, 2024, 2:08 AM

#

prisma carbon how do i use a voice model 😦

What system are you using

rare gobletBOT Oct 8, 2024, 2:08 AM

#

Ayo? @brittle wing level 1 !!! lfg

keen stratus Oct 8, 2024, 2:10 AM

#

how to make these settings:
Epoch: 620
Steps: 9000+
Pretrain: Snowie V3

#

in voice changer there is only: Gain, pitch, index, chunk and extra

fading bone Oct 8, 2024, 2:46 AM

#

when your using applio and you finish training a model. how do i export the pth and the index file to my downloads folder or google drive

spice owl Oct 8, 2024, 3:13 AM

#

-local

azure marshBOT Oct 8, 2024, 3:13 AM

#

spice owl -local

🖥️ Local stuff

🍏 Applio, by IA Hispano GitHub
Mangio-RVC-Fork, by Mangio621 Huggingface
RVC Studio, by SayanoAI Huggingface
AICoverGen, by SociallyIneptWeeb GitHub
Replay, by Replay Team Website
Original RVC, by the RVC-Project team GitHub
GPT-SoVITS, by RVC-Boss GitHub

Credits to Faze Masta and Antasma for compiling these links.

golden walrus Oct 8, 2024, 3:52 AM

#

guys, can i ask why i tried to train but it only process 1 step/epoch ?

simple ore Oct 8, 2024, 4:03 AM

#

golden walrus guys, can i ask why i tried to train but it only process 1 step/epoch ?

you provided unsupported audio, you did not split audio, extract features did nothing, you're training on two mute files

golden walrus Oct 8, 2024, 4:04 AM

#

kittypawbite but i splitted it

simple ore Oct 8, 2024, 4:04 AM

#

go to logs/yourmodel and see what files are there

golden walrus Oct 8, 2024, 4:05 AM

#

xd i stopped that one

#

btw, about pretrained

#

should i use these with high steps or those with low point

#

kittypawbite

simple ore Oct 8, 2024, 4:05 AM

#

use default pretrain

golden walrus Oct 8, 2024, 4:06 AM

#

i mean, default pretrain don't support my language so i tried to make my own pretrain xd

simple ore Oct 8, 2024, 4:07 AM

#

you can not make a good pretrain from scratch

golden walrus Oct 8, 2024, 4:07 AM

#

kittyblush so is there anyway to make 1

simple ore Oct 8, 2024, 4:07 AM

#

at least not without using some magical way the original one was made

#

I doubt the original pretrain had russian language, yet a model trained with it for just 20 steps does fine

golden walrus Oct 8, 2024, 4:09 AM

#

pepecry i tried to make one with vietnamese

simple ore Oct 8, 2024, 4:09 AM

#

it may take some extra source data to shape it up, but it is better to use an existing pretrain as a base

#

instead of trying to do it all from the scratch with 100+ hours of audio

golden walrus Oct 8, 2024, 4:11 AM

#

ah so, base pretrain + train one with my desired language, then use these D and G to train another voice i want ?

simple ore Oct 8, 2024, 4:12 AM

#

i mean... you can do that too

#

but I mean use pretrain with 30-60 min of audio in your desired language and voice

#

if it ends up not good enought, use D/G from it + more audio

golden walrus Oct 8, 2024, 4:13 AM

#

kittyblush i got 2 hours of audio

#

oh okay, i got it

simple ore Oct 8, 2024, 4:14 AM

#

you can always buid up on top of existing model

#

it simply adjusts weights

#

even just doing 5-10 epochs on top of default pretrain you should hear your trained voice, maybe not perfectly speaking certain syllables, but close enough and training longer should fix that

golden walrus Oct 8, 2024, 4:24 AM

#

-rc

#

-rvc

azure marshBOT Oct 8, 2024, 4:24 AM

#

golden walrus -rvc

Documentation

AI HUB Docs

https://docs.aihub.wtf/

🍏 Applio Docs

https://docs.applio.org/

✨ More guides

How to use RVC Mainline Colab by Cauthess
AICoverGen Colab Guide by Eddy (Spanish Helper)
Create a model with RVC disconnected (colab) by Angetyde

molten relic Oct 8, 2024, 6:46 AM

#

Hey! I’m sorry I’m new to this AI stuff, where do I start to start learning to create a voice model? I apologize if this is an inconvenience to some, I’m very new and I’m just really want to learn! Very appreciate any help would be awesome! Thanks anyone that responses!

low shard Oct 8, 2024, 7:17 AM

#

molten relic Hey! I’m sorry I’m new to this AI stuff, where do I start to start learning to c...

whats ur pc gpu?

molten relic Oct 8, 2024, 7:51 AM

#

low shard whats ur pc gpu?

I’m in bed right now so unsure but how does a GPU effect training?

low shard Oct 8, 2024, 7:52 AM

#

molten relic I’m in bed right now so unsure but how does a GPU effect training?

You could run the ai locally (on ur pc), meaning it runs on ur pc

#

like the same way u need a good gpu for games, ai takes alot of computing

#

especially training

#

You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU

simple ore Oct 8, 2024, 7:55 AM

#

molten relic I’m in bed right now so unsure but how does a GPU effect training?

if you have a push-cart you cant move a mountain, that requires a quarry dump truck

#

AI training requires a tremendeous amount of number crunching with specialized hardware, you can't do it on a cheap laptop wit intergrated GPU

low shard Oct 8, 2024, 7:58 AM

#

simple ore AI training requires a tremendeous amount of number crunching with specialized h...

simple ore Oct 8, 2024, 8:03 AM

#

woosh

rare gobletBOT Oct 8, 2024, 8:03 AM

#

Ayo? @simple ore level 15 !!! lfg

runic schooner Oct 8, 2024, 8:56 AM

#

i need help

low shard Oct 8, 2024, 8:58 AM

#

runic schooner i need help

!howtoask

patent trellisBOT Oct 8, 2024, 8:58 AM

#

low shard !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

runic schooner Oct 8, 2024, 9:04 AM

#

low shard !howtoask

I’m having trouble finding the right file to download it my cpu is amd ryzen 5 3600 6-core processor

low shard Oct 8, 2024, 9:04 AM

#

runic schooner I’m having trouble finding the right file to download it my cpu is amd ryzen 5 3...

to download what?

#

are you looking for realtime voice changing for calls, use models on pre recorded audios or make models

#

and also, i need the gpu

runic schooner Oct 8, 2024, 9:07 AM

#

Rtx 2060

rare gobletBOT Oct 8, 2024, 9:07 AM

#

Ayo? @runic schooner level 1 !!! lfg

low shard Oct 8, 2024, 9:07 AM

#

runic schooner Rtx 2060

alr, so what are u looking for?

scenic arch Oct 8, 2024, 9:10 AM

#

is it just me or are the aihub docs down?

low shard Oct 8, 2024, 9:11 AM

#

scenic arch is it just me or are the aihub docs down?

yes, check #📰│dev-updates message

pastel oak Oct 8, 2024, 9:49 AM

#

low shard alr, so what are u looking for?

Bro does NOT wanna answer questions 😭

#

3 times in a row lmao

low shard Oct 8, 2024, 9:51 AM

#

pastel oak Bro does NOT wanna answer questions 😭

did he also do that to u too?

#

😭

pastel oak Oct 8, 2024, 9:53 AM

#

Nah nah just seeing u ask him for what he needs 3 times with no answer is funny

low shard Oct 8, 2024, 9:55 AM

#

pastel oak Nah nah just seeing u ask him for what he needs 3 times with no answer is funny

😭

simple ore Oct 8, 2024, 9:55 AM

#

HE'S A CATFISHER I TELL YOU

low shard Oct 8, 2024, 9:56 AM

#

simple ore HE'S A CATFISHER I TELL YOU

what

simple ore Oct 8, 2024, 9:56 AM

#

Hiding his nasty desires

#

not answering questions

#

naughty naughty

pastel oak Oct 8, 2024, 10:00 AM

#

SMH

#

EXPOSE HIM

low shard Oct 8, 2024, 10:01 AM

#

pastel oak EXPOSE HIM

REAL

low shard Oct 8, 2024, 10:02 AM

#

pastel oak SMH

BAN HIM FOR CATFISHING lfg

molten relic Oct 8, 2024, 1:11 PM

#

low shard You could run the ai locally (on ur pc), meaning it runs on ur pc

Hey nick! / anyone that reads this for this purpose i have a NVIDIA Geforce RTX 3050 Ti

knotty moth Oct 8, 2024, 1:18 PM

#

molten relic Hey nick! / anyone that reads this for this purpose i have a NVIDIA Geforce RTX ...

laptop 4 GB variant? nope, not recommended for training, only inference

molten relic Oct 8, 2024, 1:19 PM

#

knotty moth laptop 4 GB variant? nope, not recommended for training, only inference

32 GB Installed Ram

molten relic Oct 8, 2024, 1:19 PM

#

knotty moth laptop 4 GB variant? nope, not recommended for training, only inference

Thanks for responding fast

knotty moth Oct 8, 2024, 1:20 PM

#

molten relic Thanks for responding fast

no

#

system ram is irrelevant

#

https://tenor.com/view/facepalm-gif-4576513125411549651

Tenor

molten relic Oct 8, 2024, 1:23 PM

#

knotty moth laptop 4 GB variant? nope, not recommended for training, only inference

I am unsure what you are asking my apologies, im really new to some stuff like this. I dont want for you to waste your time though! I tottaly understand being busy or if im to new to understand a lot!

knotty moth Oct 8, 2024, 1:26 PM

#

molten relic I am unsure what you are asking my apologies, im really new to some stuff like t...

I'm also busy playing around with SD, but not an excuse to not respond to you

molten relic Oct 8, 2024, 1:28 PM

#

knotty moth I'm also busy playing around with SD, but not an excuse to not respond to you

Ah thats okay!

Let me know what you need from me and ill figure it out and give it to you!

low shard Oct 8, 2024, 1:32 PM

#

molten relic I am unsure what you are asking my apologies, im really new to some stuff like t...

the memory of the gpu

#

its suggested to have 8 or more gb of memory of gpu aka vram for training

#

You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU

#

you should also be able to find the gpu memory

#

Also is it a desktop or laptop ?

molten relic Oct 8, 2024, 1:33 PM

#

12 GB

knotty moth Oct 8, 2024, 1:35 PM

#

molten relic 12 GB

u prob mean desktop 3060

low shard Oct 8, 2024, 1:35 PM

#

molten relic 12 GB

12gb vram should be good

molten relic Oct 8, 2024, 1:35 PM

#

Techically a Desktop but i shoved some parts from a few laptops nto it

low shard Oct 8, 2024, 1:35 PM

#

molten relic Techically a Desktop but i shoved some parts from a few laptops nto it

alr, rtx 3060 12gb is good

#

for both training and inference

molten relic Oct 8, 2024, 1:36 PM

#

Let me know where to start and im ready, sir nick!

#

and alisa!

low shard Oct 8, 2024, 1:38 PM

#

molten relic Let me know where to start and im ready, sir nick!

As you got a good PC, you can use RVC locally, you can choose between:

Applio: A fork of RVC with some extra features like Applio TTS, same quality tho
Mainline: The original RVC

molten relic Oct 8, 2024, 1:40 PM

#

Downloading now

tepid atlas Oct 8, 2024, 2:12 PM

#

JSONDecodeError Traceback (most recent call last)
<ipython-input-6-75abb3770c40> in <cell line: 31>()
31 if os.path.exists(config_path):
32 # File exists, proceed with creation of creds and client
---> 33 creds = Credentials.from_service_account_file(config_path, scopes=scope)
34 client = gspread.authorize(creds)
35 else:

5 frames
/usr/lib/python3.10/json/decoder.py in raw_decode(self, s, idx)
353 obj, end = self.scan_once(s, idx)
354 except StopIteration as err:
--> 355 raise JSONDecodeError("Expecting value", s, err.value) from None
356 return obj, end

JSONDecodeError: Expecting value: line 1 column 1 (char 0)

#

uh help

low shard Oct 8, 2024, 2:21 PM

#

tepid atlas --------------------------------------------------------------------------- JSON...

Don't use outdated colabs

#

What's ur PC GPU?

#

Yt tuts are outdated

#

You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU

tepid atlas Oct 8, 2024, 2:25 PM

#

Intel(R) UHD Graphics 630 and AMD Radeon RX 6600 XT

#

problem 2: python3: can't open file '/content/infer-web.py': [Errno 2] No such file or directory

low shard Oct 8, 2024, 2:33 PM

#

tepid atlas problem 2: python3: can't open file '/content/infer-web.py': [Errno 2] No such f...

you are using an outdated colab

low shard Oct 8, 2024, 2:33 PM

#

tepid atlas Intel(R) UHD Graphics 630 and AMD Radeon RX 6600 XT

You could be able to use RVC Locally (on ur pc) via: https://docs.applio.org/getting-started/installation#amd-gpu-support-windows with the AMD GPU

Installation - Applio

Documentation for a high-quality, open-source speech conversion ecosystem designed for simplicity and optimized performance

#

Google Colab is a Cloud Computing service (remote good PC), so used for weak PC

Your pc should be able to handle it

#

Btw, you are looking for using models for pre-recorded audios, or making models, or using models in realtime for voice changing in calls/games?

low shard Oct 8, 2024, 2:38 PM

#

low shard Btw, you are looking for using models for pre-recorded audios, or making models,...

@tepid atlas bc the one i sent is applio, an rvc fork (modified version) for making and using models for pre-recorded audios

#

for realtime voice changing for calls theres another program

tepid atlas Oct 8, 2024, 2:45 PM

#

okay

tepid atlas Oct 8, 2024, 2:45 PM

#

low shard Btw, you are looking for using models for pre-recorded audios, or making models,...

both pre-recorded audios and making models

low shard Oct 8, 2024, 2:46 PM

#

tepid atlas both pre-recorded audios and making models

yea then stick with Applio locally

tepid atlas Oct 8, 2024, 2:48 PM

#

okay

rare gobletBOT Oct 8, 2024, 2:48 PM

#

Ayo? @tepid atlas level 1 !!! lfg

tepid atlas Oct 8, 2024, 2:48 PM

#

lfg

rugged solar Oct 8, 2024, 3:55 PM

#

-colab

azure marshBOT Oct 8, 2024, 3:55 PM

#

rugged solar -colab

☁️ Google Colabs

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

rancid heron Oct 8, 2024, 4:10 PM

#

What happens if you run inference with a different pitch extraction method than the model was trained on?

simple ore Oct 8, 2024, 4:12 PM

#

does not matter

#

it is just a method, the range of values is about the same

brittle wing Oct 8, 2024, 5:15 PM

#

Error

delicate oak Oct 8, 2024, 5:16 PM

#

Ilaria rvc doesn't work anymore ?

polar plaza Oct 8, 2024, 5:18 PM

#

#

Can someone please tell me why this keeps happening

delicate oak Oct 8, 2024, 5:20 PM

#

polar plaza

Same

knotty moth Oct 8, 2024, 5:29 PM

#

polar plaza

but other zerogpu spaces worked on you?

molten relic Oct 8, 2024, 6:19 PM

#

I have a problem that showed up when i started the Train Model Button

#

~~

📎 message.txt

#

Any ideas or anything would be highly apperciative! Thanks so far for everyones thats helped me so far! I am just sorta bad at this stuff lol

pastel oak Oct 8, 2024, 6:28 PM

#

molten relic ~~

Put the RVC1006 folder somewhere outside of OneDrive

#

Buuut tbh I dont see any error messages unless I'm blind

#

Epoch 1 started training, how long did you wait before sending this txt file?

molten relic Oct 8, 2024, 6:40 PM

#

About a half a hour ish

#

I let it do its thing and made lunch came back and it came to that

#

Nothing was moving so I was worried

pastel oak Oct 8, 2024, 6:42 PM

#

Would wait for someone else to comment on it then, but would still move it out of OneDrive just in case

brittle wing Oct 8, 2024, 6:56 PM

#

Hey guys i was adding a new voice model but have a CKPT file what do i do with this?

rare gobletBOT Oct 8, 2024, 6:56 PM

#

Ayo? @brittle wing level 3 !!! lfg

brittle wing Oct 8, 2024, 6:56 PM

#

I got the path file and CKPT but no index, what do i do? 🤔

low shard Oct 8, 2024, 6:56 PM

#

polar plaza

check: https://docs.google.com/document/d/1YbXcLFPaGjhOdG5NFkK3QrucCEpHZBwFUxkeMO8aB18/edit#heading=h.mfxmgfqiaevu

Google Docs

Ilaria RVC Zero

Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Vocal Separator (UVR) Troubleshooting “No gpu is available for you for 60s” Introduction (with website link) Ilaria RVC Zero, is an RVC (Re...

low shard Oct 8, 2024, 6:58 PM

#

delicate oak Same

check https://docs.google.com/document/d/1YbXcLFPaGjhOdG5NFkK3QrucCEpHZBwFUxkeMO8aB18/edit#heading=h.mfxmgfqiaevu

Google Docs

Ilaria RVC Zero

Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Vocal Separator (UVR) Troubleshooting “No gpu is available for you for 60s” Introduction (with website link) Ilaria RVC Zero, is an RVC (Re...

low shard Oct 8, 2024, 7:02 PM

#

knotty moth but other zerogpu spaces worked on you?

its a limitation of the inference time that it takes for converting in ZeroGPU huggingface spaces

#

i explained it above better in the guide

#

@proven hill can't u put the limit back to 300s (5 min) instead of 1 min on Ilaria RVC Zero?

molten relic Oct 8, 2024, 7:49 PM

#

low shard i explained it above better in the guide

Hey Nick any ideas for my issue?

molten relic Oct 8, 2024, 7:50 PM

#

pastel oak Would wait for someone else to comment on it then, but would still move it out o...

I just want another opinion on it as it could be just the OneDrive thing but just checking as they are unsure

low shard Oct 8, 2024, 8:14 PM

#

molten relic Hey Nick any ideas for my issue?

I don't do local sorry

timid olive Oct 8, 2024, 8:30 PM

#

low shard I don't do local sorry

In some parts of the songs, there are backing vocals; should they be separated?

#

You can listen to the sample sound below; it is only the vocal.

#

This example is from a single song.

#

Do you think I should separate them? Some songs sound different to me.

low shard Oct 8, 2024, 9:15 PM

#

timid olive In some parts of the songs, there are backing vocals; should they be separated?

Yea

rare gobletBOT Oct 8, 2024, 9:15 PM

#

Ayo? @low shard level 108 !!! lfg

low shard Oct 8, 2024, 9:15 PM

#

With HP KARAOKE 6 of UVR

timid olive Oct 8, 2024, 10:47 PM

#

low shard With HP KARAOKE 6 of UVR

First, I processed it through this model, then through the other model, and finally used this model to delete the reverb.

#

Do you think I did it correctly? Will it be of good quality? Also, some parts sound different to me; for example, the audio.

#

It sounds like the same song but with a different sound. Will it cause problems in training?

#

I’m curious about this.

molten relic Oct 9, 2024, 12:29 AM

#

low shard I don't do local sorry

what do you do then?

rare gobletBOT Oct 9, 2024, 12:29 AM

#

Ayo? @molten relic level 4 !!! lfg

knotty moth Oct 9, 2024, 12:44 AM

#

low shard its a limitation of the inference time that it takes for converting in ZeroGPU h...

I have successfully inferred ~3 mins audio, though the GPU task aborted issue also sometimes occured, but I think it should be still less demanding than even Flux1.Dev generation, etc

molten relic Oct 9, 2024, 12:52 AM

#

~~

📎 message.txt

#

Slightly differnt then last time but still no generation, not in a onedrive this time ago, anyone have simular problems or know how to fix?

simple ore Oct 9, 2024, 1:26 AM

#

molten relic Slightly differnt then last time but still no generation, not in a onedrive this...

says not in a onedrive, yet "C:\Users\storm\OneDrive"

#

but nothing on the is an 'error'

molten relic Oct 9, 2024, 1:51 AM

#

simple ore says not in a onedrive, yet "C:\Users\storm\OneDrive"

Huhhhhh? I switched the data set onto a external hard drive

simple ore Oct 9, 2024, 1:57 AM

#

logs and everything else is still on onedrive?

knotty moth Oct 9, 2024, 2:25 AM

#

molten relic Slightly differnt then last time but still no generation, not in a onedrive this...

uninstall onedrive

molten relic Oct 9, 2024, 3:17 AM

#

knotty moth uninstall onedrive

Son of a nugget I just got this and in bed I’ll resume tomarrow.

Would anything bad happen if I do so?

knotty moth Oct 9, 2024, 3:23 AM

#

molten relic Son of a nugget I just got this and in bed I’ll resume tomarrow. Would anything...

what

#

if you don't need that bloatware, why not

molten relic Oct 9, 2024, 3:44 AM

#

knotty moth if you don't need that bloatware, why not

I don’t know I thought was it something I needed

fleet cargo Oct 9, 2024, 4:02 AM

#

what should i use

#

for text to speech?

#

not okada i suppose?

#

@knotty moth

low shard Oct 9, 2024, 6:02 AM

#

knotty moth I have successfully inferred ~3 mins audio, though the GPU task aborted issue al...

The limitation isn't about your audio file length, it's about how much time it takes to inference that audio

If it takes more than 1 min to inference that audio, it gives task GPU aborted

low shard Oct 9, 2024, 6:06 AM

#

timid olive Do you think I did it correctly? Will it be of good quality? Also, some parts so...

i cant listen to it rn

low shard Oct 9, 2024, 6:06 AM

#

molten relic what do you do then?

I do Cloud, i got a bad pc so i use a remote good pc

low shard Oct 9, 2024, 6:08 AM

#

fleet cargo not okada i suppose?

wokada is for speech to speech, if u want realtime text to speech, u can look at https://docs.google.com/document/d/12hCYJqNCFl6jWKoVvCxtwt2V6nSoilgi5La8dkZa1KY/edit#heading=h.xweoq2pdv4uj or use the tts client https://github.com/w-okada/ttsclient (but cant really help for the second one)

GitHub

GitHub - w-okada/ttsclient

Contribute to w-okada/ttsclient development by creating an account on GitHub.

Google Docs

AIs for TTS

Table Of Contents Introduction Index of the best TTS 1. ElevenLabs/11Labs: 2. Bark TTS: 3. Edge TTS: 4. StyleTTS2: 6. XTTS2: 8. MetaVoice: 9. MeloTTS: 10. GPT-SoVITS: 11. gTTS: Use TTS in Realtime on calls (ONLY PC) Introduction TTS Means Text To Speech! Inference means when you use the TTS. ...

molten relic Oct 9, 2024, 7:49 AM

#

low shard I do Cloud, i got a bad pc so i use a remote good pc

What’s cloud?

low shard Oct 9, 2024, 7:51 AM

#

molten relic What’s cloud?

remote good pc, basically i run theprogram in a cloud computing service like google colab, kaggle or lightning.ai, instead of running them on my bad pc

molten relic Oct 9, 2024, 7:54 AM

#

low shard remote good pc, basically i run theprogram in a cloud computing service like goo...

Interesting

low shard Oct 9, 2024, 7:55 AM

#

molten relic Interesting

its way better to use it locally on a good pc but i got integrated graphics boohooh

dusk tulip Oct 9, 2024, 12:50 PM

#

.

pastel oak Oct 9, 2024, 1:00 PM

#

.

jaunty shale Oct 9, 2024, 1:56 PM

#

https://docs.aihub.wtf/ doesn't work for me. Trying to find RVC Disconnected guide so I can re-learn stuff again.

does anyone have link?

pastel oak Oct 9, 2024, 2:00 PM

#

jaunty shale https://docs.aihub.wtf/ doesn't work for me. Trying to find RVC Disconnected gui...

#📰│dev-updates

jaunty shale Oct 9, 2024, 2:01 PM

#

pastel oak <#1159380240271953940>

thank you so much! I appreciate it

lucid cove Oct 9, 2024, 2:56 PM

#

could anyone help me i got screenshots

languid lotus Oct 9, 2024, 4:49 PM

#

i need help with rvc can anyone help me

jagged hawk Oct 9, 2024, 4:58 PM

#

@low shard i finally created a voice model, how can i upload it in the channel voice-models?

low shard Oct 9, 2024, 5:41 PM

#

languid lotus i need help with rvc can anyone help me

!howtoask

patent trellisBOT Oct 9, 2024, 5:41 PM

#

low shard !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

low shard Oct 9, 2024, 5:41 PM

#

jagged hawk <@911742715019001897> i finally created a voice model, how can i upload it in th...

check https://docs.ai-hub.wtf/extra/model-maker-role/

Model Maker Role

Last update: May 20, 2024

noble dawn Oct 9, 2024, 6:15 PM

#

Anyone can help me train on local ? Rq

#

If they free

quasi lynx Oct 9, 2024, 6:17 PM

#

noble dawn Anyone can help me train on local ? Rq

Whatchu need?

noble dawn Oct 9, 2024, 6:18 PM

#

It’s giving me an error whenever I click train

#

Idk why

#

When I got a 3060Ti gpu nevida

#

😭

quasi lynx Oct 9, 2024, 6:19 PM

#

Might help if you send the error

noble dawn Oct 9, 2024, 6:19 PM

#

Yes

#

I got u ima dm u

#

Ty

#

Let me screen shot rq

remote trellis Oct 9, 2024, 9:00 PM

#

Hi

#

When I log in with new docs, the main menu opens but when I click on the app or any other button, it doesn't work. Why?

#

@timid olive

pastel oak Oct 9, 2024, 9:14 PM

#

remote trellis When I log in with new docs, the main menu opens but when I click on the app or ...

its not fixed yet

remote trellis Oct 9, 2024, 9:18 PM

#

So how do I make an AI cover with the applio link?

#

@pastel oak

pastel oak Oct 9, 2024, 9:19 PM

#

remote trellis So how do I make an AI cover with the applio link?

Are you not talking about the docs guide? the normal applio stuff should still work

remote trellis Oct 9, 2024, 9:19 PM

#

pastel oak Are you not talking about the docs guide? the normal applio stuff should still w...

Not

#

Can you send me the collab link?

#

For aı cover

pastel oak Oct 9, 2024, 9:20 PM

#

https://colab.research.google.com/github/iahispano/applio/blob/master/assets/Applio.ipynb

Google Colab

remote trellis Oct 9, 2024, 9:20 PM

#

Thanks

#

@pastel oak @pastel oak

#

Look, now I opened the "applio" and I couldn't download the sound I wanted. I paste the sound I wanted into this doenload, it says it downloaded in 1 second but it doesn't download, why? Sound model: https://applio.org/models?id=1218683186431660072

rare gobletBOT Oct 9, 2024, 9:33 PM

#

Ayo? @remote trellis level 2 !!! lfg

pastel oak Oct 9, 2024, 9:33 PM

#

I dont know applio colab

remote trellis Oct 9, 2024, 9:33 PM

#

pastel oak https://colab.research.google.com/github/iahispano/applio/blob/master/assets/App...

This

pastel oak Oct 9, 2024, 9:34 PM

#

thats not what i meant

#

i cant help with applio colab

remote trellis Oct 9, 2024, 9:34 PM

#

Or

pastel oak Oct 9, 2024, 9:34 PM

#

try ilaria rvc zero

brave garnetBOT Oct 9, 2024, 9:34 PM

#

Ilaria RVC: CLICK HERE 🤗

Guide on how to use it: CLICK HERE 📝

Don't forget to thank Ilaria if you find it useful! 💖

Ilaria RVC - a Hugging Face Space by TheStinger

pastel oak Oct 9, 2024, 9:34 PM

#

download model from applio and manual upload it

remote trellis Oct 9, 2024, 9:34 PM

#

So where can I find the hugging face of this applio voice model?

remote trellis Oct 9, 2024, 9:34 PM

#

pastel oak download model from applio and manual upload it

didnt worl

#

Worl

#

Work

pastel oak Oct 9, 2024, 9:35 PM

#

Did you even read what i said

#

Modeli applio'dan BİLGİSAYARINA indir
Ilaria RVC Zero'yu aç
"Model Loader" sekmesine git
.pth ve .index dosyasını yükle

remote trellis Oct 9, 2024, 9:37 PM

#

im mobile

#

@pastel oak

pastel oak Oct 9, 2024, 9:44 PM

#

Dont know

brave garnetBOT Oct 9, 2024, 11:04 PM

#

RVC Colabs and Spaces

⠀

Local Forks 🖥️

⠀
Mainline RVC
Original project, suggested for advanced users,
by the RVC-Project team.

Applio
Simplified, suggested for all, by the Applio team.

RVC Studio
Simplified, suggested for all, by SayanoAI.

Mangio-RVC
Simplified, may not be supported anymore, by Mangio621.

AICoverGen
Simple yet great way to make covers, by SociallyIneptWeeb.

Replay
From the greators of weights.gg, excellent product for everyone.
⠀

next wharf Oct 10, 2024, 4:30 AM

#

what is the difference between FCPE and rmvpe?

low shard Oct 10, 2024, 5:38 AM

#

next wharf what is the difference between FCPE and rmvpe?

Fcpe is faster, rmvpe is better quality

next wharf Oct 10, 2024, 5:39 AM

#

low shard Fcpe is faster, rmvpe is better quality

thanks

low shard Oct 10, 2024, 5:40 AM

#

remote trellis So how do I make an AI cover with the applio link?

First of all. What's ur PC GPU?

low shard Oct 10, 2024, 9:24 AM

#

remote trellis When I log in with new docs, the main menu opens but when I click on the app or ...

its fixed now btw

low shard Oct 10, 2024, 9:24 AM

#

pastel oak its not fixed yet

fixed it https://github.com/AIHubCentral/docs/pull/7#event-14585577515

GitHub

fix hyperlinks for the temp docs by Nick088Official · Pull Request ...

As the main docs are down, the hyperlinks don’t work, so i changed the hyperlinks from the main docs to the temporary docs for now (so from docs.aihub.wtf to docs.ai-hub.wtf)

pastel oak Oct 10, 2024, 9:27 AM

#

low shard fixed it https://github.com/AIHubCentral/docs/pull/7#event-14585577515

Boss

low shard Oct 10, 2024, 9:27 AM

#

pastel oak Boss

there were 256 broke hyperlinks trolley

#

just fixed it with search and replace all yk lol

stark wadi Oct 10, 2024, 11:46 AM

#

I can't get applio to install on mac. It keeps saying that a java runtime wasn't found. Anyone know how to fix this?

rare gobletBOT Oct 10, 2024, 11:46 AM

#

Ayo? @stark wadi level 4 !!! lfg

brave garnetBOT Oct 10, 2024, 12:25 PM

#

Voicechanger Settings (Okada)

⠀

Settings for Nvidia GPUs

F0 Det.: rmvpe (suggested for all series)

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

⠀

brave garnetBOT Oct 10, 2024, 1:08 PM

#

Voicechanger Settings (Okada)

⠀

Settings for AMD GPUs

Don't forget that your models needs to be converted in ONNX!

F0 Det.: rmvpe_onnx (suggested for all series)

7xxx XT cards: 112-128 chunk | +16384 extra
6xxx XT cards: 128-192 chunk | +16384 extra
5xxx XT cards: 192-256 chunk | +8192 extra

RX 580: 192-256 chunk | +8192 extra
RX 570: 192-256 chunk | +8192 extra
RX 560: 256-384 chunk | +8192 extra

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

⠀

languid lotus Oct 10, 2024, 2:15 PM

#

what is the best model pre train please awnser fast

pastel oak Oct 10, 2024, 2:16 PM

#

languid lotus what is the best model pre train please awnser fast

Original

languid lotus Oct 10, 2024, 2:17 PM

#

ok thank you

patent quarry Oct 10, 2024, 2:28 PM

#

How do I transfer to ONNX?

mighty vortex Oct 10, 2024, 2:39 PM

#

why can't I mount drive on the colab?

low shard Oct 10, 2024, 2:42 PM

#

mighty vortex why can't I mount drive on the colab?

What Google colab are you using and what's the issue specifically

mighty vortex Oct 10, 2024, 2:42 PM

#

i'm using disconnected, the error is "credential propagation was unsuccessful"

low shard Oct 10, 2024, 2:43 PM

#

mighty vortex i'm using disconnected, the error is "credential propagation was unsuccessful"

Can you send the Google colab link?

#

You need to be sure also to always allow the Google drive when you get the popup

mighty vortex Oct 10, 2024, 2:43 PM

#

https://colab.research.google.com/drive/1XIPCP9ken63S7M6b5ui1b36Cs17sP-NS#scrollTo=ZodNcumpg-JM

rare gobletBOT Oct 10, 2024, 2:43 PM

#

Ayo? @mighty vortex level 1 !!! lfg

mighty vortex Oct 10, 2024, 2:44 PM

#

I do allow

low shard Oct 10, 2024, 2:44 PM

#

mighty vortex https://colab.research.google.com/drive/1XIPCP9ken63S7M6b5ui1b36Cs17sP-NS#scroll...

Yeah that seems the right colab

#

Try re running it and give it access again

mighty vortex Oct 10, 2024, 2:45 PM

#

i've done that multiple times already

prisma grove Oct 10, 2024, 2:45 PM

#

do you guys know how long would it take to train an rvc model locally

#

compared to training on colab

pastel oak Oct 10, 2024, 2:45 PM

#

prisma grove do you guys know how long would it take to train an rvc model locally

Depends on gpu

prisma grove Oct 10, 2024, 2:45 PM

#

rtx 2080ti

pastel oak Oct 10, 2024, 2:46 PM

#

Thats probably faster than the colab gpu

prisma grove Oct 10, 2024, 2:46 PM

#

colab's got t4 tho

low shard Oct 10, 2024, 2:47 PM

#

mighty vortex i've done that multiple times already

lemme check it myself rq

pastel oak Oct 10, 2024, 2:47 PM

#

prisma grove colab's got t4 tho

Tesla T4 is worse than 2080ti

prisma grove Oct 10, 2024, 2:48 PM

#

oh

#

I didn't know that

#

cool

low shard Oct 10, 2024, 2:49 PM

#

mighty vortex i've done that multiple times already

just tried it myself, i ran the cell, it asks me permission, then i choose google account and allow everything

#

works fine no issue

#

be sure to not modify its permissions

prisma grove Oct 10, 2024, 2:51 PM

#

#

or just
| name of the dataset.zip
| | audio files

#

or should I not even zip it for local training? 😵‍💫

pastel oak Oct 10, 2024, 2:56 PM

#

prisma grove or should I not even zip it for local training? 😵‍💫

Dont zip, put in a random folder and select the folder path in rvc

mighty vortex Oct 10, 2024, 3:01 PM

#

how many epochs should you train a model with a 3 minute dataset?

pastel oak Oct 10, 2024, 3:02 PM

#

mighty vortex how many epochs should you train a model with a 3 minute dataset?

Everything is measured by tensorboard graphs

mighty vortex Oct 10, 2024, 3:02 PM

#

but is there a general amount?

pastel oak Oct 10, 2024, 3:04 PM

#

No, every run, every dataset is unique

prisma grove Oct 10, 2024, 3:08 PM

#

I found this though? https://www.desmos.com/calculator/yeqx4dmcfm?lang=pl

rare gobletBOT Oct 10, 2024, 3:08 PM

#

Ayo? @prisma grove level 2 !!! lfg

prisma grove Oct 10, 2024, 3:18 PM

#

honestly I'm confused now, the calculator says that 20 minute dataset is about 100 epochs

#

but that's very low

#

what does the loss stuff mean?

#

and why is each epoch taking so long to train?

#

is 1 epoch per minute the normal speed? I haven't used rvc in a while

#

and I don't think I've ever trained a model locally

tropic plover Oct 10, 2024, 3:30 PM

#

prisma grove honestly I'm confused now, the calculator says that 20 minute dataset is about 1...

Like Shad said, u have to rely on Tensorboard to determine when to stop and which epoch to pick

prisma grove Oct 10, 2024, 3:30 PM

#

where is tensorboard

tropic plover Oct 10, 2024, 3:30 PM

#

prisma grove what does the loss stuff mean?

Found this here some time ago
https://drive.google.com/drive/folders/1o1ZZuUHQ6MuclA6B-AtQZlHlC5Uf34pb

prisma grove Oct 10, 2024, 3:32 PM

#

tropic plover Found this here some time ago https://drive.google.com/drive/folders/1o1ZZuUHQ6M...

I don't understand anything in that tutorial

#

I'm just gonna test all 7 models (700 epochs, saving every 100) and see which one sound the best

#

what's more concerning to me is how long it takes to train every epoch

#

what is the normal speed?

tropic plover Oct 10, 2024, 3:36 PM

#

It all depends on ur GPU and batch size, dataset size, etc. My 4060 on 4 batch size, 30min dataset is taking abt 2:00 per epoch trolley

low shard Oct 10, 2024, 3:36 PM

#

prisma grove where is tensorboard

Idk about local but check https://docs.ai-hub.wtf/rvc/resources/epochs-tensorboard/

Epochs & TensorBoard

Last update: Feb 10, 2024

stark wadi Oct 10, 2024, 3:38 PM

#

does anyone know how to install on mac m3 pro?

pastel oak Oct 10, 2024, 3:41 PM

#

prisma grove where is tensorboard

Go to the docs guide Nick linked, download the file, put it in the same rvc folder where you launch it, run it, thats all

#

It opens another webui

simple ore Oct 10, 2024, 3:43 PM

#

prisma grove I don't understand anything in that tutorial

saving every 100 epochs is silly

#

go every 10

prisma grove Oct 10, 2024, 3:44 PM

#

I don't have that much free space on my drive

simple ore Oct 10, 2024, 3:44 PM

#

unless you got hours and hours of sample audio files, using 700 epochs is crazy

prisma grove Oct 10, 2024, 3:44 PM

#

then why are people doing 700 epochs for 5 minute datasets 😵‍💫

simple ore Oct 10, 2024, 3:44 PM

#

they are stupid

#

or they follow a stupid guide

#

running 700 epochs on 5 minute file is trying to squeese a gallon of juice from one lemon

#

you can get all you can from 5 minute file in 20-50 epochs

prisma grove Oct 10, 2024, 3:46 PM

#

I have 18:50 long dataset

#

as in 18 minutes 50 seconds

simple ore Oct 10, 2024, 3:46 PM

#

should be under 200 epochs at most

#

again, use tensorboard to check

prisma grove Oct 10, 2024, 3:47 PM

#

yeah I got it now

pastel oak Oct 10, 2024, 3:51 PM

#

prisma grove I don't have that much free space on my drive

You can delete the first 50 epochs safely while saving every 10 epochs, then delete as you go if you hit another OT point etc

prisma grove Oct 10, 2024, 3:53 PM

#

another OT point?

#

shouldn't I stop the training when it starts to OT?

#

also it's saving every 50 epochs anyway lol

#

I think the interval is limited to 50

serene horizon Oct 10, 2024, 4:09 PM

#

mighty vortex https://colab.research.google.com/drive/1XIPCP9ken63S7M6b5ui1b36Cs17sP-NS#scroll...

I have tried using this but it says “You cannot currently connect to a GPU due to usage limits in Colab” for the last two days.

What can I do?

low shard Oct 10, 2024, 4:10 PM

#

stark wadi does anyone know how to install on mac m3 pro?

mac can only inference (use models) locally (on ur pc) not train (make models)

#

i dont have a mac but u could check https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/docs/en/README.en.md

GitHub

Retrieval-based-Voice-Conversion-WebUI/docs/en/README.en.md at main...

Easily train a good VC model with voice data <= 10 mins! - RVC-Project/Retrieval-based-Voice-Conversion-WebUI

#

tbh id suggest to just use cloud (remote good pc)

simple ore Oct 10, 2024, 4:16 PM

#

serene horizon I have tried using this but it says “You cannot currently connect to a GPU due t...

wait until tomorrow?

#

or whenever the limit expires

analog obsidian Oct 10, 2024, 4:17 PM

#

prisma grove then why are people doing 700 epochs for 5 minute datasets 😵‍💫

its 2024 and some people still believes 1000 epochs are better despite not being true at all lmao
PandaShrug just use tensorboard and select the lowest point in your g/total graph, for a 18 minute dataset your model should be done in less than 200 epochs

serene horizon Oct 10, 2024, 4:18 PM

#

simple ore or whenever the limit expires

No idea on how long the limit is there?

low shard Oct 10, 2024, 4:19 PM

#

serene horizon I have tried using this but it says “You cannot currently connect to a GPU due t...

you can:

Use an alt google account
Use kaggle which gives more gpu time but its harder
Wait until tomorrow
Pay for colab pro

#

btw, whats ur pc gpu?

simple ore Oct 10, 2024, 4:19 PM

#

the free usage period goes down the more you use it

#

and slowly resets back when you dont

knotty moth Oct 10, 2024, 4:20 PM

#

analog obsidian its 2024 and some people still believes 1000 epochs are better despite not being...

imagine the graph keeps going down on above 1000 epochs for some short dataset

low shard Oct 10, 2024, 4:20 PM

#

simple ore the free usage period goes down the more you use it

which is why kaggle is on top

#

lightning.ai is also cool but lower limits (as in gpu time) so boohooh

analog obsidian Oct 10, 2024, 4:20 PM

#

knotty moth imagine the graph keeps going down on above 1000 epochs for some short dataset

batch 16 in 4 minute datasets be like

#

trolley

serene horizon Oct 10, 2024, 4:21 PM

#

low shard you can: - Use an alt google account - Use kaggle which gives more gpu time but ...

I tried registering another Google account but Google doesn’t allow me.

low shard Oct 10, 2024, 4:21 PM

#

serene horizon I tried registering another Google account but Google doesn’t allow me.

You mean like it asks u for phone verification when u make another google acc?

serene horizon Oct 10, 2024, 4:22 PM

#

low shard You mean like it asks u for phone verification when u make another google acc?

Yeah.

low shard Oct 10, 2024, 4:22 PM

#

also its WAY BETTER to use Kaggle

low shard Oct 10, 2024, 4:22 PM

#

serene horizon Yeah.

you can use the Phone Gmail app to make another google acc without phone verification

#

ofc u need a phone tho

#

just like open the app, make a new acc and it will do it without needing any verification

#

but i suggest u way better to just use kaggle, its a bit harder and needs just a single phone verification but gives u 30 hours weekly (yes they refresh)

#

its WAYYY better than google colab

#

and u dont have the risk to losing ur stuff for randomly getting disconnected as 30 hours are alot for free

serene horizon Oct 10, 2024, 4:24 PM

#

low shard you can use the Phone Gmail app to make another google acc without phone verific...

I actually did register another account on my phone, but later, when I went to log in on my laptop, it asked me to verify the account with a phone number! 🤦‍♂️

analog obsidian Oct 10, 2024, 4:24 PM

#

kaggle is a bit buggy but works good when decides not to randomly end the session

rare gobletBOT Oct 10, 2024, 4:24 PM

#

Ayo? @serene horizon level 3 !!! lfg

low shard Oct 10, 2024, 4:24 PM

#

analog obsidian kaggle is a bit buggy but works good when decides not to randomly end the sessio...

since when does it randomly end the session? Never happened to me

#

be sure to use encryption

low shard Oct 10, 2024, 4:25 PM

#

serene horizon I actually did register another account on my phone, but later, when I went to l...

wtf very weird

#

u should be able to login on ur pc of the same acc made on ur phone without phone verification

#

at this point i suggest u to use kaggle or wait

#

don’t u have even just 1 phone number ?

analog obsidian Oct 10, 2024, 4:25 PM

#

low shard since when does it randomly end the session? Never happened to me

it does for me lols, everything set up correctly then randomly decides to stop the session when im downloading the dependencies
i fix this by creating another version of the notebook

low shard Oct 10, 2024, 4:26 PM

#

analog obsidian it does for me lols, everything set up correctly then randomly decides to stop t...

skill issue ngl

#

never happened to me

#

nor heard it from others

analog obsidian Oct 10, 2024, 4:26 PM

#

PandaShrug

low shard Oct 10, 2024, 4:26 PM

#

trolley

rugged solar Oct 10, 2024, 4:27 PM

#

-colab

azure marshBOT Oct 10, 2024, 4:27 PM

#

rugged solar -colab

☁️ Google Colabs

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
🆕 FaceFusion UI, by Nick088 Google Colab
🆕 FaceFusion NO UI, by Nick088 Google Colab
🆕 EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

serene horizon Oct 10, 2024, 4:27 PM

#

low shard don’t u have even just 1 phone number ?

I do, but i already used it so it won’t accept it.

low shard Oct 10, 2024, 4:28 PM

#

serene horizon I do, but i already used it so it won’t accept it.

dw, kaggle is a different service than google colab

#

use kaggle, u will be able to use that phone number

#

As you dont got a good PC, its better you use cloud (remote good pc) for training an RVC Voice Model:

Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Mainline (UI, No guide as of right now)
- Applio (UI, No guide as of right now)

#

here i sent all the cloud ways, use mainline kaggle

serene horizon Oct 10, 2024, 4:29 PM

#

low shard use kaggle, u will be able to use that phone number

Okay, I’ll have a look and see if I can work it out!

low shard Oct 10, 2024, 4:29 PM

#

i sent u the guide with the kaggle too

#

click on mainline from kaggle

serene horizon Oct 10, 2024, 4:30 PM

#

low shard i sent u the guide with the kaggle too

Thanks. I’ll give it a go now.

low shard Oct 10, 2024, 4:31 PM

#

yw

languid lotus Oct 10, 2024, 4:50 PM

#

can some one tell me if this model is good or not and how to know from this pic

#

its still in 150 ecpoh

#

18 minuts of training data

prisma grove Oct 10, 2024, 4:57 PM

#

is this OT?

rare gobletBOT Oct 10, 2024, 4:57 PM

#

Ayo? @prisma grove level 3 !!! lfg

simple ore Oct 10, 2024, 4:57 PM

#

prisma grove is this OT?

what batch size?

languid lotus Oct 10, 2024, 4:58 PM

#

languid lotus can some one tell me if this model is good or not and how to know from this pic

what about this

simple ore Oct 10, 2024, 4:58 PM

#

it tells nothing without the metrics

languid lotus Oct 10, 2024, 4:58 PM

#

what are they and how do i get them

simple ore Oct 10, 2024, 4:58 PM

#

run tensorboard

#

see scalars

languid lotus Oct 10, 2024, 4:59 PM

#

simple ore Oct 10, 2024, 4:59 PM

#

use smoothing, like 0.975 or a bit more

#

metrics loss - d/total, g/total, fm, mel, kl

languid lotus Oct 10, 2024, 5:00 PM

#

rare gobletBOT Oct 10, 2024, 5:00 PM

#

Ayo? @languid lotus level 2 !!! lfg

languid lotus Oct 10, 2024, 5:00 PM

#

#

what do they say

simple ore Oct 10, 2024, 5:00 PM

#

not grad

languid lotus Oct 10, 2024, 5:01 PM

#

#

#

#

#

what do they mean

simple ore Oct 10, 2024, 5:03 PM

#

just terible

languid lotus Oct 10, 2024, 5:03 PM

#

what

#

wdym

#

how can i make it better

simple ore Oct 10, 2024, 5:03 PM

#

what is the size of ther training set?

languid lotus Oct 10, 2024, 5:03 PM

#

18 minutes

simple ore Oct 10, 2024, 5:04 PM

#

you seem to have some terrible quality then

languid lotus Oct 10, 2024, 5:04 PM

#

do you want me to send you the drive link for the audio

#

im making a billie eilish one

#

https://drive.google.com/file/d/1tQgSnHJ1isRzrTfRQX4newovWE5YJcHG/view?usp=sharing

#

bro i used very high quality wdym

#

@simple ore

simple ore Oct 10, 2024, 5:10 PM

#

did you cut out all the silence?

languid lotus Oct 10, 2024, 5:10 PM

#

yes using audacity

simple ore Oct 10, 2024, 5:11 PM

#

that and there's some slight weird band over 16khz

#

languid lotus Oct 10, 2024, 5:11 PM

#

i dont know how to do all these stuff

simple ore Oct 10, 2024, 5:11 PM

#

the model needs to have silence gaps in order to learn a separation

languid lotus Oct 10, 2024, 5:12 PM

#

so i dont trim silince

#

??

simple ore Oct 10, 2024, 5:12 PM

#

#

you dont need to

#

unless you plan to replace Eminem voice in his rap

languid lotus Oct 10, 2024, 5:13 PM

#

so if i send you the raw voice will you make me a training data

#

just to know what ive been doing wrong

simple ore Oct 10, 2024, 5:13 PM

#

just take your original interview

serene horizon Oct 10, 2024, 5:14 PM

#

simple ore unless you plan to replace Eminem voice in his rap

Why does it say in the guide to truncate silence then?

simple ore Oct 10, 2024, 5:14 PM

#

before you cut the silence gaps out

languid lotus Oct 10, 2024, 5:14 PM

#

ill try

simple ore Oct 10, 2024, 5:14 PM

#

the training specifically inject a couple of mute audios for the model to train how to reproduce silence

#

but it is only 2 3sec files

languid lotus Oct 10, 2024, 5:15 PM

#

yeah another question do i need to make my voices cut or whole

simple ore Oct 10, 2024, 5:16 PM

#

you need to have only the voice of the target person

languid lotus Oct 10, 2024, 5:16 PM

#

if i do what app do i do it with

simple ore Oct 10, 2024, 5:16 PM

#

obviously

languid lotus Oct 10, 2024, 5:16 PM

#

simple ore you need to have only the voice of the target person

no like one audio file or multipile

serene horizon Oct 10, 2024, 5:16 PM

#

simple ore the training specifically inject a couple of mute audios for the model to train ...

Is this just for Applio? What if I use it with another RVC trainer?

simple ore Oct 10, 2024, 5:17 PM

#

here's a model i'm testing right now, there's no weird jumps or craziness

#

#

#

and it is only 10 minute set

simple ore Oct 10, 2024, 5:17 PM

#

serene horizon Is this just for Applio? What if I use it with another RVC trainer?

RVC mainline does the same

languid lotus Oct 10, 2024, 5:18 PM

#

and what pre train do i use original or titan or ov2rsuper

rare gobletBOT Oct 10, 2024, 5:18 PM

#

Ayo? @languid lotus level 3 !!! lfg

simple ore Oct 10, 2024, 5:18 PM

#

I think without the silence gaps the audio becomes too complex to learn

#

I use original

serene horizon Oct 10, 2024, 5:18 PM

#

simple ore RVC mainline does the same

But not RVC disconnect in Colab?

simple ore Oct 10, 2024, 5:19 PM

#

probably the same.. all the base code should be very similar

serene horizon Oct 10, 2024, 5:19 PM

#

simple ore probably the same.. all the base code should be very similar

Okay.

simple ore Oct 10, 2024, 5:20 PM

#

but I've seen some projects where they do not include silence for some reason

languid lotus Oct 10, 2024, 5:22 PM

#

bro if i want to make a rvc model of 21 but his voice is bad when i uvr look

#

@simple ore

simple ore Oct 10, 2024, 5:26 PM

#

yeah, has some echo and another voice blended in

#

not good

languid lotus Oct 10, 2024, 5:26 PM

#

any tips to make it better

simple ore Oct 10, 2024, 5:27 PM

#

you cant unbake a cake

languid lotus Oct 10, 2024, 5:27 PM

#

thats true unfortunately

serene horizon Oct 10, 2024, 5:48 PM

#

low shard yw

I got to the training point, but when I click train, it says “error.” 🤦‍♂️

But in Kaggle, I see it working.

So weird.

prisma grove Oct 10, 2024, 5:51 PM

#

simple ore what batch size?

5

#

I left it at default

#

#

it's at epoch 266

#

should I stop it or no

simple ore Oct 10, 2024, 5:54 PM

#

5???

#

look at the other charts

prisma grove Oct 10, 2024, 5:55 PM

#

you mean this right?

simple ore Oct 10, 2024, 5:55 PM

#

that's not right

low shard Oct 10, 2024, 5:56 PM

#

serene horizon I got to the training point, but when I click train, it says “error.” 🤦‍♂️ Bu...

show the error in the kaggle output

simple ore Oct 10, 2024, 5:56 PM

#

it has very little to do with GPU memory size

prisma grove Oct 10, 2024, 5:56 PM

#

which chart

simple ore Oct 10, 2024, 5:56 PM

#

last page, with fm, mel, kl

prisma grove Oct 10, 2024, 5:57 PM

#

serene horizon Oct 10, 2024, 5:57 PM

#

low shard show the error in the kaggle output

Where would I find that?

simple ore Oct 10, 2024, 5:57 PM

#

fm metric is weird and high

prisma grove Oct 10, 2024, 5:57 PM

#

wdym

#

hold on let me check what the model sounds like rn

low shard Oct 10, 2024, 5:58 PM

#

serene horizon Where would I find that?

in the kaggle site, where it shows the output of the cell u are running

simple ore Oct 10, 2024, 5:58 PM

#

the metric is not going down/not stabilizing

#

but you can check how it sounds

serene horizon Oct 10, 2024, 5:59 PM

#

low shard in the kaggle site, where it shows the output of the cell u are running

There’s a lot of text there.

It’s currently telling me what epoch it’s on.

simple ore Oct 10, 2024, 5:59 PM

#

g/total is kinda high, but that's probably of the batch size

low shard Oct 10, 2024, 6:00 PM

#

serene horizon There’s a lot of text there. It’s currently telling me what epoch it’s on.

there should be an 'traceback' part

#

show a screenshot

analog obsidian Oct 10, 2024, 6:00 PM

#

simple ore the metric is not going down/not stabilizing

what causes fm to go up anyways? i've never seen a fm graph that goes down lol

prisma grove Oct 10, 2024, 6:01 PM

#

sounds pretty normal?

simple ore Oct 10, 2024, 6:02 PM

#

analog obsidian what causes fm to go up anyways? i've never seen a fm graph that goes down lol

depending on the learning ratio, the model may overshoot the target, so fm goes up and down

prisma grove Oct 10, 2024, 6:02 PM

#

I had models that sound like static so

simple ore Oct 10, 2024, 6:03 PM

#

large variation of data in the set may result in fm going up and down

analog obsidian Oct 10, 2024, 6:03 PM

#

simple ore depending on the learning ratio, the model may overshoot the target, so fm goes ...

do you think is good to change the learning rate everytime we train a dataset? last time we spoke about this we conclude is better to not change the default lr (which is 1e-4 i believe)

serene horizon Oct 10, 2024, 6:03 PM

#

low shard there should be an 'traceback' part

First, I’m going to try from the start again. Maybe I missed something. I

simple ore Oct 10, 2024, 6:03 PM

#

you can probbaly set it to 5e-5 (half of default), the training may take longer

prisma grove Oct 10, 2024, 6:03 PM

#

?

#

set what

analog obsidian Oct 10, 2024, 6:04 PM

#

simple ore you can probbaly set it to 5e-5 (half of default), the training may take longer

this could prevent fm to overfit faster?

simple ore Oct 10, 2024, 6:04 PM

#

possibly... also using FP32 may prevent it too

analog obsidian Oct 10, 2024, 6:04 PM

#

simple ore possibly... also using FP32 may prevent it too

thanks! i'll try it

prisma grove Oct 10, 2024, 6:05 PM

#

this is what the 150e ckpt sounds like

#

this is all with no index because I'm too lazy to grab it

simple ore Oct 10, 2024, 6:05 PM

#

technically the metrics should go down or at least stabilize due to the learning rate automatically adjusting down

analog obsidian Oct 10, 2024, 6:05 PM

#

simple ore technically the metrics should go down or at least stabilize due to the learning...

yeah all of the metrics go down EXCEPT fm for some reason

prisma grove Oct 10, 2024, 6:05 PM

#

so what does that mean 😭

analog obsidian Oct 10, 2024, 6:06 PM

#

analog obsidian yeah all of the metrics go down EXCEPT fm for some reason

ive always believe this happens because the dataset is small but im pretty sure fm goes up even in big datasets

simple ore Oct 10, 2024, 6:06 PM

#

prisma grove this is what the 150e ckpt sounds like

neshi'zzzzzz'te

#

that's what I hear

#

at 7 seconds

prisma grove Oct 10, 2024, 6:07 PM

#

simple ore neshi'zzzzzz'te

nen ne shite

#

it's genkotsu yama no tanuki san

simple ore Oct 10, 2024, 6:08 PM

#

#

this part

prisma grove Oct 10, 2024, 6:09 PM

#

I mean yes

analog obsidian Oct 10, 2024, 6:09 PM

#

prisma grove this is what the 150e ckpt sounds like

sibilances are artifacting

prisma grove Oct 10, 2024, 6:09 PM

#

the sh sounds weird

#

tf is a sibilance

analog obsidian Oct 10, 2024, 6:09 PM

#

prisma grove tf is a sibilance

SH ch K sounds

#

and S ofc

#

u can decrease the artifacting by de-essing the dataset

prisma grove Oct 10, 2024, 6:10 PM

#

it's already mostly de-essed

#

also it doesn't do that at 50e

#

or 250e

analog obsidian Oct 10, 2024, 6:10 PM

#

prisma grove also it doesn't do that at 50e

because fm is not overfitted

prisma grove Oct 10, 2024, 6:10 PM

#

what does it mean

#

what is fm

#

and what does overfitted mean

analog obsidian Oct 10, 2024, 6:11 PM

#

prisma grove what is fm

type fm in the tensorboard, is a metric

analog obsidian Oct 10, 2024, 6:11 PM

#

prisma grove and what does overfitted mean

model confuses and starts to learn the same patterns over and over again

prisma grove Oct 10, 2024, 6:11 PM

#

#

this

#

what about it

prisma grove Oct 10, 2024, 6:11 PM

#

analog obsidian model confuses and starts to learn the same patterns over and over again

so overtraining

analog obsidian Oct 10, 2024, 6:12 PM

#

prisma grove so overtraining

similar

analog obsidian Oct 10, 2024, 6:12 PM

#

prisma grove what about it

prisma grove Oct 10, 2024, 6:12 PM

#

not the same?

analog obsidian Oct 10, 2024, 6:12 PM

#

prisma grove not the same?

nop

analog obsidian Oct 10, 2024, 6:12 PM

#

analog obsidian

it started to fluctuate here

#

so any epoch in that zone have a big chance of having broken S sounds

#

e50 probably is before that so is not doing it

prisma grove Oct 10, 2024, 6:13 PM

#

isn't ot determined by the loss/g/total metric thing

analog obsidian Oct 10, 2024, 6:13 PM

#

prisma grove isn't ot determined by the loss/g/total metric thing

every metric does something
g total is your average of:
fm
mel
kl

#

g total raising means your model start to degrade and overtrain

prisma grove Oct 10, 2024, 6:14 PM

#

it's not raising

analog obsidian Oct 10, 2024, 6:14 PM

#

fm going up means model overfitted the dataset features like sibilances

prisma grove Oct 10, 2024, 6:14 PM

#

rare gobletBOT Oct 10, 2024, 6:14 PM

#

Ayo? @prisma grove level 4 !!! lfg

analog obsidian Oct 10, 2024, 6:14 PM

#

prisma grove

yeah its fine thats what i was talking about
fm metric usually overfits very fast

prisma grove Oct 10, 2024, 6:14 PM

#

so what am I supposed to do

analog obsidian Oct 10, 2024, 6:15 PM

#

prisma grove so what am I supposed to do

choose the lowest point in g/total

#

if u don't have the exact epoch choose the closest

prisma grove Oct 10, 2024, 6:15 PM

#

it's still decreasing wdym

#

the lowest point is the latest epoch

analog obsidian Oct 10, 2024, 6:15 PM

#

prisma grove it's still decreasing wdym

show me g/total smoothing 0 and ignore scalars off

prisma grove Oct 10, 2024, 6:16 PM

#

analog obsidian Oct 10, 2024, 6:16 PM

#

prisma grove

#

try that epoch

#

that is your lowest point

prisma grove Oct 10, 2024, 6:17 PM

#

how do I know which one that is

analog obsidian Oct 10, 2024, 6:17 PM

#

prisma grove how do I know which one that is

hover your cursor in that low point

polar plaza Oct 10, 2024, 6:17 PM

#

#

Bruh

prisma grove Oct 10, 2024, 6:17 PM

#

analog obsidian hover your cursor in that low point

and?

analog obsidian Oct 10, 2024, 6:17 PM

#

prisma grove and?

check the step number

#

then find the epoch that is that step number

prisma grove Oct 10, 2024, 6:17 PM

#

14.4

analog obsidian Oct 10, 2024, 6:17 PM

#

or the most closest to that step number

analog obsidian Oct 10, 2024, 6:18 PM

#

prisma grove 14.4

so you need epoch step number 14.400k

simple ore Oct 10, 2024, 6:18 PM

#

checked my logs

prisma grove Oct 10, 2024, 6:18 PM

#

so that's epoch 150 😐

simple ore Oct 10, 2024, 6:18 PM

#

analog obsidian Oct 10, 2024, 6:18 PM

#

simple ore

this is default lr fp16?

prisma grove Oct 10, 2024, 6:18 PM

#

the one you said sounds bad

analog obsidian Oct 10, 2024, 6:19 PM

#

prisma grove the one you said sounds bad

yep that is your correct epoch and has broken sibilance

simple ore Oct 10, 2024, 6:19 PM

#

i dont remember, that was a small test set

analog obsidian Oct 10, 2024, 6:19 PM

#

it doesn't sound bad to me

prisma grove Oct 10, 2024, 6:19 PM

#

analog obsidian yep that is your correct epoch and has broken sibilance

https://tenor.com/view/math-calculate-confusing-figure-out-gif-6237717

#

the epoch that's the best is the most broken one ??

analog obsidian Oct 10, 2024, 6:20 PM

#

prisma grove the epoch that's the best is the most broken one ??

basically

prisma grove Oct 10, 2024, 6:20 PM

#

tf you mean

analog obsidian Oct 10, 2024, 6:20 PM

#

prisma grove tf you mean

is not bad, just the sibilance are artifacting

prisma grove Oct 10, 2024, 6:20 PM

#

and that makes it worse

#

no?

analog obsidian Oct 10, 2024, 6:21 PM

#

prisma grove no?

nah, artifacting happens randomly

#

dont worry

#

is a usuable model

prisma grove Oct 10, 2024, 6:21 PM

#

I don't want it to be usable I want it to be good

analog obsidian Oct 10, 2024, 6:22 PM

#

prisma grove I don't want it to be *usable* I want it to be good

hmm try this epoch and see if the sibilance are artifacting

#

or if they got better

prisma grove Oct 10, 2024, 6:23 PM

#

that's step 8.8k, the closest I have is 9400

#

e100

analog obsidian Oct 10, 2024, 6:23 PM

#

prisma grove e100

try that

analog obsidian Oct 10, 2024, 6:24 PM

#

simple ore i dont remember, that was a small test set

even in smaller datasets (5 minutes) fm should go down alongside g/total?

prisma grove Oct 10, 2024, 6:24 PM

#

analog obsidian Oct 10, 2024, 6:25 PM

#

prisma grove

way better

simple ore Oct 10, 2024, 6:25 PM

#

dunno about 5 minutes, that's barely enough

prisma grove Oct 10, 2024, 6:25 PM

#

so the calculator thing was right

#

?

simple ore Oct 10, 2024, 6:25 PM

#

my small set was like 12 min?

#

the result was not good anyway

analog obsidian Oct 10, 2024, 6:25 PM

#

simple ore dunno about 5 minutes, that's barely enough

yea im aware, i don't have a 10 minute dataset rn lol

prisma grove Oct 10, 2024, 6:25 PM

#

https://www.desmos.com/calculator/yeqx4dmcfm?lang=pl

#

it says for ~20 minute dataset you do ~100 epochs

analog obsidian Oct 10, 2024, 6:26 PM

#

prisma grove so the calculator thing was right

don't use this, just use tensorboard, more easy

prisma grove Oct 10, 2024, 6:26 PM

#

yeah easy af man

analog obsidian Oct 10, 2024, 6:26 PM

#

smoothing 0 and scalars off helps u choosing low points

simple ore Oct 10, 2024, 6:26 PM

#

well, 20 min / 100 epochs is about right, +50 maybe

analog obsidian Oct 10, 2024, 6:26 PM

#

analog obsidian smoothing 0 and scalars off helps u choosing low points

like what we did now

simple ore Oct 10, 2024, 6:26 PM

#

all depends on the content

prisma grove Oct 10, 2024, 6:26 PM

#

simple ore well, 20 min / 100 epochs is about right, +50 maybe

BUT 150 WAS BROKEN

#

analog obsidian Oct 10, 2024, 6:27 PM

#

yeah epoch 100 sounds fine to me

prisma grove Oct 10, 2024, 6:27 PM

#

boohooh

simple ore Oct 10, 2024, 6:27 PM

#

you used batch 5 instead of 4 like a weirdo

analog obsidian Oct 10, 2024, 6:27 PM

#

simple ore you used batch 5 instead of 4 like a weirdo

wasnt batch 5? lol

simple ore Oct 10, 2024, 6:27 PM

#

fixed

analog obsidian Oct 10, 2024, 6:27 PM

#

i notice the breathings are robotic in every epoch, probably the dataset lacked breaths

prisma grove Oct 10, 2024, 6:27 PM

#

simple ore you used batch 5 instead of 4 like a weirdo

THAT'S THE DEFAULT

prisma grove Oct 10, 2024, 6:28 PM

#

analog obsidian i notice the breathings are robotic in every epoch, probably the dataset lacked ...

YES BECAUSE IT'S MIKU

#

MIKU IS A ROBOT

#

FFS

analog obsidian Oct 10, 2024, 6:28 PM

#

prisma grove YES BECAUSE IT'S MIKU

miku has breaths

prisma grove Oct 10, 2024, 6:28 PM

#

WHICH ARE ROBOTIC

analog obsidian Oct 10, 2024, 6:28 PM

#

it was br1 in vocaloid iirc

simple ore Oct 10, 2024, 6:29 PM

#

prisma grove # THAT'S THE DEFAULT

and it is not right

analog obsidian Oct 10, 2024, 6:30 PM

#

simple ore and it is not right

nono, miku breath samples in vocaloid sounds like that

#

she is one of the voicebanks that has broken breathing samples

simple ore Oct 10, 2024, 6:30 PM

#

i mean batch being 5 as default

analog obsidian Oct 10, 2024, 6:30 PM

#

simple ore i mean batch being 5 as default

ah yeah

prisma grove Oct 10, 2024, 6:30 PM

#

why does it even matter? what does batch size even mean?

simple ore Oct 10, 2024, 6:31 PM

#

how many random sets of samples it trains in parallel

analog obsidian Oct 10, 2024, 6:31 PM

#

prisma grove why does it even matter? what does batch size even mean?

this is how rvc is gonna learn ur dataset
a bad batch size can lead to unstable training and potentially bad outcomes

prisma grove Oct 10, 2024, 6:31 PM

#

so it has to be 4?

simple ore Oct 10, 2024, 6:31 PM

#

for under 1hour use 4

analog obsidian Oct 10, 2024, 6:31 PM

#

yeah use 4

simple ore Oct 10, 2024, 6:31 PM

#

there's no speed benefit in using more

prisma grove Oct 10, 2024, 6:32 PM

#

who makes 1hr datasets

analog obsidian Oct 10, 2024, 6:32 PM

#

trolley

#

good question

prisma grove Oct 10, 2024, 6:32 PM

#

you can make decent voices with 5 minutes, I thought my 19 minutes was overkill

analog obsidian Oct 10, 2024, 6:33 PM

#

welp model quality is tied with dataset quality, so a 5 minute high quality dataset is gonna sound high quality, just unnatural compared to bigger datasets

prisma grove Oct 10, 2024, 6:33 PM

#

miku is not gonna sound natural ever 😭

analog obsidian Oct 10, 2024, 6:34 PM

#

prisma grove miku is not gonna sound natural ever 😭

miku should never sound natural trolley

prisma grove Oct 10, 2024, 6:34 PM

#

wdym by unnatural anyway

#

like with the s's?

low shard Oct 10, 2024, 6:34 PM

#

serene horizon First, I’m going to try from the start again. Maybe I missed something. I

alright

analog obsidian Oct 10, 2024, 6:34 PM

#

prisma grove like with the s's?

yeah and that smaller datasets are always gonna sound like a rvc model rather than a real human

prisma grove Oct 10, 2024, 6:35 PM

#

what do you consider a small dataset

low shard Oct 10, 2024, 6:35 PM

#

polar plaza

refresh it

prisma grove Oct 10, 2024, 6:35 PM

#

under 10 minutes? under 30? under an hour 😭 ??

analog obsidian Oct 10, 2024, 6:35 PM

#

prisma grove under 10 minutes? under 30? under an hour 😭 ??

under 30

#

it starts to get realistic at over 30 minutes

prisma grove Oct 10, 2024, 6:36 PM

#

how would that affect miku

analog obsidian Oct 10, 2024, 6:36 PM

#

prisma grove how would that affect miku

if u add more miku samples, she's just gonna sound like her exported vocaloid vocals

#

like no one is gonna tell is rvc

prisma grove Oct 10, 2024, 6:36 PM

#

so worse?

analog obsidian Oct 10, 2024, 6:37 PM

#

prisma grove so worse?

nope, good
you can't make her realistic with more minutes, you can only make her sound if the vocals were made in vocaloid rather than rvc

#

(which is why people prefer to just use vocaloid and not rvc)

#

for miku

prisma grove Oct 10, 2024, 6:38 PM

#

I want to make realistic miku

simple ore Oct 10, 2024, 6:38 PM

#

hmm... i wonder

#

gimme a sec

prisma grove Oct 10, 2024, 6:38 PM

#

cause it's also a matter of tuning, note transitions and all

analog obsidian Oct 10, 2024, 6:38 PM

#

prisma grove I want to make *realistic miku*

technically she can sound more realistic because with more minutes rvc can do more precise pitch changes

#

she's not gonna sound like a human but rather like a very well tuned vocaloid exported

prisma grove Oct 10, 2024, 6:39 PM

#

hm

rare gobletBOT Oct 10, 2024, 6:39 PM

#

Ayo? @prisma grove level 5 !!! lfg

prisma grove Oct 10, 2024, 6:39 PM

#

so how much should I do

#

30 minutes?

analog obsidian Oct 10, 2024, 6:39 PM

#

prisma grove 30 minutes?

30 yeah

prisma grove Oct 10, 2024, 6:39 PM

#

okay then

#

I'll try

analog obsidian Oct 10, 2024, 6:40 PM

#

prisma grove I'll try

your miku samples are her singing random vsqx files?

simple ore Oct 10, 2024, 6:40 PM

#

is there a source of this Miki voice?

analog obsidian Oct 10, 2024, 6:40 PM

#

simple ore is there a source of this Miki voice?

she cost real money (unless.... yw)

simple ore Oct 10, 2024, 6:40 PM

#

i just need 30 seconds

analog obsidian Oct 10, 2024, 6:41 PM

#

simple ore i just need 30 seconds

i dont have her installed srry, but if u want any audio then check this https://www.youtube.com/watch?v=swqbfMh467A

YouTube

googoo888

[30fps Full風] Romeo and Cinderella ロミオとシンデレラ -Hatsune Miku 初音ミク DIV...

Remake→https://www.youtube.com/watch?v=naQjypGoOHY
Cool Miku!(｀・ω・´)/ｸｰﾙﾐｸ!
ｵﾌｨｼｬﾙ Official Song: https://www.youtube.com/watch?v=kp-plPYAPq8
ｵﾌｨｼｬﾙ Official Channel: https://www.youtube.com/channel/UC9z_ByomNk9DafVViv5sKkA/featured
SEGA Project DIVA Channel: https://www.youtube.com/channel/UC6FTMCuI9X2ggdMiKFpRukw

( doriko VOCALOID ボーカロイドボカロ...

▶ Play video

prisma grove Oct 10, 2024, 6:41 PM

#

analog obsidian your miku samples are her singing random vsqx files?

random vsq/vsqx/vpr with different voicebanks (v3, english, v4, chinese) and also stuff ripped from project diva megamix, de-reverb'ed if needed

analog obsidian Oct 10, 2024, 6:42 PM

#

prisma grove random vsq/vsqx/vpr with different voicebanks (v3, english, v4, chinese) and als...

only use her japanese one

prisma grove Oct 10, 2024, 6:42 PM

#

y

simple ore Oct 10, 2024, 6:42 PM

#

gime me a wav

#

or mp3

analog obsidian Oct 10, 2024, 6:42 PM

#

prisma grove y

use actual vocaloid exports and nothing isolated

prisma grove Oct 10, 2024, 6:42 PM

#

I can't do tuning

#

so I wanted to include other people's tuning too

analog obsidian Oct 10, 2024, 6:43 PM

#

prisma grove I can't do tuning

don't worry just let the default tuning

#

anything that comes with the vsqx

prisma grove Oct 10, 2024, 6:43 PM

#

but why only japanese

analog obsidian Oct 10, 2024, 6:43 PM

#

she'll be able to sing any language in rvc despite being trained only in japanese

prisma grove Oct 10, 2024, 6:43 PM

#

no

analog obsidian Oct 10, 2024, 6:43 PM

#

yep

prisma grove Oct 10, 2024, 6:44 PM

#

I mean yes but some sounds are gonna sound wrong

#

like the english "r"

simple ore Oct 10, 2024, 6:44 PM

#

analog obsidian Oct 10, 2024, 6:44 PM

#

prisma grove I mean yes but some sounds are gonna sound wrong

you can do 2 rvc models, one with her japanese model and the other with the english one

prisma grove Oct 10, 2024, 6:44 PM

#

what 4

analog obsidian Oct 10, 2024, 6:44 PM

#

simple ore

bootleg miku

prisma grove Oct 10, 2024, 6:44 PM

#

why can't I just stuff it together

analog obsidian Oct 10, 2024, 6:45 PM

#

prisma grove why can't I just stuff it together

rvc learns better if there's consistency

simple ore Oct 10, 2024, 6:45 PM

#

quality is a bad because I used @prisma grove's song

analog obsidian Oct 10, 2024, 6:45 PM

#

so if every sample is in japanese is better

#

than japanese + english + chinese

prisma grove Oct 10, 2024, 6:45 PM

#

it is consistent because it's still miku

analog obsidian Oct 10, 2024, 6:45 PM

#

prisma grove it is consistent because it's still miku

yes but pronunciation changes

#

its up to you anyways

prisma grove Oct 10, 2024, 6:46 PM

#

wouldn't that be good? make it able to pronounce more stuff accurately?

analog obsidian Oct 10, 2024, 6:46 PM

#

prisma grove wouldn't that be good? make it able to pronounce more stuff accurately?

yes but is also going to get confused sometimes

#

and pronunciation might be worse

prisma grove Oct 10, 2024, 6:46 PM

#

how would it get confused?

#

it's gonna pronounce stuff like the source audio does

analog obsidian Oct 10, 2024, 6:46 PM

#

prisma grove how would it get confused?

is not going to know correctly which pronunciation will use

prisma grove Oct 10, 2024, 6:47 PM

#

if the source audio pronounces it right then it will too, no?

analog obsidian Oct 10, 2024, 6:47 PM

#

prisma grove if the source audio pronounces it right then it will too, no?

hit or miss

#

but higher chance of better pronunciation if all of the dataset has the same language

#

and that the inferenced audio is also the same langauge present in the dataset

prisma grove Oct 10, 2024, 6:48 PM

#

well it's still 90% japanese

analog obsidian Oct 10, 2024, 6:48 PM

#

prisma grove well it's still 90% japanese

so with this info you know the model is better at japanese than the rest

#

and also the model has a bias of japanese pronunciation

prisma grove Oct 10, 2024, 6:49 PM

#

doesn't rvc work by just picking whatever's the closest

#

I don't get it

analog obsidian Oct 10, 2024, 6:50 PM

#

prisma grove doesn't rvc work by just picking whatever's the closest

index 0 can help avoiding this bias

#

but not always

simple ore Oct 10, 2024, 6:50 PM

#

index blends original features from the audio and features of the voice model

#

by mapping original to voice model

analog obsidian Oct 10, 2024, 6:51 PM

#

if she has a bias of prononucing "la" like "ra" then she has a 99% chance of doing this even with index 0

simple ore Oct 10, 2024, 6:51 PM

#

um.. no

analog obsidian Oct 10, 2024, 6:51 PM

#

index just forces that to always happen

#

PandaShrug

simple ore Oct 10, 2024, 6:51 PM

#

that's not how the index works

analog obsidian Oct 10, 2024, 6:51 PM

#

simple ore that's not how the index works

doesnt force the dataset samples over the audio features?

#

tbh no one explained what index is

prisma grove Oct 10, 2024, 6:52 PM

#

oh my god

#

if the model is 90% japanese

analog obsidian Oct 10, 2024, 6:52 PM

#

bro just use your model if u like it

prisma grove Oct 10, 2024, 6:52 PM

#

then if you convert an audio of someone saying the word "Carrot", it's not gonna suddenly say "カロット"

analog obsidian Oct 10, 2024, 6:52 PM

#

already sounds good for me

analog obsidian Oct 10, 2024, 6:52 PM

#

prisma grove then if you convert an audio of someone saying the word "Carrot", it's not gonna...

ofc no

#

try inferencing more audio and see if u like the results

#

at the end of the day what matters is if you like the model

simple ore Oct 10, 2024, 6:53 PM

#

again, that's now how index works.... it take source audio feature, tried to look up something close enough from the voice model features

#

then it blends original and voice in selected ratio

analog obsidian Oct 10, 2024, 6:54 PM

#

simple ore again, that's now how index works.... it take source audio feature, tried to loo...

finally someone explains what index is lols

simple ore Oct 10, 2024, 6:54 PM

#

english audio + french speaker at 0 index has minimal accent, the accent comes in full force when you use index 1

analog obsidian Oct 10, 2024, 6:54 PM

#

most of the stuff i learned was from trial and error

simple ore Oct 10, 2024, 6:54 PM

#

#🔊│ai-development message

#

there you can check

#

I made a test

analog obsidian Oct 10, 2024, 6:55 PM

#

simple ore english audio + french speaker at 0 index has minimal accent, the accent comes i...

yea same i noticed this too

analog obsidian Oct 10, 2024, 6:55 PM

#

simple ore I made a test

i see makes sense now
sucks that when we are starting doing rvc models there's no info about anything in the internet

#

how are we gonna know what the metrics are? you go to the official rvc github site and there's nothing that tells u what even g total is

#

😭 its like only a couple of people actually know how this thing actually works

prisma grove Oct 10, 2024, 6:57 PM

#

this is the 90% japanese 100e model

#

it's pronouncing it better than I can with my shitty polish accent lol

simple ore Oct 10, 2024, 6:57 PM

#

dude./.. give me some good wav you're using for training

analog obsidian Oct 10, 2024, 6:58 PM

#

xD

prisma grove Oct 10, 2024, 6:59 PM

#

like, you want a wav of miku's singing converted through miku rvc?

analog obsidian Oct 10, 2024, 6:59 PM

#

prisma grove like, you want a wav of miku's singing converted through miku rvc?

he wants a sample of your dataset

#

any wav

#

an audio that u used for training

#✨│ai-help

Google Colabs

Google Colabs

Google Colabs

Local Forks 🖥️

AI HUB Docs

🍏 Applio Docs

How To Troubleshoot

How To Troubleshoot

Local Forks 🖥️

Settings for Nvidia GPUs

Advanced Settings

Settings for AMD GPUs

Advanced Settings

THAT'S THE DEFAULT