oblique viper May 26, 2025, 3:41 PM

#

do they have a more up to date guide somewhere?

analog obsidian May 26, 2025, 3:42 PM

#

oblique viper do they have a more up to date guide somewhere?

no idea but most probably, no, tho i can explain some things to you really quick
so applio has this avg_50 graphs, those are already smoothed by default, so to correctly read them you have to set your smoothing to 0.5 in the tensorboard site

#

you choose batch size depending on how big is your dataset

#

there are two slicing methods for datasets, simple mode, and automatic mode
Automatic mode is the same slicer of mainline
simple mode slices every 3s (by default), it doesn't take silence into account, so you have to remove silence in audacity using truncate silence

oblique viper May 26, 2025, 3:44 PM

#

I ran with the default settings and reached around 200 epochs with my model, I usually get caught by the errors that come from doing things that are said in the guide

analog obsidian May 26, 2025, 3:45 PM

#

ah i have no idea about colab specific errors, i gave up on them, too many errors

oblique viper May 26, 2025, 3:45 PM

#

ikr

analog obsidian May 26, 2025, 3:45 PM

#

noobies is one of the people maintaining the colabs

#

he knows more

oblique viper May 26, 2025, 3:45 PM

#

I tried to get zluda working because I am unfortunate enough to have 6700XT but that was a whole world of errors in itself, worse than colab

analog obsidian May 26, 2025, 3:46 PM

#

ive heard zluda training speeds are extremely slow so it's better to use a cloud solution anyway

crude flame May 26, 2025, 3:47 PM

#

oblique viper I tried to get zluda working because I am unfortunate enough to have 6700XT but ...

twins i also got a 6700xt

analog obsidian May 26, 2025, 3:47 PM

#

misc_lets_fucking_go

#

AMD

oblique viper May 26, 2025, 3:48 PM

#

crude flame twins i also got a 6700xt

my brother

crude flame May 26, 2025, 3:48 PM

#

this might be flux or placebo but i feel like amd gpus train models weird and give bad models

analog obsidian May 26, 2025, 3:49 PM

#

yt_nails

crude flame May 26, 2025, 3:49 PM

#

like i could compare my model trained locally and a model trained on a nvidia gpu and even with everything being the same the amd one sounds worse

oblique viper May 26, 2025, 3:49 PM

#

analog obsidian noobies is one of the people maintaining the colabs

he's super knowledgeable but the image I attached is a big problem for me trying to get help from him 😔

analog obsidian May 26, 2025, 3:49 PM

#

oblique viper he's super knowledgeable but the image I attached is a big problem for me trying...

thats literally him lmaoo

crude flame May 26, 2025, 3:49 PM

#

LOL WHY IS THAT SO ACCURATE

simple ore May 26, 2025, 3:59 PM

#

after you beat you head all sunday evening against a desk because someone cant follow basic instructions...

oblique viper May 26, 2025, 3:59 PM

#

simple ore after you beat you head all sunday evening against a desk because someone cant f...

most of my problems that I've been having are because of following ~~outdated~~ basic instructions...

simple ore May 26, 2025, 3:59 PM

#

analog obsidian ive heard zluda training speeds are extremely slow so it's better to use a cloud...

with 6700xt is it faster then colab

#

unfortunately for 20 people who follow the instructions and get the results there's someone who skips steps and misreads everything

#

I blame the ipad generation

analog obsidian May 26, 2025, 4:01 PM

#

simple ore with 6700xt is it faster then colab

oh thats great, T4 gpu is sooo ancient

simple ore May 26, 2025, 4:01 PM

#

nobody teaches the computer basics in school any more

cobalt carbon May 26, 2025, 4:03 PM

#

in cs2 voice is not working

#

i changed the mic

oblique viper May 26, 2025, 4:03 PM

#

out of the 5+ AI tools that I've used for various reasons, this is the only one that I had to bash my head against so much

#

and I've used Applio back when the guide was still not outdated, back then everything went smoothly too

hallow thistle May 26, 2025, 4:04 PM

#

If the system tells you "Pytorch is damaged", it indicates that Mac has flagged the W-Okada as malware, which is a false positive. For a solution, open a Terminal and follow in this guide. https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#opening-on-mac

Deiteris' W Okada Fork

Last update: May 5, 2025

oblique viper May 26, 2025, 4:05 PM

#

simple ore nobody teaches the computer basics in school any more

maybe you don't have to assume that beginners have degrees in computer science and expert knowledge in AI

#

and me being a beginner, the average joe would struggle even more than I am

hallow thistle May 26, 2025, 4:07 PM

#

This is where to discuss about the program issue, not showing off your ego. cat_seriously

simple ore May 26, 2025, 4:08 PM

#

yeah, I dont know how to drive a push cart, lemme drive mclaren f1

#

great approarch for AI

oblique viper May 26, 2025, 4:09 PM

#

isn't this discord server and the guide meant to make this more accessible for beginners?

hallow thistle May 26, 2025, 4:09 PM

#

Don't take what everyone here says about you seriously, it ain't that deep.

cobalt carbon May 26, 2025, 4:09 PM

#

@low shard can u help me

crude flame May 26, 2025, 4:09 PM

#

oblique viper isn't this discord server and the guide meant to make this more accessible for b...

what is your issue with the colab? i just tried it and it worked

analog obsidian May 26, 2025, 4:10 PM

#

crude flame what is your issue with the colab? i just tried it and it worked

the ui one?

simple ore May 26, 2025, 4:10 PM

#

crude flame what is your issue with the colab? i just tried it and it worked

well, he tried crepe with hop 70.. and 1

crude flame May 26, 2025, 4:10 PM

#

analog obsidian the ui one?

yea

oblique viper May 26, 2025, 4:10 PM

#

crude flame what is your issue with the colab? i just tried it and it worked

the UI one doesn't load backups properly for some reason

hallow thistle May 26, 2025, 4:10 PM

#

Creepy.

hallow thistle May 26, 2025, 4:10 PM

#

cobalt carbon <@911742715019001897> can u help me

!howtoask

patent trellisBOT May 26, 2025, 4:10 PM

#

hallow thistle !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

cobalt carbon May 26, 2025, 4:10 PM

#

hallow thistle !howtoask

bro are u blind

cobalt carbon May 26, 2025, 4:11 PM

#

cobalt carbon in cs2 voice is not working

.

analog obsidian May 26, 2025, 4:11 PM

#

don't use crepe with a hop different than 160

#

tho i would just use rmvpe

oblique viper May 26, 2025, 4:11 PM

#

simple ore well, he tried crepe with hop 70.. and 1

well sorry for reading the tiny piece of text that told me decreasing it would make the pitch better at the cost of longer time (even though having hop at the default didn't help either)

crude flame May 26, 2025, 4:11 PM

#

oblique viper the UI one doesn't load backups properly for some reason

assuming you have your drive mounted do you have your model in ApplioBackup/modelname

analog obsidian May 26, 2025, 4:12 PM

#

oblique viper well sorry for reading the tiny piece of text that told me decreasing it would m...

yea that was an old misconception
back then people had no idea about anything rvc related

hallow thistle May 26, 2025, 4:12 PM

#

cobalt carbon bro are u blind

If I say yes, would you believe me? Ok, I know you are talking about W-Okada the realtime voice changer, but what you elaborated is too less.

analog obsidian May 26, 2025, 4:12 PM

#

mangio-crepe was a silly idea

oblique viper May 26, 2025, 4:12 PM

#

analog obsidian yea that was an old misconception back then people had no idea about anything rv...

and the text is still there.. to this day

analog obsidian May 26, 2025, 4:13 PM

#

oblique viper and the text is still there.. to this day

damn

crude flame May 26, 2025, 4:13 PM

#

crude flame assuming you have your drive mounted do you have your model in ApplioBackup/mode...

If you dont want to mount your drive you can create a folder in colab called 'drive' then another folder in it called 'MyDrive' then create the 'ApplioBackup' and model name folders

analog obsidian May 26, 2025, 4:13 PM

#

even if u use crepe with a hop of 160, rmvpe is still better

crude flame May 26, 2025, 4:14 PM

#

oblique viper and the text is still there.. to this day

in the aihub docs or applio docs? if its the aihub docs please please let me know of any issues

#

i take care of the ai hub docs now so ye

oblique viper May 26, 2025, 4:14 PM

#

crude flame assuming you have your drive mounted do you have your model in ApplioBackup/mode...

yeah I have my drive mounted, and it did make an Applio folder with the model in it after I loaded backup, but when I tried to run Applio it gave out an error

oblique viper May 26, 2025, 4:14 PM

#

crude flame in the aihub docs or applio docs? if its the aihub docs please please let me kno...

it's in the Applio thing where you extract and all that

analog obsidian May 26, 2025, 4:15 PM

#

crude flame i take care of the ai hub docs now so ye

things have gotten more complicated than before imo
like, how could we explain that the loss graphs don't help in choosing a good epoch??

#

they're there so you can monitor irregularities

#

or that g/total is innacurate

oblique viper May 26, 2025, 4:17 PM

#

analog obsidian or that g/total is innacurate

that's new knowledge to me

crude flame May 26, 2025, 4:17 PM

#

analog obsidian things have gotten more complicated than before imo like, how could we explain t...

you say that

"The tensorboard is for monitoring for any irregularities any issues. Do not depend on the tensorboard to find your best sounding epoch"

analog obsidian May 26, 2025, 4:17 PM

#

oblique viper that's new knowledge to me

applio has a new loss named gen adv which is a bit more accurate than g/total

#

but that is only in a specific branch

#

and even that doesn't help you in choosing an epoch

#

the loss graphs are really a bit useless

#

best metric is to hear your model

crude flame May 26, 2025, 4:18 PM

#

analog obsidian or that g/total is innacurate

mention how g/total is the combined loss of all the other losses

analog obsidian May 26, 2025, 4:18 PM

#

crude flame mention how g/total is the combined loss of all the other losses

yet still innacurate

#

u can see adv gen loss going up yet g/total may still go down

oblique viper May 26, 2025, 4:19 PM

#

I think a good step to help beginners is to have a strong suggestion that auto backups be turned on in the Applio colab tab, as I couldn't find any mention of turning auto backup on

Since it was in extras I assumed it's not necessary and I learned that the hard way when my colab ran out of GPU resources

crude flame May 26, 2025, 4:19 PM

#

analog obsidian u can see adv gen loss going up yet g/total may still go down

thats bec mel and kl are still going down

#

mel carries g/total iirc

analog obsidian May 26, 2025, 4:19 PM

#

crude flame thats bec mel and kl are still going down

these always go down

#

only time they go up is if u try silly stuff like loss balancer

crude flame May 26, 2025, 4:19 PM

#

analog obsidian only time they go up is if u try silly stuff like loss balancer

tehe

analog obsidian May 26, 2025, 4:19 PM

#

cat_dance

crude flame May 26, 2025, 4:20 PM

#

cat_blush

hallow thistle May 26, 2025, 4:20 PM

#

Texts blur when I look closely to them. How am I thar blind? It's more like I lose focus on a small topic too easily, especially when there's an ongoing bigger topic in chat or channel. cat_wtf

analog obsidian May 26, 2025, 4:20 PM

#

oblique viper I think a good step to help beginners is to have a strong suggestion that auto b...

most stuff needs to be updated to be honest, a lot of things are outdated

crude flame May 26, 2025, 4:20 PM

#

anyway i havent updated the docs on the new logging stuff bec it isnt in the mainline branch of applio

simple ore May 26, 2025, 4:21 PM

#

btw, removed hop length for crepe from UI in exp/f0 branch

analog obsidian May 26, 2025, 4:21 PM

#

google doesn't really like local ai stuff so they don't care if a random update kills ai training/infer
they just want you to use their ai instead

analog obsidian May 26, 2025, 4:21 PM

#

simple ore btw, removed hop length for crepe from UI in exp/f0 branch

:0 nice!

#

the colabs being buggy isn't really applio guys fault but more like google trying to savotage everything

#

kaggle is another option but imo is way more broken than colab (they also hate any deep fake related ai stuff)

cobalt carbon May 26, 2025, 4:24 PM

#

Can someone help me ? in cs rvc doesnt work what can i do ? i changed the input

oblique viper May 26, 2025, 4:24 PM

#

@analog obsidian what should I do step by step in the no UI colab to continue training on my ApplioBackup? I've been using the UI colab this whole time so I'm a bit confused

hallow thistle May 26, 2025, 4:24 PM

#

cobalt carbon Can someone help me ? in cs rvc doesnt work what can i do ? i changed the input

Do I need to remind you again?

#

W-Okada not working with Counter Strike 2 can occur with several reasons, like using an older and original version of W-Okada, VB-Cable and Voicemeeter seem to cause issue when using them with W-Okada on Windows, and your microphone.

analog obsidian May 26, 2025, 4:25 PM

#

oblique viper <@775545133448953856> what should I do step by step in the no UI colab to contin...

sorry i don't use it, i train locally

cobalt carbon May 26, 2025, 4:25 PM

#

hallow thistle W-Okada not working with Counter Strike 2 can occur with several reasons, like u...

i didnt ask to you

#

pls shut up

hallow thistle May 26, 2025, 4:25 PM

#

cobalt carbon i didnt ask to you

Then it's not my job to help you either, if you continue being a dick against me so.

cobalt carbon May 26, 2025, 4:26 PM

#

u didnt help me so no need your help

#

👍

oblique viper May 26, 2025, 4:26 PM

#

cobalt carbon i didnt ask to you

"can someone help me"
"i didnt ask you"

cobalt carbon May 26, 2025, 4:26 PM

#

oblique viper - "can someone help me" - "i didnt ask you"

womp womp ?

crude flame May 26, 2025, 4:26 PM

#

💀

hallow thistle May 26, 2025, 4:27 PM

#

oblique viper - "can someone help me" - "i didnt ask you"

Ignore him. He thinks he can help himself for that.

cobalt carbon May 26, 2025, 4:27 PM

#

hallow thistle W-Okada not working with Counter Strike 2 can occur with several reasons, like u...

im using the fork version

#

im not using voicemeter or anything else

oblique viper May 26, 2025, 4:28 PM

#

anyone who knows how the no UI colab works can help me figure out step by step on how to continue training on my ApplioBackup? I've been using the UI colab this whole time so I'm a bit confused

hallow thistle May 26, 2025, 4:29 PM

#

cobalt carbon im using the fork version

karinthink

analog obsidian May 26, 2025, 4:29 PM

#

oblique viper anyone who knows how the no UI colab works can help me figure out step by step o...

i think it works the same as the ui colab

#

cobalt carbon May 26, 2025, 4:29 PM

#

i tried to run it on steam web shift tab

#

it opens ui but browser doesnt have a mic perm

#

so it doesnt work with it too

analog obsidian May 26, 2025, 4:30 PM

#

cobalt carbon so it doesnt work with it too

https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

Deiteris' W Okada Fork

Last update: May 5, 2025

#

read the guide

hallow thistle May 26, 2025, 4:31 PM

#

Mate, you said you didn't ask to me. Why did you switch up that fast?

cobalt carbon May 26, 2025, 4:31 PM

#

analog obsidian read the guide

i did it

analog obsidian May 26, 2025, 4:31 PM

#

cobalt carbon i did it

you got virtual audio cable?

cobalt carbon May 26, 2025, 4:31 PM

#

i did the same settings

#

ye bro

#

its working on dc

#

but not on cs

#

i think browser goes sleep mode when on cs

analog obsidian May 26, 2025, 4:32 PM

#

does cs allows you to choose which mic you wanna use in game?

cobalt carbon May 26, 2025, 4:32 PM

#

i dont know how to solve this

cobalt carbon May 26, 2025, 4:32 PM

#

analog obsidian does cs allows you to choose which mic you wanna use in game?

ye

#

i did the settings

analog obsidian May 26, 2025, 4:32 PM

#

hmm weird

cobalt carbon May 26, 2025, 4:32 PM

#

when i alt tab it works again

analog obsidian May 26, 2025, 4:32 PM

#

but even if the gui is frozen, the actual program is running in the cmd window

cobalt carbon May 26, 2025, 4:32 PM

#

but in game it does not work

analog obsidian May 26, 2025, 4:35 PM

#

see if restarting the voice changer fixes it

hallow thistle May 26, 2025, 4:35 PM

#

https://tenor.com/view/i-saw-what-you-deleted-cat-gif-16534979835682612734

Tenor

analog obsidian May 26, 2025, 4:35 PM

#

if that doesn't work try restarting ur pc

#

could be some weird windows interaction

cobalt carbon May 26, 2025, 4:36 PM

#

analog obsidian if that doesn't work try restarting ur pc

i tried it several times , when im on cs cmd goes sleep mode i think

analog obsidian May 26, 2025, 4:36 PM

#

or the browser is muting your mic when you close it/hide it

#

try a different browser

#

anything but operagx

cobalt carbon May 26, 2025, 4:37 PM

#

oki

#

im using brave

analog obsidian May 26, 2025, 4:37 PM

#

try chrome

cobalt carbon May 26, 2025, 4:37 PM

#

oke

oblique viper May 26, 2025, 4:37 PM

#

@simple ore could you explain which cells I have to run step by step in the No UI colab to keep training my backup? I'm getting this error
I've tried:
Mount google drive > Clone > Install > Load a Backup

analog obsidian May 26, 2025, 4:37 PM

#

yeah this weird issue comes due to w-okada being written in javascript which is extremely buggy

#

so every browser reacts differently to the gui

#

some are fine with it, some can't run it properly

cobalt carbon May 26, 2025, 4:40 PM

#

ye the problem is i think when im on cs browser is not using mic

#

same problem is on chrome too

hallow thistle May 26, 2025, 4:40 PM

#

Some say the Javascript is a trash programming language, but that's it. anime_nom

analog obsidian May 26, 2025, 4:40 PM

#

iirc the reason why is running in your browser is because the guy who made the fork noticed running it in the browser had better perfomance than running it in a window

cobalt carbon May 26, 2025, 4:41 PM

#

i hate java.

analog obsidian May 26, 2025, 4:41 PM

#

i may be wrong tho, that was long ago

analog obsidian May 26, 2025, 4:41 PM

#

cobalt carbon same problem is on chrome too

try edge

cobalt carbon May 26, 2025, 4:42 PM

#

it worked with edge

#

lol

analog obsidian May 26, 2025, 4:42 PM

#

xD

#

thats javascript for ya

#

buggy asf

cobalt carbon May 26, 2025, 4:43 PM

#

but it works bad

#

i need to change my pc

analog obsidian May 26, 2025, 4:43 PM

#

what gpu you have?

cobalt carbon May 26, 2025, 4:43 PM

#

1650

#

trash

analog obsidian May 26, 2025, 4:43 PM

#

ah yeah

#

i'd recommend a 4060 minimum

cobalt carbon May 26, 2025, 4:44 PM

#

i plan to buy a 5070 or 5070ti

analog obsidian May 26, 2025, 4:44 PM

#

nice, thats more than enough for this

#

at the moment you could try fcpe instead of rmvpe

#

fcpe is like a slightly less accurate rmvpe

#

but runs very fast

oblique viper May 26, 2025, 4:47 PM

#

at this point I would've had better luck training on my cpu for 42 hours straight than using colab angerysad

analog obsidian May 26, 2025, 4:47 PM

#

analog obsidian fcpe is like a slightly less accurate rmvpe

estimation wise, so dont worry, it doesn't affect the model's quality

oblique viper May 26, 2025, 4:49 PM

#

4 days of straight up 12 hours a day of trying to train a model using colab doesn't do good things to your brain

clever burrow May 26, 2025, 4:51 PM

#

sorry i forgot to say thank you 😭

analog obsidian May 26, 2025, 4:51 PM

#

oblique viper 4 days of straight up 12 hours a day of trying to train a model using colab does...

what if you start from 0 in the no ui colab
most models don't need more than 200 epochs

glacial pollen May 26, 2025, 4:51 PM

#

clever burrow sorry i forgot to say thank you 😭

👌

oblique viper May 26, 2025, 4:52 PM

#

analog obsidian what if you start from 0 in the no ui colab most models don't need more than 200...

that'd be my fourth time training the same model from 0

clever burrow May 26, 2025, 4:52 PM

#

i have one more question though

#

i've tried to download a model on applio but it doesn't seem to be popping up even after refreshing

analog obsidian May 26, 2025, 4:52 PM

#

oblique viper that'd be my fourth time training the same model from 0

so whats the problem? applio doesn't save the epochs?

oblique viper May 26, 2025, 4:53 PM

#

first time I tried training with CPU, that went bad, reached like epoch 70, then had to start again on colab, didn't have auto backup on so lost progress

#

most of what I did is a blur at this point, I'm on 2 hours of sleep

analog obsidian May 26, 2025, 4:53 PM

#

im pretty sure the autobackup option only saves g and d

#

you can convert the G file to a pth file actually

#

and use it as a normal model

oblique viper May 26, 2025, 4:55 PM

#

analog obsidian im pretty sure the autobackup option only saves g and d

no it saved epochs

clever burrow May 26, 2025, 4:55 PM

#

clever burrow i've tried to download a model on applio but it doesn't seem to be popping up ev...

oh nvm

analog obsidian May 26, 2025, 4:55 PM

#

so uhm you wanna train 200 epochs?

oblique viper May 26, 2025, 4:56 PM

#

I reached 235 epochs, I want to get to 300-500 to have as good quality as I can

analog obsidian May 26, 2025, 4:56 PM

#

oblique viper I reached 235 epochs, I want to get to 300-500 to have as good quality as I can

more epochs doesn't mean better quality

#

so in simple words
epochs = time the model has seen the whole dataset

#

if you force the ai to see the same thing a lot, it will believe it should only be able to clone the dataset and nothing more

#

it will quickly forget the pretrain knowledge

#

and become dumb

oblique viper May 26, 2025, 4:57 PM

#

analog obsidian more epochs doesn't mean better quality

how it feels reading this

analog obsidian May 26, 2025, 4:57 PM

#

analog obsidian it will quickly forget the pretrain knowledge

this translates as the model sounding robotic asf

#

and a lot of random weird problems

#

the model is going to try to clone your audio but it will not have the full knowledge of how to do it correctly

#

the only thing that takes 300-500 epochs, are pretrains

#

these are trained with like 50 hours of audio

#

your small dataset doesn't compare to that

#

so realistically speaking, you don't need more than 200 epochs

oblique viper May 26, 2025, 4:59 PM

#

I have 12 minutes of audio

analog obsidian May 26, 2025, 4:59 PM

#

most small models are done within the 100-150 epoch range

analog obsidian May 26, 2025, 5:00 PM

#

analog obsidian most small models are done within the 100-150 epoch range

but batch size also affects this

analog obsidian May 26, 2025, 5:00 PM

#

oblique viper I have 12 minutes of audio

i would use either batch size 8 or 4

oblique viper May 26, 2025, 5:00 PM

#

analog obsidian i would use either batch size 8 or 4

I used the default

analog obsidian May 26, 2025, 5:01 PM

#

oblique viper I used the default

train 200 epochs
save every 10
listen to all the epochs and choose the one you like best

oblique viper May 26, 2025, 5:02 PM

#

okay then I have all the epochs I need

analog obsidian May 26, 2025, 5:02 PM

#

there is a less biased method of choosing epochs but you're a beginner so stick to what i said, is easier

oblique viper May 26, 2025, 5:02 PM

#

time to test

oblique viper May 26, 2025, 5:02 PM

#

analog obsidian there is a less biased method of choosing epochs but you're a beginner so stick ...

what is the non biased method?

analog obsidian May 26, 2025, 5:02 PM

#

oblique viper what is the non biased method?

looking at the spectogram and see which epoch perfomed the best

#

you wanna check spectogram reproduction

oblique viper May 26, 2025, 5:03 PM

#

analog obsidian looking at the spectogram and see which epoch perfomed the best

how can I set it up if it's not too complicated?

analog obsidian May 26, 2025, 5:03 PM

#

oblique viper how can I set it up if it's not too complicated?

you know how to analyze spectograms? if thats the case, use rx11 since it's more precise

#

spek works too but rx11 allows for more precise analysis

oblique viper May 26, 2025, 5:04 PM

#

analog obsidian you know how to analyze spectograms? if thats the case, use rx11 since it's more...

tbf I never analyzed a spectogram, is it too complicated to learn or is it similar to learning how to read graphs?

analog obsidian May 26, 2025, 5:04 PM

#

oblique viper tbf I never analyzed a spectogram, is it too complicated to learn or is it simil...

is harder than reading graphs thats for sure

#

tho is easy to spot when the model is generating noise instead of data

#

you'll see missing harmonics

#

also some random artifacting

#

alongside more spectogram related issues

#

good epochs are able to do decent spectogram reproduction

#

so they sound less robotic and more natural

oblique viper May 26, 2025, 5:07 PM

#

I'll try to listen to some epochs ranging from 120 to 200 for now, maybe after I rest I'll try the spectogram way

analog obsidian May 26, 2025, 5:07 PM

#

sure, do some research about spectograms in general

#

you need that knowledge

#

at least for rvc is needed

#

you can also know if your batch size was either too much or too low by analyzing your model spectogram

#

but at the end, small datasets (10 minutes and below) give very random results, so for example, you can get a very bad result in your first training run, but if you train the same dataset a second time, you may probably get a better result than the first

oblique viper May 26, 2025, 5:21 PM

#

what are these? @analog obsidian how do they affect talking?

#

also thanks a ton for being kind and willing to help, you're very down to earth kanna_heart

analog obsidian May 26, 2025, 5:29 PM

#

oblique viper what are these? <@775545133448953856> how do they affect talking?

index = is a file where the accent of the dataset is stored, it is possible to use index files from another dataset tho i only use the index of my model
a safe value is 0.5, so a 50% of the dataset's accent will be added in the result, the other 50% will come from the pretrain
too high values may introduce artifacting (glitching, voice cracks, weird sounds)

volume envelope = rms normalization, in applio this is bugged, so don't use a value different than 1

protect voiceless = supposedly decreases the amount of robotic sibilants and breaths but a good model doesn't need this, in case you wanna play with this, start with a value of 0.33, bigger values decreases this protection, and lower values increases it

#

a value of 0.5 disables protect voiceless

#

for analyzing epochs don't use the index file

#

you can then use the index file after you find your best sounding epoch

analog obsidian May 26, 2025, 5:32 PM

#

analog obsidian for analyzing epochs don't use the index file

index may introduce some issues the model doesn't have to begin so thats why it's safer to analyze epochs without using the index

#

personally i have noticed using the index makes the model sound more true to the dataset

#

so better resemblance between the model and the real voice (this is probably why rvc-boss, the author of rvc, added index files)

oblique viper May 26, 2025, 5:49 PM

#

analog obsidian index may introduce some issues the model doesn't have to begin so thats why it'...

is it possible to change how strongly the index impacts the audio?

simple ore May 26, 2025, 5:50 PM

#

analog obsidian index = is a file where the accent of the dataset is stored, it is possible to u...

it does not work like that

#

the percentage is a blend between phonemes from the audio and the matching phoneme from the index file

analog obsidian May 26, 2025, 5:51 PM

#

oblique viper is it possible to change how strongly the index impacts the audio?

i think not but eh i noticed different values changes things a bit

simple ore May 26, 2025, 5:52 PM

#

0 - use phonemes from audio as is, 1 - use whatever the matches found in the index, anything between - blend the values

#

audio has 'th' phonemes, the closest faiss finds in the index is 'z', if you use 1 your model speaks english with german accent

oblique viper May 26, 2025, 5:53 PM

#

so if I use 0.5, my model speaks slightly german accent?

analog obsidian May 26, 2025, 5:53 PM

#

simple ore 0 - use phonemes from audio as is, 1 - use whatever the matches found in the ind...

i thought 0 was coming from the pretrain

analog obsidian May 26, 2025, 5:53 PM

#

oblique viper so if I use 0.5, my model speaks slightly german accent?

basically

#

obviously if your model has a english index, it'll have an american/brittish accent instead

simple ore May 26, 2025, 5:55 PM

#

the model still can make an incorrect preduction what the sound should be if it has not been trained on specific phonemes

#

there may still be a slight accent with 0 index

#

see #1376562269080649769 message

analog obsidian May 26, 2025, 5:59 PM

#

simple ore the model still can make an incorrect preduction what the sound should be if it ...

so the random voice cracks while using the index is because of this? like a japanese index trying to infer english audio

simple ore May 26, 2025, 6:00 PM

#

likely yes, it just finds a bad match/does not find anything

analog obsidian May 26, 2025, 6:00 PM

#

o nice to know

oblique viper May 26, 2025, 6:00 PM

#

where can I change the value of the index ?

simple ore May 26, 2025, 6:00 PM

#

and then the model is unable to produce anything because it has never seen such phoneme

analog obsidian May 26, 2025, 6:00 PM

#

oblique viper where can I change the value of the index ?

"search feature ratio" by default is 0.75

analog obsidian May 26, 2025, 6:01 PM

#

simple ore and then the model is unable to produce anything because it has never seen such ...

interesting, so i suppose dataset size also matters in this regard

oblique viper May 26, 2025, 6:02 PM

#

ohhh

#

feature ratio made it so much better

#

setting it higher

analog obsidian May 26, 2025, 6:02 PM

#

0.75 = 75% of the dataset accent blended in the result

#

find a sweespot where it works good for different audios tho, don't just use one

simple ore May 26, 2025, 6:03 PM

#

with 200k slices the index creation runs clusetering argorithm to group close enough samples to some average, then it runs a trim if there are more than 4000.. you can see it in the log with minibatch output

analog obsidian May 26, 2025, 6:04 PM

#

simple ore with 200k slices the index creation runs clusetering argorithm to group close en...

oh yes that kmeans thing

simple ore May 26, 2025, 6:04 PM

#

kmeans is 200k -> 4k

analog obsidian May 26, 2025, 6:04 PM

#

for bigger sets, lets say above 1 hour, is not worth to use faiss?

simple ore May 26, 2025, 6:05 PM

#

at most you can have ~4k unique samples

#

that's 3 hour set

#

and even then it will narrow it down to ~1200-1500

analog obsidian May 26, 2025, 6:05 PM

#

i was about to train a 3hour set

#

cat_dance

simple ore May 26, 2025, 6:06 PM

#

the largest index I saw about 1GB

#

you dont need to train the whole thing, you can just preprocess and extract features, then run the index creation

analog obsidian May 26, 2025, 6:06 PM

#

i see

analog obsidian May 26, 2025, 6:07 PM

#

simple ore and even then it will narrow it down to ~1200-1500

but isnt this bad?

#

or the index doesn't need that much data?

#

cat_sadcat

#

if i remember well, rvc-boss added kmeans because there was a bug that prevented index file generation while using sets above 1 hour

analog obsidian May 26, 2025, 6:11 PM

#

analog obsidian if i remember well, rvc-boss added kmeans because there was a bug that prevented...

okay this is not the case, seems like he did this to speed up the index generation

simple ore May 26, 2025, 6:14 PM

#

analog obsidian or the index doesn't need that much data?

i mean.. if it fine, but for realtime it may be taking extra time to look up phonemes

analog obsidian May 26, 2025, 6:15 PM

#

simple ore i mean.. if it fine, but for realtime it may be taking extra time to look up pho...

ohh good to know this, thank you cat_yes

simple ore May 26, 2025, 6:15 PM

#

i have not tested it, i need to find a big index and compare

swift thunder May 26, 2025, 7:05 PM

#

sudden quail May 26, 2025, 8:11 PM

#

@low shard is there a proper tutorial into using rvc i wanna use voice changer with discord

latent kettle May 26, 2025, 8:16 PM

#

sudden quail <@911742715019001897> is there a proper tutorial into using rvc i wanna use voic...

https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#deiteris-w-okada-fork

Deiteris' W Okada Fork

Last update: May 5, 2025

languid cliff May 26, 2025, 8:55 PM

#

analog obsidian but batch size also affects this

So higher batch size produces a worse result? even with a very high end GPU?

So would you say generally 200 epochs is the middle ground for most samples between 10-40 minutes?

analog obsidian May 26, 2025, 8:56 PM

#

languid cliff So higher batch size produces a worse result? even with a very high end GPU? So...

batch size 8 in a 5 minute dataset is bad but on a 20 minute one is good

languid cliff May 26, 2025, 8:56 PM

#

analog obsidian batch size 8 in a 5 minute dataset is bad but on a 20 minute one is good

i have a sample size of 40 minutes right now i want to train. So i should pick 8 over 4?

analog obsidian May 26, 2025, 8:56 PM

#

and i can't predict where your model is going to start to overtrain since thats random

analog obsidian May 26, 2025, 8:56 PM

#

languid cliff i have a sample size of 40 minutes right now i want to train. So i should pick 8...

i would use 8 yes

languid cliff May 26, 2025, 8:57 PM

#

analog obsidian and i can't predict where your model is going to start to overtrain since thats ...

Is the overtrain detection Applio has actually useful?

#

or is it more of a gimmic

analog obsidian May 26, 2025, 8:57 PM

#

languid cliff Is the overtrain detection Applio has actually useful?

no, i have asked one of the devs to remove yet is still there

crude flame May 26, 2025, 8:57 PM

#

languid cliff or is it more of a gimmic

its a gimmick that doesnt even work

languid cliff May 26, 2025, 8:57 PM

#

ah, so its useless then

analog obsidian May 26, 2025, 8:58 PM

#

hearing overfitting/overtraining is kinda easy, at a certain point the model will sound very robotic

#

you just use anything before that happens

#

so lets anything above 150e sounds very bad but anything prior to that is "alright"

languid cliff May 26, 2025, 8:59 PM

#

Yeah fair enough, so i could just set the epoch to 400 and save every 20, and just see through all of them

analog obsidian May 26, 2025, 8:59 PM

#

yeah basically

#

graphs don't tell when your model is "done"

languid cliff May 26, 2025, 8:59 PM

#

the model i trained now sounded fine to me at 500 epochs, and now i added even more audio to it

analog obsidian May 26, 2025, 8:59 PM

#

they're there so ppl can check for divergence issues and such

languid cliff May 26, 2025, 9:00 PM

#

but doesnt longer audio usually mean you should use less epochs? or did i get that completely wrong

#

or is it more dependent on how varying your audio sample is

analog obsidian May 26, 2025, 9:00 PM

#

depends in your batch size

languid cliff May 26, 2025, 9:00 PM

#

with different words, tones, etc

analog obsidian May 26, 2025, 9:00 PM

#

and how different is the dataset compared to the pretrain

#

the og pretrain is trained using very monotone speech

languid cliff May 26, 2025, 9:01 PM

#

peepoNotes

#

i see

analog obsidian May 26, 2025, 9:01 PM

#

so if ur dataset is also monotone like the pretrain, the model will have a more easy task learning ur set

languid cliff May 26, 2025, 9:01 PM

#

my dataset is very varying in pitch and tone

analog obsidian May 26, 2025, 9:02 PM

#

what you can do is to train 200 epochs and if you notice e200 sounds fine, you can continue training until the model starts to sound very metallic/robotic

#

tho i personally never train over 100 epochs

languid cliff May 26, 2025, 9:03 PM

#

So im assuming its very easy to notice when its overtrained?

analog obsidian May 26, 2025, 9:03 PM

#

yea u dont need to be an audio nerd to notice when the model sounds unnatural and robotic

#

is pretty obvious

#

has a particular ugly robotic sound

languid cliff May 26, 2025, 9:05 PM

#

yeah fair enough. On my 500 epochs i noticed like a few words and pronounciations that sounded a bit bad, idk if that because of lack of data, or too much data (too many epochs)

#

i guess i could test the 400 and 300 one and see if its better

analog obsidian May 26, 2025, 9:05 PM

#

epochs = everytime the model has seen its full dataset

#

so your model has seen its own dataset 500 times

languid cliff May 26, 2025, 9:06 PM

#

yeah

analog obsidian May 26, 2025, 9:06 PM

#

smaller datasets don't have too much data to begin so they overfit pretty fast

#

for example a pretrain that has 50 hours is trained using 300 epochs

#

because there's too many stuff the model has to learn

#

but 300 epochs with a 5 minute dataset is overkill

#

more epochs don't mean better results

languid cliff May 26, 2025, 9:08 PM

#

yeah

#

soo a general rule then is larger dataset = more epochs?

#

BUT also dependent on batch size?

analog obsidian May 26, 2025, 9:09 PM

#

yup

#

you cant predict how many exactly

#

rvc is random

languid cliff May 26, 2025, 9:09 PM

#

So with a larger batch size, you want less epochs?

#

yeah i see

analog obsidian May 26, 2025, 9:09 PM

#

languid cliff So with a larger batch size, you want less epochs?

no way to tell, again, its random

#

too many factors

languid cliff May 26, 2025, 9:10 PM

#

ah okay, so its not like a rule of thumb for it

#

gotcha

analog obsidian May 26, 2025, 9:10 PM

#

yup

#

depends how hard is the dataset to learn

#

depends in a lot of things really

#

u can try two approach of selecting epochs
you can either train 200e and save every 10
or train 200e, save everything, and hear all until you find one that sounds more natural to you

languid cliff May 26, 2025, 9:12 PM

#

mhm

#

okay ill try at 200

#

Also, for "silent training files", even if the audio has no background noise, do you usually always leave this on 2?

analog obsidian May 26, 2025, 9:13 PM

#

yes, dont touch that

#

leave it set to 2

languid cliff May 26, 2025, 9:13 PM

#

okay

#

The "Fresh training" option, do you always check this ON when making a new model? or is that if you are making pre-trains?

#

same with the "Dataset Creator"

analog obsidian May 26, 2025, 9:15 PM

#

languid cliff The "Fresh training" option, do you always check this ON when making a new model...

that option deletes the G and D files, and the graph files (eval)

#

use it if you wanna start your training from 0

languid cliff May 26, 2025, 9:16 PM

#

Okay sounds good

analog obsidian May 26, 2025, 9:16 PM

#

but sure you can have it enabled when making a new model, nothing bad happens

#

just don't enable it when resuming training

#

otherwise all of the process will be lost

languid cliff May 26, 2025, 9:17 PM

#

yeah gotcha 👍

#

Its exciting trying to constantly improve the voice, the spectogram stuff sounds exciting too, but also sounds like a lot of stuff to learn

languid cliff May 26, 2025, 9:27 PM

#

analog obsidian train 200 epochs save every 10 listen to all the epochs and choose the one you l...

Is there a easy, fast and consistent way to do this, that you would recommend? Im assuming it would be best to somehow run a pre-recorded voice sample though the voice changer for consistency?

tough fiber May 26, 2025, 9:31 PM

#

anyone know is collab broken atm? my voice on deiters fork is bad

#

cutting and distorted my voice

analog obsidian May 26, 2025, 9:33 PM

#

languid cliff Is there a easy, fast and consistent way to do this, that you would recommend? I...

test the model in applio, don't use the voice changer for testing purposes

#

voice changer inference works differently

languid cliff May 26, 2025, 9:35 PM

#

analog obsidian voice changer inference works differently

so you WANT to use voice inference to test, right?

analog obsidian May 26, 2025, 9:35 PM

#

yep

#

if it works fine there, it'll work fine in realtime

languid cliff May 26, 2025, 9:35 PM

#

Okay, sounds good! will try that then

#

Great

merry shore May 26, 2025, 11:43 PM

#

Need help with the Okada Voice Changer, only Beatrice V2 models are producing audio output
I am using an app called audioRelay
To connect my phone to computer for microphone

light latch May 27, 2025, 1:19 AM

#

Hi! I wanted help with the Colab Research link to create ai covers, Whenever I click on an old link it doesn't go through, does anyone know what the new link is?

sudden quail May 27, 2025, 1:28 AM

#

does anyone have a tutorial for rvc/

#

?

knotty moth May 27, 2025, 2:49 AM

#

merry shore Need help with the Okada Voice Changer, only Beatrice V2 models are producing au...

only beatrice model is good in the og 2.x beta version

#

otherwise you should try the fork version: https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

Deiteris' W Okada Fork

Last update: May 5, 2025

stable tartan May 27, 2025, 2:55 AM

#

Hey, does anyone know how to make AI stuff like the clip on the bottom: https://www.facebook.com/reel/677256711796852/?rdid=DahxdqgAZes9Pdpp&share_url=https%3A%2F%2Fwww.facebook.com%2Fshare%2Fr%2F1JeV3Wtifi%2F

I've tried HeyGen but I can't get it to just stare at the screen and do subtle motion like they're reacting to the video. Much appreciated.

glass summit May 27, 2025, 6:54 AM

#

Hi anyone who used Foocus or any other ui who made lora can please join vc? i need a smal help

#

Please

reef haven May 27, 2025, 7:45 AM

#

where can i find some good ai cover sites?

modest dagger May 27, 2025, 9:04 AM

#

guys can anyone tell me free voice changer with rvc

#

live voice changer

red remnant May 27, 2025, 9:41 AM

#

Guys, who knows why the program doesn't load, I've already tried everything, all the components, updated, a lot of things, why the program just doesn't load, and after 2-3 minutes it gives the error Error: Could not load Voice Focus estimator. and when I resume the same thing, does anyone know how to solve this problem?

simple ore May 27, 2025, 10:04 AM

#

red remnant Guys, who knows why the program doesn't load, I've already tried everything, all...

what's your GPU?

red remnant May 27, 2025, 10:08 AM

#

simple ore what's your GPU?

NVIDIA GeForce RTX 3080

knotty moth May 27, 2025, 10:10 AM

#

red remnant Guys, who knows why the program doesn't load, I've already tried everything, all...

and the program version you're using?

red remnant May 27, 2025, 10:12 AM

#

knotty moth and the program version you're using?

MMVCServerSIO_win_onnxgpu-cuda_v.1.5.3.18a.zip

knotty moth May 27, 2025, 10:13 AM

#

red remnant MMVCServerSIO_win_onnxgpu-cuda_v.1.5.3.18a.zip

try this version https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#download-nvidia-on-windows

Deiteris' W Okada Fork

Last update: May 5, 2025

red remnant May 27, 2025, 10:14 AM

#

okay

wintry iron May 27, 2025, 11:48 AM

#

pls help how to change epoch

outer wasp May 27, 2025, 12:30 PM

#

why i cant choose model

simple ore May 27, 2025, 12:41 PM

#

outer wasp why i cant choose model

do you actually have a model in logs folder?

hallow thistle May 27, 2025, 1:27 PM

#

wintry iron pls help how to change epoch

Which epoch? Are you talking about RVC voice model or W-Okada?

#

!howtoask

patent trellisBOT May 27, 2025, 1:27 PM

#

hallow thistle !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

outer wasp May 27, 2025, 1:44 PM

#

simple ore do you actually have a model in logs folder?

Oh i have done it thank!

modern crane May 27, 2025, 1:48 PM

#

guys, the ai voice changer program opens start-https.bat and the console closes immediately what to do

hallow thistle May 27, 2025, 1:57 PM

#

Chunk doesn't make the audio sound better in quality; it's more like what makes the audio to delay. What makes better quality is Extra. A GPU has a contribution at converting audio in real time on W-Okada.

hallow thistle May 27, 2025, 1:58 PM

#

modern crane guys, the ai voice changer program opens start-https.bat and the console closes ...

You use the original version of W-Okada, which is old and outdated. I'm guessing you have followed a tutorial video on YouTube before. What is your PC GPU?

modern crane May 27, 2025, 1:59 PM

#

video card invidia gpu intel

#

MMVCServerSIO_win_onnxgpu-cuda_v.1.5.3.18a.zip

this file i download

hallow thistle May 27, 2025, 2:01 PM

#

modern crane video card invidia gpu intel

To check your PC GPU name, open Task Manager.

modern crane May 27, 2025, 2:02 PM

#

hallow thistle To check your PC GPU name, open Task Manager.

nvidia

hallow thistle May 27, 2025, 2:03 PM

#

modern crane nvidia

That's a brand name, not a full name of GPU. On Task Manager, go to Performance tab, spot where GPU 0 or GPU 1 is in the left side there, and click one of them to reveal its full name on the right side.

modern crane May 27, 2025, 2:03 PM

#

intel xeon cpu e5-2650 v2

hallow thistle May 27, 2025, 2:04 PM

#

modern crane intel xeon cpu e5-2650 v2

No. That's CPU.

#

For example, if your PC has NVIDIA GeForce RTX 4090, it's RTX 4090.

modern crane May 27, 2025, 2:04 PM

#

nvidia gtx 1070

tight ether May 27, 2025, 2:04 PM

#

noddingblob

hallow thistle May 27, 2025, 2:06 PM

#

modern crane nvidia gtx 1070

Download and use this better W-Okada instead, since you got NVIDIA GeForce GTX 1070 in an Intel Xeon PC. https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#download-nvidia-on-windows

Deiteris' W Okada Fork

Last update: May 5, 2025

modern crane May 27, 2025, 2:06 PM

#

Download NVIDIA on Windows
The lastest version as of December 7th 2024 is: nvidia-b2332 (click here to download)
If you have a GTX 700 card or below, use AMD/Intel version instead.

this?

tight ether May 27, 2025, 2:07 PM

#

noddingblob

hallow thistle May 27, 2025, 2:07 PM

#

kazusasip

#

https://cdn.discordapp.com/attachments/1159290139609137264/1371371778181431328/image.png

modern crane May 27, 2025, 2:08 PM

#

hallow thistle https://cdn.discordapp.com/attachments/1159290139609137264/1371371778181431328/i...

this?

#

ok

tight ether May 27, 2025, 2:08 PM

#

y.u.p.

hallow thistle May 27, 2025, 2:08 PM

#

Yes.

modern crane May 27, 2025, 2:09 PM

#

hallow thistle Yes.

and after downloading, what should I do?

hallow thistle May 27, 2025, 2:09 PM

#

modern crane and after downloading, what should I do?

!howtoask

patent trellisBOT May 27, 2025, 2:09 PM

#

hallow thistle !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

hallow thistle May 27, 2025, 2:10 PM

#

Um. Which W-Okada version are you using? And what is your PC GPU?

tight ether May 27, 2025, 2:11 PM

#

namari is really busy in this channel.

hallow thistle May 27, 2025, 2:13 PM

#

It's possible to lower chunk number under 30 ms for less delay, extra number stays at 2.7 s, and also force to use fp32.

#

I don't know how to explain this. cat_deaed

#

Damn. Although you can set extra up to 5 s, most of the time you may experience audio cutting off a lot, so 2.7 s is best overall. If the audio quality still low, it can be a voice model you're currently using.

mellow ermine May 27, 2025, 2:24 PM

#

If I buy premium weights will I get a better quality image?

hallow thistle May 27, 2025, 2:31 PM

#

mellow ermine If I buy premium weights will I get a better quality image?

I'm not sure if paying for Weights Premium would help generating image in better quality, but these are privileges of having premium going there.

tough gale May 27, 2025, 3:03 PM

#

can i get some advice on making the ai voice changer sound "better". no matter what i seem to do with the pitch, format shift or index it always seems to sound off.

languid cliff May 27, 2025, 3:25 PM

#

Try a different model?

hallow thistle May 27, 2025, 4:20 PM

#

languid cliff Try a different model?

That's one of the tips on how to make W-Okada to sound good. misc_true

tough gale May 27, 2025, 4:25 PM

#

i swear iv tried so many lol

#

i still have not ruled out the ai not liking the british accent yet

analog obsidian May 27, 2025, 4:28 PM

#

just train a better one

oblique viper May 27, 2025, 5:30 PM

#

hallow thistle I'm not sure if paying for Weights Premium would help generating image in better...

doesn't look like they offer better quality images

queen sapphire May 27, 2025, 6:58 PM

#

yo whats the newest way to make ai covers of songs

#

havent done it since i used ilaria

#

??????

#

help pls

worldly ibex May 27, 2025, 7:41 PM

#

can apollo do from youtube? or even if not from youtube, is there any that seperate the audio convert it and combine them together?

narrow portal May 27, 2025, 9:43 PM

#

Is it just me, or the overtraining detector is not working?

#

Also I just pressed the Stop Training button and it's still training

edgy tangle May 27, 2025, 10:48 PM

#

narrow portal Is it just me, or the overtraining detector is not working?

The overtraining detector doesn't work very "well"

#

I think it is useless

#

You should just manually detect overtraining

#

And looks like the model didn't learned nothing after ~40k (I think :b)

narrow portal May 27, 2025, 10:50 PM

#

narrow portal Is it just me, or the overtraining detector is not working?

Yeah, latest epoch is 431 but the lowest point here comes from 240-270

#

So i think I might just stay with epoch 240-270 or something

edgy tangle May 27, 2025, 10:51 PM

#

How long is your dataset?

narrow portal May 27, 2025, 10:51 PM

#

29min

edgy tangle May 27, 2025, 10:51 PM

#

Hmmm

narrow portal May 27, 2025, 10:51 PM

#

All samples are 22hz so I used 30hz

edgy tangle May 27, 2025, 10:52 PM

#

After 60k is overtrained

#

I recommend you to check lowest points between 40-60k steps (Not only the lowest of them)

narrow portal May 27, 2025, 10:53 PM

#

Oh

#

I see

#

I don't have my PC rn so I will check that later

edgy tangle May 27, 2025, 10:53 PM

#

Probably the lowest is the best one, but it might be just noise and not really a good step

#

Just compare them and find the best one

#

cat_blep

analog obsidian May 27, 2025, 11:03 PM

#

narrow portal Is it just me, or the overtraining detector is not working?

graphs don't help in finding the best epoch nor when overtraining starts in rvc, g/total is a bit innacurate so don't rely on it too much
instead, hear the epochs (every 10 is ok) and keep the one who sound the best for you

narrow portal May 27, 2025, 11:13 PM

#

analog obsidian graphs don't help in finding the best epoch nor when overtraining starts in rvc,...

Oh, if you say so

languid cliff May 28, 2025, 1:09 AM

#

Probably a dumb question, but is longer dataset always better? or does it at one point end up making the final voice worse with too long dataset?

simple ore May 28, 2025, 1:15 AM

#

languid cliff Probably a dumb question, but is longer dataset always better? or does it at one...

it is a careful balance of getting the model to learn a new voice and not eroding allt he things it learned during original pretrain training

languid cliff May 28, 2025, 1:18 AM

#

simple ore it is a careful balance of getting the model to learn a new voice and not erodin...

Hmm ok, i have a solid 1 hour of very clean audio now, but idk if thats too overkill

knotty moth May 28, 2025, 1:22 AM

#

languid cliff Probably a dumb question, but is longer dataset always better? or does it at one...

given consistent quality and variation, it is like 85% to 95% to 99%

knotty moth May 28, 2025, 1:22 AM

#

languid cliff Hmm ok, i have a solid 1 hour of very clean audio now, but idk if thats too over...

it's still fine but you should also pay attention on the quality consistency

analog obsidian May 28, 2025, 1:23 AM

#

languid cliff Probably a dumb question, but is longer dataset always better? or does it at one...

there's no limit, just remember that a big dataset (over 1 hour) may require a higher batch size than just 8

languid cliff May 28, 2025, 1:23 AM

#

knotty moth it's still fine but you should also pay attention on the quality consistency

the quality is all the same, its all pure voice audio with no background noise removed

#

There might be a slight dB level change between the clips, does that matter?

#

if so, i can try to normalize it to 1 level

languid cliff May 28, 2025, 1:25 AM

#

analog obsidian there's no limit, just remember that a big dataset (over 1 hour) may require a h...

oh okay, its like 1 hour and 3 minutes now. Like 12 batch size? 8 maybe?

knotty moth May 28, 2025, 1:25 AM

#

languid cliff oh okay, its like 1 hour and 3 minutes now. Like 12 batch size? 8 maybe?

8 is fine

languid cliff May 28, 2025, 1:32 AM

#

knotty moth given consistent quality and variation, it is like 85% to 95% to 99%

Oaky, ill try 8. What do you mean by these percentages?

analog obsidian May 28, 2025, 1:33 AM

#

languid cliff oh okay, its like 1 hour and 3 minutes now. Like 12 batch size? 8 maybe?

i would use 16

languid cliff May 28, 2025, 1:34 AM

#

analog obsidian i would use 16

okay, 16 it is then 👍 . Idk if its too complicated to explain, but whats the technical reason for higher batch size for larger datasets?

dry marsh May 28, 2025, 1:35 AM

#

does anyone know about spectrograms

analog obsidian May 28, 2025, 1:36 AM

#

languid cliff okay, 16 it is then 👍 . Idk if its too complicated to explain, but whats the te...

gradients may be too noisy and unstable when using very low batch sizes in big datasets

languid cliff May 28, 2025, 1:36 AM

#

analog obsidian gradients may be too noisy and unstable when using very low batch sizes in big d...

Ohh okay, gotcha

dry marsh May 28, 2025, 1:37 AM

#

can any1 help w my forum

languid cliff May 28, 2025, 1:47 AM

#

Oh damn, this is actually really fast with 16 batch size

analog obsidian May 28, 2025, 1:50 AM

#

languid cliff Oh damn, this is actually really fast with 16 batch size

just in case: don't ever use batch size 16 in small datasets

#

thats for 1 hour and above

languid cliff May 28, 2025, 1:50 AM

#

Yeah

#

So 32 batch size for 2 hours and above?

#

Or does it not scale thst way

analog obsidian May 28, 2025, 1:51 AM

#

languid cliff Or does it not scale thst way

it doesn't but you can indeed use batch size 32 with 2 hours and above

#

be sure your dataset is expressive enough and not monontone speech

languid cliff May 28, 2025, 1:52 AM

#

Yeah, thats what im worried about if my dataset is varying enough, but its dedinitely not "monotone" at least

languid cliff May 28, 2025, 1:53 AM

#

analog obsidian it doesn't but you can indeed use batch size 32 with 2 hours and above

Batch size is heavily limited by VRAM right?

analog obsidian May 28, 2025, 1:53 AM

#

languid cliff Batch size is heavily limited by VRAM right?

yeah

#

16 is good enough for 2 hours and above too

#

so dw

languid cliff May 28, 2025, 1:53 AM

#

So thats usually whats stopping people from being able to do good 1hr+ datasets?

#

Yeah gotcha

analog obsidian May 28, 2025, 1:54 AM

#

languid cliff So thats usually whats stopping people from being able to do good 1hr+ datasets?

laziness

#

they take a hella time to clean

languid cliff May 28, 2025, 1:54 AM

#

Its using 16,6/32GB right now, so

analog obsidian May 28, 2025, 1:54 AM

#

i finished cleaning my 2 hour set in like 4 days

languid cliff May 28, 2025, 1:55 AM

#

With like removing instrumentals, background noise, etc?

analog obsidian May 28, 2025, 1:55 AM

#

yup

#

contentvec is very very noise sensitive

languid cliff May 28, 2025, 1:56 AM

#

Yeah feel that. Im lucky with this one since its just talking, pure voice, with a really good mic

#

So it takes me like 1,5hrs to capture 1 hr of datasets

#

So i dont mind doing 2 hrs, as long as it doesnt "hurt" the dataset with more data

analog obsidian May 28, 2025, 1:57 AM

#

the more you add, the more realistic the output

#

and better the results because you replace more stuff from the pretrain

#

if u rely too much in the pretrain (small datasets) things get weird

languid cliff May 28, 2025, 1:58 AM

#

Yeah, i might just add on to it then, see how good i can get it

#

Im still missing data from like whispering etc, so i might have to see if i can find any data for that

analog obsidian May 28, 2025, 1:58 AM

#

nono

#

dont add whispering

#

rvc hates it

languid cliff May 28, 2025, 1:59 AM

#

Oh okay

analog obsidian May 28, 2025, 1:59 AM

#

makes the whole model sound eww

languid cliff May 28, 2025, 2:00 AM

#

What about like yelling and laughing? And like.. mouth sounds like popping, humming, etc? Is that all bad too?

#

Because i might want to clean up my dataset based on that

analog obsidian May 28, 2025, 2:01 AM

#

languid cliff What about like yelling and laughing? And like.. mouth sounds like popping, humm...

about yelling and laughing im not sure, but i know too much of them fucks up things
and no, every mouth sound is bad

#

rvc randomly adds those sounds in the results

#

idk why but i know it does that

languid cliff May 28, 2025, 2:02 AM

#

Ahh okay

analog obsidian May 28, 2025, 2:02 AM

#

just train clean speech, keep every breath (very important), remove unwanted sounds and noise

#

and.. thats rlly it

languid cliff May 28, 2025, 2:02 AM

#

Yeah it might be like 1-2 minute of yelling out of 1 hour

#

Might have to remove it then

#

Wym keep every breath?

analog obsidian May 28, 2025, 2:03 AM

#

rvc cant clone yelling and laughing so its a bit pointless to add them in the dataset

analog obsidian May 28, 2025, 2:03 AM

#

languid cliff Wym keep every breath?

rvc has to learn how to do breath sounds

#

so it needs breathing samples

languid cliff May 28, 2025, 2:05 AM

#

Im a bit confused on that one, isnt breathing sounds considered background nosise? Because my dataset doesnt have any breathing sounds because of the noise gate, its only clear speech

analog obsidian May 28, 2025, 2:05 AM

#

no

#

breaths are part of the speech

#

they're the most important part of the dataset

#

without them, rvc wont be able to learn how to clone breathing

#

so your model will sound veeery robotic while trying to inference breath

#

never remove them

languid cliff May 28, 2025, 2:09 AM

#

Oh ok, but i havent removed them, there just is none in the dataset. Do you mean like, the inhaling and exhaling type sounds before and after a sentence?

analog obsidian May 28, 2025, 2:09 AM

#

languid cliff Oh ok, but i havent removed them, there just is none in the dataset. Do you mean...

yep these sounds

#

languid cliff May 28, 2025, 2:11 AM

#

analog obsidian yep these sounds

Hmm ok, yeah i dont think i have any clear sounds of that im my data set because of the noise gate being used. I wonder if this is something i could artificially add to teach the model? Or would that be hella work?

analog obsidian May 28, 2025, 2:12 AM

#

languid cliff Hmm ok, yeah i dont think i have any clear sounds of that im my data set because...

nope you cant add them, if you add breaths from another dataset there will be no consistency in the final dataset and rvc does not work very well when cloning a dataset without consistency

languid cliff May 28, 2025, 2:16 AM

#

Ahh, ok. Thats a bummer. I cant recall if my dataset has any of these breaths or not, because i havent been listening for that, but ill check whenever i come back to my pc. If anything, if i add more data, should i try to hunt down audio clips which has these breaths, or does ALL of the dataset need not have them for consistency?

analog obsidian May 28, 2025, 2:17 AM

#

languid cliff Ahh, ok. Thats a bummer. I cant recall if my dataset has any of these breaths or...

if the breaths come from the same source (same mic) i guess it should be fine, just add the breathing samples before a sentence

languid cliff May 28, 2025, 2:18 AM

#

analog obsidian if the breaths come from the same source (same mic) i guess it should be fine, j...

Ah, so if i get a sample from the same person/mic, i can artificially add it onto other sentences in the dataset?

#

If i were to add it to every sentence tho it would take hours hahah

analog obsidian May 28, 2025, 2:20 AM

#

languid cliff Ah, so if i get a sample from the same person/mic, i can artificially add it ont...

if u get the samples from the same person using the same mic, most probably yes (i have never tried something like this btw, but is not bad to try it in case gives good results)

analog obsidian May 28, 2025, 2:20 AM

#

languid cliff If i were to add it to every sentence tho it would take hours hahah

nah don't add them to every sentence lol

#

but rvc kinda needs a lot of breathing samples in order to learn them

#

idk how many exactly

#

you'll need to experiment with that

#

yt_nails

analog obsidian May 28, 2025, 2:24 AM

#

analog obsidian but rvc kinda needs a lot of breathing samples in order to learn them

and yes, i said samples, you need more than just one breathing sample

#

emoji_40

#

gather different unique breathing samples

graceful tinsel May 28, 2025, 2:36 AM

#

https://imgur.com/njXmLsm
not sure what do after making key any help?

#

I’m trying to run it through Google Collabs if that makes any difference

knotty moth May 28, 2025, 4:11 AM

#

languid cliff okay, 16 it is then 👍 . Idk if its too complicated to explain, but whats the te...

12 GB vram is recommended for batch 16

#

otherwise 8 could be okay unless the dataset is diverse

simple ore May 28, 2025, 5:16 AM

#

languid cliff okay, 16 it is then 👍 . Idk if its too complicated to explain, but whats the te...

More data, more variation, larger batch size allows to get a more stable estimate for gradient direction

#

larger the batch the less calculations for gradients have to be done every epoch, so slightly faster

#

but I would not go higher than 8 for an hour long dataset

#

but it is up for you to experiment with

languid cliff May 28, 2025, 5:36 AM

#

analog obsidian gather different unique breathing samples

Okay mhm will try and see what i can gather

languid cliff May 28, 2025, 5:45 AM

#

simple ore but I would not go higher than 8 for an hour long dataset

Oh okay! I finished the one with 16, i just started training the one with 8 batch size now, and ill see which one is better when i come home. I guess its all random and people have different experiences with diffdrent batch sizes

languid cliff May 28, 2025, 5:50 AM

#

simple ore but it is up for you to experiment with

Also, whats your perspective and experience with yelling and laughing in the dataset? Good or bad?

wild storm May 28, 2025, 8:04 AM

#

how ar u

dry marsh May 28, 2025, 8:38 AM

#

hi! does anyone know about facefusion.. because when i process the video the output doesnt come out..

pastel oak May 28, 2025, 8:56 AM

#

dry marsh hi! does anyone know about facefusion.. because when i process the video the out...

What kinda error

dry marsh May 28, 2025, 8:57 AM

#

pastel oak What kinda error

idk it just analyzes it

#

but the output doesnt come out

#

for some rsn

#

i used huggingface

pastel oak May 28, 2025, 8:57 AM

#

ic you asked in the facefusion server anyway wait for an answer there

dry marsh May 28, 2025, 8:57 AM

#

yea 😭

#

been dealing w this for 6 hours

#

do u maybe have any idea why this happened

knotty moth May 28, 2025, 9:00 AM

#

dry marsh hi! does anyone know about facefusion.. because when i process the video the out...

if you have a capable gpu, try setting up locally

dry marsh May 28, 2025, 9:00 AM

#

knotty moth if you have a capable gpu, try setting up locally

im on macOS

#

[FACEFUSION.CORE] Processing step 1 of 1
Analysing: 100% (334/334)

#

literally stops after

stiff goblet May 28, 2025, 12:58 PM

#

@glacial pollen this training is not going well, right ?

glacial pollen May 28, 2025, 1:04 PM

#

stiff goblet <@1239634084133601423> this training is not going well, right ?

Well my dude, this is up to you to judge / learn
This is not my job here really

#

I'm busy rn, watching some vtuber stream

stiff goblet May 28, 2025, 1:05 PM

#

glacial pollen Well my dude, this is up to you to judge / learn This is not my job here really

I'm just asking for the charts actually.

glacial pollen May 28, 2025, 1:05 PM

#

glacial pollen Well my dude, this is up to you to judge / learn This is not my job here really

And I've stressed it multiple times here and there

#

and I will once again, this is not my job on the server to keep guiding people on what's correct / good and or bad / wrong

#

You see, when I was learning, I had to do all on my own, that's the point of learning and understanding

stiff goblet May 28, 2025, 1:06 PM

#

glacial pollen and I will once again, **this is not my job on the server** to keep guiding peop...

Okay that's true. But this is your fork and software, that's why im asking you

glacial pollen May 28, 2025, 1:06 PM

#

Well it is, but as you should know " avg 50 " is not my invention

#

@simple ore He made it

#

I only ported it

#

So if anything, direct questions about that metric to him

#

I am on hiatus / inactive on the server ( doing a big break. Hence would appreciate lack of @ s

stiff goblet May 28, 2025, 1:06 PM

#

glacial pollen So if anything, direct questions about that metric to him

Alright, thanks anyway

simple ore May 28, 2025, 1:24 PM

#

stiff goblet Alright, thanks anyway

unclick

runic heath May 28, 2025, 5:10 PM

#

CPU too high usage when on cs2

river juniper May 28, 2025, 7:41 PM

#

Close cs2

#

.. well either that or make sure your model is using your gpu because it should use minimal cpu if it is

gilded robin May 28, 2025, 8:24 PM

#

"If you are still using CABLE instead of Line 1, I beg you to switch over because it is unironically better than CABLE in any way possible."

#

what line 1?

fair glade May 28, 2025, 8:46 PM

#

hey

simple ore May 28, 2025, 8:52 PM

#

gilded robin "If you are still using CABLE instead of Line 1, I beg you to switch over becaus...

gilded robin May 28, 2025, 8:57 PM

#

simple ore

ah i use cable C & cable D im guessing i cant do that option?

simple ore May 28, 2025, 8:59 PM

#

I believe the suggestion is to ditch vb cable and use the proper one

#

https://software.muzychenko.net/freeware/vac470lite.zip

gilded robin May 28, 2025, 9:01 PM

#

simple ore I believe the suggestion is to ditch vb cable and use the proper one

alright ty anyways i doubt i can do that since i use like 4 different routes

#

asio okadafork->reaper->peace->discord

fair glade May 28, 2025, 9:19 PM

#

hey so

#

my voice sound unrealistic

#

how can I make it more realistic

languid cliff May 28, 2025, 10:40 PM

#

try a different voice?

valid vine May 28, 2025, 10:46 PM

#

would anyone want to help me figure out why my AIs I made to play tag continue to be IDIOTS no matter what I try? I'm using pytorch and learned it with chatGPT so it probably led me astray somewhere but even after redoing the entire program 4 times I still feel kinda lost

#

I can't send the two models directly here bc no files

#

class TagStandardHide(neuro.Module):
    def __init__(self, Learning: bool = True, learnRate: float = 0.01):
        super(TagStandardHide, self).__init__()
        self.layer0 = neuro.Linear(4, 32)
        self.layer1 = neuro.Linear(32, 64)
        self.layer2 = neuro.Linear(64, 48)
        self.layer3 = neuro.Linear(48, 24)
        self.layer4 = neuro.Linear(24, 8)
        self.learning = Learning
        self.optimizer = torch.optim.Adam((self).parameters(), lr=learnRate)

    def forward(self, x):
        x = self.layer0(x)
        x = torch.relu(self.layer1(x))
        x = self.layer2(x)
        x = torch.relu(self.layer3(x))
        x = self.layer4(x)
        return x
    
    def updateModel(self, stateTensor: torch.FloatTensor, nextStateTensor: torch.FloatTensor, action, reward: float, gamma=0.99):
        optimizer = self.optimizer
        state = stateTensor.unsqueeze(0)
        next_state = nextStateTensor.unsqueeze(0)
        action = torch.tensor([action], dtype=torch.int64)
        reward = torch.tensor([reward], dtype=torch.float32)
        current_q_values = self(state).gather(1, action.unsqueeze(-1)).squeeze(-1)
        next_q_values = self(next_state).max(1)[0].detach()
        target_q_value = reward + gamma * next_q_values
        loss = torch.nn.functional.mse_loss(current_q_values, target_q_value)
        optimizer.zero_grad()
        loss.backward()
        optimizer.step()
        return loss.item()
    
    def getState(self, location: tuple[int, int], seekerLocation: tuple[int, int], width, height) -> list[int | float]:
        locationNormalized = (location[0]/width, location[1]/height)
        seekerLocationNormalized = (seekerLocation[0]/width, seekerLocation[1]/height)
        gameState: list[int | float] = [locationNormalized[0], locationNormalized[1], seekerLocationNormalized[0], seekerLocationNormalized[1]]
        return gameState
    
    def getReward(self, gameState) -> float:
        #removed

        return reward```

#

I had to remove the reward function because characters but both of them look basically just like this

#

I've tried just an input and output, having 1 middle, having more middle layers with more and less neurons in each, none of that really seems to affect anything

simple ore May 28, 2025, 10:53 PM

#

your forward is funny

valid vine May 28, 2025, 10:54 PM

#

I've tried a bunch of random forwards none of them really seem to help

#

what specifically is weird about it though?

simple ore May 28, 2025, 10:54 PM

#

why activation only for 1 and 3?

valid vine May 28, 2025, 10:55 PM

#

simple ore why activation only for 1 and 3?

I haven't really thought about it

#

is it normal for them all to use relu? /some other activation function? I figured just having it use the layer would be fine

simple ore May 28, 2025, 10:59 PM

#

without an activation function there's no point in having separate layers as the math collapses them

#

there are different activation functions offering different activation probabilities

valid vine May 28, 2025, 11:01 PM

#

simple ore without an activation function there's no point in having separate layers as the...

but that seems weird, wouldn't having the other layers affect them still change it?

#

like if it was just one neuron each and the weights were right, from the first being say 0.35 the second layer could make it -0.12 which on the next relu would be 0

#

well that's a bad example I guess because if there was another relu it would make it 0 which would also make the last next on 0

simple ore May 28, 2025, 11:08 PM

#

anyway, I cant imagine what that code is supposed to do

#

your model has 4 inputs and 8 outputs

#

btw, use standard aliases. torch.nn -> nn

valid vine May 28, 2025, 11:15 PM

#

yeah I'll probably make it more readable whenever I do more

dreamy seal May 28, 2025, 11:15 PM

#

is there any aihub docs?

craggy bough May 28, 2025, 11:15 PM

#

dreamy seal is there any aihub docs?

https://docs.aihub.gg/

Home

Last update: May 5, 2025

dreamy seal May 28, 2025, 11:15 PM

#

craggy bough https://docs.aihub.gg/

👍

valid vine May 28, 2025, 11:15 PM

#

simple ore your model has 4 inputs and 8 outputs

the inputs are this players x, this player's y, seeker's x, and seeker's y, and the outputs correspond to the eight directions (the 4 cardinals and their combinations)

#

I did it like that so they can move more like a regular person can since you can press w and a at the same time, for example

valid vine May 29, 2025, 12:21 AM

#

simple ore your forward is funny

so the only thing that seems wrong is the forward function?

wispy perch May 29, 2025, 2:24 AM

#

it doesn't work, and whenever i talk it says this in cmd: 2025-05-29 05:22:15.7391809 [E:onnxruntime:, sequential_executor.cc:572 onnxruntime::ExecuteKernel] Non-zero status code returned while running Pad node. Name:'/rmvpe/mel_extractor/Pad' Status Message: CUDA error cudaErrorNoKernelImageForDevice:no kernel image is available for execution on the device

simple ore May 29, 2025, 2:26 AM

#

wispy perch it doesn't work, and whenever i talk it says this in cmd: 2025-05-29 05:22:15.73...

gpu?

wispy perch May 29, 2025, 2:27 AM

#

gtx 750

simple ore May 29, 2025, 2:28 AM

#

what application/version you're trying to use?

#

using a voice changer on anything below nvidia's <1000 series is almost impossible

wispy perch May 29, 2025, 2:30 AM

#

vcclient_win_cuda_2.0.78-beta

simple ore May 29, 2025, 2:30 AM

#

ancient

#

try https://github.com/deiteris/voice-changer/releases/download/b2332/voice-changer-windows-amd64-dml.zip

wispy perch May 29, 2025, 2:33 AM

#

ok thx

valid vine May 29, 2025, 2:34 AM

#

valid vine so the only thing that seems wrong is the forward function?

Noobies said that it basically just makes the extra layers useless but I still don't get it, why would it still not learn?

simple ore May 29, 2025, 2:52 AM

#

I'm not familiar with this kind of model training, so I have no other advice

#

other than in order to learn something complex the model has to have enough capacity and with your current forward function it is essentially just 3 layers

valid vine May 29, 2025, 3:19 AM

#

well it's basically two different models with one goal each

#

before I had it be just one model with a 5th paramater for if it was "it" or not and then changed the seeker location to be the nearest player but that wasn't working

#

so basically the hider model is just "go away from this point" and the seeker is "get as close as possible to this point"

#

and "this point" in each of them is just the 3rd parameter (x) and 4th parameter (y) and they're normalized to be 0-1

knotty moth May 29, 2025, 7:02 AM

#

valid vine ```python class TagStandardHide(neuro.Module): def __init__(self, Learning: ...

what is the clear purpose of the AI model? training to chase the player using reinforcement learning instead of obviously using the pathfinding algorithm?

simple ore May 29, 2025, 11:51 AM

#

knotty moth what is the clear purpose of the AI model? training to chase the player using re...

30 years ago it was "using internet", now it is 'using AI"

warm dragon May 29, 2025, 12:22 PM

#

-colab

patent trellisBOT May 29, 2025, 12:22 PM

#

warm dragon -colab

📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**

Google Colab

• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

languid cliff May 29, 2025, 1:48 PM

#

Does this mean i started overtraining around 40k steps? if im reading the charts right?

simple ore May 29, 2025, 2:08 PM

#

languid cliff Does this mean i started overtraining around 40k steps? if im reading the charts...

those are old loss charts

languid cliff May 29, 2025, 2:09 PM

#

simple ore those are old loss charts

Huh? im confused

simple ore May 29, 2025, 2:09 PM

#

no avg_50 loss?

languid cliff May 29, 2025, 2:10 PM

#

Pretty sure im looking at the tensorboard live

#

simple ore May 29, 2025, 2:19 PM

#

how big is the batch size?

languid cliff May 29, 2025, 2:19 PM

#

simple ore how big is the batch size?

16, dataset is 1 hour and 59 minutes

simple ore May 29, 2025, 2:19 PM

#

too much... way too much for both

#

try a model from ~30k steps

languid cliff May 29, 2025, 2:20 PM

#

like WAY overtrained right?

simple ore May 29, 2025, 2:20 PM

#

no, it is just way too much for finetuning

languid cliff May 29, 2025, 2:20 PM

#

nah im good

languid cliff May 29, 2025, 2:21 PM

#

simple ore no, it is just way too much for finetuning

wym "finetuning"? Is that a process you do after? or what do you mean by that

simple ore May 29, 2025, 2:21 PM

#

when you train a model on top a pretrain, it is technically a finetuning

languid cliff May 29, 2025, 2:23 PM

#

simple ore when you train a model on top a pretrain, it is technically a finetuning

ahh okay, gotcha. So its essentuially overtrained by the fact that im using a pretrain, but wouldnt be overtrained if i didnt use a pretrain

simple ore May 29, 2025, 2:27 PM

#

you have a big set, 4-12x size of the common size for voice models

#

so it keep using the same high learning rate longer

#

and with batch 16 is generalizes the model quite a lot

languid cliff May 29, 2025, 2:33 PM

#

simple ore and with batch 16 is generalizes the model quite a lot

oh okay gotcha. Would you personally use a batch size less than 16 for 2 hour dataset? or would going less make stuff worse

simple ore May 29, 2025, 2:33 PM

#

I use 12-16 for my 55h vctk set

languid cliff May 29, 2025, 2:34 PM

#

55h hour dataset?? damn

simple ore May 29, 2025, 2:34 PM

#

it is a pretrain

languid cliff May 29, 2025, 2:34 PM

#

yeah

simple ore May 29, 2025, 2:34 PM

#

so yeah, try just a 30-60min set with 8

#

pick the best content

languid cliff May 29, 2025, 2:41 PM

#

simple ore try a model from ~30k steps

The 28k steps definitely sounds the best to me. Everything after that has like a robotic tone to it, especially at the end of sentences

languid cliff May 29, 2025, 2:42 PM

#

simple ore so yeah, try just a 30-60min set with 8

hmm, i actually have a old version with 40 and 60 minutes data, i guess i can compare it to this one and see which one i like better

simple ore May 29, 2025, 2:51 PM

#

languid cliff hmm, i actually have a old version with 40 and 60 minutes data, i guess i can co...

you dont need to create a new dataset, just make a copy of filelist.txt

#

then cut it in half/quarter

languid cliff May 29, 2025, 2:52 PM

#

ooh okay

livid cosmos May 29, 2025, 2:58 PM

#

So I've installed RVC AI Cover Maker and after double clicking run.bat, I am getting this error

Traceback (most recent call last):
File "F:\RVC-AI-Cover-Maker-UI-1.0.5\programs\applio_code\rvc\lib\tools\prerequisites_download.py", line 3, in <module>
from tqdm import tqdm
ModuleNotFoundError: No module named 'tqdm'
Traceback (most recent call last):
File "F:\RVC-AI-Cover-Maker-UI-1.0.5\main.py", line 1, in <module>
import gradio as gr
ModuleNotFoundError: No module named 'gradio'
An error occurred. Exiting...
Press any key to continue . . .

simple ore May 29, 2025, 2:59 PM

#

you did not install requirements?

livid cosmos May 29, 2025, 3:00 PM

#

I installed requirements

simple ore May 29, 2025, 3:00 PM

#

'no module named' says otherwise

#

use precompiled https://huggingface.co/Nick088/RVC-AI-Cover-Maker-UI-Precompiled/resolve/main/Windows/RVC-AI-Cover-Maker-UI-v1.0.5.zip

livid cosmos May 29, 2025, 3:09 PM

#

If you download the .zip from the release (here) make sure to rename the folder from "rvc-ai-cover-maker-ui-v1.0.5" to just "rvc-ai-cover-maker-ui" otherwise you may run into missing dependencies issues.

livid cosmos May 29, 2025, 3:09 PM

#

livid cosmos > If you download the .zip from the release (here) make sure to rename the folde...

Oh, that's (maybe) why it happened - my fault xD

gilded robin May 29, 2025, 3:53 PM

#

hey what's latest guide for making a voice model from scratch?

latent kettle May 29, 2025, 4:49 PM

#

gilded robin hey what's latest guide for making a voice model from scratch?

#1301535386572427364 message

valid vine May 29, 2025, 4:53 PM

#

knotty moth what is the clear purpose of the AI model? training to chase the player using re...

becuase "it automatically finds the best path based on a known algorithm" doesn't sound as cool as "it's an AI that learns how to play" tbh, but also because I want experience making AIs and if I just use a pathfinding algorithm I don't get that experience

gilded robin May 29, 2025, 5:48 PM

#

latent kettle https://discord.com/channels/1159260121998827560/1301535386572427364/13015353865...

ty alot

#

and can you train a pre-existing model on more emotion? or do you just have to make it from scratch

topaz slate May 29, 2025, 8:38 PM

#

hey is there anyone that can help me with the w-okada settings? i cant set it.

paper bloom May 29, 2025, 9:28 PM

#

hey idk if thats the right channel to ask questions in

#

but i have a question^^

#

are there good male and female voice that also sound like a real ^^

#

the one i have is good but some ppl recognize it but its very old idk if that makes a diffrent or are there like new once nowadays that are better?

silk sage May 29, 2025, 9:35 PM

#

How can i make my friend voice to an RVC model Zip for ai

valid vine May 29, 2025, 10:18 PM

#

valid vine becuase "it automatically finds the best path based on a known algorithm" doesn'...

oh I've also been working on and off so long I forgot, there was more complex movement options before too, I just removed it because I thought too many options were the problem

slim schooner May 29, 2025, 10:31 PM

#

hey guys its been a minute. just need some advice here.

if im going to train a voice model, should i use 32k, 40k, or 48k sample rate?

#

does higher sample rate require more training time?

simple ore May 29, 2025, 11:36 PM

#

slim schooner does higher sample rate require more training time?

48k is slower than 32/40k, whether to use it depends on your dataset

slim schooner May 29, 2025, 11:54 PM

#

simple ore 48k is slower than 32/40k, whether to use it depends on your dataset

gotcha, might go with 40k. thanks man.

knotty moth May 30, 2025, 12:51 AM

#

slim schooner gotcha, might go with 40k. thanks man.

remember:

#

https://cdn.discordapp.com/attachments/1159290139609137264/1344111265253032017/image.png?ex=6839ab6a&is=683859ea&hm=4477d92cccb1105be2f714364c303c6d9dbba6072a357fb4a2ba8bead93910c9&

knotty moth May 30, 2025, 1:00 AM

#

valid vine becuase "it automatically finds the best path based on a known algorithm" doesn'...

imo Jump King speedrunning is the one you should try exploring on
https://www.youtube.com/watch?v=e-iOd42mF4g

YouTube

Koyori ch. 博衣こより - holoX -

【JumpKing】検証：12時間耐久したらどんどんうまく...

※最後Youtubeの仕様で失われてしまったアーカイブ5分間はこちらで補完しました→ https://youtu.be/kyPb3-8bLMY

デビュー前からやりたいと思っていた「JumpKing耐久」！！！！！！
年末の日曜日！！満を持してやっちゃうぞ～！！！！！！！！！
12時間以内にクリア...

▶ Play video

valid vine May 30, 2025, 1:01 AM

#

what

#

are you saying I should start by trying to make an AI speedrun jumpking?

hallow thistle May 30, 2025, 1:04 AM

#

topaz slate hey is there anyone that can help me with the w-okada settings? i cant set it.

Which W-Okada version are you using? And what is your PC GPU?

hallow thistle May 30, 2025, 1:05 AM

#

paper bloom are there good male and female voice that also sound like a real ^^

#

There are no known realistic male and female voice models in #1175430844685484042

hallow thistle May 30, 2025, 1:06 AM

#

gilded robin hey what's latest guide for making a voice model from scratch?

-rvc

patent trellisBOT May 30, 2025, 1:06 AM

#

hallow thistle -rvc

📚 RVC Documentations

AI HUB Docs

https://docs.aihub.gg

🍏 Applio Docs

https://docs.applio.org

patent trellisBOT May 30, 2025, 1:06 AM

#

hallow thistle -rvc

📚 RVC Documentations

AI HUB Docs

https://docs.aihub.gg

🍏 Applio Docs

https://docs.applio.org

valid vine May 30, 2025, 1:06 AM

#

there are no known male and female voice models

slim schooner May 30, 2025, 1:06 AM

#

worked just fine for a first run but tried to run another training session and got this, anyone know whats the issue? nothing changed

knotty moth May 30, 2025, 1:06 AM

#

valid vine are you saying I should start by trying to make an AI speedrun jumpking?

you can make a simple platformer game like that
the character movement/jumping system can be simple as og mario

valid vine May 30, 2025, 1:06 AM

#

knotty moth you can make a simple platformer game like that the character movement/jumping s...

I feel like moving in 8 directions is even simpler though, is it not?

hallow thistle May 30, 2025, 1:07 AM

#

patent trellis

Did the bot just duplicate message?

valid vine May 30, 2025, 1:07 AM

#

(8 being the 4 cardinals + the diagonals)

knotty moth May 30, 2025, 1:08 AM

#

valid vine I feel like moving in 8 directions is even simpler though, is it not?

I was referring to the kind of platformer game, which is bidirectional plus vertical jumping

valid vine May 30, 2025, 1:09 AM

#

but also about an hour or two ago I made an even simpler test program that was just a single AI trying to get to a number that you could change and it DID learn how to find it pretty fast

valid vine May 30, 2025, 1:09 AM

#

knotty moth I was referring to the kind of platformer game, which is bidirectional plus vert...

yeah, but what I was already doing was literally just move up, left, right, down, upleft, upright, downleft, and downright

#

jumping seems like it'd be harder for the AI to understand

knotty moth May 30, 2025, 1:12 AM

#

valid vine jumping seems like it'd be harder for the AI to understand

the jumping trajectory can be traditionally calculated, but decision making on the paths and timing can be things for the AI to consider

late flicker May 30, 2025, 4:22 AM

#

Is this a final message of Mangio RVC local?
['extract_f0_print.py', 'C:\Users\Mike\Desktop\Mangio-RVC-v23.7.0_INFER_TRAIN\Mangio-RVC-v23.7.0/logs/test1', '22', 'rmvpe', '64']
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
['extract_feature_print.py', 'cuda:0', '1', '0', '0', 'C:\Users\Mike\Desktop\Mangio-RVC-v23.7.0_INFER_TRAIN\Mangio-RVC-v23.7.0/logs/test1', 'v2']
C:\Users\Mike\Desktop\Mangio-RVC-v23.7.0_INFER_TRAIN\Mangio-RVC-v23.7.0/logs/test1
load model(s) from hubert_base.pt
move model to cuda
no-feature-todo

hallow thistle May 30, 2025, 4:23 AM

#

late flicker Is this a final message of Mangio RVC local? ['extract_f0_print.py', 'C:\\Users...

Mangio RVC is old and no longer updated. There's Applio RVC.

late flicker May 30, 2025, 4:24 AM

#

I know about Applio, but sometimes, it didn't worked for me :(((

hallow thistle May 30, 2025, 4:25 AM

#

cat_seriously

late flicker May 30, 2025, 4:26 AM

#

Ik

#

I will reinstall it

hallow thistle May 30, 2025, 4:26 AM

#

Well, judging by your Mangio RVC folder path, you should never install any program directly on desktop. The path should be something like C:\Applio or D:\Applio if you use Applio.

late flicker May 30, 2025, 4:27 AM

#

Okay

#

Good point

hallow thistle May 30, 2025, 4:28 AM

#

Program shortcuts belong to desktop, not full programs within folders.

late flicker May 30, 2025, 4:29 AM

#

Making sence

#

-_-

#

This is the first time, it worked cat_blush

gusty sierra May 30, 2025, 9:02 AM

#

is this very bad? :D

simple ore May 30, 2025, 9:20 AM

#

gusty sierra is this very bad? :D

depends on what chart it is.. you left the name and Y axis numbers out

gusty sierra May 30, 2025, 9:21 AM

#

g/total

simple ore May 30, 2025, 9:28 AM

#

ouch.. is it like 1 minute set?

latent kettle May 30, 2025, 10:56 AM

#

@viscid moss sorry to bother you but I want to know which models are Best in UVR 5 UI to prepare a dataset from scratch. Like which is good for vocal and instrument separation, de eco, de reveb, de noise. Remove baking vocals etc..

viscid moss May 30, 2025, 12:29 PM

#

latent kettle <@274566299349155851> sorry to bother you but I want to know which models are B...

Sure, check this docs about it:
https://docs.aihub.gg/rvc/resources/dataset-isolation/#the-best-models-for-uvr-are

Dataset & Isolation

Last update: May 5, 2025

#

This was made by our QCs for RVC model creation

#

And here's the best models according to Music Separation server guys

latent kettle May 30, 2025, 12:31 PM

#

One question more, does these models works in UVR 5 GUI ? The little windowed exe ? Or these can only be used in UVR 5 UI. the browser one

viscid moss May 30, 2025, 12:31 PM

#

https://github.com/Eddycrack864/UVR5-UI/blob/main/info%2Fdocs.md

GitHub

UVR5-UI/info/docs.md at main · Eddycrack864/UVR5-UI

Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models - Eddycrack864/UVR5-UI

viscid moss May 30, 2025, 12:32 PM

#

latent kettle One question more, does these models works in UVR 5 GUI ? The little windowed ex...

Ye it can be used there, but u need to install the latest beta version + import every model manually

#

UVR5 UI does automatically

latent kettle May 30, 2025, 12:34 PM

#

I see. Thank you a lot anime_giveheart

viscid moss May 30, 2025, 12:48 PM

#

Ur welcome

river talon May 30, 2025, 1:43 PM

#

Hello everyone, does anyone know where I could find some data for a chat bot ? i'm trying to make one using pytorch and don't really know where to start, if anyone can give me a lead to start I would be grateful

slim schooner May 30, 2025, 2:48 PM

#

are these outputs good? It's for a 50min audio.

astral pine May 30, 2025, 3:05 PM

#

Traceback (most recent call last):
File "client.py", line 22, in <module>
File "asyncio\runners.py", line 194, in run
File "asyncio\runners.py", line 118, in run
File "asyncio\base_events.py", line 687, in run_until_complete
File "main.py", line 140, in main
File "main.py", line 81, in runServer
File "uvicorn\server.py", line 69, in serve
File "uvicorn\server.py", line 76, in serve
File "uvicorn\config.py", line 434, in load
File "uvicorn\importer.py", line 19, in import_from_string
File "importlib_init.py", line 90, in import_module
File "<frozen importlib._bootstrap>", line 1387, in _gcd_import
File "<frozen importlib._bootstrap>", line 1360, in _find_and_load
File "<frozen importlib._bootstrap>", line 1331, in _find_and_load_unlocked
File "<frozen importlib._bootstrap>", line 935, in load_unlocked
File "PyInstaller\loader\pyimod02_importers.py", line 384, in exec_module
File "app.py", line 17, in <module>
File "PyInstaller\loader\pyimod02_importers.py", line 384, in exec_module
File "voice_changer\VoiceChangerManager.py", line 26, in <module>
File "PyInstaller\loader\pyimod02_importers.py", line 384, in exec_module
File "voice_changer\RVC\RVCr2.py", line 9, in <module>
File "PyInstaller\loader\pyimod02_importers.py", line 384, in exec_module
File "voice_changer\embedder\EmbedderManager.py", line 3, in <module>
File "PyInstaller\loader\pyimod02_importers.py", line 384, in exec_module
File "voice_changer\embedder\OnnxContentvec.py", line 2, in <module>
File "PyInstaller\loader\pyimod02_importers.py", line 384, in exec_module
File "voice_changer\common\OnnxLoader.py", line 1, in <module>
File "PyInstaller\loader\pyimod02_importers.py", line 384, in exec_module
File "onnx_init.py", line 77, in <module>
ImportError: DLL load failed while importing onnx_cpp2py_export: A dynamic link library (DLL) initialization routine failed.

Press Enter to continue...

#

help

simple ore May 30, 2025, 3:09 PM

#

river talon Hello everyone, does anyone know where I could find some data for a chat bot ? i...

LM studio, llama gguf model, python is only needed to exchange prompts and responses via API

analog obsidian May 30, 2025, 3:13 PM

#

slim schooner are these outputs good? It's for a 50min audio.

use tensorboard to check your model graphs
tho none of the graphs helps in choosing the "best" epoch
you use the graphs to see if your model is doing well in the training process

the "lowest value" in the cmd is one of the many random things applio has yt_nails

simple ore May 30, 2025, 3:29 PM

#

it is supposed to be agv_gen

slim schooner May 30, 2025, 3:34 PM

#

analog obsidian use tensorboard to check your model graphs tho none of the graphs helps in choos...

so lowest_value doesn't help in determining if the training is going well? alright, i'll download tensorboard.

analog obsidian May 30, 2025, 3:35 PM

#

slim schooner so lowest_value doesn't help in determining if the training is going well? alrig...

no need to download anything
just use "run-tensorboard"

that loss is the avg gen like noobies said, but that alone wont help in spotting irregularities in the training

brittle wing May 30, 2025, 3:36 PM

#

Sorry if this is a stupid question but is there a way to make an ai cover with any voice model? If so how?

slim schooner May 30, 2025, 3:37 PM

#

analog obsidian no need to download anything just use "run-tensorboard" that loss is the avg ge...

In Applio?

analog obsidian May 30, 2025, 3:38 PM

#

slim schooner In Applio?

yup

slim schooner May 30, 2025, 3:42 PM

#

analog obsidian yup

ahh i see, i'll rerun training and check the graphs, what should i be looking for? when do i know training is enough?

analog obsidian May 30, 2025, 3:45 PM

#

slim schooner ahh i see, i'll rerun training and check the graphs, what should i be looking fo...

check if the discriminator (d/total) is not weak or too strong

if its too weak it will have very high values (above 4.0) and always going up

if its too strong it will have very low values like 3.5 and always going down

#

id recommend trying to train 100 epochs and save every 10

slim schooner May 30, 2025, 3:46 PM

#

#

this is from my previous training

analog obsidian May 30, 2025, 3:46 PM

#

analog obsidian id recommend trying to train 100 epochs and save every 10

if e100 sounds great continue training up to 200 (u could also set max training epochs too 200)

analog obsidian May 30, 2025, 3:46 PM

#

slim schooner this is from my previous training

avg50 d/total

slim schooner May 30, 2025, 3:47 PM

#

this one?

analog obsidian May 30, 2025, 3:48 PM

#

yes but go to the scalars tab

#

ignore the grey graphs

slim schooner May 30, 2025, 3:50 PM

#

you mean this one?

analog obsidian May 30, 2025, 3:50 PM

#

scalars

#

read above

#

theres a big scalars name in the ui

slim schooner May 30, 2025, 3:50 PM

#

ooohh mb lmao

#

this is what im looking for?

analog obsidian May 30, 2025, 3:56 PM

#

slim schooner this is what im looking for?

yea

slim schooner May 30, 2025, 3:57 PM

#

so thats bad, too weak right? sorry im not an expert. do i change anything?

analog obsidian May 30, 2025, 3:58 PM

#

train more and see if it goes down to 4.1-4.0

slim schooner May 30, 2025, 3:59 PM

#

alright, i'll rerun the training and see if it improves

analog obsidian May 30, 2025, 3:59 PM

#

but always hear your model, since loss graphs most of the time go down even when the model already started overtraining

slim schooner May 30, 2025, 4:01 PM

#

thanks, i'll do that 👍

analog obsidian May 30, 2025, 4:01 PM

#

hearing it every 10 epochs is fine

#

if ur model begin to overtrain you'll notice every epoch past a certain step amount sounds robotic

slim schooner May 30, 2025, 4:03 PM

#

i would hear those in the "audio" tab right?

#

theyre like 2 sec segments

analog obsidian May 30, 2025, 4:03 PM

#

slim schooner i would hear those in the "audio" tab right?

nop unless you use a custom reference (this is only possible in the f0_spin branch)

#

so inference some expressive audio and hear how it sounds

slim schooner May 30, 2025, 4:07 PM

#

gotcha, do i need to move the saved models into the inference folder? it doesnt read them where they are normally stored

analog obsidian May 30, 2025, 4:07 PM

#

slim schooner gotcha, do i need to move the saved models into the inference folder? it doesnt ...

uhm weird, applio should be able to locate your pth files inside the logs folder

slim schooner May 30, 2025, 4:09 PM

#

analog obsidian uhm weird, applio should be able to locate your pth files inside the logs folder

it doesnt, unless i need to add the path to them?

languid cliff May 30, 2025, 4:10 PM

#

Is finetuning a voice with bigger pretrain dataset generally gonna take longer time than one with a smaller one? Assuming same batch and epochs?

analog obsidian May 30, 2025, 4:10 PM

#

languid cliff Is finetuning a voice with bigger pretrain dataset generally gonna take longer t...

no

languid cliff May 30, 2025, 4:12 PM

#

analog obsidian no

Oh ok, maybe i did something wrong then, or it takes some time to ramp up. When i was training with OG dataset i did like 5it/s. And now with the KLM i do like 2it/s. But i only looked at the first few epochs before i left

analog obsidian May 30, 2025, 4:13 PM

#

languid cliff Oh ok, maybe i did something wrong then, or it takes some time to ramp up. When ...

the first epoch will always take longer

languid cliff May 30, 2025, 4:13 PM

#

Yeah makes sense then

simple ore May 30, 2025, 4:34 PM

#

languid cliff Yeah makes sense then

2it/s is slower than 5it/s

unborn canopy May 30, 2025, 4:36 PM

#

hello everyone can someone help me install the voicechanger with phython i dont know what to do!?

simple ore May 30, 2025, 4:53 PM

#

unborn canopy hello everyone can someone help me install the voicechanger with phython i dont ...

unless you're trying to run some ancient version of the voice changer, you dont need to use python

#

download the compiled version for your gpu, unzip, run

#

https://rentry.co/forkvoicechangerguide

unborn canopy May 30, 2025, 4:55 PM

#

@simple ore thanks

languid cliff May 30, 2025, 4:57 PM

#

simple ore 2it/s is slower than 5it/s

Yeah, just got home, and its at a solid 4 now, so its all good it seems like 😄

simple ore May 30, 2025, 4:58 PM

#

the speed may go down if you run a game or something else that uses GPU and pushes the memory use into shared territory

jaunty shale May 30, 2025, 5:29 PM

#

just as soon as the model was done.

#

do I have to wait? (I use kaggle mainline)

lucid creek May 30, 2025, 5:46 PM

#

jaunty shale just as soon as the model was done.

delet old account and make new one with same email its work to me in engok

simple ore May 30, 2025, 5:52 PM

#

or just download the baked model.

idle bramble May 30, 2025, 6:10 PM

#

does converting to onnx affect the quality of the model?
also is rmvpe better than rmvpe onnx on nvidia? is it just better overall but more expensive to run?

safe echo May 30, 2025, 8:31 PM

#

hey guys, any idea how to resolve these two? i reinstalled RCV but my perf is now 300, but it was 30~ before. (using F0 fcpe)

also, with my usual voice, when i use a world with PU in it, it cuts it out like when i say anything that "pops" any idea? thank you Prayge

analog obsidian May 30, 2025, 8:36 PM

#

idle bramble does converting to onnx affect the quality of the model? also is rmvpe better th...

onnx quality is worse, the reason it's there is because back then amd gpus were unable to run .pth files

analog obsidian May 30, 2025, 8:38 PM

#

safe echo hey guys, any idea how to resolve these two? i reinstalled RCV but my perf is no...

rvc does not mean realtime voice changer
try increasing the chunk, maybe your gpu is too stressed

random hound May 30, 2025, 10:29 PM

#

how u create ur own voice like #1175430844685484042

cosmic frigate May 30, 2025, 11:46 PM

#

Why does my voice changer bugs when I play Roblox while doing voice chat on discord?

#

Like they can’t even hear me

crimson oyster May 31, 2025, 12:41 AM

#

uhhh is there a place where i can ask someone to make a ai model?

vast relic May 31, 2025, 1:44 AM

#

are people still using this version

peak path May 31, 2025, 1:56 AM

#

hey
should i use crying and laughing in my dataset files (.wav)?

simple ore May 31, 2025, 1:59 AM

#

vast relic are people still using this version

for 5070ti you should be using deitritis fork for 5000 series

peak path May 31, 2025, 2:06 AM

#

peak path hey should i use crying and laughing in my dataset files (`.wav`)?

@tight ether
sorry for the ping

summer cliff May 31, 2025, 3:25 AM

#

how do i get less ping while using the voice changer?

foggy belfry May 31, 2025, 7:33 AM

#

Why is this happeing with Applio? -I can't send picture

late flicker May 31, 2025, 7:41 AM

#

foggy belfry Why is this happeing with Applio? -I can't send picture

Same

proud valley May 31, 2025, 10:01 AM

#

I'm developing rvc moodel with colab

#

ummmm............

#

Should I use the paid version of CoLab to create an rvc model?

tight ether May 31, 2025, 10:08 AM

#

peak path <@245213101153189890> sorry for the ping

It might not be great for the gradients, but give it a try if it's really necessary.

proud valley May 31, 2025, 10:13 AM

#

ummm....

#

Is there anything other than the extra section?

simple ore May 31, 2025, 10:22 AM

#

proud valley Should I use the paid version of CoLab to create an rvc model?

depending on your dataset size you can get away by using the free GPU time, althoug the more you use it, the less you get next day

proud valley May 31, 2025, 10:26 AM

#

ok

#

Thank u

royal marsh May 31, 2025, 10:42 AM

#

Hello i have question but i cant put photos here

jaunty trellis May 31, 2025, 10:48 AM

#

Hi, I have a problem, idk why the app doesn't detect my microphone when I use an RVC model but when I use the "Beatrice jvs corpus" it does cat_deaed

pastel oak May 31, 2025, 11:28 AM

#

jaunty trellis Hi, I have a problem, idk why the app doesn't detect my microphone when I use an...

Delete what you have, whats your gpu

#

!give-media-perms @royal marsh 5h

#

Nvm use #1192011222023950368

pastel oak May 31, 2025, 11:31 AM

#

summer cliff how do i get less ping while using the voice changer?

Whats your gpu and voice changer version you downloaded

pastel oak May 31, 2025, 11:32 AM

#

vast relic are people still using this version

Yes

pastel oak May 31, 2025, 11:32 AM

#

cosmic frigate Why does my voice changer bugs when I play Roblox while doing voice chat on disc...

Shit gpu can cause that, whats ur gpu and version of voice changer you use

jaunty trellis May 31, 2025, 11:33 AM

#

pastel oak Delete what you have, whats your gpu

rx6600

pastel oak May 31, 2025, 11:34 AM

#

jaunty trellis rx6600

https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

Download virtual cable, amd on windows, read rest of the guide for setup

Deiteris' W Okada Fork

Last update: May 5, 2025

waxen root May 31, 2025, 12:50 PM

#

Where do I get voice models which are working for the German language too cannot find any

indigo jacinth May 31, 2025, 1:04 PM

#

hi i followed the guide but vc not working

cosmic frigate May 31, 2025, 1:37 PM

#

pastel oak Shit gpu can cause that, whats ur gpu and version of voice changer you use

Nvdia 3050 series

still phoenix May 31, 2025, 1:44 PM

#

did u figure out what was that?

pastel oak May 31, 2025, 2:10 PM

#

cosmic frigate Nvdia 3050 series

https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

Download virtual cable, first nvidia on windows, read rest of the guide for setup

Deiteris' W Okada Fork

Last update: May 5, 2025

cosmic frigate May 31, 2025, 2:10 PM

#

I am using this one bro

pastel oak May 31, 2025, 2:11 PM

#

Send screenshot of voice changer

cosmic frigate May 31, 2025, 2:11 PM

#

Still bugs when I play Roblox

cosmic frigate May 31, 2025, 2:11 PM

#

pastel oak Send screenshot of voice changer

Okay

cosmic frigate May 31, 2025, 2:19 PM

#

pastel oak Send screenshot of voice changer

it doesnt let me send pictures here

pastel oak May 31, 2025, 2:28 PM

#

cosmic frigate it doesnt let me send pictures here

Make in #1192011222023950368

cosmic frigate May 31, 2025, 2:33 PM

#

pastel oak Make in <#1192011222023950368>

done

unborn canopy May 31, 2025, 2:35 PM

#

hello, can someone go in a voicall with me and tell me how to install the voicechanger because i am to stupid to make it myself even with instructions

ashen solstice May 31, 2025, 2:45 PM

#

My g/total is horizontal, with some down spikes

#

What could be the problem? 110e so far

languid cliff May 31, 2025, 3:40 PM

#

i aint no expert, prob means you are starting to overtrain

#

https://docs.applio.org/applio/getting-started/tensorboard

hallow thistle May 31, 2025, 3:42 PM

#

unborn canopy hello, can someone go in a voicall with me and tell me how to install the voicec...

!howtoask

patent trellisBOT May 31, 2025, 3:42 PM

#

hallow thistle !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

craggy saffron May 31, 2025, 4:59 PM

#

hi when i use a file as a input the output sounds good but when i use my own voice it sounds bad how can i fix this? or is this a microphone issue?
not about the pitch

#

i tried looking up if its about the microphone but the web says u dont need a better microphone

viral mason May 31, 2025, 5:03 PM

#

proud valley Should I use the paid version of CoLab to create an rvc model?

don't pay for ai :(

river cairn May 31, 2025, 6:03 PM

#

Is it possible to use an ASIO other than FlexASIO with Deiteris VCClient? I've tried the ASIO driver supplied by my Focusrite Scarlett 4i4 audio interface, as well as a virtual ASIO provided by VB-Audio Matrix, and both experience crackling and dropouts during realtime voice conversion. Buffer size 256, sample rate 48000 (as recommended by this guide: https://rentry.co/lessdelayasio)

marble vigil May 31, 2025, 6:57 PM

#

is the illaria rvc vocal isolation tool also not working to anyone else?

silent condor May 31, 2025, 7:26 PM

#

hey, im haveing fun makeing little ai covers on weights but the ai cover voice is kinda quiet, is there a way to fix it.

polar peak May 31, 2025, 7:46 PM

#

Whats the exact google collab with old gradio UI?

summer cliff May 31, 2025, 8:24 PM

#

pastel oak Whats your gpu and voice changer version you downloaded

i got a geforce rtx 4060 and i downloaded one and it brung me to the web version

pastel oak May 31, 2025, 8:29 PM

#

summer cliff i got a geforce rtx 4060 and i downloaded one and it brung me to the web version

web version is correct, it still runs local just displays everything on a webui. idk what you mean with ping theres not supposed to be any like that, if you mean delay then theres some stuff you can do:

use server mode with "windows wasapi" as a prefix on everything
lower chunk but never go below the "perf" number that you get on the graph in color

peak path May 31, 2025, 8:56 PM

#

i have a problem with https://github.com/blaisewf/rvc-cli
note that i use Google Colab
if i want to use resume option, i've got this error on Training session.

Autobackup Enabled

Starting backup loop...

/usr/local/lib/python3.10/dist-packages/librosa/util/files.py:10: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
  from pkg_resources import resource_filename

Backup Complete: 860 new, 0 updated, 0 deleted.
Backup Complete: 0 new, 1 updated, 0 deleted.

Files are up to date.

/usr/local/lib/python3.10/dist-packages/librosa/util/files.py:10: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
  from pkg_resources import resource_filename

/usr/local/lib/python3.10/dist-packages/torch/utils/data/dataloader.py:558: UserWarning: This DataLoader will create 4 worker processes in total. Our suggested max number of worker in current system is 2, which is smaller than what this DataLoader is going to create. Please be aware that excessive worker creation might get DataLoader running slow or even freeze, lower the worker number to avoid potential slowness/freeze if necessary.
  warnings.warn(_create_warning_msg(

Checking saved weights...
Using HiFi-GAN vocoder
Starting training...

Loaded checkpoint '/content/Applio/logs/voos/D_2500.pth' (epoch 100)
Loaded checkpoint '/content/Applio/logs/voos/G_2500.pth' (epoch 100)

#

/usr/local/lib/python3.10/dist-packages/torch/utils/data/dataloader.py:558: UserWarning: This DataLoader will create 4 worker processes in total. Our suggested max number of worker in current system is 2, which is smaller than what this DataLoader is going to create. Please be aware that excessive worker creation might get DataLoader running slow or even freeze, lower the worker number to avoid potential slowness/freeze if necessary.

  warnings.warn(_create_warning_msg(
/usr/local/lib/python3.10/dist-packages/librosa/util/files.py:10: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
  from pkg_resources import resource_filename

terminate called without an active exception

Backup Complete: 1 new, 0 updated, 0 deleted.
Files are up to date.

simple ore May 31, 2025, 9:16 PM

#

@peak path you dont need to use rvc-cli, if you gonna train anything, just use noUI colab.

#

dataloader warning is strange, have not seen it before

#

perhaps you failed to restore the dataset?

peak path May 31, 2025, 9:21 PM

#

simple ore perhaps you failed to restore the dataset?

restore the dataset?
i didn't load my dataset again.
because they are loaded into my Google Drive already.
should i load them again? in the resume session?

simple ore May 31, 2025, 9:21 PM

#

colab does not access your google drive directly, you need to move the dataset from the backup to the colab node.

peak path May 31, 2025, 9:22 PM

#

yes, i did it in resume tab.

#

there are 2 sections.

load data
set values

simple ore May 31, 2025, 9:23 PM

#

okay, so in the filesystem browser (folder icon) there should be your folder '/content/Applio/logs/voos' with a bunch of stuff inside.. f0, f0_voiced, extracted, sliced_audio folders, etc

peak path May 31, 2025, 9:24 PM

#

yes yes
they are there

#

/content/Applio/logs/voos

#

simple ore May 31, 2025, 9:25 PM

#

thats the backup

#

i'm talking about colab side

peak path May 31, 2025, 9:25 PM

#

yeah i know
i can see them in colab

#

let me do it again

simple ore May 31, 2025, 9:25 PM

#

peak path May 31, 2025, 9:25 PM

#

true

simple ore May 31, 2025, 9:26 PM

#

okay, so you should be able to select a different max epoch (> 100 you have saved), and it should resume the process

peak path May 31, 2025, 9:27 PM

#

simple ore <@1155563131104395388> you dont need to use rvc-cli, if you gonna train anything...

oh, i set that to 100 again
should i set that to 200 in the resume session?

#

my first session was 100

simple ore May 31, 2025, 9:28 PM

#

obviously, otherwise you are at 100 as trained before

peak path May 31, 2025, 9:28 PM

#

oh my lord
let me do it

peak path May 31, 2025, 9:29 PM

#

simple ore <@1155563131104395388> you dont need to use rvc-cli, if you gonna train anything...

you said i don't need to use https://github.com/blaisewf/rvc-cli
is there a better option?

#

i don't have the link

simple ore May 31, 2025, 9:29 PM

#

the regular noUI colab https://colab.research.google.com/github/iahispano/applio/blob/master/assets/Applio_NoUI.ipynb

Google Colab

peak path May 31, 2025, 9:30 PM

#

ok, thank you so much
i thought that you have another option.
i used it before

simple ore May 31, 2025, 9:31 PM

#

RVC-CLI is a command line interface for Applio, but it is kinda redundant

peak path May 31, 2025, 9:40 PM

#

simple ore obviously, otherwise you are at 100 as trained before

i'm stupid
i set that to 150 in the resume session
works fine
tnx

wide perch May 31, 2025, 9:59 PM

#

Using RVC nvidia on Github.

As soon as I begin audio conversion, the entire process freezes and the command prompt is empty
Other people I talked to had this same issue
Anyone know how to fix it?

#

Fixed it, but the voice changer isn't working

#

Just getting "Audio Block Passed"

simple ore May 31, 2025, 10:05 PM

#

wide perch Fixed it, but the voice changer isn't working

trying to use an acient voice changer 2) trying to use to different device types (WDM mic vs MME line out)

wide perch May 31, 2025, 10:05 PM

#

simple ore 1) trying to use an acient voice changer 2) trying to use to different device ty...

wdym ancient voice changer

#

Also yes, I have both input and output on MME

#

RVC worked for me before, now its just spamming audio block passed and the voice changer isnt working at all

simple ore May 31, 2025, 10:06 PM

#

link your "RVC nvidia on Github"

wide perch May 31, 2025, 10:07 PM

#

simple ore link your "RVC nvidia on Github"

https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/RVC1006Nvidia.7z

simple ore May 31, 2025, 10:09 PM

#

so yeah, ancient

wide perch May 31, 2025, 10:09 PM

#

simple ore so yeah, ancient

I don't understand what you mean by ancient.

simple ore May 31, 2025, 10:10 PM

#

in AI terms, project that have not been updated for 6+ month are hopelessly outdate, your's is like 2 years old

#

here's up to date one https://rentry.co/forkvoicechangerguide#download-for-nvidia-gpu-on-windows

wide perch May 31, 2025, 10:14 PM

#

Alr thanks

elfin dome May 31, 2025, 10:14 PM

#

guys , i need this files

fierce pivot May 31, 2025, 10:14 PM

#

Is there no longer a place to request someone create a model for you? Apparently I suck at it 🙂 and need someone to do it

wide perch May 31, 2025, 10:17 PM

#

simple ore here's up to date one https://rentry.co/forkvoicechangerguide#download-for-nvidi...

I set up everything but how I still can't hear myself when I click start

#

I set up my input and output correctly

simple ore May 31, 2025, 10:18 PM

#

elfin dome guys , i need this files

those component are usually get downloaded automatically

elfin dome May 31, 2025, 10:22 PM

#

simple ore those component are usually get downloaded automatically

oh , what now !

simple ore May 31, 2025, 10:23 PM

#

unless you're using some outdated app that point at non-existent repository

elfin dome May 31, 2025, 10:23 PM

#

hmm

polar sage May 31, 2025, 10:48 PM

#

Hello, what Colab are people using actually?

#

!colab

patent trellisBOT May 31, 2025, 10:49 PM

#

polar sage !colab

📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**

Google Colab

• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

river cairn May 31, 2025, 11:10 PM

#

river cairn Is it possible to use an ASIO other than FlexASIO with Deiteris VCClient? I've t...

Just wanted to bump this. Is anyone able to answer?

winter burrow May 31, 2025, 11:30 PM

#

I haven’t used w okoda for a while, is it still the best realtime voice changer?

analog shale May 31, 2025, 11:33 PM

#

Hey, I'm trying to find a small language model that learns with prompts. Any suggestions?

polar sage May 31, 2025, 11:43 PM

#

Hello, what Colab are people using actually?

slim schooner Jun 1, 2025, 12:37 AM

#

what was the virtual audio cable that you need again for wokada?

#

I think it's called VIC or something, not sure

river cairn Jun 1, 2025, 12:41 AM

#

slim schooner what was the virtual audio cable that you need again for wokada?

Virtual Audio Cable (VAC)

#

By Muzychenko

slim schooner Jun 1, 2025, 12:42 AM

#

thanks bro 👍

outer wasp Jun 1, 2025, 2:22 AM

#

May I ask?

What if I want to create a speech model from scratch on applio or any speech model (meaning without download any pre-existing other model data)?
Is applio is right way to create a voice model?
How much voice recording data does it take to create a voice model?
Thank u for reading

simple ore Jun 1, 2025, 2:32 AM

#

outer wasp May I ask? 1. What if I want to create a speech model from scratch on applio or ...

foolish idea, the model wont be generalized enough to be able to infer things it has not seen during training
it is one way, but there are other ways.
for training a model on top of a pretrain, 10-60 minutes. More is not needed.

silent condor Jun 1, 2025, 2:39 AM

#

on weights is there a way to make the ai voice loudrr, its kinda quet on covers that i do

outer wasp Jun 1, 2025, 2:40 AM

#

simple ore 1) foolish idea, the model wont be generalized enough to be able to infer things...

Thank u! And i have some more question

How to know it a Quality model to use from sample? (No robotic voice or sth?)
If i have record my voice to train it, there are anything need to note?
What is the best way to create voice model u use?

simple ore Jun 1, 2025, 3:20 AM

#

outer wasp Thank u! And i have some more question 1. How to know it a Quality model to use ...

check tensorboard charts, check spectrogram of tests audios, listen, pick the best model you've trained
use a good mic, quiet room without echos, same loudness, same distance form the mic, clean up the recording
ask in model maker chat

crude flame Jun 1, 2025, 3:23 AM

#

to get model maker chat you need to apply for model maker first: https://discord.com/channels/1159260121998827560/1305524365810470963

median monolith Jun 1, 2025, 3:52 AM

#

I have a question about the Weights voice model creation feature, could I maybe do it here?

also, wasnt this channel named something with "help"?, its been a while since I asked for a question in the server, and I swear this channel was named something like this. just wanted to know.

latent kettle Jun 1, 2025, 4:09 AM

#

median monolith I have a question about the Weights voice model creation feature, could I maybe ...

There are a lot of changes in this server. Yes it was named as Help-RVC but now it's renamed.

#

You can ask questions about Realtime voice changer or any other help like RVC or something

devout tulip Jun 1, 2025, 4:11 AM

#

Which website to use to make AI cover?

soft tiger Jun 1, 2025, 4:35 AM

#

So where can i do google veo 3 stuff for free?

latent kettle Jun 1, 2025, 7:58 AM

#

devout tulip Which website to use to make AI cover?

If you have a good gpu do it locally. If not use huggingface spaces or colab or maybe kaggle

viral ruin Jun 1, 2025, 8:46 AM

#

Is there any working colab to train RVC ?

silent condor Jun 1, 2025, 8:51 AM

#

please somone le me know if theres a way to make the ai cover song thing on weights any louder, the voice isnt very loud

ancient portal Jun 1, 2025, 9:57 AM

#

hello, I need newest version of AICoverGen

edgy minnow Jun 1, 2025, 12:56 PM

#

what does this mean?

#

okada w

#

@tacit rampart #🧬│ai-chat message

#

How did you fix this

simple ore Jun 1, 2025, 1:10 PM

#

edgy minnow okada w

you're probably trying to run a second instance of the program

edgy minnow Jun 1, 2025, 1:11 PM

#

simple ore you're probably trying to run a second instance of the program

It did it first try, but restarting my PC seems to have fixed it.

young dirge Jun 1, 2025, 1:12 PM

#

when I use the ai voice changer sometimes its like where my voice kind of cuts out for a very small amount of time. Is there any way to fix this since it makes it sound way worse

simple ore Jun 1, 2025, 1:14 PM

#

young dirge when I use the ai voice changer sometimes its like where my voice kind of cuts o...

mic sensitivity? voice activation is generally bad

young dirge Jun 1, 2025, 1:15 PM

#

simple ore mic sensitivity? voice activation is generally bad

I beleive my mic should be pretty good. I am pretty new to this voice changer. So Idk if there is any specific like mic sensitivity or like how to check it. All I know is when I tried to record my voice with like obs I could hear how the voice cuts of alot

young dirge Jun 1, 2025, 1:16 PM

#

simple ore mic sensitivity? voice activation is generally bad

if you mean like the level you can change in the sound settinga i turned it up too 100%

mighty sinew Jun 1, 2025, 1:17 PM

#

im using applio and im having a problem where with one certain voice it wont produce a audio file but it says its been inferred succesfully any way to fix this as the voice seems to work for others and i want to use it

simple ore Jun 1, 2025, 1:25 PM

#

mighty sinew im using applio and im having a problem where with one certain voice it wont pro...

look at the other window with the error log

simple ore Jun 1, 2025, 1:26 PM

#

young dirge if you mean like the level you can change in the sound settinga i turned it up t...

so when you record an audio from your mic directly are there any issues with the audio dropping out?

young dirge Jun 1, 2025, 1:26 PM

#

simple ore so when you record an audio from your mic directly are there any issues with the...

nope. Only when I use the voice changer never normally

simple ore Jun 1, 2025, 1:26 PM

#

is there an issue when you use a voice changer and use mic as an input and headphones as an output?

young dirge Jun 1, 2025, 1:27 PM

#

I use the cable input to be able to speak on discord and more. And that is when the problems accure. I use my normal mic for the input and the cable input for my output

simple ore Jun 1, 2025, 1:28 PM

#

we'll get to that, please answer the question above

young dirge Jun 1, 2025, 1:29 PM

#

simple ore is there an issue when you use a voice changer and use mic as an input and headp...

so my normal headpones as an output and normal mic as an input. I dont think so since when I use the voice changer and hear myself it sounds pretty good

simple ore Jun 1, 2025, 1:30 PM

#

okay, so now mic input, line1 as output, discord line1 as input, push to talk = enabled

#

is there an issue when you use push to talk?

young dirge Jun 1, 2025, 1:33 PM

#

gonna try it with friend so he can let me know if it sounds good

#

its still quite laggy apparently when I try it

#

or like it cuts of alot

simple ore Jun 1, 2025, 1:44 PM

#

okay... is the noise canceling enabled in discord?

#

there are a few settings you can try

young dirge Jun 1, 2025, 1:45 PM

#

simple ore okay... is the noise canceling enabled in discord?

I turned it of

young dirge Jun 1, 2025, 1:55 PM

#

simple ore okay... is the noise canceling enabled in discord?

could I possibly text you in dm's so I could send a photo of what my settings are so I am not doing anything wrong?

viral ruin Jun 1, 2025, 2:17 PM

#

Is there any working colab to train RVC ? RVCDisconnected seems to be banned

young dirge Jun 1, 2025, 2:42 PM

#

I finally fixed it on discord but now its just not working very well on teamspeak

#✨│ai-help

How To Troubleshoot

How To Troubleshoot

How To Troubleshoot

How To Troubleshoot