#✨│ai-help
1 messages · Page 235 of 1
hey man i really want this fixed without having to NUKE my entire pc
do you maybe have any ideas what might be the problem
maybe try:
- running force gpu clocks again
- do ctrl+shift+windows+b
- check if there's any weird default settings in the nvidia control panel/nvidia app
- try force fp32 mode on in advanced settings
it happens just to you 😭
i know its crazy the unlucky person i am
LMAO
but ive always been unlucky icl
my life is on hardest difficulty all the time
Bro has a 40 series and has the worst issue possible 😂😂😭😭
bro
the luck is crazy
LMAOO
Fr man I feel bad for you
ATP you might as well run it on cloud
Lmaoooo, imagine I was training a voice and halfway through my browser crashed
💔
It’s such a random issue ngl
it just came out of nowhere to haunt me hahahah
Is there a massive delay?
Yeah word man 😂😂
a delay massive to me because thats not a normal delay
i used to run it well
i be playing schedule 1 resident evil 4 remake etc just fine i know my gpu is workinfg
That’s so weird tf
is your model converted to onnx/or you're using rmvpe_onnx?
no just rmvpe
yes but check if your model is converted to onnx
it is not the model is just the model itself.
i never converted it to onnx
and i recently had to download the model again
right?
im so confused.
and ive redownloaded ts like 20 times
lol
the okada itself
fork
have u tried like different browsers?
try another browser
ill try edge
okay
its disabled
i just open w-okada and it havent told me im using onnx
uh
try force fp32 mode
set it to on
see lyery
disable jit compilation
alright
might be a program bug to report to deiteris
set it to "on"
alr could be his hardware or a problem with deiteris
fr? I thought it is related to his gpu driver/cuda 
idk abt hardware my graphics card is pretty recently bought
💔
like maybe a month or two old now
every single other program works fine for him
yes
so the latest fork or nah?
yes
it seems a very super specific issue that he has only in the latest wokada deiteris fork
does that happen with the original w-okada?
nope.
original w-okada works just fine,
how are u sure about that
same chunk?
yes
okay
so i gotta message that guy now?
yeah maybe try, he's deiteris
was this explained well enough without any confusion?
im trying not to make it any harder than it already is.
hello is ilaria rvc still usable
seems fine, maybe explain better that it runs fine but has issues when it goes background
@reef flax just a quickie but, have you updated your drivers recently maybe?
which one are oyu talking about (like the link)? whats your pc gpu? what do you want to do?
Maybe there's some nvidia fuckettry going on that's clashing specifically with the fork?
yes.

could be maybe?
I don’t know:
Might be
I recently updated it because it didn’t work
Well man, try to install previous driver
There are two versions of Ilaria RVC: the lightweight one from Hugging Face and full version from GitHub.
My previous driver it didn’t work either. I updated it still didn’t work
that’s why I updated lol
when was it when it still worked 'fine'?
weird shit is i got his own same gpu and on latest drivers it worked fine
A while back, maybe a few weeks. I started it and it worked fine
hmmm yea, that becomes the problematic part
Hmmmm...
Download " latency mon "
and see if you get any spikes anywhere
when running the fork
@reef flax
alright I gotchu give me a. Moment.
just
shi mb i was using the outdated one, i js found the latest one anyways so thanks anyway
make sure nothing is running, absolutely nothing
not even malwarebytes if you use it
( not even discord. Null
then start
and run the fork
I don’t use it can discord still run
i feel super nerd for having latency mon prior to this conversation 
Or something to screenshot with
aight
we gotta make sure it's as clean as it can get
cause a lot of things can contribute to the system latency
Then, note what is the latency before running the fork
and after ( use the model )
You solved that yourself? That's wild.
alright
then, once you get the data / info
yes
( idk note it somewhere )
run other realtime clients or whatever works for you
and note again
LatencyMon 7.31?
pick the latest one
Now, if there'll be no apparent discrepancy ( some measurement error is allowed tho. ) then idk, really weird
Yea, close all you have, run it, wait for 1-2 mins and see the latency ( note it )
then run the fork and use the model ( note any spiking latency as well )
and yeah, follow what I wrote before
all iknow is i have to do this now
is there something else i have to do or no.
just use the model right
yup, casual usage
i gotchu.
You'll be notified in the program if there's any major latency issues ( or " having problems in real-time application usage " or something along that
alright it’s loading right now
im using my phone
I used task manager to close discord etc
good
“Your system seems to be suitable for handling real time audio and other tasks without dropouts.”
is what it’s saying as of right now.
note or screenshot the bars and stuff, metrics in general or take a pic with your phone
then, run the other realtime stuff ( w-okada
and try to compare the metrics / bars
bet
Should I pause it
Run the stuff
Then start it again
Or keep it running
close the fork and run other w-okada / og one or whatever else that works fine for you ( delay wise
ye
honestly I think my latency is gonna be fine💔
Either way appreciate it
but yeah downloading the most recent version of w-okada right now
which worked totally fine for me
alrighty
i want learn making agents
Just asking to be sure not because im dumb I probably already know the answer.
When opening the w okada using the model, i dont continue right just restart it (the latency test thingy.)
I want to make sure im not doing it wrong at least
yes
you start fresh the latency mon
Alright.
Starting it up now
I’m gonna use the model while testing latency
Using the original one it still says this
I’m gonna do fork now.
@reef flax you see, what you're looking for is:
that kind of latency ( especially coming from nvidia )
doesn't occur
you see the bars, ye
Well, that means it's something up with the fork or the way it interacts with your gpu
here's what you can try
oh okay thanks for speaking English 💔
disable the:
hardware accelerated gpu scheduling
uhhh where
oh, it was off for you?
if it’s off or on
hmmm...
i have it off because it speeds up training lol
well ye, typically ye
but there were some quirky systems
where it was doing the opposite or would improve certain things so, if something fails in speed or latency department and there's no clues, I recommend trying it on ( or off if it was on )
You have it off
interesting uh
i don't use deiteris for realtime, i use fumiama
havent noticed a problem with that
Anyway, try to turn it off
alright hten.
restart the pc and retry, see if delay improved
That's good for you yes
I have it off too
so we're good ✨

alright im back
ill try fork
oh btw
should i run it on administrator?
i used to always do that
I mean, you don't have to unless it asks you for it
alr, id ont think it matters taht much anyways
well ye, but it's a good habit to have
im hoping this fixes it
if it doesnt oh well
it IS giving the same yellow warning
yeah teh awful delay is still here
lol
yea, guess all you can do is wait now
sadly can't help much as I ain't from real-time department
np
thanks 2 everyone who tried eitherway
hope you can get it sorted out asap
it couldn't even start for me smh
-# sounds like a skill issue to me
did it start for u
amd 😎
r/AyyMD
We are a satirical PC hardware community dedicated to proving that AMD is clearly the better choice. Everyone is welcome, including non-AMD fanboys.
Don't want to burn your house down with Novideo GPUs or Shintel CPUs? Then AyyMD is the right place for you! From dank memes to mocking silly Nvidiots, we have it all.
that versus UserBenchmarks
@latent kettle dude i got the model to 225 epochs before it randomly stopped
i got the files tho how do i resume training? i've never been into this point
do i just do everything from the beggining and untick 'fresh training'??
Yes. You must keep every settings as before but don't check fresh training. If you have closed the kaggle you need to copy pre processed data and logs and latest checkpoints
i have not closed anything yet
do i reopen applio or just do it right away?
How?
You got any errors
no it just stopped updating on here
i waited like 10 minutes and it didnt like refresh
It detected over fitting
it was going to stop at 300 epochs anyways so i dont think its a huge loss
damn thats bad right
I don't think so
well can i add epochs to it without damaging it? or not cause of the overfitting?
More epochs is not equal to good quality
so do i just leave it like this?
Can you dm me tensorboard graphs
dude one problem that i faced is that i couldnt select the actual model on tensorboard just the [model]/eval one
and now its like this
"fresh training" is for starting over instead of resuming it
yea i figured that thank you tho
its not working when i click on it
Then send me the screenshot of latest logs
the ones on kagglio or are there logs on tensorboard?
This thing but from last epoch
oh no thats the last log
dont use overtrain detector
Is it broken?
it ends training prematurely
oh so do i resume training and disable that?
i feel like the model still sounds pretty robotic idk how to fix that
hey so bossman told me to use server instead of the other which did work but the output with server, when monitoring sounds good but when its an input on discord for example it sounds bad.
do you know a fix for that?
i have it on windows-direct-sound btw
use wasapi
thanks let me try, i feel annoying asf asking so many questions -- i swear it used to run well at first 😭
hey so update, server is working fine just like how it used to work, i dont know why i cant use the other option but. i guess ill just say im good now? thanks to @glacial pollen @knotty moth @analog obsidian and @low shard for all the help they provided, and being so patient with me. If there's anything like a kofi or something for this team, i would love to donate sometime. lmk :).
Cheers man ✨
Should i use silencefront on or off?
elaborare more
great
for W okada. in advanced, there is a thing called silencefront, is it recommended i keep it on or off?
silencefront reduces gpu usage while completely idle
kinda useless if you're talking
can someone help me, right now my W okada takes like 7 seconds to work
Show a screenshot of your wokada
!give-media-perms 1h @somber canyon
The site
dont judge me about the images...
Uninstall vb audio cable
Get vac lite in the virtual audio cable step of the guide
Set chunk to 192
Also it doesn't really look like lagging
Is it lagging only when playing games
its just that the sound is coming out way later. no matter if im playing or not
the perf stays between 50 and 90
yeah, and it's green, it's not lagging
so the problem is with the vc audio cable?
if you're trying to achieve 0ms delay, there's no such thing that has 0ms delay lol
vb audio cable is known to give issues, that could also add delay or literally just stop working
it's better to use vac lite
i cant find the app to uninstall it 😭
nvm
found it :D/
so i should use vac lite
thanks
yes, you might also want to try https://docs.aihub.gg/w-okada/local/w-okada/#reduce-more-delay
Last update: April 5, 2025
just know that you can use that to lower the delay, not have it to 0, there's no thing as 0 delay
do i click on setup.exe or setup64.exe or setup64a.exe?
setup64.exe
the vac lite part is explained at https://docs.aihub.gg/w-okada/local/w-okada/#virtual-audio-cable
Last update: April 5, 2025
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
im using the new UVR colab in #📰│dev-updates but uh
which models are the best to separate vocals and instrumental
and then harmonies from said vocals?
NVM fixed it, and if anyone got the same problem just try to clear settings and it should work
Robotic ?
Is it male or female
Try to adjust pitch
Are you converting it from female sample ?
the images are so fucking good man
this is hella based
care to explain gng
💔
<@&1159293204038955078> Where are my meds?😩
what the hell
I NEED MY VICODYNE
</3
Is this the same as in the program or is there a difference?
parabellum speedrun ban o - o
real
how many days did u time him out?
only time will tell us
Hello, I was about to download the files for the rtx5000 series on this link: https://github.com/IllIlIlIllIl/voice-changer/releases/tag/b2335 but for no reason the z01 just gets deleted and idk why.
wdym just gets deleted?
also what's your pc gpu and what you want to do?
So the zip and z02 stays in my download folder but z01 just disappears after the download is finished. I got the 5090 and use the real time voice changer
are you using an anti virus?
also lucky asf having an rtx 5090
the anti virus might be silly and eating your file, i would suggest temporarely disabling it
Windows Defender, i already turned that off but didnt changed. And ye xD
does the file literally just do poof instantly after u finish download it?
No it stays for a few seconds
so, you download it, it stays for 5 seconds then gets shy and disappears?
😭
yup
are u sure u don't have any other anti viruses?
like 3rd party ones
like avast
didn't even know it worked as an AV too 
yo when i get to the mmvc part of the download do i just download all three?
what's ur pc gpu? what do u want to do?
what's ur pc gpu?
also that's wokada deiteris fork, realtime voice changer for calls
W for ultrakill pfp
yeah i know
im only trying to download it so i use the sam microsoft ai voice model
idk if i can download it tbh m driver version is 512
and the required version is 528
nvidia geforce GTX 1070
you need to download the normal nvidia one
the one with 3 files, is the rtx 5000 series one
you got a gtx 10 serie lol
theres 4 series difference
where is the normal link then?
Last update: April 5, 2025
is that about it after i downloaded the normal one do i have to download anything more?
read the whole guide, you also have to download the vac lite and set it up
Where are the voice actors in this server?
What should I put above and below to change my voice in Fortnite?
Is this the same as in the program or is there a difference?
put input to line 1
All settings in all games are like this?
But why in fork input is Hyper x
not line 1
yes
A VAC (Virtual Audio Cable) makes a fake audio device, used to re-route the audio of different programs
In Wokada context, it's used to get the output of wokada as the input in other programs
Should I put input Line 1?
no, input to line 1 is ONLY for other programs
you're getting the output from wokada to be the input in other programs
oh okay
In all other games and programs input is : Line 1
but in FORK input is my mic
output : line 1
Yes, I understand
yes
hi does anyone know how i can resume training from an ai rvc?? my computer shut down and cut itself off and i understand i have the backup but i don't know how to resume it so to speak.
hello anyone know if 6700xt 12gb is enough for wokada?
Yeah it's enoguh
-realtime
1st link Wokada deiteris fork
Huh
@patent trellis wake up smh
Here's the link https://docs.aihub.gg/w-okada/local/w-okada/
Last update: April 5, 2025
I mean its worse or better than nvidia? or just equal sorry for my english
@low shard bro which one is the correct one
i ended the training early to see how it was doing
im training on kaggle tho and these are heavy asl im scared to download them and being kicked out cause of bandwith
what emeddor was used for the refineganv2 pretrains? every time i try to use them i get an error
@analog obsidian ima just ping u here lmao
but i was saying for a singing model
10min dataset is there a pretrain that i could use
klm 4.9 is really decent
even for english speaking models?
yeah
would you say it generally performs better than the og version itself?
perfoms better in singing, but for speech i feel the og is better
so like to make sure a rapping model like mine (lil uzi vert) it could work on that pretrain?
just so i dont end up training and it sounds like shit lol
Test em both as Lyery says
as for batch, on top of what you know to try, so 4, I'd also suggest 8
as all in ML, it's non linear and we can't estimate anything, at least with 100% accuracy
well im saying for singing models in general
like my dataset isnt trash
its actually really clear
klm 4.9 got higher vocal range
thats all i can say
about the rest, no idea, you have to try
its just time consuming and im not a paitent guy
well i am paitent but when i have to retrain the same thing to test oof
but ima try it
want this model to be perfect as possible
you tell me about patience
my first model I was learning all that rvc and ml / ai on, took me 3-4 months of trials, experimenting and learning
( which I wanted to make ' perfect '... I mean, it paid off lol
tho, it won't be that bad tho, it's just 10 mins of set
rather quick
Anyway, fcpe works so, gonna push the update
let me know when the update is there so i can update to the recent version on google colab
bet
if ur talking abt your fork
ye
also my dataset, has very minor not really noticeable clap bleeds, only way i can explain it is when ur playing music really loudly n you can hear it in the recording some of the song when its loud asf if that makes sense
should i keep it
or remove it
if yk
sorry for the ping again 
just wanting to know if i should
rmvpe should be able to handle that
but i would remove those
so i can make the ai job easier

keep in mind that rvc is going to clone every sound in your dataset
if something repeats often in your set, rvc is going to randomly add that sound in the inference results
oh i was gunna try sending u a sample of the clap bleed
but i cant send it
its only one song that has it
so its not really repetitve
but i can hear it lol
oh wait nevermind i can send it
its only 2 seconds long anyways
idk if u can hear it
its after he says his verse
yeah rmvpe can handle that
should i remove the sample i sent 
i mean its only 1 second but still
maybe 
i removed it in case
that is if it's spectral foot-print matches the learned feature
just to clear it up
1 second? then definitely yeet it
even if, out of 10 minutes, you'd have to yeet idk 1 minute of ' awful samples ' it is still worth it
machine learning really really hates outliers
quality over quantity is the golden rule
also noticed some of the dataset has some like jewlery shaking sounds when he raps or something, 
its very minimal
is there a way i can remove it
with an ai model on mvsep or uvr
or would that have to be done with rx
there are some models for sure but I am not a specialist in terms of sfx removal
@low shard should know more about it
bru its very minimal
but u can hear it
well, I need more context
frick dk if i should keep it
yeah let me see if i can get u a sample yea im pretty sure
not sure if it overlaps here but i found this
its like very little stuff in the dataset like clap bleed / shi like this
well
you can just yeet it
- ye, if those occur and are separated from the voice, get rid of them
else if model learned them and some of your input audios had some instru bleeding
it could be present in the voice output ( after infer
if you want legit good voice separation, you can try gabox fv4 I think it was
ye, melband roformer architecture, model's gabox fv4
https://huggingface.co/GaboxR67/MelBandRoformers/tree/main/melbandroformers/vocals
( well okay, voc fv4 is the name
dam ok i mean the dataset is really clean its just those annoying ass little sounds that are in the dataset might just have to manual remove at the end or try that one u gave me
what model did you use to isolate em?
but also keep in mind its not very often or repttive its just like once and a while, and it was only one song that had that kind of bleed, heard rvmpe just takes care of it
but i dont wanna risk it
you can send me a lil bit of some song or whatever you have ( dm ) i'll run it through fv4
and you can hear how it'd sound
because what you have sounds pretty bleached / squashed in lower range
it's not that it takes care of it
f0 / pitch is unrelated to such things in training
i used melband reformer bas curitz version , it has the highest vocal fulness n sdr rn better than any model currently for isolating (found in a guide in audio seperation )
the worst concern here would be the feature extraction part
if contentvec mistakes it for something related to voice, it could be contained within your voice's features
and then, if AI decides it's a good idea to copy it, it will do so
you want the whole song or just part of a song to use ?
send me some 20 seconds, pick the worst part of the song ( where most instrus occur for example
Voice tends to come in delay from wokada, is there any way to reduce the delay?
I use virtual audio cable
use some best roformer model or Bandit v2
if fails, yeet it out
I can still safely say that voc fv4 is one of the best models out there
then there was also bigbeta 4 and 5 I think
a lil better fullness but awful artifacts
bandit v2 may still have little quality compromise, also it separates laughters and some non-verbal even from the main speaker
even better than bs reformer?
i heard there some issues with voc fv4
bs roformers were the first roformer models afaik
then mel ones came in
now we have plenty of this and that arch
buttttt, imo fv4 wins
when it comes to vocal fullness and less bleed you say that one does better than any model?
sdr is quite low i think 🥀
BS roformer is rather muddy (with bright bgm), fv4 has better fullness
muddy voices could cause more robotic sounds
there are bleedless models out there but the results can be very muddy sometimes
ps, pro tip
do not use stereo files for rvc
pick one channel and yeet the other one, then export as mono
else there's ( afaik ) channel weighting or centering done? ( can be wrong in that )
yet for safety, always best to use mono
im going to compare with the mel reformer model i used and see if theres anything noticeable or anything different
it does sound good tho
im surpised removed that hi hat bleed too
MDX and older models have that issue, and the best roformer ones can remove it
now, lemme try out some magic tools on it
in the mel reformer model i was using it was still present in the vocal, it was a crazy hi hat that was played in the song , when u clean it up its more present
but in the model he used i cant hear it at all
hmm might have to try the model after all
iirc the first mel/BS may still fail on it
are you dealing with some hip hop/rap music esp?
there you go ( ps, that's lazy de-reverb, can be done better
sadly, eq and lil bitcrush effects are still on tho ye.
if you give it some lil care, you'll have a decent set
hearing from the audio there is some weird slight distortion + slight echo bleed
but yeah
and it does sound quite more fuller
compared to which a bit of echo or reverb, isn't as damaging, at times won't even break the model
yea.. that's why I stopped at fv4
bigbeta 5 for example had a bit better fullness but sadly noise too
dc high freq noise, pretty stubborn to yeet
also im hearing that becruilys melband reformer karakoke model is performing better than any other for removing adlibs / back vocals n such
should i use that one
or stick to bve
or stick to melband karoke
in that I can't help
I'd typically only use bve
( for input voice for infer, that is
cause it is too destructive for training purposes
better ask others I suppose
u think bve would clean this up without an issue , like from keeping the vocal from being damaged or
yea will do
but apprecaite the help i will try that model out
gotta go update that gogole colab too lol
Doubt it, bve and most old-gen models are pretty destructive in that matter I suppose
Either way, gluck~
i got a “no healthy upstream” error when logging into the weights.com site
Weights.com server side can go down sometimes, but right now the site is still up.
so im super new, I want to be able to create my own narrations with the clone wars narrator voice, as seen at the beginning of each episode, anyone able to guide me through that?
Are you generally speaking between Nvidia's and AMDs GPUs?
Everything is in the docs https://docs.aihub.gg
Last update: Oct 21, 2024
it came back up about half an hour after i posted so we’re all good now
I was testing the colab link of Applio, but I don't see any tensorboard. I actually found weird that the page under the code when training is a giant 404 white windows from chrome. It is normal? Because I don't see anyway of testing in a tensorboard way each model it generates.
I'm using this colab link
https://colab.research.google.com/github/iahispano/applio/blob/master/assets/Applio.ipynb
im training a model but idk how long it should take
I'm converting a 10 second video on facefusion Colab, I don't install it locally because I've already tried and my PC overheats too much, but after about 2 hours of loading the video, i.e. almost the end, the runtime disconnects because I have 0 computing units, any advice?
should it take 30-60 minutes?
colab? or local?
local
I don't know, I use colab usually the maximum I put on colab if I have a poor connection is 60 min the minimum is about 20/30 min
my dataset is about 9:30 minutes
so i guess it would take a lil while
i'm 1 hour in
I also use about 10 min of dataset, but I'm telling you the duration on colab which I think is similar to local
Not any fixed time
It's totally up to your PC spes, and other settings. Such as batch size, total number of epochs
I'm not an expert, but if your PC doesn't have a good graphics card or isn't powerful enough it could take longer, then I repeat I don't use local when I tried it took me like 3 hours but with a very slow PC that overheated a lot
Not every time. Training local is better than cloud if you have good resources use local versions to get speed and prevent from disconnecting
yes I know this but you need to have a suitable PC, with mine I just slow down the process compared to colab because my PC despite being a 500 euro laptop with ryzen 5 and AMD is still too slow
It's a common practice it usually take around 3-4 hours or depending on your PC specs and your settings. And it causes over heating due to 2 reasons
- Your cooling system in not good enough
- It's a resource demanding task. So heating is common
Laptops = over heating
I could almost cook on it when I was trolling locally hahaha
Laptops usually overheat because there is not much space to put that giant liquid or air coolers.
and it is precisely not using the local but only colab that brings me back to republishing this question:
I'm converting a 10 second video on facefusion Colab, I don't install it locally because I've already tried and my PC overheats too much, but after about 2 hours of loading the video, i.e. almost the end, the runtime disconnects because I have 0 computing units, any advice?
You must require a GPU for fine tuning
There is a daily limit of 4 hours and it randomly disconnect if it found no activity by user.
There is a little solution for it but idk it will work
Open developer-settings (in your web-browser) with Ctrl+Shift+I then click on console tab and type this on the console prompt. (for mac press Option+Command+I)
function ConnectButton(){
console.log("Connect pushed");
document.querySelector("#top-toolbar > colab-connectbutton").shadowRoot.querySelector("#connect").click()
}
setInterval(ConnectButton,60000);
thanks I'll try
Can you help me with this problem?
Turn on your internet connection and try again
There is a module maybe for super noise canceling it do requires internet.
@wild crag
Okay, I'll try !!
umm... I tried it and it didn't fix the problem. The internet was normal. Is there any other way?
@latent kettle
Pre-trained models are downloaded or not ?
When I press button Initialize, this screen appears, but none of the buttons are pressed here.
It was running normally until a few days ago, but suddenly it didn't work today
I faced this error once-time and I just connected to internet and unchecked sup2 sup1 and eco in noise section
I can't even press the "start" button there ...
Im talking about the quality of the sound in wokada real time , if are the same quality
Should be the same
Just know that amd is generally less supported in ai but Wokada deiteris fork improved AMD support
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
An index file stores accent of a voice model. If one RVC voice model misses an index file, the voice model would still work but without accent applied.
thanks a lot for help 🙂 i think i will explore the guide too 🙂
Which guide? Although some fork RVC programs may require a voice model to have both pth and index files to work, Applio and W-Okada don't really require that.
the server guide to have some rough idea of what i am doing
That doesn't tell which specific RVC program you're looking for. Let me guess, is it RVC or W-Okada?
Try Applio instead.
do you mean the old mainline based or the latest Applio-based?
older mainline based ( realtime one )
fork wokada is recommended https://rentry.co/forkvoicechangerguide
or there is also fumiama mainline fork though it seems rather experimental
thanks a lot
thank you
what sample rate should I use here? 30, 40, or 48k?
I tend to do all 3 of them sometimes and some of them end up with either a little bit of noise or little distortion. This is an audio that has been only merged with all voicelines I've got. No normalizing, just removing all silence.
32k
ok thnx
if you have the Izotope rx you can resample it to 32k if you want
i normally use audacity to change sample rates
hi can anyone help, I'm new to okada
gpu: radeon 6600
cpu: amd ryzen 5 5600x
whenever I try any voice, or tweak any settings its cutting like crazy
i'll take a peek later, though I am new to it since I heard it just now. Is there any guide/tutorial for it?
Did you follow this guide? https://docs.aihub.gg/w-okada/local/w-okada/#download-amd-intel-and-cpu-on-windows
Last update: April 5, 2025
sadly, I didn't but thank you!
so this is only software avaliable for me?
thank you a lots
hi is someone here?
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
@low shard do I download the VAC Lite? Windows.
if you're looking for realtime voice changer yes
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
be sure to read the 1st link guide
Ahh! Okay. Just making sure, thank you. Hopefully this one wont be as buggy as the last one 
xD
that's an old version of original wokada
also, you shouldn't use original wokada
wokada deiteris fork is more optimized
yup
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
1st link
@olive flower may i ask ur pc gpu btw
nice ur good to go, be sure to uninstall original wokada
and uninstall also vb audio cable
So, what I see is my 4090 RTX should handle this xD
Hey guys, i don't want to use Colab as RVC model maker. Please give me local RVC model makers 
ofcourse lmao, wokada deiteris fork is pretty optimized and can run even on a gtx 1050 (will have a terrible experience in games like marvel rivals but you get what i'm saying)
elaborate:
- ur pc gpu
- what u want to do
I understand xD But that is really awesome to know. It's less demanding
I am using NVIDIA GeForce RTX 4060 Ti, i just want some local trainer
chunk basically controls the delay
lower value = fast response IF it has enough power for that
lower value = basically gives the gpu more time to take responses, so less resources and less delay being used
TL;DR: yes
though, the version you're using is highly NOT suggested
it's going to have worse performance
It's less demanding
basically yes
colab is a cloud service only for people with a bad pc
thanks
yw
I used Gradio local trainer, but it doesn't work properly
what?
don't use video tutorials
they are old
use the tutorials i sent you
guys do ya know a online voice changer?
Why the new version of Applion just created me this files? config.json
D_233333333.pth
events.out.tfevents.1746466483.DESKTOP
extract_f0_feature.log
filelist.txt
G_233333333.pth
preprocess.log
train.log
Same problem
elaborate:
- your pc gpu
- what you want to do
G/D_23333333 files are a set of weight when you chose not to save an separate copy every x epochs
to save space
new applio version does not save half of those files.. no extract f0, no preprocess, no train.log
and the tfevents file should be in eval folder
but if you somehow got an old copy from a year ago, then maybe
3.1.0 or 3.2.0?
Hi
I was reading through the guide
I'm a bit confused. There's no specific moment where the guide says when to go to Audacity
In step 1 you start with Spek. In step 2 I can't tell without knowing whether we're still in Spek or in Audacity
Then in step 3 there's a screenshot that clearly shows an Audacity window
Also, am I supposed to extract multiple smaller audio files or one big one at the very end?
I already did, read the guide
how can i get rid off background noises when using W-Okada
show a screenshot of ur wokada
when using the voice changer the mic picks up random noises and i hear glitch sounds like im moaning when using it
in the background
!give-media-perms 1h @broken urchin
enable sup2 and echo and lmk
oh right you're on server, welp
what if you try to put in sens more to the right?
same thing
even if u put it to the max to the right?
yes
Yeah, but when i download that file, it doesn't work fine :(((
@crude flame maybe should make that a bit clearer
elaborate the issue
which file?
what doesn't work
what did you do
RVC1006Nvidia.7z
are u using it on discord or a game
It doen't made me the right files
discord lol
Only this files
it's better to just use applio, which was the 1st option in the guide, more improved and easier to use
same problem with that idk why
welp, you could either try client with sup2 + echo (but will not be able to gain that less delay perk), or use a 3rd party noise suppression program, you could even try discord's noise suppression but might cause some crackle issues
yes, just use applio https://docs.aihub.gg/rvc/local/applio/
Last update: Apr 01, 2024
it's an rvc fork (modified version) which is easier to use
also noobies replied already about that thing #✨│ai-help message #✨│ai-help message
@low shard This is the right link? https://huggingface.co/IAHispano/Applio/blob/main/Compiled/Windows/ApplioV3.2.9.zip
yes
please read carefully the guide, don't skip steps
you shouldn't have to unless it gets flagged
i didn't really have to disable it
which noise suppresion program should i use
Why i don't see any run-applio file
extract the zip
I already did it
Hello, i'm searching a AI model locally/or an app to make cover with a custom voice model cloning technique does this exist ?
i mean u could try nvidia broadcast app or discord voice suppression
@crude flame should we add a suggested list of 3rd party tools for noise suppression in ur opinion
what's ur pc gpu and what do u want to do exactly
there should be a run-applio.bat file
Nvidia 4070 (laptop) or RX6750XT (pc), I want to be able to use my voice to make cover of song i like
I don't have it, i downloaded from the link i sent
show a screenshot of what u see
!give-media-perms 1h @late flicker
you can technically do AI on both, but will have WAY LESS pain on Nvidia since they have better AI support (even generally speaking)
As you got a good PC, you can use RVC locally, you can choose between:
- Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
- Mainline: The original RVC
Applio is suggested
If you want the Windows AMD Applio Guide, its: this
Well thank you for your quick assistance, I will check that out and come back if needed, thanks 👍
the amd gpu does have a bit more of vram which is great, but gonna waste more time trying to make it worth, maybe @simple ore could also give you a better opinion on which one to use since he used both amd and nvidia
did u use winrar/peazip/7-zip to extract the zip?
whats the app i need to use to seperate the voice from a song
winrar
Last update: May 5, 2025
ALso using 7-zip, but not rn
thanks
could you retry? the folder should look like this extracted, maybe retry with 7-zip
yw
I am downloading it from GitHub now, i will try it
Yeah, i have it
Thanks very much
But how to install it, there is applio-install.bat
Someone can help me with this? I believe I asked way too late at night and may no one saw it? But I still found it weird the colab give you a 404 error window. Maybe is an adblock issue or something like that? Or the colab is outdated?
the nvidia laptop is likely much better choice
as long as there are 8GB vram
you wouldn't need that, you would need only the run-applio.bat
yup there are
was amd ai support that painful
But it don't do anything
are u running as admin
nope
what's the absolute path?
C:\ApplioV3.2.9
no, it just wont be as fast
i meant shitty but yeah
there's updated zluda that makes it better, but I dont have amd PC any more to re-build and re-test the changes
I tried everything
Hello,
I'm working on a project where I want an AI to suggest full meals (like lunch or dinner) by combining ingredients from a structured database. The database is divided into categories such as proteins, carbohydrates, vegetables, spices, and sauces. Under carbs you have items like rice, pasta, etc., and the same goes for the other categories. Each ingredient also has attributes, like sugar content, calories, etc. I will have database of all the ingriends, like liver of zebra etc so the database will be very large.
The AI should pick a meal based on the user's input. For example, if the user wants a low-carb option, it should select the best alternative that also makes sense flavor-wise—for instance, curry and ketchup might not be a great match. And if ketchup isn’t available, the AI should reconsider. If it was going to suggest fries with the meal, but there's no ketchup, it should think again and offer a different idea.
What’s the best way to connect an AI to my database? I want quick responses—ideally under 2–3 seconds. I've heard about the User → RAG → AI → User pipeline, but I heard someone mention that RAG is not popular anymore, is that true. I also know that if i interact an AI from Ollama to my database its either hybridversion with RAG or training my data on a model which is called QnA, (im not sure).
Right now the data is stored in Json cause I know to little of whats best to store
I am really beginner in handling databases so dont judge me to hard.
NOTE: It's not a must that the AI has to "think again" if something is out of stock.
Well i successfully installed but how can i make my own model with my voice then ?
idk of any other than nvidia broadcast
step 1) have a problem; step 2) decide to use AI; step 3) have two problems
Yeah I am starting to find that at times. 😂 Though it has been a godsend to me with music.
please check the docs for that https://docs.aihub.gg
Last update: May 5, 2025
same
Can someone help me find a good life au voice changer I need one for a dnd game
why my okada vc so laggy
What's ur PC GPU
I'm guessing realtime voice changer
Show a screenshot of ur wokada
It turns out that I had selected the fastest response time ‘=‘ and my gpu couldn’t handle it
Now is fixed
Set to 198ms
😔 still abit of a delay but it’s fine it’s smooth
Altho.. Which was your GPU?
Just asking.
Also in case of anything we got docs that can help you regarding realtime.
Hi, I've been using Voice Changer for months, but I have a problem, all my voices are choppy, I hear lag.
https://prnt.sc/zyQwnq04nERe my settings
I need help , when they talk in the room they can hear their voice throw my mic any fix?
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
You're using the original version of W-Okada, especially the DirectML variant for AMD/Intel GPU, which is outdated. What is your PC GPU?
What is your PC GPU? And which W-Okada version are you using?
Wrong answer, but close enough to the settings according to your GPU.
what am i supposed to say more than that
Can you send the screenshot of your W-Okada?
F0 Det: regular rmvpe
In. Sens: try set it up to -60 dB
Also, is VB-Cable working for you? Because most of people here complained to me VB-Cable won't even work for them.
yes its working its just when some one talk loudly a bit he can hear his voice throw the voice changer from my mic
If you encounter any issue when using VB-Cable, I'd say try to switch to Virtual Audio Cable lite instead. To hear your own W-Okada audio, set Monitor to your main headphone/speaker.
are these two options the same btw?
No, take the CABLE Input (VB-Audio VIrual Cable)
I think
So this is where thing gets really confusing when using VB-Cable. "CABLE Input (VB-Audio Virtual Cable)" is the main input I think. I'm not sure about that "CABLE In 16ch" one.
@knotty moth Can you help me plz? I am trying to run applio-run.bat But nothing happens
AMD RX 6800 16 gb vram
VAC lite only provides one for each input and output.
Use this better W-Okada from this guide instead. https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#download-amd-intel-and-cpu-on-windows
Last update: May 5, 2025
Which version of python i have to use?
I'm not sure how on earth you tried to install Applio. But inside the Applio folder is supposed to look like this. The Applio path can be like "D:\Applio". Installing Applio into C drive can cause several issues.
Also, did you download and install the "compiled" one? Because pre-compiled one is for those who know what they are doing.
I don’t know the gpu from my pc it’s pretty old
Any GPU older than NVIDIA GeForce GTX 10 series is considered old.
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
Then is mine old it is a NVIDIA 370

Anyway, use cloud service instead. This NVIDIA GPU is very old.
I know but how do I use cloud Service
Read the guide there. https://docs.aihub.gg/rvc-voice-changer/cloud/w-okada-kaggle/
Last update: May 5, 2025
if I have a song with 96kHz, then i separate the vocals, lead vocals, and remove the reverb, can I still train it using a 48k pretrain?
Thansk for the help I will give you an update if it works
many 96k tracks are wasteful

pre-compiled is the version for dummies, yet dummies still fk it up
Could someone guide me to make an AI cover with my voice? thanks in advance!
With RVC, you'll have to "train a voice model" with given audio of your voice.
What is your PC GPU?
Geforce RTX 4090
Yeah, this is one of the most powerful NVIDIA GeForce RTX GPU you can train on.
Have the audio file of your song ready, & let's extract the vocals from it with an audio isolation software.

That link basically tells you basics on how to make AI cover. For the actual RVC program, there is. https://docs.aihub.gg/rvc/local/applio/
Last update: Apr 01, 2024
TYTYTY
Y'all are so helpful
Yeah, i am using the compiled one: https://huggingface.co/IAHispano/Applio/blob/main/Compiled/Windows/ApplioV3.2.9.zip
Alright so, I have been using RVC yesterday for the first time on my rtx 5090. It works really good while setting it on server mode, but when the gpu is on full usage, it begins to cut out the voice, delays and crackles. Is the only way to stay under 100% usage or any other technic? Also whats better, Client or Server? I only know that Client has the benefits with echo and Sup1,2
- are you using the version for 50-series as in the guide?
- what kind of game are you running?
- try server mode and wasapi devices for less audio latency (less likely to affect whether to cause voice cuts)
- Yes
- VRChat (in vr with Resolution upscaling and MSAA x4)
- And I was using wasapi in Server Mode
try stop using vr mode, or use a 2nd pc/laptop for voice changer+streaming
I mean it does work in vr, just making issues when it was at 100% by using MSAA x4, when i stopped using it and it was around 60-70%, it worked without any issues. So I guess I have to look that my gpu isnt at 100% usage. But than im asking myself, what about the Tensor Cores? As far as i know, currently the RVC is using CUDA, but will it be able to use the Tensor Cores instead?
why are you pointing at MSAA instead of perhaps DLSS?
the VR mode may be rather demanding like 4K res (or even 8K), but it may also be vrchat's poor optimization alongside the million poly count avatar models
as I said, the best bet is to use 2nd pc/laptop
Dont quiet know what you mean but DLSS doesnt exist for VRChat
I was in a world all by my own
But yeah i guess so
also you'd better watch the power connector status, perhaps it is throttling before melting risk could happen
I got the Asus Rog Astral OC, it has a nice feature where i can see if pins of my power connector are broken but everything is fine ^^
i need help downloading the w-okada ai voice changer
im so confused on where to find the download link in github
No dashboards are active for the current data set.
Probable causes:
You haven’t written any data to your event files.
TensorBoard can’t find your event files.
If you’re new to using TensorBoard, and want to find out how to add data and set up your event files, check out the README and perhaps the TensorBoard tutorial.
If you think TensorBoard is configured properly, please see the section of the README devoted to missing data problems and consider filing an issue on GitHub.
Last reload: May 6, 2025, 3:15:26 PM
Log directory: logs
What is this isue with Applio?
is there a way so that the voice changer doesn't sound like there's a robot or smth everytime i speak like how do you make it smoother?
how i can wnload appilo?
Unload ?
Which gpu do you have and what are you pc specs
rtx 3050
VRAM and RAM
Traceback (most recent call last):
File "C:\Users\pauma\OneDrive\Escritorio\app.py", line 59, in <module>
installation_checker.check_installation()
File "C:\Users\pauma\OneDrive\Escritorio\assets\installation_checker.py", line 26, in check_installation
raise InstallationError(
assets.installation_checker.InstallationError: Installation Error: The current working directory is located in OneDrive. Please move Applio to a different folder.
what is this?
It's clearly mentioned that your current directory for applio is one drive. Move it to other directory.
what should my gain be for an asmr voice?
how i can make this?
Are you using pre compiled version of applio. If yes, do what noobies said. Extract it to any other directory C:\Folder or D:\Folder or any directory
unzip it to C:\folder


