#✨│ai-help
1 messages · Page 236 of 1
who says is is a good idea to unzip 7GB of stuff into OneDrive/Desktop
idk how to change
might as well take a dump in your bedroom
Is there a difference between merging in wokada and merging in rvc? And will there be a loss of quality in the model that will be formed when I merging 1 quality model and 1 poor quality model?
ctrl-x, ctrl-v
You're so funny, get out of the house
what, you need a youtube video that would explain how to move folders to a different location?
yes, there will be a quality loss
merging is tricky tbh
is hard to predict the results
i prefer to use rvc for merging because w-okada can be very buggy sometimes
Don't. Extract Applio into somewhere like "C:\Applio" or "D:\Applio"
rvc hasn't 32k merging but w-okada has
umm how am I supposed to upload a picture over here , I wanna know how to use that voice changer on discord vc's
applio has 32k merging
@oak plank
.
okay
Talk a bit until your name turns light green so you can send your screenshot here. How can you tell me what is your PC GPU? And which W-Okada version are you using?
ik
what is W-okada
Wow, so you pinged an admin for this one.
it will be easy to do that in DM's ig?
W-Okada the realtime voice changer.
For example, model 1 doesn't make whispering sounds very well, but model 2 is good at whispering sounds but can't make subtle sounds, can the merged model make up for the shortcomings of the 2 models?
if you dont mind can I send you the picture of the voice change am using rn
No.
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
interesting
i havent never tried that uh but i dont think it would fix both problems
because the model still has the influence of both models
That's one of the bad questions I've ever seen. For voice models, find one in #1175430844685484042.
oka
so if you try to inference whisper, the model may still use the knowledge of the non-whisper model
but first try merging them tho, it may work good, no one knows
BTW is that okada voice changer free?
If you want to help someone, it's best to only help instead of rating how bad/good their question is 
You download a voice model zip file, extract it to a folder, and copy or move one or two files (.pth and .index) into ".\Applio\logs".
Otherwise they can wait for someone else for help
ok @hallow thistle this is the client am using
W-Okada is free.
so like I can use a minecraft villagr sound though that
I didn't know Applio supports 32k, will there be a noticeable difference in merging? I'll start downloading now
Download and use the better W-Okada from this guide instead. https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#download-nvidia-on-windows
Last update: May 5, 2025
yes I have it , but now how am I supposed to use it on discord?
Also, helping two people at the same time sounds like a bad idea for me. I can't focus many things at once.
I mean I tried using villager sound on discord through checking the soundboard
but it did not work as inteded
it was like 19k ms
Slow down, please. The W-Okada in your screenshot is the original version of W-Okada, which is outdated.
oh I see
iirc there's no native 32k merging but applio managed to do it by doing ... idk
i know there's an explanation on how they made that possible but i cant remember
w-okada dev probably did the same as the applio team
The original one and the one from the link I sent to you are two different W-Okada versions.
so shall I delete the older one?
Yes.
Also, no need to ask me about every step by step on this one. I can only focus on installation, settings and a program needed to be used with Discord.
downloading it is a bit confusing
RuntimeError: An error occurred loading the audio: Error opening 'assets\audios\Grabación_16.m4a': Format not recognised.
What is this?
Use mp3 or wav instead of m4a.
okay
Bro you can read the errors. These are very easy to fix. Read them carefully before sending it

m4a is not supposed format so it says to convert it into any other format
the mac download keeps getting corrupted for me alraagghhtt 😼
Same it says to extract applio in other directory as I told you above. So read errors carefully
The what? Sorry, but did you download the Windows one when you have Mac?
no im downloading the mac one cuz im using a mac
wait i think i know why
its not working
Ohhhh i was downloading the wrong file 😭
Replying to my message where that message was replied to another person is kinda unexpected.
yes but im having the same issue i was using the old w-okada and it's fried asf
Which CPU does your Mac have? Intel or Apple Silicon?
intel
Download the Intel Mac one.
yeah im doing that now thankx
🎉🎉
I think using external GPU with Intel Mac is possible, but you'll need a Thunderbolt-compatible PCIe Riser card, an NVIDIA GeForce RTX GPU and a power supply for this one.
I'm not sure if Detris ever made W-Okada for this especially, other than having Windows running inside Intel Mac in order to use CUDA features from such GPU.
can someone show me how to use mainline?
im using it for training a model
@steel forge
Anyone?
What is the best voice cloner?
I use Applio.
lower chunk number means less delay, but be careful
Welp, don't expect W-Okada to work that smoothly while playing AAA games at same time
Since AI stuff is more power-demanding than videogames
Altho there you have some recommended settings. (you got 3080 so use 192 chunk and 2.7 on extra)
Try with 2.7 or 2.5
As the guide states
extra
Well, you can test with each to see which performs better for you.
where can i learn how to train a melroformer model
@odd shale is there a cloud notebook where u can do it
I don't know anything about separation model making Bruhy, sowwy
Elaborate what you want to do
I downloaded a boy model but idk where to use it to turn on the voice changer
Cuz i bought voice mod and alot of things and I got scammed
So i need a app or smth to import in it the model to be a voice changer
I think you shouldn't have bought voicemod in first place.
In that case you can use deiteris fork.
Its good for adding extra effects to W-Okoda but thats about it lmao
@tranquil radish check this guide and install deiteris fork
I will ask for a refund I hope they accept
Just the site, its relatively straight forward.
Nope, we really don't recommend watching any yt tuts
These get outdated pretty quick
Mhm ok
Yo're welcome buddy.
Is this supported by this server or it's a local one
Local.
Oh so I found also a voice changer from a content creator called duckus he showed a tut u think the on EU sent is the best or cuz u maybe. know this content creator
Duckus? I'm pretty sure he used OG W-Okada on his videos along his crystal voice model
Meh, just stick with deiteris.
Idk tbh but the tut was a year ago
Then it's pretty outdated.
Don't follow yt tuts
Deiteris is the one u sent
Ye ty
Yep.
I am not the best at pcs and stuff so sorry if I am asking alot of questions
Do I need to pay for this voice changer u sent
What? of course not.
it's free
??
And will I need from time to time to change it or
Just look at the guide every so often to see if it has updated, thats about it. It hasn't updated in like 6 months though so you likely don't need to worry for a while.
Mm ok
When i have wokada forked idling on default browser or without anything open It utilizes around 90-100% of the gpu but when i hop on roblox it goes down to a consistent 50-60% utilization... Should i be worried? (I use rest protocol cuz it sound better, not sure if that's the reason)
hey guys, i wanna train my voice for singing but i want good quality.
how should i record my voice to get best quality for training my voice for singing?
for example: recording my voice max 5 min for 3 times with total of 3 audio with 5 min each?
and how much (total_epoch): should i do for training? 50? or 200?
the applio gradio is stuck on installing requirements
it is serviceable
it would work great? or just good 
because I don't think a RTX 4060 8gb will work well
@tranquil radish Are you using weights too? Do I have to pay to download the covers I make? I love using Nanachi but Weights doesn't have a good Neco Arc voice model like TopMediAi does
you dont have to pay to download the covers
Nah see i don't understand in this things 😭
I'm on the app & I can't figure out how to download them
It won't even let me listen to them unless I'm online with some kinda service like wifi
yeah try using the website
Hm ok I might have to remake some covers on the website
nope as long as youre logged in they should be there
Does the website have a better Neco Arc voice model than the app?
TopMediAi is the best I've found
website should have the same models
Awe alright at least I love the Nanachi voice model
Thanks
@craggy bough ah I figured it out I just had to share my cover to make it downloadable
Which do I choose?
It's making it confusing to download the cover
@tranquil radish Maybe you know?
Really I don't 😭
hey guys. i need a little help here. i download UVR5 UI and im a newbie to this. its running and etc, but im lost in the huge quantity of models. if someone has a "manual" hahaha i appreciate
https://github.com/Eddycrack864/UVR5-UI/blob/main/info/docs.md#uvr5-ui-documentation
Here's the list of the best models for
And here's the most huge doc about all models:
https://docs.google.com/document/d/17fjNvJzj8ZGSer7c7OFe_CNfUKbAxEh_OBv94ZdRG5c/edit?tab=t.0
edit 05.05.25 deton24’s Instrumental and vocal & stems separation & mastering (UVR 5 GUI: VR/MDX-Net/MDX23C/Demucs 1-4, and BS/Mel-Roformer in beta MVSEP-MDX23-Colab/KaraFan/drumsep/LarsNet/SCNet x-minus.pro (uvronline.app)/mvsep.com/ GSEP/Dango.ai/Audioshake/Music.ai) General reading advice | D...
omg haha thanks 130 pages
apollo can run in this system?
the model apollo* for restoration
The idea was to create an appl that has as many models as possible
ty and ur welcome :3
oh gosh, hold on, u are the creator of it?
yep
depends on the dataset size, 4060 may be faster
im your fan, like literally. u did an awesome job creating it, and i appreciate your work and effor to put this out
so many corporations use the models and limited the usage, and u manage to make a nice interface, and compilation all that. thanks
Thanks I really appreciate it 
for UVR5 in kaggle is there a way to change the segment size?
Do I also have to check the "Override segment size"
nah
output
for some roformers results could be weird
Do you recommend what segment size I should use
tbh default
okay
best cpu for voice moddles while playing games or even obs
ryzen series im looking for
X3D or just the X
more cores or more threads help?
Logs and weights
anything based on ur budget
hello
if someone can help me in downloading the app i would be thankful if someone can please add me beacuse i am fasing alot of proplems
Having loads of problems with applio
I did the preprocess as usual and it's telling me I didn't?
I think it's an applio problem cuz I used the NoUI ver and it worked
I'm trying to train a model using MRF-HiFi GAN vocoder in Codename-RVC-Fork-V3.1.4, from pre-processing datasets, pitch extraction, and generating index is doing fine. but why when i try to start the training im facing this error. I tried using different sampling rate and result is the same, now im using HiFi-Gan and its doing fine.
I'm using GTX 1060 4GB.
i want to try using MRF-HiFi GAN vocoder because i'm not satisfied with the quality of the HiFi-Gan.
My GPU is AMD Readeon 7700XT and i have the Alpha Version
Fork wokada recommended for amd gpus
https://rentry.co/ForkVoiceChangerGuide download amd/intel one (dml)
What app, theres hundreds of. Explain what you need
some one gave me this link
https://rentry.co/ForkVoiceChangerGuide
Whats your gpu, whats your problem
nivedia gtx 750 ti
Its too old, will not work well
You need online hosted, follow kaggle link
mmm
can u send me the link
btw what gpu should i get to work well on it
Thats also suggested in the guide, think it was
RTX 2060 or better
or AMD RX 5000-series or better
ok ty
That's my pc will it run the voice changer
And if not what should I get
This one
Someone plz help me with Mangio RVC fork. I did everything right, but the final model is shi*t
Do i have to restart the model?
I somehow can’t put this model in the download model thingy, it’s like not downloading it
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
show the logs for preprocess and extract features and training settings
Give me a moment, I closed everything
I'll try again
In the gradio ?
Or the files?
something is not right with your model's config.json
in colab
if you're using UI colab, then the logs will be in the colab, but settings are in UI
could somebody help me get an audiokable i can use for a voice changer
less unnecessary stuff, more the actual logs
anyway, you got 12 x 3s files, yet your trying to train with batch 15?
Ah
stretching too thin, my dude
I just don't change the basic stuff
I didn't know </3
you dont have enough data, you chose batch size intended for 50+ hours of audio
I didn't choose anything
I just left the default
What should I change it toM
*?
Those at the default settings when you open gradio
I'm not here to argue, I genuinely didn't know
Thank you for letting me know
I lowered it and now it's working
I'll know for next time
you cant just lower it to 1 or 2 and expect anything good from 30s audio source
how do i use the model im training while it trains. Im training locally on applio
how do i install rvc live?
So, you want realtime voice changer for calls/games?
calls yes
RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
What's your PC GPU?
I told you the differences between rvc and Wokada, so you'd need Wokada deiteris fork for realtime, what's your PC GPU?
how do i find out?
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
How did you exactly train models without knowing your PC GPU?
GPU 0
NVIDIA GeForce RTX 4070
Driver version: 32.0.15.6603
Driver date: 10/15/2024
DirectX version: 12 (FL 12.1)
Physical location: PCI bus 1, device 0, function 0
Utilization 1%
Dedicated GPU memory 1.8/12.0 GB
Shared GPU memory 0.1/15.9 GB
GPU Memory 2.0/27.9 GB
Nice, you got a desktop?
Or laptop?
desktop
Even better
Wait
So how did you exactly train models till now?
Did you really use colab all this time?
Because colab is just one of the cloud computing service (remote good PC) services made for people with a bad pc
had a laptop
Ohh
Do you want me to tell you also how to train rvc models locally? @static gyro
Or do u just want realtime for now
real time for now
You should really NOT use cloud when you got a good pc
understood
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
1st link, Wokada deiteris fork
404
i dont see download
Oh right we changed the url link just yesterday night
Last update: May 5, 2025
Here
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
Yup the bot is up to date now
Any reason why uvr has ensemble all greyed out? i have the latest update and deleted old presets.
quick question, Replay keeps saying "Server not running" been sitting here waiting on that for about 10 minutes now am i missing some setting or?
v7.2.3
Edit: closing the server and reopening it had it starting around 15 minutes later
(I did the "remove local server" option already)
has this started to overtrain yet?
not even close
thought so
next time show all avg_50 charts
where is that
whats the software to make my ai model to sing
?
like this
bet
i have one question after i donwloadet a voice form weights.com it it dident have the index file it only hat a json file
what sample rate is this?
What are you doing
32khz
im good now
Okay
thank you so much really!!!, it worked, yesterday I tried it, but it stopped about halfway as it happened before, today I tried again and I understood that I had to insert and activate this code several times, so every 30 minutes I re-entered it 2 or 3 times to be safe, and the runtime lasted 3 hours and stopped only when the video was finished, thanks
Yw XD
It was just a guess work. I was not sure. But it worked 😆😆
if anyone has the same problem as me, you know this method works
thx
are there any free tools i can use to upscale my photos
to look more realistic
that i cld rn locally also
i have a 3060ti and a ryzen 5 5600
you want to do locally right? either comfyui or real-esrgan or SwinIR
You have to install a custom node named "the ultimate upscaler "
elaborate:
- what browser are u using
- did u give it microphone perms
- what's the exact issue
it exactly tells you why, repository not found
either privated or delated
also, what do you mean in the download model thingy? here we offer help for multiple programs, not just one
but i'm guessing the download tab on applio
Found out why it’s not working, model is empty
that’s why lol
not that the model is empty, it's that the repository is either private or deleted
and the creator left the server
where can i find this?
Mangio RVC fork is abandoned since 2023, use applio
i think i already told you that
where can i find "the ultimate upscaler"
the node he was talkign abt
4gb of ram is extremely outdated, plus that cpu is weak,does this laptop even have a dedicated gpu?
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
how to train my own voice via collab? i always getting some error with training collab
any latest collab link? for training voice
also, the guide is moved to the docs https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/
Last update: May 5, 2025
firs, what's your pc gpu?
how to know
second, never use video tutorials for rvc, they are old
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
Colab is a cloud computing service (remote good pc) ONLY for people with weak pcs
it shouldn't be your first option, your first option is to check if your pc is good enough to run it locally
intel r uhd graphic 770
How many rams should I make it +Nivedia gtx 750 ti
a shit that's integrated graphics, please check if you got any other gpus like gpu 0, gpu 1, etc
i have amd thats why cant use local
AMD readon rx 6700
It's a pc but I can edit it but idk what to replace or add @low shard
holy shit that's ancient, yeah you're way below the BARE minimum which is a gtx 900, also nowdays the minimum standard is 8gb of ram for everything
your pc gpu wouldn't even be supported by the program
Mmm
@tranquil radish your only option is cloud computing services (remote good pc)
you want a realtime voice changer for calls right?
i have AMD readon rx 6700 but some one said only nvdia works for local
I just wanna prank my friends so games and stuff
AMD is supported locally
your gpu is supported via Zluda (Cuda emulator) on windows
how to do it is there link to download the working one?
If i buy his would it be good https://abosalahpc.com/product/تجميعة-15000ج/
(That's a computer website)
your only options are:
- pay for an atleast decent pc
- use the kaggle wokada deiteris fork https://www.kaggle.com/code/suneku/voice-changer-public with 30 hours for free weekly, but you need a phone verification
also what games? i fear your pc can't run any online games
There isn't 1 voice changer for calls and games and everything in 1?
can't read arabic, but 8gb of ram with a gtx 900 is the extreme bare minimum to even get supported, even though an rtx 3060 would be suggested if you're on a budget
Mhm
this is applio, an rvc fork, modified version
What's that?!
wokada deiteris fork works for both calls and games, everything that is realtime
if you want inference (use models) on pre-recorded audios, then you'd need another program
Real time means live right?!
Does all look like this
it's not related to your thing dw, it's a modified version of rvc needed for pre-recorded audio
Mmh OK
that's wokada deiteris fork, is that someone's else picture or you got a desktop?
Someone's else Pic
realtime means in live yeah, like using it in a discord vc
Mm
i gave you the options, there's nothing more you can do than that
your pc could have issues even on lowest graphics for games atp
Do u have a pic of the inside design of this voice changer
I am planning to buy a better one
Like
The design of the voice changer the style or they all the same
wdym the style
the picture you sent is how the wokada deiteris fork User Interface looks like
the local guide is https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/ for people who got a good pc
Last update: May 5, 2025
Mm
does applio also work for singing?
yes, it's an rvc fork, rvc is Retrieval-based-Voice-Conversion, so speech to speech
I wanna know how kaggle looks like this voice changer
If u have. A pic
the interface is always the same, it's the same program
Oh
Then what's that diffrence
kaggle isn't a program
its a cloud service provider
Mhm
it has it's own user interface for the code cells, all it does is using a remote good pc
and it will run the program's code
This is complex ughhh
AI is more complex than what Chatgpt makes it look like
this is Open Source AI
you can use #1192011222023950368 or #✨│ai-help
no need to dm
nope, this server is english only mostly
it's the most common language on the internet, and we can't really moderate other languages
It's cuz there is a pc I planned to buy it but idk if it will run the voice changer
Mhm ok
what are the specifications
Well i really don't understand but lemme see it again gimme 5 min
I got this info but wait
GPU rx580 nitro plus rgb
Ik it's not enough gimme secs
Rx 580
Core i5 9400f
8 or 16 rams
Idk how many rams exactly
This guy didn't mention it
But it's 16 ddr4
Not sure tho
that's an 8 year old card
it will be the bare minimum for AMD gpus basically to work
So it's bad or what
it's the extremely bare minimum for amd gpus
it means, it's supported, but you won't get a stable experience especially while using it in games
calls it will be decent, but you will basically have to upgrade in some years anyways, it's like you're buying an already old car and expect it to be good for another 8 years
atleast an nvidia rtx card
also
it really depends on your other use cases for the pc
Mm
What does that meab
Mean *
take in consideration that if you're looking for a good experience in the program especially in games, it's better to have n rtx card
but also depends on what other things you use the pc for, like games or other type of usage cases, and also your budget
if you don't have the money, just use kaggle wokada deiteris fork
it's free even though it's on cloud, i think 30 hours are enough weekly for you
Well the RTX card well be kinda expsinve what version should I buy
as i said, it also depends on what other things you use your pc for
Games only
Mine craft pubg roblox free fire
This one in Egypt for 340$ so it's expsinve
oh those are pretty low end games, since they are even on phones
Oh
are you planning to run the wokada on those games?
you should be fine with atleast a gtx 1660 card if you're really short on money and need a pc that will last more, or an rtx 2060 if you got a bit more money and want less delay, just remember that i can't really say much since i don't play those games so i can't guarantee you i have proof of how it runs like roblox on a gtx 1660 card along with the voice changer
just know that you will play games at the lowest graphics while using the voice changer, it's normal to not have more delay in the program
Mhm
Even if the call is on discord but playing rblx will it lag
are you talking about high graphics?
Noo
The voice changer
Like will it lag
should be fine if you close every other program in background and play on lowest graphics
Mhm
@analog obsidian i think a gtx 1660 should be fine to run those games on lowest graphics + discord + voice changer, what do you think?
The proplem the gtx 1660 is 400 dollars in Egypt
Hi everyone - I was wondering how I would go about trying to make a video like this (https://youtu.be/ulYG7StLoww?si=S9hOTlxa782rsBIT) but based on my favourite movie?
as i said, if you don't have money, just use kaggle wokada deiteris fork
This will work on games and discord?!
it's the same program just running on a remote good pc gpu provided by kaggle, ofcourse it will give you only 30 hours weekly for free and you HAVE to verify your phone number
There is none for free forever?
ofcourse not
gpus are expensive
you're lucky there even are services that provide free time
I am talking about the voice changer
Mm
the voice changer alone is open source and free, but kaggle, lets you use their remote good pcs, so ofcourse there's a limit
you are expecting to use pcs that cost thousands to be for free forever to your usage
do you really need more than 30 hours weekly
I really don't know
How much does it cost?!
it's free, 30 hours weekly
What if forever
there's no such service that give you a lifetime access to their gpu
there's other services like google colab which have paid plans, which are depending on their monthly subscription or pay as you go
you can't expect to run ai on cheap hardware or for free
Mmm
Well the pc I am getting is for 15000 egy means 300$ so it's expensive in Egypt
So it's kind of expensive but the prices in Egypt isn't the best
what pc are u getting
what matters are the specifications
Well ik
Well i told u the specifcations
oh that one with rx 580? well i wouldn't really know if it will be good for gaming + voice changer + discord
i can't assure you everything will go fine with low delay
Mmm
ot?
Is low delay means the voice would progress fast or it will take time
delay = the time it takes to run the program
more delay = will take more time to respond, will be slower
less delay = faster time to respond
Mmm good
no what i'm saying is your pc surely won't be fast at all for it, so even if it works i would say it will be slower to reply
just use kaggle dude
That's kaggle dude?
that's not kaggle
your only options are:
- pay for an atleast decent pc
- use the kaggle wokada deiteris fork https://www.kaggle.com/code/suneku/voice-changer-public with 30 hours for free weekly, but you need a phone verification
also what games? i fear your pc can't run any online games
This thing
kaggle runs on cloud
it's on a remote good pc
it doesn't depend on how good is your pc
Dude this is so complex 🙂
local = runs on your pc
your pc right now is bad for local because it's weak
Is kaggle the program or what
cloud = runs on a remote good pc
kaggle will use their own good pc to RUN the code
Ohh
Wokada deiteris fork is the program
Now I got it
Kaggle is just a site that allows you to use their pcs
yes.. it's the realtime voice changer that uses RVC for realtime inference (usage of models)
Ohh ye
And kaggle just makes it run faster
Right?
kaggle is just a cloud computing service, all it does is give you the permission to run code on their own good pc
Ye
If i runned wokada alone it will lag so hard right
Without using kaggle
it wouldn't even run
Dam🙂
your pc right now is below the minimum standards
U mean the old one I told u about or the new one I was gonna buy
Like
THis one
the old one, the new one will work but not really much good and wouldn't suggest it for long time usage since it's already old
Rx 580
Core i5 9400f
8 or 16 rams ( didn't mention)
I do have but
Well it's 30 hours only that's the issue
Cuz idk for how much time I will use jt
https://docs.aihub.gg/rvc-voice-changer/cloud/w-okada-kaggle/ here's the kaggle guide if you want
Last update: May 5, 2025
Is there a paid version for kaggke
30 hours are already alot for free
Ik
for kaggle no, for colab yes, another cloud service, but there's no such thing as lifetime plan, you have to pay monthly subscriptions
Oh sad
why would they make a lifetime plan
no matter the price, it costs them alot to power those gpus
so they would lose money
Mmm
seriously, try to make 30 hours enough rather than paying money for an already old pc
Well see i would already get this pc for games and more my question is just will there be diffrence in the voice changer when I buy this one
Rx 580
Core i5 9400f
8 or 16 rams
the difference is, it won't be fast running games + roblox + discord
Mhmm
Mine?
Do you recommend any graphic card that I should get for a? 2000$ build
Mm
So bro she has an i5 4th gen with 4 rams ddr3 and u want her to get a "rtx 2060"
Like bro
Wth?
the user would want to buy a new whole pc, not just replacing a single component
so i told what gpu would be the good for the games they played + discord + voice changer
also, i suggested a gtx 1660 too
It's going to be good
not really much for games + voice changer + discord
@tranquil radish That's gamerfighter's gpu
Ye ye
Oh
wtf? spy?
Nah that's our friend
I meant gpu
you know right AI takes ALOT of computing? especially realtime
Ye ye
Ima programmer
Like I'm not that good
Btw
Do u recommend the 9400f for the cpu or ehat
Alr
Ty
not really needed to do an argument here for this, everything is explained in the guide https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/ along with suggest settings for each gu, and the guide tells it clear that you can't expect much from old gpus especially for games
Last update: May 5, 2025
@tranquil radish speak only english here
Oh sry I didn't say smth bad anyways
Hello guys. I need a TTS system that can run through the command line (not a GUI-based executable). It should support RVC models so i can use those models. Are there any good options available? I would really appreciate any help 🙏
none
RVC is meant for Speech to Speech
not Text To Speech
I thought maybe there are already a work around of it.
The only way is to use Applio (with ui) that basically uses microsoft edge tts api to make text then uses it as an input in RVC, but it won't be emotional at all
So is there are TTS that you can recommend?
and it depends on the internet, unless you use another TTS of your own as a "base"
but no, there's no single way to have a native way to 'transform' sts to tts
this is the only way you could use TTS with RVC (ofcourse, you can use any tts-generated text as input)
but yeah it won't be as good as a tts ai like elevenlabs
Thanks for the help , I am currently gonna try "TTS" library , Let's see if that works then maybe Ill use "TTS_RVC" to do the conversion
THis also has tts but its bad , lol
Let see what happens
In order for me to have a higher speed in the MMVC do I need a better internet?
Or is there another way?
iirc yeah there's cli python packages, but all they do is use microsoft edge tts as an input
unfortunately you can't directly 'convert' the architecture
the quality and emotion will all depend on how good is the TTS base if you're going to use it as an input in rvc
elaborate:
- what's ur pc gpu
- what tutorial link did u follow
- what is the issue
Ahhh... There are better TTS ik , better then eleven labs the problem is my hardware . I will need 14GB v-ram at minimum 😦 . And since they are new there are no quantized model also...
@mossy gull https://docs.aihub.gg/tts/tts-tools/ this could help you too, it's a list of pretty good tts tools
Last update: Dec 12, 2024
My gpu is a Radeon RX 580, I followed the tutorial in the manual of the latest version, the problem is that I would only like to increase the speed at which the voice is generated, with as little delay as possible 🫡
AI requires good hardware yeah, unfortunately the only Open Souce TTS that is better than 11labs at the moment is Nari Dia TTS 1.6b, which is way better at english dialogues, not sure what language you need
close all programs in background, show a screenshot of your wokada and also tell what programs are you using wokada with
i remind you that your pc gpu is pretty old btw
Need eng , but V ram is the prob....
btw my main target is like a robotic voice , So i guess mid level tts will work
a robotic voice? then why don't you just use Applio since as it uses ms edge as a base it won't have any emotion
or do you mean low quality too?
I need smt that will work on CMD , cause I am making a script take the tts output and match the facial expression of a character in one go , just by giving the text prompt for the tts and a image of the character
put game graphics to the MINIMUM, cap fps to 60
check the triangle
use asio for a more complex but less delay guide: https://rentry.co/lessdelayasio
operagx eats more resources for their fancy effects, might want to try chrome or firefox
or, buy a better pc gpu since you can't expect much from a 8 year old card
I would recommend using Wasapi if you are a normal user.
https://rentry.co/LessDelayWasapi
What does ASIO do?
ASIO accesses your audio devices directly, while the driver that you always use on the daily (which is "MME") goes through multiple layers within the Windows audio subsystem, ca...
it's already running at the lowest possible settings in the voice changer quality 😭
extra 0.1 is the worst quality control you can put
ohh, might want to try smt like https://github.com/Atm4x/tts-with-rvc
I tried!
it's basically what Applio does but in a python package
it doesn't work?
F
Is there a restriction placed? It seems I cant send files here
I want to show the output
did you also tried the other things i said above?
!give-media-perms 1h @mossy gull
well you'd generally need to be level 10 to send images
Are there any free AI create song sites where I can create my own model?
I'll see now
could you elaborate more?
what's your pc gpu? it's better to check local first
import sys
from pathlib import Path
from tts_with_rvc import TTS_RVC
script_dir = Path(os.path.dirname(os.path.abspath(sys.argv[0])))
model_path = script_dir / "KittyKI.pth"
index_path = script_dir / "KittyKI.index"
tmp_directory = script_dir / "temp"
output_directory = script_dir
tmp_directory.mkdir(exist_ok=True, parents=True)
tts = TTS_RVC(
model_path=str(model_path),
index_path=str(index_path),
f0_method="rmvpe",
device="cpu",
tmp_directory=str(tmp_directory),
output_directory=str(output_directory)
)
generated = tts(
text="Hello, world! This is a test.",
pitch=0,
index_rate=0.75,
resample_sr=32000,
filter_radius=1,
rms_mix_rate=0.5,
protect=1.33,
is_half=False,
verbose=True
)
print(f"Audio generated successfully and saved to: {generated}")```
This is the script I used
I have tried difffrent f0 methods but its still the same 😭
have you tried with other models too?
yeah
I tried with one of thier recommended models too
same thing
rmvpe is the best one
also, what's your pc gpu?
Ohh , I just treid just in case
1660 super , but as you can see i run that on cpu
is that the prob?
I mean, if you put cuda, it will run faster
just saying for better performance
have you also tried playing with the pitch?
I know that much , I tried that way cause this script will be used in a lot of peoples machines thats why (since a lot of people does not have a GPU)
Yes , as you can hear that the pitch is not the prob, The prob is the way it talks . I mean why tf that accent coming from 😭
oh wait, you didn't set a ms edge tts vocie
it's better to set a tts voice that has the same sex and language as the RVC model one
tts_rvc has it, I bet
yeah you can choose the voice
it's in the repo demo script so
will check rn
I use a different script with Applio
This isnt half bad!
Using "TTS" library
yeah deffo better than before
@mossy gull remember:
- in most cases rmvpe is the best, crepe might be a bit more soft but it's not really robust at all for noise, while fcpe is kinda like rmvpe but less accurate and more soft
- always use a tts edge voice as much similar in language and sex to the rvc one
- always play with the pitch
Thanks bro I will let u know my progress 🙂
yo bro
what setting should i use in uvr 5
for de echo
could i ask u that
do yk lmao
the defualt is 0.3
on mvsep
and i could use uvr 5
but im lazy
i got the site open might as well
3080 10GB good for okada fork?
more than enough
the default settings are fine
but you can try different values though it won't be much different
i used the find model thingy right here #🔍│find-models message but when i clicked on the model it’s just only allowing me to like make the model sing and i can’t actually download it or anything is there a way to download these
nevermind i figured it out, the download button was just very tiny and i didn’t see it
i need help in terms of editing the batch size, data set and etc. i dont know where to find them
Guys I used to use control net fast with xl but right now gen speed is so low how can I fix that ?
Hi, I'm having a problem with my voice changer. It sometimes lags. I downloaded the latest version as recommended.
The problem occurs when the ping and MS speeds spike.
Screenshot lag : https://prnt.sc/vJ48pIxmygA2
Screenchost config: https://prnt.sc/rfWk0eUQ1VXD
AMD RYZEN 6800 16gb my GPU.
I see 2 versions of voice changer, which one should I download? One is vcc and the other is mcvv.
Have you downloaded comfyui
The fork is better
wdym 1 or 2 mb i cant understand*
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
@strange mountain wokada deiteris fork
so MCV right?
Which 2 versions are you looking?
install and use comfyui manager to install required custom nodes in the workflow
yesir im bad english 😭
this mmvc?
ok ty
yea
have u checked https://docs.aihub.gg
Last update: May 5, 2025
that's latest wokada deiteris fork
uninstall vb audio cable, get vac lite from the same guide where you got the program
uncheck sup1
set extra to 2.7
set chunk to 192
check the triangle
the "vcc" one is commonly called original wokada, the "mcvv" oone is commonly called wokada deiteris fork, get the wokada deiteris fork
what's ur pc gpu btw?
Hey bro, I put this in PowerShell: "ForEach($PROCESS in GET-PROCESS audiodg) { $PROCESS.ProcessorAffinity=4; $PROCESS.PriorityClass='High' }"
and it didn't work. It was working fine. Do you know how to fix it?
what's ur pc gpu? what tut link are u using?
RX 6800 16 GB Vram
what tutorial link are you using?
what's also the issue exactly?
Finally, I left some audio for you to listen to by entering the following command:
PowerShell "ForEach($PROCESS in GET-PROCESS audiodg) { $PROCESS.ProcessorAffinity=4; $PROCESS.PriorityClass='High' }"
The modulator started to act poorly, the performance increased, and lag was heard.
let's talk there
hi
hello, if you need help, elaborate
am looking for RVC GUI zip or anything similar to it
RVC GUI is extremely outdated
don't watch youtube tutorials for RVC / Wokada
a 1 year old tutorial is prehistorical in ai field
what's your pc gpu and what you want to do?
oh okay
I can suggest you better programs, tell your pc gpu and what you want to do
just never follow video tutorials for those programs since they get old very easily
@brisk robin You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
link for what?
voice changer
amd
discord or insta
which
amd radeon hd 7000
is there like threads, handles, utilization, speed .............
that's an over decade old gpu
i'm more surprised that people still have that
yea cus my new one is broken after exams ill get new one
your current pc would be extremely too weak
the only option is either using cloud (remote good pc) with limited free time or just wait till u get the new one
so i cant use voice changer?
ohh
what? i asked you a question #✨│ai-help message
just a girl voice changer
your current pc is too weak, i told you your only options #✨│ai-help message
do u want to use cloud with limited time and more harder to setup or just wait till u get a new pc
ok ill wait ty
yw
so there is not voice changer i can use now?
the only way you could right now is via cloud, which has limited time for free and is harder to setup
it would be better u just get a better pc
ai needs a good pc
it can't run on an over decade old pc
okk
do you recommend applio
what's your pc gpu? what you want to do?
there isn't a single program for everything
and ai can't run on any hardware
so what's the best option that i can use
rtx 3050
what's your pc gpu and what you want to do?
if you don't tell me this
I can't help you
nice, if u want realtime voice changer for calls, then just get wokada deiteris fork
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
1st link
oh ok ty
yw
by the way when i speak sometimes it comes out in another language sometimes it is distorted 💀
show a screenshot of ur wokada
!give-media-perms 1h @strange mountain
i go to school now can you remove it and after i come back from school can i explain it to you later?
Intel(R) HD Graphics 4000
that's cooked, weak integrated graphics
do you have any other gpus?
so is there is any video of via cloud so i can setup?
alr
no, all videos are outdated
Last update: May 5, 2025
but it's just better to get a better pc
what is this?
he meant the Nvidia or AMD radeon gpu
a guide to read
bro how much time it take to setup?
all you need to do is just read it
no, it's not a 5 seconds program like chatgpt
no, all videos are outdated
a 1 year old youtube tutorial is gonna just cause you more issues with old programs
just read the guide
ok i need to follow all steps?
yes
i can't vc
explain your issue via text here
no worries, just want to know where i can edit the batch size and data base
to fit the voice model
are u making ur model or trying to modify the one from soneone's else
its someone elses
can i send screenshots in here?
the creator put specific batch size numbers and etc under the voice model link
do i have to apply that
?
or is it just something else
that's just an information of how it was trained
it's not a setting for the voice changer or smt
ohhh omg im so slow
i thought i was meant to copy the settings
lmao
thank you boss
lol yw
one last question, does okada run well with AMD?
wokada deiteris fork does, ofcourse it's not as good as nvidia but it's more improved
be sure to NOT use youtube tutorials, those use an old version of original wokada
wdym editing batch? it's just some training configuration
like a cooking recipe after you've cooked it
could you please send the link
yeah i just figured that out lol thanks though
im getting failed to fetch error in the client, how do i fix this
if i wanna train on 40k should i remove the audio/data that don't 20 khz?
which settings i should use here for 5060Ti ?
thank you
im guessing trhis
What's your PC GPU and what you want to do exactly
That's original wokada, don't use it
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
The link got changed to the docs one
Are you sure that person uses Linux? Because your fork W-Okada guide Rentry link has #download-for-nvidia-gpu-on-linux instead of #download-for-nvidia-rtx-5000-series-gpu-on-windows at the end.
im getting this error can someone help?
This could indicate that you didn't select a voice model you uploaded on W-Okada.
Also, install W-Okada into somewhere like C:\MMVCServerSIO or D:\MMVCServerSIO. It doesn't have to be on your desktop folder all the time, as it can cause some issues.
thank you
can you look at that too
wat is that mean
This is a warning from one of packages in W-Okada, telling you one of its code is deprecated and will be replaced by another new one. Although you can simply ignore it, as it less likely has to do with the program.
anyway to run the voice changer on a intel cpu? i have the winnonxdirectml 1.5.3.15 version
uhm in original woka where 192 ms?
What's wrong?

dont run as admin
You use RVC AI Cover Maker? Never heard of this one, but I use Applio.
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
You wanna run W-Okada with only CPU? Does your PC have any GPU?
integrated cpu
gpu*
Running W-Okada with only CPU or an integrated GPU is not recommended.
If you really wanna run W-Okada with only CPU anyway, there's this guide. https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/
Last update: May 5, 2025
alright thanks
For an actual situation like you wanna use W-Okada with a game, use cloud service instead. There's Kaggle, but you'll have to register with your phone number on this one. https://docs.aihub.gg/rvc-voice-changer/cloud/w-okada-kaggle/
Last update: May 5, 2025
alright ty
wait but its saying that kaggle will use my cpu's gpu aswell is it like a system where it uses both the cloud processing power and my cpu
You confused? Kaggle doesn't use your PC CPU for it to process. It runs on its own server.
ye but on the tutorial its saying to select a gpu
WAIT OH
OHH
nvm
@sacred crescent To find which proper program where you can use RVC voice models, go here. No need to hop into my direct message just for this.
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
I am just looking for a good RVC program to use

Which one are you looking for? The AI cover or realtime voice changer? There are two specific programs.
I got a kanye AI model and just want to use it for lyrics
You can't simply ask for a download link like this. What is your PC GPU?
3070
I extracted and it just came up with a CMD.exe saying AI Models??
Any idea to elaborate?
Like what do I do now?
It said press any key to continue and now nothing is on screen
Are you sure you have downloaded the compiled one? Because the compiled one isn't supposed to crash upon running.
If you wonder what I mean about the compiled one, I mean this link. https://huggingface.co/IAHispano/Applio/tree/main/Compiled
Yeah got the 2.9 one
More like V3.2.9.
Applio is supposed to look like this if successfully launching.
Yeah didn't work like that for me let me try and run it again
It says move the applio folder to spaces without path
so straight into C drive aye?
just moved it and still says the same thing
If you have extracted Applio into your desktop, do not do that. Extract it to somewhere like C:\Applio or D:\Applio.
Okay let me try that
Guys how do I get RVC working
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
So other people can hear it
elaborate more
I did
.
how can I get my RVC working
so everyone can hear me
I think you mean W-Okada the realtime voice changer.
@timber bramble
RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
- what's your pc gpu
- what do you want to do
- what tut link did u use
there's thousands of ai programs
RVC and realtime voice changer (W-Okada) are two different programs.
i can't predict which are u using
Im using
i just hope you aren't using youtube tutorials, because those are old asf for wokada and rvc
what's ur pc gpu? what tutorial link did u use?
if u don't remember the tutorial link, u can share a screenshot of the program
I have a 3080Ti
!give-media-perms 1h @timber bramble
You can type each information in one message at once.
what's ur pc gpu?
This is what im using
that's old original wokada
Oh
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
Lemme check hold up
As I expected. You're using the original W-Okada, which is outdated.
