#✨│ai-help
1 messages · Page 256 of 1
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
I think you should do -read
-read

guys does anyone have tried creating realistic pictures? i tried and it wasnt realistic that much. Can somebody help me ?
I meant actually read it lmao
i cant understand english
you're literally typing it
gng i have no clue what the -realtime is telling me to do
js act as if
i have 3 braincells
combined
and explain what i should do
bruh that other kid doesnt help at all he just says random bullshit
guys does anyone have tried creating realistic pictures? i tried and it wasnt realistic that much. Can somebody help me ?
ok so what's your gpu, open your task manager and click preformance
if you're on phone I'm gonna dissapear
i have a 1050ti
nvidia
cool
can somebody help me 😂
Imma send u the download and stuff and help u out on installing it and using
dms ok?
i believe i have an outdated voice changer and i want my friend to install it aswell since he got a pc upgrade, he currently has AMD though and he installed one which is old through a very old yt video so like i wanted to find the lateest one for both him and me
I don't know anything abt image generation, u can keep asking tho :3
I only know the link to the nvidia compatible one so he's kinda out of luck for now
I could help u tho
alr then
-amd
😠
-nut
-manat
do not nut on me
i cant hear the voice changer and dont know if its acc working or not
send a screenshot of it so I can see if you have the mic settings set up correctly
oki
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
maybe if youre lucky 🤷

Idk 😭
Guys, I'm looking for an AI that can take a rulebook with Card Outlines that I created and create hundreds of Card images designed for the strategy and rules I created because this is a tedious task for game design.
i just tried reinstalling UVR too and got no issues, you sure your windows and drivers are up to date?
also are you sure you don't just wanna use cloud which is faster than your pc and easier?
it has finished running.. there's the green check
also, the original wokada collab doesn't work, and the wokada deiteris fork colab does work but only in the paid tier, if you try it in the fre tier you risk your google acc to get banned
tell your operative system and what you want to do
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
@viral mason isn't an helper, tho if he wants to help he should actually help
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
This is a General AI Server, AI has many fields
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
ye nvm
you have to elaborate, AI is a complex and intensive task, and be sure to not use video tutorials since they are old
if you're looking for a 1 click program, open source AI isn't like that, this is a community driven program, not a product from a company
i already said nvm ima just keep the voice extra chunks as u told me before 2.7
when did we talk?
u told me how to install remember
@native cedar did you use an alt acc? if you send me the username i can check our old chat and help u up
the browser voice changer
How do I que a job?
It says I’ve reached my qued job limit
But I haven’t used my weekly yet
that's called wokada deiteris fork
It says so at the top
is it the best version
#✨│ai-help message
chunk: controls the delay, should be higher than the perf value while running
extra: controls a bit quality, higher than 2.7 can cause cutoff issues
wokada deiteris fork b2332 which is the one you have, yes
alr ty
do you need any other help
also wdym u dont have access to it anymore?
jus lost it
I’m confused, can you train one free weekly model or not?
if you're talking about weights.com yes
if you have any issues
ask in their server, not here
Ok
if 40k is the best sample rate for my dataset what settings should i use for exporting in audacity because they dont let you do 40k
how do you want the ss
the folder just has something that says start_https and another folder with alot of stuff in it
I help with stuff I understand
But ye I'm not officially a helper
I think 32k is cooler :3
guys pls i get this whenever i use an RVC voice model RuntimeError: PytorchStreamReader failed reading zip archive: failed finding central directory
start_https.bat is a thing from old original wokada, all video tutorials are outdated
elaborate the rest i asked u
This is a General AI Server, AI has many fields
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
What error you have when you open okada?
what do you want to do? realtime voice changer for calls?
but insta closes
Do you have some external antivirus ?
yea the voice changer for discord and games i play
the version he got is outdated, i remember you're an ex helper but i don't think you're aware of the newer version
never follow yt tuts for that
they all use an over year old original wokada
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
Yes I didn’t never use new version I still have previous I’m stuck at 2024 hahah
I have to check all
All new stuff
better check the docs https://docs.aihub.gg
Last update: May 5, 2025
already started a 40k train but i guess i'll try 32k after that 
Meow I want to ask for help
I don't know, I got this answer before, if someone answers, please mark it.
Ask the question
do i click link to github
and download that
No, you read the guide, the download is literally below
i
dont see it
only think i see it tells me to download is nvidia
where is the download
Are you on hugginface now?
hello!! im very new to using the voice changer and im just wondering if there are in guides that will work for mac?
that error means one of the files with the models (.pth) is damaged - incomplete download most likely
Should i reinstall VCC CLIENT?
Or the voice models
depends on where the error happens (see the console output)
I tried multiple voice models they didn’t work ill try to reinstall vcc client
Thank u for the help mate!
i mean show the fking log
probably rmvpe.pt or something, just delete what you have in internal folder and it should redownload
Have you tried with some other voice model ?
Yeah alot
So I think instead of upload .pth you upload the zip
Extract the pth from the zip
And upload only the pth
could be
unless they show the log
although with the new ipad generation @flint vapor has 80% chance to be right
ipad kids scare me
Yes if he send to log we can clearly understand which is the real problem
I think he uploaded the zip
Because it was say failed reading zip archive, but I don’t know which client is he using
pytorch models are .zip too
Oh yes true but I afraid maybe he uploaded directly the zip file
Because idk which client he using and how he is using
Because he said he used many voice models so I think is something about his client or he didn’t upload correct
well the thing is that i have a model for uvr, so i dont know if i would be able to load that into the cloud version
like a custom model, not one of the presets that it alr comes with
@crude flame u should make a voice model of dis guy in yt "Not6ixid " he has many clips of his voice hella deep
how good is it?
Why me?
ur insane yk
ur the best ever creator
ur insane
I'm above average but I'm not that good
how long does it tke to make a model
Depends on my motivation
This last model I made and dumped took like a week
I couldn't get good sibilants so I gave up
But in general a model can take a day to like months
Codename's best model took him 4 months
whos codename
fr?
Drama
making a model is tricky because randomly rvc doesnt wanna learn the dataset/struggles for no apparent reason
Yeah
his models he posted are gone to?
Yeah 😭
He never posted models
oh
But he was good at it
so ur the best creator rn
No
who is
Idk prob lyery
me 
what models u got
I was joking-
I haven't heard many models from people so IDK
it's all subjective tbh
^
sure there is skill involved but most of the time the results are super unpredictable 
u can check them on my weights page
whts ur page
i found a very good corpse husband model from weights
bc I'm cool 
I do but they aren't e girl models
I could copy it if the weights page wasn't shittings itself when I try copying it, just look for Claptrap on weights and find mine
They are now normal girl models
Rip all my models 😭
we need more big buff oily men models
we need super deep hot voice men models
😭
you can have the best dataset in the world, that you spend a week cleaning it
and rvc may still hate it lmao
You are like the third person I've seen that wants guy models
I've been here since like 2023
deep voiced man model pls realistic

best I can do is ss it because weights will not let me copy 😭
German deep man voice
i alys wanted deep voice
Real pls
pls latina german deep voiced eboy trump model 
You are the second person to say that one eboy model i made was good
Hey, sorry if this is a silly question... but does the Deiteris fork of W-Okada start offline after you've DLed the files?
In the mainline release, if you were to try starting it without internet, it'd say something about failing to DL models and just not work.
could be a leftover of the original version, which for some reason, required internet connection (sussy)
mak a corpse model 5 hours of the voice
how do i stop the mubling
for ai voice rvc webbrowser one
cus noice suppresion or wtver is so bad makes it super bad
do the later versions of wokada still have the rvc quality high setting? ik the difference isnt noticeable but intonations sound beter and i have resources for it
also im on an older version of wokada, if i switched from nvidia to amd gpu, would it still work or do i have to switch versions
im looking the code of the original w-okada and it seems what the rvc quality settings does is to repeat the inference two times, so you speak, the model produces a result, then inferences that result to give another output
that is... weird
there's no way to improve a model's quality, but what you can do in the fork is to enable fp32 inference mode, it's hard to explain in simple words what fp32 is but, in theory, it should reduce the amount of artifacts you get in the result
in the latests versions of the original w-okada i don't think you can enable fp32 without manually changing the code
in the fork u can do it in the interface/gui
besides that, increasing the extra to 5s can technically give you the highest quality realtime can offer
but if the model is already bad, it wont make it any better
the model im using is good, i agree that the rvc quality pipeline is a bit weird but words sound more natural and theres like "more emotion" in it.
idk how else to describe it
i guess ill stick to the older version of it for now, does it work with amd gpus?
because its easier for the model to clone its own voice
it works but ive heard its super laggy and choppy
link for future reference?
Last update: May 5, 2025
ty
Hey guys, what does Timbre Leakage actually look like in a spectrogram and why are whispering voices so hard to make D:
timbre leakage is not something you see in the spectogram but something you hear
it's when the voice of the model sounds off and extremely similar to the original audio used for the inference, so let's say you make a model of your fav character, if your model has leakage everytime you try to inference something, the model is going to have a voice similar to the source audio rather than the intended voice
and whispering voices are hard to make because whispering is noise data and rvc doesn't do great with noise
it tries to average the whole dataset into a unique voice, so if u got 50% of normal speech and 50% of whispering, the model is going to try to "blend" the whispering and the normal speech together, making the model sound very weird after training
if you trained your model with the og pretrain and contentvec you can just set the index a value to maybe 0.5~ to fix the pretrain leak
but if you used spin instead, then it cannot be fixed, its a weird inconsistent bug
@analog obsidian interesting, thank you very much, you are doing a great job!

another question, i'm trying to understand how to interpret tensorboard graphs. i've seen you guys chatting (codename;0, Yannenou, noobies and you) about several graphs and clipping.. is there some guidance with examples? not especially about clipping, generally
how do i stop the mubling
for ai voice rvc webbrowser one
cus noice suppresion or wtver is so bad makes it super bad
keep in mind while reading the graphs:
grad norm d = low values and going down
grad norm g = below 300 and going down, i personally noticed if grad g is over 400 the model has always some random problem
d/total = going down
fm = going up (this is the only metric that should go up)
mel = going down (very important)
kl = going down
g/total = going down
thats pin-worthy, isn't it? 🙂
grad clipped and such were testings when grad clipping was a thing in rvc, later it was found it actually never worked as intended
xD
lol. how the turn tables
if you're using the new exp_f0 spin branch, there is a new loss that i cant remember the name lmao, but that also goes up alongside fm i noticed
i mean its not "new" it was always there, just hidden 
every of them are important but imo, keep an eye in the grad norm g since if thats high, the chances of the model having some random problem are increased
adv/g
that!
Adversarial generator
then dont use the noise suppresor
then it mubles like crazy
rip
yes
i have literally no idea what you are talking about, the whole terminology is still new to me, f0 okay, ground frequency - pitch related, grad i guess gradient - norm, normalization? the learning curve is kinda steep i was playing around first of all and now i start digging
these are your grad norms
gradients
they're in the tensorboard
so ideally you want them to go down and chill
with low values like that
if they're always going up and having crazy high values, something is wrong with the dataset, or the batch size may be too low
they're noisy and fluctuate (moving up and going down constantly), thats normal
as long they're not super high everythings gonna be fine
this doesnt tell me anything about the learningprogress itself, right?
they do
i mean yeah, if something is off
none of the metrics can tell u when your model overtrained exactly
or tell u which one is the best epoch
how do you encounter that, going for the top x out of y lowestest values at the given epoch and running several examples through UTMOS?
ow that is not implemented in rvc
ive asked for some validation losses before
there are other stuff more precise than val loss as well that vits used in their paper
Codename's fork has some 😎
i've implemented val/loss in seed-vc, pytorch makes it easy
yea there is already val loss in code's fork and noobies also did his own val loss train.py
it's better than nothing imo
the regular losses are kinda a meme since they dont help at all
@supple glacier that was the intention behind the question
i see i understand now
hmm yea obviously adding validation stuff would be awesome
they're far superior to rvc's current metrics
the rvc metrics dont really tell u anything lmao
Chat is this overtraining?
YES
btw @supple glacier we have tested batch sizes in rvc, we've found that you can use bigger batches in datasets just fine
batch 64 works fine in a 10 min set for example
Why my 30 second e girl dataset sounds bad?
gives u a smoother convergence and the model see less noise through the training
if i save every 100 epochs a checkpoint, does rvc use exactly the trained parameters at the specific epoch, or does it look -+ epochs to the left and right and averages that?
if u got more than 10 mins, yeah use the highest batch ur gpu can handle
thats a good question i wanna know too lol
ik rvc saves the exact parameters at that epoch, allowing to stop the training, and continue training later
about the other thing, 0 clue, noobies probably knows xD
It uses that epochs parameters
Imo doesn't make much sense to avg the epochs before and after
By avg those you can be making the model worse
yea that makes sense to me aswell
lines = checkpoints, circle the epoch i would like to check out the most, right?
i cant, since no checkpoint
lowest point in the graph is usually noise
Save every epoch 
your ssd controller and cells hates that trick
i actually got the best epoch in the moment g/total started to rise forever
the pain truth is
the only way to know which epoch is the best is by hearing them
hah, interesting
Also sometimes for no good reason RVC will just hate the voice you're trying to train and it will never sound good
yeah the loss metrics are really that bad haha, we really need some validation added
Ik from a lot of personal experience
Codename's fork 
Have you tried them yet?
due to the source-speaker maybe or in general?
I have no idea why but it will randomly just not like a voice and no matter what you change it won't be good
I wasted a week trying to get a model RVC didn't like to sound good
im gonna check them real quick
i've encountered a similar issue with a voice, the person itself sounded like a robot..not in a electric way/distorted way, just monotonous, no dynamic/depth.. i mean how do you make a robot make sound more natural haha
This model was the model I was going to try hard on 😭
interesting, i got this exact problem when i tried training a model using the old crepe with custom hop
But nooo
the model was not able to do any pitch changes
that happens when you get out of vram while doing feature extraction
it generates a f0 file with 0 information in there
so weird
indeed
alright, gotta go, thanks for the chat.. just starting with this whole stuff.. lets see where the journey goes... starting work in 4 hours 😦
bye 
8gb of vram problems 😎
You seem promising, I think you'll do great and be great!
hey guys, so I followed the github steps to get OPENVINO with A1111 Stable diffusion, and even watched a video from Intel themselves and... I am getting an issue of my SD using the CPU instead of the GPU
is 56min of dataset too long?
thats a good amount
some would dare say its not enough for realistic results
another question, if your dataset is monotone, will the voice model also be monotone no matter how you say things?
oh so even if you try not to sound monotone, it will just sound weird?
yea
gotcha
why is ma ping so high
What is the best real-time AI voice program currently available that supports custom model uploads?
Does anyone have a tutorial for making a model with applio colab?
chat, does anybody know a community where i could get help w llamaindex
does anyone have the E-women voice file??
I have a 3060 and I handle good rvc and okada
I think there should be an option to only save model weights but G and D files are only saved in the target epoch, which is what I've tweaked
the downside is that it could lose the progress if interrupted
no idea but the testing results still seem unpredictable for me
I'd consider some factors like amount of noise, glitches, ability to handle edge cases, etc.
hey there , where can i find flux kontext dev kaggle notebook ?
i cant simply fix it without any context. tell me whats up
sounds robotic
i made the model in weights
is that normal
hello
is ur profile picture ai
Can i have Flux kontext dev kaggle notebook ?
and is there a kaggle nb collection ?
is it my model
that sounds bad
theres no such thing
nope, thats not really how machine learning works
oh
How do you use AI cover now? Use the simplest method on colab
is it normal for my voice changer have like 1-2 sec delay in dc voice chat?
using the server audio will reduce the delay but it will have echo issues
Yes is normal 😁
hey guys where can i download okada??
-wokada
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
Here there is a guide! https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/
Last update: May 5, 2025
well that sucks, it not suitable for gaming,
at least i could use it for chatting
yeah thats czz it process on your local machine hardware rather than a server or cloud computer
Question, is the "v.1.5.3.18a onnxgpu-cuda" the old version? Trying to use it for a combine soldier voice model but it is at times a bit choppy and not always smooth so I feel like it's the old original version. Which one is the newest stable original version? Is it the 2.0.78 beta or 2.1.4 alpha?
I got a 2060 RTX, and Windows 11, trying to use it in G'mod.
So theoretically, IF someone have like high end Hardware (like, GPU not sure Which factor, I'm a tech boom💀 ),does it work even better and as fast as real time?
Probably, I don't have a decent GPU. But yes, with a good gpu like RTX 4090 or RTX 5090 you can run games and voice changer at same time.
I suppose RTX 4060 Ti 16 GB could be good enough for marvel rivals with voice changer and some tweaked down settings
I think I'm the best model 
You had an old version of the program, click on the Nvidia windows download
Don't use anything from YouTube, all videos are old
What's your Mac m chip?
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
This is a General AI Server, AI has many fields
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
You should try checking in https://docs.aihub.gg/rvc/resources/dataset-isolation/#uvr-zero-gpu
Last update: May 5, 2025
This is a General AI Server, AI has many fields
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
Elaborate:
- your PC GPU
- your operative system
- what you want to do
why do people say ai backwards
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
Elaborate:
- your PC GPU
- your operative system
- what you want to do
Whats your PC GPU and operative system first?
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
This is a General AI Server, AI has many fields
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
Elaborate:
- your PC GPU
- your operative system
- what you want to do
There's no such thing as 0 delay
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
Elaborate:
- your PC GPU
- your operative system
- what you want to do
Yes that's the over year old original wokada, the best one Is wokada deiteris fork b2332, not original wokada
But why is it the best? And what is the difference between the deiteris and latest original wokada?
I noticed the originals are considerably bigger in gigabytes than the deiteris
preformance I believe, as well as a cool formant shifting
I care about quality over performance. I do have good enough of a GPU to run either.
nick prob knows more and I just woke up I'm sleepy as hell
Performance and quality, fork means modified version
Alright I will give it a go. Thanks both of you @viral mason.
I tried 
Even the latest original wokada just has UI changes, not anything related to performance and quality, except some performance for Beatrice, but Beatrice models are lightweight and have worse quality than rvc, which is why nobody uses them
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
1st link wokada deiteris fork btw
Aye cheers. One last thing actually- would I have to uninstall the original before installing this one or not?
Is a model integrated in default okada 😄 but is not so much customizable
Beatrice V2 is a type of Speech To Speech model made by Wok, they are the preset ones too for example, they are lightweight and can run fine even on CPU, but sacrifices quality
quick reccomendation before you do that, take a screenshot or screenrecording of what models you have in case u forget
Only got one haha, from Half Life 2.
I usually have over a hundred models I put into mine if I enjoy the software hehe
Eh personally I just need it for the cool metro cop voice.
oh you're about to be in luck bc I'm making half-life models soon
more like remaking
Oooh, well I think a classic combine soldier would be cool.
I was only able to find a metro cop one, and some Alyx one's that aren't really the same as the classic combine soldiers.
If you remember to do tag me! Especially if you make a pretrain model for it that I can use.
the one that sounds like this right?
I mean you don't have to, tho at least uninstall vb audio cable if you use that, vac lite is better, many users reported that vb audio cable gives issues randomly on windows
Yes true, to me vb audio cable was crack a lot
And vac fixed it
Oh yeah. Where is this? I couldn't find it.
I tried vac lite and couldn't get it to work with voicemod which is why I use that still
just look up combine in the voice models section
I did but I only saw metro cops and the alyx one's.
Let me try again though!
With VB cable is working good to you?
most recent of mine of the hl2 one is that dancing combine soldier
yup
Nice! Sometime i think depends and is not for all same error, to me while using okada it was a bit crackling, but only when i was in a voice chat
interesting
Voicemod is that subscription app right? Is it actually better than wokada? Though the only thing I'd be using it for is the Half Life stuff which I might as well use wokada for, but I am still curious.
I only use voicemod because it allows me to use custom fx without being complicated
I use it with Okada tho
Doesn't that make the delay worse? Or is the Voicemod fx pretty much a pre-applied filter aka an instant conversion to where only the wokada translation speed is noticeable.
Voicemod isn't even ai, it's just vocal effects
And yes it's paid
It gives random free effects iirc
Voicemod doesn't affect the delay on okada from what I have seen
it just puts like a filter over your voice like reverb or a deeper pitch ect
Fair.
Got an issue, note I did not uninstall the original maybe that's why? Here is the error log:
2025-07-09 15:25:01,973 INFO [main] Python: 3.12.7 (tags/v3.12.7:0b05ead, Oct 1 2024, 03:06:41) [MSC v.1941 64 bit (AMD64)]
2025-07-09 15:25:01,974 INFO [main] Voice changer version: b2332 NVIDIA-CUDA
2025-07-09 15:25:01,974 INFO [WeightDownloader] Loading weights.
2025-07-09 15:25:02,423 INFO [Downloader] Verified pretrain/crepe_onnx_tiny.onnx
2025-07-09 15:25:02,429 INFO [Downloader] Verified pretrain/crepe_tiny.pth
2025-07-09 15:25:02,493 INFO [Downloader] Verified pretrain/fcpe.onnx
2025-07-09 15:30:07,088 ERROR [WeightDownloader] Failed to download or verify pretrain/crepe_full.pth
2025-07-09 15:30:07,088 ERROR [WeightDownloader]
NoneType: None
2025-07-09 15:30:07,088 ERROR [WeightDownloader] Failed to download or verify pretrain/content_vec_500.onnx
2025-07-09 15:30:07,088 ERROR [WeightDownloader]
NoneType: None
2025-07-09 15:30:07,089 ERROR [WeightDownloader] Failed to download or verify pretrain/rmvpe.pt
2025-07-09 15:30:07,089 ERROR [WeightDownloader]
NoneType: None
2025-07-09 15:30:07,089 ERROR [WeightDownloader] Failed to download or verify pretrain/rmvpe.onnx
2025-07-09 15:30:07,089 ERROR [WeightDownloader]
NoneType: None
2025-07-09 15:30:07,089 ERROR [WeightDownloader] Failed to download or verify pretrain/fcpe.pt
2025-07-09 15:30:07,089 ERROR [WeightDownloader]
NoneType: None
Traceback (most recent call last):
File "client.py", line 22, in <module>
File "asyncio\runners.py", line 194, in run
File "asyncio\runners.py", line 118, in run
File "asyncio\base_events.py", line 687, in run_until_complete
File "main.py", line 90, in main
File "downloader\WeightDownloader.py", line 88, in downloadWeight
Exceptions.PretrainDownloadException: 'Failed to download pretrain models.'
Last update: May 5, 2025
it happens when u have a slow connection
check that to fix it
Will do.
Clicked on the link (https://drive.google.com/drive/folders/1OFfM9rmxCZkiYjxoK_yzhRbcXpt0TiJ0?usp=drive_link) to download the files manually got this:
Google
404. That’s an error.
The requested URL was not found on this server. That’s all we know.
So maybe the problem is that the site where the prompt downloads them from is down?
@analog obsidian did deiteris delete those files from his drive?
the google drive is just a backup
it might be better you retry with a better connection till he reuploads it on his drivew
because the issue is your wifi is that slow that it goes in timeout for taking too long downloading them
Would you be able to maybe send me a copy from you? I know it's a bit much to ask.
Like a copy of the pretrain models. If you have the default one's that should come with the download.
I'm not on PC rn 😭
Rip. Well guess I'll keep re-trying and waiting for a new back up link to appear.
lighthost is a free alternative
never heard of it
yo its been a long time since ive used applio on kaggle so what the heck is this o - o
no clue
why do i need to log in

you need an account on kaggle to make things lol
yea i know
just use your google acc
dude its from this
its from shirou's
what's that
this is a file browser, just add login/password in the cell that makes it
yeah but like
what kaggle space is that
applio or smth else
i just told you the dude's name :
Small update, I think I got it to install but it only opens up to the web client, I don't see a bat file anywhere to open it locally. I downloaded the 2.74 GB Deiteris':
Download NVIDIA on Windows
The lastest version as of December 7th 2024 is: nvidia-b2332 (click here to download)
Yeah that's completely normal
It opens locally
On a web user interface
Ooh. Is there no way to make it a custom interface?
I don't like it opening on Opera since I have like a billion tabs open.
Nope, it's better you read https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#why-does-it-run-in-a-browser-and-not-its-own-window for more info
Last update: May 5, 2025
You can just copy the url and paste it in another browser
Aye, thanks for the link.
Opera gx can cause issues as some reported, but not sure I don't use it
Fair, not my cup of tea, but I guess it's not a huge deal.
anyone have the voice changer download link?
do u have nvidia gpu?
Last update: May 5, 2025
nvm it's right there
lol yea the link also doesnt work for me
wait what
how to setup voice changer program?
look at that link up there
hows it going btw?
can u do a reupload or dm him?
i dont have contact with him lol
windows and you want a realtime voice changer right?
share a screenshot of ur program settings
!give-media-perms 1h @thin summit
what's your pc gpu? operative system? what do you want to do?
Seems to work all fine for now from preview. I haven't tested the Half Life voice model in-game yet though.
yeah
RTX 3070
Windows 11
I want a voice that sounds ultra realist
wouldn't you wanna sound like Homer
Well, I want it to be realistic that we can laugh/cough without it bugging, you know?
fair
if the dataset had coughing, laughing, ect it would be able to if trained correctly
there's enough simpson to make a reallllllllllllllllllllllly good model I'd think
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
1st link, wokada deiteris fork
if u share a screenshot of ur settings i can check them
RVC has a limitation on that
so realtime voice changer for calls right?
What should be done in this case?
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
1st link
wokada deiteris fork
that's if you want a realtime voice changer for calls
it's for sure a great gpu, what do u want to do btw
this is the last version "voice-changer-windows-nvidia-b2332"
?
yes
depends which deepseek? deepseek 671b? you'd need an entire server to run that
save more and buy 5090
sure, u just need over 1k gb of vram lmao https://apxml.com/posts/system-requirements-deepseek-models
i mean ofc the 5090 would be better
for local rvc you don't have to worry at all
you can also do wokada deiteris fork with intensive games without issues
about 3d modelling and image gen, you should be fine, tho i haven't tried the new omnigen2 and chroma that @simple ore talked about
the most intensive thing that you could be worried about is LLMs
you should be able to run fine models with like 70b max iirc parameters, which is great tbh
https://www.databasemart.com/blog/choosing-the-right-gpu-for-popluar-llms-on-ollama#:~:text=Small Models (2B%20%2D%2010B%20parameters,with%2024%2D48GB%20of%20VRAM.
I have finished downloading but I don't see the program. How do I use it?
nevermind about that bit, i confused with 24gb for a sec 😭
yeah google gave me that info when i tried googling th ertx 5080 vram as i forgot 😭
It should be all good 👍
I will test it out tonight and let you know if I have any other issues though.
on rtx 4060 ti 16gb vram with 32gb ram i can run max like 32b parameter models, but they are kinda slow
nope
extra to 2.7, over that that can cause cutoff issues on some models
f0: rmvpe without onnx
output: uninstall vb audio cable it causes issues on windows, get vac lite from th eguide instead
input: microphone
chunk: 128ms
on wokada deiteris fork, you can **optionally **use more advanced settings for benefits:
- Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
- Disable JIT compilation: off for faster loading speed of the program, on for slightly better performance (10-15 ms) for Nvidia only)
- Reduce the delay via: https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#reduce-more-delay
Last update: May 5, 2025
ofc ai is vram hungry
Aye, I will try those settings, I do have vac lite installed as well.
@timber crow btw not sure how the rtx 50 serie is rn, i hope they fixed the bugs with drivers tho, i heard some complain while others not
about programs, most programs should support the 50 serie, and if they don't you can simply update their pytorch version
I know the reduce delay but how do I disable JIT compilation and Force FP32 mode? Probably gonna find it instructions on the website yes? Or is in the advanced settings tab?
Yhup found it.
elaborate what i asked, also what program? what did u download?
advanced settings tab
The link you gave me
why ur chunk so high
set disable jit compilation on, if you want it to have slightly better performance but slightly takes more on startup
Better quality.
Takes longer to process audio and I have tested it, sounds better.
For me it takes 2 seconds.
but very long time dam'
Cheers.
what link? i just asked ur pc gpu and operative system
you should also need to tell me that
None of these settings actually noticably lower the audio quality though right?
yeah unfortunately the rtx 50 serie didn't start so much well
nope
what should ma advance settings be for rvc website version
what is index?
it's not rvc nor a website
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
It's wokada deiteris fork
anyways,
on wokada deiteris fork, you can **optionally **use more advanced settings for benefits:
- Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
- Disable JIT compilation: off for faster loading speed of the program, on for slightly better performance (10-15 ms) for Nvidia only)
- Reduce the delay via: https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#reduce-more-delay
Last update: May 5, 2025
wdym the voice changer the advance setings
those advanced settings are suggested for everyone
i explained you the program above, read it, rvc doesn't mean realtime voice changer
"Preprocess completed in 0.00 seconds on 00:00:00 seconds of audio." what does that mean? i'm trying to train a model on applio colab
the trained accent
what's ur dataset file format? how did u make it?
so dis isnt a voice changer?
Should I open it?
its a zip file, the audio itself it's wav with like 11 minutes
it is, but it's not called like that, it's called wokada deiteris fork, RVC has a different meaning, please read it above
voice changer is a too broad term
there's thousands of programs i could simply call "voice changer"
asked the admin
you have to remember people aren't smart
just look at me
it's not letting me generate index also
What program did you use to make the dataset? have you tried re-exporting the wav via audacity and using applio's dataset maker tool?
I was just teaching them
i used audacity actually, and then put the files in a folder and made it a zip file
i've always done it like this on rvc disconnected and i've never had any problem
rvc = ai voice cloning
w-okada = app that runs rvc models in realtime
but i guess it's different with applio colab
rvc = ai voice cloning
STS ai voice cloning btw
/j
you can do it without zipping https://docs.aihub.gg/rvc/cloud/applio-colab/#b-dataset-path
Last update: June 15, 2024
for the w okada web browser version is there a way to stop all the backround air cus it mubles and sup1 and sup2 makes it really bad or do i have to use sup2 and 1
so just the audio itself?
What would you guys say are the best models out of #1175430844685484042
Best as in most realistic sounding ones xD
rough no gif perms 😭
i did so and i got the same message, 0 seconds of audio
what am i doing wrong?
when making the model, which one of these index do i have to zip with the pth?
anyone know the setting solution for a robotic sounding voice?
There is no setting solution for robotic-sounding models, if the model sounds robotic, the fault is on your dataset
Reasons can range from uncleaned noise, sfx, harsh sibilants, overtraining and etc
alr thanks leo ill keep that in mind
hi everyone, can yall help me with something? I'm trying to download applio but it keeps coming without the run_instal.bat
what am I doing wrong? I made sure my antivirus was off
You have to install it from huggingface i think you have installed from github
You welcome! This will have the .bat file 100%
run-applio.bat
it's called wokada deiteris fork
use just sup2 and echo
yes
oh wait, mine has this
but it does not has the instal one
Don't worry if you do run applio it will install
I already applio downloaded, but its not working anymore
use applio's dataset maker @young halo
share a screenshot of ur wokada
!give-media-perms 1h @nocturne helm
Try do again run-applio.bat is the command for open the interface
okii
I did now, thank you!
ill try it on the new one u sent me tho
Sure! Let me know 😄
K 1 sec
its to check ur settings and suggest better ones
@flint vapor it worked! thank you ml ❤️
You welcome! If you need more help you can always text here!
f0: rmvpe without onnx
extra: 2.7
chunk: 200
output: line 1
k
on wokada deiteris fork, you can **optionally **use more advanced settings for benefits:
- Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
- Disable JIT compilation: off for faster loading speed of the program, on for slightly better performance (10-15 ms) for Nvidia only)
- Reduce the delay via: https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#reduce-more-delay
Last update: May 5, 2025
16xx series can't do fp16 inference, so he has fp32 enabled by default
chunk at 360 is pretty good tbh
Hello, is there anyone here who knows about TTS?
For TTS i recommend you GPT-SoVits
The new version with V2ProPlus
I’m getting an error while fine-tuning with VITS together with Coqui. I wanted to ask for help to solve the error.
what language do you need?
I am currently trying to fine tune Turkish with coqui VITS
Coqui is dead
VITS is morally outdated
XTTS-v2 has Turkish
how good, I can not tell
most new models use some kind of GPT/LLM to generate speech tokens so the output does not sound monotone and robotic
Unfortunately, I cannot use these models because they are paid, but I will take a look at the model you mentioned, thank you very much.
There is a new veersion released in June, V2ProPlus sounds good but the only problem is that languages are very few
If someone need more language yes XTTS is good
xtts is free for personal use
gpt sovits without finetuning was ass and it only supports en, cn, jp
maybe the new model is better, but anyway
Oh yes the new model work very good without fine tuning i saw
so im using a voice rn but its picking up everyones audio and idk how to fix it
use headphones as the system audio to listen and turn down the volume
if you mean irl ppl, tell them to be quiet
no in game lol
so by turning down volume would it be the monitor
or input or out
cause im using voicemod to try input in, but its just picking up all its audio
turn down the ppl's audio volume and anything on the headphones
ohhhh ok
quick question, is the RVC available for Linux?
(cuz im planning to switch to Linux).
what is the best voice changer to create a girl's voice for a video?
it doesn't have to be in real time
Yes it is but you need to install all manually
Which distro you want move on?
hmmm....
Linux Mint ig?
Anyone know how to retrain a voice model I lwky forgot
how do i download w okada for mac?
Why does it not work in discord
This is a General AI Server, we won't be focused on voices anymore
Elaborate:
your PC GPU
your operative system
what you want to do
what tutorial link are you using
a screenshot of the program
@duckus
where is he
Does it work on its own?
then it is on Discord's side.
Have you check if Discor is using the default mice or the voice changer?
Can somebody help me identify what version of RVC I had? I can't seem to post images in here to give a reference, but I downloaded it somtime around december as a recent release. However if I look at any github links here nothing seems to match the timeline
im using w okada and i just am looking for some fun ideas for voices to try out and hoping someone could send me the link to some models
like a girl voice model or anime girl voice model
im very new to it all and was just wondering what you guys considered must haves for doing a bit like fortnite trolling lol
What's up with the tag removal in the voice models search?
Is this to do with the recent Weights changes?
@low shard he stole your joke 
glad it's gone, it attracted yucky pests
Hello, How do I get Ai model training chats?
What does my error mean does anyone know?
Exceptions.VoiceChangerIsNotSelectedException: 'Voice Changer is not selected.'
2025-07-10 00:05:58,057 ERROR [VoiceChangerManager] 'Voice Changer is not selected.'
Traceback (most recent call last):
File "voice_changer\VoiceChangerManager.py", line 212, in change_voice
audio, vol, perf = self.vc.on_request(receivedData)
File "torch\utils_contextlib.py", line 116, in decorate_context
return func(*args, **kwargs)
File "voice_changer\VoiceChangerV2.py", line 159, in on_request
raise VoiceChangerIsNotSelectedException("Voice Changer is not selected.")
Exceptions.VoiceChangerIsNotSelectedException: 'Voice Changer is not selected.'
Pls help
If you wish to continue playing Dank Memer, you need to verify your account.
To Verify:
- Click the button that says "Pass Verification" to visit the Dank Memer Website.
- Read and accept the rules.
- Complete the Captcha
After verifying, you'll be able to play again as normal. You will also receive
100 for passing the verification.
If you have trouble verifying, visit the Support Server for help.
@wise sand
haiiii why is OKADA lowing all the volumes of EVERY other program without permission
when you're using an input or output that is flagged ad 'default communications device' on either playback or recording tab
There are different UI and software that make you use RVC, Can i see a screenshot of the UI?
-ùokada
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/ Here there is the guide 😄
Last update: May 5, 2025
I cant add pictures in here for some reason
Oh you can send me in DM 😄
is there any ai to extend a song locally?
hi guys, been out of the loop for 2 years, where do I find the newest models recommended for vocal separation as suggested on the aihub website?
fv4
ohh we using this rather than the local app now?
not really
i was recommended that by other people in this server but for local app i think the equivalent is https://github.com/Eddycrack864/UVR5-UI
https://ultimatevocalremover.com/ this one
Yea I have the beta version of that
Was wondering why I couldn’t find the recommended models anywhere
Turns out it’s like a separate fork?
im not sure but it does have the recommended models
because it's on the creators huggingface page
voc fv4 ckpt and yaml file is here
oh yes I remember finding that last night, no idea how to install custom models like that though
what would be the difference between this and the one by eddycrack?
👨💻 HOW TO DOWNLOAD AND PUT THE MODEL MANUALLY INTO UVR GUI?
- download the
.ckptand.yamlfiles of the model you want on huggingface - open uvr gui
- click settings icon (on the bottom left, right next to the 'start processing' button)
- click "choose advanced menu"
- click "advanced mdx net option"
- click "open models folder"
- drop the
.ckptfile into theMDX_NET_Modelsfolder (when you click "open models folder," you’re already inside theMDX_NET_Modelsfolder.). - once that’s done, go to the
model_datafolder, open themdx_c_configsfolder, and place the.yamlfile there.
thanks a bunch, will try
tell me if the uvr system ask you to set the parameters
alternatively you can clone and install this repository
not sure, i’ve never really compared them (in terms of output quality). i usually just use eddie’s uvr hf when my pc’s running heavy processes
ahh i see that makes sense
btw would you cut out talking from a dataset if you want a purely singing model?
Any updates?
I'm guessing you're talking about either mainline or Applio and yes
Elaborate more
Are you trying to catfish?
That's pretty not bad for Linux beginners
uhhh..... idk....
i just want to know if the RealTime Voice Changer works in Linux
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
Elaborate:
- your PC GPU
- your operative system
idk, just wanna try it
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
Elaborate:
- your PC GPU
- your operative system
- what you want to do
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
Lmao
🔥🔥
Jokes aside it's pretty hard for people to help without info, that's why I always ask them
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
What's your PC GPU and operative system? Which program are you really looking to use ?
You mean the removed e girl tag? It wasn't needed
I removed it myself
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
Elaborate:
- your PC GPU
- your operative system
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
Try other models, not all models are good
AI chatbots: ChatGPT, Gemini, Copilot, Perplexity, Grok, and Claude. Out of these seven, which three are the best AI chatbots for research?
Perplexity
May I ask you, why you recommended Perplexity?
Because they introduced also lab mode, and it also create you a scheme or an entire list of the research that perplexity do 😄 , but perplexity is mostly recommended if you have premium version
What about the free version?
Also the free is good but mostly only for normal research
Then in perplexity you can set which ai you prefer to use
Could you recommend another two AI chatbots for research purposes?
ChatGPT and gemini or Claude
Is there any possible way to do inference on mobile (locally) not cloud
Unfortunately local at the moment on phone is not possible
Appreciate the recommendations brother but I only need another two.
Inference too ? I think I have listed about that
Yes
Via termux
Is it possible? Yes, should you do it? No
Crazy I knew that
I'm the creator of the guide lol
Why 
I'm actually the first that ran it on phone btw
Yeah I know that but I was not sure
Sorry i thought he mean the entire gui like on pc
It can take over than 60 secs for inferencing 8 secs
If is for inference yes
Yeah, you can do that
It shows the gui
Oh nice i didn't know it can be done, but i think just inference training for now is impossible
Reasonable, I want to try that
I think ChatGPT and Claude are more precise
I got snapdragon 855 and 6gb RAM using samsung s10e
Thanks for the information brother. May God bless you.
Thank you so much! You too 😄
Training is impossible, at least from the power of the phones they were tested on, it would overhear shortly after
https://discord.com/channels/1159260121998827560/1289538710307602554 I haven't tested it in a while tho 
I would really not suggest it unless you want to do it just for fun or hare cloud
I did it just for fun lol
Yes training require more power and phones atm don't have enough, also inference i saw you said is possible but i think is always recommend to do it on cloud if someone doesn't have on pc, is very slower local on phone i think
Yeah ofc, did it just for the memes 
I want to do it for fun 
So it's basically installing ubantu distribution in termux and than installing RVC on it ?
Kinda
is a 4060 good enough for w okada?
also when i open w okada how do i close the system directory thing without closing okada (the black box that shows what’s happening)
can someone help me to setup the AI voice changer?
yes
depends on ur gpu
and cpu
Yes it is
does anyone know how to fix this issue though? it’s really annoying
i don’t trust myself to mess with things either 😭
yo wtf bro why cant i hear my voice model but it works when i talk in discord
mon isnt working
idk too
How to reduce the chunk without losing quality? 192 ís about 512ms
whats ur gpu
3070 8gb
idk
I've 192 (512.9ms, 24576) and EXTRA: 131072
yes
you don't, you can't, that thing is the command prompt, it's what is actually running wokada, the wokada in oyur bwoser is just the web user interface
vccclient is a text to speech made by the same author of wokada
I hope you're using wokada deiteris fork and not youtube tutorials, right?
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
that's old original wokada from yt tuts, all yt tuts are outdated
uninstall everything
and?
thank you; even if it’s annoying it’s nice to know i’m at least doing it correctly
https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/ there is here a tutorial 😄
Last update: May 5, 2025
what's your pc gpu and operative system?
Is a guide
for amd gpu?
let's check first if the gpu is good enough
Yes sure 😄
what's your pc gpu first? there's many AMD ones
Yes there is also an amd version but is recommended to use only if the gpu is ok
what to do now?
alr good enough
read the wokada deiteris fork guide
nick tell me what to do mate
So I've a 3070 8GB
I've Windows 11
I want to use different Girl voices to troll people (Found the settings just the m/s is annoying)
https://www.youtube.com/watch?v=SxdnGxicJOg
https://i.imgur.com/09Pgl8a.png
There is a download link in the guide for amd
im dumb i cant
guessing that you want a realtime voice changer for calls
you need to read it
BUT IM DUMB
that video is outdated, it uses an over year old version of original wokada, and vb audio cable causes issues on windows..
@covert valve
uninstall everything you got
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
read the 1st link, wokada deiteris fork
If you click it will download amd version
nick could you call me?
How to fix it?
you have to read it to understand it, it's open source ai, it's not as easy as using chatgpt, the program is complex and intensive
you have to read it to atleast understand
if you dont read it, you will never understand it
uninstall everything, get wokada deiteris fork from the written guide i sent you
is there a specific part you don't understand well in the guide ?
Where are you from? I can translate it for you and send you translation
sorry but this server is english only
are you AI?
we are not ai 💔
bruh
I will use Translator and send you in DM
I didn't notice the link can you send again?
look i just want to do a setup for realtime voice changer and my gpu is amd
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
here is the link you sent
first of all, uninstall vb audio cable from the windows app settings
delete the old original wokada you got

100 for passing the verification.