#✨│ai-help
1 messages · Page 265 of 1
on wokada deiteris fork, you can **optionally **use more advanced settings for benefits:
- Advanced Settings -> Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
- Advanced Settings -> Disable JIT compilation: off for faster loading speed of the program, on for slightly better performance (10-15 ms) for Nvidia only)
- Reduce the delay on Windows via: https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#reduce-more-delay
Last update: July 18, 2025
lemme check rq
You wont have 0 delay there will always be some delay based on gpu
Audio server mode then using [windows wasapi] on all cuts down some more delay
ok so can you help me
i think is from here
Advanced setting: crossfade length reducing this redues delay but also redues quality, 0.10 is sweetspot but if u dont csre you can go down to 0.05
...ok?
bro
??
We already gave you 3 things to do for less delay
Advanced Settings -> Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
Advanced Settings -> Disable JIT compilation: off for faster loading speed of the program, on for slightly better performance (10-15 ms) for Nvidia only)
Reduce the delay on Windows via: https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#reduce-more-delay
Last update: July 18, 2025
yes very good
.
The chunk part
You ask about chunk then ignore what he said about chunk
Ok ignore the cable part dont ignore the rest of his message?
And this 3rd factor
we need someone to distill kimi-k2 into mythomax
yea i tried to install cu128 but when i run rvc mainline it says 120
show me how you did it
There's a cloudflare error on the Weights website
guys what do I do with the MMVC app because every time I start it, no audio at all will come out when monoring, an when recording, no audio either
I actually used the guide in the server and it worked
the voice got smoother and clear but the delays still happen , mostly when a game is also open
also when i reduce the chunk, why does the delay increase
my sound quality lowers whenever i use it through browser,
It's been like that all night long... started again but its not doing anything
i am trying to run tensorvenv however i keep on getting this message.
note: This error originates from a subprocess, and is likely not a problem with pip.
error: metadata-generation-failed
I am currently just following the AIHUB guide and I am now stuck here.
I fixed it. If you have numpy 1.19.5, then you want to downgrade your python. I had 3.10 version when numpy works well with 3.9 version.
Hey folks. I'm trying to create a voice model with Elevenlabs for a cartoon character, but I'm gonna want to remove background noise from most of the audio since there's ambience. Is there any specific site/service that's good for that?
(Also, how many samples do I usually need/how many minutes? And do I need a range of emotions?)
Hopefully I won't need to dig for scenes with no ambience lol
@simple ore and yet after like 2 hours i did nothing
again half hour and still nothing
so what driver version do you have?
Of?
video card, adrenalin
okay, i'm gonna update my spare pc, it will take some time
what are the best mel roformer models for isolating vocals from rock and metal?
I have some files where I’ve isolated a single character’s voice from an animation, but there’s still background music, vocal harmonies from the music, clanking sounds, and various other effects mixed in.
What’s the most effective method I could try?
Even with UVR, I couldn’t get a clean result.
bandit plus and melband karaoke (becruily) on mvsep
Is it great?
bandit will get rid of effects, melband karaoke will get rid of the vocal harmonies and music
do mel karaoke first
Hey sorry nick about last time something had came up anyway I need your help with something
@distant idol
i'm still stuck
Thanks, carnage Ill try it
lemme try with updated zluda.py
i used the original
nope, works fine too
try closing Applio window, delete C:\Users\user\AppData\Local\ZLUDA
and re-start training, it will recompile again
works fine for me
wha is this passthru
starting now
something is happening haha
well, you've deleted what it had compiled before, so it starting to compile again
see ya in 30
anyway, I've tested 25.6.1 driver with 6.2.4 hip sdk method on my old 6700xt and it works fine
Yucky scammer (it's gone yay)
Hour later
can you go inside C:\Users\user\AppData\Local\ZLUDA and see the date stamp of the file there
technically you dont have enough compile lines shown yet
there are usually more
right click, properties, see last update date?
okay, so it is just stuck for some reason
so delete the appdata, and try again?
did you install the latest VC++ redist?
where can i check?
I made one
nah i just wanna try out a girl voice
lol
see how it sounds
maybe use it in roleplay games on roblox

.
i cant send screenshot
there
why
my device stuck in Activating the VC screen
Hi, i can't verify at the moment if it is working
Which VC do you have?
Hi guys! I have a problem. I have a pretty good pc (rtx 4070 + ryzen 5 7500f) but any sound model I tried sounds very robotic and the sound quality is bad
same here but with amd gpu
and its not running on my gpu no matter what i do
MMVC
guys i cannot pick harvest i can only pick crepe_tiny and rmvpe_onnx why
русские есть?
hey, I'm new, I've never used rvc before. I'm trying to train a model for myself, but I always get the message "Unfortunately, there is no compatible GPU available to support your training". I've updated the driver to the latest version, downloaded cuda, downloaded torch, updated rvc to the latest version. deleted rvc and reinstalled. deleted the vga driver and reinstalled but it still doesn't work. My computer has: i5 14400f, rtx 5060ti 16gb. (running the latest game ready driver).
which rvc?
5000 series nvidia requires cu128 torch, depends on the software you're trying to use there are different methods
I tried 2, the first one is Retrieval-based-Voice-Conversion-WebUI. the second one is mangio rvc. however both give error "Unfortunately, there is no compatible GPU available to support your training"
anyway, which rvc is better to use? or are they all the same?
both are severely outdated
download, unzip, then open cmd.exe in the folder where you've unzipped it to and do env\python -m pip install torch==2.7.0 torchvision==0.22.0 torchaudio==2.7.0 --upgrade --index-url https://download.pytorch.org/whl/cu128
it work ;D tysm

Can anyone tell me if anyone knows the website and prompt/template that was used for this gif:
https://tenor.com/view/happy-birthday-gif-4446960541527894948
It s been popular on tiktok and I wanna use it too
Please ping me if u can help
- whats your gpu
- did you follow an old yt tutorial
- whats the name of the program you downloaded, full name of the file you downloaded
- screenshot of the program
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
Not suggested nor maintained, older versions in youtube tuts are even way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
first link
I found it
how long should i train a 30 minute dataset
Question for pretrain. I am using mainline rvc and I filled in everything like the typical. I clicked One-click training and I only got a index file and no pth. I am doing something wrong? I have also loaded the G and D model path too. I have the used the KLM 4 pretrained. I have set the version to v2. I have made sure the sample rate was the same. Everything is in logs folder and I don't see nothing in the weights folder. I don't really understand what I am doing as the guides are very vague
do you expect "One-click training" to do some unbelievable magic instead of doing those steps one by one and verifying each finishes properly?
@viscid bluff
can someone let me know wether Im basically done training a model or if I should train it more, I can send photo of the tensorboard
i said i did everything like what the tutorial says. i created other models but not pretrained
also it says everything has finished which makes make confused
or in terminal it says it's complete.
what are you looking for the pth file?
yes the pth file i don't understand where it is
i thought it'll be in the weights folder after finishing but it's not
doesnt command prompt show a message of where the last saved weights pth file was saved
I clicked one-click training and I also can't send pictures
It says it's in my logs folder which is where the index stuff are
pth files are produced every particular epochs during the training session
I'm only getting index things every time time I click the one-time training
How do I start a training session with a pre trained model?
I thought it was the one-click training button as it is for other models when training it with the default base models
As I said I have done all of that. However I do not get pth file at the end
I did ask you to show this screenshot
what do you have in logs/ your model name?
and in assets/weights?
the chipsterling50v was a model that i have trained before. it's not the same one as the one i am trying to train with the pretrained model
and what is in train.log?
indicates you trained nothing
\
just click train model
how do I train it then I have clicked everything single thing
i have cliekced both train model and onen click training
it is perhaps the pretrain model needs more sounds?
this is why it's confusing me currently
what do you see in the terminal window when you click Train Model?
how many slices are in 0_gt_wavs?
hmm it's saying
FileNotFoundError: [WinError 3] The system cannot find the path specified: 'C:\Users\Local User\Documents\RVC/logs/mute/0_gt_wavs/mute40k.wav'
that will do
are you running it as admin?
nope
but lt worqed flrst tlme
sorry lf am belng dumb thls ls my flrst tlme
try this version that shouldn't have that kind of issue
Last update: July 18, 2025
sorry but can you tell me what to do exactly
I have a 5070 and for some reason my audio sounds bizarre I have heard the audio sound wonderful on worse gpu's
hey you now any vldeos?
that show lt out
@steel forge
for trolling or catfishing
can you help me please?
nope i have no idea
anyone that can help?
cuz lt worqed flrst tlme
second restart lt stayed lqe thls
it says ur missing a file
ban teh dude under me scam btw
you now the dlls needed?
nope
all that ik is thats the error
alr lg l cant use lt
can you help me man
where dld u lnstall lt
wlch verslon
l meant wlch verslon of hte volce changer
can u send lt?
@north lava
for some reason in discord it repeats my normal voice and also the ai voice, how do i fix it?
in game also
what is an index file
guys when i speak into okada it doesnt pick up my voice
like its radio silent
how do i fix that btw i tested and my mic is still working
@low shard
Does training wear down gpu?
hello, im trying to make ai cover of tokai teio singing through patches of violet, but at the chorus she sounds like she ran out of breath, how to tune?
How do it use the json and index file of the voice model i downloaded?
Send screenshot of the program
she might lack of stamina or need some endurance recovery skill you could get from some support card
- make sure the input voice has been isolated & denoised properly
- if the model struggles at that pitch level, try tuning pitch shift or try another better model
Which voice conversion software should I use if I want to preserve emotion, significantly alter the original voice, and interfere with it in real-time on an average model using a Tesla T4 GPU? I only intend to use it for talking, not singing..
if it keeps overheating and doesnt have proper cooling, airflow or thermal paste
esp laptop gpus tend to overheat easily
pth & index file are used
index file isnt mandatory but helps reproducing accent
Do you know where I can find this same server but it's from Brazil?
↑
continuing your discussion, here is guide & download for the deiteris fork
Last update: July 18, 2025
Try getting deiteris one maybe it will fix the issue
Its only 3 gigs and you can create a thread in #1192011222023950368 if you have furthermore issues
Yeah you need to use deiteris one
That versuon didnt work for me too
So i used deiteris
Np
. ↑
sorry but we dont support the og version, only deiteris fork
the fork one is argued to have better quality, which could solve ur problem
I reccomend the nvidia one which is 3gb
Yes hold on
I reccomend going to site and finding this link and click on it to download deiteris fork for nvidia gpu
@untold socket
After download Extract the zip and open file named MMVCServerSIO.exe
np
np
.↑
. |
np
dml is for AMD/intel gpu, and cuda is for Nvidia gpu
yeah
just click this link since you use rtx 3060 this link is good it will automatically start download
brb soon
back
Which voice conversion software should I use if I want to preserve emotion, significantly alter the original voice, and interfere with it in real-time on an average model using a Tesla T4 GPU? I only intend to use it for talking, not singing..
I dont know how tesla t4 gpu's work but i heard its from nvidia right?
Yes
i don't know if it can preserve emotion after the original voice but we have 1 that can change voices though
I’m Guys I am Alex – a creative wizard with 15+ years of experience in design, development, animation, and more. 🧙♂️✨
These days, I’m deep in the world of AI tools and TikTok content creation, helping people grow and monetize their profiles. 📱💸
Whether you’re just starting out or looking to scale fast,
I’m here to share what works and support your journey.
💬 DM me anytime – I’m always happy to help!
What is it?
yo any good alternatives for w okada? or any good forks?
its a voice changer named deiteris fork
Thanks
the site for it is in https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/ you can find download link there i reccomend downloading one for nvidia
Last update: July 18, 2025
Can I just git clone?
is deiteris the same as the rvc client?
this is the best one
ty
The best option?
yeah atleast thats what i find best
wait how do i extract it?
it's a .002 and if i change it .zip it doesnt work at all
am i missing summin
nvm i dont like to read
its ok boss
im gonna read the tutorial
i dont wanna piss yall off
thanks boss
are you struggling to find download link?
nice
my bad lol
its okay
Please Elaborate:
- your PC GPU
- your operating system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
Please Elaborate:
- your PC GPU
- your operating system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
are you looking to do e girl trolling/catfishing?
are you looking to do e girl trolling/catfishing?
i need help
When i open my start_http file it doesnt load or do anything the first time i did it it worked.
start_http is apart of the old original wokada, you deffo followed an old youtube video tutorial
Please Elaborate:
- your PC GPU
- your operating system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
C:\Users\pible\Downloads\Ai Voice Changer\MMVCServerSIO>MMVCServerSIO.exe -p 18888 --https false --content_vec_500 pretrain/checkpoint_best_legacy_500.pt --content_vec_500_onnx pretrain/content_vec_500.onnx --content_vec_500_onnx_on true --hubert_base pretrain/hubert_base.pt --hubert_base_jp pretrain/rinna_hubert_base_jp.pt --hubert_soft pretrain/hubert/hubert-soft-0d54a1f4.pt --nsf_hifigan pretrain/nsf_hifigan/model --crepe_onnx_full pretrain/crepe_onnx_full.onnx --crepe_onnx_tiny pretrain/crepe_onnx_tiny.onnx --rmvpe pretrain/rmvpe.pt --model_dir model_dir --samples samples.json
this is the link
wait it worked now
sorry i was impatient lol
that one is outdated asf, it uses an over year old version of original wokada, and vb audio cable creates issues on windows
delete the zip, folder and uninstall from windows app settings
also, i saw its about ai girl voice, are you trying to do e girl trolling/catfishing?
no
the one you're using is OUTDATED, don't use it
im just learning to download the voice changer
send me one
what's your pc pgu, operating system, and what do you want to do?
like:
- ai covers
- tts
- e girl trolling/catfishing
- roleplay with realtime voice changer
i just want to have fun with the voice changer i am nvidia gpu 3060 and my operating system is windows
i wanna do some random voices with the thing lol
windows 11, right?
also im guessing voices like spongebob or villagers lol
yeah windows 11 pro and i might do those
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
Not suggested nor maintained, older versions in youtube tuts are even way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
you can either use wokada deiteris fork or vonovox
they are both the best, the only difference in your case since you got nvidia and are on windows, is the User Interface and that vonovox got paid voice effects like low quality mic
fuckkk
anyone know why the voice model i cloned sounds a bit static and a bit off? It can pick up some stuff and replaces words with other words like how are ya -> brocolli
Hello, to understand more your issue and know how to help, Please Elaborate:
- your PC GPU
- your operating system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
make chunk higher
set extra to 2.7
try 1000
okay,
idk if he was catfishin or not
he didnt specifiy
ohh
sorry didnt notice
😭
Hello, do you need any help?
when I run on kaggle it occasionally stutters (I followed the guide and put everything higher/lower than recommended just to make sure it was not too heavy on the gpu). Besides, on the draft session info, everything seems good except the cpu is at 100-120% (I made sure to use gpu to run the vc). Is something wrong or is it supposed to be a bit stuttering like that?
Please Elaborate:
- your PC GPU
- your operating system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
voice changer on kaggle would have some latency, that's internet for ya
well yes that's true because of the internet connection,
but let's check if they are using the updated one and also if they meant latency or other issues like cutoff since they said stuttering
- 3060 rtx but on kaggle I chose T4 x2
- windows 11 latest
- find out why it stutters randomly
- the most recommended one in the guides channel
- I turned it off but I will boot it up if you still want it
3060 rtx
if you got an rtx 3060, you can do it locally, is there a reason why you're using cloud?
everything is perfect when I use the local version so nothing to say
ah yeah, sometimes I want to play gpu intense games
could you please tell the games name, lower the graphics to the lowest 1080p 60fps, then show a screenshot of your local wokada deiteris fork settings ?
just to check if maybe you could do it locally
I have been using it locally since the begining. But I also want to sometimes play heavy games
something like the finals, control, black myth wukong, etc... Those are pretty intense even on low
I tried with the finals, it became unusable when in game
ohh, so you already tried? even tried asio/wasapi with disabled jit compilation?
but yeah you're right they are pretty intense, i was just checking if maybe you could try but if you don't want to, you can just share a screenshot of the kaggle settings and check if there's anything you can adjust
everything execpt asio/wasapi as the noise was interfering too much (my mic is not that bad)
i see, maybe let's check if there's any kaggle settings i could help you adjust?
not a pro but I know some tricks and played around the setting. Either using cloud or run on a different device lol
about asio/wasapi?
i meant other things like disabled jit compilation and the extra, since you said you have noise issues with wasapi/asio and you can't fix that unless you use a 3rd party tool noise reduction
I also did use a different machine so I can play heavy games (it worked). But the other machine spec is kinda meh so I got bit greedy (used to the performance of 3060
tried them all. Shouldn't I keep extra at 2.7s? I want to keep it at the best quality possible so I don't really want to reduce it
alright nvm then, i mean you could keep extra a bit lower for less delay, but i understand you want the best quality
either you can try sacrificing a bit the quality using a lower extra and fcpe as f0, or this might be related to connection issues, since on cloud there's more delay
oh I still want to ask you about kaggle. When one is using it, should the CPU be at 100-120% constantly? Is that normal?
followed everything and pretty sure that everything is set to the GPU
Is that the reason? If not then maybe like the other guy said, the internet introduces some stuttering
Is 24 GB of storage enough for https://github.com/deiteris/voice-changer?
the voice changer is ~5-6GB
- models
Hello lovely peoples! I'm a beginner model maker and wanted to know what sample rate does in applio voice training
other than the RTX 3060 rig, what is that spec?
But for the full thing with dependencies too.
about 5-6GB
you installed the vac trial, not lite, it might be you didn't follow the right guide
what's your pc gpu & operating system? what do you want to do?
fixed it thanks
Do you may want me to check your settings? You can send a screenshot and I can help
nty
nvidia 1650 4gb (gaming laptop)
it should work as bare minimum unless you need lower delay for competitive gaming
hi, which rvc is good now? i just need voice quality and not performance. (rtx 5000)
and how many epochs should i train the model for? my audio file has more than 300 pre-cut wav files. their total duration is 1 hour 17 minutes
thats a very powerful Geforce
Please Elaborate:
- your PC GPU (like yes rtx 50 serie, but which?)
- your operating system
- what you want to do
- what tutorial link are you using (if any)
- a screenshot of the program (if any)
also, be aware that RVC means Retrieval-based-Voice-Conversion, not realtime voice changer
there's no right amount, the only way is using the tensorboard
rtx 5060ti
win 11
i want play game with my friend and share model
no tutorial
no program
i want play game with my friend and share model
so, use realtime voice changer right? not train models? because those are 2 different programs and things
yes. i have trained my model but i dont know what software to use for real time changer. i have trained with applio
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
Not suggested nor maintained, older versions in youtube tuts are even way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
you can either use wokada deiteris fork or vonovox in your case, the major difference is the user interface, and that vonovox got paid voice effects like low quality mic (like bitcrushing), but you can check their pros and cons if you want
Why is this?:
w-okada voice changer ERROR: Could not find a version that satisfies the requirement faiss-gpu (from versions: none)
ok tysm
Please Elaborate:
- your PC GPU (like yes rtx 50 serie, but which?)
- your operating system
- what you want to do
- what tutorial link are you using (if any)
- a screenshot of the program (if any)
please let me know!
It's google CoLab
could you please elaborate everything i asked?
google colab is just a cloud computing service for people with a bad pc, there's thousands of notebooks
Sure:
- My PC iGPU: Intel Skylate Intel HD 520
- Windows11
- Run back w-okada VCC since I need it for a job and last time I used it was months ago
- I used an old collab link but I lost it so I just googled a new one (Also used github repo for storage instead of google drive)
- For screenshot It's not present right now since I closed the tab and only copied the error. But I will bring it within few minutes.
Also we can't even attach screenshots. But here: https://i.imgur.com/PciCTTN.png
You also want the link of the colab project ?
yes, it would help
thanks for the info, even tho i dont really understand what you mean by for a job, do you need to:
- tts
- ai covers
- e girl trolling/catfishing
- roleplay in games
- roleplay in vc
- etc?
Asking since there's different programs for each task
Need it for voice acting
you sure you need it realtime, and not on pre-recorded audios?
I can either do pre recorded as I used to do, or real time. But ofc I prefer pre recorded.
then you're using the wrong program
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
What else is available then?
Wokada is only for realtime, you'd need Applio instead
also that is an outdated version of original wokada lol
yeah it work great, just sometimes I leave it at the office so I was wondering if cloud is a good option...
Applio colab, read up the guide and let me know!
Well I still would need a vm to test it on like google colab.
it runs on the cloud computing service google colab, it's the same service cloud, just different program
welp...
I used the colab version lol
Yeah, use the applio colab version
No train things
Actually after downloading pretrained models somehow dosn't show under the models list
well I didn't watch it so idk.
i mean the title
and the text
train things = stranger things
it sounds similar
oh wew
Also that pack wouldnt even work on Skylate GPU's
or any intel gpu
it's only for AMD as I see
What really is Sample rate when regarding training with v2?
Ok finally I found the original w-okada colab version
Oh god the original okada
gotta admit I'd only recently figured this; but the og 'Kada always gave me this unexplainable autotune when I talked. Deiteris fixed that 🙏
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
GPU: NVIDIA GeForce RTX 3060
64 bit
Just want to troll in game with it
https://www.youtube.com/watch?v=SxdnGxicJOg&lc=UgzGz88-UBa6zY_UGn54AaABAg
very powerful aaah
Pretty much only Nvidia users (for now) get to use the good ai voice changers
what?
i mean our geforce its too powerful
ah so i cant use it cause i have a 3060?
Vonovox might get availability for amd users at some point, the creator even said he's working on getting a card for amd
no
i mean
its very powerful and thats good
very very good
perfect
🗿
ah i see, ty!
is there a one step way to generate audio using tts and an rvc model
I was looking to use the kokoro tts model
don't use original wokada, you shouldn't even use wokada for pre-recorded audios
use the applio colab, read it
the original wokada colab is broken, and it's not the right program for you, use the applio colab
that video is outdated asf, dont use youtube for realtime voice changers, it uses an over year old version of original wokada, and vb audio cable creates issues on windows
delete the folder, zip and uninstall from windows app settings
Just want to troll in game with it
are you looking to do e girl trolling/catfish?
ive legit edited the kokoro local tts gui code to play the sound to my output device instead of just leaving it as an output file then routing it through to the voice changer
which now that I think about it seems extremely inefficient
no it's not related to that
It dosn't even work '-'
the applio colab works, could you elaborate the issue?
Downloading pre trained models wont show them on list. And even using it's custom path will give runtime error
pre-trained models are only used as a base for training models, are you looking to train models?
oh bru..
Are all models rvcs btw? So I use #1175430844685484042
😭 if you only want to use models on pre-recorded audios, dont touch any pre-trains
not all, but most yes, just be sure they have the RVC tag
duh only 10 results
there are over 10k rvc models, try restarting your discord
might be there's a lot of post forums, you could also try using https://weights.com/models
Can you send me it?
which amd gpu?
which windows version?
and what do you want to do?:
- ai covers
- tts
- e girl trolling/catfishing
- roleplay in vc
- roleplay in games
any updates?
?
It was from the original github repo. But the english one?
yeah, it's outdated, wokada deiteris fork (modified version) b2332 is the one with the best performance at the moment
they should update links on the md file lol
the original one isn't used anymore
I see
This is a General AI Server, AI has many fields, so we can't know your issue
Please Elaborate:
- your PC GPU
- your operating system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
Hello guys
!give-media-perms 1h @stoic frigate
start the program and used just like today before and the previous days
what program? there could be thousands of AI programs
guys i want to switch from gpt api to my own local model and lower the latency time and the accuracy should be similar to gpt, so i need to finetune a model. which model should i choose for this? like should it be smaller like distilBERT,Tinyllama or some few 8b parm models?
oh you're using original wokada, it's not suggested anymore, you deffo used a old youtube tutorial
delete the zip, folder, and uninstall vb audio cable from windows app settings
vb audio cable also creates issues on windows
Well bruh Applio dosn't even work
How am I supposed to show the models under Voice Models menu ?
send the tutorial link you're using, i feel like you're using the Novision AI Girl voice tutorial
I even set them into: ..\ApplioV3.2.9\rvc\models\kenki
you shouldn't use that, it got less performance and quality, it's outdated and as you see its more bugged
because you need to download the models first
And this?
did you do as in https://docs.aihub.gg/rvc/cloud/applio-colab/#1-upload-voice-model ?
Last update: June 15, 2024
I didn't use it on colab since it dosn't work with Drive
Oh I see. It does install it on /log wow
I installed them into the rvc/models path
Uhh... Isn't the model supposed to use the GPU instead of the CPU?
what? you need to use colab, your gpu is old
you can't do it locally
Not old but not supported (Intel)
This is why we wanted VCC instead
you shouldn't use local, you can just use the Applio colab, original wokada and applio both have colab versions
Had to use rmvpe algo for it.
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
Please Elaborate:
- your PC GPU
- your operating system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
come to the vc
@low shard
I need help with voice changer on colab
I can't VC, it's better you elaborate what I asked
Please Elaborate, those are crucial infos:
- your PC GPU
- your operating system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
how to add my own voice or i cant?
you'd need to train your own rvc model of your own voice
i hope you didn't come here from yt tuts, they all old
for what?
Please Elaborate:
- your PC GPU
- your operating system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
depends on which thing you want to do, there's new programs for different things
yeah don't follow the rvc mangio or rvc gui tutorial you see on youtube, please answer the infos i asked u
uh idk the gpu
windows
ai cover
tutorial outdated
and idk which program
uh idk the gpu
that's the most crucial thing for ai
You can check your pc gpu on Windows via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
also I'm guessing windows 11
yeah
um ok but why does gpu matter
why does gpu matter
yes, A LOT, AI is intensive and complex, this one runs on your hardware, it isn't a 1 click company product like chatgpt that runs on a remote good pc cloud server, it's open source and driven by the community
alr, let me know
Hey, how do I run this: https://github.com/deepbeepmeep/Wan2GP, on my laptop which has these specs:
- CPU: Intel CORE i7 12th Gen
- Memory: 16GB
- GPU: NVIDIA GeForce RTX 3050 Ti Laptop GPU (4GB VRAM)
- OS: Windows 11
Hello everyone, I have a question: What is the best program for making AI covers and which is the best for creating a voice model? I'm unsure which one to use, and I want to do everything locally.
It doesn't have to be the best, just the one that people usually use the most.
what's your pc gpu and operating system?
i don't think you can run it
CPU: Ryzen 5 7600 6-Core Processor 3.80 GHz
GPU: Nvidia GeForce RTX 5060 8GB
Memory: 16GB 6000mhz
OS: Windows 11
Really, even though that is already the super optimized version?
I really have to upgrade my current setup
well i didn't try it, but in the readme it says that there are models that go as low as 6gb vram, and you got 4gb vram
https://www.youtube.com/shorts/Y2tzig9aty8
what type of AI is used here?
8gb vram is the bare minimum nowdays, you got half of that
@upper narwhal also let me know for any issues
I think it would be better to use Applio
not sure about rtx 50 serie support on mainline
No problem, I'll test both and see which one gives me the best results. Thanks for the help!
which one gives me the best results
they should give the same results, the major difference is just Applio is easier and a bit more optimized, the quality is the same, because the original rvc devs left it to rot in 2023
Fortunately, my country, Malaysia, already released a laptop with an RTX 5090 GPU that has 24GB of VRAM, but it’s priced at a whopping MYR29,049, which is equal to $6,886.91
But it’s too expensive for me
My current setup costed around, ~MYR5,699
not without a bunch of code changes
shouldn't it be possible with just updating pytorch?
no
you can bump pytorch, but then you need to go and change all torch.load() calls
what are these accounts that just put images
are they stupid
I ain't clicking that shit
mrbeast scams 😭
what if they use it from source? is the main branch updated for that atleast?
do you know what they even do? is it just an image or does it contain like a virus orr
no virus, u just open it, there's an image saying smt like "mr beast is doing twitter giveaway go there"
most pointless scam accounts ever 😭
dont think so?
having their image perms disabled is somehow safer
You know? I created an easy Python package manager for Windows only, that helps manage your Python environment, installs packages, search PyPi, remove packages, etc…
For full details, go here: https://github.com/ROCKYANDTHEPAWPATROL20/PENVCreator/releases
Of course, the release’s description is generated by ChatGPT
(and the rest of it? 🙂
Are you mentioning me? If yes, then the code itself and Markdown all generated by AI
I have recently updated the software: https://github.com/ROCKYANDTHEPAWPATROL20/PENVCreator/releases/tag/penvcreator2.0
hii i wanted to use the w okada voice changer to troll my friends but its stuterring and has a big delay why is that so?
ryzen 7 5800x
rx 6650 xt
windwos 11
32gb ram
troll my friends
are you looking to do e girl trolling/catfishing?
not e girl just generally use a differnet voice i am a girl haha
like a troll voice
share the tutorial link u used
i just installed it with the github informations and by searching the chat
i uninstall and reinstalled it again and it worked for like a couple minutes but now it doesnt again
also there’s a bit of static/noise in my microphone even when i dont talk. can that cause probelms?
@hollow sparrow
yo im here
so ur saying its using your cpu instead of ur gpu?
yea, i selected gpu 0 dont know what that is but i checked task manager and all the powers going to my cpu
u gotta change the cpu on okada to ur gpu
wdym
is okada open?
send a ss of okada
where dms
u can send it here
okadas like the actual voice changer right, sorry if im slow just new
yea
where can i get a new one
type -realtime on bot commands and click the first one
its a guide
oh alr
also what's your pc gpu?
rx 580
hm?
@low shard i needs help im at outside rn
Does anyone have a google colab link for ai cover?
what causes my mic to spike? like there's just occasionally a word that's super sharp whenever i use an ai voice. or is this just a 'me' issue?
look at loss_avg_50 charts instead
whats the best app to merge voice models? Just been using wokada fork to merge. I also assume it’s not possible to merge models with different sample rates right?
hello everyone I just downloaded and started to use W-Okada Voice Changer with the install version of vcclient_win_std_2.0.78-beta.zip ... I own rx9070xt and was hoping to get faster realtime output voice changing. But results were so chopped and unlisteneable. I wonder if its about the version I installed or the Chunk/Extra settings I have... For example only avaliable range to have real time voice is usually Chunk value of 72000 [1.5 sec] with Extra 144000 [3 Sec]
I am using 5080 laptop version, windows 11, i want to reduce delay betwen me speaking and the ai voice changer outputing it
Can u share the link? There's different programs
Do not trust yt tuts, delete the zip folder and uninstall vb audio cable from windows app settings
What's your PC GPU, os and what do you want to do?
Did you first check if your PC GPU is enough?
i said that an hour ago o- o
I use phone,not pc
u gotta use pc to use that
Wokada deiteris fork, and no not different sample rates
oh its colab its a bit difficult to use tht on mobile
I'd suggest weights.com In your case since it's mobile friendly
weights - a bit buggy on mobile :p
Huh? Works fine for me
its buggy for me cuz my phone is literally 3 years old
You encountered bugs? Would be better to report them in discord.gg/weights
What phone do you have?
i thought u knew o - o
breh
You're using outdated original wokada and prolly vb audio cable,
Delete the zip folder and uninstall vb audio cable from windows app settings
What do you want to do?:
- ai covers
- TTS
- e girl trolling/catfishsing
- roleplay in games
- roleplay in vc
What do you want to do?:
- ai covers
- TTS
- roleplay in games
- roleplay in vc
What tutorial link are you using? Share a screenshot of your settings
If ur phone is old and always lags every second, maybe it's just better to change it
eh screw it im risking it 
sir, can i ask if you know how to make the voice less chipmunk ?
wdym?
are u using okada or smt else?
o
well i don't use vonovox @low shard u can do this 😄
roleplay in games and vc
hey, i played a game with realtime voice changer with my friend and it was quite interesting, now i want to try livestreaming with rvc is it possible? for female voice my sister is a vtuber so i can totally ask her to record for about 2-3 hours or can i get her voice on stream to make a quality model? i also asked grok AI and grok said i need both my original voice and my sister's voice as "base" and "target" is it necessary? to train a quality model how long does it take to train data. i will prioritize voice quality and not care about how long it takes to train the model or if it is harmful to the pc. my current pc is i5 14400f, rtx 5060ti 16gb, 32gb ddr5 6000. with my configuration can i train a high quality model? and for my needs which rvc should i use to train the model and which rvc to change voice in real time?
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
Not suggested nor maintained, older versions in youtube tuts are even way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
1st link wokada deiteris fork, read it
reply me pls
is this still working?
umm i need for my friend
what pc gpu does your friend have?
rx 580
that's bare minimum, is he here in the server? what does he want to do exactly?
also asked grok AI and grok said i need both my original voice and my sister's voice as "base" and "target" is it necessary? Or do I just need to train my sister's voice? For best quality, how much data do I need to prepare?
for discord only
no, you just need to train the voice, chatbots don't know about RVC, use the docs https://docs.aihub,gg
should work, id suggest him to join here if he got any issues
the issue is he can't open that application
tell him to join here, would be better he elaborates why he can't open it
How much data do I need to get the best possible voice quality for livestreaming?
@drifting ether
I'm not on my pc for now
alright
Cant rn, and want it to vc with friends and use female voice (trans)
Ohh, that's great!
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
Not suggested nor maintained, older versions in youtube tuts are even way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
be sure you're using either wokada deiteris fork or vonovox with vac lite, and not original wokada with vb audio cable
Hello, I establish a small hospital, I wanna making healthcare software using AI. so identify illness with voice. is it possible?
already doing that
using wokada fork
b2332?
on wokada deiteris fork, you can **optionally **use more advanced settings for benefits:
- Advanced Settings -> Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
- Advanced Settings -> Disable JIT compilation: off for faster loading speed of the program, on for slightly better performance (10-15 ms) for Nvidia only)
- Reduce the delay on Windows via: https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#reduce-more-delay
Last update: July 26, 2025
it would be better u also show a screenshot of ur wokada deiteris fork later
uhh i am on trip rn will be back in likee
4 to t days
u good at that time?
hey i have a question when i use that owakada voice changer thing sometimes i can hear other ppl thru my headset and it changes there voice too
it only happens sometimes
i have to restart my pc to get rid of it or am i doing something wrong?
It means your microphone is picking up your headphones volume, so reduce your headphones volume, move mic further away, move the slider for noise gating under the F0 option further to the right
that's an over year old of original wokada, it's outdated asf
don't follow yt tuts, don't use vb audio cable it creates issues on windows such as crackling
delete the zip, folder and uninstall vb audio cable from windows app settings
please elaborate:
- ur pc gpu
- operating system
- what you want to do?:
- AI Covers
- TTS
- E girl trolling/Catfishing
- Roleplay in call
- Roleplay in games
Hi, sorry to barge in, I'll ask again:
GPU: 3070 Laptop
OS: Win10
What I'm trying to do: I'm trying to get into streaming as a hobby after work. I just dont wish my voice to be recognized, while not making it obvious I am using a voice changer. Not robotic, or too high pitched, just another human. That's the bare mininmum - preferably I have a character I can play as the voice. I am male and it'll also be a younger male.
can someone help me get the latest version i have an nvidia gpu and windows the only versions i see is a year old
dont follow yt tuts they are all old, tell your pc gpu, OS and
are you looking to do e girl trolling/catfishing?
tiktoks i have a 1070 and a i7
tiktoks
like e girl trolling/catfishing catching pred tiktoks?
and what's your OS?
no just singing i have windows 10
just singing? then you're using the wrong program
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
i was gonna use it for normal talking too for background talking in videos
i think what you need is inference (use models) on pre-recorded audios
not realtime voice changing
RVC doesnt mean realtime voice changer
i know that i need to figure out how to use both
you don't need to use both, it's just a waste of time, wokada isn't the right program for what you need to do
are you looking to do e girl trolling/catfishing?
yt are all outdated, never trust yt for RVC nor Wokada, AI moves at sonic speed
im talking about suggested models from the voice models tab
Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), not the best to train (make models) even if still possible
You can:
- Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Cloud (remote good pc, easier and faster than ur PC but limited time):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.com: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio UI Colab: max 4 hours daily, not granted, of GPU
- RVC-AI-Cover-Maker-U Colab & Kaggle: Automatically separates the vocals and instrumentals, converts the voice and mix all together back
Easiest possible (automatically separates vocals & instrumentals) : weights.com & rvc-ai-cover-maker-ui colab/kaggle
Easiest cloud: Ilaria rvc zero
Easiest local: Applio
I gave you help for what you need, or do you actually need to use AI for something else as e girl trolling? I think you're confusing programs
no i dont need it for e girl trolling im literally a girl
why do you keep saying that
YouTube uses outdated stuff if you wanna use buggy broken stuff go ahead
A lot of creepy losers with no life come here to download the voice changer for that purpose
It's way too common
oh wow
then you can read the guides I sent you, it will help you understand how to use the program
Hello, Im trying to use weights' AI voice model in RVC AI cover maker, but it seems like there's no way to copy the direct DL link ? I can't right click the DL button. I wanna paste it into the url text field to download instead of having to download the model locally to my laptop then upload it again
I wanna paste it into the url text field
you can't do that, the only way is you have to download it on your laptop then upload it yourself
Oh, but iirc there used to be such a feature, why was it removed ?
I used to be able to copy the direct DL url of model from there
how u fix bad ms
nope, there was never for weights.com, there is (and still is) only for HuggingFace.com (a general AI platform), those are 2 different sites
show a screenshot of your program
!give-media-perms 1h @iron venture
and also tell what u doing, like playing marvel rivals with it
Back then it was called weights.gg I think, but whats the reason for not allowing a direct DL link ?
Back then it was called weights.gg I think
nope, I'm there before even weights.gg was ever a thing, it was never possible, you had to always do the manual download, and you can't do anything about it, they do have things like login so other sites don't just scrape them iirc
Oh, that makes sense, thank you
are you using it in a game? you said you wanted to sing for tiktok live or smt ?
which program are you using it with, that matters a lot
Btw, is there any AI now that can do remix or mashup of input audio(s) ?
im in a discord call with my friends its still high ms
No need to ping me, I saw your message above that you deleted it, and no I'm not aware, i don't think there is one
maybe try like suno, not sure
set f0 to rmvpe without onnx
extra to 1.5
on wokada deiteris fork, you can **optionally **use more advanced settings for benefits:
- Advanced Settings -> Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
- Advanced Settings -> Disable JIT compilation: off for faster loading speed of the program, on for slightly better performance (10-15 ms) for Nvidia only)
- Reduce the delay on Windows via: https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#reduce-more-delay
Last update: July 26, 2025
Btw, when using RVC AI cover with input song of male voice but model is female singer, am I expected to change something in setting like pitch or just leave them at default ?
dont expect low delay, your gpu is ancient
try playing with the pitch
yea true
Thank you, btw in AI art the strength of the model is controlled by CFG I think ? So in AI audio, how can I strengthen the effect of the singer's model to influence & make the output sound more like them singing ? Is this possible ?
those are 2 completely different AIs lol
lmk
🥲 Sorry, I don't understand the technicality behind these AI stuffs
But is there a way to influence the strength of the AI voice model ?
nope in RVC, the only thing similar to what you said is just the index ratio, where you can control the influence of the trained accent
Oh, the accent, I think I wanna give that a try aswell to see if it gets the effect I want
Is it this option ?
yes
lol idek atp it was better a couple hours ago
you sure that's what you want to do? you were talking about tiktok tts before #✨│ai-help message, this isn't the right program for you, why you using this one?
my friends using it while talking to me im gonna use the same thing
works fine for them
i fixed it
what? that's different from what you asked
What are your personally favorite/ recommended AI voice models ? I might find precious hidden gems I never knew existed before this way
just check the voice models tab
is it bad i wanna just do whatever i want with the client? i dont have a set thing im just trying to have fun with it?
it was even in his name 
16.0 GB (15.4 GB usable) - AMD Ryzen 5 PRO 4650G with Radeon Graphics 3.70 GHz working here ?
@low shard Do you know how to save storage from training. When I train models from applio noui, it fills up my storage automatically
working for what? what do you want to do?:
- ai covers
- tts
- e girl trolling/catfishing
- making rvc models
- use rvc on pre-recorded audios
clean up storage from your drive
be sure to train with save only latest on
leave autobackup cooldown to like 15
Enable cleanup if this is your first time training a model and you're not resuming
I hope it works
I have no idea since I'm using kaggle but ig it doesn't overwrite G & D files in Drive storage
so you let it store in the colab internal storage
What about fast training in applio no ui?

Yo is there anything like this live voice changer but it changes from like English to a different language
This one's a tricky question. Since it involves the math part of AI.
Everytime I study the math formulas that make these functions possible I can never fully comprehend how, lets say writing a prompt is the same as a math function that is shown in a paper...
Or how an image diffusion model turns noise into an image and is also explained by a math function... You can say that I did not grasped all the pieces of this part.
Would any of you guys could give me a better example on how to understand these?
did anyone ever train on f0: FCPE for W-Okada?
I heard FCPE is lightweight for Okada
but i don't know the Quality From FCPE train to FCPE Okada, not yet tried that stuff
who would train on fcpe when rmvpe exists
i already did the all steps but why my mics doesnt turn on
what is the best girl model for vietnamese voice
For a Vietnamese voice model, try find one in #1175430844685484042.
Can someone tell me what the best most accurate ai juice wrld voice model is? Or is there any such thing as premium models that people sell that are alot better?
Can someone tell me what the best most accurate ai juice wrld voice model is?
There are thousands of rvc models, the only way is to listen to the demo and try it yourself
Or is there any such thing as premium models that people sell that are alot better?
Nope, paid models are not allowed
Are you trying to do e girl trolling/catfishsing?
This is a General AI Server, AI has many fields, so we can't know your issue
Please Elaborate:
- your PC GPU
- your operating system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation) - Sharrnah/whispering-ui
Hey, so I would like to convert a recording I made into a different voice for a hard techno song I'm currently making, I have a gtx 1660 super and I'm using Windows 10, what program should I use to convert my voice using another voice model?
Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), not the best to train (make models) even if still possible
You can:
- Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Cloud (remote good pc, easier and faster than ur PC but limited time):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.com: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio UI Colab: max 4 hours daily, not granted, of GPU
- RVC-AI-Cover-Maker-U Colab & Kaggle: Automatically separates the vocals and instrumentals, converts the voice and mix all together back
Easiest possible (automatically separates vocals & instrumentals) : weights.com & rvc-ai-cover-maker-ui colab/kaggle
Easiest cloud: Ilaria rvc zero
Easiest local: Applio
Let me know for any issues!
Thanks! I'll give those a try
do you know if there are any "neutral" rap voice models that are available?
I need a rap voice but not of any famous artist
if that's even a thing
Ehhh, RVC AI model need to be trained on a specific voice, you could try searching for less popular ones, or try merging/fusing models in applio (but honestly not many people use that feature)
hey
i was wondering can i run w okada deiteris fork on GTX 1070 gpu
i currently have the RTX 3050, will the GTX 1070 have much worse performance?
You can run W-Okada with GeForce GTX 1070. However, GeForce GTX 1070 would have worse performance if compared to GeForce RTX 3050.
and why is that? the power of the gpus is very similar
the gtx card is a bit stronger in power too
That's an expected question. While some claimed that GTX 1070 would outperform RTX 3050 over gaming at certain situations. The thing is some reported that some GTX 10/16 GPUs couldn't run W-Okada with a graphic demanding game at the same time without lowering the settings.
1070
3050
3050 is 1.5x faster in fp32 and 100x faster in fp16
what voice changer should i use for a 7800 XT?
i've been using w-okada fork for a while now
it kinda sucks? or it just doesn't really give me what i want, the breaths are broken
thanks!
There's a better W-Okada. The old original one is broken.
i think i have this one
you got wokada deiteris fork b2332? (specifically this version since it's the latest)
and vac lite?
you can show a screenshot, so i can also check your settings/advanced settings
show a screenshot of your whole settings & advanced settings
there's nothing new for non-nvidia users (There is Vonovox only for Nvidia users), also i feel likke this might be related to your settings or models
eh
might be the models, i'm just looking for realistic ones
your normal settings are fine, but you can adjust the advanced ones
on wokada deiteris fork, you can **optionally **use more advanced settings for benefits:
- Advanced Settings -> Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
- Advanced Settings -> Crossfade Lenght: Controls how smoothly the AI stitches different processed parts "chunks" of your voice back together. 0.1 for fastest voice, 0.15 for improved quality but increases delay by ~50 ms
- Reduce the delay on Windows via: https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#reduce-more-delay
also, try other models, and remember that RVC models got limitations on non speech sounds, such as laughing and screaming
hmm, i'll try that
I am going to try and create visual art (image or video) with the help of prompt and AI
is StableDifussion good enough for it?
I have integrated graphics so it takes time but with StableDiffussion I can generate pack of 20 HD images in 40-50 minutes (some get's blurry or unfocused)
help me out with this one
I have integrated graphics
you shouldn't even do it locally then
https://github.com/rupeshs/fastsdcpu there's this if you want, but it isn't the best quality
i would rather use cloud or get a better pc in your case
hmm I use laptop maybe in a year or two wil get pc
I'll look into cloud then
what models could you recommend that handle non-speech better?
https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#models-to-try , you can also search for models, some might have been trained on things like laughing, but as I said, it's an RVC limitation, it's been like this for 2 years and since the devs left it to rot, it will probably never do an actual realistic laugh since i'm guessing you're looking for that
Last update: July 26, 2025
and i can only use RVC on w-okada?
There's no AI Voice Model that surpassed RVC in quality yet, and yes
there are other STS AI Models like So-VITS-SVC, Beatrice 2.0, but those are lower quality and old, no one really uses them
RVC voice models can be used on W-Okada.
While you could use Beatrice voice model in older W-Okada versions, it was later removed in fork W-Okada, leaving only RVC.
What should I do if I've reached the daily quota for ZeroGPU? (Ilaria RVC)
Do people not train models locally anymore?
Try another different online RVC option like on Kaggle or Google Colab.
Either wait, pay for huggingface pro, or use something else
Ofc they do, Ilaria RVC Zero isn't even for training models
what's your pc gpu, operating system and what do you want to do?
use it locally
Also, some people here train voice model locally on their PC with a GPU. They not all quit training voice model locally entirely.
It was a separate question, I am aware it's not for training models.
I don't have my PC specs with me right now, it'll be out of my reach for a while, but I use Windows
Link to the guide? Is it preferable to do so on cloud or local, or does it depend?
A specific Ilaria RVC fork on Hugging Face can't be used to train a voice model, but the mainline one with full functions of similar name does.
-rvc
I still have to navigate through the categories
These are options where to run RVC online.
there's info in our docs, but it would be better you check your PC GPU because it's crucial for AI training first
Is it preferable to train a model on cloud over general cases, then?
it's suggested to train a model on cloud only if your pc gpu is not good lol, overall cloud got limited free time
there's no difference in the program code, it's the same quality, cloud is used only for GPU Poor users
I see
Thanks
I'm pretty certain my PC can handle the load because I've trained a few models locally before
if you're sure about that, then you can just use Applio from our ai hub docs locally
It is preferred to train a voice model online when you don't have a good GPU (lower than RTX 20 GPU), but online options are usually limited in GPU usage time. RVC on online and local all function exactly the same; the difference is which RVC fork you use.
I'm trying AICoverMaker Kaggle again, and it gives me this prompt: Installing requirements ERROR: Could not open requirements file: [Errno 2] No such file or directory: 'requirements.txt'
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
@viscid moss could u check this?
not cloned the repo before running install?
that's the only reason for reqs missing
No, already cloned
Are u sure that u are running the notebook from this link?
https://www.kaggle.com/code/eddycrack864/rvc-ai-cover-maker-ui
- running the latest version notebook (v6)
Yes, I am using the correct link. How do I check if I'm running the latest version Notebook?
Before u press Copy and Edit, choose version 6 of 6
Working on my test
Hmm, that is what I chose before
If u have persistence enable, disable it. Maybe something gets corrupt
Try again with a clean env
This requires internet on, doesn't it?
yep
yeah, which means you need to verify your phone number
Tried to, but it's a lost cause. Apparently, it couldn't verify my number and I would have to go through the hassle of identity verification. Thanks for the help nonetheless
Contact kaggle support about it https://www.kaggle.com/contact
or, just use local since you said you got a good enough pc
Ew why is there no persistence
cuz it's for testing only xd
That's a guaranteed loss of all progress as soon as you leave the space 💔
I select CPU mode at first to test a notebook on Google Colab and Kaggle. 
why does my comfyui zluda take hours to make its first photo it has been on 57% for almost an hour now
Hi everyone, sorry for the text, I'm using a translator, I'm asking for help, I need a person to make a voice model of my friend, his birthday is on August 14th, I was trying to make a voice model and calculated that 150 epochs will be processed after 20 hours.
In general, who wouldn't find it difficult to help me?
Hi, have anyone here worked with ssd_mobilenet_v2_fpnlite_640x640_coco17_tpu-8? I have trained the model and tested it in local env - everything works. After that I converted it to tensorFlow.js format and pushed it to browser (Blazor). The code itself seems to work but the output I'm getting is nonsense. Is there anyone that is willing to help me with this?
CUDA error: CUBLAS_STATUS_INTERNAL_ERROR when calling cublasSgemm( handle, opa, opb, m, n, k, &alpha, a, lda, b, ldb, &beta, c, ldc) can someone help me with this error
you could use Applio and create the voice model yourself, it's completely free and easy to learn
dependson what you're trying to do
Seeking Native English Speaker for Exciting Opportunities!
I am a AI developer.
Are you a native English speaker with excellent communication skills? I'm looking for a native English speaker collaborated me!
I need only language skill not anything.
If you want to collaborate with me we can discuss just now.
what's the scam?
i wanted to upload image here but nvm
I HAVE A GPU Nvidia RTX
Win 10
AI Covers
I want to see if the program is updated or not. I had it a long time ago, but I formatted my PC.
Guys, who knows what is the best offline audio transcribing model for mobile devices?
which?
you said realtime voice changer first, what do you actually want to do?
3060
I want to see if the program is updated or not. I had it a long time ago, but I formatted my PC.
yeah there are different programs
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime. There also updated forks with extra features like Applio.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
depends on what you want to do
Roleplay
so, just to clarify, you want to do roleplay like spongebob in games? because you have been changing the reason previously to ai covers
like spongebob
Or a character from a TV series
The desired goal is to see how far the AI progresses, but I enjoy role-playing with friends or other games like VR C.
alright
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
Not suggested nor maintained, older versions in youtube tuts are even way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
you can use either wokada deiteris fork or vonovox
they have similar performance and quality, u can check more about their pros and cons in their guides
I have tried RTX 3060 12GB which one is best for my card
I don't know why when I use this there is a clipping in the sound and the sound is not good
that's an over year old version of original wokada


i mean, it could be my data