#✨│ai-help
1 messages · Page 271 of 1
crossfade: 0.15
extra time: 3
block size: play with it and be sure it's higher than the gpu delay
be sure to also use rmvpe
there isn't really a good value, it varies alot by hardware and what program you're going to use it with, the only way is by setting it higher than the gpu delay
which script ? sorry i am very new to these stuff
maybe try https://github.com/filipstrand/mflux but depends on ur M processor
any updates?
also i am trying to train a model but it doesnt save sometimes and i do not know what settings to use
is applio the best way to do this
how do I change vocoder
in applio RVC
i'm using google colab public url gradio method
what's your pc gpu and operating system? which guide link are you using?
gpu 4060 8gb, win11,the on on aihub website
be sure to use https://docs.aihub.gg/essentials/how-to-make-voice-models/
Last update: July 17, 2025
other than 3rd step all good
uhhh??
wdym what's the error? u tried applio
how can extract thi
this* sh*
oh my god this really annoying af
@low shard nick can you help me please
+CUDA12.8 Pytorch updated
(Pytorch nightly version)
RTX 5080 test done
Windows / NVIDIA
For RTX 5000 users
thanks to deiteris (https://github.com/deiteris/voice-changer)
cant extract this
use winrar or 7zip to extract just the .zip file and it will do the rest automatically
don't use windows' defualt extractor
thanks really <3
(js copied and pasted bc my dumbass went to the wrong channel) Hello! beginner here. tried training my own model with rvc and it went on for almost a day before stopping because I ran out of memory 😭 are there any alternatives to making a model (besides me freeing up my memory ig.) unfortunately I only have an NVIDIA Quadro p600
not by using windows extraction for sure
7-zip will do
Just use cloud platforms such as colab or kaggle
thank you! so js to confirm, I can make my model in Kaggle? google collab always cuts off my sessions bc of credit limitations
You deffo need to use cloud, here's every option:
Train (make) RVC Models on cloud:
- Prepare the Dataset
- Setup RVC:
Choose a cloud way to use RVC,
- Google Colabs (max 4 hours of daily T4 16gb gpu not granted for free, but easy to use, there's a paid tier):
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus, either T4x2 16gb each or P100 16gb, only free):
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly, Free Studios run 24/7 but require restart every 4 hours. There's a paid tier):
- Be sure to know about the tensorboard
Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but more gpu time
If you want the easiest way and for free, is using https://weights.com/ which uses RVC
RVC Inference (use models) on pre-recorded audio on Cloud
You can use either:
- Weights.com: Easiest Possible Ever Automatic
- Ilaria RVC Zero: Fast and free on cloud
- Applio UI Colab: RVC Fork with some extra features like TTS
- RVC AI Cover Maker UI: Automatically Separates the Vocals and Instrumentals, converts the voice and mixes them back
For RVC training, Applio Kaggle gives the most free hours which is good for training,
If you prefer the easy automatic 1 click way, Weights does that and gives 1 model training for free weekly iirc
tysm! I'll poke around a little and see what works best for me
Also, I have 78 voice samples for the training, is that enough? because I'm not sure if I can get more
You're welcome!
If you're talking about quality, all of them provide the same quality for training btw
Since rvc V2 has been the peak of Speech To Speech ai since 2023, and the original devs kinda left it to rot as it seems to be hard to upgrade it iirc
The forks, modified version, like Applio just have a simplified User Interface with other extra features like TTS and performance improvements
But the quality doesn't change
How long would your dataset be in total?
in total it clocks around 9 minutes and 8 seconds
ohhh ok
I'm willing to try/learn anything but easy interface would def help, ty !
You're good, be sure it is high quality tho
It's better to have a 9 minutes high quality dataset than an 1hour of terrible audio,
Both quality and quantity matter, but quality a bit more
great! yeah i'd say the quality is fine lol
ofc ofc
thanks sm again <3
Yw again, hopefully your mindset isn't confused anymore :trolley: /j
Feel free to ask here again for help, or make your own help post in #1192011222023950368 (both are good, even tho Forum is more organized)
Hi, does anyone know how to find a voice model that isn’t based on a specific person? I’m looking for a young Spanish male voice that isn’t famous, just sounds good
my nitro boohooh
it's always confused 💔 that's why I'm always asking stupid questions! takes one to know one 💔 ty again!
Hello, RVC STS AI Voices models need to be trained on something, unless you merge models (basing on trained models, to make a 'unique' one), are you perhaps using any realtime voice changer?
I don't usually use real-time voice changers, but rather I would need to edit the voice in post-production, that is, record everything with my normal voice and in editing change it so that it doesn't sound like mine, but I can't find models.
I want to make my own AI assistant (eventually to turn into a commercial service). I want the responses to come out as human-like as possible. I will probably need datasets to use (or my own?) for fine-tuning the model and using RAG.
How does one get started with this? I understand I'd need to obtain and use any UGC data legally and all that, I just don't know where to get started and what the best way to gather data (and put into .JSONL file format) is.
Yeah, I wouldn't even know what to search for related to my topic
nevermind, this seems to be voice only or some sht
try applio merging tab maybe?
Hi I want to know which version should I download for the real time voice changer for windows I want to download the OKADA voice changer
Hello!
Please Elaborate:
- your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
- your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
- Many users confuse Wokada and RVC, There isn't a program that does everything, there's a program for each thing
- what tutorial link are you using
- a screenshot of the program
chatterbox tts allows you to create synthetic voice variations based on an input voice
This was Voice, RVC, focused, but we are slowly shifting to general ai
Maybe this dataset full of conversations used in movies could help for feeling human like https://www.kaggle.com/datasets/rajathmc/cornell-moviedialog-corpus
- RTX 4070 Super
- Window 10 Pro
- The program that converts my voice to a model in real time either to troll friends or to rp ot whatever
- I am currently here https://huggingface.co/wok000/vcclient000/tree/main
lemme guess, did you use a youtube tutorial that told you to use that link along with vb audiio cable?
all video tutorials are old, that is a version of original wokada
@quaint knot
and vb audio cable creates issues on windows
delete the folder, zip and uninstall from windows app settings
no I used to use it before and I formatted my pc I just went to the github but forgot how to download it again
Ohh I see, unfortunately that one is original wokada and outdated now
what should I download now?
it's not suggested since months
troll
like spongebob or e girl trolling?
lol in general not egirl trolling
I used it to record sometimes instead of my voice
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
you can choose either wokada deiteris fork or vonovox
it's better you read each pros&cons
ooo so the original wokada is not working anymore? and there is a deiteris fork version??
original wokada isn't suggested, because of it's performance, the newer original wokada is just User Interface changes lol
Can I talk here instead?
You can either use wokada deiteris fork (modified version) or vonovox
vonovox is still being worked on and has some potential
I tried vonovox I couldn't download it
id really suggest you to read both vonovox & deiteris fork pros&cons
what was the error?
there was an issue with the bat file like at the end when It was extracting tinkers or whatever it got stuck
maybe try with the latest version
1.50
yup that's what I tried
#🧬│ai-chat message sure, but it is more chaotic than #1192011222023950368 , would u rather here or in #1192011222023950368 ?
Forum
Okie, be sure to elaborate here and we (helpers) will help
lmfao
Check the forums
im gen shocked, that dude was still asking for catfishing and even admitting it after days of continously asking for "e girl"
😭
Sure!
smartest user ever
bro made like 2 post forums and 30 messages asking only for egirl "rp" then directly said he says "rp" only bc catfishers get banned
Lmao?
istg 😭
weird, if you want you could try again and show me the error for me to help you
okie I am really confused this happens on every downloader now I am trying with the Deiteris version and this is what it gets stuck on
wifi?
??
on every installer it always somehow stops working for rmvpe and also content vec
I wonder if my network is blocking it or something
It's possible, could also be slow internet losing connection by the time it gets to the end
You can manually download the files from the drive. Instructions and link:
https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/#how-to-fix-failed-to-download-or-verify
not from drive, that one was deleted by deiteris
I reuploaded it in my github repo release
where can I find that?
is it the pretrain folder?
Yes
and what exactly should I do with that folder
You take everything inside the zip and put it into the "pretrain" folder of MMVCServerSIO
ooo I see a file called pretrain in the voicechanger folder
That's the download process that freezes on your end
Yes
You put the files from the zip inside that
okie thank you I will try that
oh god it works
thank you
I can't believe I formatted my pc for this
do you want me to check your settings if u want maybe?
What is this
just a solution for your "record everything with my normal voice and in editing change it so that it doesn't sound like mine, but I can't find models."
to make that I used a 30s voice sample + randomizer to make 10 different voices
Could I have the original audio
Okay, I’m so sorry. I’m new and I didn’t know much about this, I just understand it now
Say, what'd realistically be the best case scenario settings for freq/epoch when training off 48000mHz data (RTX 4090)
how do i download the voice changer?
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
i downloaded it, but how do i hear the voice that im using
don't use youtube tutorials, please elaborate as explained above
Full GPU Name: dont got one
Operating System: (e.g., Windows 10)
Detailed Description: i cant hear the voice that im using
Tutorial Used:https://www.youtube.com/watch?v=LX5en3pZJwM
Screenshot: no error message
that tutorial is outdated
delete the folder, zip and uninstall from windows app settings
dont got one
Are you sure?
that is the most important thing
You can check your pc gpu on Windows via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
integrated graphics? that's bad
do you have any other gpu?
like gpu 0, gpu 1
oh great, it's a dedicated gpu
ok so what do i do after i uninstall
you're just looking for realtime roleplay in vc/games or e girl trolling?
well im already a girl but im looking for voices to have fun with
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
1st link, get wokada deiteris fork
where is it
the guide
you need to read the guide and get the wokada deiteris fork amd/cpu/intel version
is that all i need
but where do i get that
nope, i was just explaining which version you need
yeah, you can ignore steps related to other things like linux or nvidia tho
ok
it isn't 1 click, be careful, missing a step can literally fuck up your whole audio system
let me know!
can somone help me find a reinforcement learing model tutorial for python, all tutorials i found are outdated
So I’m trying to make a 20-30 min long voice clip for a character I want to make an AI song of, but the character always talks with music behind him. There’s hours and hours of audio but it’s all with music or sound effects. How badly will this affect the final result? I’m trying to only use the cleanest clips but even those have weird noises in the background
use a cleaning tool to remove all music, fx, reverb, ect because any voice model trained must have clean audio
https://huggingface.co/spaces/TheStinger/UVR5_UI
The link you sent can clean the audio? Thank u!
yes!
though there are more options than that, this one I recommend for it being simple and easy to figure out
thank u! i am so new to this lol
these are the models I suggest for a good cleaning (in order)
Vocals FV4 by gabox
MelBand Roformer | De-Reverb by anvuew
UVR-De-Echo-Normal.pth
Mel-Roformer-Denoise-Aufr33
optional -
(use one or the other)
UVR-MDX-NET_Crowd_HQ_1
Mel-Roformer-Crowd-Aufr33-Viperx
just 4 is needed ^^
a lot of the music is like, background music so idk how well it will work but i will def give it a shot once im done editing the thing
thank you so so much🙏
no problem! if u have any other questions dm me or @ me here!
Last question. i want to make a full version of this song, so should i add this singing clip and put it through that file too, or should i train the voice model only on speaking clips
@viral mason
sorry for the questions
it's best to train it only with talking but I don't follow the rules >:3
so far it is 10 mins but i have 2 more seasons to add and im going to try to make it as long as i possibly can
alrighty ^^
what length should i aim for?
it's best to try for at most 30 minutes at possible
ok
but if u can't get more than 10 minutes that's also ok
sorry last question, should i be including breathing in the data set? like if the character takes a breath do i keep that in?
yes breathing helps a lot, although I am not sure about like deep sighs
ok
hello.
how do i know if im using the vocals isolation right 😩
nvm i think im starting to get it
i just ran out of the credits i think
Don't worry it resets after a day
And again there are other options although I'd ask Eddy
Or a mod like Nick
They're both very helpful
da fu does ''failed to fetch'' mean lol
I need a smart free ai tool which can copy an edit's visuals and animations and make a same one for the pics I provide on the same beat drop
in short I just wanna swap this pics in a short edit
Can anyone help please?
Does anyone has a AI that can develop whole Programms?
yes rn what i am doing is first place the audio file in dataset folder, then pre-processing, then extraction of features then creating index and start training
in the whole process i dont understand anything about tensor board thingy also the model is not getting saved till i check the overtraining toggle
Hello, I am brand new here and I wanted to know how people take a voice and make it say whatever they want? Is that a specific ai site?
I'm on MacOS
There are setting, training or simple model on Okada to make my voice feel more nature? I sound like my nose is stuffy. When laugh or Yelling it not nature
is there a software doing more realistic voice convertion than w okada and maybe faster?
Is it necessary to use models trained in the same language as the audio to obtain a good result or does it not influence?
required? Nope, but suggested
you could also lower the index ratio to make sure it uses less of the trained accent
what's your pc gpu? I'm guessing you're on windows 11, and what tutorial link did u use?
Hello, I am brand new here and I wanted to know how people take a voice and make it say whatever they want?
Are you talking about Speech To Speech (STS)
Or Text To Speech (TTS)?
I'm on MacOS
Which version, and M Chip?
have you tried checking https://docs.aihub.gg/rvc/resources/training/#epochs--overtraining ?
Last update: May 5, 2025
I mean you can just use any chatbot to develop code
don't expect too much, overall they just do text prediction
Yeah but it depends on your hardware
Could you please tell me ur pc gpu and operating system, so i could guide you in which tool would be good for ya?
ah the bot is taking a long time
my wokada fork worked fine until it didn't and server audio works but client doesn't
whats happening
can you show a screenshot of ur settings?
sure hold up lemme launch it
it works on server but not client
i reallly need the suppression so i cant really switch
if you want a good noise suppression, use nvidia broadcast app
mic-> broadcast app -> voice changer
what browser are u using? u sure u gave it microphone perms?
client should work
yeah i checked it all but i just decided to use nvidia broadcasst
thanks tho
why doesnt the download on applio work
what are you trying to download and where?
local or colab?
local
the download button and then it comes up with the windows save thing
but when i save the file just isnt there lol
what are you trying to download/
if you had a model trained, it is in the logs
there's nothing to download
what browser is that?
how can i fix that
make a ticket for devs
oh
why cant i send screenshots in this channel i need help
i can just open it in a browser im pretty sure
yeah
#1192011222023950368 try there
i posted a forum
There aren't any up to date one's on yt don't ever use YouTube for anything rvc or real-time vc related but I have recorded one myself for kaggle applio
Anyone who why my comfyui-zluda gives a cuda error when i try to create a image?
what error, what's your gpu/
some cuda malloc error i cant remember 100 percent which cuda error and my gpu is rx 6750xt
MacOS Sonoma 14.2.1 (23C71)
For the context, I'm starting as a content creator and there's a scene in a movie between 2 characters and I want to make them swap their voices, not their dialogue
it seems like Speech To Speech then, u'd need RVC
what M chip do u have?
M1
For Inference (use models) Mac (which doesn't have great support), You can:
- Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides, probably won't be able to train, make models):
- Cloud (remote good pc, easier and faster than ur PC but limited time):
- Weights.com: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours, not granted, of GPU
- RVC-AI-Cover-Maker-U Colab & Kaggle: Automatically separates the vocals and instrumentals, converts the voice and mix all together back
Easiest possible (automatically separates vocals & instrumentals) : weights.com & rvc-ai-cover-maker-ui colab/kaggle
Easiest cloud: applio colab
Easiest Local: Applio
I'd suggest using cloud, your m chip isn't good
I would love to know is it possible to get it work
i need to see the error
probably running out of vram, especially possible with SDXL's bad VAE file
Oh okay I will take a look what its called a bit later
btw do you know which webui people use for sdxl/flux nowadays? 
comfy mostly

i use sd.next
is still getting updates?
a million of them every week
oh nice, thanks
just one week of updates
perfect
use dev branch to be on the cutting edge

is there a new model yet?
idk what you're talking about, elaborate
last time i used the ai voice model was like 8 months ago has gotten any updates or a refresh?
what ai voice
that's a voice changer
there are two versions of the voice changer that are recommended right now, if you use nvidia u can use both
i recently switched to amd
ah
I am not entirely sure as I use Nvida and more options are available for that lol
still u can look here at the options if u like, the first one should have amd compatibility
!realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
might need Local realtime RVC
these are the local options
you got this man, btw if u need better help u can ask a mod or one of the online helpers
vonovox for now is only nvidia but Wokada Deiteris Fork does have amd although I have no info on that
yes it is, it has older code and is just slower than these new ones
welp time to wip everything of it of this clean pc
you ai guys move too fast
one day on mccv and then on onitrix vi
make sure you screenshot which voices u used
is there a reason for? just asking
oh okay thanks yea befiore i just labeled them thing 1 through 6
worked good untill i forogot that i already made a number 5 then label the other 5
work whre can i find the model i need for my vc
if u still have them downloaded u can just re find them there and make a new folder to put them all in
or redownload them off of either weights or through here in the voice model section
wait it said make sure you hae unistalled all versions of this product
how do i know it wont just blow up
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
i ahd the same issue
you gotta delete that old version and something less moldly
if u remember the names of the voices just look here
https://discord.com/channels/1159260121998827560/1175430844685484042
work
\hlp
my ai keeps runniong away
the sofrware
WHERE DO I FIND THE VOICE CHANGER
ALL I SEEE IS THAT AUDIO CABLE MANE
I tried..
worm
is the one i should be getting deiteris w okada fork?
<@&1159293204038955078> can one help such as me dearly
yes
what's ur pc gpu? i'm guessing ur on windows 11
ojn windows 11 i have arx 9070 xt 16gb
copy the settings I have here if you downloaded deiteris as just a starter for testing quality with not too much delay
proccessor is a ryzen 7 7700x
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
get wokada deiteris fork
you need to read the documentation, it's more than just clicking the link, be careful and let me know for everything!
that's old original wokada, dont use youtube video tutorials
delete the folder, zip and uninstall from windows app settings
You have been warned. That is catfishing
It's still catfishing and illegal
i wanna be venom
finally a person who wants to use the voice changer for a good purpose
which venom btw
is there any way to download a model from elevenlabs?
the movie from alst dance sounds more scary and clearest
you haven't been warned nor anything, just don't help such people tho
yes sir🫡
nope, it uses a completely different technology
11labs is closed source and TTS
RVC is STS
o ok becouse i really liked one model
I didn't see the thing he replied to until a little after I sent that message sorry
It's fine don't worry :D
Unfortunately, you can't download it
you can use it in 11labs website though
like I said earlier, voice model section https://discord.com/channels/1159260121998827560/1175430844685484042
huh? there are over 10k+ of RVC models iirc
no the venom ones
venom is a famous marvel character
the "we are venom" is just a line from the movies, we aren't affilitated with marvel
on which rvc venom model to choose ?
ohh the voice changer
yea
here's the direct link: https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/
Last update: July 30, 2025
you can skip any steps not related to windows amd, like the linux or nvidia ones
be sure to not skip the virtual audio cable, that is for all users
u downloaded vac lite and run setup64.exe and clicked yes?
yes
great
https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/#download-amd-intel-and-cpu-on-windows this exact steo is for you!
Last update: July 30, 2025
there is, the exact link i sent is for all amd or intel or cpu users
that one works for amd gpus
you sure? this one opens in the browser
unless u play vrchat you'll never find me >:]
im on amd i downloaded the thing from github made sure to download the right one not sure where to go in the files thougjh every exe i run does nothing
wait nvm
got it working i think
i feel like you're using that over year old version of original wokada with vb audio cable
don't use that
what's ur pc gpu? im guessing ur on windows 11
windows 11 amd and it's the whichever it's called
so I just need help with making an AI, Ive got an idea but I cant get the tools to work with me.
nor do I know what tools I need
amd and it's the whichever it's called
the gpu is the most crucial point actually, you need to be sure to know the name
You can check your pc gpu on Windows via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
everything ai related dpeends on your gpu
that's original wokada, not suggested anymore
and some users reported issues with vb audio cable, which is why we don't suggest that either anymore
yeah for amd it's ML
id suggest you to delete the zip, folder and uninstall vb audio cable from windows app settings
whichever works honestly aslong as it works idm
you need to give the full name of your AMD GPU
to be sure which settings are needed, and if it's good enough for whatever task you want to do
I'm not completely sure which it is maybe amd radeon
please do this and tell me
I know that but I'm not at my PC at the moment
which is why I tried to name it off the top of my head
please let me know when you're at your pc, because the settings vary depennding on it, and if it's even good enough
Let me know!
alright I'll ping you when!
hello i have an isuse, few days ago everything was fine but today i tried to test the voice but its not working, no sound is coming out
uncheck sup1, if you got noise isues use sup2
uninstall vb audio cable from windows app settings, many users report it doing randdom issues on windows
get vac lite https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/#virtual-audio-cable
Last update: July 30, 2025
okay ill try it
well i reinstalled it and i still cant hear it
you shouldn't reinstall it
yeah i was going to say that
after you install vac lite
set output to line 1
and monitor to headphones to hear urself optionally
[SIO] rconnection failed Error: xhr poll error ????
I'm using windows 10, gpu 9070 xt (in screenshot) everything was working fine a few days ago but all of the sudden today it would not pick up my voice i think, i tried every input and output option and tried other mics
AMD RADEON RX 6550M
which website would you recommend to train an RVC? I am so new to this and im lost in the guide. I have a very good data set, but my computer cannot run a program so it needs to be on the cloud
if theres no good free option, i can probably use someone else's computer but id like to do it on my own if possible :)
A friend who knows some about these things said that its 99% not that because I streamed that to him when i tried to set it up
you can only know that by examining task manager/performance tab and seeing how gpu usage does
I remember clearly the gpu was using 9/12gb vram
show the error dump
I am so sorry me and My friend restarted and I Have to do The whole thing again by The tutorial and then show you what error i get so i Will show on friday okay
But I am really thankful you want to help me
why does it keep saying "trial" in my mic?
you've downloaded shareware virtual cable
?
instead of following the guide like a good boy and downloading v4.70 lite
ight
prec8 it
theres a bunch of stuff in the extracted file.. i press setup64.exe right?
yes
follow the guide
hello, i tried to uninstall the old vb and use vb lite as instructed but my issue still presists, i can't hear myself when using it
im using windows 10
it was working fine a few days ago
but today all of the sudden it stopped working
weirdly enough, it works when using server mode, but client doesnt work?
@low shard
hi guys do u know how i can use the ai voces on discord??? on calls with my friends??
have you tried enabling postthru
where do i find the option to enable it? (btw it works for server mode, just client mode doesnt work)
well im going to be busy for a while now but if anyone has any suggestions or ideas on how to fix my issue i made a forum about it here, it'd be apperciated ❤️ https://discord.com/channels/1159260121998827560/1405280433360343171
everytime i download a model it doesnt work but the original ones work
but models dont
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
Please elaborate
it's a AMD RADEON RX 6550M
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
wokada deiteris fork
last time I downloaded the right one it just kept failing right before downloading
what's ur pc gpu and operating system? are u looking for a realtime voice changer or ai covers?
i'm not sure you used this one, but be sure to have a good connection
yeah my connection is very stable
I was so confused
why it kept doin it
yes, but you gotta elaborate
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
try now
Alright when I'm home I'll try it
macbook pro M1 chip
no idea what that is for gpu
8 i think
and just looking for something to make song covers with a character from a show that doesnt seem to have any voice models
For Inference (use models) Mac (which doesn't have great support), You can:
- Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides, probably won't be able to train, make models):
- Cloud (remote good pc, easier and faster than ur PC but limited time):
- Weights.com: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours, not granted, of GPU
- RVC-AI-Cover-Maker-U Colab & Kaggle: Automatically separates the vocals and instrumentals, converts the voice and mix all together back
Easiest possible (automatically separates vocals & instrumentals) : weights.com & rvc-ai-cover-maker-ui colab/kaggle
Easiest cloud: applio colab
Easiest Local: Applio
I'd just suggest you to use cloud
weights said my file size was too large
do any of u know why my rvc comes with extra voice cracks?
rvc doesn't mean realtime voice changer
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
how can i install the vc i have an 2080ti and and win 11
I recommend deiteris wokada fork or vonovox. They have guides in the docs which is really easy to follow, just read the pros and cons for both voice changers to see what you'd like best.
https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/
https://docs.aihub.gg/realtime-voice-changer/local/vonovox/
can you look in dm pls
this isnt help but a question. does FP32 make it sound better? im using the w-okada forked and theres a feature where u can force it. should i enable it?
it's a tricky question, but at least, its needed for fp32 models to work properly otherwise they randomly have glitches, or at least my fp32 models were glitching when i enabled fp16 infer in realtime, tho for some reason seems to works fine in local rvc
well i just enabled force f32 and i got 0 glitches so far
i tested it by talking for a bit
didnt hear any so far
yes i mean like, fp32 models glitch when fp16 infer is enabled
i mean what would u say then? is it genuinley preffered?
doesnt change that much honestly
i figured
but for fp32 models i'd rather have it enabled so they dont glitch
i mean i just tried it and i hear a good diffrence but idk if im just crazy
like when i talk it sounds more natural
it sounded good before but it sounds better now
but like i said, idk if thats ME or thats actually the voicemodel
i can't say for sure, all i can say, in regular rvc (not realtime) the results are exactly the same
but realtime pipeline is different so, ye
alright, thanks then!

and also about w-okada updates, did the developers/engineers just stop working on it?
from what i heard AI hub and W-okada its self is literally dead
yes, vonovox and training models using the spin embedder is what is best rn
w-okada is something from the past
hypathetically if i got people to code w-okada for me is there a way to make it any better then it is now
afaik, no, because the whole code is a mess
well IF by a chance theres a way to code it can it be better then it is now?
for that you have to rewrite the entire program
is using super old outdated software of 2017
im planning to hire a team to just rewrite it entirley
would i have to ask for permission?
try to get in contact with wok
hes the original author
is he on discord?
wok5681
his disc
he's here
the code is open source
so in theory, is fine? i guess
welll
id wanna ask some questions
hes the original coder of w-okada?
i thought the original author was gone
😭
like last year i could NEVER find him,
wokada
wokad
woka
wok
nobody knew who he was
ah, i see
well thanks
this is actually gonna help me
he speaks japanese btw

no idea lol
rtx 4080 its on laptop though
i did try using the mango rvc fork main
@low shard i did try using the mango rvc fork main to train models but not sure if its working or not
I don't use mangio so I wouldn't know
what do you use?
are u training a model?
yeah im trying to create my own model
I use applio on kaggle, I train with rmvpe
Mangio rvc fork is abandoned since 2023
wait wut really?
check https://docs.aihub.gg/ you can do it locally with applio
Last update: August 5, 2025
Yes don't use video tutorials
is it applio?
so are we gonna get both the index and pth files?
That is our documentation, you can read here more about how to make models with Applio
ok2
do i have to do the cmd thing if i were on 4080? or is it for 5000
This is only for the Rtz 5000 serie
So nope
Not for your Rtx 4080
ohh i see
yeah sry its kinda confuse me for a sec
also to train new model, which sampling rate is good? 32k 40k 48k?
You need to train the model depending on the sample rate of your dataset
ohh i see okay
hi i get abit confuse im on the last step, do i click train model or the index?
okk
does anyone know how to install pytorch...
ohh i see thanks!
super confused. so i have applio running on a diff computer and i have a dataset but how do i turn that data set into a voice?
the guide doesnt really explain it
are u using it locally or on kaggle?
I got a whole kaggle applio tutorial
hello! i have a question for Deiteris' W Okada Fork Local Realtime Voice Changer...
I have windows NVIDIA RTX 5060, so would that make me download the NVIDIA RTX 5000-series on Windows version?
If that's the case, is there any step by step guide how to download it? because it said i have to download 3 files, while there's a total of 5 files on the link (https://github.com/IllIlIlIllIl/voice-changer/releases/tag/b2335) It would help me a lot if there's a procedure or video somewhere
But if it's not recommended for me to download the NVIDIA RTX 5000-series on Windows version, then which version would be the best
locally, and the sample rate doesnt match with the 3 options
my sample rate is 22000 or something and the lowest is 32000 something
did i mess up
can't really help with local but for sample rate download audacity#✨│ai-help message
it also said it failed because there wasnt enough data
The file was WAV and it was 25 mins long
does anyone know how to make a person look really creepy with ai
Is anyone proficient with Vonovox? I'm trying to run the program, while also playing Marvel Rivals, and it seems to cut my mic off over and over, or literally any other game, even Peak.
My current specs are 2070 Super, and AMD Ryzen 3700x, I mainly use Vonovox for vtubing. I just want it to work when I'm playing more high intensity games for streaming and offline as well.
I’m going to just keep trying until I get it I guess 😭
why doesnt the voice changer work in discord
Can anyone help me use SDnext properly? My results are really bad I'm trying to upscale + enhance
hi there, do you know how to make a song but using the voice models as the vocals
you can try wokada deiteris fork instead if not sure
marvel rivals may be too demanding for your spec
RTX 3080/4070 is recommended for your typical use case
Ohhh okay, usually Rivals runs at least at 90-100 fps usually for me. It's just so weird
then the voice changer might be underperforming, try adjusting chunk & extra settings
Okay I will do that tomorrow, again thank you for the feedback! Also I LOVE your profile picture
@low shard it still did it it always happens at 170
Hi, I was wondering if anyone could help me to fix my high res, it randomly starts and stops for not apparent reason, sometimes its high, and sometimes its normal.
I downloaded the cuda version since its for AMD if I'm not mistaken, and I have a 7900xtx which should be enough to not have high res.
I use the MaeASMR Model too
downloaded the cuda version
of what?
why do you think 7900xtx supports CUDA?
just realized looking at older posts having high res like me, the version I have is outdated
I followed a 2 months old tutorial that talks about old stuff
I can't send images here... can we go into a post so I can send an error I got trying to install the voice changer ?
lol I can't even create a post
and btw it was "ML-Cuda" not cuda alone
same or nah ?
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
read 1st link
ty
alr it fixed the high res, but how to avoid sizzle though ?
any settings you'd recommend ?
hi, i tried to train a speech model with the freesia speech pretrain with spin embedder and I got only buzzing sound as output. what am i doing wrong?
you need to use spin embedder for inference
unless you did something even more stupid like using a pretrain as voice model
did these settings for training the model https://imgur.com/a/nyJnoMd
@simple ore https://ibb.co/pBdhn9G0 Which settings should I change to make the voice sound better ? I have a good too
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
!give-media-perms 50m @empty bolt
by your model, are you trying to do e girl trolling?
I'm a trans woman....
I'm only on week 3 on estrogen doses so I still have a masculine voice and I'm trying to hide it, is it still catfishing ?
Ohh, it's great to help people like you!
f0: rmvpe without onnx
extra: 2.7
uninstall vb audio cable from windows app settings, get vac lite from the same guide you got wokada deiteris fork, then set output to line 1
uncheck sup1, check sup2 if u got noise issues
Okayyy tysm 😄
also since you're on amd, this triangle will be ur life saver
reallly lol ?
There's also optional settings, want help with them too?
what does it mean
sure !
hover your mouse over it, it will tell you exactly what you need to do lol
on wokada deiteris fork, you can **optionally **use more advanced settings for benefits:
- Advanced Settings -> Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
- Advanced Settings -> Crossfade Lenght: Controls how smoothly the AI stitches different processed parts "chunks" of your voice back together. 0.1 for fastest voice, 0.15 for improved quality but increases delay by ~50 ms
- Reduce the delay on Windows via the Wasapi / Asio Guide
ohh indeed that's pretty useful
that tutorial is outdated, it uses an over year old version of original wokada and vb audio cable
delete the zip and folder of original wokada
uninstall vb audio cable from windows app settings
Are you trying to do e girl trolling like in the video?
thank youu that's really helpful !
you're welcome
, btw do you need any other help? I gave you image perms in case temporarely you need them
if I do I will let you know 😄
so delete everything basically ?
yes, forget video tutorials
no i am not
there are different programs based on different things, there isn't a program for everything and it also depends on the AMD version (and if your gpu is actually good enough to do what you need to do, since it's bare minimum), are you trying to do ai covers, or tts. or roleplay in games/vc?
alright, have a nice day
oh well I do, for the rmvpe settings without onnx, its grayed out and I can't click on it for some reason
that was a typo, i meant with onnx since you're on AMD
onnx is meant for amd
what games? i fear you might not be able to bc ur gpu is pretty old
okk makes sense, thx
ehhh, id not count too much on that, your gpu is the minimum
you can try on 1080/720p with 30/60fps cap tho
wanna try?
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
1st link
wokada deiteris fork, yes
official is the one you're using
show an entire screenshot of your settings
its weird my model works with tts inside the fork but not with vovonox live. i tested a model from this discord and that works with vovonox
dont crop anything
yeah to check which should be modified and which shouldn't in your case
that doesn't look like applio 3.2.9
also the quality kinda low
!give-media-perms 1h @dusty mortar
applio didnt work with my 5090 so I used the rvc4 fork
that looks like codename's rvc4 fork, it's extremely experimental and not suggested for most users
applio does work with the rtx 50 serie
Last update: August 9, 2025
thank you i will try it again
this looks like the reason why it's not working
it would work only in codename's fork and applio
ah i see
you could either:
- use the applio rtx 50 serie fix, and use the spin custom embedder if you want to train spin models
- retry with codename's fork changing that setting, however the whole thing about the fork is being experimental and there is an high chance it can fuck things up for most general users, that's why it's not in the docs anymore and not suggested
i try the fix first. i did that fix for an old applio version but I forgot about it
it does still work, and iirc in the next major applio release you might even not need to fix it urself as i heard noobies will use the newer pytorch from the start
if u want to train spin models, you can use the "custom" option
where do i get the custom embedder? is it automatically downloaded?
you can either:
- use the rtx 50 serie fix precompiled guide, then download spin manually via selecting custom and getting the file from https://huggingface.co/IAHispano/Applio/tree/main/Resources/embedders/spin
- download applio main branch by git cloning, tho it needs git and would need to skip the precompiled step of the guide and just use the fix part (this will use the main branch which automatically downloads spin, but has some other experimental things and it's not as much stable)
Hey why my voice is like lagging so much?
the message i sent is related to something else
and it's pretty hard to understand the issue since it's a general ai server
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
please provide the info needed for me to help you :D
Okay
i also need help, same issue as yesterday, my voice isn't being picked up no matter what i do, although yesterday when i switched to "server" it worked but today both client and server dont pick up my voice
everything seems fine, i installed the vb lite and its still not working (aside from the server miracle yesterday)
Sometime you should change your default microphone on your browser settings, in your browser settings you have your def microphone?
oh okay
yeah its the default i tried that too
i switched from quest's mic to my other mic, both set to default, and still the same
let me know!
gpu: rx580
extra: 1.0
be sure the game/discord input is line 1
on wokada deiteris fork, you can **optionally **use more advanced settings for benefits:
- Advanced Settings -> Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
- Advanced Settings -> Crossfade Lenght: Controls how smoothly the AI stitches different processed parts "chunks" of your voice back together. 0.1 for fastest voice, 0.15 for improved quality but increases delay by ~50 ms
- Reduce the delay on Windows via the Wasapi / Asio Guide
GPU Name: AMD Radeon RX 5700 XT
CPU Name: Intel Core i5-7600 @ 3.50GHz
Operating System: Windows 10
Detailed Description: I was trying to set up AI voice and followed exactly what was shown in the tutorial video, but the audio still lags, glitches, and is delayed, and it also cuts out.
Tutorial Used: https://www.youtube.com/watch?v=SxdnGxicJOg&t=617s
Screenshot: its just bugging voice
that tutorial is outdated, it uses an over year old version of original wokada and vb audio cable could caause issues on windows
delete the zip and folder of original wokada
uninstall vb audio cable from windows app settings
Are you trying to do e girl trolling like in the video or are u just trans / doing rp?
I was try to do like finnish thing in rb
And where i can download newest one?
ohh, roleplay?
yee'
Butt where i can download the newest:D
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested WebUI with the best general support for many platforms. GUIDE
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
is there a like tutorial?
if you mean written guide, yeah the hyperlink is up there
if you mean video tutorial? nope, never trust those for realtime voice changers
ooh
they are always old since 2 years, never use them for rvc nor wokada
If you don't understand a step in the written guide, you can ask me 😁
@buoyant warren any updates btw?
How do I upscale + enhance using SDnext? I cant use sd1111 cuz I got an intel arc card sadly- can someone help me with this?
generate an image, send it to image tab (using image ->) button
on image tab re-select sampler, then expand resize tab and add multilier, and pick the resizing method
No like- It's generating higher resolution image but taking away all the details instead of adding
no mattter what i do i cant get the voice change to work both on presets and customs
not 'rescale'
what? I'm sorry im new to this
there is a scale output option in "resize" tab but I thought it's just gonna upscale without any details, that's why I used these
I will try your settings
I told you already to not use that version, delete the zip and folder
And uninstall vb audio cable from windows app settings
Use wokada deiteris fork instead please
the problem is it wont i ghave stable internet and it faikls at 170
At 170 what ?
Where are you from btw?
170mb / 274mb
new york
The latest version as of December 7th 2024 is: dml-b2332 (click here to download)
is this the right one?
so its definetly not an internet issue
Yes
Okay
Start http is from the old original wokada
Don't use video tutorials for realtime voice changers
Delete the zip, folder
And uninstall vb audio cable from windows app settings
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
Please elaborate
https://litter.catbox.moe/dry5y5ny486bi8ys.zip try this temporary link maybe?
It's weird that the HugginFace download server link doesn't work for you
But lmk if this works
You're welcome!
yeah something is defiently weird it still says check internet cinnection near the same mb even though i have stable internet
It's an issue on your end then, try either:
- use a different browser that is up to date
- use a download manager like FDM
- check if your storage is full, free it up a bit
- use a VPN (maybe?)
Because other people just downloaded it and even the HugginFace link was fine, I even tried both temporary one I just sent and that's fine too
do you think mediafire would work
I think it's in your end, if it was server side it wouldn't have just worked for me and the other people I just helped
It would be better if you try the things I told you
everytime it fails it turn sinto this file type
Delete those files, try the things I told you please
aight i got it but i think it doesnt work in discord
and i speak but only hear my voice after like 10 seconds
after tring in chrome then edge i tried firefox an it finally wORKED
Yup, it was on your end then
Well u can unzip it and continue with the guide then
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
I told you to not use that one, and that you need to elaborate, please do that #✨│ai-help message
alright thank you so much
Let me know how it goes
i should deleTYE THE OLD FOLER FOrmj the other one i wa using right?
oops accidental caps
The old original wokada and vb audio cable? Yes
Quick question function call in Ai context is just asking open ai question then based on it respond we excute some code then based on result of this code we ask new question and get final result ?
@low shard
'''2025-08-14 09:44:09,438 INFO [WeightDownloader] Loading weights.
2025-08-14 09:44:09,932 INFO [Downloader] Verified pretrain/crepe_onnx_full.onnx
2025-08-14 09:44:09,936 INFO [Downloader] Verified pretrain/crepe_onnx_tiny.onnx
2025-08-14 09:44:09,989 INFO [Downloader] Verified pretrain/crepe_full.pth
2025-08-14 09:44:09,992 INFO [Downloader] Verified pretrain/crepe_tiny.pth
2025-08-14 09:44:10,310 INFO [Downloader] Verified pretrain/rmvpe.pt
2025-08-14 09:44:10,538 INFO [Downloader] Verified pretrain/fcpe.pt
2025-08-14 09:44:10,563 INFO [Downloader] Verified pretrain/fcpe.onnx
2025-08-14 09:44:11,150 ERROR [WeightDownloader] Failed to download or verify pretrain/content_vec_500.onnx
2025-08-14 09:44:11,150 ERROR [WeightDownloader] 'pretrain/content_vec_500.onnx failed to pass hash verification check. Got 99cbfd8c7be5b32af2d208d8569f5cdd, expected ab288ca5b540a4a15909a40edf875d1e'
NoneType: None
2025-08-14 09:44:11,151 ERROR [WeightDownloader] Failed to download or verify pretrain/rmvpe.onnx
2025-08-14 09:44:11,151 ERROR [WeightDownloader] 'pretrain/rmvpe.onnx failed to pass hash verification check. Got 9461af137f188837d82c21af61f3f570, expected 9c6d7712f84d487ae781b0d7435c269b'
NoneType: None
Traceback (most recent call last):
File "client.py", line 22, in <module>
File "asyncio\runners.py", line 194, in run
File "asyncio\runners.py", line 118, in run
File "asyncio\base_events.py", line 687, in run_until_complete
File "main.py", line 90, in main
File "downloader\WeightDownloader.py", line 88, in downloadWeight
Exceptions.PretrainDownloadException: 'Failed to download pretrain models.'
Press Enter to continue...'''
i down.oaded everythign now this is what shows whne i open it
your connection seems unstable, check https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/#how-to-fix-failed-to-download-or-verify to fix it
Last update: July 30, 2025
If you see this error while downloading "pretrain models" within W-Okada terminal, it indicates that your internet was slow and that it was timed out. Follow this guide section for solution. https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/#how-to-fix-failed-to-download-or-verify
thats the thing though everything downlaoded
all the downloads finished
So is this solved?
im gonna try his thing he senthopefully al works
What is this supposed to mean?
nvm

mostly i was talking while thinking
what dose mean
Documents\Ai voice\dist\main>main.exe cui --https false --no_cui True
What program are you trying to run at this moment? Sounds like you're trying a realtime voice changer program.
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
yes i am trying to get the real time voice changer
What do you use W-Okada the realtime voice changer for? Trolling your friends or catfishing? And what is your PC GPU?
and i use the realtime voice changer client i use it to sound like youtubers to have fun whit my self and friends and my pc windos 10 and gpu i think is 2080 nvidia
Download the better W-Okada version from there and follow this guide. https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/#download-nvidia-on-windows
Last update: July 30, 2025
thank you :)
guestion where do i download it i cant find it for windows ;-;
Let me know if you have finished installing and running the program.
There is for Windows.
thanks :)
Hi i was working on a small project its a simple Text to speech running in directml and requiers Onnx models, is converting a pth model into onnx possible?
Some people wondering how I get one tab on Google Chrome to alive. I have a secret. I used an equalizer extension on a tab to keep it active even if I switched to another tab. The thing is an audio playback sound needs to be present first before you enable the extension on that tab, and then I switch to "Google Colab" link real quick.
Which program are you looking for? And what is your PC GPU?
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
I made my own program with some help from an ai agent, right now
am on windows 11
gpu : radeon 7900xtx
suprisingly my TTS works but it only has a en_US-lessac-medium.onnx model working for it so i was wondering if i can convert the models we have here into onnx
I'm not entirely sure what to use for making covers besides weights.gg, u can ask a mod or helper for that
Both "RVC voice model" and RVC program don't make the whole song track like Suno/Udio. Instead, in an RVC program, if you upload an audio file with a voice model applied, you click convert, the program would process and then becomes into what it known as "AI cover" or AI converted vocal.
What do you use RVC for? And what is your PC GPU? In case if you have one so there's a local option.
Basically you're putting a voice overtop an existing voice of an already existing song
@low shard i got it now im not sure how toactivate the voice changer i pres start and it doesnt changte my voice
Since this specific AMD Radeon RX GPU is the mobile one, here's your settings:
Chunk: 128 ms, although you can increase this if your perf number at top left is red and audio is unstable
Extra: 2.7 s
GPU: AMD Radeon RX 6550M
On W-Okada, the green "start" button is where you activate the program to start working.
i did that but the voice didnt change'
If it still not working, make sure to check your microphone and try and wait again.
What do you use W-Okada for? Trolling your friends or catfishing someone? And what is your PC GPU?
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
stilll not working
Have you restarted the program?
yeah i just did
Hi, I am new to this server and new to AI. with so many AI models or platforms out there I am trying to figure out which one might be best suited for what I want and I don’t want to waste my money taking on a subscription and have a turn out to be useless to me. Is there anyone here that might be able to have a conversation with me so that I can get the correct one picked out for me that will meet all my needs?
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
May I ask, what AI program are you looking and use it for? AI cover or realtime voice changer?
right now, I have been using the free plan for ChatGPT, but only for the conversation and research. I am looking to find a model that can help me do video, including deep fakes and also for static images and also for programming, and I don’t do any of those, so I’m looking for something that can be used from a conversational functionality
I know that ChatGPT on some of its various models one of them is not video and it also won’t do deep fakes, and I know that because I’ve asked it to describe the various models it has and what they can do and what they cannot do
ChatGPT model itself doesn't do deepfake and other video related features. Dall-E and Sora, both from OpenAI, are known to generate image and video respectively. Other alternatives are also possible; some of which are free and open source but need you to set up by yourself, but also some are paid and easy to use.
my situation is that I know little to nothing about AI and besides ChatGPT and mid journey those are the only names I know and I don’t even know what they do how they do it what their limitations are so I’m looking to educate myself on all of the various ones that are out there and figure out what they each do
I am working on a couple of possibilities for doing a few different YouTube channels, and I will need both static images for the creation of memes and also to help me design things like background graphics or even frames for when I insert content. I will also be having some projects where I will need some programs to be created for me and I don’t do programming at all. I wouldn’t be wanting complete impersonation as far as deep fakes are I just would want it to be willing to create the images of the people I would be wanting to use for my memes both static and video and I know that a lot of AI programs have been created or programmed to not allow the usage of tons of public people
do you know the names of the programs or models that people have used to do deep fakes that have not been gutted to refuse to create the visual image of these public or famous people? Part of my problem I don’t even know the names of these various AI models or platforms.
you dont need a deepfake to create a random person's picture
it wouldn’t be just random people I would be wanting to use in my videos or my static memes. It would be political people and entertainment people.
perfect example of what I’m talking about is a while ago like last year I tried to get a Meem created on the concept of wonder woman both as the comic book type of drawing and also the Lynda Carter image along with people like Joe Biden, and other political figures being in circled in her golden Lasso, and instead of creating Joe Biden it kept creating Donald Trump
also, like stated previously, I only know the names and existence of two AI models or platforms like I said I only know ChatGPT and mid journey I don’t know the names of any others to even begin researching what they are and what they can do and what they cannot do
Hello
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
wait @low shard can you give me an alternative to vb cable so im able to try it out?
you'd need to change the over year old version of original wokada too, not just vb audio cable
it's better you elaborate as said in #✨│ai-help message
there's different versions depending on the gpu for example
@simple ore did I lose you? Or did I say something wrong?
i'm busy with other things
oh, I just wasn’t sure if I lost you or something. You have to pardon me. I’m new to this server.
Uhhh what dis she say? I only speak English…LOL
What i have to do now and best settings for?
high perf value, you're running something else other than the voice changer which can affect the settings
are you trying to do e girl trolling? or roleplay in games?
Hi everyone.
I'm currently running a channel in which I'm narrating some webnovel series (currently only 1) And have been looking for a way to change my voice with an app for the girl characters... Been just trying to physically change my voice, it works... but... it hurts lol.
I see that some people showing images of their recordings on Audacity, so that already answered my first question.
My Current Desktop specs are
i5-9400F @2.90
Ram 32 gb
Nvidia RTX 2060, 6gb.
With all these versions, I'm wondering which one would be best suited for my needs.
I'd greatly appreciate the help.
Thankyou
how do i sett it up to fivem
how do i fix the audio cable saying i cant redownload it
hello guys i have an issue where if i use ai Voice changer client demo and link it to discord using VB output and input in discord people hear my original voice,and then the changed voice and people also hear themselfves too so please can anyone help me and tell me how to fix this issue?
setup what? this is a general ai server, please elaborate
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
redownload it? could u please elaborate?
egirl/eboy trolling? or like spongebob?
dont use video tutorials, vb audio cable can give issues on windows, video use an outdated oriignal version of wokada
Please Elaborate:
- your PC GPU (Such as Nvidia RTX 3060, AMD RX 9070 XT..)
- your operating system (Like Windows 10, Windows 11, Linux, MacOS 26..)
- what you want to do? There isn't a program that does everything, there's a program for each thing:
- AI Covers
- Train RVC Models
- E girl trolling
- TTS
- Roleplay in VC
- Roleplay in Games
- etc
- what tutorial link are you using
- a screenshot of the program
thx for letting me know, and yw!
that is catfishing and illegal. You have been warned and won't get help.
what is the index thing for
pls dont catfish its just corny in general
be yourself and have fun w the voice vchanger
in rvc context:
- pth files: contain the voice
- added index files: contain the accent
- metadata.json file: it's just some extra info about the model download link if you downloaded it off weights.com, it's not needed and won't impact the actual model at all
TL;DR:
index = trained accent
i dont get what epochs are and how i can set it
Epochs are set during the models training phase, it's the amount of cycles the training phase goes through. If you are using a model you don't have to bother with it, if you are training a model (which it doesn't sound like it since you'd immediately see the option), it's another thing
they don't mean quality btw
what does "zero gpu worker error" means in ilaria rvc space in huggingface?
what should i do
Radeon 7800xt
my b sent too soon
radeon 7800xt, windows 11, strange tunnel effect present with the voice that randomly appears
ive been told its like speaking into a cup
how do you view the tensorboard in rvc v2 disconnected to see how many epochs you need?
*i need
can you show a screenshot of the issue?
@viscid moss
can you also share a screenshot of the program, and the tutorial link?
rvcidsconnected is outdated #📰│dev-updates message
what's ur pc gpu?
i have a nvida gtx 1660 super
yeah
Can someone help me with an issue still? I am using Vonovox for Vtubing, but with games like Peak, and Marvel Rivals, the changer seems to "break" or glitch to where my mic cuts out with almost every word, just want to know how to fix it.
Currently operating:
RTX 2070 Super
AMD Ryzen 3700x
Mic used: Shure M7
get a better/second gpu
tune game settings down so gpu is free
hi i might need some help regarding how to make ai covers
it can train but it's not suggested
Train (make) RVC Models on cloud:
- Prepare the Dataset
- Setup RVC:
Choose a cloud way to use RVC,
- Google Colabs (max 4 hours of daily T4 16gb gpu not granted for free, but easy to use, there's a paid tier):
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus, either T4x2 16gb each or P100 16gb, only free):
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly, Free Studios run 24/7 but require restart every 4 hours. There's a paid tier):
- Be sure to know about the tensorboard
Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but more gpu time
If you want the easiest way and for free, is using https://weights.com/ which uses RVC
RVC Inference (use models) on pre-recorded audio on Cloud
You can use either:
- Weights.com: Easiest Possible Ever Automatic
- Ilaria RVC Zero: Fast and free on cloud
- Applio UI Colab: RVC Fork with some extra features like TTS
- RVC AI Cover Maker UI: Automatically Separates the Vocals and Instrumentals, converts the voice and mixes them back
better to use cloud
Sure!
Could you tell your pc gpu? I'm guessing you're on windows
rtx 4080 and yes im on windows
so ive read the docs, there are 2, how to make ai covers and ai cover makers
are they differents?
so rn im trying the how to make ai cover
so after i convert the music to vocal and instrument, then change some settings on audicity
need to convert it on voiceconversion
after i read the instruction, after i convert it, below it there is training instrustions
after the download output
the how to make ai cover is an essential page explaining how to make it yourself
the aicovermaker fork is an automatic process which simplifies the whole thing
id suggest the 2nd if u want only ai covers


