#✨│ai-help
1 messages · Page 306 of 1
ok
and it shows preprocess completed and it loaded 30 seconds of audio - all good
if tha number of seconds matches your dataset, you go to the next step
bytes means megabytes???
@simple ore is there a chance i can send you the audio files to train the model and youll train it for me?🙏
or you @hallow thistle
Byte simply means byte unit; if it doesn't say MB or GB, it's not megabyte or gigabyte.
can you?
i'm training different things right now and electricity is not cheap
As of now, I don't train a voice model. To request a "model maker" to train one completely for you, there's #1159289738314919936.
What is your PC GPU? And what is this program or which W-Okada version?
is it possible to get a stable real time voice changer for amd gpu
There is Tg Develop W-Okada fork DirectML.
well I tried that But in real-time I’m still getting weird echo/ghost sounds not normal echo but random distorted audio that I didn’t actually say. even when I’m not speaking at all, some sounds still come through.
What is your PC GPU?
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
rx 6600
Chunk: around 80 ms
Extra: 2.7 s
GPU: AMD Radeon RX 6600
Pitch detection: rmvpe_onnx
Okay so the sound much stable than before and I tried some random models but idk why its my voice or what but it still sounds very robotic
NVIDIA RTX 4060 TI 8gb vram pc
Windows 11
I need help with my voice, I tested all official models and used some others, but my voice sounds really robotic (W-Okada) (Basically, the same problem of Brave)
Are you using Tg Develop W-Okada fork? If not, better try one. https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/
Last update: November 22, 2025
Alright, thank you for now, but can I return if it still sounds robotic?
Use this settings in your Tg Develop W-Okada:
Chunk: around 60 - 80 ms
Extra: 2.7 s
GPU: NVIDIA GeForce RTX 4060 Ti
Pitch detection: rmvpe
Input: microphone
Output: Line 1 (Virtual Audio Cable)
Monitor: optional, set this to your speakers/headphones to hear the program.
Okay, thank you again
Hey, sorry to bother you, I did everything according to the tutorial but I'm getting no sound output.
can anyone help me?????
what is a good ai for text to speech where you can use the .index and .pth models
Why not just use it live or ecord your voice and out the ai over it
question for me?
Yea
hello, this is a general ai server, please elaborate
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
my voicechanger is a bit laggy and ive changed almost every setting. can anybody help me ?
my bad thx
Full GPU Name: Radeon RX 7800 xt
Operating System: Window 11
Detailed Description: I've fixed my old problem, but I have a new one: I can't select "Echo Cancellation" or "Noise Suppression" in RCV Voice Changer.
Tutorial Used: https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/
Screenshot: i can't post picture i think i don't have permission
btw is there a way to send code on dc
without it missing up the code
one of friend is teaching me c
soo
yea
applio can be used on a browser as well as on ur pc locally
idk what is needed for minimum tho gpu wise
are you using server mode perhaps?
either send the files, or use discord's formatting codeblock like:
print("Hello World!")
thanks
I would assume they're using server mode as client mode has been reported on both wokada tg fork and deiteris even for me to just not work anymore
Hi, I would like to get a consultation.
Russian Russian models I tried to use, mostly Russian, because I speak Russian and I assume that this should make it more realistic.
The problem is as follows:
When changing the voice in real time, random words are eaten if you speak quickly. Very often, certain letters, r, w, h, n and others are not pronounced. In general, speech and voice using even Russian models are always very far from the minimum acceptable level.
I use Tg Develop.
GPU - RTX 3060
I've read all the guides a thousand times, tried all the settings. The result has always been very far from "realism"
And yes, I'm writing this through a translator.
Thanks in advance for any answers.
quick and simple question, how do i delete a voice model in the slot config?
nvm, i found this https://github.com/w-okada/voice-changer/issues/440
this looks old
all of them are, but what gpu do u have
there's 3 realtime voice changers (wokada)
depending on what gpu ur pc has will depend on which one u should use
It’s for a friend, you mentioned it the last time we talked
One that uses ai voice modules and it’s free
My computer doesn’t have enough space for it
They all do that and are free, what gpu does your friend have?
Sorry it took so long, here
Nvidia GeForce GTX 1660 Super
It's ok, your friend will be able to use wokada deiteris but it won't run too well in games unless the graphics are super low
Should work fine on discord calls and stuff tho
It's the third guide
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE
For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
Where would I download it?
It's the blue guide button right there next to wokada deiteris fork
Question, for Lightning Ai i keep getting this popup saying [SIO] reconnection failed error: xhr post error. Any fixes?
watcha using lightning ai for?
For deiteris w okada fork
Ah yes, indeed, my bad.
is there a way to use the rvc in other apps like discord and not the browser 🤔
what I do
Sup guys, i'm running an Nvidia RTX 3090 with Windows 10 and i've been using the newest Applio 3.6.0 version for my trainings(locally) and alot of the models i've trained look like trial and error even when using the same exact dataset, i wonder what i can do to have better and consistent results. Anyone willing to help? I haven't followed any tutorials, just text guides on how to prepare a dataset.
happens with ngrok too
i get this error when i try to install applio on kaggle
No solution found when resolving dependencies:
╰─▶ Because faiss-cpu==1.7.3 has no wheels with a matching Python ABI tag
(e.g., cp312) and you require faiss-cpu==1.7.3, we can conclude that
your requirements are unsatisfiable.
hint: You require CPython 3.12 (`cp312`), but we only found wheels
for `faiss-cpu` (v1.7.3) with the following Python ABI tags: `cp37m`,
`cp38`, `cp39`, `cp310`, `cp311`
any help fixing it?
bc last time it worked fine
same just started
@lone ruin here
uhhh
I suppose the same method as in Colab will be required: forcing the Python version.
but then again applio on colab works just fine
I meant that the Python version isn't forced in the Kaggle notebook, but it is in Colab (if you don't understand me, it's the translator's fault).
ohhh
!apt update -y
!apt install -y python3.11 python3.11-distutils python3.11-dev portaudio19-dev
!update-alternatives --install /usr/bin/python3 python3 /usr/bin/python3.11 2
!update-alternatives --set python3 /usr/bin/python3.11
from sys import path
path.append('/usr/local/lib/python3.11/dist-packages')
yeah it's just kaggle's very annoying to use
for me
have to make sure everything went smoothly because i recently just lost all data of the model i was training
kaggle's literally looks like this
i had that on
Ah yes, that happens if you refresh the page. You could say you lose the storage "state." It is fixed if you try to execute a cell and then stop it; that will stop the training but save your files
it's just i forgot to save the version of the notebook i was training the model on
u got unlucky then 
ykw i'm gonna start over while also trying to fix the current problem i have rn
pasting this onto it solved the issue
!apt install -y python3.11 python3.11-distutils python3.11-dev portaudio19-dev psmisc
!update-alternatives --install /usr/bin/python3 python3 /usr/bin/python3.11 2
!update-alternatives --set python3 /usr/bin/python3.11
where'd ya paste it
here
just replace whatever this is with the one above
I need a screenshot of exactly where bc I'm slow
!uv pip install -q -r /kaggle/working/program_ml/requirements.txt --extra-index-url https://download.pytorch.org/whl/cu128 --index-strategy unsafe-best-match --system
%cd /kaggle/working/program_ml```
->
my eyes aughhh
from IPython.display import clear_output
rot_47 = lambda encoded_text: "".join(
[
(
chr(
(ord(c) - (ord("a") if c.islower() else ord("A")) - 47) % 26
+ (ord("a") if c.islower() else ord("A"))
)
if c.isalpha()
else c
)
for c in encoded_text
]
)
new_name = rot_47("kmjbmvh_hg")
findme = rot_47(codecs.decode("pbbxa://oqbpcj.kwu/Dqlitvb/qurwg-mtnqvlmz.oqb", "rot_13"))
uioawhd = rot_47(codecs.decode("pbbxa://oqbpcj.kwu/QIPqaxivw/Ixxtqw.oqb", "rot_13"))
!git clone --depth 1 $uioawhd $new_name --branch 3.6.0
clear_output()
!pip install uv
!apt update -y
!apt install -y python3.11 python3.11-distutils python3.11-dev portaudio19-dev psmisc
!update-alternatives --install /usr/bin/python3 python3 /usr/bin/python3.11 2
!update-alternatives --set python3 /usr/bin/python3.11
!uv pip install -q -r /kaggle/working/program_ml/requirements.txt --extra-index-url https://download.pytorch.org/whl/cu128 --index-strategy unsafe-best-match --system
%cd /kaggle/working/program_ml
!python core.py "prerequisites" --models "True" --exe "True" --pretraineds_hifigan "True" > /dev/null 2>&1
!sudo curl -fsSL https://raw.githubusercontent.com/filebrowser/get/master/get.sh | sudo bash
!filebrowser config init
!filebrowser config set --auth.method=noauth
!filebrowser users add "applio" "applio123456" --perm.admin
clear_output()
print("Finished")```
I cant open applio on google cola mb for some reason
idk if i should buy google colab pro
ew no paying
go to kaggle
Oh it has a kaggle?
How do i access the kaggle
No i mean kaggle for applio
@alpine lotus what browser should I use then that comes prebuilt with a good ad blocker
any browser could work + ublock origin
Yes yes
chrome only allows ublock lite due to manifest v3 migration
I can't hear myself with the AI voice.
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
and again read the "how to ask" first
Does anyone know which AI is used for ai Cat dancing videos?
what does this mean
sos, i need help, is there anybody can support me? i don't know how to use w okada voice changer😭
Hi ! I want to run Tensorboard but the console says "the path is not found", can you help please ?
I've put it inside the folder "Applio"
-rvc
Hello, friends! I haven't worked with speech conversion models in a long time, and the last time I did, Apollo was the best option. Now I need to create a model, and I decided to ask if there is anything better or if Apollo is still relevant.
Hi again, I can't find my index files, it says "succesfully created" but I can't find it on logs, can someone knows why ?
My ms is so completly high & the voice is delayed, im in the right version for my gpu so im confused why
can somebody help me ? My ref time is at 12k and the voices are very laggy
literally my same problem
my res is at 30k ms
yeah mine sometimes too, but i cant find anywere a solution
im on the right gpu too and i reduced the index extra and chunk but it hasnt seemed to work
i put everything on the lowest to minimize performance and its still terrible
What gpu do u have?
Same for u @runic ermine
Amd RX 7700 XT
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE
For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
It's the second guide
okay thanks and this should help to 100% ?
Not sure, I'm just giving you the best voice changer for ur gpu to see if it helps
ah okay, is there any type of tut video ? or just this guide ?
Just the guide, but all u do for this one is just run mmvcserversio
It launches in browser
okay, the current one i use is this one MMVCServerSIO_win_onnxdirectML-cuda_v.1.5.3.18a.zip
is there any differnce ?
Oh yeah that's uh, old
Yea it's not brand new but it's much more optimized and up to date code wise and has extra features like voice effects u can apply for free
Pretty cool stuff
ah okay nice thanks. This guide seems complicated but ill try
but what is this for ?
Like I said all u gotta do is just download the two things, the voice changer and vac lite (just vb cable but better for windows)
And open mmvcserversio
For the voice changer
Then for vac lite run setupx64
Not complicated at all
okay ill try and ill text u if i need help yeah ?
Sure! I won't be too busy today
where can i download this?
how do i know when its good?
chatgpt is saying that the sound or it wont be any better or good
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE
For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
-dt
can I get help about\ [Voice Changer] Pipeline is not initialized.
[Voice Changer] Waiting generate pipeline...
Test each time it saves an epoch like if u have it where it saves every 10 test around like 50-170
Same for save every 5
hey i followed every step but no voice is coming out my mic and even though the app
What steps?
If u got the voice changer off a YouTube video it's outdated no matter if the video is like a couple months old
What gpu do u have
any idea on how to fix high "res" delay in the voice changer
are you able to let me know whats best for my gpu
i believe that is my gpu right?
I wouldn't know for sure what gpu ur pc has in particular but if that's the one u have I'd say use wokada tg fork
Just download these two, the voice changer, and vac lite
ok perfect, do i need to uninstall the current version of the voice changer i have right now?
perfect thanks so much
You're welcome! If u have any questions just ask me or some of the mods or hekpers
Yeah any yt video uses outdated voice changers, you should use wokada tg fork the download is here
#✨│ai-help message
how do i set it up
i downloaded both
For vac lite run setupx64
And wokada tg fork run mmvcserversio
i cant seem to find anything named wokada
do you reccomend any settings within the voice changer?
Mmvcserversio is an exe file you're supposed to run within the wokada tg fork folder, and for vac lite did you run setupx64
yes iv done both and im in a website
Tbh it depends on your PC itself, for chunk I'd say somewhere in the 300s is ok for amd and extra time at 2.7
the website one is real time yeah sorry i was confused
That's ok, I've got work soon so idk if I'll be able to keep helping rn
oh ok all good
is it the same steps for the model?
To put a model into it just press the plus button on the left side, download them from herehttps://discord.com/channels/1159260121998827560/1175430844685484042
yup i did it thanks
You're welcome
is this real time like in discord
Yea it should work on discord if you have the mic settings set up correctly
whaat should be my outpuy
voice changer:
input: headset/headphone microphone
output: line 1
other programs:
input: line 1
output: headphones
i went to discord and it keep spamming the same thing
There's a thing on the right side that would with be green, red, or yellow
When it's running
If it's green it's good
If not change your settings around till it's good
are u reffring to performnce stats
I believe so
do i have to click anything extra
becuase no audio is being played rn
There's a way to hear yourself with vac lite but I cannot remember as I don't use the normal setup most people use
alr but when im in discord i connects to line 1 and nothing is being played
Have you pressed start server in the real-time voice changer?
yes and it keep reapting the same thing over and over for some reason
ive got it to work but its verry laggy
what should be my audio driver
Virtual Audio Cable lite or Realtek HD Audio?
Intel/Realtek High Definition Audio is an integrated sound card; Virtual Audio Cable is a third-party software that gives virtual audio line and works similar to "Stereo Mix". When you mention "audio driver", it sounds more like the HD Audio. 
How do I fix Failed to Load rmvpe. fallback to rmvpe_onnx (using deiteris kaggle)
what are ephochs
hey i have a lot of lags with voices idk how to fix it
Hey, I have a question. My friend downloaded something from this site, is it trustworthy?
https://huggingface.co/Shadicti/deiteris-Fork/blob/main/voice-changer-windows-nvidia-b2332.zip
number of epochs = number of times the model looked thru your dataset (audio)
DOES ANYONE HAVE GOOD SETTINNGS FOR A GIRL
anyone know any tips to clear up the voice glitching some or having the audio randomly chirp? The microphone is good and my mic is quiet, if there are settings to fix this can someone help?
why can i hear m y on voice on the voicechanger but the one i put
hey
So I download this model
" Deep male voice " or whatever it.was uploaded by razer I think?
The voice always come so bad and different from the Sample he sent I don't really understand the settings of it
Like
Batch Size:** 6
Dataset:** 30 Minutes
Hop Length:** 32
Pretrain:** Titan
Sample Rate:** 32k
Like are these?
I can't really see then on
applio
@viral mason i want nvidia one
do i use vonovox?
anyone know how to use the rvc voice changer
didn't I already help u?
ye
how i download it
yes but nothing is coming out of my mic
what are the most accurate girl voices?
why are u like this 
do u have the settings right?
yes exactly
my fucking cmd is acting up
im not the admin on this pc btw
rtx 4060 ti
@simple ore
ok i got it to work
i just had to put in
py -m pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
hey i made it work but not it just sound like a robot instead of girl
Oh, what’s the newer method to my question then?
why u being a girl 
lol its really robit sounding
I downloaded w okada and im trying to figure it out, just unsure about it
why does the voice get mechanic or u can hear it´s ai sometimes? how can I fix that
Sorry to Interrupt, But I have another request, Can you do Elmer Fudd voice by Billy West please, once again, I found on weights the Looney Tunes Back in Action video game for PS2, but it doesn’t sound good enough for the actual PS2 and GCN version, It’s sound like a DS version. So can you do it with the RVC please. Here’s the video and photo that I showed you https://share.icloud.com/photos/0fbAgqlVIY3jexXB5TzRWCRbA https://share.icloud.com/photos/072eIwsjQlL2IqWcEhojGzyHw
what is the best rvc to use?
Nobody helped me 
why do u have a photo of epstein 😭
epstein.jpg
i got this output on 50 epochs using refinegan does anyone know the reason for this? im guessing this is a pretty common issue because i did the usual setup for training a model but i clicked refinegan
im guessing it has to do with the pretrain but im not sure
set the embedder when you’re converting the audio to spin v2 instead of content vec in advanced settings
in the inference
i was asking why but okay
still sounds the same
show me your settings
wait i just checked and they only have the 32k pretrain for refinegan and not the 40k
that might be it
lol
why are u still using refinegan
that shi old
cuz its cool for higher frequencies
hifi older
refinegan sucks ass
i mean its not that bad
so is it better to just use hifi and spinv2 then
ofc
if hifi is still objectivley the best why does applio say "HiFi-GAN: Default option, compatible with all clients.
MRF HiFi-GAN: Higher fidelity, Applio-only.
RefineGAN: Superior audio quality, Applio-only."
don’t listen to it
155 is og and 150 is refinegan
og has better pronunciation and can yell better but refinegan has better quality
hmmm
i like og more
i got an official acapella of evil jordan at 40k and its a pretty whispery song
use og
okay will do
happy to help
lowkey ill do both at the same time
Thanks bro
bro why mine a bit too delay for like 2 sec im using von models
please first mention which voice changer version you're using and your GPU spec or colab/kaggle notebook you're using
AMD rtx and king von or juice wrld
idk what is kaggle
new to this thing
voice changer version not your juice wrld model
oh how do ik if it supports me
also please be more specific, like RX 6600 or RTX 3060?
is that exactly yours or perhaps higher?
mine wait lemme double check i
no doubt i js check it, it is rs 6600
I still don't know which version you're using, but I'd recommend tg-develop fork as listed in the following
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE
For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
the deiteris fork would still work for you despite being a bit older, but still better than the original wokada
oky ty
What rvc model would you recommend me for the natural feminine voice? 🙂
Not looking for some anime or 2 minute dataset training.
Quite new in this rvc world, willing to explore 🙂
I am practicing trans voice lessons, and I noticed that it works better if you make tone softer? 😅😅
Maybe a bit weird, but that's what I have noticed
Hmm. Wokada is bad?
how to cvonnect the voice changer with discord
what should I do if my voice doesn't appear in the game?
What is your PC GPU? And did you follow any tutorial or guide before here?
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
Not all W-Okada voice changer versions are bad, though the original W-Okada version (like v.1.5.3.18a) is often not recommended because it's an old and outdated version.
ty
More like "AMD Radeon RX 6600", an AMD GPU. When you say you're new to all things, it's best to learn along. AMD doesn't make RTX GPU; AMD makes Radeon, while NVIDIA makes their RTX GPU, although there are some very rare cases where GPU got mixed up as something like "GeForce Radeon RTX 9060 XT" from their respective manufacturer. Kaggle is a website that hosts Jupyter Notebook where you run Python-related software on their site.
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE
For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
It's the second guide
T-T
Ion need the guide
I've done this so many times
I main linux
I reinstall every week or so
I get how to use github
hi
T-T
This won't hel0
Wdym
what should i change?
I need to use docker
640?
Also @viral mason it's cuz I'm epic
Yeah
okay i did it
yes it has
U told me u use all AMD but alr, there's a Linux build I believe in here somewhere
https://github.com/tg-develop/voice-changer/releases/tag/b2397
uh a little hold on js really delayed like very
Is this the main page
It is yes
do i change anything to do with the in and out or tune?
I need a hear it
Ok
I actually had the wrong link I just changed it to the actual main page
I'm for sure this time
I'll check when I come back from the restaurant
Okiii
This is just the straight up link to the windows x64 CPU varient wth
What?
is it buns?
Wdym?
vram okay wait
Ig once u get home u can scroll through the guide bc it has multiple different links to use for downloading the different versions of tg fork
is this it
bro its so buns istg
so what can i do?
bru
i see okay
oh
so it has nothing to do with the sets right
so what should i use then
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE
For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
they all work for nvidia
I know
alr let me read this hold on
on gpu i have the options cpu, gpu0, gpu1, gpu2, gpu3, but i have no way to select my gpu, how do i fix this?
it says gpu (dml) then the options i said
my gpu is amd
How many GPUs you got
Oh
so is the 4060 bad for that or is it sufficent
Open task manager or poor man's btop
?
It works but it could be the outdated launcher
i see
Get vonovox or another one
im getting vonovox
which one?
i got both
okay
okay
1 sec its downloading rn
for nvidia u need the 001 and 002 zip
I already said it in vc
👍
It's not titled Nvidia btw
It's titled cuda
ik, but it's in case some ppl aren't aware of what that means
With 7z
Okay but some people aren't aware that Nvidia version is called cuda
what about now?
Open task manager
that's why I said to download those two for Nvidia lol
bc it says cuda and most ppl won't know what cuda means
You said download nvidia
It's titled cuda
How are they going to understand
R u dumb
i did but what do i do now
I am yea
Okiii
okay they both extracting right now @supple cloak
They need to extract as 1
It's part 1 and part 2 which are done in the 7z menu
should i screenshare im confused
this is pretty easy to understand I think "for nvidia u need the 001 and 002 zip" I said for nvidia which means if u use nvidia get those two zip files
which vc? i cant do it here
Yes
But if you look at the screenshot
It says cuda
i cant send photos here tho
And not nvidia
Oh
which is why I said which two zips to download
they're the only ones with 001 and 002

Yeah
But still you didn't say that before
watch screenshare
whats going on
where is that im lowky slow i apologise
i heard
where do i add the model
OH
its loaded now
start server?
oh
okay
okay thank u btw
cya
Tryna help this dude through vc
not everyone has a strong or even a weak understanding of computers
why is my voicechanger so laggy it doesnt record my voice pratically and when it does it comes with a bunch of cuts and the voice sounds broken
also takes a lot to give the output
i have the chunk at 832 and it takes double the time its supposed + it keeps cutting and its impossible to understand what its saying
did u get the voice changer off a youtube tutorial?
...
u should get wokada tg fork, here's the amd download
@viral mason can u help me
yea
then that'll work
I do yea
I have a 507ti nvidia card and these settings work fine with it and sound great, try them
nah, only ones that won't work are any refinegan models
there's only 4 or 5 of those tho here
for input should i use my virtual cable or just mic?
normal one
regular headset mic for input and output as virtual cable
and opposite for using it on discord ect
virtual cable for mic in games/discord and output just whatever headphones ur using
i see #
let me test it out one second
BRAZIL SERVER PLEASE
forgot to ask u what do i put as monitor
device
should be ur headphones
I don't use the monitor setting since I have a different setp from normal use
for sum reasonm i cant hear myself
bru
it dont work
can someone
help
?
is there a rvc google colab link thingy so i can make an ai cover
i cant find a website that works
u can use applio, btw kaggle is much better than collab since it gives 30 hours free
instead of like 2-4
-rvc
no idea what that even is tbh
i have a voice recorded back then
u would need pth and index file to make it work
.index files always started with the prefix “added”
✅ Cached memberlist for server has been updated.
This server has a total of 8883 registered .fmbot members.
uh
used to yea
in current applio I haven't seen that
in applio that prefix was removed for simplicity
how do i put my voice model to applio
i read the tutorial but didnt understand anything
i already got a voice model and a index file but i cant import it in there
How can I create voice dialogues with realistic voice tones? For example, how can I use a shy voice? Elevenlabs has very few character models.
Heya, I have an M2 Macbook Pro and I've been wanting to use RVC. I found the original wokada but trying to figure it out has been awful. Someone linked me to this server saying i could get help for mac and wokada
can someone tell me how to get a better version of rvc?
Does Tg Develop have any tools to deal with the "robotization" of voice?
the wha
wha?
That's an awkward question. Tg Develop's W-Okada doesn't have any known filter that could fix the audio in realtime. Instead, the audio sounding robotic has to have being your settings (extra being set to 0.5 s, while the recommended extra is 2.7 s) or the voice model itself.
As what the other moderator warned you in #1192011222023950368, not even one time, the #1159289738314919936 is the only proper channel to request anyone to train a voice model as what you really expected. You have stated to not spamming your requests in other channels, like #1450361836468834325 message, yet you still make the same mistake while you learn nothing after all. To get a voice model, you should learn to train one by yourself, otherwise just don't complain about your requests elsewhere.
No idea, though I assume it's simply an RVC voice model file (pth) which would still work in any RVC fork. 
Try Virtual Audio Cable lite instead of VB-Cable one.
Look up "AI Hub Brazil" on Google, if the Discord server exists, it exists.
By the way, @minor tundra, if you need more help about W-Okada, you can ping @ helper role, and don't always expect or force those non-helper members to help you about that.
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
RVC (Retrieval-based Voice Conversion) and W-Okada are two different programs. Tg Develop's W-Okada fork (b2364) is the only known version to work with Apple Silicon Mac; the b2377 and newer version removed support for Mac. https://github.com/tg-develop/voice-changer/releases/download/b2364/voice-changer-macos-arm64-cpu.tar.gz Applio RVC fork never has any of their version compiled for Mac, there being only Linux and Windows.
i need help with voice chanager i set up the aubio virtual hooked up to discord but it doesnt work
@tame oracle help me pls
please read the help guidelines above before asking, and i can't help rn as im going to bed, im very sorry, but you can ping the helper role and wait and someone will help you

Goodnight fellow femboy 
<@&1159293204038955078> hey, can anyone tell me how to stop hearing myself when using the voice changer client demo? When I click start, everything works perfectly but I don't wanna hear myself
stop using monitor on okada
that's how you avoid hearing yourself
yeah okay, i'm dumb thank you sm
and if you don't mind- i hear some kind of background noises, it's very minimal but still annoying. any setting that i could tweak to mitigate that or is it just purely my mic?
turn on sup2 on okada
alr man thank you, that seemed to do the trick
<@&1159293204038955078> kinda came across another issue, my voice plays twice cause it's taking input from the sound coming from my headphones too for some reason. So if there's music playing in the background it picks that up as well. Any help is appreciated
Actually nvm it isn't picking music but it picks up other peoples voices and my own as well and plays that

What is your PC GPU? And did you follow any tutorial video or guide before about W-Okada voice changer?
"Voice changer client demo" sounds like an old version of W-Okada. What is your PC GPU?
RTX 3050
That's what I saw on github, I'm not sure maybe I got the versions wrong
Try Tg Develop's W-Okada fork. https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/ https://github.com/tg-develop/voice-changer/releases/tag/b2397
@unborn briar i can't help u but they can cuz im just a model maker not a staff
Who is "they"? 
u and the staff 🗿
@unborn briar use #✨│ai-help for help about W-Okada realtime voice changer instead of expecting a non-helper member to help you in DM.
he's using the outdated one 
Embed fail.
guys how can i clone my voice
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
embed fail.
What voice changer do u have, there are 3 (and then the outdated demo client that u shouldn't be using if u got it off YouTube or smth)
Oh yeah I figured it out I had to do system not client I’m pretty sure
well i just logged back on from last night and it doesnt work at all for my mic
Yo! I have a subscription to Kling AI and ive been trying to generate text to video ai generated videos but even though the quality is very good, the video just seems to not be following my prompt and now im scared of using more credits. Who can help?
what gpu do u have
amd
take these, for the first one run mmvcservsio, for the second one run setupx64
https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-dml.zip
and here's the guide if u need it
https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/
Last update: November 22, 2025
tysmm
what does index do in a voice model
i am trying to use pretrain Snowie 40 or v3.1 40 and i get this every time even tho my dataset is 40khz
why is this happening?
Hey guys, I’ve recently downloaded RVC and trying to make my own voice model, but it doesn’t sound as great as the online models like kits, audimee etc etc
Wondering if I’ve screwed up some settings? I have 30 mins of fairly clean audio and trained it at 500 epochs
Is this too much?
Snowie is a very old pretrain
what does that mean for the process?
500 is a bit much, most models sound good around 100-300
which russian pretrain should i use for singing (in Serbian)
Yeah I’ve tried at 50-300 as well but the voice just doesn’t sound like me and it’s not pronouncing some things right
It sounds like it’s got a cold
I’ve uploaded the same data set to kits.ai and that worked way better
So there must be some setting or something I’m doing wrong
maybe when doing intrfrireance try change index file settings
U should just use og pretrain, legacy core, or use refinegan if u want. Any other pretrains aren't made properly and cause harmonic distortions and issues in general with how it sounds
but how to get that Serbian/Russian accent then on those pretrains?
Can I send you screenshots?
Kits.ai is a very old and honestly bad site for making ai models, have u tried applio?
yupp
No, but kits has worked fairly well
I’m using RVC because that’s supposed to be the best right?
I downloaded it from GitHub
-rvc
Try applio, it's where all good models come from as of recent time
It can be used online or on ur pc locally
But aren’t all voice models using RVC ?
Like they’re all built on top of rvc
That’s what ChatGPT is telling
Me
They use rvc yes but some things create models better than other things
Some have bad old code
Some keep it up to date
And Improve
Plus applio is completely free
So use aplio to train and then put into rvc?
Apologies if I’m not understanding. I haven’t slept much and have been trying to fix this all day
Yup! The guide to it is right here if you'd like to read up on it
It's ok man, get some rest soon so u don't fall asleep mid training lol
Would you guys consider yourself experts on rvc
Training and such
I need to get this to work
And am willing to pay
Obviously I will read the guide
could you help me out, I m using Applio NoUI on colab
should i use custom pretrain at all and which one for Russian dataset that i wanna train?
i am not, i am still struggling all the time
Sent u dm
I'm not sure what pretrain would work better, but u should be using Kaggle to train using applio as it gives 30 hours to free users per week
i tried and yeah it's better BUT it alwayes gives me error when i open op the RVC link
That's odd
it says something about ports and servers etc
this thing, and this also happened on every collab and other sites..
@viral mason
Did u insert your Ngrok token
yupp
Hmmmm
i even tried to go to another Wifi
Have you tried switching it to local tunnel instead of Ngrok?
That's odd
never had those problems year ago
maybe try asking one of the helpers or mods, I wouldn't really be smart enough to tell u exacttly what is causing it
thanks g
@tame oracle do you maybe know ?
@wide oasis ?
i need help with my voice changer, when i test my mic the result just comes in and all i hear is the voice cutting off and it comes back for a fraction of a second but never a whole sentence idk if its my settings or smth
what settings do u have on the voice changer?
like chunk and extra time
256 chunk and 16348 chunk
what do you mean program
like my specs
i cant send screenshots here
yeah ok i sent in dms
Intel(R) Iris(R) Xe Graphics
what why me saying some people is my brother not online thats too cruel
i need some help bad i have been stuck on thsi for over 2 hour's
hello can you help?
stuck with what
So I’ve been trying to follow this tutorial https://www.youtube.com/watch?v=tWd1SMRXRMw, it's to do with the https://huggingface.co/wok000/vcclient000/tree/main because I haven’t got a GPU for my computer. It should work like normal but for some reason it’s not working properly. I can show you where I’m struggling for it to work?
this is ollld
if u don't have a gpu to ur pc idk what to tell ya
LOL OK thanks
sorry i cannot help u :<
How do i fix my Vonovox? I hear my voice first then the model trailing behind it
do u have hte mic settings setup properly?
think so. the input is the mic and the output is line 1 on Vonovox then for discord its line 1 as the input and output the headphones. and the model keeps on like breaking up
Hey there, I downloaded the latest version of W-Okada voice changer, but whenever I start any voice it hits me with this:
[Voice Changer] Pipeline is not initialized.
[Voice Changer] Waiting generate pipeline...
i keep getting this error
probably downloaded some oudated af code
Is it possible to make a rvc sound real sounding?? I have cpu only sadly.. surface pro 8, rip.. do u think I could maybe install a GPU on an emulator? Ava max rvc is what I am going for.
whats the best epoches for a 8 min audio
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
If your PC doesn't have any GPU, your PC won't output any image to monitor, unless there's an "integrated GPU", which still won't be ideal for the voice changer anyway. Instead, try online option.
Try change block size.
Intel Iris Xe is an integrated GPU, often found in 11th generation Intel Core CPU system and newer. If you attempt run any W-Okada version (like the DirectML variant) with an integrated GPU, the program would perform at the fraction of your PC CPU, so unfortunately. Running W-Okada with only CPU is possible, performs a bit step up from an iGPU, though not always recommended to run as some serious task and along a game. 
The ngrok website itself either fail or got blocked access to elsewhere. Make sure to reset your ngrok token just in case if it fixes.
No. 
In RVC voice model, an index file stores accent of that voice model, typically found alongside pth file, although some voice models might not include index from start.
Is it AMD Radeon RX or AMD Radeon? 
That's more like a short-term solution rather than a detailed one. Make sure to read help guidelines before start asking, rather than telling your story of using a "program".
Well you’re def more expert than me 😄
hey if i have like a intergrated gpu in my cpu which is intel
which download shall i do
oooooh thanks a lot for clarifying
although for my pc it makes the voice sound wobbly if that makes sense
maybe cuz I'm using an intel iGPU (HD Graphics 530) or killing my CPU
Intel HD and UHD Graphics are the Intel integrated GPU, typically found in most Intel Core CPU systems. You sure running W-Okada DirectML with an integrated GPU would give better audio quality than one with a dedicated GPU (like Intel Arc B series)?
this isn't so much an AI question, but I have an issue obtaining audio samples for doing inference. Since youtube clamped down on copyrights, the app i used to use for downloading videos no longer works.
How can I get audio samples off of yt now?
See #✨│ai-help message and #✨│ai-help message.
i mean yeah i am and the quality is nicer when the index is disabled
it's choppy
Hii i need help for voice changer i download one here but idk what do do after

What the hell?
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
Never say you don't know about your PC GPU, in this situation. To check your PC GPU, open Task Manager, go to Performance tab.
Idk what version to pick, the 001 cude or the 002
Are you looking at wokada tg fork?
yee
If u have Nvidia u need both if u have amd get the one below that, it says amd
oh ok
the 002 doesnt let me extrat it ;/
Hmmm
prob help if i could send pics its just a white file and not extactlbe like the 001
Put it in the same folder with 001 and that might work, like don't try to extract just go ahead and run mmvcserversio
I had the same issue where it wouldn't extract
Upon trying to insert my .json file into the index section for Realtime Voice Changer Client, it says the following:
" Extension of file should be the following: index, bin "
Am I doing something wrong?
keeps saying select an audio input device but I have my mic selected ;/
Do u have both input and output selected?
weird i had to switch it off deafault
Weird
Btw if u wanna this in games u need either VB cable or vac lite
The second one is more recommended
then whats a good chunk size and extra proccesing and ill be good to go
i already have that set up
Try for chunk 122.7
And for extra time 2.7
for games?
just seems like itll be hard to respond on a 3 second delay
I have a 5070ti and I really only use the voice changer in vrchat so I wouldn't know for regular flat screen gaming
i use a 4070
That's similar
Probably will work fine, but u can test it tho
Different chink sizes
just move it up to more?
Lowering it would reduce the amount of time it takes for the voice to come out
But lowering it too much would cause it to be choppy and bad
So u gotta find a middle ground between the delay and it sounding good
female voices use around 11-13 pitch?
Actually it's a bit lower than that
Depends on what female voice it is but most are good at like pitch 3-12
Unless you sound like corpse husband
Lol
Oh :(
i cant hear myself on discord
Epochs aren't important tbh unless you're training a model, but even then steps are more important
Did u set the voice settings up correctly
why is it so laggy
what batch size should i do for 34 minutes of 41.1 Khz audio
when i try do download a few selected voice models it says on hugging face "Invalid username or password"
after i trinkered with hugging face configs now it says "Repository not found"
Its this one im trying to download
it lags alot and voices are not so good like the showed ones
my app is just blank
how do i fix this? its been happening to me and my friend
im not getting any sound from virtual aduio cable
What app
Voicemod? Or wokada voice changer
Those are two different things completely
i think wokada
the huggingface repo has been either deleted or set private
both of the og wokada and deiteris fork have bug on deleting the model slot, try directly deleting the slot folder in model_dir
or just try tg-develop fork or vonovox as listed in the following
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE
For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
you should also try one of the recommendations as just above^
Last update: November 22, 2025
any reason there is too many audio stuff?
also does make difference related to tis
That's not too many u just downloaded VB cable so it added extra stuff
okok
yeah why an apple falls downwards not upwards?
you haven't quite explained the problem tho
do you mean the vb-cable is not working for voice changer?
sorry man just curious
am not good with tis stuff
its working dw
why tf does okada look like that?
Like what
Ty
What pretrain models should I use if I select "refinegan" vocoder and "contentvec" embedder, for 40k sample rate?
If there are no any, should I use "hifigan" + "contentvec" or "hifigan" + "spin-v2"? Can't find any info about standart 40k pretrain.
(Applio 3.6.0)
that option is only available for 32k
okay, then what embedder should be used for Applio\rvc\models\pretraineds\hifi-gan\f0[G/D]40k.pth pretrain?
contentvec is the default one
but you can also try spin-v2
Thank you
It's a bit weird that if I select refinegan it doesn't even say that no pretrain available and that it actually does scratch-training
just keep 32k sample rate and "pretrained" checked but not the custom pretrain
to change the sample rate, you have to delete the current preprocessed model and start over from preprocess stage
well I have 48k sources and target files too, 32k will be too much loss of voice details I think (usual speech)
it will automatically resample to the target 32k, but for better quality you'd better resample it first by yourself
as said, there's no refinegan for 48k yet
that's why if you select the 48k option it will train from scratch
yes I got it, thank you
even if you try to do that, note that no one has figured out some optimal configurations for 48k refinegan at this moment.
right now someone is attempting for 40k #🔊│ai-development message
No access to the link
I got an error occurred during voice conversion and the console said pipeline is not initialized, im using
GPU: NVIDIA GeForce RTX 3050 6GB Laptop GPU
OS: Windows 11 24h2
I followed the instruction at https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/. How do i fix that?
@finite bay
alright
Use #✨│ai-help or #1192011222023950368 for help about an AI program. There's no reason to go in someone's direct message for such thing.
Like who?
With audio mode server and WASAPI selected in W-Okada, the sample rate generally locks and works fine at 48000 Hz (or in a very rare case, 96000 Hz) because of "shared mode"; when you set sample rate to any number (like 44100 in your screenshot), the program will fail because of mismatch sample rate. It's an issue in almost every program that primarily uses WASAPI in shared mode not just the voice changer.
the program doesnt fail tho
i have been using this settings for days
alsooo it doesnt switch to anything else
like its stuck at 44100
it reverts any change i make when i start the server
It looks like the sample rate from both your speakers and microphone all being set to "44100 Hz", so you might not encounter any issue when using WASAPI in W-Okada even if the program's sample rate is 44100 Hz, although 48000 Hz is often more preferred over 44100 Hz because 48000 Hz gives a bit better audio quality.
Does anyone know if there’s a new fork for okada
this is set to 48000 tho 😭
nvm it, if it works dont touch it ig

lol
mine doesnt connect with discord?
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
Your question sounds too simple but that doesn't refer to any specific program.
I watched the tutorial and understood everything, but somehow the voice changer client just won't connect. I have no idea why.
W-Okada and voice changer are the same program. Whatever. What is your PC GPU?
I have a 4060 and I’ve been using this voice changer for like 2 years, is vonovox better?
hey can someone help me
No idea, as I've never seen any benchmark between Tg Develop's W-Okada fork (b2397) and Vonovox on the same PC specs nor I've ever tested them myself, but some people have been saying Vonovox can give better audio quality than other W-Okada versions and that's really it.
Make sure to read help guidelines before start asking anything.
when i try to launch mmvcserversio it doesnt work it just says "analyzing...done" but its not launched
What is your PC GPU? And did you follow any tutorial or guide before?
Nvidia Geforce RTX 3050
i was following the tutorial of "novision" but it doesnt work
Try Tg Develop's W-Okada fork. https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/ https://github.com/tg-develop/voice-changer/releases/tag/b2397
whats that
A better version of MMVCServerSIO, which the name is basically the same as W-Okada or voice changer as you called.
thank you so much, is it free?
i have a really bad one bc im still waiting for my new one to come
asus GTX 1060
Just say one.
Sure, with NVIDIA GeForce GTX 1060, it will still work with W-Okada though far below than RTX 20 series GPU.
For a newer W-Okada version, see #✨│ai-help message.
hey do u have a yt tutorial i can follow with the ai u sent me?
https://cdn.discordapp.com/attachments/1159290139609137264/1446171264489357322/image.png?ex=694a15b2&is=6948c432&hm=55c76522e40a0231c9e94c1f8c5a3a81764d07c3e7caa6c5fb89cf3905787989& https://cdn.discordapp.com/attachments/1159290139609137264/1451594734962217062/image.png?ex=694a0a32&is=6948b8b2&hm=de90fe156223b06829c074adc1bb7ea04e0af73765ce984fc149e38accf37229&
it worked thanks
Oh its okay, thank u so much for the help tho
how to install
Make sure to read help guideline before start asking anything.
rx 6600 xt
Try Tg Develop's W-Okada fork. https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/ https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-dml.zip
Last update: November 22, 2025
do i need to run as admin?
No.
That's an awkward question, and here's your settings:
Chunk: around 70 - 100 ms
Extra: 2.7 s
GPU: AMD Radeon RX 6600 XT
Pitch detection (F0): rmvpe_onnx
also should i use gpu or cpu for processing unit

..?
In Tg Develop W-Okada, the processing unit is basically "GPU", so when I say GPU you should set it to your GPU not CPU.
oki
why is the voice so laggy
Here's your settings:
Chunk: around 256 ms, always check the perf number at top right
Extra: 2.7 s
GPU: NVIDIA GeForce GTX 1060
Pitch detection: rmvpe
Do you remember when I asked you about using the b2332 a month ago? There's Tg Develop's W-Okada fork (b2397) to try, but I doubt if you ever found out about it. 
i used them and they worked fine, but suddenly my voice started to repeat and lag
Check your perf number at the top right of voice changer. https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/#finding-my-own-settings-for-chunk-size-and-extra-processing-time
Last update: November 22, 2025
on w-okada its not picking up my voice even though i gave it permission, my input microphone works fine as i tested, what do you think happened
okay what do i do with the perf number..
its like 193 ms of 141
Take time to read the guide instead of expecting me to provide an answer. Increase chunk number up until your perf number is green.
What is your PC GPU? And did you follow any tutorial or guide before about voice changer?
i followed the tutorial by easeus and AIsearch, my gpu is 5060ti
Try Tg Develop W-Okada. https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/ https://github.com/tg-develop/voice-changer/releases/tag/b2397 The original W-Okada version didn't compile to work with GeForce RTX 50 series GPU.
oh, i used the download one from huggingface, is that the original one?
This https://huggingface.co/wok000/vcclient000/tree/main is the original W-Okada; the original W-Okada all outdated and now superseded by newer voice changer versions made by different authors.
okay, thank you
i have an intel cpu and it wants me to download "voice-changer-macos-amd64-cpu.tar.gz" but i dont see it on the downloads area
That's not the correct one. The correct ones are https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-cuda.zip.001 and https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-cuda.zip.002.
The "macos" means macOS, an operating system of Apple Mac, while "windows" is simply Windows. Your PC is not Mac.
when i open it winrar says "unexpected end of archive"
If i train a voice model using weights.com (partner of ai hub), after training does it gives the model .pth and index file once its trained? so i can run on w-okada?
hey so u recommended me tg develop w Okada fork and im just wondering how do i extract both files like the cuda zip 001 and 002 cuz when i click on them ii cant extract them
AMD Radeon RX 6650 XT, Windows 10, I start a voice and it says [Voice Changer] Pipeline is not initialized. [Voice Changer] Waiting generate pipeline...
how to be able to hear myself in discord
there was this on RVC app i used
it came up when i searched like MMVC
and it had squigly lines
resemlbing audio
i cant find it anymore , does any1 know what im talking ab
Yes, but that's probably an old one what gpu do u have
rtx 4060
im looking for it as its rlly good
U should use Vonovox it's the current best at the moment for Nvidia gpus
The guide is the first one
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE
For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
To set it up just run setup, then after that run start
nice, ty but dyk the one i was talking ab at first?


