#✨│ai-help
1 messages · Page 304 of 1
Well, it sounds like you didn't set anything right on GUI.
I set pitch and index ratio, selected the correct GPU, set up everything to use WASAPI, selected RMVPE_onnx wtc. Not sure what else there is to set upp
You might too expect for best known working settings, which is what I'm going to tell you:
Chunk: around 70 - 110 ms
Extra: 2.7 s
GPU: AMD Radeon RX 6950 XT
Pitch detection (F0): rmvpe_onnx
Input: your microphone
Output: Line 1 (Virtual Audio Cable)
Monitor: optional, you can route this to your speakers/headphones to hear the program.
An index file is a file that stores accent of a model, typically comes alongside pth file which is the actual voice model file. It is commonly used in Retrieval-based Voice Conversion (RVC) program (like Applio RVC fork). In W-Okada, however, the "Index Ratio" is supposed to be value 0, as index is not needed in voice changer; by setting it to any value, the program would still work but also use more GPU resources to process.
ah, so the index file won't actually add an accent to the voice in W-Okada? That makes more sense then.
anyone has vbcable audio driver file. their own site not reachable atm
-vbcable
<@&1159293204038955078>
we do not suggest vb audio cable for realtime voice changers on windows
what's your exact pc gpu and os?
It's been awhile since I used rvc,which one still works with rvmpe?
Can you help me aswell?
Hey guys, I'm trying to set up Applio for my AMD GPU so i can train voice models locally, the issue is that it's not showing my GPU : Unfortunately, there is no compatible GPU available to support your training. And when I run it it says : An error occurred extracting the index: need at least one array to concatenate
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.
GPU : AMD Radeon RX 6900 XT
System : windown 11
and I'm following this totorial : https://docs.aihub.gg/rvc/local/applio/#amd-on-windows-precompiled-fix
They all do, use refinegan it's the best new pretrain and the only one recommended to be used other than the original
Refinegan is spinv2 and is only available on the newest branch of applio
I downloaded VAC, i can hear but it wont register my voice. what do i do?
GPU: 3070ti Windows 11. Which version of okada should I download and from where?
u should use vonovox
Who can send me all their RVC files, please? (via gofile)
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
yo i got rx 5500 xt
r5 5600
windows 11
16gb ram
which one should i download for good voice?
Thank you
GPU 3070 TI Windows 11 I tried to download Gawr Gura voice model but it keeps showing invalid password and username. Why is that?
what
wherever you're getting the voice from should not have a password or anything
definetly a scam site
get all voices from here ^^
https://discord.com/channels/1159260121998827560/1175430844685484042
Oh okay! Thank you
btw here is the guide for vonovox
https://docs.aihub.gg/realtime-voice-changer/local/vonovox/
Last update: November 21, 2025
I will go through it. Thanks
he has been asking the same crap since before, and he said it rtx 50-series
bro does not need deiteris if he has a 50 series
he either means a 50 series card or literally 5000 GPUs

uh, what am i doing wrong
Bro either still can't get over it, not reviewing his queries or having a dementia.
See Tg Develop W-Okada fork. https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/ https://github.com/tg-develop/voice-changer/releases/latest
help :c
For voice models, there's #1175430844685484042. If Hugging Face is blocked in your internet from your country, consider VPN or check again. If you mean something else, please be specific.
nvm
If you can't "hear" yourself on W-Okada, the issue won't be the Virtual Audio Cable software itself. Rather, there's something to do with audio settings in the voice changer. https://cdn.discordapp.com/attachments/1159290139609137264/1446171264489357322/image.png?ex=69330372&is=6931b1f2&hm=756cdc25641223097fca41fe2b520109fc818e740befb445d10c8e5e56c83e06&
I need help with the voice
i turn the voice changer on, my mic is set on the right stuff. but i cant hear it
i have vb audio
@viral mason how to get vonovox audio to discord
i cant upload images
I can hear myself in the voice changer
Use helper role instead of one specific user.
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
oh well can u help me
<@&1159293204038955078> . I set my input on discord to. Line 1. And I cant hear anything at all
You would also read help guidelines before start asking anything.
Don't be too hurry. I'm trying to provide an answer.
ok mb
In Vonovox, there are only input and output. You can set "output" to your speakers/headphones to hear the program, although this won't output any audio into Line 1 at the time.
Virtual Audio Cable and VB-Cable are two different programs. I don't use VB-Cable, but in my screenshot it's Virtual Audio Cable.
my vonovox dosnt look like urs
Either because mine uses light mode or the version being v1.6.9. Check again.
check dms
You can send an image on here now. 
By the way, Vonovox and "W-Okada" are two different programs, but both function similar to each other (being realtime voice changer). Don't always confuse between these.
This is the actual link to Vonovox. https://github.com/dr87/Vonovox
so i download this right
Don't be too hurry. The "W-Okada" in your screenshot is an outdated one. Vonovox in question is more preferred, although there's another newer W-Okada fork version that's more UI friendly than Vonovox.
Which one should i get then
Vonovox.
ok im running the setup to download the requirements
almost 10 mins and its still downloading the requirements dang.
If your internet is slow, it's kind of expected.
Nevermind, i just need this to work in discord and wtv game I want
Here's your Vonovox settings:
Block size: 0.10
Extra tme: 2.00
Crossfade duration: 0.15
how can I enable fp16 on an old fork of applio, need it for an experiement
hey would anyone be able to help with this?
ah dam can't post pictures yet
basically i cant seem to select my gpu, it just has options like cpu, gpu0, gpu1, gpu2, ect
it doesnt have a drop down menu like ive seen in the videos
im using a amd 9070xt btw on windows
I smell an old voice changer
ah, i did just notice the program name is actually title "voice changer client demo"
so that might have something to do with it lol
Since u have amd I'd say download wokada tg fork ^^
It's that second guide the one right under Vonovox
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
There are multiple versions on the download page for it and you'll also need vac lite (it is like vb cable)
I'm heading to sleep so if ya need more help ask a mod or helper, I'll be able to help when I wake up tho
real quick whats a vb cable?
i havent taken a look at it yet sorry
It's alright, you only need vac lite plus the voice changer anyway
i assume by your response its explained in the link u gave
chillin, thanks gamer
And if u get confused that's why there are mods and helpers :3
Np man, gn
Not yet lol
When weights will bring back text to speech?
can someone help me with the voice changer ?
i did everything that i saw in youtube but still its not working
like i hear sudden unrecognised sounnd so im asking for help
how to find bestq pitch and formant shift settings?
because everything sound terrible to me
so i am on ubuntu 24.04 can anyone help me setting it up?
deiteris W okada fork
local
can anyone help me?
plz
i need help
PLZ HELP MEE
DUDE IT DOESN'T WORK
omg bro
this program is broken
only windows user can have stuff like this
...
@obtuse summit Did you make sure to install the linux version?
i am on linux rn..
and yes i installed it
Gpu nvidia and cpu won't work
i can't use the second cause i don't have another gpu
i have an nividia gpu
both files installed?
aight, I got nothing then
Idk linux 
oh, and to answer your other question in chat
Models are usually trained on their default language, so they can do other languages
but some sounds unique to certain languages will cause clashes when speaking
there do be some models that are trained in italian tho, but you can also try english ones and see if they work for you
@obtuse summit https://discord.com/channels/1159260121998827560/1364589802921660527
Check this one out
thx i am trying it now
txh
it worked
so
it works but i can't gte the model to work
nv imma give up
I downloaded(MMVCServerSIO_win_onnxgpu-cuda_v.1.5.3.18a) but there is no RVC v2 where to download it
ask in the official server
what is ur gpu and os (windows, linux ect)
that's super duper old, what gpu do u have and what is your os
RX 580 2048sp, win11
since u have an amd card I'd say use wokada tg fork
the guide is the second one , it's right under vonovox
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
fine
Hi!
My GPU is : RTX 5070
My operating system is windows 11
I canno't hear anything from the voicechanger, seems that it doesn't work
Tutorial Used: https://www.youtube.com/watch?v=SxdnGxicJOg&t=124s
If somebody can help me 🙂
you have
an outdated software :D
since ur gpu is super good I'd say use vonovox as it's the best voice changer currently for nvidia
it's the first guide
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
and u will need these two downloads to use it
https://github.com/dr87/Vonovox/archive/refs/tags/v1.6.9.zip
@warm marlin
so it's not the same software ?
it is, it's better and new
like way better
it's a realtime ai voice changer but not old and outdated
my voice is pretty laggy on an amd rx 7700xt and im on windows 11
and do you have any youtube video to recommand me ?
like a tutorial
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
my voice is very laggy
rx 7700 xt
Windows 11
please read the help guidelines above before asking
No all YouTube tutorials are outdated
so uh what's your PC GPU?@twin spruce
uh no what 😭
i suggest u don't use the voice changer while gaming on high intensive games
i want to sound like sponge boob
does roblox count?
roblox is fine as long the graphics quality is low or medium
ok
ima send the guide for the voice changer one sec
but what the app for the voice changer is
Last update: September 6, 2025
here's the guide to how to use it
ok
also you can find the spongebob models on https://discord.com/channels/1159260121998827560/1175430844685484042
thanks :3
np :p
that one isn't really recommended as much, I'd suggest wokada tg fork
I switched from deiteris to tg fork and tg fork I say is better
the hell is that o- o
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
second one :p
eugh spin embedder
it supports content vec models goofy
doesn't need spin
switching models on it is almost instant
dude the girl has a 1650 do u expect its gonna handle the modified version of okada?
What version should i chose
I didn't know what gpu they had my bad 😭
stick with deiteris, the one balbal first gave u
Deiteris fork?
ye
that's why u gotta read the message first 😔
lmao
voice-changer-windows-amd64-cuda.zip.001
1.95 GB
Dec 7, 2024
voice-changer-windows-amd64-cuda.zip.002
613 MB
Dec 7, 2024
voice-changer-windows-amd64-dml.zip
What is the difference between these 3? @quasi condor
I have an RTX 4060 and a win 11
where and what to download?
theres too much steps for my brain to comprehend
for RVC v2
u should use Vonovox since ur gpu is good enough to handle it ^^
https://github.com/dr87/Vonovox/archive/refs/tags/v1.6.9.zip
https://software.muzychenko.net/freeware/vac470lite.zip
the guide is here in case u need help and u can ask me too
https://docs.aihub.gg/realtime-voice-changer/local/vonovox/
Last update: November 21, 2025
for vac lite, run setup64
is it better than the one I downloaded? (for a female voice)
it's currently the best of the three up to date ones
if u got one off yt it's old and outdated by over a year
does it change in real time too?
so I just download it and it already works?
did u read the guide at all?
sorry uh im cleaning the house one sec
for vonovox run setup, then run start
for vac lite run setupx64
click this one
ok
have u also installed the virtual cable?
I'm sorry, I just don't understand English well.
what language do u speak
Rus
o
он сказал, вы читали руководство?
no
translate this to russian for them
for vac lite run setupx64
for vonovox run setup, then run start
u gotta install the virtual cable too its on the guide
aight
thx
it said that i had to use a command on the terminal too
here's vac lite btw https://software.muzychenko.net/freeware/vac470lite.zip
@simple helm
Он сказал, что для Vac Lite запустите Setup x64.
Для Vonovox запустите Setup, затем запустите Start.
Show me what the interface should look like, otherwise I don't understand it well.
it writes to me when I press start (Error
Cannot start voice conversion - No
GPU available. You can still
access settings and configuration.) but I can't find where to put the graphics card.
ffmpg not found how to fix?
please run the /howtoask command in here 
/howtoask ffmpg not found how to fix?
nono just run the command, dont type anything else lol
Where can i generate images
My gpu = rtx 5060
Operating system = win 11
Cant hear anything frm the voicechanger
Tut used = https://www.youtube.com/watch?v=SxdnGxicJOg&t=356s
Would appreciate if anyone could help me out
if u used a tutorial u have an outdated voice changerrr
since ur gpu is peak u should get vonovox
https://github.com/dr87/Vonovox/archive/refs/tags/v1.6.9.zip
Last update: November 21, 2025
Is it normal for the voice changer to glitch sometimes or is it usually a software or GPU thing
depends on what ur doing, gpu, and what voice changer u have
Oh so it could be a voice character bug?
Alr ty ill try this out
Have you tried switching to different voices? Btw if you got the voice changer off YouTube it's outdated
Annnnd, what gpu do u have?
I have the most recent one from the website, I haven't tried multiple voices yet tho
Im not home rn, but i think i have AMD Ryzen 7000
what?
Voice changer isn't outputting my audio. At first when I open the voice changer it works with my headphones right. But when i try using line or cablle to connect it to my game it doesn't work. I have an amd and am using the deltris w-okada changer btw
Ah sorry, I just mean it wasnt from a youthbe video, I just got everything from the website using local and deltris okada
@tame oracle can you please help me with the voice modulation software? It distorted in my editing sofware, and it wont even launch, it just gives me an error mesage @tame oracle
Someone will come and help you when they have time; no one is obligated to monitor the channel 24/7.
you mean the worst possible one??
vonovox is awful
anything you try to do sounds like its in 10 bits and it runs 100% GPU usage constantly
Its good if you use nvidia
welp, i guess thats unlucky
i dont have an nvidia so i wouldnt either way
...so why are you saying its good then if you have no idea?
deiteris is the definitive real time voicechanger
everyone i know whos used vovonox has said its really good
its why I said unlucky because it seems like a problem only youre dealing with
theyre gaslighting you then
ive heard theirs, it sounds good
youll see how horrible it sounds when you use vonovox if you get an nvidia
then theyre not using it lol
whatever you say man
plus on top of sounding terrible it runs even the best nvidia gpu to 100% usage conatntly
case in point, vono is so unoptimized its not even funny
u must not know how to use it correctky
the tutorial is half year old but it still uses the old version of original wokada 18a that uses older Pytorch version before release date of RTX 50-series gpus, and they don't support it
your RTX 2060 is enough for the voice changer alone but don't expect decent gaming performance
otherwise lower the game graphics settings and cap fps to 30 or 60, also please check the voice changer settings as well
btw you can also try tg-develop fork https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/
Last update: November 22, 2025
That’s what I use cause unlike vonomid it’s actually optimized and I can game and run background processes without any issue
The only way to use it right is to uninstall it
Its honestly unusable
Hello everyone, does anyone know how to render voices using a ready-made voice model in the form of index and pth files? I mean NOT in real time
applio
thx
does anyone know best setting for deiteris's w-okada fork voice changer that work best in mac mini m2?
i try every setting but the result still not good, is it have anything to do with the model/my microphone?
I tried to use deiteris w-okada rvc and I have it opened but I couldn't connect to it through localhost and instead usded my local IP and now I only see the background of the website. I'm using opensuse tumbleweed. How do I fix it?
and how do it works please 🙂 ?.
does anyone know a good way to integrate an ai player on minecraft java
im using the mineflayer but its just a bot
and i am needing to use an external llm api to control it
but its TOO DUMB
can't even make a simple house of dirt
anyone know why my settings is on gpu 0 but in task manager it says its using a large amount of cpu which is causing the ai to lag
im on a radeon7800xt
which file should i be downloading
<@&1159293204038955078>]
DMR v2 seems to be detected as 40k in applio but it's a 32k pretrain, is there a way to fix this?
Last update: November 22, 2025
I haven't really played around with AI bot mods on minecraft java, but what LLM are you currently using?
that seems a funny experiment tho
hey i need help
hello, could you please elaborate?
i currently set deepseek-v3.1-terminus
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
so i set up everything from a youtube video and i cannot hear my voice on anything
i have vega 7 amd ipgu 2gb vram(i know its ass but i wanna test it)
windows11
here is the vid https://www.youtube.com/watch?v=SxdnGxicJOg
on hugging face, where is it to create a voice
can someone help me i didnt find anything on the docyment
I downloaded the W-Okada voice changer from https://huggingface.co/wok000/vcclient000/tree/main is this a safe site or virus filled shit
4080
Good to know thanks
u should get vonovox since ur gpu is good enough to handle it
https://github.com/dr87/Vonovox/archive/refs/tags/v1.6.9.zip
Last update: November 21, 2025
form vac lite, run setupx64
and vonovox run setup, then after that is done run start
To completely uninstall the old one do I just delete all the files and I'm good?
yea, keep the voice models tho if they're good, just anything in that wokada folder delete with trashcan
What's the easy way to install applio?
there is any video that teach how to configure Vonovox?
nope, nobody has made any tutorials, the default settings are usually fine
i'd like to put a higher delay on voice output to improve the quality, is it possible on Vonovox?
it is yes but delay is bad for conversations
you would do that by increasing block size
I wouldn't know
why couldnt i get any help?
<@&1159293204038955078>
i have a radeon 7800xt will it work yes or no
im currently trying to use it and its shit
would love sum type of feedback from the helpers
please read the help guidelines before asking
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
Bro please just answer my question
Does it work with Radeon cards or no
I’m not doing that entire thing
It’s a simple yes or no
does what work with Radeon cards?
The entire software. When I speak it sounds like shit. Not fully pronouncing the words I’m saying and also using 100% of my cpu. I have a ryzen 7 9800x3d and a Radeon 7800xt
so, voice changer then, yes it does work with Radeon cards
So then what could be my issue? I watched a video of someone using it on Roblox and the ai is fully catching everything they are saying. I have a blue yeti mic which should have no problem catching my voice. It cuts my sentences short or just doesn’t even pronounce what I’m saying at all. I can say “hi, how are you doing” the ai will try to say the same thing but fails miserably
so you said you watched a video, did you follow a video tutorial as well?
Yes
OK well fuck me and my question then
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
second guide
Impatient people are so annoying:p
Good Morning, does anyone know the best website or program for making songs with AI voice?
I'd say use applio
It's pretty good
You'll have to separate the vocal and instrumental track tho with uvr
I see, perhaps you can help me with this problem in that case
Welcome to my world
"F:\ApplioV3.2.8-bugfix\env\lib\site-packages\torch\cuda_init_.py:209: UserWarning:
NVIDIA GeForce RTX 5050 with CUDA capability sm_120 is not compatible with the current PyTorch installation.
The current PyTorch install supports CUDA capabilities sm_50 sm_60 sm_61 sm_70 sm_75 sm_80 sm_86 sm_90.
If you want to use the NVIDIA GeForce RTX 5050 GPU with PyTorch, please check the instructions at https://pytorch.org/get-started/locally/"
I know nothing about errors as I use applio on Kaggle, a browser and not locally
Maybe a mod or helper could help if they know more than I do
I feel like that would be better Could you tell me how?
Using it via kaggle?
Yap
Sure, give me a minute to get on my pc
Thank you very much!
Np!
I'm on my pc now, if u would like to move to dms or just talk here either is fine
Yeah thanks you
^
|
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
What is the best model for real time voice changer? The pinned link in #🔍│find-models is dead
i have my chunk on 48000 (1 sec) and extra on 131040 (2.73 sec) but it takes like 30 seconds to catch up on discord. what should i do?
there isn't just one "best" model for it
please read the help guidelines before asking
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
can someone lmk what the best working version of w-okada works for amd
im not too up to date with anything anymore 😭
Don't use w okada anymore
Go for vonovox instead
does that support amd now?
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
yeah it only works for nvidia
hello
someone here ? @viral mason
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
as it said, you should explain the problem first
If I could scold you either way, I just won't. AMD Radeon RX 7800 XT will work with Deiteris/Tg Develop W-Okada fork. The thing is the one you got from tutorial video from YouTube is outdated, where its DirectML variant is also bugged. 
No excuse. If you need help about something, basically explain about your issue, the program you wanna use/you tried, and your PC specs (like GPU). You wouldn't simply say "hello" and expect one to answer or ask you back in a help channel.
is refinegan in any applio local build?
it gives me "typeerror failed to fetch" whenever i try to use okada whys it doing that
re-run the cell or try ngrok/another tunnel
use reply instead of pinging
I said another tunnel otherwise
it's just that ngrok has limited requests/min to allow but you can still run applio anyway
it gives me "typeerror failed to fetch" whenever i try to use okada whys it doing that
I have a ryzen 5 5700 rx 6800 oc 16gb and 32gb ram 1tb ssd and 3tb hdd can i run this on marvel rivals
1440p max settings
hi
horizon also doesn't work with filebrowser
what is the best ai to use for helping you study for school?
Why do my exported models in Applio have robotic voices? I configured the training with 1200 epochs and 10 good quality audio tracks. Am I doing something wrong?
I'm using the version 3.6.0 of Applio
Can you please tell me what to do?
the sound is not output for some reason
I even reinstalled the cable.
There is no sound output to the discord, who can help with private messages?
When u trained your model did u check the overtraining detector box? I usually do that
My model isn't that good but the result is acceptable
kto russkiy i mojet pomoch?
Надо что то поменять
Я не русский, хз язык ваш
yes, it was disabled, i trying other things to see if improve it
Oh damn, I always enable it
Try to enable it and use a smaller epoch to see how it work like maybe 100-200 Then u can continue to train for higher epoch level
Ok
Thx man
i think the other problem that i'm having is this, i'm trying to train Brazillian voices, trying to figure out how to add the Nanashi pretrained model
ahhh idk about those, i use hifi gan in default
👨🏭
whats its better?
what is the best realtime voicechanger
W-Okada voice changer. The user you replied to mentions about "Vonovox", a complete alternative to W-Okada that also uses RVC voice models but with different interfaces.
i wan't very clean interface and ultra realtime voice changer
Make sure to read help guidelines before start going outside Discord server, looking for an "outdated" W-Okada version from YouTube tutorial videos.
Want, wantn't or wasn't?
By the way, what are your PC GPU and operation system? Because a voice changer program often performs better with a discrete GPU.
The only best AI to help study in school is you. There are also some services that offer similar usages, but they won't be the same level as you. 
does anyone know why the delay with the app is so long for me? the audios really choppy and takes ages to go through the delay literally says 200 seconds, my specs aren't that bad i have a mid range pc with a ryzen 5 5500x and a intel arc a380
Could anyone tell me why when I use the Realtime Voice Changer Client it doesn't change my voice at all?
<@&1159293204038955078>
Like even if I raise the pitch it does nothing..
Try Tg Develop W-Okada fork. https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/ https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-dml.zip
Last update: November 22, 2025
Make sure to read help guidelines before start asking anything. What is your PC GPU? And did you follow any tutorial or guide before this?
I have a 5090 and Ryzen 7 9800x3d and yeah I followed like 3 tutorials online and downloaded multiple versions but every time my voice just stays the same.
Try Vonovox. https://docs.aihub.gg/realtime-voice-changer/local/vonovox/ https://github.com/dr87/Vonovox/releases/tag/v1.6.9
Last update: November 21, 2025
RUN SETUP.BAT AGAIN TO USE SWIFT!!!!
1.6.9
Small Algo update for Swift-F0
Swift-F0 has been added a a pitch extractor option. More info here
https://github.com/lars76/swift-f0/tree/main
https://git...
Vonovox is a complete alternative to all other W-Okada versions. Both use RVC voice models, but Vonovox has a whole different UI.
You're welcome. 
Why wasn't w-okada working for me specifically though?
The W-Okada version you used might be an original and outdated one. There's Tg Develop W-Okada fork (b2397), this one is better than the original W-Okada, although Vonovox is often preferred for audio quality.
oh ok
The original W-Okada didn't make one to work properly with NVIDIA GeForce RTX 50 GPU, so it kind of happens.
ohh that makes sense now
becuase it did come out before 50 series came out.
Now when I talk it stutters really bad 😭
wait
Sooo how to make your voice model be able to whisper
Cuz whenever I try to whisper its just meh
But I can do high deep evil voice
all of my voices sound the same
hi guys is there a free alternative for virtual audio cable ? it keeps saying trial again and again
my audio is very grainy
Does anyone know how do i increase delay time on Vonovox? To improve voice quality
how do i fix my shi having a lisp
What's the best audio upscaling
Hello, I wanna ask which realtime AI voice conversation I should get
Specs:
16 GB of RAM
CPU: AMD Ryzen 5 8645HS
GPU: NVIDIA GeForce RTX 3050 - 6 GB
Can anybody link me to a version that's smooth and not choppy? I heard RVC is good
What voicechanger is the best and free?
Depends on ur gpu
You should try out Vonovox, it's the current best and works only with Nvidia
Isn't Vonovox for high end rtxs
I heard 3050 is basically "the bare minimum"
I mean it could work, but if u don't want the risk of it not working too well u could also try wokada tg fork
It's what I currently use while Vonovox's current beta gets an update to a stable version
I have a 1660 super with ryzen 7 5600x 8 cores with 32gb ddr4 ram
At best you'd be able to use w-Okada deiteris unless you use your AMD card which u could use wokada tg fork
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
Where can I download that?
Wokada tg fork is the best of the two I mentioned, you can download them from the guide that is associated with each one
whenever i say something 1 second later it repeats itself how do i fix
Mmmmm I need a 100% confirmation that 3050 is good, you said Vonovox would work but it works only with NVIDIA which is what I have
Can you give me a link to both? And if anyone here has a clear answer about 3050 that would be awesome, I haven't actually bought the laptop with these specs yet but I am going to soon, that's why I am mostly just asking
@hollow mantle
do you know how i can fix this
whenever i say something 1 second later it repeats itself
It's supposed to do that
Sure! Just one sec
You press stop
ohh i get it
Do they at least have spec requirements I can read? Maybe 3050 is good enough
Vonovox: https://github.com/dr87/Vonovox/releases/tag/v1.6.9
https://docs.aihub.gg/realtime-voice-changer/local/vonovox/
Wokada tg fork:
https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-cuda.zip.001
https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/
RUN SETUP.BAT AGAIN TO USE SWIFT!!!!
1.6.9
Small Algo update for Swift-F0
Swift-F0 has been added a a pitch extractor option. More info here
https://github.com/lars76/swift-f0/tree/main
https://git...
Last update: November 21, 2025
Last update: September 6, 2025
I put the guides with the downloads
Alrighty ty, any voice model from #1175430844685484042 would work?
Any of them work yea, only ones that won't work are the refinegan models I made with spinv2 recently
Homer, glados, and bugs bunny
Those only work in the most recent beta of Vonovox
Gotcha, and Vonovox does work, the minimum is far lower than I expected
Oh this looks kinda hard, is there a YouTube tutorial on how to set everything up?
What was the minimum?
All u do is run setup, and then run start
For Vonovox at least
The settings that are best are default actually, but if u want less delay decrease block size
Btw do u have vac lite or vb cable
U need a virtual audio cable to use this in games
i dont know how to find the anvuew v2 for uvr
like what yaml and cpk I need to used
it gives me "typeerror failed to fetch" whenever i try to use okada whys it doing that
ive done all the settings right but it just isnt working
please read the help guidelines before asking
please read the help guidelines before asking
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
even though 3050 laptop 4 GB can still run the voice changer alone, I'd rather suggest the one with 8 GB vram or better 12 GB to be able to run some modern games
im getting an error "Voice Changer is not selected." while using MMVC okada web client
how do i use the voice changer
I don't know what's wrong with my w odaka app ( RuntimeError: CUDA error: no kernel image is available for execution on the device
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.)
wait i need to download gtx 5000? right guys?
either downgrade to an RTX 40-series gpu or get vonovox/tg-develop fork as listed below
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
Those are the specs I can currently afford. You said 3050 4gb can still run, it's 6GB.
Are you saying Vonoxo can run with these specs just fine with at least discord involved? I am not planning on playing huge modern games atm
isn't the 6 GB variant desktop one?
discord shouldn't take noticable GPU load unlike games, and could say so for vtube studio
Not sure if it's a variant one, I just wanna ensure I don't sound like this lmao https://m.youtube.com/watch?v=TR8Tdfj6_-c
Something smooth and actually sounds normal
The model is a variant with better components for mid-tier gaming
it's more likely the model quality
This voice is from RVC Okada, the person in this video has a rtx 4060 ti16 GB and a i10100f
It's probably an ASMR model and that's what made it not as good
If you have NVIDIA GeForce RTX GPU, there's Vonovox. 
I know, I am planning on using it and thankfully the requirements are below with what I am going for based on the doc https://docs.aihub.gg/realtime-voice-changer/local/vonovox/
There's no NVIDIA GeForce "GTX" 50; there's RTX 50. For GeForce RTX 50 GPU, there are Tg Develop's W-Okada fork and Vonovox.
Hello, I have a question for you. Are you able to create your own game scripts? I need a script for the game Case Paradise. I'll send you a link to the game, but I can't find the script anywhere. It's not a very popular game. I'm interested in either item spawning or duplication. Thanks in advance.
there could be some good enough ones, it's more of the dataset source quality and how properly the model maker cleaned & prepared it
Do you have a recommendation on one that's good?


If you could find one, there are like over thousands of voice models.
The Tg Develop's W-Okada fork, double click MMVCServerSIO.exe. Asides that, make sure to read help guidelines before start asking anything than a simple query.
try at least listen to the samples, otherwise gotta have to try the model
What is your PC GPU? Did you follow any tutorial or guide before this? Because the original and outdated W-Okada version often found in tutorial videos from YouTube.

Heyy y’all, my friend want to do music with ai voice model, and he’s asking for a pc setup to do that can someone help me please ?🙏🏽
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
Oops sorry
To check his PC's specs, tell him to open Task Manager, go to Performance tab, and spot GPU 0 or GPU 1. If you mean what's the best specs for a PC to run "Applio RVC fork" locally, you simply explain about your query.
He have no computer yet 😅
i have a 4070ti and i followed a tutorialyeah
Try Vonovox. https://github.com/dr87/Vonovox/releases/tag/v1.6.9
RUN SETUP.BAT AGAIN TO USE SWIFT!!!!
1.6.9
Small Algo update for Swift-F0
Swift-F0 has been added a a pitch extractor option. More info here
https://github.com/lars76/swift-f0/tree/main
https://git...
So is it simply that? You can ask something like "what's the best PCs specs (like GPU) to run Applio RVC locally?". For a decent PC that can also be used to play some games and run AI tasks, tenth generation Intel Core and NVIDIA GeForce RTX 3060 16GB can be enough, although there are newer Intel Core / Core Ultra + RTX 40 or RTX 50 if your friend wants more performance and budget is possible. 
As much as I hinted about potential PC specs, you can also do some research about best working PC for your needs, that doesn't have always be exact as my statement.
Thanks Lucy, I appreciate
You're welcome. 
Hello, I have a question.
I've read the guide and my wokada works perfectly without any issues. Is it possible to connect wokada from my PC to my smartphone because I want to play games on it and use the voice changer.
Sorry if my English isn't very good because I'm using a translator.
Thank you
whats a good epoch amount for training a voice model?
-rvc
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
If ur gonna train a model use Kaggle applio or use local applio
i need some help with a error with applio it says: RuntimeError: Failed to import transformers.models.hubert.modeling_hubert because of the following error
operator torchvision: :nms does not exist
i have rtx 3050 laptop gpu
i downloaded the precompiled 3.6.0 applio
i got a .index file, a .pth file, and a .npy file, what do i do with them?
it seems like a mismatch of some libraries
mode.pth and added index are used, .npy is a temp file
how do i use the files tho where do i put them
depends on the application you're trying to use
idk man i wanna put drake vocals on a lil uzi vert song
I just downloaded the latest cuda version of w-okada. I have a 5060 ti, and it says it's sm_130. But it says "The current PyTorch install supports CUDA capabilities sm_37 sm_50 sm_60 sm_61 sm_70 sm_75 sm_80 sm_86 sm_90 compute_37." I don't know what PsTorch is, how it relates to w-okada, or how to fix this, any help?
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
1st two links
if u got ur voice changer from a youtube tut or if the links say demo client they're super old
yeah i got it from w-okadas github hugging face
possibly you did not unzip it properly, i'll give it a test in a sec
I'd say choose between vonovox or wokada tg fork
just gave it a quick test with inference, looks fine to me
do you know any programs that are more mainstream?
i cant find anything online about either of those 2
i used winrar. maybe if i try with 7-Zip?
The two I recommended are the only two that are good, just read the guide provided for the one you want to try.
Anything online will be either really outdated and old or pay to use
For Vonovox after you extract the file run setup, then run start
For wokada tg fork after extraction just run mmvcserversio
maybe, make sure you have enough free space
same error
i have enough free space
maybe its because a wrong cuda version of torch?
the compiled version comes with all required libraries and python
im in windows 10 21H2
yes
it usually does not cause issues, but who knows...
y'all is there a site to generate ai voices for free im not paying weights for this it used to be free
or train them whatever
so i need to uninstall?
what about on mobile sorry?
I see that it's PC only
soo do u know anyy
i checked my storage and i only have the drivers
so im trying vonovox now since the other one is outdated but it says model not properly initialized and its just warming up the voice conversion
you can try
what do i do guys?
What gpu do u have?
It only works with Nvidia:3
Ok 👍
so what should i do?
Try closing Vonovox and reopening it to see if it happens again
i did that already
is there like a tut for ai vc on mac
using Miniconda Prompt to activate the conda env of the precompiled applio and using this command "pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118"
fixed the problem, maybe you want to add this command to the guide if anyone have this problem too
i dunno why would you go with cu118, but okay
also you did not need to activate, you could just run env\python -m pip uninstall ... / env\python -m pip install
oh okay thank you
I got the site thing to work but it wont pick up the mic to discord
hello
I need help to find TTS or model for Wukong and Pigsy which use for "what kind of (X) is this/what kind of awesome is this" i found some website but it was paid https://www.reddi[.]com/r/HelpMeFind/comments/1lyv7ff/help_me_find_the_tts_for_the_what_kind_of_x_is/
that cuda version is old, only okay for GTX 10-series
then why these error happens with the cuda version of the precompiled zip?
i typed the command to see the cuda version and it shows cuda 13.0
i havent seen your gpu but I suppose GTX 10-series?
the latest cuda 13 needs 20-series/newer
rtx 3050 laptop gpu
can someone help me to make a chat bot?
that i can host for free :D
i have 6gb vram tho and 2060 gpu
:/
okada

saying "okada" doesn't mean anything or prompt us to do something, if you need help finding a voice changer for you id suggest you read the help guidelines and ask
Hi ! Hope ur all going great ! 😃
I wonder how to make a voice model, I have some minutes of a voice I like (worms voice lmao) and I would like to speech some text using this. How can I create a model please ?
Me? 
Hahaha 😂
you dont need a voice model for tts, you can just use a zero-shot tts that uses 10 seconds of audio
I'm still going to create a model I think. Because I already have excerpts from the voice of the Worms, and a good bunch of lines that I would like to create with the voice
how to use voice-models?
wait how can i train a model in russian?
when i choose gpu
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
oh right
How big is Microsoft Visual C++ Redistributable Package
Megabytes.
Try Vonovox instead of Deiteris W-Okada (b2332). That b2332 one didn't make to run with GeForce RTX 50 series GPU. https://github.com/dr87/Vonovox/releases/tag/v1.6.9
If you'd like to stay with W-Okada for easier interface, there's Tg Develop's W-Okada fork. https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/ https://github.com/tg-develop/voice-changer/releases/tag/b2397
oh okay, then i download this 3 files?
voice-changer-windows-amd64-cuda.zip.001 and voice-changer-windows-amd64-cuda.zip.002. Not voice-changer-windows-amd64-cpu.zip.
Hi is there any video for creat a rvc voice model?

If you're looking for a tutorial video from YouTube that sometimes tells outdated information, I'd rather not recommended. Instead, there's a guide for "Applio RVC fork".
-rvc
Ty
okay thankss
I'm using Applio and I am getting a weird robotic background noise on a model that used to work perfectly.
- Same model, same settings as before.
I even tried to redo an old recoding I had with the same setting and got a very different and much worse result.
all I did was sleep? and I woke up to my Applio just acting up
When you say you "were sleep" and Applio RVC acting up, it sounds like some creepypasta storyline where the program changes its settings while you were unaware of it. 
I wish it was that but no, SAME settings SAME everything and all of a sudden the output is trash
Make sure to use latest version of Applio RVC, review and redo your exact steps, and use your headphones to hear them closely again.
Using 3.6.0
Headphones are full blast in my skull
I even wore the same clothes I was wearing yesterday and still it sounds so weird
If that issue has a possible explanation, it could be the voice model itself being in low quality, the audio has background noise, or you had a phenomenon called "Mandela effect", where you think you remember the initial generated audio once sounded good but it sounded terrible in reality as more you listened to it.
If you have these audio files (both first and last one) saved to your drive, you can try send them to here to make sure the issue actually persists.
Applio RVC fork is yes, more preferred. But for the mainline RVC GUI (like RVC1006AMD_Intel.7z), this one is outdated.
Check out this one. https://docs.aihub.gg/rvc/local/applio/#amd-on-windows-precompiled-fix
Last update: August 9, 2025
i know this was tomorrow but i used the first one (applio) to train an alphabet lore model
Wdym it was tomorrow
beacuse it shows right closer on your username
it says "Ontem às 14:20" sorry if its portuguese
Idk what 14 o clock is as I use the 12 hour clock in America
But I think as long as you used applio for training a model it's probably fine
my w-okada voice changer is bad with my amd rx 7800xt 16gb vram
Did you get it off a YouTube tutorial
Idk what direct ml is so I'm assuming that's super old and outdated
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
Yep, follow the guide and download the correct version for amd, I wouldn't know what the AMD version is but someone like Namari could
oke
Since I use Nvidia I only know the correct files to download forNvidia
If u have questions on how to use it tho I know what to do there
hey yall, i have a question, i'm very new to the ai-voice changer scene, and i'm interested in potentially using w okada's voice changer to start streaming using my OC robot's voice, since she uses an old text-to-speech engine as her voice. What would be the best way to learn how to train my own voice model on clips I can generate using the old TTS engine?
I've no idea at all where to even get started with learning how to train my own model so any help would be very much appreciated! thank you :)
"Not enough data present in the training set. Perhaps you forgot to slice the audio files in preprocess?"
BRO
WHAT IS THIS ERROR
MY FILES ARE ALREADY SMALL AND ITS STILL SAYING THIS
STUPID COLAB
AND IT STOPS AT STOP TRAINING
I CANT TRAIN IT AGAIN
AND IM USING APPLIO TO MAKE IT AND ITS ALREADY SLICED
BUT ITS STILL MAKING ERROR
im still getting the SAME ERROR GRAAAAAAAAAH
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
it's the second guide
I don't have amd so I wouldn't know exactly what files to download but they are probably in the guide
and will this help me to train my own model to use in that wokada fork you suggested i use?
there's actually a seperate thing for that
wokada tg fork is only the voice changer
-rvc
ah okay, that makes sense
applio is the current most used voice cloning software
and the applio docs have info about training an rvc model for use within wokada?
And it ignored me
sorry i'm asking so many questions i'm jumping in at the deep end lmao
okay, i changed the "118" to "121" and now works fine
yes
the model u make with the audio can be used in both wokada and other places that use rvc
like for covers on weights or in applio itself
that's perfect thank you so much 
you're welcome ^^
btw if u have questions u can ask me, if I can't answer them maybe a mod or helper can
I have gotten this before too, idk how to fix it or what causes it
Could I self-host a small LLM? I have an AMD Ryzen 7 5700G, a Radeon RX 6600 XT with 8 GB of VRAM, and 16 gigabytes of DDR4
@viral mason i just installed tg develops what settings do i do?
or is there a tut?
did u run mmvcsersio?
do u have vac lite?
it's a virtual audio cable that would connect the voice changer with other software like discord or games in general
not yet but i will be downloading
Is there any reverse search tool
I got some girl dming me stuff on insta
Cant tell if she is using a fake id or not
Tried google lens but ntg came up
TinEye is a reverse image search tool
Tried but it didnt work
i really dont know anymore then
Welp nvm it ig
just use kobold
is the easiest way
LM studio
but given such limited vram, it'd be able to run < 10b llms
im using w-okada fork and for some reason, when I have my input/output set correctly, it occasionally plays on my browser tab and not through my vb cable
ok i fixed by selecting server and then client
how do i make voice changer work in dc?
anytime i select a different voice model, i get "An error occured during voice conversion..." but it still works, is there a reason or anything I should do to fix this?cmd 2025-12-09 19:03:44,930 INFO [RVCr2] Initialized. 2025-12-09 19:03:44,951 ERROR [VoiceChangerManager] The expanded size of the tensor (9600) must match the existing size (7431) at non-singleton dimension 0. Target sizes: [9600]. Tensor sizes: [7431] Traceback (most recent call last): File "voice_changer\VoiceChangerManager.py", line 212, in change_voice audio, vol, perf = self.vc.on_request(receivedData) File "torch\utils\_contextlib.py", line 116, in decorate_context return func(*args, **kwargs) File "voice_changer\VoiceChangerV2.py", line 162, in on_request result, vol = self.process_audio(audio_in) File "voice_changer\VoiceChangerV2.py", line 152, in process_audio self.sola_buffer[:] = audio[block_size : block_size + self.crossfade_frame] RuntimeError: The expanded size of the tensor (9600) must match the existing size (7431) at non-singleton dimension 0. Target sizes: [9600]. Tensor sizes: [7431]
hi guys, i installed vb audio virtual cable for my voice changer bc i saw that in a tutorial but now i cant hear any sound from my computer
Hello. Im using kaggle cloud RVC, and when I start server, this is the error I get.
2025-12-10 01:27:41,266 ERROR [VoiceChangerManager] Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same
what audio cable are you using? like vb
i was able to fix it
forgot to say
but
my thing is hella laggy
like
the voice changer has a pretty high delay and its always cutting like the sound
im using the best settings for it
dm me
can smb dm me to help me figure out why my voice changer has such a delay
YOOO IM INTERESTED SIGN ME UP MY GUY
lightning.ai also seems bugged idk what's wrong please help
it works fine
wdym
I cant post images here to show
but on lightning.ai it says
[SIO] rconnection failed Error: xhr post error
over and over again
on kaggle it says
2025-12-10 01:27:41,266 ERROR [VoiceChangerManager] Input type (torch.cuda.FloatTensor) and weight type (torch.FloatTensor) should be the same
i changed nothing in the code
are you trying to open applio?
or wokada
it is RVC inference application, but the latest version (3.6.0) has realtime feature. sorry that it and the cloud wokada are outside my knowledge tho.
Rice Cooker....
my mic isn't being picked up at all, i have it set to my regular mic and the output to my virtual cable
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
Make sure to read help guidelines before start asking about W-Okada realtime voice changer. Check your microphone. What is your PC GPU? And did you follow any tutorial or guide before your query?
Sorry I fixed it thanks
Just using old models
Switched up to other ones and they worked fine
That doesn't answer anything.
Fine
I checked my microphone, was being picked up. Have rtx 3050ti laptop
And I followed a guide yes but it's fine now
I'm more of going into much deeper issue, not a short-term solution as what you stated. 
If you have NVIDIA GeForce RTX GPU, try Tg Develop's W-Okada fork. https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/ https://github.com/tg-develop/voice-changer/releases/tag/b2397 If you have followed any tutorial from YouTube, that one likely telling you to use an outdated version (v.1.5.3.18a).
I'll reinstall tomorrow and see if the older models work then thanks
I think the guy said that if you have amd to download a different github version (I did) but now I'm forgetting if he meant cpu or gpu
I have ryzen 5
AMD Ryzen is CPU, while AMD Radeon RX is a dedicated GPU. Your laptop has RTX 3050 Ti, so better use a version that compiled to run with NVIDIA GPU. The "DirectML" variant of W-Okada can also work with NVIDIA GPU, not just AMD/Intel, but not always recommended.
Gotcha
I hear there's another version of the software (or different software?)
What's the big difference
There's Vonovox, a complete alternative to other W-Okada versions. There's big difference about UIs, settings, though they both use RVC voice models and all work similar.
Applio RVC (Retrieval-based Voice Conversion) fork. The program is not realtime voice changer the way W-Okada voice changer does, Applio RVC is used to do AI cover and even train an RVC voice model.
Either way, RVC doesn't always mean realtime voice changer, that said.
Now, delete your current MMVCServerSIO (voice changer) Kaggle notebook and try follow this guide. https://docs.aihub.gg/realtime-voice-changer/cloud/tg-develops-w-okada-fork-kaggle/
Last update: September 6, 2025
Kaggle, Google Colab or you run locally?
8 GB VRAM is a bit small by today's standard, although running smaller LLM model is possible if the model is optimized to use under 7.9 GB of GPU VRAM and work with Radeon RX GPU especially. For more VRAM, there's NVIDIA GeForce RTX 3060 12GB as a starting point.
If it possible, you can use @ helper role ping to call some other helpers to investigate your issue. When you rant a small rant in caps lock or accuse other people for ignoring you, anyone can either wait you to calm down or won't talk to you after all.
I'd prefer to stay away from Nvidia where possible largely because I don't like the company and because I use Linux
Nvidia, famously and historically, provided rather poor driver support from Linux
Your opinion, your choice. 
While I use Linux distro for some parts, like trying to compile FFmpeg from start, I still struggle to get compiler going without having to do sudo apt-get or install some packages needed/required (like libfdk-aac and libx265). I'm more interested in NVIDIA GPUs, but if you could find AMD Radeon RX that's comparable to a specific high-performance NVIDIA GPU you could've done that, same goes to Intel too.
I see
Debian, I take it?
I once tried Debian on Oracle Virtualbox, but I forgot to check GUI install and connect the internet at installation, and the result was that it shown as text mode-only terminal. For more consumer-friendly uses, there's Ubuntu, while Debian leans a bit at professional. There are some Linux distros available, but Debian and Ubuntu are what I know.
I feel like those mistakes regarding Debian are on you
Yes, these were my mistakes. 
Yeah
I can definitely still recommend Debian though
nah it's true that the Nvidia driver support is worse than AMD one in linux
ubuntu and arch/derivatives aren't bad either
Arch itself isn't bad either, if a little complicated
I use Arch myself
im lost. where to train a model, applio kaggle gave a model format (512, 256, 1) when it shouldve been (256,256) or smth
it seems more likely mismatched sample rate of the preprocessed model with the pretrain and/or the vocoder (hifigan or refinegan).
to be more clear, show screenshot of the configurations (ask me for image perms first).
i used 40k sample and refinegan
read the pretrain description, it is 32k
so you should use 32k in the preprocess
ohhhh that makes sense thennn, thanksss
but what if i dont use applio for conversion? can i use the refine?
and is there much difference using hifi-gan instead of refineGAN?
AMD Radeon(TM) Vega 8 Graphics
Windows 11, Laptop
im trying to use the realtime voice changer, but the outputting audio keeps getting cut
i used this tutorial https://youtu.be/SxdnGxicJOg?si=9EA6ujmKQbLToCIN
Chunk: 128
Extra: 16384
This AMD Radeon GPU is an integrated GPU, often found in AMD Ryzen systems. A mobile dedicated GPU is something like AMD Radeon RX xxxxM. Are you sure there is any other GPU in your laptop?
If your laptop has just integrated GPU, it's better to go for online option like Kaggle for superior voice changer performance. Attempting to get any W-Okada DirectML version to work with a integrated GPU would perform a fraction of your AMD Ryzen CPU, or in rare cases CPU-only.
Oh alright thanks it seems it's better to try out kaggle
Last update: September 6, 2025
Damn thanks
I'm sorry, do you know how can we import weights models into applio ?
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
👋
What's the best way to generate celebrity AI voices with Play HT? Are there any good GitHub repo with audio files or something to train the models or should I only find YouTube clips to upload? Thanks in advance 🙂
Why does it seem like every voice sounds terrible even tho the audio from the creator sounds amazing, i have tried almost everything
👋
is vonovox better than w-okada
could someone help me with tg develop w-okada? when i launch the program this error happens
Could not load library with AVX512 support due to:
ModuleNotFoundError("No module named 'faiss.swigfaiss_avx512'")
2025-12-10 17:29:44,181 INFO undefined undefined undefined undefined [loader] Loading faiss with AVX2 support.
2025-12-10 17:29:44,199 INFO undefined undefined undefined undefined [loader] Successfully loaded faiss with AVX2 support.
2025-12-10 17:29:44,203 INFO undefined undefined undefined undefined [init] Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes. This is only an error if you're trying to use GPU Faiss.
Yes, it's Nvidia only tho
Is it possible for it to save my model and settings? Even after I stop running it and start it again
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
i have a 5070
I want to create my own ai cover but I don't know where to start. I prepared mp3 files of the vocals I obtained to train but I am lost on which ai sites to use and how to utilize them without a GPU.
i need help, vonovox isnt opening for me
i clicked the start.bat
i already ran the setup.bat
nvm, but i tried opening a model and it wont give any audio when i talk
whats the problem
the advanced settings if u prob want it
hi i installed the thingy for the nvidia model right and i used winrar extracted it then i try open start.bat it says some stuff then i click run the other option is to extract all? and then a black prompt screen pops up for a sec then shuts off
Does anyone have basic models for 100m-200m parameters, for code/Ukrainian language testing? I need a basic model to further train and use for my own purposes
why applio's notebook for kaggle is on github and the page doesn't exist?
You sure that's the correct way to access the .ipynb file on GitHub?
idk, i just clicked the link in the -kaggle command
Who made it?
A Kaggle ipynb file is supposed to be imported within Kaggle website. You wouldn't launch the notebook file directly on GitHub, unless you know what you gonna code inside ipynb file for some implement. 
My voice seems to cut off a little at the end like if i say "Hello guys" it sounds like hello guyz or so and that makes it sound like ai
Using Vonovox btw.
how did u make it give audio bruh
mine doesnt give audio when i talk
help
I just selected my input device and output device and press start 😭
same bruh
show ur settings
vonovox settings
Send a ss of your settings
You selected a model right?
ye
can u show the voice setting if u scroll down
Where are you using it in or nowhere for now?
tf u mean
as output set your headset or whatever ur using to hear stuff
u cant hear whatevers going to virtual cable
if u want to hear yourself then u can use an app thats included in virtual cable, its called audio repeater (MME) and there u select wave in the line 1 and wave out ur headset or whatever
put virtual cable as output and select it as input where u wanna talk
