#✨│ai-help
1 messages · Page 300 of 1
oh alright
my laptop dont got a dedicated gpu
i cant run rvc models
is rvc v2 gonna work on w okada's voice changer
either buy a better laptop with dedicated gpu or use cloud services with limited free gpu time
you can't do much about it
AI needs powerful resources
It's not really simple 1 click all free
does 16 gb ram help?
what about beatrice v2 how do i download models for it
it works on cpu too and is faster right
i downloaded, right after i deleted it, my mic started going static why is it happening?
It's ass
oh alr
Nobody makes Beatrice models anymore, is ur PC able to run wokada deiteris
i didnt test
i dont got a dedicated gpu
O
If you have the money you could get a 306p possibly
That would run any of the 3 voice changers just fine for normal use
i got a laptop can it be inserted now?
Oh.. a laptop
Yeah laptops are not good for running any kind of ai
Not locally at least, you'd have to use Kaggle or something
is there something i can do
im not an expert
can laptops have gpu added in them
or do i upgrade to 16 gb ram then it will work
Either try out the online alternative on kaggle or try and save up money for a desktop PC with an Nvidia GPU around 20-30 series
That's really your only two options since I'm not sure if laptops can have powerful gpus like that added
oh alright
will kaggle allow me to use this real time voice chnager on discord calls?
I have no idea as I have always used the local version and don't even know how to use the version online
I would ask Nick he's pretty smart and knows more about it
doesn't matter
no one trains beatrice v2 models
it's worse than rvc v2
oh alright
@low shard
Oh hey look, I summoned him
how do i get my mic back to how it was? it used to have no static at all, after i fix it ill re-download again.
what gpu do u have?
you downloaded an outdated voice changer since earlier u said u got it off youtube
so I wanna get u a better one
intel
No idea, Intel isn't made to run so it struggles horribly with it only know how to properly set up wokada and no idea how to fix any weird issues with it
Maybe try reinstalling it
Hello i am running wan2.2 animate and having issue when click run it process a little then get stuck is there anyone who can help me with this runnig it locally using comfyui
Hello, hello, good afternoon, how are you? It so happens that I've been away from Colab for quite some time, and I decided to come back, but I don't think I'm up to date with the updates. When I go to my last saved settings page, I get this error.
/bin/bash: line 1: ./MMVCServerSIO: No such file or directory
WARNING:pyngrok.process.ngrok:t=2025-11-20T18:34:27+0000 lvl=warn msg="Stopping forwarder" name=http-18888-4eba4a04-6a04-49ac-9a4c-68c715b272ef acceptErr="failed to accept connection: Listener closed"
Does anyone know how I can fix this?
well, i deleted everything, i mean everything related to the app thingy, and my mic kept doing the static
I set up tg wokada fork for the first time but using audio effects causes the cmd to crash and thus losing connection. Theres no error message either. I cant find anything about it
yes, with a limited free gpu time, and you need to verify your account and phone number
mm that's weird
what's your pc gpu, os and the version you used?
yo nick how you been
I didn't play much with voice effects on that program but like I don't remember having any crashes
good, wbu?
I am using the latest version b2397 from 3 weeks ago
RTX 3060Ti, Windows
It works without any effects, its only when I use effects that it crashes after cliking start. I tested on server mode with windows wasapi, as well as client mode which uses mme
good good ty
ohh that version used to be pre-release, now it has been changed to a normal release
I just tried it with an echo effect on the output and I haven't gotten any issues
what specific effect did you try?
I tested on output with equalizer (crashed), i tested chorus (crashed) then tested a random effect for input to see if its just that, but crashed aswell
So im guessing every effect is likely to crash
i can try with an earlier build but doubt itd change anything
It works now. I ran a windows update and that seemed to have fixed it
I dont know the correlation between that maybe theres something in the code that fetches something but ya
I tried with the latest one you said, I just didn't know it was now a normal release
That's pretty weird, may I know the exact windows build and update so I can say it in the docs?
And btw are you aware of Vonovox?
how do i use the voice models on a mp3 or wav file
im using vonovox and it just never finishes "warming up voice conversion" i've already closed and reopened it a few times
I havent been in the game since around the time I had to step down as a helper. I knew about it at the time but nothing else, why?
The windows update part is just speculation but it did conveniently start to work again after updating
https://support.microsoft.com/en-us/topic/october-14-2025-kb5066791-os-builds-19044-6456-and-19045-6456-657e5143-6c5d-4401-8efa-1641ca93c051 this was the last update i had before updating to the most recent from november 11
What's your PC GPU and OS?
What's your PC GPU and OS?
me or him ?
4070 windows 11
Just talking about Vonovox because I thought you didn't know about it, since it does have some performance updates
Ehhh not sure if to add windows update part then 
Both of you, it's an important info
AMD Ryzen 5 7600X 6-Core Processor
oh
Windows 11 or 10 right?
yeah
Be sure your windows is up to date, check for GPU drivers
You have the Nvidia App?
yeah
alr, downloading it rn
Get Applio, an RVC fork: https://docs.aihub.gg/rvc/local/applio/
Last update: August 9, 2025
ok and what does that do
The Nvidia server is pretty useful
You can use RVC (Retrieval-based-Voice-Conversion) models from #1175430844685484042 on audio files, like ai covers
oh
so whats the difference between that and a site that does it for me
will it sound better if it runs on my pc
Isn't that what you asked? You wanted to use the rvc voice models on audio files
yeah
This is a local program, it runs on your PC power
It won't be as easy as using like weights.com, but it doesn't rely on a server and you can control it however you want
A site like weights.com is of course easier, but they use the same program at the end of the day
ok thanks
ill just use the website for now
i was just wondering
if the ai qaulity would be better running on my system insted of the website
i think its more for speed than quality
are you up to date with all of the variants? is that the current favorite still? i havent looked into the voice changer in a while, for example tg released after i left so thats why im interested atm
but i saw it was really complicated and whats up with the codes
i saw many people on youtube doing it with google collab can i use it instead i didnt see keggle tutorials
whats better
Kaggle gives 30 hours Google will at max give 4
That fixed it, is there anything else I need to know about vonovox?
What site are you talking about btw?
Yw and well, pretty much every setting is explained in the ai hub docs, be sure to play with the pitch and try an extra time like 4.5
The docs didnt include anything about smart sine, is that important?
is the performance same and the download method too
I believe so yes
There isn't much new tbh, but wokada tg develop fork is only QOL and UI, the developer doesn't work on performance related things
When I make L4 GPU or A100 CPU on Colab site An error occurred during voice conversion. Check command line window for more details. I'm getting an error, what's the solution?
Check command line window for more details
Hi all! is it possible to request help to get feedback from a study about AI?
can someone help me download the voice changer im trying to troll but its not working
What gpu do u have?
i dont know how to see that i want some help on a call ive never done this before
Just open your task manager and click the performance tab on it
As long as you're not using a laptop you should have a decent gpu
intel arc?
all it says is intel(R)
You're cooked 
why
how
Leave it on, it's a smart algorithm that doesn't process noise at all, as said in the 1.6.5 changelog:
One of the most useful features I have developed thus far. This is a replacement for the old VAD and takes its spot as the checkbox. This is a smart algorithm that will disable the SINE generator's ability to translate noise to speech. This means most noise like isolated static and random noises can no longer be mapped to false speech and generate static, NSF generator will not process them. However, extremely loud and consistent noises like slamming on a surface may still trigger false positives , however most normal noise will be stopped. I will continue to improve this, but the first version is very effective so far.
Also, the docs have just been updated on that part
Anyone has tried any good ai humanizers?
Ai humanizers are just LLMs rephrasing
I wouldn't trust AI Detectors
because it can never be a human
it is integrated one which won't work
ok i already found a different one that will work with the models
Hi, Can anyone help me?
I'm looking for singer models, but not artists or celebrities, rather singers who have trained their voices and have good quality, similar to the AUDIMEE models.
then stop subscribing colab pro
Check command line window for more details
also you haven't explained the problem and which colab page being used
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
NVM it was a scam you have to, pay money
???
they were like ya this is free this is sooo free and it wasnt btw if i had the right gpu i would have this voice changer
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
is this good i dont understand what does it mean
it doesn't say what the numbers are for ur amd card
weird
did u click the right one
i went to task manager and porfamnce
ill send full screen ss
hmm
I guess u could try wokada deiteris
first is vac lite which is a virtual cable u will need to use it in games second is wokada deiteris (the voice changer)
oh
will it work?
cuz w okada voice chnager didnt
and whats the difference
is quality same and stuff
it should
better quality than what you would fine on yt
bc the stuff on yt is all outdated
is it the same as the one i downloaded
why would this work and not the one i downloaded
idk what the one u have currently is
the normal one
there is no normal one just different versions
anything to do with beatrice means it's super old
what
i downloaded the latest one
wait ill send ss where i downloaded
from where
i followed this video and used the link in description https://youtu.be/SxdnGxicJOg?si=HYeg7KXvsrEMvJ2J
I will go over every single step needed to fully understand and use this powerful AI-voice changer. U can use it for trolling people or whatever purpose floats your boat.
💾VOICE CHANGER DOWNLOAD (GITHUB): https://github.com/w-okada/voice-changer/blob/v.2/docs_i18n/README_en.md
💾VIRTUAL AUDIO CABLE DOWNLOAD: https://vb-audio.com/Cable/in...
but on the link they gave i downloaded the latest one
still the code to that is outdated compared to the current stuff
idk what that is
how would that one work does it not require a gpu? and is it that different
whaa
if u don't have a gpu ur cooked
i already told u everything before did you forget
i dont have it
I have a very bad memory
?
my specs
is that not a gpu?
🤷♀️
tbh no clue at this point
vro
no
unless you wanna buy a pc/laptop with a dedicated gpu
i cant until atleast 3 years so yea
my dumbass didnt know gpu stuff
and just bought this
should have spend a little more and got one with dedicated gpu
(Removed this due to nvm)
does anyone know some settings to change to make their background vocals less glitchy on rvc
?
dont expect to run locally with just integrated gpu
and 8 gb ram is too low for most modern applications under win 11, unless you switch to some lightweight linux distro
i canb upgrade to 16 gb ram then it will work?
if you arent using it while olaying games and are fine with a massive delay + stutters
then yes
will it work if im in dc voice call
i dont want too much delay or too much stutters
I’m having trouble getting mmccs sever to open on mac im using a mac min m4 But it says malware may harm your mac move to trash or done
Know what ?
expect a delay of at least a few seconds
alright
i have downloaded the w okada voice changer latest version
using this tutorial
I will go over every single step needed to fully understand and use this powerful AI-voice changer. U can use it for trolling people or whatever purpose floats your boat.
💾VOICE CHANGER DOWNLOAD (GITHUB): https://github.com/w-okada/voice-changer/blob/v.2/docs_i18n/README_en.md
💾VIRTUAL AUDIO CABLE DOWNLOAD: https://vb-audio.com/Cable/in...
its not working rn so your saying it will work on 16 gb ram?
thanks
its keeps telling me to use online hosts like kaggle
my pc specs are 8 gb ram 5500 u amd ryzen 5 lenovo slim idea pad 1 no dedicated gpu
is it going to work or should i use kaggle
i downloaded the orginal one and it wasnt working is deitris gonna work
then use an online host
hi, what are some good prompts to generate video game music?
but i got the recommended stuff
the things recomended in the ss
is it the same
why do you keep asking the same questions over and over
read what it says
and decide for yourself
do you want the bare minimum
then chatgpt it
ok
specifically sci fi type music, they all sound generic lol
last thing i downloaded the vb audio cable but the guide says vac lite should i delete the one i have and download the one which the guide sayd does it make a difference
the same volume can not be used as both the source and destination 
trying the 5k series download it said extract the zip folder and it will do the rest automatically but it's not doing that
???
that's the error it gave me
0 context to this
when i tried to extraxt the zip
What zip
^^^^^^^
well i can't post screenshots in here so i guess i will be super specific
real time voice changer 5k series fork
Ooh cool, same as mine
yeee one sec let me show you what it's telling me
Download all 3 files, then extract the .zip file, it will automatically extract ALL 3 FILES into one. Then open the MMVCServerSIO folder and run MMVCServerSIO.exe (or called MMVCServerSIO if you don't have extensions activated).
U should really use Vonovox tho tbh, it's way better than both wokada deiteris and th fork
ah is that something new
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
First guide has it
ok cool i will do that instead then ty
Np! If u need help or have questions I'll be here until I fall asleep
Which could be at any time
yeye i'll check out the guiode should be pretty straight forward i imagine but yea the other one said extract one folder and it will do the rest but that was not the case lol
This one is just one folder, on the GitHub page it's just a zip called source code, and then when u extract it run setup.bat then start.bat
The layout of it is very different tho from wokada
But it functions the same just way better
oh nice this one can use two gpu's as well
This is ONLY For users with 2 GPUs in the same system, you must do the following:
Open NVIDIA Control Panel
Go to Manage 3D Settings > Program Settings Tab
Add python.exe from the Vonovox runtime folder (runtime\python.exe)
Set both settings "CUDA - GPUs" and "OpenGL rendering GPU" to the GPU you want to use for conversion```
Yea ur 5070 is perfect for it
Btw I seem pretty chill would you wanna be friends?
Oh shit you're also in the Film Actors Guild?
I was just about to ask if u had vrchat
Ted is awesome
Haven't really been showing up to the acting things recently because of work or just not being home when it happens
yeee same i reformatted my pc for my new monitor and wanted to get the voice changer to troll with on my xbox
can route it into console with the remote play thing lol
yeee lol
What are u going to use btw, I'm guessing maybe Winnie the Poo?
not sure i started playing overwatch again so might do something on there
yeye true
i also have 50 different avatars on vrchat that i swap into so i could get those voices again and mess around
I have vrchat + so I have a lotttt of Avis and voices for most
How???
but not going to leak the secrets xd
O
not in public anyway but it's a routing thing
Fair
can be used on phone calls which is not the best thing for scammers to have access to lol
U must be a super genius
only sometimes 
oh this uses asio audio as well what the helly
lots of settings on here
I'm too slow to even figure out sometimes how long to cook the chicken nuggies
-# I am 19 btw
Full screen the app, it is full of settings to mess with, and some you shouldn't
Ok im trying to figure out how’s this works. I mange to figure out how to disable my mac auto mawle ware and I opened the files but it just keep piping up numbers and trying to connect to sever on my terminal ? Any advice on what to do ?
?
Yea I have to show pictures to explain it
Maybe tomorrow I’ll try again i need my sleep
what settings would max out the quality i see block size and extra time and some advanced settings
Tbh just put crossfade at 0.15
And uhhh
Extra at 4.0
I think that's possible
Is it?
Ok for block size I'd say keep where it is for now
kk
Anddddd did u figure out the mic settings and where the pth and index files go
One of mine perhaps? 👀
cassidy from overwatch lol
I haven't made any ow models sooo def not mine lol
Btw for female models if you're a dude pitches around 3-10 maybe 12 at the most extreme is best
Try starting at 3 then go up
yea going up by 3 keeps it in the same key as the base pitch ect
Huge tip to make it more believable, try to replicate how the character speaks, like if they have an accent or something
Sounding monotone will make it monotone ect
Be expressive
yeee i have been using the wokada stuff for awhile just didn';t know about this new stuff ty for the advice though 
No problem:3
I have ppl use it after I help them and they say it sounds robotic and either they're using an old crusty e-girl model or they speaking like a regular person into the mic
No life at all 😞
Anyways I'm going to bed now was nice to meet you, can't wait to possibly see u in VRC sometime, my user is the same as my display name here
Google colab is good ?
you don’t install it, it’s on https://aistudio.google.com
Kaggle or google claoud good ?
Both Google Colab and Kaggle have their own trade-offs, depends on your use cases.
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
for instant conversation realtime ?
Be more specific on which you're looking for (e.g. W-Okada, RVC).
E.g. ?
Is this the only app? I'm asking out of curiosity and I feel like an idiot.
@hallow thistle
The "e.g." in paragraph typically stands for the Latin phrase "exempli gratia", which simply means "for example" in English literary. If you have no idea what any of these words even mean, you can pick up an English dictionary. 
But when you ask too many questions about the same thing (either Colab or W-Okada), I'm not sure what to answer.
you dont seem to be fluent in english
By the way, this is Google Colab. The word "notebook" refers to a type of document that includes blocks of code intended to run on Jupyter Notebook-related website, which itself is also used in Colab. This notebook in my screenshot is not W-Okada or RVC; it's Music Source Separation Training (MSST), a specific type of "Ultimate Vocal Remover" a distinct program used for separate audio into stems.
Oh okey
İ am turkhis
I am a Turkish voice female package producer
We use the Fivem role-playing game
The site gives the same error, what is the reason?

from the application on the website
paid version
Not the correct answer. I mean like, which tutorial or guide did you get this Colab from or were you searching on Google and found one?
The hell. I literally asked you the question, and you still didn't answer any correct answer. 
yes
There's a Turkish YouTuber who does this, but he hasn't said anything about it. Do you know the solution?
For translation from your Turkish to English, I'd suggest DeepL Translator, or Google Translate as a more quick translation. You wouldn't answer anything too simple that would make things more confusing.
If you got a Colab notebook link from tutorial video on YouTube, they outdated. Try this one. https://colab.research.google.com/github/tg-develop/voice-changer/blob/master-custom/Colab_RealtimeVoiceChanger.ipynb
So what is your suggestion?
Is it a definitive solution?
Whatever you asked.
ok
I trust you
There's a guide on how to use "Tg Develop's W-Okada Colab". The guide is all English; if you still don't understand English you can use a translator right away. https://docs.aihub.gg/realtime-voice-changer/cloud/tg-develops-w-okada-fork-colab/
Last update: September 6, 2025
ok my bad
Is this a good setting overall?
@hallow thistle
You sure that's Tg Develop W-Okada Colab? If so, it is supposed to look like in my screenshot. In your screen, it's Deiteris W-Okada, which is a different one. https://cdn.discordapp.com/attachments/1439159243776331899/1439241897498513440/image.png?ex=6921b6f8&is=69206578&hm=971798b80329a7eeac4f6f5a0ddef6fee9e312785cb27a3087bcb3e69cda2fc7&
While both Tg Develop and Deiteris W-Okada forks would work, the Tg Develop W-Okada fork generally has more recent features than Deiteris W-Okada; Deiteris W-Okada, while stable, its latest update was last year.
With "T4" GPU selected, here's your settings:
Chunk: around 80 - 100 ms
Extra: 2.7 s (above 2.8 s might possible)
GPU: Tesla T4
Pitch Detection (F0): rmvpe
Input: microphone
Output: Line 1 (Virtual Audio Cable)
Monitor: this one optional; you can set this to your speakers/headphones to hear the program.
Oh, so you have the more powerful and expensive "NVIDIA A100" selected in Colab. While that one is powerful for other large AI trainings, it would be too powerful for smaller AIs like W-Okada and RVC than necessary. The T4 is still enough for smaller AIs and eats less compute units than A100, unless you're trying to flex your money that way.
oh okey
Why does this voice sound different from before, a bit robotic?
Additionally, you can go to "Advanced settings" on the program, and try set "crossfade overlap" to 0.15. Otherwise, if the audio still robotic, it can be the voice model itself, so nothing much you can do about it aside from switching to another model.
Yes, of course. Click "close" to close the setting popup.
okıey
@hallow thistle hello what does the sample rate do in vonovox ?
Hi may i ask to not ping just one helper, if you need to please just ping the helper role and when one of us can help you we will
guys what is the best quality female voice model?
there isnt any
search for egirl voices or f4m voices
on weights.gg
Ew
What gpu do you have?
I dunno if your GPU is gonna be able to run anything of the voice changers but you can try
Vonovox can only run on Nvidia, but wokada deiteris and wokada tg fork possible could work with it
But still u could try Vonovox to see if it even works
Crossfade at 0.15
Extra at 4.0
Block size just depends on your gpu but default is usually fine
can realtek microphones work on this program?
And the pitch always have at 0 unless it's a female model, pitches 3-10 maybe 12 are good
that depends on my voice
i think
Any headphones work as long as the mic isn't really bad
nah dude the thing is
Yea but most models work best at 0 if they're male and you're male
the microphone isnt with usb
Wdym
it has jacks
Like the round thing
ok ill try
Lemme know if it works
what bro
Why are you looking for those kinda voices?
do i start start.bat or setup.bat then start.bat
Setup.bat then start.bat
ok cool
@viral mason
its saying a new release of pip is available 25.2 -> 25.3
do i wait
Should download that version then, unless it's doing it for you
its keep spamming that after a big chunk of texts
nevermind i got it
U can download Vonovox since u have a 30 series Nvidia card, just go to that first guide right there and download vac lite and vonovox off GitHub, all u need should be there
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
alr thanks
is there possibly a video tut ?
Nah it's not that complicated to download, for Vonovox just run setup.bat then start.bat and vac lite run setupx64
all good thanks
Np!
hey guys can someone whos rly into ai videos help me
any one work with livekit ?
i fellow vieso and take smae code but i get this error each time i speak
can any one hlep me
line1 INFO received job request {"job_id":"","room":""}
line2 INFO process initialized {"pid":696263}
line3 ERROR unhandled
{"job_id":"AJ_cE2ZfCy7vvqq","room_id":"RM_X36Vc72HeCj6"}
Traceback:
line4 File "/backend/agent.py", line 32
async with model.start(room=ctx.room) as session:
AttributeError: 'RealtimeModel' object has no attribute 'start'
Nvidia Encoder is supported.
Nvidia Decoder is supported.
ERROR unhandled exception {"job_id":"AJ_cE2ZfCy7vvqq","room_id":"RM_X36Vc72HeCj6"}
Traceback:
File "/backend/agent.py", line 32
async with model.start(room=ctx.room) as session:
AttributeError: 'RealtimeModel' object has no attribute 'start'
what is livekit?
Anyone know if weights paid version gives you unlimited hours of audio recordings as well as longer audio files to make. Like 30 minutes audio files?
yes i watch video in youtube but it's a littel bit old
don't pay :(
Question I have a Mac what file should I download? The one that says 3 months ago or the 2.14- alpha zip
I still dunno what livekit is
GUYS whats the best ai image generator
Hello I am looking for an RVC or an alternative for people with an integrated gpu
What gpu do u have? I'm not sure I understand
if you got an integrated GPU like Intel UHD, it's not really that good for Local AI Tasks
I mean it could run on CPU but very slowly
did you check if you have any dedicated gpu?
I do have one but my girlfriend doesn't and im trying to find an alternative for her
she can either:
- uses your PC
- buys a dedicated GPU
- use cloud: remote good pc
About cloud, there are different services with limited free gpu time:
Train (make) RVC Models on cloud:
- Prepare the Dataset
- Setup RVC:
Choose a cloud way to use RVC,
- Google Colabs (max 4 hours of daily T4 16gb gpu not granted for free, but easy to use, there's a paid tier):
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus, either T4x2 16gb each or P100 16gb, only free):
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly, Free Studios run 24/7 but require restart every 4 hours. There's a paid tier):
- Applio (UI)
- Be sure to know about the tensorboard
Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but more gpu time
If you want the easiest way and for free, is using https://weights.com/ which uses RVC
RVC Inference (use models) on pre-recorded audio on Cloud
You can use either:
- Weights.com: Easiest Possible Ever Automatic
- Applio UI Colab: RVC Fork with some extra features like TTS
- RVC AI Cover Maker UI: Automatically Separates the Vocals and Instrumentals, converts the voice and mixes them back
Let me know if you didn't understand something
and nice Alastor pfp lol
Ok so I got this far and now I’m completely lost I’m have my mic on but I can’t seem to record a voice? Also when I tried just putting a pre recorded test clip it was just my voice. Anyone got any tips and trick to help me out thks
that's so old
I can see dust on it
Oh is it an out dated model is that why it’s not working ?
So what free soft ware would you recommend then ?
i have a problem when i have everything set up my ai voice is laggy but on basic models its working with no issues (RX 6600 XT)
depends, I think I remember ur username do u have a nvidia or amd gpu?
there are no basic models that come with any of the voice changers, what u have is the original wokada which is super old and outdated
u should try out woakda deiteris
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
lol nope a filthy Mac user with an m4 Mac min
Are paid monthly subscriptions worth it for those who don’t have ultra end GPU’s to vocal train?
I’m currently running an RTX 2080 Ti, probably meh for today’s standards
Ok np thanks for the attempt
just like, use kaggle
it's free and has 30 hours a week for free users
Interesting, how would I implement it with RVC for training?
all u really need to do is make an account with kaggle, phone verify and do this to set up the kaggle applio space for model training
this doesn't really explain in detail how to train but more of just what to do
kinda messy
That’s really cool!
I wish there a more in depth about it within Kaggle.
I understand Applio, just the whole Kaggle I need more learning.
However I appreciate you sharing, extremely helpful.
Thanks!
np I'm currently not mentally well enough to properly explain how to use it in depth ^^
I could show you how to use it but not explain
Has any one else experienced the RVC voice sounding good when listening to it via monitor but when it comes out of the other people’s speaker it sounds horribly like ai/robot? and where you able to resolve it?
where i can download rvc?
there's still Gradios? I mean unfortunately that Weights needs pay now and that Astra Labs already gone so there's no create AI Covers in easier way anymore ☹️😭😢
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
Which is colab for making AI Covers the most among these?
my voice changer is more using my cpu than my gpu
switch it from cpu to gpu mode
can anyone pls answer me which is colab for AI Covers and I hope it's free even if it's limited thanks
the one that says "AI cover maker"

Yeah but I mean there's another colab for AI covers as well because probably some other Gradios are better
So, you're telling me you know about that one and that there is better ones?

Yeah hehe
But Idk because I'm not familiar with newer/modern/current Gradios or Colabs yet
neither am i, but i do know that Kaggle has been said to be better than colab mainly for more free GPU time
-kaggle
Kaggle is a Cloud (Remote Good PC) Service that offers 30 hours of GPU weekly, but needs a phone number verification
by IAHispano
Kaggle
by Hina
Kaggle
by Hina & Deiteris
Kaggle
by Eddy, ArisDev & Nick088
Kaggle
by Eddy
Kaggle
by Shirou & ArisDev
Kaggle
by Shirou
Kaggle
Yeah thanks but I think that's only works for laptops or PCs especially Windows
Kaggle is in a way a upgraded better version of Google Colab, you can only use Colab or Kaggle on a PC
Yeah but I only have a phone and tablet so idk if that Kaggle works for mobile or Android

sorry, don't know
Oh okay, I think I just use Gradio or just make a "How Would__sing" videos that aren't AI Covers that using original version of the songs then or I will use Kaggle when I have a laptop if that's not work for mobile

sounds like you got a plan
Yep hehe
NVIDIA GeForce GT 730 is a decade-old GPU from 2014, has just one GB of VRAM although some might have 2GB, and attempting to get either W-Okada or Vonovox to work with that GPU would likely result not working afterall or run on your PC CPU instead. There have been several members here trying to run with GeForce 7xx GPU which won't be very ideal. Besides that, neither guide says GeForce 700 as a minimum. 
AMD Ryzen 9 9950X3D is a CPU released in early 2025, several years ahead of GT 730. While it sounds possible to run W-Okada with only CPU, it would perform much slower than a dedicated GPU like AMD Radeon RX 9000 series or NVIDIA GeForce RTX 50.
To know about the term "sample rate", look up on Google. The "sample rate" is something like 44100, 48000, 96000 and 192000, all of which typically expressed in Hertz (Hz), although some might express in khz (kilohertz; 44.1khz, 48.0khz). It indicates how much a digital representation captures the original analog signal in a second; the more hz number the more audio quality, while lower is lower quality. The "sample rate" is not just in Vonovox but also found in other audio-related softwares, audio files, and even your speaker/microphone settings.
that's a terrible setup, overkill cpu and too weak gpu that has no resell value. sounds like you could exchange it with 5800X3D + RTX 3060 12 GB / 4060 Ti 16 GB setup (prob in used market pricing) which is more capable to run voice changer and also for gaming
For a recommended number in sample rate, either 44100 or 48000 will work but the latter is more preferred. 96000 and 192000 are typically used in professional audio, where the quality is more important than general listening. 
Is there a free ai thats good at naturally removing objects
Google Gemini's Nano Banana can also do that, though you can try this one and spot the result.
I need a ai where I can circle the thing I wanna remove. I have a ai picture that has too many hands and I wanna show it which one to remove
stable diffusion/flux inpainting could work but gemini nano banana or flux kontext could do better to preserve untouched details
just tell which part to edit/remove in a prompt
if not sure, crop to the part to edit, process it, and put it back on the original image
just installed the wokada , its not working . like no output or input coming .
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
NVIDIA RTX 5050 , windows 11 , just tried wokada voice changer it has delay in voice too
well you have a NVIDIA 50 series card, you should use vonovox
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
yeah have installed
it
how do i use it ?
didn't you just say you have o okada installed?
added the voice models
O Okada and Vonovox are different, what does your UI look like? Did you get it from a yt video?
What is your question? We'll see if i can answer it
You mean Text to speach?
O okada is a real time local voice changer program
Okay, and how many minutes of voice can it generate in one go
yes i did it first . it was not working . then installed vonovox
well i honestly don't know much about TTS stuff, i apologize
yes from yt video
Does it work
i cant share pic here
O okada doesn't have a limit other than how strong your GPU is, its not a TTS program either
Okay,
ok, well anything from a yt video is outdated
What a waste of time. Soab
it will have delay for processing of your voice and depending on how good your GPU is
downloaded vo. from here
Ok, and im sorry, your issue was what again?
yes , how can i run it on vo
you told me i got rtx 50 series so should use vonovox
like to know how that works
ok, so you downloaded it, run "setup.bat" before anything
i did it and added voice model .
Okay, so now you should just need to start the voice changer and you will have to mess with the settings as things vary
Last update: November 21, 2025
guide here too
have downloaded and installed precompiled setup
need to setup the audio
you installed the virtual cable right?
? The virtual audio cable is for routing the output of the voice changer to discord and games and such
yes have installed it
voice change is not working
input and output should be our devices right
Input should be your Mic and output for now should be your headphones/speakers while we get it working
ahh
2sounds at time , how do i fix that one
2 sounds at a time? Never heard of that, what are your settings?
yes my orginal sound and model voice

ur settings, what are they?
come to support vc,will stream
its default one
see
Okay, say something again
you hear ?
Yea, ive never seen that before, i honestly don't know
sounds like your chunk might be a bit low

i apologize as i feel im not being able to do much
how do i change it
in Vonovox its called block
ahhh
under voice settings
turn it up by like 2-3
maybe a little more, sounds better to me
ahh
definitely better, seems you just need to mess with the block size and extra to work well with your GPU
is it working on idscord ?
If ur talking right now then i don't hear it but i was before yea
ahm
which input device should i select in discord
Discord input should be the virtual cable output
how do i turn on off the my own voice
mean whenever i start am hearing it like echo
idk what that is, Vonovox doesn't have any kind of voice passthru so that's what i mean when i say idk
im so very sorry
never seen that before
ahh okie
voice model voice is am getting
in echo
i understand that, i just don't know what it causing that as it shouldn't be happening
any alternative
The only other thing would be W okada eithe deitris or Tg-develop, but Vonovox should really b working fine on your system, can't garentee the others wont do the same if you try them
its working but thing is echo inapp
like when we testing audiio in discord
weird, so when you use it not on Discord its clear?
no no i
i mean inapp sound
in app sound? Im sorry im not understanding something here, might be cuz its past midnight
ur saying the actual voice changer?
yes , inappp voice
in wokada these is an option turn it off
? The way you're saying that is like a order, a command, im sorry im tired, can you elaborate more or worst case you may have to wait for another helper
when i talk i hear it from inapp . like an acho
well im usless for that then cuz idk, im very sorry

thank u for understanding, have a good day
lol diff id

!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
Make sure to read help guidelines before start asking anything back, but if you see this as a waste of time, it might be your issue rather than the program itself. 
Let's make thing look simple. Because words alone can't make one to imagine as image, if you send some visual examples like what I do, like the screenshots of Vonovox UI, that would make things much easier to communicate. Otherwise, you can expect a very long conversation that has little progress to go here. 
What are your settings on Vonovox? Because Vonovox looks like this.
can someone help me with why my real time voice changer sounds like its having voice cracks all the time
Make sure to read help guidelines before start asking anything about W-Okada realtime voice changer.
nvidia Geforce rtx 4060 ti, windows 11, when i speak it just sounds like im having a ton of voice cracks on real time voice changer
Did you follow any tutorial or guide before this?
Rtx 5070, Ryzen 7 7800x3d, the voice changer works in the cpu mode and doesnt with my gpu.Thisis the error code
: NVIDIA GeForce RTX 5070 with CUDA capability sm_120 is not compatible with the current PyTorch installation.
The current PyTorch install supports CUDA capabilities sm_37 sm_50 sm_60 sm_61 sm_70 sm_75 sm_80 sm_86 sm_90 compute_37.If you want to use the NVIDIA GeForce RTX 5070 GPU with PyTorch, please check the instructions at https://pytorch.org/get-started/locally/
no i just got the link from a friend and copied his settings
b2332, b2377 or v.1.5.3.18a? I'm more to be specific.
v.1.5.3.18a
The best working W-Okada voice changer doesn't need you to make a bit complex tweak in the program by yourself. Rather, try Vonovox, as this one is also made for GeForce RTX 50 series GPU. https://github.com/dr87/Vonovox/releases/tag/v1.6.9 https://docs.aihub.gg/realtime-voice-changer/local/vonovox/
RUN SETUP.BAT AGAIN TO USE SWIFT!!!!
1.6.9
Small Algo update for Swift-F0
Swift-F0 has been added a a pitch extractor option. More info here
https://github.com/lars76/swift-f0/tree/main
https://git...
Last update: November 21, 2025
Okay perfect! tysm
That one is the original version of W-Okada and is outdated. Try Tg Develop's W-Okada (b2397). https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/ https://github.com/tg-develop/voice-changer/releases/tag/b2397 The b2397 is now marked as a latest version on GitHub, while b2377 is a bit behind.
Let me know if you have any issue using Vonovox, a complete alternative to W-Okada. 
Okay thank you ill try it and see if it works
Will do tysm! 
the files that is says to download for nvidia arent unzippable t
use 7-zip or winrar, not the built-in windows one
You have to download two files, as the zip file split into two pieces.
And make sure to use 7-Zip or WinRAR instead of the built-in archive extractor.
it sent me to rvc voice changer website is that supposed to happenn http://127.0.0.1:18888/
It's completely normal for this specific W-Okada version.
oh okay perfect. how do i make it so i can hear myself on this version?
Oh tysm, is it normal for it to sounds almost like its very raspy is that normal
it looks exactly like this
everything was on default
no changes made,and fixed it now
Here's some more settings if you haven't yet:
Chunk: around 60 - 90 ms
Extra: 2.7 s
GPU: NVIDIA GeForce RTX 4060 Ti
Pitch extraction (F0): rmvpe
what voice changer should i get for my gtx1660
https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/ https://github.com/tg-develop/voice-changer/releases/tag/b2397
witch one?
both?
Yes, two of them.
okiiiii tym!
Both zip files have to be in the same folder, and use WinRAR or 7-Zip to open or extract the ".zip.001" one.
its not letting me extract
it let me extract now
weird
@hallow thistle what next
Chunk: around 160 - 180 ms
Extra: 2.7 s
GPU: NVIDIA GeF0rce GTX 1660
Pitch extraction (F0): rmvpe (not rmvpe_onnx)
Input: microphone
Output: Line 1 (Virtual Audio Cable)
Monitor: optional; you can set this to your speakers/headphones to hear the program.
If you have set the audio processing as "server" mode, selecting audio driver as "WASAPI" is more preferred than "MME". Otherwise, switch back to "client" mode for much simpler settings and ability to select noise reduction features.
In "server" mode, if you set it to WASAPI, the sample rate on W-Okada has to be 48000 Hz, and your speakers/headphones also has to be 48000 Hz too.
problem still
that one
If using W-Okada on Opera/Opera GX has some issues, try another web browser.
Always make sure to send the full screenshot of your W-Okada, because when you send simply just error messages, I won't be able to see your current settings.
on google too
When I said "48000 Hz", you still have sample rate being set to "44100 Hz" on W-Okada. Click red "stop server" button, set them and try again.
It's how WASAPI's "shared mode" works, typically found in many audio-related softwares. You can't mix up different sample rates in the same audio system. The another mode in WASAPI and also ASIO, "exclusive mode" can be of any sample rate number, although it will mute other program audio and leaving W-Okada the only program to output audio, which I don't think it would be ideal for use on Discord or another program where they all use the same audio system.
i want it for discord tho
so what should i put there?
Sounds good now. Always check perf number in the performance stats to make sure it stays green.
I don't understand how to download and install this
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
Make sure to read help guidelines before start asking anything instead of a small rant. 
To know about your PC GPU name, open Task Manager, go to Performance tab, spot GPU 0 or GPU 1 and click either one to reveal its full name on the right side.
@hallow thistle hiii
the voice output from vonovox is stuttering
it was working fine before
You wouldn't always ask a simple question and then going right away like this. For a general question about Vonovox or W-Okada, you can simply explain about your issue and it doesn't have to mention/ping me like if I'm only a helper you think you only familar with. 
That doesn't mean anything. As what other helper said to you earlier, you can use @ helper role ping if necessary. 
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
arent u a helper
Make sure to read help quidelines before start asking anything. 
ok
ive read it
rtx 4070 super
windows 11
vonovox voice output stuttering
no error message
<@&1159293204038955078>
You can send your Vonovox screenshot to here now.
I'm more familar with W-Okada, so anything about Vonovox I'd have to rely on the guide as a base knowledge. 
i have that one too
should i use that to see if the problem persists?
For W-Okada, I'd suggest Tg Develop W-Okada fork over the "original" one from tutorial from YouTube.
are you running a game too meanwhile using vonovox?
it happens regardless
of whether i run a game or no
it used to run perfect before
even when i ran a game with it
do you get like delay or does it have issues with specific voice models?
huh?
all models have the same type of stuttering
there is an update tho
it prompted me to update
ofcourse update vonovox to the latest version
windows too
and ofcourse the gpu drivers too
okay
its fixed
by setting it to 3
didnt evne have to update
you deffo should btw
it's better to avoid any bugs and get the best
Ok
whats this
That is wokada tg fork
Any advice on how to train an RVC voice model? How many epochs should I use for a dataset of 15-40 minutes in general?
There is no specific epoch count for any dataset length, usually a model is fine after around 100-200
Just check each saved epoch with a short sample
what to put for a male
and female?
if you are a male and using a male voice for most of them keep pitch at 0
if u are a female using a male voice model tune the pitch down some
and reverse it for other way around
female voices are usually good around 3-10 pitch wise
try going up till it sounds good
Formant Shift?
nope, pitch
i meant what about it
nope just changing the pitch will be fine
but if it sound bad still its voice problem?
most likely either the voice model is old and not really made well, or it could be ur mic and or background noise/people talking in your background getting picked up by the mic
Can't send pic here so Idk how to describe my issue but on kaggle the applio import keeps loading and is taking way longer than2-3 minutes this is ridiculous idk wth it isn't working for me.....
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
@strong nova What is your GPU?
@tame oracle Nvidia Geforce Rtx 3070
great that's more than enough for Vonovox
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
Thank you
No problem
why would u use that awful website :<
help
cant send picture
https://youtu.be/8Hq4PEZ30oY?si=v6tBH6DyUiCtjKJZ how to do this
Try installing Applio manually
ai voice changer for intel?
Intel Arc A/B, Intel Arc Graphics or Intel UHD Graphics? These are Intel GPUs, with the first one being a dedicated GPU while the others are integrated.
not quite sure but i think its an integrated
Processor Intel(R) Core(TM) i5-3210M CPU @ 2.50GHz 2.50 GHz
Graphics Card Intel(R) HD Graphics 4000 (32 MB)
That laptop must be hella old, as your laptop Intel Core CPU is third generation, but a bit step up from my laptop which has second generation Intel Core i3. 
does any1 konw why my voice changer not working
just ranomly sotpped 5 minutes ago
iahave everything set perfectly
it just doesnt talk or anything
There have been newer version and an alternative to this one. Tg Develop's W-Okada fork (b2397) and Vonovox, which is a complete alternative to W-Okada, are known to work with NVIDIA GeForce RTX 50 series GPU.
Vonovox can give better audio quality although its UI can be less familar than other W-Okadas. https://github.com/dr87/Vonovox/releases/tag/v1.6.9 https://docs.aihub.gg/realtime-voice-changer/local/vonovox/
Or opt for Tg Develop W-Okada for more familar W-Okada interfaces. https://github.com/tg-develop/voice-changer/releases/tag/b2397 https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/
VB-Cable can be very confusing to set it up to work, where you would encounter audio not coming out from the program, while Virtual Audio Cable lite is much simpler to set up, as it as only one virtual line to use. https://software.muzychenko.net/freeware/vac470lite.zip
does anyone know why mine voice changer is cuttting out everytime i talked
Are AMD GPU’s supported for training?
so vonovox is better than okada? and also does the same models work with vonovox
as if i were to be on okada
Both Vonovox and all W-Okada versions support "RVC" voice models. While I haven't seen any benchmark between Vonovox and Tg Develop W-Okada fork on the exact system, many people prefer Vonovox for audio quality, and that's what I know.
can i just drag and drop the model folder from rvc to vonovox or do i have to add it 1 by 1 manually
It is possible that AMD Radeon RX GPU can be used to train RVC voice model, especially with Applio RVC fork, and Stable Diffusion, but only specific versions from both programs would work, as many main programs that made for NVIDIA GPU would often perform poorly or not work at all with AMD GPU.
RVC doesn't always mean realtime voice changer; RVC would be more of "Retrieval-based Voice Conversion". Import by dragging the RVC model either from MMVCServerSIO's model folder or directly onto Vonovox program don't sound like something you could possibly do with either Vonovox or W-Okada.
so if i already have a ai model on rvc i cant use that model with vonovox?
I'm more of focus on settings, but when you ask too many questions with little or no progress to go, there's the guide doc for Vonovox. https://docs.aihub.gg/realtime-voice-changer/local/vonovox/
Last update: November 21, 2025
thats why im asking bc im confused it sasy add .pth but rvbc okada is using .safetensors so i dont see a rvc model when i try to add it

You can't always make things more complicated than necessary. Either Deiteris or Tg Develop W-Okada would convert pth into safetensors file by default, which is completely normal for these specific versions. To upload a model into Vonovox, you simply upload the original "pth" file rather than the one that already in W-Okada.
okay i see i got it
sucks i lost most of my original uploaded models but i have my most recent
For voice models, check #1175430844685484042.
nah i make my own bro i usually just delete them
i only care for my recent one so its fine
thx though im ab to test this is and i think from what im hearing already the quality is better
i do feel like rvc fake laughing
is better
could just be the model but idk
Hello everyone! I’ve been using the Hina Modified Realtime Voice Changer Client on Colab for a long time, but recently I’ve been having a problem where every model gets stuck at 96–99% during loading, even when using GPU and even with models that used to work perfectly.
I deleted my old notebook and re-uploaded it, but the issue still happens.
Could someone please share the most up-to-date and stable version of the Realtime Voice Changer Client, or let me know if there’s a newer working Colab version?
I would really appreciate any help. Thank you!
Can't I sing with my own voice? Can I only sing certain sounds?
If you have Google Colab Pro tier, you can run this voice changer notebook; the free tier otherwise can cause issues running either Deiteris or Tg Develop W-Okada fork on Colab. https://colab.research.google.com/github/tg-develop/voice-changer/blob/master-custom/Colab_RealtimeVoiceChanger.ipynb https://docs.aihub.gg/realtime-voice-changer/cloud/tg-develops-w-okada-fork-colab/
Last update: September 6, 2025
Make sure to read help guidelines before start asking anything about RVC or W-Okada.
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
I couldn't do it again. My problem is that I have an audio file that I want to model with RVC GUI, but the ready-made audio files have pth and index extensions, and since my audio file doesn't have them, I can't add them. How do I add them?
What is your PC GPU? And which RVC fork are you trying to run? Because when you say " I couldn't do it again", it can either mean you didn't read the help guidelines how to explain about your issue.
Otherwise, it's hard to identify the issue and a potential solution if any.
RTX 5060TI 16gm vram
Simply as that. The original and mainline RVC GUI, while stable for a few scenarios, wouldn't work with NVIDIA GeForce RTX 50 series GPU. Applio RVC fork, a superior RVC to the mainline, can work with RTX 50. Download Applio from Hugging Face. https://huggingface.co/IAHispano/Applio/tree/main/Compiled/Windows https://docs.aihub.gg/rvc/local/applio
-rvc
Attempting to run original RVC-GUI with RTX 50 would likely run on your PC CPU instead.
Because your initial query is confusing to understand, so that's what you would expect. If you meant something else, let me know.
I have a question: how long does this last?
I usually used a file from Google Drive that I opened and it configured itself automatically, and I could use the voice for about 4–5 hours. So, approximately how long can I use the website?
Usually 4 hours a day for "free users". The question is, again, "how can you run W-Okada notebook on Colab with free tier"?
I’m just running the notebook normally on Colab Free. I open the link, connect to the runtime, and it works for me. I’m not using anything special.
Beware of any random disconnection while running W-Okada on Colab with free tier, as some people here have got their Google accounts terminated from using the service on Colab. For the more runtime and fail proof against random disconnection, Google Colab Pro or "100, 500 compute units" is more preferred than the free one. Kaggle is another free option that works similar to Google Colab, provides 30 hours a week with "T4 x 2" enabled within a notebook.
guys i installed deiteris-w-okada-fork for linunx as per the guide
i even downloaded port audio as well
my question is
which mic do i use there in linux
i tryed to run the https and then it didnt do anything and then i just tried to open the voice changer it self it opened then didnt show anything
Is there a Colab where I can create a model from scratch?
its stuck like this, what do i do?
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
help me @hallow thistle

There has been a newer version of Deiteris W-Okada but developed by a different author. Tg Develop W-Okada (b2397) has more recent features than the Deiteris (b2397) one. I'm more familar with Microsoft Windows, so anything about Linux distro would have to rely on guide as a base knowledge.
@hallow thistle which one i choose here
If you know how to download files or use a command in Linux, download these three zip files from GitHub. https://github.com/tg-develop/voice-changer/releases/tag/b2397 Use WinRAR, 7-Zip, or zip and unzip commands in terminal to combine and extract the ".tar.gz.aa" one and all.
Did you not pay attention to my statement?
i did all these steps bro
my question is which mic i choose
More like "HD-Audio Generic: ALC257 Analog (hw:2,0)". Because that one says "analog", so that one must be the microphone you plugged into your PC. The Windows equivalent is something like "Microphone (Realtek High Definition Audio)" which is named to be more friendly.
what about virtual audio cable
currently im using my headphone blackwire
but for output

I'm not sure about the "virtual audio line" in Linux, as I don't see "portaudio" or something related in your screenshot.
so should it show port audio in voice changeR?
@hallow thistle Can you check your DM ?
@lavish isle Instead of hopping into my DM. For the best working RVC Colab notebook to train a voice model, there's Applio RVC. https://colab.research.google.com/github/iahispano/applio/blob/master/assets/Applio.ipynb https://docs.aihub.gg/rvc/cloud/applio-colab/ The no-UI one is also possible. https://colab.research.google.com/github/IAHispano/Applio/blob/main/assets/Applio_NoUI.ipynb
anyone who helps me fix me my virtual audio cable in linux? i can pay for the help
Whatever you done in W-Okada, as you have only said you have followed all step froms the guide, and that's all. Rather, I think you might need to configure the PulseAudio or PipeWire for use with PortAudio, as what these softwares hinted as the potential solution that could work similar to Virtual Audio Cable or VB-Cable in Windows system.
when i change on discord to line 1 nobody hears me why??
i need help
voice chnager wont even open
when i try to start it up it opens the command prompt like usual but tehn it says a bunch of stuff and closes itself and doesnt even start the vc
@hallow thistle
So, I've been playing Games with 3090, a Primary card for Games and Using RVC(Vonovox) on 2070, a secondary card for Voice Changer.
I've noticed that RVC, despite running on a Separate GPU in a Same PC rig is experiencing a Performance issue and the sound become choppy when Game's Heavy load on GPU or CPU become apparent, despite the game running on 3090 and 3090 ONLY.
Can you help me with this? Thanks.
why does it say trial in a dc call
the other person hears trial every 5-10 secs with the ai voice
either pay for it or uninstall it then install vac lite
oooh it says trial cuz its not the lite one?
You have the most outdated voice changer ever
Unless that's not the issue and I'm stupid
can u guys answer my question too while ur here 😛
^
What gpu do u have
And did u get the voice changer from a YouTube tutorial
Btw if u have Intel you're cooked
nvidia
how do i delete the previous one again 😅
ive been using it since july and its been working




