#✨│ai-help
1 messages · Page 311 of 1
Well I do want to Train My own voice
Talking seems too hard
Or use Audio
To upload in local voice changer to change the voice to appear different character
For videos
Your queries still confusing.
Paid request and commission aren't allowed in this server, so you can keep money in your wallet/bank for something better.
Ok,
Here's my query
I want RVC voice changer for locally
That has option to any upload Voice models
And ways to Train models (by myself)
And you can suggest me 2 or 3 voice changers or rvc's as well
Realtime Voice changer isn't necessary
That has Both TTs and Audio upload to convert

How am I supposed to decipher these? Do you mean something like an RVC (retrieval-based voice conversion) that could train a voice model and also do AI cover (inference) at the time? Because "Applio RVC" literally has both features I mentioned, with "TTS" a feature in Applio RVC. The retrieval-based voice conversion and realtime voice changer, while both use the initials "RVC", are not the same thing.
-rvc
When I took a break from typing for a few seconds, it doesn't always mean I'm done and giving up, and in this case, I simply didn't know what to respond.
I fixed I just had to adjust the settings a bit

Sup guys, have a good new year! I'm new on this server, thanks for having me here. My goal here is to seek learning, share knowledge in artificial intelligence, meet people, contribute to open source, and help people with problems with artificial intelligence.
I start learning Artificial intelligence. I have some knowledge in Python, databases, cloud and etc. I looking for some good recommendations and advise for study. If you ask me "which area of artificial intelligence you prefer?", I would say: AI applicability and Machine learning.
It's pleasure to be here, and nice to meet you all!
Hi, what would be the best program to emulate an anime voice?
is vonovox the better version of w-okada or and is there really a difference?
When you say default voices what do you mean
You're most likely using an old program if it came with voices pre installed
i am using Voice changer Client demo from github
Yup that's old
its hard to use i got it from a youtube video
What gpu do you have
Should switch to wokada tg fork, since u use AMD use this download
I'm assuming you have a virtual cable already from the YouTube video
VB cable
i dont think so, i have voicemeeter does it work ?
ty
It could yea
In case not tho I'll get you one
i installed the program on my hdd should i install it on my ssd ?
Hello, i have a problem with the installation of Okada in the guide, I have a message who say "pkg_ressources is deprecated as an API [...] The pkg ressources package is slated for removal as early..." I need to instal something ? I have a link with this message
Idk what that means
Please help guys, its been 2 years and i am not able to make a good RVC for my voice.
Can somebody help me to create a rvc for my voice?
In case people wonder: I am currently waiting for staff to reply to a question about the rules I had, due to namari's moderation decision. In the mean time I'll randomly answer recent help forum posts with what I know. (might as well do something useful.) -- I have not ever set up an environment where I can train models in due to lack of having an expensive GPU and I do rather do things locally, but I can answer stuff when it comes to using some of the existing AI tools. I apologize if my attempts to help are a bother and you rather have an official helper attempting to help you.
how much of an impact do transients make on your model
like for example if im training an acapella and the artist agressively compresses his vocals would it turn out way worse than lets say i had the same acapella but with no compression
i got u
Last update: August 7, 2025
this is the easiest way for beginners
kaggle is better
ye but its harder for new people to learn
does Vonovox work better than Okada
for real time v oice
also , i cant find setup.bat in the folder for Vonovox
can u hel.p plz

- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
Tg Develop's W-Okada fork and Vonovox can be "competitive" because while Vonovox can give better audio quality than other W-Okada versions, Vonovox can be less familar for most people while W-Okada still remains easier to get around.
Not always. While Applio RVC can be harder to get around than those online websites that use "RVC" (Google Colab and Kaggle don't count, though they are known for running Applio RVC on cloud), Applio RVC is better at voice model audio quality where some other online websites (like Kits.ai) don't offer the same level as the former.
its just different kinds of hard drives, an HDD is a spinning disc drive and a SSD is a much faster solid state drive
HDD stands for hard disk drive, a more bulky and heavier drive that has spinny disks, storing data on each platters. SSD refers to solid state drive, this stores data similar to a hard drive but uses flash chips instead of spinny disk.
how do i fine tune the model to sound really good
its cutting out and not clear sometimes
when speakin in Vovonox
sometimes grainy
If you install an operating system (like Microsoft Windows) on SSD, it will be faster than a spinny hard drive because SSD has no moving part to pick one data, whereas hard drive has moving parts where it has to pick data from a platter.
See my message.
ive went through all the docs lmfao

I have a 4090
Windows 11
Voice sounds grainy / doesnt pronunciate things that good sometimes
When you mention "how do I finetune the model on Vonovox" after saying "does Vonovox work better than W-Okada", it felt like some steps being skipped, so I'm not sure what to answer.
well ive gotten better luck with Vonovox so im using Vonovox
I went through this guide
You can send your Vonovox screenshot to here now.
Most of the time people couldn't open those .zip.001, .zip.002 files because they either used the built-in Windows archive extractor instead of WinRAR or 7-Zip, confusing ".zip.001" file as simply ending with ".zip" because they had "Hide extensions for known file types" enabled, or didn't read the guide as reference. To much the headache, I had to provide some solutions for those paradoxical problems
this is the model im usiong
are u saying i didnt extract it right? im confused
I'm talking about Tg Develop's W-Okada fork, not Vonovox. If you have attempted to extract the Tg Develop one before, either one from my statement could be the cause.
oh no im talking about Vonovox
its sounding grainy on Vonovox im not using W-Okada anymore. Unless you think that would work better?

I'm more familar with W-Okada voice changer. For Vonovox, I never actually used it as the main voice changer, though I open the program as a visual guide/reference where words alone couldn't imagine.
Okay, ill try the fork.
given my specs, i would get the AMD64-cuda.zip.001 or .002??
Now that I open both W-Okada fork and Vonovox. On your Vonovox, what about the "Block size"?
"voice-changer-windows-amd64-cuda.zip.001" and "voice-changer-windows-amd64-cuda.zip.001" are split zips of a single zip file, so you "must" download both.
This is b2397 W-Okada.
why is one of the zips a .002 f ile
or is that normal
Ok, now that I predicted the common potential issue.
derp
With "Hide extensions for known file types" disabled, these two files should look like this.
there we go
now i extract in that folderr
Open the .zip.001 one and extract to somewhere like "C:\Users\cybor\Downloads\MMVCServerSIO".
ok got it running
my processing unit should be my gpu correct?
Yes.
ok its all setup, now how would i fine tune?
this is what my performance looks like
The term "finetuning" is commonly used in training an AI voice model (like Stable Diffusion), and it's rare for someone to take "simple settings" as "finetuning" on W-Okada.
so its hopeless to keep trying this model?
I'm trying to provide more answer, so don't hurry.
On W-Okada:
Chunk: around 40 - 60 ms
Extra: 2.7 s
GPU: NVIDIA GeForce RTX 4090
Pitch detection: rmvpe
Input: microphone
Output: Line 1 (Virtual Audio Cable)
Monitor: your headphones/speakers, this one is optional
is virtual audio cable needed
ive just been using my mic
oh wait nvm
sorry
heres my comparisons
hello, does anyone have a guide on how to use RVC on discord mic on linux?

that's a better explanation
i like da spinny disks even tho SSDs are better
Well idk how to sound my voice good without being sound ugly yk what I mean
Does anyone have any experience with German-language voice models for inferencing? Do they get German pronunciation, vowels, consonants, etc. correct?
I suspect the answer is "yes" but I just want to discuss this a little
Hello chat.
I just wanted to ask nicley after 10 months if there is any new update or delevopment for live voice changer. I used "Modified Okada (b2332)" but i got told by AI there is something new and better named "Applio" ? That even needs newer models and can better work with emotional speaking like laughing and similar?
Searching this 2 lol
I tried models from like 1-2 years ago and they were really bad from quality.
But the model i mixed together like over 1 year ago did sounded pretty good with english and german talking.
But i think with current new builds and development there can be even better stuff? I just got back into it and want to try some new stuff out myself
Applio RVC (RVC stands for retrieval-based voice conversion) is not a realtime voice changer, though there's a realtime mode in version 3.6.0. Applio RVC is typically used for simple AI cover (inference) and even train an RVC voice model.
There's a newer, continuous W-Okada fork version developed by Tg Develop, the latest version is b2397. There's another voice changer named Vonovox, a complete alternative to other W-Okada versions. Vonovox can be competitive at audio quality but its interface and settings can be less familar.
What are you looking for? W-Okada voice changer or Applio RVC?
so Gemini again told me half baked truths?
It was like saying Applio is New and improved successor to Okada in terms of Realtime conversations....
And that even my old models would probably sound a bit better in it or i should make a new Version 3 model ?
Are this forks you named focused on real time conversation and better in quality then applio?
I not mind if they look bad from the menu or need some setup.
Important for me is quality and anything that helps with the old problem from 1 year ago with emotional talking breaking the models and similar.
I have a 4090 so i can train myself (even if i never did that before).
So what would be the best direction for me for my goal ?
What? You can't always rely on Google Gemini for anything, what Gemini says to you might be of outdated information surrounding Google search results. By the way, the latest version of RVC voice model is still RVC v2; the RVC v3 is a hypothetical one that hasn't yet been a thing.
Yes i know that many stuff is outdated or not always total correct from gemini and similar chat ai's.
Reason i got back too this chat and asking the experienced humans how much of what gemini told me is true or not.
Because the talk about applio and how gemini made it sound hyped me a lot
This is Applio RVC v.3.6.0. You can see one tab labeled "realtime" there. https://cdn.discordapp.com/attachments/1159290139609137264/1455545707204182016/image.png?ex=695dafd3&is=695c5e53&hm=533bce1d646a77e23915599c99c404eb81538a5f5d6e7ddc47093a40ce45a33f&
For actual realtime voice changer, it's best to go for W-Okada voice changer fork or Vonovox, where Applio RVC's realtime mode might not always be reliable for realtime task.
For training a voice model, Applio RVC is preferred. Other voice changers I mentioned can only do inferencing (convert your voice in realtime using RVC voice model).
-rvc
You can see https://docs.aihub.gg for more information and guide docs about related programs.
Last update: August 5, 2025
When i asked gemini about it with what you told me it really like told me the same you just did.
Interesting
Okay Applio is the model maker and newer Okada fork and Vonovox are the software to use the model.
Can i expect far better quality when i make a new model with Applio and then use one of the newer okada forks?
My model is like 1-2 years old and mixed from 2 different voices so i have something of my own that is not just someone else voice. My goal is to create something new that is my own not just simple copy pasta.
Is Applio good in mixing 2 data sets of 2 different voices? Gemini said it can do that.
Sure, Applio RVC can do "voice models mixing into one. Aside from original pretrain voice models, Applio RVC also lets you to use any custom pretrain model from third-party sources for training a model, although the voice model result is often questionable by the nature of those custom pretrains, where a few models trained with custom pretrains might work as intended while few others fail or ended up lower quality.
While I use Gemini for some parts, I wouldn't go for something like "what does Namari mean about their statement?" and then show them chatbot with some screenshots. While I'm more focus on some basic settings and inference parts that generally accessible for most people, for more information about training a voice model, you can ask some fellow members who have pink "model maker" role here, where they might know how to achieve a better working model intended.
that sound really great! And i think with my 4090 training wouldnt not need that long?
I remember 1 years ago training myself sounded hyper complex... Is it with applio now much easier and user friendly?
ohh i did some.... commissions back then when it was a "thing" on this discord too some of this Master model maker and... some did what they promised and... some not really...
But i assume since then and with how training now works i maybe better with going the "train myself " route now
Yes, NVIDIA GeForce RTX 4090 is so far above than the minimum/recommended GPU stated on official Applio RVC website, so any RVC task (even training) will be super fast than some entry-level to mid-tier ones with 8 GB VRAM. While paid request and commission were a thing here, today they are not allowed, only free ones acceptable. Some time around when paid requests weren't allowed, the model maker role channel went missing, until December 2025, the similar channel has finally returned as #📤│model-maker-role.
Well i can approve some... individuals tried to "scam" other had a lot of passion and it was fun working with them
Real time voice changer I would say so W-Okada right?
That is really great to hear. I think i will use this week and work on my nano banana projects and this applio + Voxvox software. I want too see how much i can improve my old stuff with the newer ways
W-Okada is realtime voice changer. What is your PC GPU? For better working audio quality, it has to happen with settings, the program version you use and a supposed GPU.
i think it was a good decision to remove the paid commission system. It has to much risk for this interesting community and what it works on.
I personally a still shocked how crazy fast image and video AI gets pushed by all the big companies.
But when it comes to voice.... maybe only "text to speech" is a thing but realtime voice changing with AI ???
I mean it would be a big useful thing for movie companies and similar. But the development it that area.... is idk... it feels very very slow compared to chat,video,image ai development.
Nvidia GeForce 360ti
Do you mean "NVIDIA GeForce RTX 3060 Ti"? The "NVIDIA GeForce 360 Ti" doesn't seem to exist as actual GPU. For better working W-Okada, try Tg Develop's fork. https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-cuda.zip.001 https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-cuda.zip.002 There's the guide. https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/
Last update: November 22, 2025
Bro how to upload download rvc in applio . I'm confusing plz help
Do you mean "to download a voice model" and put them into Applio RVC? Simply, copy a Hugging Face link from a thread from #1175430844685484042, go to Applio RVC, paste it in "model link", click "download model", go to "inference" tab, refresh model list and there you go.
"Drop files" option there is possible, but you'd have to extract files (.pth, .index) from the zip before uploading them "one by one".
Yes sorry my autocorrect did some BS
But idk if it work yk just searching for a real time voice changer without problems yk
yes i do, thats why i said “i like da spinny disks even tho SSDs are better”
hey weight
for some reason when i open teh fork it opens terminal then closes immediately
ah its because i moved it out of the server file
is there anyway i can make a shortcut on my desktop so i dont have to go into the oflder
Their name is Namari lol
@viscid moss hey bro. Can U DM me?
Hi! I have a question:
I’ve had Realtime Voice Changer for the past year or two, and I haven’t updated it yet. Could someone tell me where I can go to download the latest version? the version I'm on issss- v.1.5.3.18a [onnxgpu-cuda]
this is so old, what gpu does your pc have?
Currently NVIDIA GeForce GTX 1080
am needing to upgrade it
hm
I used to have a similar gpu so u may be able to run this but idk 
https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-cuda.zip.001
u need both
download them then extract the 001 zip, then drag the 002 zip into the folder for 001 and run mmvcserversio
Can someone help me and voice a small text in a ai voice of dottore from genshin impact? I've spent the whole day trying to figure this out... if it's not difficult for someone, please help me
thevoicemodels.com when i try to download a model it says
you must gave the @AI Caylagi role to download
how can i get the role
how do i hear myself to hear what it sounds like im new to this
For RVC voice models, see #1175430844685484042 and Weights.com.
On W-Okada voice changer, this might give you some ideas. https://cdn.discordapp.com/attachments/1159290139609137264/1456249974525526185/image.png?ex=695eee39&is=695d9cb9&hm=855eb487bc4b36c0b98020ee4b2f524a0500e4d3e0764487e0a5f04e825471fb&
how do i fix rvc voice models not working
What is your PC GPU? And did you follow any tutorial or guide before?
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
Older versions like b2332 and even v.1.5.3.18a weren't compiled for NVIDIA GeForce RTX 50 series GPU, so better try Tg Develop's W-Okada fork (b2397) or Vonovox (1.6.9) where they are compiled to work with RTX 50.
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE
For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
how do i find the donwload
This is Tg Develop voice changer. https://github.com/tg-develop/voice-changer/releases/tag/b2397
ok
On GitHub, click download both "voice-changer-windows-amd64-cuda.zip.001" and "voice-changer-windows-amd64-cuda.zip.002".
i download both
These two files have to be in the same folder. Use 7-Zip or WinRAR to open and extract .zip.001 one.
Go inside MMVCServerSIO folder, and try double click the program of the same name.
don't use og wokada, use what u have in the picture u just sent
how do i make it sound better
use your gpu instead of cpu, switch to rmvpe (not onnx) extra at 2.7 and chunk around 122.7
The original W-Okada (v.1.5.3.18a) once supported Beatrice voice model alongside RVC, while fork W-Okada (like b2397) has Beatrice support "removed", there's only RVC voice model.
i put it on gpu when i speak says this
did u set up your microphone settings and have a voice installed
you're missing the virtual cable
Did you not install Virtual Audio Cable from start?
i have it
This might give you an idea, for second time. https://cdn.discordapp.com/attachments/1159290139609137264/1456249974525526185/image.png?ex=695eee39&is=695d9cb9&hm=855eb487bc4b36c0b98020ee4b2f524a0500e4d3e0764487e0a5f04e825471fb&
For Virtual Audio Cable lite, download from this link. https://software.muzychenko.net/freeware/vac470lite.zip
Click stop server, try set Extra number to 2.7 s and sample rate to 48000.
whats the best dealy
the smallest amount
Around 110 ms might work.
different voice changer? i can only seem to find w-okadas beta from 2 years ago
aight and also the voice is stuck for me i delete the last one i had and it stays the same aft i put a new one
That one is b2397 W-Okada fork.
wokada tg fork is more simple
through the link on the github
what gpu do u have tho, there is an amd version and an nvidia version
nvidia
Which NVIDIA GeForce RTX?
exract the first one then place the second one in the folder made by it
4070

that says amd64 though
thx for the help
they're labled weird
they all say amd when they aren't for it, I had the same thought
also how do i turn this on
AMD64 is another name for x86-64 or x64, a CPU architecture used in most Intel and AMD CPUs.
it won't work unless you use client mode which just doesn't work at all anymore
yea client mode quit working since november 2025
If you set to "server" audio mode on the voice changer, these three options will gray out; if you switch to "client" audio mode, these three options will available. It's a quirk to all W-Okada versions, not just this specific version.
thx
thank you
why does ts sound so ass when i tab out 💔
On your W-Okada voice changer, here's your settings:
Chunk: around 60 ms
Extra: 2.7 s
GPU: NVIDIA GeForce RTX 4070
Pitch detecting: rmvpe
what settings do u have, and what games/software are u trying to use it with
thank you
trying out games rn, it gets a bit laggy
lower your game graphics to the lowest settings to get better performance while using it
Try increase chunk number up, until the perf number in "Performance Stats" section is green and stable.
German language progress report: I tried a voice model, Heinz Erhardt, from voice-models.com and it sounded pretty good to my German-learner ears... it seems to get the German fricatives and sibilants correct (ich, auch, acht, Schlange, etc.)
Is there any particular reason why VAC Virtual Audio Cable is recommended? There are other softwares which do the same thing. Is VAC the best? Is it the easiest to use?
just need help with voice model stuff my things are always called out for voic changer]
Is there a good help / setup guide for VAC Virtual Audio Cable?
— now, i'm not an expert, but —
you literally just install it
and you can later set it to act as your input gateway for VC inside other applications
Is there anyone who can make a voice model?
Ru
yo
wdym what
idek\
u tryna find it too?
yep
😛
whats a good chunk to use nvidia geforce rtx 5050
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
where is the download link twin
@visual nimbus @royal talon you, both of you, make sure to read help guidelines before start asking. What is y'all PC GPU?
uh how do i check
wait nvm
it says gpu 0 and gpu 1
gpu 0 is nvidia and gpu 1 is amd
nvidia geforce rtx 3060 ti-
Click on GPU 0 to reveal the full name of NVIDIA GeForce RTX/GTX on the right side.
nvidia geforce gtx 1650
Try Tg Develop's W-Okada fork. https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/
Download links: https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-cuda.zip.001 and https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-cuda.zip.002
Last update: November 22, 2025
See #✨│ai-help message.
whats a good chunk to use nvidia geforce rtx 5050 cuz the chunks i been useing sound super ass
erm oki
is this only the audio cable?
i do this one?
do i click both links?
For downloads, download both voice-changer-windows-amd64-cuda.zip.001 and voice-changer-windows-amd64-cuda.zip.002. The docs.aihub.gg one is a guide doc about W-Okada.
This link https://software.muzychenko.net/freeware/vac470lite.zip is the download for Virtual Audio Cable.
im confused because when i download voice-changer-windows-amd64-cuda.zip.001 it doesn't download in a zip
.zip.001 and .zip.002 are split parts of a single zip file. https://cdn.discordapp.com/attachments/1159290139609137264/1457951470505955562/image.png?ex=695f301d&is=695dde9d&hm=0785a4a4002ce97446aaf0a4f148530595ac06f52649c6b2fb2e7abc7cd4e9e7& If you have "Hide extensions for known file types" enabled, the .zip.001 one would simply show just .zip, making things confusing.
Use WinRAR or 7-Zip to open the .zip.001.
o okay
There's no exact best working "chunk (or chunk size)" value in W-Okada voice changer. The chunk value can be of any number, as long your GPU can handle, see #1458165876263223457 message for a bit more information.
What is your PC GPU? And did you follow an tutorial or guide before?
now it wont even let me download it....... um
Check if your internet is fast enough and any antivirus program that might still scanning your PC.
yes i followed a guide but my gpu is 4060 ti and cpu is 12400F i5
Follow this #✨│ai-help message.
ok
nope its just not working it ways couldn't download - download error
:I
ummm
oh now its working
.
nvmmm
You can send a screenshot to here now, otherwise I won't be able to identify the issue from words alone.
sorry nvm its now working

is this what it is supposed to look like?
ok thanks i’ll try
The usual behavior would look something like in my screenshot, one folder of full size and two batch files all intact. If you see one folder of halfed size and one batch file there, it can indicate that the program might load only .zip.001 while the other part (.zip.002) missing.
so i get .zip.001 and .zip.002?
Another potential cause is that your WinRAR is under "evaluation copy (unlicensed)" mode. If .zip.001 and .zip.002 are in different folders you should put them in the same download folder. If you didn't download .zip.002 you must download it, then follow the earlier step.
oki
so like this?
This is good now.
then what do i do?

"Extracting a zip file" is one of the most basics anyone can do. In WinRAR, click "Extract To", set "destination path" to somewhere like "C:\Users\your username\Downloads\MMVCServerSIO" or "D:\MMVCServerSIO" and then click "ok".
Click on the first button.
ok
okay so i did it but it made a windows sound but nothing popped up @hallow thistle
how do ik if it extracted
?
Double click on "MMVCServerSIO" program again and try screenshot it.
That's inside "MMVCServerSIO" folder within the zip file.
If you extracted the files to a folder outside the zip, it should look like this.
i dont see anything out of it
hi so im on linux but i wanted a voice changer but also my hardware sucks so maybe idk an online voice changer(free since im broke)

What is your PC GPU? If there any.
rx 570
is it this ?
Yes.
While "AMD Radeon" driver is known to work well with Linux distros, some have observed, this "AMD Radeon RX 570" might likely struggle with W-Okada voice changer even on Linux. For a cloud service, see https://docs.aihub.gg/realtime-voice-changer/cloud/tg-develops-w-okada-fork-kaggle/.
Last update: September 6, 2025
Double click the program.
soo it will js be the same process as windows?
is this what i am supposed to see
This is Tg Develop's W-Okada (b2397). It's completely normal for this specific version to look like this.
oki
if I wanted to get rid of my current installation of RVC, and get a newer more updated one, how would I do that?
pretty sure my current app is very outdated, and quite broken.
now how do i put a model in?
Ehh, kinda, but some additional steps are required for Linux, like installing portaudio in your terminal for a "virtual cable" to work on Linux similar to "Virtual Audio Cable or VB-Cable" in Microsoft Windows.
ye i saw that
ty
You're welcome. 
W-Okada voice changer or Applio RVC? Generally, completely delete the older folder while extract the newer one (from zip) into the same folder is the only acceptable workflow steps. When you extract the newer one overlapping the older one, there's a less chance the program still works although mostly fail to even run.
i dont have any yet where do i download them? because i look in the voice models but i never see the download links...
it was W-Okada, does deleting the folder it was initially extracted into delete it fully?
For how to download a voice model from #1175430844685484042, go to #1175430844685484042, go to a post/thread there, spot a link (usually being Hugging Face) and then click it to download a file. https://cdn.discordapp.com/attachments/1159290139609137264/1456902989422923959/image.png?ex=695f5424&is=695e02a4&hm=76558fc8b12998acbc1077d022ef01918c2eaf5625dbcfd46b22be643506f541&
can some go in dms with me and tell me if my voice sounds real (ai) ill send audio clip thanks
If you completely delete (without going into Recycle Bin) the whole MMVCServerSIO folder, all files should gone with no trace.
No. 
i need one opinion plsss
If you go into one's direct message, you're potentially limiting those knowledge for just me and you. In #✨│ai-help is better for such query.
can i post a audio clip of me there tho
im so confused none of them have a download option......
oop i meant to reply to this
Read again. If there's no "download" button alongside a URL link, it doesn't mean you can't download a file.
If you see this message, click "Continue to download".
yea i dont see any of that all i see is a link to weights.com?
You can send your file here until your username turns blue.
hi
i ran the third cell in kaggle voicechanger and its stuck on /kaggle/working/MMVServerSIO for like 10 minutes is that normal or do i procced to the next step or what
i cant send pics so i cant send the exact thing lol
or nvm im js dumb it doesnt show more than that i needed to procced to the next step
hi
my voice changer isnt working for some reason like its not changing
i dont have image perms so i cant send images
wait wdym
"If you go into one's direct message, you're potentially limiting those knowledge for just me and you" then this doesnt matter... bruh
oo ur smart
there are no models in list, but they downloaded as i check in the path
wait for the admins im too dumb for this
.
im pretty sure i did everything correctly
god why is linux so complicated it worked perfectly fine on windows
its with kaggle btw not local
now i cant even start i get this when i start
Error
An error occurred during voice conversion. Check command line window for more details.
idk where im even supposed to check cuz im on cloud
changing from tesla p100 to cpu worked
but now i cant hear my voice
On Kaggle, use "T4 x 2" instead of P100.
What is your PC GPU? You seem to be using the older W-Okada voice changer version.
ill try it
Talk a bit more, so when your username turns blue here you'll be able to attach a file (like an image).
ohh ok thanks
can u pls give me the permission tho
I'm a Helper, not a moderator, so I can't give you a permission.
ok allg thx
wait no way
finally
Your username now turned blue. 
lol
4060
Try follow this #✨│ai-help message message.
@hallow thistle
Referring to: #1458165876263223457 message
Because his windows username corrupted
and because he claimed that more problems were happening on his system.
I mean his appdata local folder has changed location on the disk outside of his control
His disk is broken
both you ans the moderator have been unhelpful; you made the voice client work for one day, maybe two or three if lucky, but his pc is dying in all likelihood. First you need to get the foundation in proper order (his pc) after that you can work in getting the voice changer to work. Ignoring the foundation will screw things up.
Like what is this comment anyway? #1458165876263223457 message
the OS doesn't matter, it's his username and other issues.
Still not done trying to get me demoted yet? The original poster got helped as intended, so it's just over for you now. 
can i get help pls
The v.1.5.3.18a and b2397 are two different versions. Check again if you have downloaded Tg Develop's W-Okada fork (b2397) instead of the v.1.5.3.18a one.
Where the heck did you get that idea? I only question the the way you seem to react and your actions, and the rules.
I asked the staff to glance at the situation and was waiting for a reply using the ticket system, that's it
i downloaded the right one the v.1.5.3.18a one
I didn't tell you to download and use v.1.5.3.18a. I told you to download https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-cuda.zip.001 and https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-cuda.zip.002.
o shit my bad gng
I asked for clarity specifically. I wondered what I was and wasn't allowed to do or post; If you have seen my ticket as a staff member, then you should know.
You're just doing your job, I assume, so it really isn't about you.
yall i keep getting an error but its pretty long so idk if i should sent it here(yes very dumb lol)
ye i cant even send it even if i want lmao
When you cry about what I did one more time, for real though, you would never be a good "Helper" as me, and the moderation might be considered for your actions, since past few days. 
chatgpt fixed it lmao
OpenAI ChatGPT can be useful for few parts, in this case it's a bit better.
uh so the sound is still not working even with t4
maybe im testing it wrong?
i prolly am tbf
also why am i still pink i have almost 200 messages bro
Try talk a bit until your usename turns blue.
uh ive been talking for a while
well
do u know the solution to my problem
also i can send imgurs
or whatever they r called
im pretty sure i set voice cable wrong cuz its not in my microphone list
On your voice changer, these settings might work:
Chunk: around 64.0 ms
Extra: 2.7 s
GPU: T4
Pitch extraction: rmvpe
Input: microphone
Output: a virtual audio line in Linux
Monitor: your speakers/headphones
i dont have a virtual audio line even tho i downloaded one in the tut
from the guide i meant
i js have 2 microphones(a monitor and a normal one idk why i had them since i installed linux) and my speaker
ohh monitor of is the virtual one
im so dumb
how do i even test it cuz its not working when i record using serverio analyzer(idk what that is so maybe im wrong)
This thread https://discord.com/channels/1159260121998827560/1442151712390512701 might be your answer on how to achieve that virtual audio line. Because while some people here use Linux distros for specific tasks, it's rare for some to actually get W-Okada voice changer to work on Linux like in Windows, where it would be a new knowledge for everyone.
ty ill try it out
this did smthn cuz now firefox keeps crashing lmao
i love when my voice changer ruins my browser and audio
i deleted the conf file and now i have the same issue but working browser yay
what do i do here
If you have extracted the MMVCServerSIO onto your desktop folder, better don't do that. Instead, try extract to somewhere like "D:\MMVCServerSIO".
Also make sure your NVIDIA GeForce driver stays up to date.
Can anyone help with perfecting for w okada voice changer
Gain in and out value
I use E girl rvc model. think thats best english voice model
what model is used for this ? https://www.tiktok.com/@peterpat15/video/7580899860405095710?is_from_webapp=1&sender_device=pc
can some one help me get the voice working in discord because its not
i extracted the zip001 with the zip002 in the same folder, that worked, i tried opening the application inside but it just closes immediately, what do i do?
What is your PC GPU? Are you following Tg Develop W-Okada fork guide?
What is your PC GPU? And did you follow any tutorial or guide before? Make sure to read help guidelines before start asking.
nvidia rtx 3050, tried a few guides and tried some stuff in this channel when i ran into problems. im welcome to start fresh though because i dont really know what im doing lol
In my screenshot, this Tg Develop's W-Okada fork (b2397). https://cdn.discordapp.com/attachments/1159290139609137264/1457950706500763809/image.png?ex=695f2f67&is=695ddde7&hm=62f9be1ebec429bbe87fecf29cc942ea136b815f2b569be2a934f53cdfe5accd& If you're referring to this one, you're looking for right one. Review your steps: both voice-changer-windows-amd64-cuda.zip.001 and voice-changer-windows-amd64-cuda.zip.002 have to be in the same folder, use WinRAR or 7-Zip to open the .zip.001 one, try extract files to somewhere like "D:\MMVCServerSIO", and if you extracted the program files on your desktop, better don't do that.
i extracted it to the same folder as the 001 + 002 was in, is that not ok then
what about it?
Go inside MMVCServerSIO folder, try double click on MMVCServerSIO.exe.
yes thats what i said originally, when i open it, it just closes
The program won't open has to happen before when you extract the files. An incomplete zip file, where inside the zip file, there's one folder of halfed size and a single batch file present, generally indicates a bad sign where .zip.002 might be missing. Meanwhile, the complete one (where .zip.002 is present alongside .zip.001) has a folder of full size and two batch files present there.
yes i have the version with both
What do you mean by "both"?
i have 2 batch files present
Try open MMVCServerSIO.exe within CMD, and copy the error message from terminal if there any.
For much better identifying the issue, talk a bit in here until your username turns blue so you would be able to send your screenshot and any attachment in here.
no theres no error
using vonovox , my delay starts off at like 700ms then builds up to 1200ms over time, anyone know how i can keep it at 700ms constantly?
whenever i try it it does this
Guys, if it's written in the guide epoch 300, what does that mean? and other stuff
(pitch extraction: rmvpe
steps: 21k
batch size: 6
pretrain: legacy core v1.5 / 40k)
any idea why it doesn't work? I used old okada before and these input and output worked fine for me.
(I use online method)
This is not a guide, this is what it was trained on
The information is for viewing the models settings used when it was trained
i got vonovox, extracted it, ran the setup.bat but now when i do the start.bat it just says: C:\Users\fart\Downloads\Vonovox169>runtime\python.exe launcher.py
Initializing CPU limits: 4 cores, 4 threads [ID:8259]
PyTorch thread limits set to 4
CPU affinity set to 4 cores (0-3) out of 16 available (total/4 = 4)
C:\Users\fart\Downloads\Vonovox169>pause
Press any key to continue . . .
"\fart" 😭 💔
what is the best free tool that i can use to faceswap a video
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE
For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
the second guide would be perfect for you then
Wokada Tg-Develop Fork?
yes
what server settings would be best for a 4070?
My voice doesnt go through even tho i Selected the right input the Vol indicator is at 0 all the time
is it on client mode or server mode?
Client mode
Ill let you know if it worked
[Voice Changer] Pipeline is not initialized.
[Voice Changer] Waiting generate pipeline...
[Voice Changer] ex: could not broadcast input array from shape (1024,8) into shape (32768,8)
[Voice Changer] Pipeline is not initialized.
[Voice Changer] Waiting generate pipeline...
[Voice Changer] ex: could not broadcast input array from shape (1024,8) into shape (32768,8)
Im getting this error code
after switching to server u set your mic settings back up right?
I guess so, gonna be honest i dont really know what im doing but it should be right i followed a tut
Rx 570
damn
hahahaha its good tho i can play everything i want to
So u think its bc of the gpu?
most likely
Is there any good alternative Voice changer, im lowkey desperate for that free content
<@&1159293204038955078> can you guys give me line in software download link
do u just need the virtual audio cable or
yea that thing
1 sec
thank you soo much my lovely friend
not any good free ones, I can give u the one I use but I doubt it'll work with ur gpu
the one I use is free tho, it's wokada tg fork
Imma try it out thanks tho!
yo im using vonovox i can hear my model fine when im using headphones but when i put it on cable audio my voice also overlaps
fixed after a restart, i had passthrough on
FP16+TF32
What are the training benefits of this?
@tame oracle
hi
can u help me with something
when i open the realtime voice changer
it says to wait for web server perma
it worked just fine some time ago but now it wont open
I'm not the person you pinged but I am somewhat good at helping with the voice changer, what gpu do you have?
thank you
And did you get the voice changer off a YouTube tutorial
ye
It's outdated then
:(
Pretty much anything they show off on YouTube isn't the good stuff that's up to date code wise
I'll get the link for the one I use
Also do you have a virtual audio cable like VB cable?
yeye
thank you very much friend
You're welcome! To run it just start mmvcserversio.exe
Btw make sure to switch to server mode because client mode hasn't worked since November 2025
okey

my apologies
was not at my pc
glad you got help tho
no problems cutie
Upload a Voice Model can no longer be used in Weights.com?
wdym?
Did you get it working or need anymore help? :3
Why can't you upload trained models to the page anymore on Weights.com?
I'm still kinda confused, do you mean models trained on appio?
im trying to figure it out
how to be as smooth as possible
Weights removed the ability to upload your own voice model, as of yesterday
fr?
greedy fuckin company
guys now what to do
@hallow thistle
BROOO MY TWIN MY GLORIOUS KING HELP ME
TWIN
i have to go to school by 8:30 am say fast!!!
its 7:44am now!!
That's the "file explorer" instance coming from Applio RVC notebook on Kaggle, it's not the actual Applio RVC interface in this case.
im dumb
how to open rvc interface
mine looks like this
?
read the chats
Check on your Kaggle terminal, there's a URL link if you run the last code cell; if you use ngrok, there's ngrok; if you use "Gradio + Localtunned" there's Gradio.
Yes, it's in the "Applio Public URL" part, not a local IP address consisting of numbers.
That's Tensorboard, still not actual Applio RVC.
wah😭💔✌
how to get applio RVC now
twinn

now clicking applio public url got me here
Twin are you einstein
are you tesla?
you solved it
💔✌😭
@hallow thistle gng ngl u got a L here✌ still thanks for ur help
This is getting too confusing, because all links in your terminal are the same one. If you replace if Tunnel = "Ngrok" to Tunnel = "Gradio + LocalTunnel", mine would look like this on Kaggle.
nvm its solved
Sorry, I'm too slow to even type one message.
its okkk its okk twin we are not perfect
the modern day einstein
💔✌🥀
how to add voice models
help me twins
Legend is that @hallow thistle is still typing
u doing covers?
Recently, there's a user here who trying to help people of his own will and getting me demoted from as a staff member because I clarified to him once to not enter an old thread that has long solved, if you see me no longer have Helper and staff role, it can mean the judgement has been decided I was wrong on that one.
idk just 4 fun
bruh wthhh
its not that deep bruhh
now help me add voice models twin
To this day, that one person has been spamming in Modmail account talking about my actions as if did something too far terrorism to this server, as well as trying to drag another mod as if neither staff member available at the time.
See #✨│ai-help message.
u adding them to applio?
ya ya
yup thankss
Could someone please do a vocal swap for me? I'm having a little trouble as I haven't worked with RVC since 2024.
I need help transforming the vocals from the song "Fate of Ophelia" by Taylor Swift into the vocals of Tobias Forge (Papa Emeritus) for a musical project. Any kind soul to help me?
Sorry for my bad English.
What is your PC GPU? There's Applio RVC.
it doesnt take my gpu power right it uses cloud right of kaggle?
damn its fast afff😭🙏🙏 when I used to use rvc ui locally mine used to take 2 minutes for 40sec
Anything on Kaggle always use resources from Kaggle and related cloud servers.
RX 5600 XT (6GB)
its solved thanks
DAMMNNNN ts took 10 sec to convert so fast!!! i sang ts btw
Applio RVC can be quirky when you put an output path to a non-existent directory (like /results/Me Too male part Yurizono Seia.wav rather than /asset/audios/Me Too male part Yurizono Seia.wav), where the Applio RVC might show successful message on frontend (GUI part), it doesn't actually create anything aside from throwing up an error in backend (terminal part). 
is ts good or not twin😭✌💔
applio giving me trust issues now
I'm at work, so I don't have any headphones to listen. 

do u have other good song samples to test ur model
ts not my model
just doing inference
o
did i sing good😭✌
that was u?
It didn't artifact much so I'd say yea
I graduated already
whenever i try it it does this
If you don't mind answering, what should I get then? If I have a Nvidia Rtx graphics card?
Resberry pie
Which NVIDIA GeForce RTX?
Rtx 4060
Tg Develop's W-Okada fork. https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/ https://github.com/tg-develop/voice-changer/releases/tag/b2397
@pearl anvil you'll need these two zips for wokada tg fork
https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-cuda.zip.001
Crazy
Did you follow any tutorial or guide before? And what is your PC GPU?
heyyy help me gng
since u are not a model maker
tell someone to help me make a model
help me
i did everything correctly,and when i try in discord it doesnt get my voice,nothing comes out
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 4060 8gb vram desktop) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message.
To maintain a legal, safe & ethical community, we will NOT provide help for:
- ANY illegal activities.
- NSFW/Porn.
Requests for these topics may be ignored, not helped and result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
- Don't Ask To Ask.
guys im training a model
From what I understand from the guide, I should prioritize using the pth closest to the top and with the most steps, right?
I'm looking at it through TensorBoard; I can't send image
“Guys, how do I train it? I need help
-kaggle
Kaggle is a Cloud (Remote Good PC) Service that offers 30 hours of GPU weekly, but needs a phone number verification
by IAHispano
Kaggle
by Hina
Kaggle
by Hina & Deiteris
Kaggle
by Eddy, ArisDev & Nick088
Kaggle
by Eddy
Kaggle
by Shirou & ArisDev
Kaggle
by Shirou
Kaggle
My laptop gpu is NVIDIA RTX 1050 windows 11. I am trying to use the ai voice changer wokada but whenever I use it when in game or record it lags or glitches, sometimes voice is not picked up by the changer. Some people have told me about upgrading my system if that’s the case, can someone suggest me a budget cheap build that can run the ai voice changer better than my laptop or maybe can I send the options that I was looking at and let me know if that would work. I don’t want to upgrade and then find out that it’s not better than my old laptop.
I don’t know much English. If possible, could you please give me a specific video tutorial on training RVCI don’t know much English. If possible, could you please give me a specific video tutorial on training RVC
What is currently best in terms of quality realtime voice changer?
Any recommendations for what I should use for RVC interference if I have an AMD GPU (RX 7900 GRE) and I'm on Linux or an Apple Silicon Mac?
Anything local with balanced buffer works best
so what would it be?
Nope
Just test each one until it sounds good to you
With a sample, tensorboard is pretty much useless to look at for training voice models
Vonovox, but that's Nvidia only
And for both Nvidia and amd there's wokada tg fork
do you have a link for it?
I will only be sending a link for one of them if I know what gpu u have
Since I don't wanna send something that won't work
Soo what gpu does your PC have
Sry it's rtx 3090
ur good, what are you looking to do with the voice changer btw :p
Probably gonna just use it from time to time when hanging out with my friends for shits and giggles
so vonovox is the current best for nvidia, would you like that one?
if ye then u need this and this
https://github.com/dr87/Vonovox/archive/refs/tags/v1.6.9.zip
@warped wave
not currently no, if you'd want to just record it using obs or smth tho that could work
What is the second link for?
If it's for virtual canble then I have it installed
It's a virtual cable, vac lite
can sm1 help me, when i run "start_http.bat" it closes
Any kind soul ?
hey david could u help rq?
what you're using is old what gpu do u have
uhh
an rtx 3060 may be able to do what u want properly
NVIDA GeForce RTX 3050 Ti Laptop
do u have a newer one i can use??
U can try wokada tg fork, idk how well laptops will run this stuff
oh ok do u have the link?
u need these twohttps://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-cuda.zip.001
extract the first one then put the second one in the folder for the first one
ok ok
bc the second one can't be extracted but u still need it
if u want
alright
ones its done can i dm u and show u? like a video
its my first time using rvc so i dont really know what to do
I guess? I can't call tho
okay ill send u a video dw
ok i got them all downloaded
check dms
So is that the only thing that matters ? The graphics card ? What about the intel core ? Like i5 or i7? Is there any other component i should consider for the upgrade for my purposes ?
intel cannot run ai like at all
it's bad
only amd and nvidia can run realtime stuff like that
anything that's like equivilent to an rtx 30 series and up for amd is recommended
so for nvidia 30 series and up
10 series CAN work but not well
yes and i got a gtx 1650
pls help my ai is not opening
very helpful
can anyone help me? anyone
hey broo
help me
should we use UVR compulsory??
@hallow thistle
you know twin?
what?
UVR
I use the google colab of uvr
whatever but since i got the song now how to mix it and make a dataset
bro you here🙏😭✌
Is this a virus
what
in what
Intel Arc Axxx/Bxxx (dedicated GPU) can, while other integrated Intel GPUs don't. Intel Core/Core Ultra are the CPU.
Newer Intel Core Ultra CPUs, while offer integrated Arc GPU for most models, they generally can run smaller AI tasks (like some camera background removal) and won't be the same level as dedicated Arc. There's an Intel NPU as well but it's primary for offloading some tasks from CPU and GPU (like noise suppression from microphone audio) and still not optimized for high-computing AI task like a voice changer either way.
hmm ok, ty for the info
hi, how can i get this to run on my gpu instead of cpu ? i have an amd rx 6700 xt
W-Okada voice changer (MMVCServerSIO) or Applio RVC (retrieval-based voice conversion)? Generally, both programs are free and open-source, use RVC voice models, though they are of different purposes.
Try Tg Develop's W-Okada fork, especially its DirectML variant. https://github.com/tg-develop/voice-changer/releases/download/b2397/voice-changer-windows-amd64-dml.zip https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/
i can use it in game ?
Yes, given input/output audio settings look like this in your game.
can u help me download and help setting up vrc??
Are you following Tg Develop's W-Okada fork? On this GitHub repository https://github.com/tg-develop/voice-changer/releases/tag/b2397, there are two files named voice-changer-windows-amd64-cuda.zip.001 and voice-changer-windows-amd64-cuda.zip.002 there, click both links to download them.
can we go to dms and u ca help me there??
Check the voice changer guide for more information. https://docs.aihub.gg/realtime-voice-changer/local/tg-develops-w-okada-fork/
These two files have to be in the same download folder Use WinRAR or 7-Zip to open .zip.001 one.
No, stay here. Going into one's direct message generally limits those knowledge only just for you and me, and if it's nothing personal I wouldn't do that. 
w-okada has been having a TON of errors and getting worse lately. i have NO IDEA HOW, considering it's all LOCAL, i do not understand how anything could have changed, but it observably has. does anyone know if there's a reason why?
You have been asking for W-Okada voice changer before. Is b2397 W-Okada still not working or something?
@hallow thistle their downloaded what do i do now??
Use WinRAR or 7-Zip to open .zip.001 one.
openwith option right?
well client mode stopped working out of the blue last time i asked, (which still BAFFLES me because nothing in my files changed), but having downloaded a new version of both forks, the server version is now stuttering immensely, and seems to be acting like there's a harsh noise cancellation function cutting off my sentences. last time you helpfully advised me to make sure the sample rate was good, which helped at the time, but now it's getting worse regardless, the performance is worse as well, and it feels to me like the quality is just degrading due to some factor i cannot comprehend.
Either double click or right click menu will work.
ok im on the thing
what do i do now
Is your PC GPU still AMD Radeon RX 7800 XT? On your W-Okada, try these settings:
Chunk: around 100 ms
Extra: 2.7 s
GPU: your GPU
Pitch extraction: rmvpe_onnx
@hallow thistle
these are and always have been my settings. like, i've been using this software with ease for months without changing anything. no new model, no new hardware, no new software, things just started not working. i'd love to find a solution but i still have to wonder WHAT happened, why, how.
Calm down, and don't be hurry. You can send your screenshot to here to make sure anything there is alright.
sorry, didn't mean to come off as upset at you or anything, i'm calm, it's just frustrating and confusing you know? just wanted to explain that i am doing everything pretty by the books here.
Extract files to somewhere like "D:\MMVCServerSIO". https://cdn.discordapp.com/attachments/1159290139609137264/1458384995956818122/image.png?ex=69616c9e&is=69601b1e&hm=fdb4027ce1e885a726458bbf157fb69d2de4f5b38380a3d06d8e057020b9381d&
i'll let you help the newbie get set up, i can troubleshoot this on my own fairly well, i just was wondering if you knew how a software could be altered to function differently without me downloading anything new. like, how did client mode break? i don't comprehend it.
okk im extracting it noww
no explanation for client mode
its done extracting and i extracted it to the example
what do I do noww
so everyone is confused? alright, at least i'm not alone lol
Go inside MMVCServerSIO folder, there's a program of the same there, double click the program to launch the voice changer.
okay
This is not Windows Explorer.
oh
The actual folder should look like this.
Wait for a bit.
Chunk: around 120 ms
Extra: 2.7 s
GPU: NVIDIA GeForce RTX 3050 Ti Mobile
Pitch extraction: rmvpe
Input: microphone
Output: Line 1 (Virtual Audio Cable)
Monitor: your speakers/headphones, this one is optional.
Yo what’s the current best fully offline, local voice model trainer? I wanna make a very high quality model but i dont want the audio to be leaked since it’s all unreleased stuff.
Applio RVC.
is there a tutorial for it?
-rvc
Check out AI Hub docs and Applio official website.
ok im done now how do i put models in
ok thanj you
oh ok wait
look
You're welcome. 
i dont have line 1 VIrtual audio cable
and i use voicemeeter output
how can i
uh
Either Voicemeeter or VB-Cable might work, though I often use "Virtual Audio Cable lite" because it simply offers one single virtual line for input/output at the time. Voicemeeter/VB-Cable seem to offer many virtual lines than necessary.
This is the download link for Virtual Audio Cabe. https://software.muzychenko.net/freeware/vac470lite.zip
just to be clear none of the files that i use in the local version of applio have any way of being found online right?
I don't know, there are many instances of Voicemeeter's virtual lines in your screenshot which is confusing, so I'd not using it.
What does this mean if online cloud (like Google Colab) doesn't use repositories from GitHub? Both locally and online ones generally use roughly identical or same code repos from GitHub, but designed to work on different environments.
@hallow thistle
Extract these files into "C:\Users\your username\Downloads\vac470lite".
ok done
If you mean something like this: when you run Applio locally, user data (like voice models and dataset audio files) would still stay in that folder, your files won't go or being shared anywhere outside your PC. Same goes to online cloud services but the data there might be removed all after the session ended.
i extracted it
Go inside "vac470lite" folder, double click on "setup64.exe" to install Virtual Audio Cable lite.
do i have to restart?
Generally, I am not supposed to tell every step to members even if the guide docs exist, although I still do it anyway if the poster asked for it.
Right click on your speakers/headphones, set it to as "default device". Same goes to "Recording" tab with microphone.
No, I mean the physical device, not a virtual cable.
do i set this
on default?
@hallow thistle
you there
uh @hallow thistle
@hallow thistle hellooo
how do i fix "restart session?'
wait i figured it out but when i talk it keeps glitching and it doesn't sound like the voice i want
and i cant hear anything when i go to youtube
it keeps cutting out
Traceback (most recent call last):
File "F:\2\Applio-3.6.0\app.py", line 6, in <module>
import gradio as gr
ModuleNotFoundError: No module named 'gradio'
help me

Try remove your current Applio RVC folder, re-download the zip from this https://huggingface.co/IAHispano/Applio/blob/main/Compiled/Windows/ApplioV3.6.0.zip link, and re-extract again.
You're welcome. 
I’ve downloaded it, but I don’t know how to install and configure it yet, because it’s quite different from the videos I’ve watched
is there a guide or tips for noise reduction in .wav files?
I have Audacity and I know how to use its noise reduction but I wonder if there are other recommended methods?
-rvc
See AI Hub docs and Applio website for guide docs.
Important question - does RVC have to work harder to do inferencing on FLAC files, instead of .WAV files?
can't we use rvc on roblox?
heyy bro
Do i need headphones for it to work
How can I fix my voice when it keeps cutting out
What software are you using for PTH with Index? Can you share your settings? My voice keeps cutting out
i mean dude u can just put vac lite on voicemeeter
that's what i did :p
See #📤│model-maker-role.
Without a headphone or speaker, you won't be hearing anything from a desktop PC, unless it's a laptop with integrated speakers. 
What is your PC GPU? W-Okada realtime voice changer or Applio RVC? And did you follow any tutorial or guide before?
how to upload a model to hugginface
so people can download
You didn't register a Hugging Face account? You should've done that sooner.
i already have an account but tell me how
to add models so people can download
On Hugging Face.
@hallow thistle
select model?
or data
which one?
@hallow thistle
bruhh✌😭
don't ghost me bruhh
Generally, I have no idea what to put in "License" part, so please stop asking me further. 
ok but you know how to upload a model right??
or no
bro giving help suggestion to helper✌
just openrail
Ty for the information my laptop is Windows 11 Lenovo
What is your laptop GPU?
How do I check for that
Open Task Managaer, go to Performance tab, check if there any GPU 0 or GPU 1.
Hi I keep having a problem with the Voice changer I seem to have fix it then it comes back.
What happens is it change my voice fine but then I get a echo of the voice again the same as if I play sounds on the PC it will hear that changed too.
I have bluetooth headphones so it shouldn't be speaker echo and I have tried turning the monitor sound off.
Has anyone had anything like this before I don’t know what is going wrong. Thanks
can someone help I dont even know how to adjust it and when i try adjusting it t js sounds fake 😭
Ew
Choose something better
Can't believe I tried to help you
I'm very disappointed
is there an rvc realtime voice changer app i can use for free?
nvm!
-# i actually downloaded it just for your cyn voice lol hi
lol haiii
I have a secret one I haven't posted that has no effects, you'd need to add an autotune effect and pitch it up some with effects tho in a seperate program like fl studio :3
I use it personally as it's better than the current public ones
cool!
ye it's fun :3
hi, is the guide about the deiteris fork still up to date?
https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/
tbh wokada tg fork is a better choice now, it is better from my experience switching from deiteris to it
ValueError: could not broadcast input array from shape (4468,2) into shape (5056,2)
How to fix
oh okay thanks, I'll definitely try that one out



