#✨│ai-help
1 messages · Page 281 of 1
hi so i wanted to get into ai art creation but idk were to start (i mean what program to use)
I'm training a voice model rn I just need to know which pretrained model I should use
Is there a place where I can hear what the pretrained models sound like?
idk anything but il throw it in here: cant u just put the pre model into W-okade or another rvc
just use the default or legacy core
beatsforge is for drums
Can I import the voice model I want into Voice Changer?
I'm referring to the ones that are in the models channel here on the server
yup
Oki doki
unzip the model and make a new folder to put all of the ones u download in, makes it simpler to keep track of everyone of them
The default? In Applio on the training tab it doesn't have the pre-trained model drop down. So in the guide it says to instead download a pre-trained model there
Oh you mean the pre-trained in the gdocs file gotcha
that's weird
just train it without choosing a custom one
uncheck this for original pretrain
My data set isn't that long wouldn't it have quality issues?
Applio
Local yeah
ok yea just do batch 2 for that since small dataset
yea
Do I just send it like this then?
The ai docs says 15 if you're new, and total epochs "go for an arbitrarily large value like 1000"
Or for "saving every epoch" do I set it at 1
Do I change anything else?
anything past like 700 is overtrained as hell
even like 700 is probably overtrained
5 is better tbh unless u got a whole lot of room on ur pc for every single epoch
since in the end you'll delete them all besides the one that turns out the best
So save every 5 and change total to 700?
And how about any of the other check marks?
nah u can keep it 1000 bc you won't need it to go that high anyways
just have it high so u don't have it too low and miss the peak of your model
all seems good
but yes every 5
save every 5
Okay, already changed it. Batch size 2, save every 5, total 1000. I'll go and generate index then
good to go, make the index then hit train
if u need any samples to test for singing or talking just lemme know I got a bunch
Okay, thank you! I'll let you know how it turns out!
still kinda low on talking ones especially for female models
can anyone help me
that sucks
yh
if yes u downloaded something ancient and super outdated
so uh what gpu do u have bc u need a new voice changer
oh ; (
nvidia, amd or intel
i waited 1 hr bec of internet
i tried both whichever worked i used
intel
I know you're a skeleton but use your brain
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
download that third one
read the guide and download vac lite
and the voice changer, gotta scroll down a bit for the download link to it
thrid one?
"Wokada Deiteris Fork
Most suggested WebUI with the best general support for many platforms. GUIDE"
that one ^
thanks
whcih one btw
which wha
for what
u don't need an extra browser just download it 😭https://github.com/deiteris/voice-changer/releases/download/b2332/voice-changer-windows-amd64-dml.zip
here
thanks gn
So it finished suspiciously fast so I looked into the cmd...
yikes
probably glitched
not even a 30 second dataset finishes in 6 seconds
So idk why it's looking through the 5.7 when I have the 6.2 installed
Also that yeah
goodnight skeleton man
Do I just relaunch the thing?
maybe? idk how the local one works as I use cloud
Okay, I'll try this again
I'd maybe ask Noobies as they're pretty smart with this coding stuff unlike me
On second thought... I'll just make a thread. That'll be safer
u still here
yup
any updated guides on GPT-SoVITS? or, like, something better for TTS?
u know how to fix pipeline is not inzitialzed
you';re trying to train on iGPU
you need to change the visible device in .bat
u know how to fix pipeline is not inzalited
I'm having issues with Kaggle Applio not working, I get a connection error every time I click anything what is this
select a voice model
check the kaggle log
if you can access the UI, there should be no connection error
so likely the process did stop while you're stumbling around your UI
this issue followed me even after deleting an account and making a new one
either kaggle hates me or there's an issue with the current applio
i did
show a screenshot
wher would that be?
kaggle screen with the cell running (spinner spinning) and the log underneath
ah
that looks old
u gave me that
seems to be working fine now, I'll update u if it disconnects again
Whats the best gain tune index and everything if u have a deep voice and ur gonna troll ur friend with a girl voice
How exactly do you change the name of the uploaded models in Vonovox? There's no "ok" button and closing it doesnt save
that's not allowed here, get outta here weirdo, read the rules next time
change the file name on your pc
hey sorry im abit lost but how'd i create a tts ai speaker with a custom voice model in the thread?
if you mean rvc models, it can only be used on existing input voice audio or the one from a TTS output
alternatively, you can try Chatterbox, a zero shot TTS, with input text and the voice's reference audio #📰│dev-updates message
Ok thanks, does voice reference significantly change how the audio will sound like?
Do I end it here?
your model blew up
So... Bad?
check what you got < 20k steps
Wait how do I do that?
So do I just restart applio and not end training?
ctrl-c in the terminal window, then start it again
your trained models should be in the logs/model_name folder
I think I did this wrong, it just closed the window and error
I got the trained models... Do i just pick one in random and that's <20k and see how it sounds?
the output should sound like the voice in reference audio
So okay, I got my model how do I test it in tts?
rvc only needs input audio, so you would consider using output from any TTS
Which tts do you recommend? I tried checking out local fish audio but either I'm blind or I can't find the directions on installation
not random, try latest and go back
Which tts do you recommend?
that depends on the language
voice cloning?
Can't I use the rvc model for realtime tts? Or do I have to do the cloning thing those tts programs provided?
sry but does applio work just fine? i have trouble installing chatterbox
Does anyone have any mobile apps or websites for uvr5? (Ultimate vocal remover v5) i usually use hugginface space for uvr4 but i want to extract with v5 and i cant find anything
there’s no mobile app since phones aren’t super powerful, but you can try other cloud methods https://docs.aihub.gg/rvc/resources/dataset-isolation/#vocal-isolation--cleaning
Last update: August 18, 2025
you could also suggest @viscid moss to add the model to his ZeroGPU hf space if it doesn’t have it
does this server have LLM loras?
its like you train a hugging face language model to have a personality of a character, its much more effective than promoting and R.A.G
is that the site where i can download models from? https://rvc-models.com/
Is there any way to reduce cutting out while using the voice changer in w-okada deiteris?
we aren’t affiliated with them, u can use #1175430844685484042 or https://weights.com/models
installing chatterbox is a rite of passage
use python 3.11 or 3.12
based on the version there are slightly different steps required
Wym with uvr4 and v5?
🤔
BRO CHILLL like almost everyone in this server use the ai voice changer to troll there friends
what voice changer is best rn? the og one or the fork?
where do i download the program that let's you change your voice, the github and website is too complicated?
Voicemod → Easiest, free (with some paid voices), just install and go. Works well with Discord, games, etc.
voicemod.net
If you just want something that works immediately, then go with voicemod
so i can use the models on voice mod?
Not really
Voicemod doesn’t let you load or train custom AI models like RVC
this tg okada fork zip 2nd part is corrupted 
how do you know?
because i just downloaded it?
Ew
Shush e-girl thing
It's not meant for that specifically, if u wanna be a cool character like Goku or whatever that's fine
Creep
Voice keep doing weird stuff like alot of mini freeze
Rx 6800 and mid cpu like not bad
Ping gets very high (20K ms) after 2 min of talking .Any idea
Probably bad settings, what voice changer are you using and what's your gpu
Question, did you download it off a YouTube video
If yes it's super outdated and old
Like a year old
Is that Nvidia, Amd, or Intel
That's for training models
ooh
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
Best one with amd support ?
U should download wokada deiteris
The tg wokada also supports amd but I have no experience at all with it
So I've got no idea what's different about it
Yup
Download links are there once you scroll a bit, also download vac lite if you haven't already, it's at the beginning of the guide
Okay let me see
Hum
Cant find VAC
lite
Oh wait
i already have it
Oh
Warned as high danger malware and has been deleted
when i tried to open smth
I think i just extracted the wrong way
mmmhh
weirdd.
I want to try it on TS but idk how to. It wont work
Team speak ?
yes
Burh
the voice changing thing
I am playing gta rp and the server uses it
they dont use it. they use a plugin for TS. It works very well
nah
Do you have VCB thing
yes
yes
And it dont work ?
Did you turn it on ?
xdd
@viral mason Less freeze but MS is still very very high
Like rn im at 8K ms
How do I find generic RVC models?
yo anybody know a good way to convert an image and an audio file to a video of the person talking? best if locally, and applicable to virtual characters (example: https://files.catbox.moe/hrud1f.png)
What does this mean
Are you trying to be an e-girl or something 🙁
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
is there better alternative to vac?
Im using w-okada for voice change and I it was good for me when used on pc for vrcahat with less then 1 sec delay. now when I use vr head set I have like 8 sec delay what can I do to fix it?
Most likely, your gpu has to work harder for vr mode, so less computing power can be spared for your vcc. U can try reducing graphics on ur vr headset, tho with 8s, u probably need a gpu upgrade to get it to acceptable levels
can somebody help me with my problem really quick
why does the voice changer sound all glitchy and stuttering
I was having that issue too
It fixed itself tho
yeah, i just clicked again and it was working normally hehe
Yeah ive been uzing zerogpt on hugginface the one that site links to, but it doesnt go to uvr5, only uvr4
On the hugginface space, there is no “vocals fv5 by gabox” it only has fv1 - fv4
Wait lemme check
!howtoask
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.

What is your PC GPU? Did you follow any tutorial video before? And which W-Okada version are you using?
Oh, ya u are right. Model is already released on UVR5 UI but my pull request has not being merged
Meanwhile u can use it on Colab/Kaggle/Lightning.ai or locally
till it gets merged
VB-Cable is another virtual audio line program, but I don't recommend using it. Virtual Audio Cable lite is the free version of Virtual Audio Cable.
Can i do this online and on mobile?
Both, but for a better experience/performance PC
Alr thx ill try it out
Ur welcome, lemme note ur user
I'll ping u when done (On HF)
Preciate it
There's no known generic voice models here. What you see in #1175430844685484042 are mostly of famous people and fictional character voices, usually fan made.
I want to admit something, i have 0 idea how to use SD at all, i think i do but i actually don't, as prompting the way i did with NovelAI is not yielding similar result, i realize that maybe the models don't go well with prose / natural language and prefer tag-styled prompts, and i have been following trends on 4chan and civitai without knowing much, which was a big mistake on mmy part especially when i chose to use SwarmUI because it's new to me.
I set up old SD before and i think reason i mainly avoided using it for long is because of the gen speed, but with SwarmUI, i was able to gen faster than before even on 1280x832 res, but i still get bad results..
Also it doesn't help that youtube guides i feel like expect you to know more about it and often feel like a technical overview of the software instead of being helpful for user newbies.
I wanted to ai gen anime/furry art in western art style but they don't go very well in end result unlike novelai, i tried artist LORAs and Mol Keun model from civitai and i got very horrendous results that i don't think inpainting/segmenting will fix.
- Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
- Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
Tell your:
- Full GPU Name: (e.g.,
NVIDIA RTX 3060) - Operating System: (e.g.,
Windows 11) - Detailed Description: What were you trying to do and what went wrong?
- Tutorial Used: Link to the guide you were following.
- Screenshot: A picture of the full error message is very helpful.
To maintain a lega, safe & ethical community, we will NOT provide help for:
- (E girl, as an example) catfishing/trolling, scamming, impersonation.
- NSFW/Porn.
- Any illegal activities.
Requests for these topics will be ignored and may result in moderation action.
- Be Polite & Patient: Our helpers are volunteers. You may ping the
Helpersrole once. - English Only: Please keep all conversations in English.
I have rtx 4080 I followed a youtube guide that was really good. And I can't check right now the version but I download the latest about 2 weeks ago
GPU: RTX 4050 6GB
Operating system: Windows 11
|I have been trying to use the Deiteris' W Okada Fork real time voice changer following this doc: https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/#help
The issue i am running into is that it doesn't seem to either pick up any audio from my mic OR is unable to convert it OR isn't able to generate any output. There can be even more possibilities but i can't 100% confirm any. Though if it helps, I have tried three different mics which all worked on their own but didn't work when i ran them through the realtime voicechanger. I used both VB-audiocable and the VAC provided in the doc itself while making sure the other one is deleted and restarting my device before and after the un/installation.
I did get this in the cmd which i'm not aware of what it means due to the lack of my knowledge and vocab in this area. Hope it's helpful:
2025-09-17 18:04:24.9061699 [W:onnxruntime:, transformer_memcpy.cc:74 onnxruntime::MemcpyTransformer::ApplyImpl] 2 Memcpy nodes are added to the graph main_graph for CUDAExecutionProvider. It might have negative impact on performance (including unable to run CUDA graph). Set session_options.log_severity_level=1 to see the detail logs before this message.
If anyone here knows anything about this, i'll be grateful. (sorry if the message is too long, i tried to make it well-documented so it won't be a hard time to understand my issue. If needed more info, i will provide.)
videos could have editing magic, if the reality isn't that good, it's not good
I'd recommend one of the local solutions here #1393389200862089240 message
voice changer is realtime, for recording you'd want the best quality as possible which could be achieved using the normal non-realtime RVC on pre-recorded audio, i.e. https://docs.aihub.gg/rvc/local/applio/
Last update: August 9, 2025
if you're using RTX 5000-series, you should try either of these:
Last update: September 6, 2025
Last update: September 6, 2025
Its a RTX 4050 6GB
I have downloaded the correct one from the doc as well
so I should use a fork of WOkada?
the guide would explain it
ok tnx I'll try when I get home
is there any error message when you seek through the terminal?
the one that i provided was the only one, rest seems to be normal logs
2025-09-17 18:04:24.9061699 [W:onnxruntime:, transformer_memcpy.cc:74 onnxruntime::MemcpyTransformer::ApplyImpl] 2 Memcpy nodes are added to the graph main_graph for CUDAExecutionProvider. It might have negative impact on performance (including unable to run CUDA graph). Set session_options.log_severity_level=1 to see the detail logs before this message.
i've checked my volume and things as well and they seem fine since the mic works without any problem if used directly
now check the voice changer settings, esp the audio input/output devices
to show the screenshot here, you need to first ask me for image perms
I do have an ss saved for this, can i have image perms to post it here?
!give-media-perms 20m @old hawk
yep
did everything the doc said, change my playback and recording to the default ones and restart my device
how is the perf meter and have you tried adjusting the volume & try few other models?
I did try to tweak with the volume things but still had 0 result, as for perf meter im not sure whats that
the perf meter is located at just left side of the model pic, it should show how much ms and dB
if it outputs something it should show above -90 dB
It does move but it has no effect whatsoever to any sound, like it moves in the same pattern indefinetly
it stays at -90db
for diagnostic purpose, first try stop the voice changer, then click passthru button & change it to red, and it should output your unprocessed voice from the mic
if it seems okay, then you can change the passthru back to green
oh damn, i will try that right now
nope, still no output
with passthru
I have allowed the site my mic permissions and have checked my mic permissions in windows settings as well
perhaps problem with the speaker?
what if you try passing the virtual cable output and do mic test in discord?
that is what i did actually
i tried both monitoring and passing it through discord
while monitoring it from discord settings
all you did with VAC instead of vb-cable?
lemme try that
is this correct
if it may error out, try SR 48k
then if it sounds too noisy, you'd probably need some external noise suppression
it seems to work now, thanks a lot!
i had spent like all my day today trying to look for the issue
thank you again
np that's good
could someone help me out?
i'd like to make good ai gens image but prompting doesn't work out and not sure if it's the models, lora, etc
different model types require different prompting methods
guys, can i ask if i need transcript to train a model ?
gpt-sovits?
Codename's fork
you don't seem to understand what rvc model architecture is supposed to be, please go read the docs https://docs.aihub.gg/essentials/whats-rvc/
Last update: August 9, 2025
I got the other fork to work like the guide shows, but is there a recommend settings I should use like how many chunks and which F0 detector? also what else can I do for less delay when in VR without VR I have almost no delay with good quality. ty
Fork just means Modified Version in the open source tech field
If you're talking about Codename's RVC Fork 4/3, it's not in the ai hub docs, its **experimental **and really not suggested especially to people who aren't developers, as pointed out before in previous cases by other people like Lyery (which seems to be codename's friend) & Noobies (An Applio Developer): #🔥│model-maker-chat message #🔥│model-maker-chat message
Fork means Modified Version in the open source tech field
Could you please elaborate your PC GPU, Operating System and the tutorial/download link you're using?
I'm guessing you're talking about an ai realtime voice changer (be aware that this is a general server about ai, so i'm just guessing), be sure to play on lowest graphics 1080p 60fps cap
oh, okay
Yeah that's the reason it's not in the ai hub docs, you seem to be confusing about the way RVC models are trained, you don't need to train them on text, RVC are Speech-To-Speech models, checking more the Applio and AI Hub Docs would help you understand better :)
If you were also thinking of a transcript you have to read out loud to train a model off your voice, there isn't a standard one that is universally better
I hope you understand, and for any issues let us know here :D
i mean, i used to train my model without transcript for ages, but i found out my friend had transcript in his data folder so i just curious
xd i just want to improve my model quality
I used what the mod told me from the guide they linked and like I said I have 4080 win 11
Ohh, I see it now, you were looking for a transcript to read out loud to train the model lol, the reason why the other helper asking if it was a GPT-SoVITS model is because transcripts are used in TTS training lol
I mean anyone can just make their own little text to read, there is no universally better transcript to read to make an RVC model of your voice better tho
There are multiple versions and programs for ai realtime voice changer for calls/games, could you please link the exact one you're using currently?
https://github.com/deiteris/voice-changer/releases this got the windows one
any machine voice will fork up my model, i heard this from a wise man with a panda pfp
but thank you for your time tho
I've said before you'd want gpt-sovits to utilize the transcript, but it is TTS, which is far different from rvc

btw it is not really a transcript if without proper timestamp
that's wokada deiteris fork, are you sure you want to use that one? the last update was december 7th 2024, vonovox would give better performance on your nvidia windows
it has timestamp. just i wonder i can reduce the misspelling issue
and improve accent

you don't quite get the point
I'll try that also just downloaded what I was told.
any machine voice will fork up my model
I wasn't talking about training your model with a robotic tts, I said that the other helper might have confused your request with a TTS Model (GPT-SoVITS) training request, since this is a General AI Server, not an RVC Server anymore
It was a miscommunication on both ends, but simply to put it: You don't need to read a specific transcript to increase your RVC model quality :)
i heard this from a wise man with a panda pfp
I'm not sure who you're talking about, but I'm guessing the ex staffer Razer, idk if he's also your friend with the transcript that you were talking about since I don't remember him saying that, but usually we never suggested a specific transcript that has some words to increase quality, I hope you get what I mean
btw rvc actually uses an embedder model which could affect the pronunciation. currently by default it is contentvec, and another best option so far is spin (v2). you can try it in the latest applio.
note that the model inference should be done using the same embedder model as the one used in training the model, otherwise the output voice might sound gibberish.
no, he doesn't tell me about using transcript to improve
he just told me artificial voice will make my model sound worse
i tried spin v2 but it doesn't have any pretrain that can handle Vietnamese. only v1 that have Legacy-core pretrain
using an output of TTS with voice cloning to produce more audio of the target voice
that generates artificial audio
you may train RVC model on that, but results wont be natural / bad
that
i mean if you have no other choices that's the only way
i got expressive data like suggestion. but when i got the data from my friend, it contains transcript so i got confused. that's about it
Ohh I see
About the transcript thingy, nope there's no specific standard one that will improve your RVC Model training
but what is the different between training via applio and fork
If you need urgent help, please checkout our AI Hub Docs or ask for help here following the [Guidelines](#1402790586028789830 message)
no different right ?
you might be interested on finetuning the spin model that might involve dataset with transcript (cmiiw), but might need around several hours length, you can go ask @simple ore for further explanation
I'm guessing with "fork" you mean again Codename's Fork 4 or 3
Codename's fork got more experimental options, if you mess up with them especially if you aren't a developer, you could blow up your model just like it happened previously to other users (this was also apart of a convo I shared the link to you previously)
there are versions of hifigan that use PPG, but it is done automatically using Whisper ASR
interesting
only TTS/ASR models are trained on audio+transcript
since i trained 3 in 32k but i haven't run into errors, or i might have ran into but i don't know if it's an error
ah got it,
i think it's the case, then i will drop the transcript aside

ASR training takes noisy speech with music/sound effects or other stuff so it can learn to extract the actual speech
TTS are training on clean audio so it could reproduce the speech in a required voice
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
the 1st link is vonovox
the 2nd link is wokada tg-develop fork
the 3rd link is wokada deiteris fork
RVC uses speech features extracted from audio, so it does not really care about what's being said or language
vonovox is much more suggested for your windows nvidia setup, because it's in active development unlike the other 2 i mentioned
I mean it's your choice what you want to use, I'm just making you aware, if you'd rather to still use wokada deiteris fork I can help you with any of the 3 programs
oh but i'm curious if RVC learn about speech feature but it failed to pronounce certain words
is it because the data don't contain that certain spelling ?
for the non-english language ffs?
or any specific example, like being unable to pronounce strong "R"?
ye, non-english language
idk how to give example since it's my language 
"you're having pain but can't tell where the pain point is?"
i mean i know but i can't explain, reeeeeeeeeee. but i can say i do sound like American try to speak Vietnamese
yo what up anybody know a good "base" TTS model to generate the initial TTS to then convert to RVC? Needs to support german, for english ive already found kokoro which does all i want.
how i find here russian voice model
yea it's the accent but you were talking about mispronunciation yet couldn't mention some specific example words/phonemes
ah, take this word as example "ngủ"
ủa. ả. these has the ? on their head
pretty much sound pretty wrong
like they try to add the r to the end of the word

sorry I don't know vietnamese and I thought it sounds rather complicated
ah ye, no, it just sounds incorrect, that's all

but spin somehow reduce the problem
Does anyone here used Twilio before?
how to upload image
You need to be level 5 or request mods for temperory perms
Search with keyword "Russian" in #1175430844685484042 or use Weights bot in #🔍│find-models.
Go to ai hub
My bad this is ai hub I thought I was in the Vonovox chat, I'm too sleepy for this
You don't need to force yourself to do like me. 
yea only you vietnamese know how to pronounce
Hello everyone. I'm here with a request for help with a RVC error that pops up in CMD. I've trained a test model (no problem), but can't even test it due to weights unpickler error and I tried a few solutions from forum and chatGPT (mainly to explain basic concepts and such).
GPU: GTX 1650 4GB VRAM
The RVC I used: https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI
I run this in mini conda
Other things I have:
- Python 3.10
- pytorch 2.7.1 with GPU support (I had version 2.8.0 CPU)
and somehow managed to fit the requrirements of all the other packages.
This is the error I encountered (also no .index file gets generated and I assume it should be generated after I click on Train Feature Index). Nothing happens when I click it.:
CMD output on pastebin: https://pastebin.com/k3L1a2KE
they need to fix their damn code on here because since this update this keeps happening
nothing even shows up here to show what went wrong is just gives you the middle finger and stays silent
That's mainline/original RVC, tbh if you want it easier, self contained and with more often updates, I'd suggest the RVC Applio Fork tbh
Original/Mainline's last update was 10 months ago currently
Last update: August 9, 2025
Thank you. I've already download Applio after reading through the first few docs pages. I've dealt with such problems before with ComfyUI nodes, but RVC is something new for me.
mainline works without errors up to pytorch 2.5
after 2.6 those errors can be fixed by editing some files
Also, is there a general consensus on Pytorch version and such? Oh, rihgt, I saw downgrading to pytorch 2.5 works, but I didn't want to do that due to the requirements because I didn't want to start another environment and do everything anew.
applio uses 2.7.1
that's perfect, also, may I ask if .index file is a requirement for a model or not? I get conflicting opinions when I search
is a requirement if you want to apply to the model maker role
for casual usage not
thats where the accent of the speaker is stored
when I used the mainline it didn't generate the index file and the train index option didn't work as I mentioned, so I assume it's just broken
besides that old version of torch, mainline also needs a specific old gradio version to work fine and a old matplotlib as well
this
So the file you sent has all the stable requirements? Thank you.
yup
it's been a horror for me
I'll be dreaming of banbilions of requirements to make VRC work.
yeah mainline havent got a real update since 2023 due to the author giving up on the project
applio is literally the same as mainline, just with updated packages
but i always recommend mainline because it's the original thing
Thanks for help. I'll get back to trying and reading the resources for Applio. I'll have to get the basic understanding of what all the packages do.
Mainline didn't work for me, I guess.
no need to install them manually
applio has an installer that does that automatically*
uggh, that's the idea I always got. With ComfyUI update it needed a git so I was installing and trying to troubleshoot manually. AI is just a tool, but yeah, thanks. Otherwise I'd still be lost.

did Nick get into murder drones recently or is he changing for halloween
Hi What page for create a song with IA artist
go to https://discord.com/channels/1159260121998827560/1175430844685484042 and look up caseoh
No, I'm a man.
I just want a generic voice to use, not specific to any character
I guess just look around but all voices are based one someonehttps://discord.com/channels/1159260121998827560/1175430844685484042
You can still get a voice actor that is not famous and has a pretty generic voice
Hi, back again. I've got a quick question. Where are weights actually saved? I can't see it in the docs or am I just blind? If Applio is the same branch, then the model should be loaded from assets/weights folder (the voice model) same for .index file
under logs/modelname
yup, found it, but in a much later chapter of the guide
so that someone could make me an RVC model, because I can't do it myself
give me time
tyy
do it yourself it's not hard 
how to do that ?
what gpu do u have, Nvidia, AMD, Intel
Nvidia
cool! there' three options u could download but if u want the best quality I'd use vonovox it's the first option
-rt
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
i will install it
pls wait for me it will take 15 minutes at least.
im new to this
that's fine, I expect it to take time because there's some stuff to read and download yknow
the easiest way would be through kaggle
https://dashboard.ngrok.com
https://www.kaggle.com
I'm making a video on how to do it right now
nvm can't make the video it literally doesn't work rn I'm gonna jump off a building
second part
we talked a bit in #💻|programming (not about helping but you get it) lol, I hope it's all fine now
Oh right halloween will come soon
I got into murder drones since recently
ye it's cool
I didn't think about this for halloween lol
some days before halloween hazbin hotel season 2 drops tho
I heard about that yea
just felt like changing a bit since some thought i was a bot and they say it was also bc of the "by Weights" 😭
I may make a model of Alastor but not gonna post it because I don't wanna get shot in an alleyway for it
I think I got it. Yeah, at least I got Applio working. 1 epoch took me 20 min.

Are you asking about websites to create images?
u know anything about this btw Nick
you can ask that in #1159289738314919936 if you want someone else to voluntarily do it for you for free
usually I just teach someone how to do it, bc I ain't making models for ppl
It would be nice if it could be done because it seems complicated to me
I showed you a video, it's easy 😭
@viral mason I deleted the video of your issue because you leaked your Ngrok Token 😭
Have you tried using using Applio's Dataset creator instead in Kaggle? Instead of doing that whole thing uploading manually the dataset
You can make a post in #1159289738314919936 , but be aware that this is done by few people who do it for free (paid comms aren't allowed) voluntary, so no one is forced to make the rvc model for you if you get what I mean
No need to say that, we have the AI Hub Docs and helpers to help you in the help channels
Last update: August 5, 2025
No, because I make my datasets myself and don't want them altered more than they already become from post normalization
I haven't ever used that option before and don't know what it does or how to use it
Also oops
About the token thing
It doesn't alterate it at all, it just automates the uploading dataset process and makes it easier:
- You check the option
- You just upload the dataset audio file
- You give it a name
- It automatically uploads it and uses it
So it's just an easier way to upload datasets correctly,
unfortunately without giving your datasets any parasites /j :D
@viral mason
i did it
is this saying cuda not avaieble ok?
is it ok to say this or just visual
like vonovox will work fine
nope, it wont use your GPU, which is very bad
you said you got an Nvidia GPU, whats its full name?
Hm weird
Oh, I'll try this then
Hopefully that'll work, I'll try it in an hour or so
are you sure? you checked in task manager?
You can check your pc gpu on Windows via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
yes im sure
i checked
bro i bough the pc myself
Do you have the nvidia app? Please check for any windows or gpu drivers updates
please check for windows or gpu drivers update and let us know
Hmm
That's not task manager
i checked
task manager aswell
i use desktop
not laptop
so i can see my gpu aswell
Great!
Can you delete the runtime folder
and re-run the setup.bat
be sure it's in a folder without spaces or weird characters
and to not run it as admin
Btw did I download vac lite as well, it was in the guide
how is it running
in another program
i closed it
it was pythomn
python
my bad.
Lol
do you also have https://aka.ms/vs/16/release/vc_redist.x64.exe ?
But is it the same version or an older one?
go into windows apps, check for Visual C++ 2015-2022, uninstall it, try to install the one I gave
oh ok
done
after you did that, delete the vonovox folder, redownload and re-run setup.bat
Uhhh
u sure its not a false detection
ok how i setup
vonovox then
to see if it work good
im very new to ts
so its confusing
nvm
ts dont work
it says
no gpu avaible
crazy
Set your mic settings as so
Input: your headset or headphone mic
Output: vac lite
it wont even detect
Shit
btw have you checked if you have a pytorch version 2.7.1 and have GPU available next to it?
I'd highly recommend joining the Vonovox discord server since you shouldn't be having this issue, the creator can see if he can figure the issue out with you himself
I can't send the link here because this server has a thing to avoid promoting stuff
in the environment where your python.exe is installed type python -c "import torch; print(torch.__version__)"
if you have e.g. pytorch2.8.0+CPU then it'll only ever run CPU and always write GPU not available or something similar
okat
nope i didnt check but it shouldve installed by itself when i did setup.exe
I think this is alright too, it recognizes my GPU. but if it write CPU next to it, then you've got a CPU version of pytorch installed
oh
so do this in the environment where the python is installed and where you run your instance of that program
He's using Vonovox, not applio or mainline/original RVC
do you perhaps have any anti viruses? you should maybe try reinstalling with them off or with an exception
nope i disabled all
I thought it's normal creating an environment. either you use a ddefault one, or create one or use mini conda or something siilar to manage environments, at least that's the hell I went through
How does the folder structure look like? Could you send me a link for Vonovox?
this should be it I guess https://github.com/dr87/Vonovox
yup that should be it
it setups python in the runtime folder
@paper flare what if you go in the runtime folder, write cmd at the top of the file explorer path, then type python -c "import torch; print(torch.__version__)"
one second
im confused
so what i do?
open "runtime" folder
on the top path bar, click cmd, then write the command I told you
I opened, but i dont understand what is path bar
nvm i did it
@low shard
it says cpu
@stiff idol you were right
mystery
how i solve it
if you have applio v3.5.0 installed already you can try realtime there
whats that.
nvm
how did you install it?
vonovox
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 10.5M 100 10.5M 0 0 18.3M 0 --:--:-- --:--:-- --:--:-- 18.3M
Extracting
Creating directories
Installing pip...
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 2098k 100 2098k 0 0 16.4M 0 --:--:-- --:--:-- --:--:-- 16.6M
Collecting pip
Using cached pip-25.2-py3-none-any.whl.metadata (4.7 kB)
Using cached pip-25.2-py3-none-any.whl (1.8 MB)
Installing collected packages: pip
WARNING: The scripts pip.exe, pip3.12.exe and pip3.exe are installed in 'X:\vonovox\Vonovox-1.6.9\runtime\Scripts' which is not on PATH.
Consider adding this directory to PATH or, if you prefer to suppress this warning, use --no-warn-script-location.
Successfully installed pip-25.2
Installing Packages
Looking in indexes: https://download.pytorch.org/whl/cu128
Collecting torch
Downloading https://download.pytorch.org/whl/cu128/torch-2.8.0%2Bcu128-cp312-cp312-win_amd64.whl.metadata (29 kB)
Collecting torchvision
Downloading https://download.pytorch.org/whl/cu128/torchvision-0.23.0%2Bcu128-cp312-cp312-win_amd64.whl.metadata (6.3 kB)
Collecting torchaudio
Downloading https://download.pytorch.org/whl/cu128/torchaudio-2.8.0%2Bcu128-cp312-cp312-win_amd64.whl.metadata (7.4 kB)```
yes i did setup.bat
idk
cu128
it looks like the right cu128 version
want me to screenshare
for better assist
hold on
if you got to windows explorer and paste %APPDATA% into the path, it should open Appdata/Roaming
is there python folder?
like that, press enter, is there Python311 or Python312 folder?
okay, give me a sec
oh ok
sort by name lol
The ai voice fusion on Applio collab doesn't seem to work
okay, lets try this
delete runtime folder
open cmd.exe in Vonovox path
where setup.exe is, then type setup.bat >log.txt and press enter
one sec
let it install
done i deleted
how do i open cmd in
vonovox path
i only see powershell when i reclick
with windows explorer in that folder, type cmd.exe in the path
oh ok
i cant run vonovox setup it's getting blocked by smart app control 😭
let it run
sorry for bieng little slow
i cant find this
@low shard how do I kill myself
i did it
don't 😭
@simple ore is Applio Kaggle having issues for training for everyone?
just wait until it is done
sorry to disturb via ping, I see you're busy rn too
I dont think so?
o
damn
@shy spruce
yes
i type this in the cmd
somehow it fails to install main requirements
yea his windows installation does not want to correctly install pip packages. it was giving him cpu torch even though he specified cu118
but then the next one (pytorch ringformer?) installs 2.8.0 cpu
his windows has something weird going on
scroll up
well it finished but it still gave error
ok sec
i can try to package up the full installation including the runtime
I think it is the issue on downloading the runtime?
so it then tries to use system python and fails
it keeps installing different cpu versions of torch, even when manually using cu118
give me a bit and I will package the whole thing including the runtime and upload it to my HF
attach the log.txt
wdym
log.txt did create a file in setup folder
The "connection errored out" happens when you click anything on the ngrok webui, right?
yea almost any time I click anything there
gimmie 2 mins to test on my end
I tried switching diff tokens and same issue
@paper flare are you using stock windows from microsoft? or a 3rd party download / modified windows
ive noticed this issue so much with people who have modified versions of windows
@simple ore I can confirm that whatever you click on the ngrok tunnel of the Applio Kaggle, gives you an error, without logs about why this happens 😭
what do you see in kaggle?
cell still running?
log showing anything weird?
I'm guessing you meant to reply to me lol
um
no i cracked windows cuz didnt want to pay for activation
the cell is still running

get.activated.win does not break python
bassically force activation for windows
nope, the log doesn't change at all
maybe try set PYTHONNOUSERSITE=1
if its the big one everyone uses from github, it wont break it
but some people download an "optimized windows"
before running setup.bat
nope didnt do that
at all
i didnt do these debloat optimized windows
stuff
i'm uploading a full version with the runtime, give me a little
thank you so much
send me ngrok token to use 🙂
I do remember seeing a convo in vonovox's server about 3rd party windows versions causing issues like cuda not being available and some curl weird certificate stuff, this user also seems to be on normal windows and having certificate issues too iirc #1417388806507597854 message
yea, but my install is standard, its using embedded python directly from python with normal pip
im not sure why some users get this issue
i think the best i can do is host the full package with the runtime
Yeah, it seems to be some random issue, pretty weird
https://dashboard.ngrok.com u can make an account here :3
Welcome to ngrok! Please log in.
yea ive seen it only happen to 2-3 times. but im doing everything by the book so theres not much I can do to solve it
the best thing would be hosting the full package option on HF
this will prob take you more time to do releases if you're planning to do that each update, but great 🔥
i'm too lazy to make accounts for things I'm not gonna use
yea thats fine, on releases I can just upload the full package option. I'll prob just write a script to do it
with hf cli
ill wait you 😄
yea lets say like 30 mins and I got it for you , my upload is not the best no fiber 
i have no fiber aswell i feel you 
how did it use 120 requests in a minute if i barely clicked something
I've only clicked settings and tried to save precision
I was checking the network tab, and yeah same issue
if i hard refresh the app after like a minute of getting connection errored out, it seems to work all fine
@viral mason can u try this too
no, same error immediately
what does hard refresh mean
on windows ctrl+shift+r, it doesn't use cache
what page tho
I think adding 'realtime' ui broke gradio lol
the ngrok link
weird
176 .js files
oh, makes sense then 😭
@simple ore @viral mason this fixed it for me:
- open the link and click something to get the error
- hard refresh instantly after getting the error
- wait 1 minute exactly, then hard refresh again
how
should i also uncheck the disable cache or just 3.4.0?
just replace the 5 with 4?
3.5.0 -> 3.4.0 here
ew light mode
first try enabling cache
see if that helps
if not try 3.4.0 and see if the number of requests in ngrok log is less
wtf it's broken even if i use 3.4.0
hard reset method it is then
idk if it breaks anything, i just tested basic check options
I guess potentially the UI for kaggle/ngrok can be cut down with only required sections visible
that perhaps may lower the number of requests
hows it goin
guessing about 5 more mins
yipee
sorry to ask again, hows it going {: since its been 5 minutes
yea im just guessing the progress bar is not correct, it looks ~80%
ill ping you as soon as its ready
its in my screen so i wont forget lol
tyyy
yipee
im installing
just download and run, dont install setup since this has the entire runtime already
running setup again might mess up your runtime
okay
ill delete old one then
lets gooo
YES
i will just upload full versions with my releases
YEEE
w
what is
cpu affinity btw
it just uses less cores because a few versions ago we figured out using 100% of the cores causes lag for no performance gain
oh ok\
why does it say extension of file should be following pth onnx
Any way that i can get a voicechanger?? 🥹
@shy spruce
it crashes afterwards
oh shit i set quality very low lemme rec again
Guides for Programs that use RVC Models in Realtime for Calls/Games
A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE
A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE
Most suggested WebUI with the best general support for many platforms. GUIDE
For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons
These options are not recommended for use.
Not suggested, older versions in youtube tuts are even way worse. GUIDE
The program is worse compared to the ones above, and much less updated. GUIDE
you sure that's your real mic? it doesn't have a real label just the default driver name and its at 96000kz
yea but im a little busy this sec
uh
look
there is no other
inputs
look
thats my mic
which mic is this thats its 96000hz? can you turn it to 48000?
make sure you have https://aka.ms/vs/16/release/vc_redist.x64.exe installed
O
WAIT
I FOUND ISSUE
LOOK
when i try run it as admin
look what it says
system cannot find path
you shouldnt be running it as admin, it looks for a different python installation when you do
make sure you have that microsoft package
i got it
already
here
see
idk why vonovox dont work for me

maybe try a non onedrive folder?
how
put your damn vonovox installation to somewhere else like D:\
its not onedrive tho
not in the desktop folder
didnt fix the issue
and you didn't show the error messages (and run the program within cmd)
none told me
to do that
:{
relax
idfk how to run it with cmd
im not tech guy like yall
good excuse

