#✨│ai-help
1 messages · Page 250 of 1
Yep that tutorial uses an over year old version of original wokada, and the settings he gave were completely fucked
Uninstall vb audio cable
Delete that old original wokada
Forget everything you get from those videos
Those are outdated info since a long time
never trust YouTube tuts for rvc and wokada
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
read the 1st link, wokada deiteris fork has way better quality and performance
Forget everything you got off that YouTube tutorial, it was just wasted time
Through applio yes, but it won't be expressive
Because it uses Microsoft edge tts
It has good quality and multi lingual but it's plain for reading text
So what do you suggest me to do if i want to clone a voice and train it?
If you ONLY want an expressive tts, don't use RVC, use eleven labs or fish speech or f5
RVC is not meant for expressive TTS at all
ok thank you
Yw
@lusty nest any updates?
following the guide right now, all is good so far
alr lmk for any issues
everything is sorted, thanks for the help!
on wokada, you can optionally use more advanced settings for benefits:
- Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
- Disable JIT compilation: off for faster loading speed of the program, on for slightly better performance (10-15 ms) for Nvidia only)
- Reduce the delay via: https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#reduce-more-delay
Last update: May 5, 2025
do you may want me to check your settings?
Nono it's okay, I got the delay down to almost nothing and the stability is really good
alright, have a nice day
Can anyone tell me how I can use this exact same AI model that this website is using, offline?
There is some text with two links:
This generator was made using the text-to-image-plugin.
The brain button feature uses the ai-text-plugin.
The bold part is the link and for the "brain button" it says it's a plugin that uses stable diffusion, so if I get stable diffusion offline, woudl that work the exact same way as this website?
it uses stable diffusion https://github.com/CompVis/stable-diffusion
this is outdated asf
dont even bother using that model locally
use flux.1-dev or sd3.5 instead
Which UVR model reduces the audio quality least? To prevent it from being lossy and turning into 4 bit lmao
Appreciate the tip!! Thanks!
my friends are saying that my voice is crackin up in discord
these are my settings
cant upload
any fixes please?
why did you set extra to 5?
setting extra to anything over 2.7 can cause cutoff issues
for better voice quality
extra slightly controls voice quality, if a model sucks, it sucks
you should put it to 2.7, else you risk cracks and cutoff issues
put it to 2.7, check if it worked, if it didnt, check https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#discord-crackle-fix
alr done ...anything else that seems wrong?
nope, if the issue still persists you can trey the link i sent you
both of em are off already
anything else?
does the issue still persist?
yea sadly
Last update: May 5, 2025
already tried this
can you try other models too?
Last update: May 5, 2025
tried like 3 of em ..can you suggest the best ones ? for like egirl voices kind of like E-woman
gave you the link above
E-woman is wild 😭
why are people focused so much on e girl voices btw
in what way?
im saying that im shocked on why many want e girl voices lol
just want to troll my friends ..my normal voice is just well less pitched you can say for a female
anyways, is the cracking still happening?
even with the suggested models?
downloaded that one...3 files init ....pth,json,index
which one should I upload
in rvc context:
- pth files: contain the voice
- added index files: contain the accent
- metadata.sjon file: it's just some extra info about the model download link if you downloaded it off weights.com, it's not needed and won't impact the actual model at all
alright
sounds good to me ..dont have my friends online...will definetly let you know
Thansks
rmvpe is fine right?
yes
so everything else is fine right?
yea
Is this any different from "vcclient_win_cuda_2.0.78-beta.zip"??
Or is it the same
vcclient_win_cuda_2.0.78-beta.zip
you shouldn't use that one
that is original wokada
not suggested anymore
Belle uses wokada deiteris fork which is way better
@normal oar lemme guess, you used a youtube tutorial?
Exactly
all video tutorials are outdated, delete everything you got off it
im guessing you have vb audio cable, which is known for causing issues on windows
Does it contain malware
No, it just has way worse quality, way worse performance and much more unstable
I've been using it for couple days now
you shouldn't
I see, glad no malware 🙏
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
read up the 1st link
Alrighty
wokada deiteris fork
there's no updated video
and prob will never be, because ai changes at sonic speed
what's your pc gpu btw?
4060
alr u will be all fine
Mhm
Why is fork better than the orginal??
it has better performance improvements
and advanced settings to have better stability and quality
just forget eevrything you see on yt tuts, as they are pretty old
they prob told you also to use crepe lol
Got it, its completely safe right?
Ikr, i use rmvpe
yes its literally FOSS, Free and Open Source software, you can check the code yourself and do whatever you want with it
plus the deiteris fork has more protection over the models
Last update: May 5, 2025
alr yw
Can I run it on docker?
Wokada isn't really meant to run it on docker and there are no guides about it
Nvm then
so the problem is not fixed...talked to my friends and it is just cutting off
I recorded it to show you
I am so done finding fixes tbh but I am still gonna try
lets rule out some stuffs....my mic is decent and my internet is more than decent ...and this is exactly how I sound
Guys, can anyone help me make a voice for a YouTuber?
elaborate:
- your pc gpu
- your operative system
- what you want to do
wtf? show again a screenshot of ur current wokada, are u sure ur microphone is decent enough?
It's exactly the same ....I can't send you now cause I am out of my home but I'll tag the last image sent
I have the apple airpods for real
Also the voice is perfect when I monitor it
5070
64 operative
YouTuber akvi4
so, it sounds perfect on monitor, but horrible when recording or using it on discord?
and you already tried all solutions?
Exactly
Im guessing you mean windows 11 x64?
well, be aware that it will be very complex to train a model, firs tof all you need applio (rvc fork):
How to (unofficially) use Applio for RTX 50 serie cards
Follow to download it as said it in https://docs.aihub.gg/rvc/local/applio/
After you extracted the precompiled, go to the path in Windows explorer, write "CMD" and press enter, then in CMD write env\python -m pip install torch==2.7.0 torchvision==0.22.0 torchaudio==2.7.0 --upgrade --index-url https://download.pytorch.org/whl/cu128
If you get any already satisfied requirement issue, run env\python -m pip uninstall torch torchvision torchaudio then the command said above
After that, you can check: https://docs.aihub.gg/essentials/how-to-make-voice-models/
Last update: Apr 01, 2024
In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.
Yea as far as I know
@noble hound I have deleted your post, you can make models yourself but NOT REQUEST them
You can search rvc ai voice models at:
- #1175430844685484042
- In #🔍│find-models , Do /find with @earnest musk
- https://weights.com/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- Suggested Models for Realtime Voice Changing (Wokada)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- make it yourself with our docs guides
- Be aware that we don't allow any model request, so don't fall for any "pay me 20 dollars and i will make the model for you" dm
:wave: @low shard, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
Honestly, I would like someone to do it for me, I'm too stupid to do it myself
you can't, we don't allow anyone to do commissions, either do it yourself or search if someone already made it
we have removed model requests
@noble hound are you still looking to make it yourself? if not, you can check the sites above to see if anyone has made them already
hey ive been using deiteris' okada fork and ive got it well and running. but is there any tips to help increase the accuracy of the voice? sometimes what i say doesent get correctly interpreted by the model. (dont have an accent). e.g dog gets interpreted to doll by the model
Maybe it's something related to your settings or the model you're using wasn't properly trained or got a short/bad dataset. Also, you can try imitating the voice itself.
Ok
I think I'm seeing a sign of over-training, should I keep it running?
(10 min dataset, training with MAINLINE on Kaggle)
@pastel oak maybe you got any idea?
Are these the settings you have rn when you recorded
at 2.4k steps - fat chance, unless you runing batch 64 on a tiny set 🙂
testing on different epochs may somewhat surprise you
try another model
if you trained the model, try using longer dataset with better articulation
CAn I please get some help?
elaborate:
- your pc gpu
- what you want to do
- your operative system
- whats the issue
- what tutorial link did you follow
why is it so laggy
can someone help me the client is loading but ntohing pops up
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
Does anyone have any advice regarding flux gym? their isnt alot of info on the internet about how many training epochs and repeat images you should do. Im training a Lora for a realistic women.
like whenver is ay something it comes out like 10 seconds later and im breaking up
don't expect reasonable performance with cpu mode or too weak spec
i have really good specs i think its my settings or smth
gpus older than RTX 20-series and RX 580 are not good
or even igpu
I need settings help as well..
can i run the client in the background smoothly with RTX 3060Ti?
What is the default for volume in and out in okada? 100%? Does this make the sound neither loud nor small?
@low shard helpp
Yes please help
how do you run the w-okada voice changer on linux, the guide tells you to how to combine and extract the two archives but what do i have to run locally
the guide should be explanatory enough
otherwise try troubleshotting by yourself or ask chatgpt
the one that will sound fine for you
anyway 100% is the safe one to avoid some distortion due to audio clipping
why does my VC client thing say wait web server... and doesnt do anything
but in 100% and out100% so loud
bruh system volume
set it to less than 100% and it will never be clipping
Hey, what is the best cost-efficient API model at the moment?
/downloads
Hey! I am getting no output
It doesn't seem to work
Getting 0 input, 0 output
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
We are a general ai server, not just voices
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
This is a general AI server, we aren't rvc focused
Wdym? If you're asking to advertise your videos, this isn't the right place
There isn't a best setting, just play with it, it doesn't affect quality
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
:wave: @low shard, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
Wdym with client?
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
We are a general ai server, you need to elaborate
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
For what? There's literally thousands of AIs
Show a screenshot or screenrecording
Headphones issue, solved it 👍 thanks
What videos no mate I’m trying to make a discord bot and need actual help how do I get the help I need
So I have been running in cicles trying to figure out on my own, what am I supposed to do about pytorch? I'm trying to set up Okada for the first time.
I have a RTX 5070 so there seems to be a bit of a snag, but I don't know what to do about it
a decent alternative is vonovox, it already supports 50-series cards as well
Last update: June 2, 2025
No way to use Okada?
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
Close enough
wokada deiteris fork has way better performance and quality
original wokada is old asf lol
You sold me
tho, vonovox is a similar fork which may have slightly some improvements for nvidia
so it might slightly be better but im not sure, i haven't used it, most people use wokada deiteris fork
So which one do I grab then, deiteris or vonovox?
unfortunately RTX 50-series doesn't have backward compatibility to the older pytorch versions used by the older voice changer as well
nor the current deiteris wokada
nor even the og 2.x beta
https://i.imgur.com/0FTJXqx.png downloaded, ran setup, what do I do now?
start it
Try launcher.py
Just had to wait actually

okada link
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
Elaborate:
- your PC GPU
- your operative system
- what you want to do
rvc
rx 580
livestream
what's your operative system? windows?
11
rvc
livestream
RVC is not for realtime
RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.
Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)
Do you need RVC or Wokada?
They do 2 different things
wokada
btw, be aware that your pc gpu is bare minimum, you won't be able to use the voice changer in games like marvel rivals
and will play every game at lowest settings
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
not for games
1st link, read wokada deiteris fork
don't follow youtube tuts
they are all old and use an over year old version of wokada
does anyone has a working collab link for training RVC V2 model?
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
what? you shouldn't go download it off github, if you used any video tutorials, they are old
delete everything you got
read the guide I sent you
missing steps could lead you to fuck up your whole audio system
my pc has gtx 1650 and windows 11 , I want to train a voice model on cloud
I mean, your pc gpu could technically train locally, but it would be slower and limited so yeah
Train (make) RVC Models on cloud:
- Prepare the Dataset
- Setup RVC:
Choose a cloud way to use RVC,
- Google Colabs (max 4 hours of daily T4 16gb gpu not granted for free, not much hours for training, but easy to use, there's a paid tier):
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus, either T4x2 16gb each or P100 16gb, only free):
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly, Free Studios run 24/7 but require restart every 4 hours. There's a paid tier):
- Be sure to know about the tensorboard
Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.com/ which ofc uses RVC
RVC Inference (use models) on pre-recorded audio on Cloud
You can use either:
- Weights.com: Easiest Possible Ever Automatic
- Ilaria RVC Zero: Fastest free on cloud
- Applio UI Colab: RVC Fork with some extra features like TTS
- RVC AI Cover Maker UI: Automatically Separates the Vocals and Instrumentals, converts the voice and mixes them back
which one would be better
you are on the wrong place
you got an amd gpu
@pseudo steppe get https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#download-amd-intel-and-cpu-on-windows
Last update: May 5, 2025
Read carefully the guide
Which microphone would you recommend me to get for ai voice? (edited)
It sounds bad on my headset ones
How do i train ai to duplicate my friend voice? (i already got his voice like 3 mins)
Elaborate:
- your PC GPU
- your operative system
- what you want to do
Hello, I have a problem with the application. I would like to prevent myself from being heard in the background. Is this possible?
change sr to 48000
How do I remove my voice in the background when I speak please?
doesn't work
use wasapi, not asio
wasapi used
is that og wok or the fork??
og
then download the fork ez fix
Yo im using otaka voice changer and i tested some models but they all sound kinda chopped even with good settings. I mean like they stutter. How can i fix that typa stuff?
but its basically with all of the ones ive tried
do they sound choppy only when playing games?
no its alltime
tell me ur gpu and which wokada version u have
are you using vac lite or vb audio cable?
vb audio
its like 2.0.78-beta
and i have gtx 1650 but is it only gpu dependent
This is a General AI Server, we won't be focused on voices anymore
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
uninstall that
install fork version
uninstall vb audio cable
install vac lite
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
1st link
wokada deiteris fork
don't trust video tutorials
the version you got is original wokada

in this day and age
yeah just saw someone using it #1387831485901574195 message
Ruby who wanted help also used it from a video tutorial
installing the fork and replacing vb audio cable with vac lites fixes every weird error
yes but people are attached to original wokada
they don't care enough to check this channel 
ping everyone 🔥
do i have to change any setting in vac or js keep it how it is?
dont change anything and be sure you're downloading vac lite, not the trial

which one do i download? ( nvidia gpu)
amd64 cuda
devs loving using super long unnecessary names for their shit
Last update: May 5, 2025
amd64 = x64 bit cpu
cuda = nvidia
here is the compiled version for nvidia https://huggingface.co/Shadicti/deiteris-Fork/blob/main/voice-changer-windows-nvidia-b2332.zip (non 50xx series)
i js downloaded the cuda thing, do i delete that again and download this or wt
gtx 1650 , window and i want to make my friend cover a meme video
they're the same thing, the difference is that the huggingface link already merged the two cuda zips
oh so i need 2 cuda?
your pc gpu could train locally, but its not suggested and slow
Train (make) RVC Models on cloud:
- Prepare the Dataset
- Setup RVC:
Choose a cloud way to use RVC,
- Google Colabs (max 4 hours of daily T4 16gb gpu not granted for free, not much hours for training, but easy to use, there's a paid tier):
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus, either T4x2 16gb each or P100 16gb, only free):
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly, Free Studios run 24/7 but require restart every 4 hours. There's a paid tier):
- Be sure to know about the tensorboard
Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.com/ which ofc uses RVC
RVC Inference (use models) on pre-recorded audio on Cloud
You can use either:
- Weights.com: Easiest Possible Ever Automatic
- Ilaria RVC Zero: Fastest free on cloud
- Applio UI Colab: RVC Fork with some extra features like TTS
- RVC AI Cover Maker UI: Automatically Separates the Vocals and Instrumentals, converts the voice and mixes them back
yeah, the cuda version are 2 zips that needs to be merged
thank u
the github has a gif that teaches u how to merge the zips, but if u dont wanna do that just download it from huggingface
^
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
which one do i have to use ( vac lite)
uninstall vb audio cable from windows app settings
did you perchance install voice.ai?
nop
ValueError: Unable to detect audio format for file extension: .opus
what does that mean?
i've been trying to make a cover several times and i always get an error
get vac lite here https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#virtual-audio-cable
Last update: May 5, 2025
uninstall vb audio cable
i did
did you follow both steps?
if you miss this, you can fuck up your whole audio system
This is a General AI Server, we won't be focused on voices anymore
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
@cosmic knoll the file is setup64. not setup64a
My GPU is Intel HD Graphics 520, my operative system is windows, i want to do an ai cover, this the link:https://colab.research.google.com/github/Eddycrack864/RVC-AI-Cover-Maker-UI/blob/main/assets/RVCAICoverMakerUI.ipynb#scrollTo=sDo4MkcLcN5l
and the tunnel is gradio
Anyone able to help me with background noise for the voice changer?
which one do i select?? i can hear the ai voice but it doesnt do it ingame
did you upload a video with the .opus format?
line 1
and set game input to line 1
also its better u show full settings
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
why i cant send screenshoot
I have a problem
for some reason my voicemod is not working
and I connected evrething right
hi i have a voice changer and when i used to add models normally but now when i add models it says PermissionError: [WinError 5] Access is denied: 'model_dir\1' what should i do
are you talkinga bout okada or voicemod
did you click 'start'? 😛
why the voice changer(okada) doesnt work in valorant, anyone knows how to fix that?
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
!give-media-perms 1h @upper goblet
This is a General AI Server, we won't be focused on voices anymore
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
This is a General AI Server, we won't be focused on voices anymore
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
do not use .opus as input
how do i run rvc on mac?
poorly
😭
that ik
whats the best way? i have trained an model with rvc web ui now i wanna use that with mac
hey, just wondering as a freelancer starting out, what the best automations are to sell to businesses. I have good knowledge of n8n, html, css, and im decent with js and make
Anuew dereverb or MDX23C dereverb for inference vocals
https://huggingface.co/spaces/r3gm/RVC_HF
Help, it's not working anymore
for some reason on weights i cant create
Hiya
My Okada is very breathy
And
I can't for the life of me get Voicemeeter to release sound out of B channel
No matter what.
Any ideas?
Further, what about seemingly random and painful "Skipping"
I cant say "hello hi there" without getting at least 4 cut pops
on an rtx 4090
post a screenshot of your okada here
pok
this is not the voice changer version we know so far, perhaps too old or a random fork?
Last update: May 5, 2025
just the recommended one
can w-okada damage gpu?
Or maybe I'm seeing smt wrong.
Let me try again, fresh.
U were correct.
I had the wrong fork somehow.
Thank u sirs.
Btw, Server or Client?
(I really don't understand why there are two of those)
And which audio driver do u all use the most?
wasapi, etc
Yes, I have been up for 40 hours. yes I should apologize.
I am sorry.
And I am grateful to u
well so you stumbled upon a strange fork I've never heard of
and ye I think you should consider the recommended fork as written in our guide here
Can I somehow send u a vc to ask
What u think
Or just hang in supp vc
Uhm I dont know what banned word I used
Let me try again
If anyone wants to spend 60 seconds in vc with a noob
To help
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
elaborate
you dont seem to have wokada deiteris fork,get it from the guide
Better than main?
oh nonono
lemme guess, you followed some youtube tutorial?
all video tutorials are old
!give-media-perms 1h @brittle wing
I'm using a random voice I found
Sounds fantastic on the site
Sounds... similar when I do it but
It's missing a lot of "Depth" and reality.
Sounds like a breathy mess

poke
set rmvpe without onnx, uninstall vb audio cable it can cause random issues on windows, get vac lite https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#virtual-audio-cable
Last update: May 5, 2025
set extra to 2.7
are you using it in games? what other programs do you have in background
Basically none
And the PC is stronk.
I'll do the other stuff u said.
uno momento por favore
unless you manuallu overclock your gpu, which that already shortens its life span, and your pc isn't overheating 24/7 with bad airflow, no
what damages gpus is extremely hot temperatures
are you sure no other program is in background? you should have less perf
you should close as many programs as possible just keep open the voice changer, discord and whatever program you're going to use wokada in
on wokada, you can optionally use more advanced settings for benefits:
- Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
- Disable JIT compilation: off for faster loading speed of the program, on for slightly better performance (10-15 ms) for Nvidia only)
- Reduce the delay via: https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#reduce-more-delay
Last update: May 5, 2025
Ok, I think I'm set. Wanna talk to me for 10 seconds in support chat?
To tell me what u think
which temp would be over heating for 2060
I can't VC, but you can set monitor to headphones to hear yourself
No I did that ofc.
It's just, u do this a lot
U can "sniff out" any artifacts.
If any.
I think it's decent tbh.
all the other thing you can do is just try other models, not all models are good
plus, there's no such thing as a perfect model
voice models got limitations
they can't laugh or scream well for example
so rvc is 100% detectable
that seems to be case for all models
none of em can laugh

just dont get like over 85 celsius degrees, anyways you shouldn't have any issues unless you do overclocks, play super intensive games and have bad airflow
yep, @brittle wing RVC is detectable ofc, there's nothing perfect
@brittle wing do you need any other help?
No sir. Not trying to trick anyone. Just enjoying the software.
It's fun.
it reaches 83 degrees when playing demanding games like rdr or fh5
alr do you need any other help?
No sir, thank you.
shouldn't be too much of an issue, @simple ore what do u think?
"Maximum manfuacturers operating temperature for the RTX 2060 is 88 degrees."
I'd worry if it was constantly >80C
idle or while playing demanding games
during load, obviously
yeah it does stay 82-83 while playing rdr
what is memory junction temp
try using GPU-Z to check the performance metrics
or HWinfo as said above
what's the normal fan rpm for gpu's
~1500RPM on load, 0 on idle
mine is at 2000 rpm tho
it depends on the AIB model
go check some review spec for yours
is zero rpm actually a thing
casue mine always spins
not sure but it has been since RTX 30-series
I knew a TUF gaming RTX 4070 has zero rpm and its temp stays below 60 in full load
while average gpus stay around 70
Yo does anybody know if UVR or some other vocal separating software has a model that can separate a reference / similar voice vs other voices? Like a male and female talking in a podcast and isolate them. I know like resemblyzer can sorta do it kinda
Nvm I found a doc that tells me some stuff
I have a question, what is the minimum graphics card required to create AI models locally?
assuming for RVC, at least RTX 20-series
Im having some issues with the W-Okada RT, Using a vitrual cable to try and run it in games/discord. When I listen in on the cable trough sound manager I hear the changer but discord and games seem to either not pick it up or just go with unchanged voice.
can sm1 help?
how can u search for rvc of some specific character voice? like aizen
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
@lucid veldt in #1175430844685484042 or #🔍│find-models
What search? Is it #1175430844685484042 or Weights.com?
i can hear myself in my headset
the one for okada i cant find aizen some how
Do you mean "can't"? Is it about W-Okada the realtime voice changer and virtual cable?
i mean i can hear myself
with the voicechanger
idk how to fix it
its kinda annoying
It's annoying indeed as you use the "original" version of W-Okada from YouTube tutorials, which are outdated. Which W-Okada version are you using? What is your PC GPU?
wtf is W-Okada?
W-Okada the realtime voicechanger. A program that converts your voice into AI voice in realtime.
Weird and crazy. You can send your "program" screenshot here so I can identify it for you.
No.
than this is going to be kinda hard
I thought users with green name can send an image here and in #1192011222023950368 ? Did this server changed or something?
nah bro, i just think that i have turned one of the things on
That's not the correct answer.
It tells me to Select "none" for your monitor device.
i already did that
so yeaaahh
If your "voicechanger" looks like this, this is "W-Okada". However, the "original" version opens its own window. I use the better version, which it opens in browser. https://cdn.discordapp.com/attachments/1159290139609137264/1386272484352856104/image.png
ill just wait for sm1 there know smt about this voice changer ig that is my only choise
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
Just managed to get it working I think, Needed different audio cable
Thanks for checking up on it
be sure ur using vac lite and wokada deiteris fork, not from yt tus
That didn't answer some of his questions, but close enough of getting W-Okada to work by yourself. 
my okada broke, it seems to not hear anything from my mic but instead its translating other ppls voice
I think I remember you asking about this before, or maybe I just recalled a different individual.
I think we have talked about "index damaging the CPU" thing before, but the messages I replied long ago were deleted after I went away, thinking it gonna sound like a bad idea to ask about some basic ahh question. 
I don't think it does?
okada just kinda
is a voice changer
voicemod doesn't damage gpu
which is better crepe_full or rmvpe
Rmvpe.
Crepe is an old F0 pitch extractor, used in RVC. Rmvpe is newer.
the real answer to this is to choose the f0 estimator used to train the model
most models use rmvpe
with crepe_full i can even sing with it without them noticing its voice changer
when is rmvpe v2
in terms of f0 estimation precision, it's pretty close to rmvpe
there are some moments where it can be better than rmvpe
but for the most part, rmvpe is better
However, there's an exception. If a model is trained with "crepe", use crepe on W-Okada. The RVC voice models that trained with crepe are usually from 2023 and considered old. 
i recall crepe being decent for singing
What
but using crepe in a rmvpe model may be a bad idea, the model learned the sibilants and breaths using rmvpe knowledge
rmvpe zeroes the breaths and sibilants, but crepe doesn't
it gives them a value, a value rmvpe never gave to them
I've never seen an actual rmvpe V2 being released anywhere. But I just see this in a GitHub repository.
rmvpe v2 doesnt exist namari u cant be serious 

the thing here, crepe in terms of precision, isn't trash, it does have its limitations
but can work
this is the reason why u dont wanna use crepe in a rmvpe model, besides perfomance issues
let's produce it
if singing works fine for him using crepe it's because the thing isn't trash and horrible
its quite decent
All I see is "RVC v2, rmvpe" but not "rmvpe v2" in #1175430844685484042 
for 2018 standards ofc, rmvpe is better than crepe anyway
however, f0s have nothing to do with realism, thats just placebo
and for inference, the result in sound is the same
your rmvpe model is gonna sound like a rmvpe model no matter what u use
well, teeeechnically the inference f0 matters
because that is the input (in addition to phonemes from feature extractor)

I dont use index tho


Heya! 🫡
I am trying to rollback from LatentSync 1.6 to 1.5 but I need the LatentSync_v1_5.ckpt and it is nowhere to be found now 😦 anyone have a clue to where to get it please? 1.6 doesnt work anymore on my setup (3060ti) but 1.5 was fine for 8sec in 720p, I shouldnt have try to update 😦
whym does not work properly?
there's always mainline, just download the compiled version and return to 2023 glory days
idk if i can ping but i need this to work 😭
can someone help me with w okawa colab
its keep on saying
ERROR: Could not find a version that satisfies the requirement faiss-gpu (from versions: none)
ERROR: No matching distribution found for faiss-gpu
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 261.0/261.0 kB 15.2 MB/s eta 0:00:00
Preparing metadata (pyproject.toml) ... done
Building wheel for pyworld (pyproject.toml) ... done
Installing dependencies from requirements.txt...
ERROR: Ignored the following versions that require a different python version: 1.21.2 Requires-Python >=3.7,<3.11; 1.21.3 Requires-Python >=3.7,<3.11; 1.21.4 Requires-Python >=3.7,<3.11; 1.21.5 Requires-Python >=3.7,<3.11; 1.21.6 Requires-Python >=3.7,<3.11
ERROR: Could not find a version that satisfies the requirement onnxruntime-gpu==1.13.1 (from versions: 1.15.0, 1.15.1, 1.16.0, 1.16.1, 1.16.2, 1.16.3, 1.17.0, 1.17.1, 1.18.0, 1.18.1, 1.19.0, 1.19.2, 1.20.0, 1.20.1, 1.20.2, 1.21.0, 1.21.1, 1.22.0)
ERROR: No matching distribution found for onnxruntime-gpu==1.13.1
Successfully installed all packages!
Can anytone help me wiht okada it sounds so ahh even tho i got a 5070
@acoustic scarab was there an update becuase my rvc just dosent open anymore
it runs the command prompt but nothing happens
highly unlikely
whys it broken then
i think im just gonna reinstall it, do i get cuda 1, cuda 2 or dml?
i dont remember what they do differently
i can't say unless you provide a log or any other output that might be analyzed
cuda for nvidia gpu, dml for amd/intel, but honestly i've never heard of a choice between cuda 1 and cuda 2
cuda 1 and 2
both
oh
it's a divided zip archive
is the file that huge?
i guess so
I am training a LoRA in fluxgym of a real life person, training for face and body. Using a 58 training images. Would love some advice on how many epochs and repeats I should be using, and how many total training steps I should be aiming for. Im pretty concerned about wasting my time training such a big data set for the LoRA to just come out over or under baked.
theres an issue
its not possible to use cuda 2
its not a zip file
what file archiver do you use
put them in the same dir
and use extract to on the first one
7zip should extract both
it still says the second one isnt a zip archive
the thing is 7zip should've considered all of archive parts as an entire archive so you don't have to extract the remaining parts(.002 in your case)
yea but they separated them in download
what was in the archive then
just the rvc
try it
ohh you mean
yea
ok yea that dosent work
does the same thing as the vrc i just deleted
it initalizes then nothing happens
does the console stay open after initialization
yes
take a screenshot of it and send it
where exactly did you get the archive
from the offical github
gimme the link
i've never used deiteris fork and can't say what could've gone wrong, you can try the default w-okada from here https://huggingface.co/wok000/vcclient000/tree/main, latest stable build https://huggingface.co/wok000/vcclient000/blob/main/MMVCServerSIO_win_onnxgpu-cuda_v.1.5.3.18a.zip
yea but isnt deiteris fork wayy better
https://huggingface.co/Shadicti/deiteris-Fork/blob/main/voice-changer-windows-nvidia-b2332.zip
download this instead, it's the fork but with the two zips already merged
ah ok
this download did the same thing
just initalizes and then nothing happens
that is indeed weird
uh
what gpu?
geforce rtx 2060
the cmd should output a localhost link hmm
anybody have any tips on how to cancel echos from VC in Discord when I use RVCC under server mode? The only solution I have rn is to lower chat volume
switch to client and enable echo in w-okada
and maybe enable sup2
okay i was hoping to keep server because it works a lot smoother and faster
maybe if you wait maybe 5 minutes it'll fix by itself? weird issue, never seen that before
you could try downloading a third party noise supressor, then using that as your input
how is waiting 5 minutes going to help
so by doing that you can keep server mode
it may be stuck
too much for the cpu to load maybe?
i mean it used to work fine
is this problem also present in the original version?
switch to vonovox 👍
(most of the time
)
good idea
whats that
Last update: June 2, 2025
a new voice changer made by a known dev of the community
is it more optimized
faster than w-okada
yeah
way way way more optimized
and doesn't run in your browser
thats like exactly what i need
if something goes wrong you should join their discord server tho
wait where do i download it
oh its twards the bottom
vonovox discord server is in their github
any issues regarding the app go there and notify the dev about it
kk
deiteris wokada is still okay for beginners
but for those having RTX 50-series I'd rather suggest trying vonovox
Does anyone know a good base fnf cromatic for RVC?
i have an AMD Radeon RX 7800 XT gpu, is this good enough to train a voice model on my pc and if so what should I use to do so?
why does every single female rvc model sound so dookie without being merged into oblivion 🥀
you can patch Applio to use Zluda emulation and run it on your card
would it be better to run it locally or on the cloud
on 7800 better locally
still pretty good with 16 GB vram and zluda
ok so i may be dumb but i downloaded applio and extracted it to a folder but i'm not seeing a run-install.bat file, but i do see run-applio.bat, do you know why this may be?
can someone help me with w okada its really laggy
open a help thread
if you're using Adrenalin driver older than 25.5.1 there's a guide https://docs.applio.org/applio/getting-started/installation#amd-gpu-support-windows
for newer driver there are changes
For Adrenalin 25.5.1 driver or newer
- Download a compiled version of Applio v3.2.9, unzip to the desired folder.
- Download HIP SDK 6.2.4, install the components except the video driver at the bottom
- Add
C:\Program Files\AMD\ROCm\6.2\binto your Path environment variable. - Using command line from the Applio folder run
env\python -m pip uninstall torch torchvision torchaudio
env\python -m pip install torch torchvision torchaudio --upgrade --index-url https://download.pytorch.org/whl/cu118 - download these into Applio folder
https://github.com/IAHispano/Applio/blob/main/assets/zluda/patch-zluda-hip62.bat
https://github.com/IAHispano/Applio/blob/main/assets/zluda/run-applio-amd.bat - replace rvc/lib/zluda.py with the following
import torch
if torch.cuda.is_available() and torch.cuda.get_device_name().endswith("[ZLUDA]"):
# disabling unsupported cudnn
torch.backends.cudnn.enabled = False
torch.backends.cuda.enable_flash_sdp(False)
torch.backends.cuda.enable_math_sdp(True)
torch.backends.cuda.enable_mem_efficient_sdp(False)
- run
patch-zluda-hip62.bat - run
run-applio-amd.batto start Applio
i'm currently on adrenalin 24.10.34, would you recommend just following the old guide or updating real quick and doing the new version?
that's not a right version, probably some non-whql beta
soo update then
yeah, should be better with latest zluda
im gonna be honest this branch is pretty awful can someone just fix okada
need help
id rather not use some offbrand version of okada with no features
@knotty moth do you have any idea how to fix this?
Which UVR model is best to minimize bitrate dropping?
What does merging models do
combines the timbre of each models together
what does that exactly do
create a new voice
does it work better? 
honestly i have no idea, if done right it should be ok
i'd rather not do it lol
i havent used that feature much to create new voices
it does have more uses than just creating new voices but thats a bit more advanced
like merging two different training runs
you use applio
or like
the voice changer app? 
uhno
i thought theres an option in wokada itself
there is one in the og version, not sure if the fork kept that feature
i personally only use rvc's merging
click options and find it, should be there
the results wont be always so great
most of the time is ehh, weird
basically its all random

...Which model is Gabox's voc_fv4?
80% of the time the new voice sucks/ is mid
Sometimes you can get a good one but you'll need to be lucky
by avoiding to convert to lossy formats in any process
Hi! I need help with b3223 nvidia version w-okada thats fork, and today i wanted to open the voice changer, oh noes! The voice changer doesnt open! the cmd says this "json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)" and includes a screenshot. When i press enter to continue, the cmd closes and nothing happens.
I dont know why but i think it could be something about model_dir file, because recently i got a new model
Nevermind it was the model_Dir file, i deleted it and it got fixed
Hey. Which program for generating ai videos based on text? Something free and good like Rvc Gui, please.
easiest is something like comfyui + t2v model of your choice
or t2i model to get a starter image + i2v model to make a video
Okay... Thank you Noobies. One more question I'm writing long texts, and make free audiobooks possible would be cool i mean text to cloned voice. @simple ore
What's the best pretrain for anime Japanese speaking characters?
I know now. Google text to speech and next step using rvc gui.
Hey, anyone here with experience in LLM memory systems, fact consolidation, or contradiction detection? I'm working on a project that handles reinforced semantic memory and dynamic fact resolution.
what software should I use for voice cloning RTC on win pc
yeah same happened with me too
This is a General AI Server, we won't be focused on voices anymore
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
Sir Nick, can i ask if i want to improve a real-time model, i should focus on the data right?
because people somehow still spot i'm using AI right away
RVC models are 100% detectable, there's no such thing as a perfect model, RVC has limitations for like laughing and screaming btw

are you trying to train your own model, or use one made by someone else?
i can also suggest better wokada settings if u show a screenshot
if you're like vtubing, you shouldn't laugh or scream lol
be sure to get as much high quality clean data as you can, and to use the tensorboard
Aye, i guess it's time to learn how to imitate voice
I use tensorboard ah

Just my data is from podcast
They have pretty much monotone voice
Like someone is telling stories
there are plenty of tts options with zero shot voice cloning
Longer dataset = more realistic results
Expressive data also helps
anyone knows how to fix my problem?
i have a model with only a model and no index. at the beginning it worked well but now the voice is cut when i try to speak. my other model with index is working
This is a General AI Server, we won't be focused on voices anymore
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
How many db should I put the microphone volume into the voice changer? And what is the voice changer input and output volume basic without amplification?
somebody helpp
helPppp
First part of your question, input volume is subjective, theres no right or wrong, every case is different
Just make sure your normal voice going into the voice changer is not too quiet and not fucking up your ears (turn on passthru to hear your normal voice coming out the voice changer to test this)
elaborate all i asked
it is likely some model slot .json got bricked
MMVCServerSIO\model_dir\
adding to that, in case its not clear, delete the model_dir folder like the person you quoted before that already said himself @pseudo steppe
Make sure to save the models inside model_dir in case you deleted them off your pc
I need an exact figure. I'm going to put in a microphone volume of -6db
There is no exact figure because nothing is right or wrong, its just your mic volume
As long as your mic is clear going into the voice changer you have the most optimal
But isn't the sound distorted if you turn down or turn up the input or output volume?
vol in: controls how loud your normal microphone is going into the voice changer. You can leave at default 100%
vol out controls how loud your changed voice sounds like going out the virtual cable. This you can also leave at default 100% unless you prefer a louder voice, I would go for max 150-200% personally but everyones different
Some models are trained quieter or louder than others too, that is a factor
A quick self test can be to use discord self hearing and you determine if the voice coming out is adequate to your liking or not. The main point is there is no right or wrong, so we can not suggest you a setting because its all preference
anything works as long the input audio is not distorted
actually the thing is it for working fine from months but suddenly from today only this error is coming
4060
windows 11
discord
So if the sound is too loud, would reducing the output volume less damage the sound quality than the input volume?
if the volume is too loud the model may sound robotic
u can try deleting the model_dir folder to fix it, might be a bad model, but u will lose ur model
thx it is resolved now
yw
why??
because by turning the volume too high you're also boosting the natural artifacting rvc has
so they become more easily audible
what is natural artifacting rvc?
model randomly sounding robotic
I do noise treatment on the microphone and it sounds the cleanest at a high volume. Microphone noise ratio snr
as long everything sounds fine, don't worry much about it
and i use sample rate 48000hz and bit depth 24bit is it good?
and buffer size and max latency(vb cable) which value is good?
i use 2048 buffer size and 7048 max latency
I have around 1 hour of dataset. 
Let me train it again

hi i am trying to make my own voice moddle for W-okada but im to dumb to get Applio to work even though i have the guide of the logs coud someone pls help me?
the program is working but im to dumb to use it
I want to clone a voice
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
Elaborate:
- your PC GPU
- your operative system
- what you want to do
- what tutorial link are you using
- a screenshot of the program
i have a 4060
i use windows 11
i wanted to make a voice model of my frend
i am using the totorial from the Ai hub logs
like i said everything works or at least i think it doas im just to dumb to use it
have you tried reading https://docs.aihub.gg/essentials/how-to-make-voice-models/ ?
In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.
thx
help does anyone have an ai humanizer
I start the first process in RVC v2 Disconnected colab but I get an error, what should I do?
outdated af, tries to download pip 22 from 3 years ago
-colab
Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
by IA Hispano
Google Colab
by Hina
Google Colab
by Eddy
Google Colab
by Eddy
Google Colab
by Deiteris & Hina
Google Colab
by Shiro & Eddy
Google Colab
by Nick088
Google Colab
by Nick088
Google Colab
by Jarredou & Makidanye
Google Colab
So how can I fix this problem?
very unlikely to fix, since it is probably using python 3.9 code while colab is now 3.11
What web browser works best for the RVCC Wokada Deiteris Fork voice changer? I currently use Opera GX and have some issues with it spiking and lagging when set to recommended settings for my GPU
Okay, I solved the problem thanks to ChatGPT.
scroll down until you find the section where you set the batch size value then click advanced settings
if you truncate the silence of your dataset using audacity, yes
if not, use automatic
i truncated in audacity
how many minutes this dataset have?
it didn't get all of it so i made the gain threshold a lil lower and i manually deleted some
you're not supposed to remove every silence, i personally left 0.35 of silence and let the automatic slicer do its job
i also got an xlr cable for my mic and my golly does it sound soo much better than usb 😭
oh yeah not all of it is gone
i just did it to remove like
keyboard noises
nice
it has some silence left
batch size 8 may be ok, but if the results come out bad, you could try batch 4
How do I uninstall the original version of W-Okada? I dont see a uninstall file and idk if i want to just delete the folder cuz idk what else it installed on my pc
what does batch size do again?
as long is not over 0.35s of silence you'll be fine
amount of slices the model learn every step
don't think it is
just delete the folder
oki
oh and by "bad" u mean like overtrained or like robotic or smth
also should i turn on overtraining detector
model randomly having issues like voice glitching/breaking randomly, weird artifacting
set it to auto
auto = if dataset is over 1 hour, uses kmeans instead
kmeans is intended to be used with big datasets to speed up index creation
for anything lower than 1 hour, use faiss imo
kmeans learn less thats bc it's faster
oh ok
over 200 hours
I wanted to try morphing my voice for an animated short project - I set up "Retrieval-based-Voice-Conversion-WebUI" in an virtual environment, but when I try to get any of these voices working, I have no success in making them show up in the gradio GUI
https://huggingface.co/QuickWick/Music-AI-Voices/tree/main
I tried looking for a how to and asking ChatGPT, but most documentation is in Chinese and ChatGPT is suggesting nonsensical "solutions" - help would be really appreciated
I used "python tools/download_models.py" and "Inferencing voice:" in the GUI is still empty.
oh mb, it's over 1 hour in mainline
it automatically does that when you have ~2500(?) slices
the setting is for 200000 slices
so it does not matter for normal model
12.5 years to train fr
and nobody makes indexes for a pretrain
wait so it's a different thing vs what is in mainline?
i already clicked generate index
dont think much about it, it's one of those complex settings that were originally hidden from mortals like us lol
😭 damn
it is entirely useless for anyone
nice, i always thought 1 hour sets used that kmeans thing
ok so i'm gonna play spider-man 2 and let it train for a bit
hopefully my pc doesn't 💥
it automatically narrows the set if you got more than ~2500 slices.. yes, it is kmeans or whatever faiss method
it is automatic thing by faiss
gotcha, so confusing but i think i got it lol
well, for 200k samples it uses sklearn.cluster.MiniBatchKMeans
oh great i got an error
?
for 2500-200k samples it uses internal faiss minibatch, same kmeans
makes sense lol
you forgot to extract the features
oh 💀

so i extract features and start training then
what if you hide the kmeans option? so we don't use it by accident 
ye
send ss
you forgot to slice?
not the features... the whole thing
pointed the input to C:\training\file.wav


