#✨│ai-help
1 messages · Page 246 of 1
theres no way you noticed it was a fnf related audio with just what I showed
Are these the newest graphs & how do I read them?
im pretty sure i surpassed the overtraining, should I then press the stop training button?
ok so i will press it
oh no
theres a problem
the buttons seem to not be working for some reason
stop it on kaggle side
actually, it seems to have already stop on kaggle and tensorboard
really strange that it just visually stuck/keep going on applio
1000 epochs lol
yeah, i really wasnt sure what value i realistically should put, so i did what the docs asked
would you believe if said i have like 3 lowest points in my graph?
should it be the lowest point visually or by the "value"?
i dont think you can really use a chart for such model.. just test some epochs
uh, sorry? i know its a really limiting dataset and all, but aint i supposed to seek for "the lowest point" and stuff? so should i go for the epochs with the lowest points and see wich one i prefer?
lowest point info is outdated, in this type of ai the only way to know if your model is doing fine is hearing it
the graphs are more useful when training pretrains
i mean it can show you a good area on where to start listening
hi i have a 9070 what should my settings be for voice changer
Currently, there is a lots of traffic on the website, and that's why things are taking time, but it will be working fine soon
I use these settings, see if they're good on your end
Mines amd however
same settings probably work :p
Thxx
hello where can i find the onnx models
they should be in the same selection menu
in find models there alt pth models
do you know how to fix it not detect my mic at all
like when i talk it doesnt change my voice or anything
<@&1159293140440723499>
<@&1159293204038955078>
I'm not sure, I'll be here when I wake up tho ^^
i watched the duckus vid
send me link
i figured it out i was on the wrong version
please dont mass ping
sorry
I need help with my voice changer
everytime i talk, for example I say "hello"
its like "Hel-oo"
its weird
<@&1159293140440723499> <@&1159293204038955078>
what do you mena
I just sent this
I just need help with my voice changer
I mean don't do like this^
any old video tutorial won't work at all
do you know any updated tutorials?
Last update: May 5, 2025
btw you typed faster than me 
seems like not enough vram, what is your GPU?
rx550
that's too weak, at least 6 GB cards as bare minimum or 8 GB as recommended
so RX 580 might be bare minimum
can use cpu mode though it will be slower
It wont open anymore ever since I switched to gpu mode
^close some other programs or reboot the system
Last update: May 5, 2025
Can anyone tell me where can I download Love and Deepspace AI voice model ?
what's an example of an extremely good model? ("good" being able to replicate target voice without sibilants with a decent range) like just one that is actually maxed out in quality for rvc's capabilities - can be a model of anyone, i just want to test an actual higher quality model
the less latency? more better?
don't spam ping roles
plus moderators aren't always helpers too
You can search rvc ai voice models at:
- #1175430844685484042
- In #🔍│find-models , Do /find with @earnest musk
- https://weights.com/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- make it yourself with our docs guides
:wave: @low shard, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
I'm guessing you're talking about the realtime voice changer, wokada deiteris fork that you got from our written guides and not some old ass youtube tut right?
if so, yes
u can also elaborate more for better help
what's ur pc cpu?
Have a girl voice
I7-3770
elaborate more
pre-recorded audios? realtime?
if realtime, wherer? discord vc? or in games?
absolutely not your pc is extremely too weak
😭
But I can’t find any of love and deepspace characters voice models in all of this website
you can either:
- buy a better pc
- use cloud (remote good pc) with limited time, and you need a good internet for this
what u want to do?
if there isn't any, you can make it yourself, we don't allow people to take models request
also remember to maybe use all types of names that the characters could be possible named as, and all the sites i told you
yeah, what build you gonna get btw?
so like i can tell you if its good enough
like if you upgrade to rx580 thats just bare minimum and will be shitty in games
Idk yet but probably a rtx 3060 and a i9-12100k
ohh happy early bday
Thanks
nice, just know you won't like play marvel rivals at max settings while using the voice changer
you will need to put the game settings to low so you can have as much low delay in the voice changer
I dont like playing games at max settings on any game
I like medium to low
So i should be good
Thanks
come back here if u get a good pc or want to use cloud instead
dont use video tuts tho
they pretty old
@peak stag @brittle wing don't use video tuts, that one is over a year old
tell ur pc gpu and what u want to do
Can you tell me how to make my own?
what's ur pc gpu?
i have a gaming pc and 4060 rtx i want to use a girl voice changer and it sometimes becomes choppy
okay, what do you want to do?
alright you edited the message
you seem to want realtime in discord vc or games
you followed a yt tut right?
you shouldn't
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
I don’t know . I use MacBook Air
yea
what does this mean
that's good enough, but for gaming and most regular task, you just need decent affordable cpu like i5 12600 or ryzen 5 5600
it's a bot message linking the written guides, read only the first one
there's no up to date video tutorial, so read carefully please
macbooks aren't that great for ai
RVC doesn't support training models on mac
the best thing is for you to either buy a good pc with a nvidia gpu, or use cloud (remote good pc) which is kinda harder to setup and have limited free time
which u wanna do?
alright thanks ill do this later
yw and lmk, remember to uninstall all things you got off that yt tut first
vb audio cable too, it's not good for windows
but this guide also says use virtual audio cable
ye i was talking about fork vc
vb audio cable is just one type of virtual audio cable, virtual audio cable is a type of programs not just one
the guide tells you to use vac lite, which is a better VAC (Virtual Audio Cable) for windows
Really? 😭 Thank you for answering my question
when i say uninstall everything from yt tut, i really mean everything
yeah, wanna try cloud?
alright
and i don't follow youtube tutorial i follow the guide
do u want to show a screenshot of it so i can check ur settings and give u advices
No
alright then, i guess you will just hope someone makes that model one day then
just remember that requests aren't allowed here, so don't fall into paid commissions, we dont encourage those nor want you to get scammed
Okay thanks a lot
yw
take a look at existing models in https://discord.com/channels/1159260121998827560/1175430844685484042 and weights.com
other than the Nvidia gpu spec requirement or the cloud solutions, you'd have to learn a lot to do model making
yeah it's not a 1 click button thingy to do
I found only love and deepspace Caleb voice model .I want other characters of love and Deepspace too 😭
as said above, we dont support paid requests/commissions
Okay thank you
For https://www.kaggle.com/code/suneku/voice-changer-public on the cloud, why does the guide reccomend tesla t4 x2 and not the p100? You can only use one of the tesla gpus
it seems like the t4 is closer to a 3050 while the p100 is closer to a 3070
P100 is a pascal GPU (same arch as GTX 10-series) that is less supported for AI workloads
the raw performance isn't the right way to compare in this case
A support thread would say otherwise
From two different people:
"P100- This is the go-to acceleration method for most use cases and notebooks. It is pretty useful for most purposes and is integrated with Tensorflow and PyTorch well but does not use distributed computing
T4- This is also a good GPU but this needs to be used with distributed computing, else one may not use this to the fullest extent."
"GPU T4 x2: The T4 GPU is designed for deep learning and machine learning workloads. It provides fast processing power and supports high-performance computing tasks.
GPU P100: The P100 GPU is also optimized for deep learning tasks and provides high-performance computing capabilities. It offers faster processing speeds and larger memory capacity compared to previous GPU models."
you're missing the point of tensor cores, which are included in V100, RTX 20-series, and newer generations
and most AI workloads benefit or may require tensor cores
in this case, T4 is turing arch which has tensor cores
i cant find anything about tensor cores in the p100, does that mean it lacks them?
Volta (V100) and Turing (RTX 20-series and T4) are the first to introduce tensor cores
Oh I see
Thanks for clarifying
im trying to train a model using codename's applio fork, but i keep getting this permission denied error:
PermissionError: [Errno 13] Permission denied: b'C:\\Users\\admin\\Desktop\\Codename-RVC-Fork-V3.1.6-rev2\\Codename-RVC-Fork-V3.1.6-rev2\\logs\\test\\eval\\events.out.tfevents.1750078534.HoneyBadgerG6X.21556.0'.
ive tried running as admin and turning off readonly and giving the folder full permissions, but to no avail, the file exists, but it thinks it doesnt. if anyone could help, that would be great and would be saving me hours of anger 
brol... like.. you need 50 more folders down the line
sure that will help
/s
it is not at 250 character limit for a path, but seriously
extract the thing into C:\rvc or d:\rvc
and use a normal user, not admin
yea that worked, its very early in the morning, i assumed it was that but it worked fine for one model and didnt the next, but thank you regardless lol 
Hello, someone can help me a little? what is the best rvc to use in online games? i have a RTX4090, i saw the Wokada Deiteris Fork peoples saying is the best, but doing a test with it, the Wokada use 90, 100% my GPU, its impossible play using this, or i can do something ???
unfortunately there's not much can be done about gpu schduling between games and cuda compute
you can try switching this off
hello can i have the guide for realtime voicechanger?
?
do you have a guide for this?
the gpu scheduling is needed for frame generation but ye it can somewhat cause performance issue
i dont have performance issues if i just using the Wokada Fork, the real problem its because he use 90 100% of the GPU, and its impossible running this with the game, the game consume 60% of the GPU
turn up chunk if it is too low
yea, i doing some tests here, if i use the low chunk i got instant voice with 40ms but my gpu use 100%
if i increase the ms to 120ms, i got 50% of GPU usage, but the voice its not so good like 40ms
😐
the stable one (less than 100% usage) is usually preferrable
yea, ill test with the game now, GTAV more specific, about the audio: Client or Server, which is better?
server mode with wasapi devices has less latency
i using FlexASIO, its better???
ASIO is even better if it works on you
yea, its working on FlexASIO format in cliente with SA 48000
is https://3d.hunyuan.tencent.com/ safe?
safe for what?
it is a too to make 3d assets from ai generated images and texts..
good luck registering ther though
safe to download from and use the models without getting a virus?
dunno, i dont have QQ account or chinese phone number to try
Hey, I have no idea where to ask this, but I'm trying to download okada voice changer, I'm on the Github page and there is supposed to be a place with a grid of stuff to download and its just not there for me
(I might be extrermly dumb but I'm pretty sure i'm in the right place)
what's your gpu?
Uh what's that
video card
open this
does anyone know good anime lora with bold outlines like in anime panty & stocking, which is avaible on civitai?
ew 
task manager is always a thing
that too, but it seems we are dealing with a special case
get the one from here, not the og repo
https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/
Last update: May 5, 2025
no no its fine I got it dw
wdym?
omg repo :3
smh not that repo
what the sigma

"(I might be extrermly dumb but I'm pretty sure i'm in the right place)"
ohhh a special case
got it
how create rvc models? i can't find
You can search rvc ai voice models at:
- #1175430844685484042
- In #🔍│find-models , Do /find with @earnest musk
- https://weights.com/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- make it yourself with our docs guides
:wave: @low shard, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
did u first try searching in the sites i told u?
i want to create with my voice
what's ur pc gpu
just use Kaggle applio
how i can download applio?
what's your gpu?
4GB?
yes
I just wanted to share the results of different choosen epochs of my voice model, but im not sure where I can do that
also, if I post a model on #1175430844685484042 , and Im not sure wich epoch.pth to share, could I maybe share a drive link on the post to a folder with all the different .pth files I downloaded? (it would be 9 different pth files for one model).
heyy i have a question about weight ai i might purchase there subscription have a couple of queries
can i train 2 models on a subscription?
and is there a hindi feature on tts?
you can make as many voice models on weights without the need of a subscription, and if you want to do a "premium training" (a better sounding model), you can get a free one by getting a 5 day streak on the site (making atleast one creation, like a cover, per day).
Guys when I Download a voice model which pitch should I use on it
it doesn't say which settings are best for each model when I download it from #1175430844685484042
is it a male or female model, and is your voice high pitched or low pitched
change the pitch until it sounds like the model
Oh okay
Is there anything else that I need to change other then the pitch
my Voice is mid pitch not high not low its in the middle
Its Adele's model
not really, just experiment with every model
most models are good at pitch 0
Perfect got it appreciate you 🙂
np ^^
NVIDIA
Full name
NVIDIA Geforce RTX 4050 Laptop GPU
Last update: May 5, 2025
Download virtual cable and the first nvidia link you see
the nvidia-b2332 one
And follow rest of the guide, write a message here if something doesnt work
Ignore mac, linux, online steps ofc
💀
im a bot mate
Last update: May 5, 2025
ok
thanks for spoon feeding me im sorry lol
alr what do i do with the file now
oh
😭
i got it
downloading rn
it says all weights are loaded
nothing loaded
what do i do
hello?
i did this and it takes me to a website not an app
what interview?
i got some shit to attend
and i need ts to work
why cant they just make ts an app
wait do i have to use the voice changer in the website?
Why is the graph so weird?
is this normal
It's adorable
is there a way to lower the lag
i have 400 ping
4000 total
@viral mason
?
look at the ping
its super laggy
yo omfg ts is so hard to use
do i really have to use a web browser??
onx is bad why are u using onx rmvpe
i dont know
nobody helps me
im stuck
they tell me to read
like wtf??
im an idiot
same lol
pretty please?
sure yea! my dms are open
if you dont want to use the web browser you can download vonovox (nvidia only version of w-okada. you have a nvidia gpu so you are fine here) and use that instead
im getting so many things told to me, but i dont know where to get any of this
everything is in our docs https://docs.aihub.gg
you need #outdated-model-maker-role before posting models there
delete the folder where you extracted the voice changer, thats pretty much it :/ ...
im not sure which installation of real time voice changer your talking about but pretty sure that it has the python and installed libraries within the folder so everything is in there, you dont have to delete anything else, just the folder of installation, couldnt be any simpler
- close programs and background processes that may degrade the performance
- switch to high performance instead of power saving profile (assuming you're not running on the laptop battery)
- use an external cooler in case it overheats and throttles
yea, ik
when i have that, can i do the thing i said?
i have the same problem but still with it. and i dont understand why its lagging, even if GPU cant reach 10% of using
like, if the GPU was using 100% i would understand the lag
no, post only one link with the best epoch in your opinion
cooked settings?
Alguien me ayuda
anyone know how to fix constant voice cracking
Today morning I woke up to this error and I can't fix it
clcik reload
(Sorry to interrup, but...) If I have a more of a trivial question, can I do it here?
I did it 7 times and the 8th time fixed it 😂
@unkempt turtle
how do i download the voice changer
anyone know how to fix constant voice cracking
Probably should ask the question lol
I can help you out in a couple hours probably
Try using suppression
Its a local webui, its only to display everything. Its not a website. Read the guide it mentions that in faq
To delete it fully delete the folder thats all
Crackles?
Ask instead of asking to ask
Send screenshot of the full voice changer
Guys what is the best woman sounding model there is at the moment ?
what is gemini doing
QApplication.setAttribute(Qt.ApplicationAttribute.AA_แล้วไงต่ออ่ะลScreenResolution)
QApplication.setAttribute(Qt.ApplicationAttribute.AA_EnableHighDpiScaling)
where did it get AA_แล้วไงต่ออ่ะลScreenResolution
bruh pls fix mainline
on colab i can't use it it still keep saying this site can't be reached
Last update: Oct 23, 2024
how do i use kaggle
Hey guys, i'm testing out different models and I noticed that some models take higher performance hit (requiring bit higher delay) than others, am i wrong to assume that it is because of higher sample rate? and if it is the case can it somehow be reduced, so that i dont have to increase the delay
perhaps, or it takes a bit more memory usage
if you mean RAM then it's the same from the looks of it
I saw you got helped. Next time don't ping moderators, moderators dont have to be helpers
just wait for an helper next time
I'm glad the ability to @ all helpers instead of a specific user is a thing
I'd be nervous to do it
this server is english only, speak in english and elaborate
what's ur pc gpu? which tutorial link did u use? be sure to not use video tutorials, they are old
you can either use this channel for casual questions or #1192011222023950368
elaborate, what's ur pc gpu, what do u want to do, what tutorial link did u use?
be sure to not use video tutorials
lol the tag
it won't be fixed, the creator of the colab is busy since months
Plus applio is more suggested
hey why does the http file for the voice changer only work once, I cant open the menu again
it's another cloud service provider, the other helper literally linked you the guide to read
also, what's ur pc gpu and what do u want to do?
elaborate:
- ur pc gpu
- what do u want to do
- the tutorial link you're using
if you're using a youtube video tutorial, you're cooked, because those are old
amd 6700xt
im trying to setup the voice changer
https://www.youtube.com/watch?v=SxdnGxicJOg
the tutorial is 2 weeks old?
i just checked the tutorial, the version that he uses is over a year old...
ah ok so i did something wrong
this is the same shit u find in 1 year old plus tutorial lmao
the thing you did wrong is using video tutorials
absolutely uninstall everything off that youtube tutorial
forget it even exisated
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
read up the 1st link
wokada deiteris fork is way better than the over 1 year old version of original wokada in that video tutorial
there's no up to date video tutorial
ok thank you
u can check ur gpu with task manager btw
ez
he already said his gpu
remember to uninstall everything before using the new written guide, meaning vb audio cable and the old original wokada
then read the guide and lmk
why did you rejoin in 2024 did you just take a break?
u da goat i got it working
want me to check your settings?
i cant post images
!give-media-perms 30m @digital perch
see the console window
this?
says this when i try to put my mic on input
what browser are u using
firefox
gpu: ur amd gpu
chunk: 100ms
extra: 2.7
try chrome or any other browser, be sure to give microphone perms
ill try that now thanks
this is why ur my goat, ai voice changer is working 100% no errors thank you bro
be sure to put
input: microphone
output: line 1
output: headphones (to hear urself, optional)
u can also optionally set force fp32 mode on in advanced settings for kinda better quality but kinda more delay
be sure in the discord vc or game setttings to put the input as the line 1
do u need any other thing like optionally find a way to lower a bit the delay, even tho its kinda more hard
or is it all fine?
don't use onnx
which one should i use then?
Is Hina_Mod_AICoverGen a broken project at Google Colab? Does anybody know?
I suppose both should work for Nvidia gpus without issues
perhaps slight performance difference
I heard onnx of any of the models is worse, honestly not sure tho as that was said a while ago and I forgot who told me that
I need change voice in mp3 file with Hina mod for Colab. It worked perfectly 2 minths ago((
???
what are you trying to do, explain in words that make sense please
I might be able to help
yes
Ok
I have mp3 file with my voice in russian. I need to change this voice to another voice model. I have this ptn and index file. I used to Hina at Google Colab for this task constantly two months ago.
Aplio doen't match my expectation becuse of bad pronunciation of some ukrainian letters (sounds). Hina was perfect.
But now it doesn't work with mistakes,
Traceback (most recent call last):
File "/content/Hina_RVC/src/webui.py", line 8, in <module>
import gradio as gr
ModuleNotFoundError: No module named 'gradio'
OR
Timer: 00:09:36Traceback (most recent call last):
File "/content/HRVC/src/webui.py", line 10, in <module>
from main import song_cover_pipeline
File "/content/HRVC/src/main.py", line 22, in <module>
from rvc import Config, load_hubert, get_vc, rvc_infer
File "/content/HRVC/src/rvc.py", line 5, in <module>
from fairseq import checkpoint_utils
File "/usr/local/lib/python3.11/dist-packages/fairseq/init.py", line 20, in <module>
from fairseq.distributed import utils as distributed_utils
File "/usr/local/lib/python3.11/dist-packages/fairseq/distributed/init.py", line 7, in <module>
from .fully_sharded_data_parallel import (
File "/usr/local/lib/python3.11/dist-packages/fairseq/distributed/fully_sharded_data_parallel.py", line 10, in <module>
from fairseq.dataclass.configs import DistributedTrainingConfig
File "/usr/local/lib/python3.11/dist-packages/fairseq/dataclass/init.py", line 6, in <module>
from .configs import FairseqDataclass
File "/usr/local/lib/python3.11/dist-packages/fairseq/dataclass/configs.py", line 1104, in <module>
@dataclass
^^^^^^^^^
File "/usr/lib/python3.11/dataclasses.py", line 1232, in dataclass
return wrap(cls)
^^^^^^^^^
File "/usr/lib/python3.11/dataclasses.py", line 1222, in wrap
return _process_class(cls, init, repr, eq, order, unsafe_hash,
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/lib/python3.11/dataclasses.py", line 958, in _process_class
cls_fields.append(_get_field(cls, name, type, kw_only))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/lib/python3.11/dataclasses.py", line 815, in _get_field
raise ValueError(f'mutable default {type(f.default)} for field '
ValueError: mutable default <class 'fairseq.dataclass.configs.CommonConfig'> for field common is not allowed: use default_factory
Timer: 00:09:37
it may not work because most google collab stuff is dying
I really only know of weights.gg, applio, and mainline that could be used t do what you want
but Applia works right know. What's the difference,
it's up to date with its code, Hina might not be
rvc isn't ever perfect know
people mess up words all the time no matter the language even when speaking normally
Last time Hina helped me to fix this issue
so it's kinda natural
are they in this server?
is Hina in this server
Sure
it's a yes or no, do you know if they are???
I'm not sure if it was Hina. But man or woman helped me
I'm sorry I need a specialist
you can probably get better help from one of the helpers than me
there's two online rn, see if they're available
it may be contentvec limitation, or depend on the language, or try articulating better
Does anyone know how to set up something like AI-Dungeon but on my computer?
where is the download link??
Is it possible to do a mommy voice without using Okada Voice Changer?
yes, its called doing vocal training
using more than 2.7 can cause cause issue of cutting
don't say that, he's on amd, onnx is specifically for non nvidia gpus
BRUHHHH
rmvpe_onnx, dw
it's broken and replaced, it will NEVER come back,check #📰│dev-updates message
@unkempt vale what's ur pc gpu? what do u want to do?
Hina is busy since months
in the guide, you have to read it
Is it possible to do a mommy voice in weights?
idk where is download link
nope, they dont offer realtime voice changing
but for normal inference you can
What would be the best pitch for me to put in?
for male +12-ish for female 0
Thanks bro
np brosidion
December 7th 2024 is the latest version?
hi guys im tryna make a song w a female voice but because im a male, I need to transpose the voice up by 8 for it to be in her range so she can actually have energy while singing but that puts her vocals out of key with the beat, is there any way around this? I cant pitch up the beat cause it sounds like some garbage
how do guys make songs w female ai models in their range?
what's ur pc gpu? what do u want to do?
try just playing with the pitch
I wasn't ever told that lol
i asked you, what's your pc gpu? what do you want to do?
for calls right?
then yeah, this is the latest wokada deiteris fork for nvidia gpus
be sure to read all of https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/
Last update: May 5, 2025
i think its said in https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/ along with convert to onnx
Last update: May 5, 2025
onnx is slightly worse in quality iirc but has better performance for amd
I downloaded it, just run the file?
no, be sure to read the whole guide
you need VAC lite too, it's in the virtual audio cable part of the guide
please dont skip steps, else it wont work
What should I do then?
i downloaded this
Last update: May 5, 2025
you can skip mac/amd/intel/rtx 50 serie steps, but you need to follow the rest of the guide
if you dont read it, it wont work ofc
Does anyone know an alternative to Weights images? He's hungry and overweight.
Do I just run MMVCServerSIO and set it up? Is there anything I should do when installing?
what?
Yeah, you should read the guide and do the Virtual Audio Cable part
it isn't a 1 click program
no, you need to do the virtual audio cable part
A VAC (Virtual Audio Cable) makes a fake audio device, used to re-route the audio of different programs
In Wokada context, it's used to get the output of wokada as the input in other programs
On weights.gg, you have to wait 1000 people to be able to take an image or train a model, I've still been waiting for 3 days
if you dont do it, it wont work in other programs
yes i know i have vb audio cable
and i use audio interface do i need to use server?? I know the client is better
I had fork by deitris, and I've been working with it for months, it just happened today morning
anyone know GOOD ai voice changers
no, vb audio cable is bad, it may give random issues on windows as users reported
uninstall it, forget everything from wokada youtube video tutorials
get vac lite from the guide, you need to read the guide to use the program
is it wokada deiteris fork b2332? also what browser do u use? what's ur pc gpu? does the issue still persist?
elaborate:
- ur pc gpu
- what u want to do? realtime for calls? pre-recorded?
rtx 4060 ti
for a stream and to troll people in game i want it to sound as realistic as possible yk i tried odaka but im having issues with it
I just asked to question because I needed to be sure I could make a less of technical question here, before I do that and someone goes like "yOu AreNt SuPpOsed To bE asKiNg thIS HeRe" or something like that.
All good, you can ask anything here
yk i tried odaka but im having issues with it
youtube tutorials are old
forget everything you get about it in a video tutorial
also, no there is no such thing as completely perfect way
can you help me its really choppy and i cant seem to put hte playback
there's limits, ai can't laugh well for example
like i dont hear the voice coming through on the other side
delete everything you get off videos
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
read the 1st link, wokada deiteris fork
alright but cna you also tell me how to output hte audio
wdym output the audio
like i mean i dont hear it at all on the other end
say i want to use the vc on discord right how do i make it go through discord
you need to uninstall everything off youtube
then read the guide
the guide explains it
about voice models, whare can i find beatrice?
Do I need a Laptop or a desktop for voice changer?
Yes it's that version and the problem still presists, and my pc gpu is: RTX 4060 (8GB)
I use Google Chrome
RTX 3050 (sorry for answering too late)
Why is training those RVC voice models so slow
i had no issues when i used it that high
What would yall say is the best realtime voice changer
Also take into account I have a RTX 2070 so no very high end ones.
Beatrice v2 models are Faster and more Lightweight than RVC v2 models, but have Less Quality and there are Less Publicly Available models
TL;DR: you shouldn't use Beatrice models, DO NOT USE VIDEO TUTORIALS FOR RVC AND WOKADA
They are old
What's your PC GPU and what do you want to do
Realtime? Of course, that's the bare minimum to even run cloud, a phone doesn't have a virtual audio cable (VAC) which is needed for it to work in other programs
Or do you mean pre-recorded?
What did you do specifically that caused this issue? Also try other browsers like firefox
Sure I think realtime
Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), not the best to train (make models) even if still possible
You can:
- Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.com: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio UI Colab: max 4 hours daily, not granted, of GPU
- RVC-AI-Cover-Maker-U Colab: Automatically separates the vocals and instrumentals, converts the voice and mix all together back
Easiest possible (automatically separates vocals & instrumentals) : weights.com & rvc-ai-cover-maker-ui
easiest cloud: Ilaria rvc zero
easiest local: Applio
Alright, you got at least any type of PC? Would be the best to have a good one
AI is an intensive task, don't expect it to be fast
It might on some models, but sure you can do whatever
I want to my a PC but I don't know If I should get a laptop or a desktop
Hmm so what's the matter it takes three days on average to train such models a waste of time
Also why does the chart look sow weird
WOKADA deiteris fork
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
1st link
3 days?
read it
Yes
Depends on the specs, generally speaking a desktop is better if it has a good GPU
rvc models take max like overnight
3 days? What's your PC GPU
some can be finished in like 5 hours depending on gpu
Hmm yes
Colab
What GPU is the best so I can get it?
Fork? I have the natrual WOKADA onyxgpu-cuda where can I find fork?
What's your budget?
Hmm so I'm getting bad GPUs in Colab, got it
ah, that makes sense
I gave you the link, it's the 1st, read it
ok thx
Are you sure? Colab has 4 hoyrs max of GPU for free daily
a amd 6700xt gpu is faster than colab
I don't have one yet
Are you using the free tier? What GPU are you using
Also, will a 2070 be able to run decently smooth on it?
Tesla T4
Yeah, just set game graphics to the lowest if u gonna game
Mmk, appreciate the help! Have a good day!!!
Then it's hard to suggest to you, I can just tell you to get a pc with an RX 5090 (which alone that component costs over 2k USD) if you want the best lol 😭
You need a budget because the top of the top is expensive yk
i mean a a100 would be better but yk
Oh yeah of course you can also suggest ai specified GPUs that cost over 10k dollars
This is why you need a budget to make a PC, there's no other limit than your money
Can't I get one for like $150?
An entire PC for 150$?
Yea
Nothing actually it just happened, I woke up to the app being no avail
Hell no, generally speaking it's probably gonna have low end specs, prob gonna be some crappy shit, at max it could run cloud (meaning using a cloud computing site for using a remote good PC with limited free time)
Are you gonna buy a PC just for realtime voice changing?
So $2,000?
Nopee there's more therefore
Yea I also what like real-time video
What about Dell Optiplex 9020 Desktop Computer PC
Hi which version should i get? https://huggingface.co/wok000/vcclient000/tree/main
I have a rtx 4060
where's RVC v2
that sounds nothing more than a raspberry pi retro rig or a low budget phone
Is that an insult orrr
unless you meant 1.5k budget
Wait is budget money?
did you stop the training and then resume later?
can someone help me plz?? why my Res is around 40000
Yes several times
It is cause i ran out of gpu
Also someone said this graph is outdated
two possibilites here:
you resumed the training with the wrong batch size, let's say you started the training with batch size 8 but then resumed with batch 16
or
the graph bugged because you stopped training before completing an epoch, in this case, it is a harmless visual bug (technically it is not a bug)
and yes that graph is outdated
I always trained w 8
I know & how do I seriously identify the real lowest point?
the lowest point of the graph is not your best epoch, that is outdated information
Really then how do I know?
you save every 10 or every 5 epochs then hear them until you find the best sounding one
So basically i can train as many as i want now
Im using a pretrain
well it's more complex than that but to make them simple, train 200 epochs and maybe save every 10 and hear them
in my case, today i trained a model for 100 epochs because the dataset was big and i know the model will converge before 100 epochs
Does the pretrain guarantee that the model is gonna finish faster?
yes, always train with a pretrain, if you dont use one the model will sound like a robot
So nowadays its recommended
I mean it can give lazy people an area to listen to epochs in
Yes but im using one for kor. Langauge
Original is mostly english
the model learns the new phonemes regardless of the language of the pretrain
it learns them in like 5 epochs
Yes but I have preferences
also it's a known fact klm does not produce good speech results
but if klm makes u happy go for it
How do I calcolate the epoch per dataset minutes
Is there a dataset length and epoch correlation
yeah
So 30 minutes.
hmm maybe try 200 epochs and save every 10
Will it be 500 epochs
no thats too much
Thats what im doin but im at 300
sure thats fine
you know the model is overtrained when after a certain point the model starts to sound very robotic
You can check tensor board
To prevent overfitting
The graph is messy and outdated
What do you mean by outdated
there is not too much info in your dataset and you're using a pretrain model, which already contains a lot of information, you dont want to replace the pretrain knowledge with the knowledge of your 30 min dataset, the pretrain is trained with over 50 hours worth of data, you dont want to lose that
by training 500 epochs you're telling the model to forget the pretrain
thats bad
Okay then tell me how many epiche for 30 minutes.
200 and save every 10
batch 8
Then stop and do inference
stop and try the model
My saving frequency is every 10 batch size 8
no
200 is enough?
yes
Most of the models are trained on 200-250 epochs
I have the 200 epoch weights saved
Also you can resume training dw
Yes but my 30 minutes dataset is clean and high quality
Im still training going for 350
bro, it's simple, if you wanna train a super high amount of epochs, do it
its your model
Not a problem. Just try your model
Okay and
It will ruin your model
But you say 200 is enough do you mean thats cause of the pretrain
More epochs is not = to good quality
Hmm sometimes it is
well i already explained everything, you can ignore that and train the way you like, it's fine
Is it cause of the pretrain
Everyone use pre trained to make the training faster. Training a model without pre trained will took days
Cause what i noticed with this pretrain it starts sondino good at 100 already
I remember
It took longer before
Yeah you have 2 ways
- Trust us
- Look at tensor board
she's using mainline graphs so they wont help
She*
sorry, edited my message
So I suggest you to stop training and try the model
You say 200 is enough cause i used a big pretrain?
Well you can try and find it out
Uh ok
you have resumed training with different batch size than before
No i didnt
It was always 8
another possibility is you lost a tfevents file for the second training session
Could bey
assuming you have really run three/more training sessions
@brittle wing how many steps per epoch you got?
How do I understand that.
Formula
4345 steps
And
okay, so 90steps per epoch, okay
WOW WAIT
By listening?
I know
can you like not do it from a phone with tiny screen?
did it save the actual model, not g/d
I font have another device
The pth yes
okay then
you can only find out by running a bunch of tests and listening which one is good and which one is bad
Okay.
450e seems unnecessarily too much
What if it sound normal at 350
How do you know?
Im all ears
Some ppl train 16 mins on 500
Too much
how do you know?
You. Can. Only. Know. By. Listening. The. Results.
Okay
My GPU is 10000000000 /j
Still seeking for help. We already told you 
i dmed emojikage and got no response, is there anyone else i can contact to help? i want to edit the source code and run the gpu-enabled deiteris fork locally, but due to a lot of technical reasons that ppl might be able to help with, i cant
i want to continue the deiteris fork project but forking the repo doesnt let you build gpu compatible versions for some reason
I'm not sure that's entirely true... where is the data showing that it performs poorly in speech tasks?
Are there any models from Google that are faster and smarter than gpt 4.1 mini?
you don't
if you want to use for gaming, you'd want to upgrade your gpu vram
even many modern games demand more than 8 GB vram
If what you're saying is true, then any models using the og pretrained model shouldn't sound robotic. but clearly, most of the models currently uploaded don't meet that standard, which means they aren't uniformly high quality. that suggests the bigger issue lies in the dataset you're using. I just don't want the blame to keep being shifted entirely onto the pretrained model.
even if you train a model without using any pretrained models at all, the results will still vary depending on your dataset. sound is extremely subjective what sounds good or bad can differ from person to person. so there's no absolute answer. on top of that, everyone uses different hardware, which makes it even harder to define what something like "sounds like."
all of the models I've created were trained using KLM. If the pretrained model were solely responsible for robotic artifacts, then all of my models should consistently show the same issues. but unfortunately, the differences always depend on the dataset I'm training with. even as the person who created KLM, I can't definitively say where the problem lies. so to see people confidently claiming what's "wrong" as if it's a fact. it feels a bit frustrating.
the raspiness of the voice changer keeps coming back every now and then, what's the cause of this? do i need to find a better voice model or do i reinstall VAC lite?
whym? it runs locally
Guys, can i ask if there is a big different between 32k and 48k

I mean the quality
for speech use 32k
Ah okay, because i saw on vonovox, it has 48k option only

I have no idea if i use 32k, it gonna be worse or not
Thank you. Let me try training in 32k
I'm not sure whether it re-samples audio or it is just the audio output setting
you can use any model, I think
for voice changer and most practical use cases that don't demand audio quality/fidelity, 32k is enough
for model making, keep in mind of this
normal speech recordings do not go over 16k usually.. sometimes even 12k
thus x2 SR for the model - 32k and 24k
I tried googling it's specs and it seems to not have an integrated GPU, so no
None of this, this is original WOKADA
You want a realtime voice changer for calls?
What's your PC GPU? What do you want to do?
elaborate:
- your PC GPU
- what you want to do
- the issue
- what tutorial link are you using
Show your WOKADA settings
there is also a version for spin embedder
but not so recommended unless you're interested on testing and contributing
im trying to find the name for this ai txt to speech model name can someone dm me so i can dm them the mp3 ??
its a very common one
Sound overtrained at 350, sound i go for 300
Yes tell em
Okay and it was right
Like 350 epichs sounded robotic as hell
Now 300 are okay but moderate robotic
So 250 maybe will do
Can i DM simeone so they tell me the best checkpoint?
Ah, i tried using this. But the outcome is a bit, not as good
Since i heard breaking and cracking, a bit robotic too

I will try KLM 6.3 for spin
if you don't find it good using default pretrain as well, more likely the dataset issue
I mean, default is fine, just default is mainly English, so when it comes to Chinese or Vietnamese. It's a bit
Mispronounce ?
Like can't spell certain words
With spin, that issue has decreased

But still exist
How come the 200 epochs model checkpoint sounds the best 😭
Just asking
Should I go for that one...
do you guys know where to get voice models for german females?
they sound robotic because ppl overtrain their models training it for like 500 epochs
i can run another test run today
but i said that because ive seen a couple of people claiming klm made their speech models sound a bit metallic
as for me i also remember experiencing that, but i can confirm that today
Look at what I asked pls
no
How come my 200 epochs checkpoint sounds the best
we told you and you did not listen
i literally told you yesterday why training 500e is a bad idea
today you're seeing what happens when u overtrain the model and are surprised

But the 200 epochs chkpt is the best.
Well after listening yesss also it was today morning in my case...
Timezone
ah I see I see.
I checked the checkpoints below 350 & I understand
yup gonna ping you after the comparison is done
Okay I'm gonna choose the 200 epochs checkpoint and shut up
And apply for model maker like...it's time
good luck!
THANKS
because i dont find like german female voices in the voice models category.. could someone help me?
There's no german tag so good luck
how to use weights.gg?
Are you using it on mobile or PC?
Yes you can, what are you trying to do?
how can i get the model and index for the voice changer?
because I was that you can search for models and I got one interesting, so I want to use it but I do not know how 😦
When you click on a voice model next to the use button should be three dots, click them and there's a download button
If u need any more help finding it I could send you a video of what to do
Thank you so much I got it. Appreciate it 🙂
You're welcome
how do i fix my mic either lagging or talking with 5-10 sec delay?
i don't understand allat chunks extras stuff
There's always going to like a small delay but chunk at 360.0 is pretty good and extra at 2.7
i have a problem though, my chunks don't show me values like 360, but stuff like 38400 [0.8 sec]
same with extra
lagging actually got fixed, now it's just a delay
i've seen some people that have it adjusted either without delay or just like 1 sec delay, which isn't too bad
uninstall what you've got and instead download this: https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/
Last update: May 5, 2025
I think you're using an old version
Download the Lyery
The one he sent
should i use the official or deiteris fork?
the official is slower
fork is faster but it opens in your browser (which is actually better because w-okada GUI uses more resources)
could we go to DMs? i have few questions and need to upload a screenshot, which i can't do here
sorry i don't accept dms, but please try to read the guide, everything you need to know it's there, trust me
is a 5 min read
it's about downloading, i'm using ADM, should i use the ones that say DML or cuda in the end?
i got confused on that question
@low shard can you please give clay image perms pls?
is your gpu nvidia or amd?
AMD
download DML
alright, got it, thank you ❤️
if i'll have any more questions that won't be in the guide, i'll make sure to tell you ( if you don't mind, ofc )
sure no prob, i'll be here
if i dont answer im busy
You can search rvc ai voice models at:
- #1175430844685484042
- In #🔍│find-models , Do /find with @earnest musk
- https://weights.com/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- make it yourself with our docs guides
:wave: @low shard, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
okay, it's fine. The quality is improving.
Alr
!give-media-perms 1h @fluid briar
there is an interesting pretrain for chinese though

it exists ?
noice
anybody know how to fixx when i load a backup and get this error
well, you did not load it it seems
i did
do i load it after starting applio or before
i did it before
and it said it was successful
but let me try again with klm6.1v3 first, since Seoul Streaming Station trained it in Korean, i think it might help me out
those are languages used there
but isn't it already 476? .___. i thought we don't need to care about config
default is 109
how did you found this lol?
Noobies, can u help someone who's kinda like annoying me about some issue I have no idea how to fix, it's @solid vortex they despereatly need help
I'll let them explain
now that you mention it
he pretty much know a lot
but i have no idea how he knows
some goof came to Appio's github and demanded to add a revolutionary new architecture.
turns out it was just a pretrain with a lot of chinese
did not even use chinese embedder, I think?
but sir, it's 476 by default
have you tried using their rvc version?
lmao


