#help with getting started
1 messages · Page 1 of 1 (latest)
oh okay is there a tutorial or some special way i should uninstall wokada ? i just uninstalled vbcable through the installer app
im mtf trans and just want to sound more feminine so i dont get slurs thrown at me every time i turn on my mic in a game lol
that's great then, I just asked because some have been caught doing catfishing, which is not allowed and can lead to illegal stuff
yeah im not surprised
you just delete the old original wokada folder and uninstall the vb audio cable from the windows app settings
-realtime
Guides for Programs that use RVC Models in Realtime for Calls/Games
Most suggested. GUIDE
ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE
Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated
1st link, here you will get wokada deiteris fork and VAC Lite
ofcourse we don't help and promote ai for bad purposes lol, you should be shocked how many think it's a "flex" 😭 (ofc they get banned)
oh wow this is a good step by step ty!! i will follow and let you know if i run into any issues~~
they think catfishing is a flex? LMAO
alr, lmk
yeah 💀
wait do i need to restart after installing vac
You don't have to
ok i got it! do i just choose any of the models in the voice-models channel
does the higher epoch ones mean its more accurate?
nope
uhh it's hard to explain the concept of epochs, but, no, more doesn't always equal better
there are multiple factors really
they put the epoch amount in the title because for some reason, it is mandatory to do so (it is against the server rules not to put the number of epochs), idk why such rule exists lmao
epochs are a unit of measuring the training cycles of the AI model
basically the amount of times the model went over its dataset and learned from it
they don't mean how good is the model, it's just an info provided on how they trained the model by the model maker
More ≠ better
Less ≠ better
There's no way to determinate how good the RVC model is until you try it out or listen to the audio samples if there are
ohhhh okay let me try a few then !
its pretty good for the most part but sometimes its kinda crackly and i cant tell if it's my mic or if its the model
Show a screenshot of your wokada
share an audio so we can hear whats the issue
on wokada deiteris fork, you can **optionally **use more advanced settings for benefits:
- Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
- Reduce the delay via: https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#reduce-more-delay
Last update: July 11, 2025
What time is it for u
It's 2 am here 😭
oh i havent used it in discord yet i just did the recording from the wokada app itself
ok ill send over a recording in a bit
same 😄
oh okay
wait sorry dumb question but is it not better if it's trained more?
oh wait that one sounds so much more natural !! how do i download it?
model was probably trained using cringe asmr mommy type data
nothing can be done, the model is fcked up
i dont share my models, but a rvc model should sound like that
they have the "rvc" tag
models have an easier life if they're cloning things similar to their voices
male to male conversions = almost perfect
male to female = weird
if they sound like a smoker is because the model is overfitted (having a strong bias to do X sound, in rvc models case) with that particular sound
and nop, this can't be fixed (well it can be fixed if the model maker retrains the dataset, adding good diverse data and removing trash data)
ohhhhhh thats so interesting im super new to this world so good to know
is it hard to make your own model?
nope but to be honest, it can be a bit annoying and time consuming
making models of your favorite game character is for the most part, very easy, because voice lines sound good and are already cleaned by the dev team, so you really dont have to do anything besides clicking the training button
in the other hand, realtime models can be a bit challenging if you just started making models, since you gotta do the audio cleaning process yourself, and for realistic results rvc needs a lot of data (minimum 2 hours)
it also needs to be good data, for example, asmr or soft talk data is horrible for models, but natural speech is perfect
In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.
you need a gpu that at least have 8 gb of vram
sure! good luck, any questions dont be afraid of asking them in the #✨│ai-help channel, im most of the time there helping new model makers
