#Refining/adjusting voice models
1 messages · Page 1 of 1 (latest)
can you share a screenshot of the entire program settings you're using?
yes
I will be able to provide screenshots in like 4 hours. Like I said I am using wokada but I did see that the guide you posted had vonovox as an option and I do have a 5070ti. How does vonovox pair up against wokada?
Currently I have formant shift set to 0 but I have been playing with it to see if it comes out better.
uninstall vb audio cable
get vac lite https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/#virtual-audio-cable
Last update: July 30, 2025
set extra to 2.7 if you dont want to risk any cutoff issues on some models (it's a bug present in wokada deiteris fork)
set chunk to 200
on wokada deiteris fork, you can **optionally **use more advanced settings for benefits:
- Advanced Settings -> Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
- Advanced Settings -> Disable JIT compilation: off for faster loading speed of the program, on for slightly better performance (10-15 ms) for Nvidia only)
- Advanced Settings -> Crossfade Lenght: Controls how smoothly the AI stitches different processed parts "chunks" of your voice back together. 0.1 for fastest voice, 0.15 for improved quality but increases delay by ~50 ms
- Reduce the delay on Windows via the Wasapi / Asio Guide
Ill try that now, I was playing with vonovox too and trying to play with those setting some and think im dialing into what i like
would you have any suggestions on getting a audio of a laugh to convert right. Even not in real time just a recording if possible or is that purely dependent on the trained model?
any suggestions on getting a audio of a laugh to convert right
Nope, RVC can't always do super realistic laugh, if the model was trained on some specific laughs it might help, but rvc is limited on non speech sounds
@hallow hazel soo, is this solved?
Sorry yes!