#hear myself

1 messages · Page 1 of 1 (latest)

quaint moat
#

why tf can i hear myself?

fickle flicker
fickle flicker
#

Any progress? The original one and the better one are not the same.

quaint moat
#

Cant i use the orginal?

fickle flicker
#

Don't use the orignal one. It's outdated. Otherwise you can stay for original W-Okada for much slower performance and bugged RVC voice model selecting.

quaint moat
#

than help me get the other one

fickle flicker
quaint moat
#

U want me to downbload this thing?

#

or this?

fickle flicker
# quaint moat

Oh mate, I just highlighted the text where to click in my screenshot. It's in "Download NVIDIA on Windows" part, not "NVIDIA RTX 5000-series" which is for NVIDIA GeForce RTX 50 series GPU.

quaint moat
#

uwu

fickle flicker
quaint moat
#

?

fickle flicker
fickle flicker
quaint moat
fickle flicker
# quaint moat

Go into MMVCServerSIO, and you'll see a .exe file named "MMVCServerSIO". That's the actual W-Okada program. Double click on the program to run.

quaint moat
#

time to wait...

#

what now?

fickle flicker
#

F0 Det: select rmvpe
GPU: NVIDIA GeForce RTX 3060
Chunk: aronud 60 - 90 ms
Extra should always be 2.7 s
Input: your microphone
Output: if you have installed VAC lite, select "Line 1 (Virtual Audio Cable)"
If you wanna hear what W-Okada is outputting, you can set Monitor to your main speakers/headphones.

quaint moat
#

Chunk: aronud 60 - 90 ms
This do so my sound quality is so bad 😢

fickle flicker
#

Sometimes, I'm at lost when someone asks me for every step by step, because that would make a person won't be able to think by themselves but relying other people for everything. bocchi

fickle flicker
#

If you still hear low quality audio even if you set Extra to 2.7 s, or you just haven't upload any voice model there, try a better model from #1175430844685484042.

quaint moat
#

try to take it fully down

#

and try to talk

fickle flicker
#

For example, if you say some words into W-Okada, but then W-Okada outputs your converted voice 6 seconds after, it's considered a delay.

quaint moat
#

ik it make the delay, but how more u get, how better is the quality of the audio there will get out

grave apex
#

in higher chunks the model retains more context from the audio, so the result is more stable

fickle flicker
fickle flicker
grave apex
#

extra iirc are extra chunks to help retain context even more, so the quality increases

#

i cant remember 100% how extra works

#

but has to be something related to context, rvc is context based so the more u have, the better the results

fickle flicker
#

If you confused about something about W-Okada, you can ask. You didn't have to tell me I was saying nonsense all the time and go do some more research. YuukaErm

grave apex
#

2.7s = keeps 2.7s of prior context

#

max is 5s

fickle flicker
#

While it's possible to set extra more than 2.7 s up to 5 s, it does improve the quality more but also result in audio cutting off a lot, so 2.7 s is best overalls. Extra does make delay when number is higher, but it also depends on your GPU. Some reported that extra 2.7 s caused delay and even unstable audio for their GTX 10 series GPUs. There's another guide about fork W-Okada all detailed there. https://rentry.co/ForkVoiceChangerGuide#settings

solid spindle
#

as beyond it doesn't give noticable quality gain against the performance load