#How to hear myself

1 messages · Page 1 of 1 (latest)

lucid python
#

GPU: AMD Radeon RX 5700 XT.

Operating system: Windows 11.

I can't hear what I say at all, I followed the guide of wokada in discord as best as I could and I'm still not able to hear my audio at all

calm knoll
lucid python
#

still wont work

#

and monitor is my headphones

calm knoll
#

What about your full screenshot?

lucid python
calm knoll
#

The program is working, look up performance stat at top left corner screen. What do you mean you can't hear the program?

lucid python
#

it judt doesnt work, how can i show you

#

is it supposed to be working by the stats?

#

i wwitched to server and now i can listen but it hss bad quality

calm knoll
#

Set F0 to rmvpe_onnx, not something else.

lucid python
#

it works in server kinda, client doesnt tho, whats the difference about server and client

#

do you got any advice to make it sound good or should i test it myself

#

it sounds a bit robotic and i got echo s little bit

lucid python
#

can i keep both versions or do i got to delete the current one

calm knoll
#

Extract a newer one to a different folder, while also keep the older one if the newer one fails just in case.

lucid python
#

I'll test it in some hours, if it goes good ill just leave this as solved

#

By the way, does a better mic improve the voice quality?

calm knoll
#

That's one myth about the voice changer. A better microphone doesn't improve the generated voice audio quality at the post-result, though a better-working microphone can sometimes suppress a bit of background noise that coming to microphone, regardless of microphone price.

lucid python
#

So it's more important to look for noise suppression microphones with an alright quality instead of a mic with insane quality?

#

Though last ones may also got nice noise supression

calm knoll
#

A physical microphone itself doesn't have its own noise suppression feature, although the noise suppression/echo cancellation is often done through software-level settings. Some slightly expensive dedicated microphones have their own AI noise removing processing unit built-in, but as what I said these devices will be more expensive than the non-AI ones.

lucid python
#

Do you got any advice for people with noisy mics? I feel like my mic doesn't have good noise supression

#

Or how should I manage this issue

calm knoll
#

By the way, the audio sounding robotic sometimes has to happen with the voice model itself, even if extra value is set to recommended 2.7 s on W-Okada fork. While the mentioned voice model doesn't always mean it was badly trained, I once heard that "low" dataset audio in RVC voice model training might be the issue.

lucid python
#

Oh maybe, but if that's the issue I'll end up using others, right now I just downloaded one for testing, but ty, I'll try the newer version in some hours though client probably still won't work

#

Is server better than client or what's the difference?

calm knoll
#

Audio modes in voice changer: "Client" is simpler, lets you use the voice changer's noise supression and echo cancellation options, although generally has the higher audio latency because it mainly uses older "MME" audio API. "Server" one is more complex but flexible, lets you pick any sample rate, audio API (MME, WASAPI, ASIO), although sometimes the program fails if set to WASAPI and a mismatched sample rate.

#

When you set audio mode to "server", these noise/echo suppression options will grey out and unavailable. This one is a known quirk in every W-Okada version.

lucid python
#

should i add index and path if a model has both

rigid meteor
#

The index changes how the voice's speech sounds and how much this blends with your own voice.
It is optional
index files tend to be large, and, if you're going to use them you may need to adjust the chunk size to compensate, since it requires more time to process the voice with it enabled.
Only the pth file is required.

lucid python
#

it runs good but now when i talk it cuts off last second of the phrase, or sounds robotic in the end

#

a lot of the time thats happening

#

nvm its robotic

rigid meteor
#

If you changed the extra, chunk size or crossfade, you should swap your processor once to cpu, and then back to gpu0 to fix a bug

#

that should help with lagginess, choppiness and huge amount of processor usage

#

for robotic sounds... idk, it could be the model, it could be a filter, it could be software
you might be able to fix some of it by speaking more loudly into the microphone or adjusting the input volume
you can try to fix some of it by adjusting extra or crossfade, but don't forget to swap processor (cpu and then back to gpu)

#

it might help to lower protect to 0.30 or lower

#

but do also check your microphone settings; it's possible its caused by some setting.

calm knoll
calm knoll
# lucid python nvm its robotic

There are some workarounds, simply, if you still care about 100% perfect audio quality. Click "stop server" at first, go to advanced settings, set "Crossfade overlap" to 0.15 and set "Force fp32".

lucid python
#

Ty I'll try this, by the way does internet connection matter at all?

lucid python
calm knoll
# lucid python Ty I'll try this, by the way does internet connection matter at all?

When you run W-Okada voice changer locally, internet connection is** required** to download some files, meanwhile most actual functions (like converting your voice in realtime) generally don't need internet. The online cloud options (like Colab and Kaggle) however require internet to all function, not just the voice changer that runs within the service.

#

As much as I implied, the internet has nothing to do with the audio quality. Aside from simple settings and an RVC voice model, how do you know if something else has to do with the audio quality on voice changer?

lucid python
#

If so, maybe my mic in my headphones are kinda mid

#

Should I record and send it here?

#

Like an .mp3, .wav or wtv

calm knoll
#

Still believing like that?

lucid python
#

Wdym

calm knoll
#

Do you still think your microphone is the issue in all of this? We've had a discussion earlier.

lucid python
#

Yes I remember I thought it kinda influenced the input but if you say so I'll stop focusing in my mic, though should I send a test audio? Reading whatever?

#

Maybe I'm tripping and it's not that bad but im not sure I'm not an expert

rigid meteor
#

It could help see what exactly you mean with robotic noise.
The voice changer should only be reacting to voice though, not to noise. I'm not sure how your microphone by itself would be causing robotic sound. It could be caused by background noises instead and I noticed some lower quality models might cause robotic like sound to be put out with certain sounds, or at random intervals when using the index.

#

Some people speak really softly; whispering I mean. Some models cannot handle whispers and then produce robotic like sound in its place. That is why I mentioned that speaking louder might help.

dense pulsar
#

a better microphone improves the quality of the model due to the input being resampled to 16k

if your microphone sounds muddy and bad at that sr then the model will struggle and have different problems like bad pronunciation, robotic sound, etc

but if your microphone is clean and high quality then the model (more specific, the embedder) is going to have a more easy job translating your voice to the model's voice, giving better results

some models are just bad and robotic tho, that cannot be fixed

#

ah and the index file it's just a file (xD) that stores the accent found in the dataset

index at 0 makes the model to use the accent/pronunciation of the source audio (your voice)

if you set a value higher than that the model is going blend the index file pronunciation in the result

#

but in realtime ive heard most of the time screws up pronunciation, it's more useful in non realtime infer

lucid python
rigid meteor
#

It does sound like an issue with the model; it for some reason has trouble determining the vocal pitch seemingly.
Can you try a different voice model and see if you still get this odd effect?

lucid python
#

This is the only girl voice I've seen that is good

#

I feel like I found the solution slightly atleast

#

I got to talk more like clear and calmly

#

If I speak too quick or too high or low it glitches

lucid python
#

Not too loud just not whispering or speaking unclearly

rigid meteor
#

One tip would be; try to sit in front of the microphone, with your mouth I mean
and figure out from what angle the sound enters it best; some microphones have an odd angle. The backside will be less loud than the front.
You should be able to be heard clearly with about a fist distance. (2 to 6 inches)

With a headset, keep the microphone very close to your cheek, at the corner of your mouth.
Also, try adjusting the microphone volume or gain if you can (on the hardware), if not you can increase the input volume on the voice changer, or use effects like the audio compressor to force it to take your voice in louder.

lucid python
#

What are voice effects

#

They're useful?

#

I thought they were a random feature

rigid meteor
#

Personally I prefer to use OBS's filters, which do the same thing.
Voice Effects are basically adjustments to the audio received or send.
Gain for example, makes you louder (or less loud if you do negative decibells)
Limiter prevents your loudness from going over a certain value
A compressor effect is basically a conditional effect, which causes audio to be pushed less loud or louder if audio is detected at a specific decibell level.

#

In the past these types of 'effects' were only done by people through expensive hardware, or through expensive VST plugins.

#

or by programmers, if they know their way around complicated tools that adjust the audio driver directly (well almost)

lucid python
#

Are these placed like along with the chunk etc settings or where are they located visually

#

Within the menu

rigid meteor
#

Scroll down

calm knoll
lucid python
#

where shoukd i put this output or input

calm knoll
#

Output.

lucid python
#

is there any recommended settings for compressor

calm knoll
#

While I know how "compressor" effect works in audio engineering, I'm not sure how to explain about this one.

lucid python
#

also how can i remove the effect, it wont let me

#

just in case

calm knoll
lucid python
#

its like merging with tge ui

calm knoll
lucid python
#

oh yh fixed now

#

is this a common issue or why cant the ai say hello or some specific words are like harder

calm knoll
lucid python
#

my bad

rigid meteor
#

If you feel like you sound too soft, you can also use an expander, but be a bit careful where you place the threshold when using both of these. Your audio only needs to be somewhat more equal, not completely equalized. If everything has the same loudness than you won't sound natural.
EQ can help with things as well. Humans have by default higher midtones than low and high ones; you can make it close to eachother by adding or removing some volume (decibel)

#

A good Equalizer and Compressor setting wll make you sound like as if talking in a studio, but you'll need to find the correct settings for each voice. Unfortunately Audio Effects are shared, which is a flaw in tg-develop's design.

rigid meteor
warped mulch
#

can someone help i got a good gpu

#

someone help

calm knoll
lucid python
calm knoll