#✨│ai-help

1 messages · Page 279 of 1

sand forge
#

thats my mic

#

right

#

its done?

prime badge
#

With Deiteris' W Okada Fork or Vonovox is there any way to slow down the voice?

viral mason
#

@simple ore I have this error in the kaggle applio space, is there a new link or method to get it working

#

currently using the 3.4.0 branch

#

I remember someone saying to update applio but I don't remember who

simple ore
#

looks like you cloned the latest code?

#

not 3.4.0 branch

viral mason
#

I have, I've used the method you said to use

#

it's the third option right?

#

I've tried it twice now and it gives that error at the third cell

simple ore
#

this seems right

#

and should be cloning the branch without realtime

viral mason
#

which is that

simple ore
#

anyway, make a new cell

#

paste !sudo apt-get update && sudo apt-get install -y python3.11-dev portaudio19-dev then run it

#

that should fix the error with portaudio

viral mason
#

should that cell be before the first second or third cell

#

or does the order not matter with that

shrewd jay
#

What rvc should I use?

viral mason
simple ore
#

make sure the notebook clones 3.4.0

viral mason
simple ore
#

yes

shrewd jay
simple ore
#

that's not for you

viral mason
#

bro probably replied to the wrong message

viral mason
# simple ore yes

thx it worked, if I need to do this again in the future is there a way to easily upload the version of the notebook onto github and just find it the same way I do the usual kaggle applio notebook for importing it

shrewd jay
#

Yall got any good Vonovox settings..?

simple ore
#

(until we switch the version)

#

it stays unchanged in Kaggle in your account and you dont need to edit it

viral mason
#

can't keep training once it's full

simple ore
#

can probably nuke the model name in the logs

viral mason
#

only option is for downloading

#

unless it's something to do with code

whole mica
#

I am looking for an AI program that can convert my 3D renderings into an icon style while still making them look like my 3D model. I tried Midjourney, but when it comes to icons, I can't get anything decent out of it. ChatGPT comes closer, but I have over 100 icons to generate, and at some point ChatGPT always deviates from the style, usually after 2-3 generations. Do you know of any software that can consistently generate the same style for me? Where I can specify a style with images and my 3D model, and then it is converted for me?

half phoenix
#

i need a rly good vc for a girl im jus gonna troll to my frnds (ive got a rly bad accent )

#

oh im sorry i didnt read the rules sorry

#

np

viral mason
#

The number of these people grows each day

half phoenix
rapid bison
#

uhh i need help

#

i got error when i launch voice changer

#

i wanted to send a photo

#

butt

#

okeey sir

tropic mountain
#

my voicechanger cuts out while im talking how do i fix it

fluid lion
#

How much "faster" would a 5090 be vs a 4090 for rvc? Would it be a noticeable difference in ping/delay i can run it at

viral mason
simple ore
#

using training UI at the bottom

viral mason
#

I don't know what that means

devout bone
#

For some reason I have all the input and output right but I still can’t hear the voice

nocturne mesa
#

I can't hear my mic

#

Like my mic doesn't work

#

when i use the virtual audio cable

south verge
#

hey how can i make the delay a bit shorter?

tawdry needle
#

hello does anyone know how to make rvc work on 50 series ?

plain zinc
#

Do you know why it doesn't detect my voice or why I can't hear it? Yesterday the program was fine.

strange wraith
#

is there a video tut ?

rancid pine
#

Are these normal graphs while training or should I restart altogether?

simple ore
#

look at avg 50 charts instead

oak edge
#

oiii @simple ore

#

i'm training a 1hours 40 mins data set, it's been 345 epochs Y_Y

#

how much can I push more

simple ore
#

it is likely done already

#

check the charts

#

test the saved models

oak edge
simple ore
#

then expand two avg_50 sections and collapse others

oak edge
simple ore
#

what abouy loss_avg_50?

oak edge
oak edge
simple ore
oak edge
#

lemme send

oak edge
simple ore
#

something happened after 30k steps

#

probably not worth looking at those

#

so take the model saved around that step and give it a test

#

fp16 is not quite stable and it exploded for you

#

you can probably use a much smaller dataset.. 30-40 minutes

oak edge
#

yt_nails using gdrive backup

simple ore
#

something got messed up

oak edge
simple ore
#

perhaps the precision got switched mid-way

oak edge
simple ore
#

like you started training with applio 3.3.0, then the discriminator got fixed in 3.4.0 and default precision was set to fp16

#

in this case you need to restart

#

sorry about that, you better off using 3.4.0 anyway

oak edge
simple ore
#

but give the model from ~30k steps a try

oak edge
#

28649pepecry2 each session gives me around 20 epochs

oak edge
#

cat_huh cuz im not sure what you mean by that 30k, and what's before and after 30k

simple ore
#

the model name has a number of steps

#

your model got messed up after ~35k steps

#

most likely you did train at fp32 at the start, then we had 3.4.0 released and you resumed with fp16

#

then you also resumed after 58k steps with a wrong batch size

low shard
low shard
low shard
low shard
#

!howtoask

patent trellisBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
south verge
low shard
oak edge
livid hollow
#

can anyone solve it :--


cpt = torch.load(modelPath, map_location="cpu")
2025-09-08 11:17:20,560 ERROR [MMVC_Rest_Fileuploader] 'config'
Traceback (most recent call last):
File "restapi/MMVC_Rest_Fileuploader.py", line 67, in post_load_model
File "voice_changer/VoiceChangerManager.py", line 118, in load_model
slotInfo = RVCModelSlotGenerator.load_model(params)
File "voice_changer/RVC/RVCModelSlotGenerator.py", line 42, in load_model
slotInfo = cls._setInfoByPytorch(modelPath, slotInfo)
File "voice_changer/RVC/RVCModelSlotGenerator.py", line 58, in _setInfoByPytorch
config_len = len(cpt["config"])
KeyError: 'config'

simple ore
livid hollow
#

Please, can you share the correct link for the RVC model I can upload?

simple ore
#

whatever you've downloaded is probably some old SVC model or something

#

how would I know that?

#

its like you went to a junk yard, ripped out a carburator from a 1960s car and now trying to plug it into a modern car with fuel injectors

livid hollow
#

I just want to use basic voice conversion, like any voice.

hallow thistle
#

I just found out on Saturday that you can listen to your "microphone" or "Line 1 (Virtual Audio Cable)" on VLC. Can be useful if you wanna hear what "W-Okada with output set to Line 1" is outputting, although the VLC can add up the delay for the audio you selected to hear but not really affect its actual "input" signal overall. cat_vibecat_vibeCatpls

patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
hallow thistle
#

That one from other guy was being sarcastic for sure. SVC and RVC are two whole different voice model architectures. SVC (so-vits-svc) is the old one from 2023, while RVC (Retrieval-based Voice Conversion) is the newer one and currently widely used. Also, SVC voice model cannot be used in an RVC program.

livid hollow
livid hollow
prime badge
#

I am using RVC b2332 Nvidia-Cuda, how do you slow the voice down? When I speak i can hear it in the monitor speaking faster then what I speak

hallow thistle
#

In this context, RVC voice model can only be used in Deiteris fork W-Okada, even as locally and online, because any voice model other than RVC was removed from that W-Okada.

hallow thistle
prime badge
#

Not delay from when I start to speak but slow down the speaking between words

livid hollow
hallow thistle
#

!howtoask

patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
hallow thistle
hallow thistle
prime badge
#

If I say a sentence in my normal speaking speed I can hear it in the monitor speaking faster then I speak which leads it to sound weird

livid hollow
#

my goal is to use it on call centers... I've tried elevnlabs but there's no real time voice conversation option....

and its for one of my projects and not for catfishing or other spammy thing

prime badge
#

I find myself have to purposely speak slower

livid hollow
hallow thistle
livid hollow
marble cedar
#

Does anyone know the best set of resources or guides for hosting a huge model on a Google Cloud Compute Engine VM with a T4 GPU

hallow thistle
livid hollow
hallow thistle
hallow thistle
livid hollow
# hallow thistle What is your client's PC GPU?

As I’ve said, I’m using a cloud-based GPU, so that’s not a problem. Currently, I’m on the Kaggle free tier (T4) and at least I’ve got an interface for real-time, which I wasn’t getting with direct RunPod GPUs. But with Deiteris’ W Okada Fork on Kaggle it worked — the problem I’m facing now is a KeyError: ‘config’

hallow thistle
balmy tulip
#

if im using kaggle for w-okada do i have to rerun the cells every time i want to use the voice changer?

livid hollow
# hallow thistle I've never used Runpod service for myself. I only use Google Colab and Kaggle fo...

Not the fork version — I was using the old RVC repo which is discontinued. With that, I could only do basic voice conversion (uploading audio and getting output), not real-time.

Yesterday I found out about this server and tried the W-Okada fork, which surprisingly looks like it supports real-time.

But now I’m stuck on one error… someone mentioned I’m using RVC-based models, but I’m not sure what other types of models I should actually be using?

hallow thistle
hallow thistle
livid hollow
hallow thistle
#

These are RVC voice models. Their file extension is .pth, and its file size should always around 55MB. Most of RVC voice models often come with an index file, which is where it stores accent of that voice model.

hallow thistle
#

Instead, you might wanna find "voice models" in #1175430844685484042. The thing is there is no known "generic" voice model there; most of which are of characters and famous people.

#

Typical pth file size for a normal RVC voice model is around 55MB; any size that greater (> 60MB) or less (< 40MB) than this doesn't sound right.

livid hollow
#

well well well that's some help I was expecting.
thank you very much WBN 🙏

livid hollow
hallow thistle
livid hollow
hallow thistle
#

Thank you, and have a good day. anime_pray

tawdry needle
low shard
tawdry needle
#

i used this one for nvidia

#

i have a 5070

#

it opens in browser and my mic sound only work in it when i pick the server option with windows direct

#

client doesn't work

low shard
tawdry needle
#

and even after it work it has too many glitchs

tawdry needle
low shard
low shard
# tawdry needle then what is the best version i could use ?

This is a General AI Server, the program you need depends on what you want to do

RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime. There also updated forks with extra features like Applio.

Wokada = uses RVC for realtime inference.

Vonovox = Another Realtime Voice Changer based on RVC, with similar quality and performance to wokada deiteris fork but other perks

#

Wokada isn't RVC

#

and RVC isnt realtime voice changing

tawdry needle
#

i sent you the pictures in dms

tawdry needle
low shard
knotty moth
#

you can first tell me if you need to show the screenshot

livid hollow
low shard
tawdry needle
plain zinc
viral mason
#

rvc is many different things lol

compact sorrel
#

anyone know why wokada gives me over 11000ms ?

low shard
#

show a screenshot of your settings, elaborate more since there are different versions

low shard
#

Done!

compact sorrel
#

capped myself on 60 fps roblox and still 17000 ms

low shard
# compact sorrel

from the interface, that looks like a 2 year old outdated original wokada, did you use a youtube video tutorial?

compact sorrel
#

could u provide me a link to a newer version?

low shard
#

delete the folder

#

also, if you have vb audio cable

compact sorrel
#

ok

compact sorrel
low shard
#

uninstall it from windows app settings

compact sorrel
#

kk

low shard
low shard
#

I'm guessing you're on windows 10/11

compact sorrel
#

11

low shard
#

to check if it's also good enough

compact sorrel
#

is that good enough?

low shard
#

-realtime

patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

low shard
#

you can use either wokada deiteris fork, or tg-develop's fork (which is just a deiteris fork but with some Qaulity Of Life additions, don't expect too much from it ofc)
You can check each pros&cons in the docs :D

craggy dune
#

Just wondering, im testing some models and even though i have the monitor set to none, i can still hear myself play back. How do you turn that off?

low shard
#

!give-media-perms 1h @craggy dune

craggy dune
compact sorrel
craggy dune
low shard
# craggy dune

always check the Triangle, it's very important for non nvidia gpus
turn on echo, lower your headphones volume and keep your microphone far

low shard
craggy dune
#

but i have nothing selected for monitor and i can hear myself through my headphones

low shard
craggy dune
low shard
craggy dune
#

well it just sounds like monitor audio, to hear playback

#

but i dont want that

low shard
low shard
# craggy dune

also, be aware that extra over 2.7 could cause cutoff issues in some cases

craggy dune
#

I just dont want to hear myself

#

and what i was lead to believe monitor lets you hear your audio playback

#

which is what im hearing

lethal iron
#

Does anyobody have a up to date tutorial how to install comfyui-zluda on My rx 6750xt

low shard
craggy dune
#

When i say hi, i hear hi back with the voice changer once

low shard
# craggy dune What is vac light

vac lite, it's the one you're using right now as Line 1, used to get the output of the voice changer as input in other programs
you sure you did this?

#

can you show a screenshot of the recording and playback tab?

craggy dune
#

perfect

low shard
#

But glad to hear it's fixed now

craggy dune
#

and also my line 1 cable settings are set up properly

#

so yeah idk

#

weird bug

sand tree
#

Hello! Is there anythig I can do about weird pronounciation issues? For example, when i try to say words like "going" it pronounces as "gohee" as if it cant produce the "ing" sound

craggy dune
tacit verge
#

why my voice lagging and cutting out

low shard
low shard
shy gazelle
#

why when I turn on the downloaded voice, the res parameter increases to 8000, resulting in a huge delay

dry coyote
#

all i hear in my RVC is mumbling

#

i use deiteris's form

#

fork

wanton rapids
low shard
viral mason
# wanton rapids

All u do is download it, download vac lite, extract both, install vac lite on your PC then restart it (if needed), run the setup file in vonovox, then once that's done run the start file in vonovox

low shard
# dry coyote i use deiteris's form

There's no RVC deiteris' fork, rvc doesn't mean realtime voice changer

RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime. There also updated forks with extra features like Applio.

Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)

Vonovox = Another Realtime Voice Changer based on RVC, with similar quality and performance to wokada deiteris fork but other perks

Maybe you meant wokada deiteris' fork?

low shard
knotty moth
#

for Nvidia gpus only

low shard
#

-# downloads it on Intel Mac trolley

viral mason
knotty moth
viral mason
#

Yeah, I wish those yt tutorials that are several years old would get taken down

#

Tbh all of them should be taken down especially since a lottttt use them specifically for the one thing (egrill shit) that 85% of the people that join here for want

wanton rapids
wanton rapids
#

It feels odd that a voice changer is only 6.6 mb

#

I wouldve used W-Okada but vonovox seems a bit better

viral mason
low shard
#

@viral mason @wanton rapids You don't need to install python externally/globally, it comes bundled with Vonovox

viral mason
#

that information will be helpful with the next time someone asks about that

#

ty Nick

wanton rapids
#

Thanks

#

Btw uh theres 3 different setup files in vac lite

#

Which one do i run

#

Setup, setup 64 or setup64a

low shard
#

It's written in the guide, and the next step will be crucial, be sure to follow it :)

low shard
wanton rapids
# low shard

Ah i see, theres line 1 in both input and output

#

I switch back to normal?

low shard
wanton rapids
#

Oki

#

Done

#

In vonovox do i run setup? The windows batch file

low shard
wanton rapids
#

alr it opened cmd idk when to expect the finish

low shard
wanton rapids
#

holy

#

3.5 gb

#

yea now it feels real

#

the models usually sound normal only in english/japanese?

#

i mean i see that most of them are english/japanese

ancient trout
#

Hi guys I’d like to create an AI that plays perfectly any time of SHOOT EM UP game but idk where to start and know nothing about programming.

Is there any tools that can help me achieve it with no programming skills ?

Thx in advance 🙂

fallow lava
#

Guys, what is the recommended specs for streaming realtime with Deiteris' W Okada Fork?

#

If I were to shop for a GPU now, which would you recommend?

low shard
wanton rapids
#

i just download a model and thats it?

#

i think i gotta set the virtual cable as the input device actually

low shard
low shard
wanton rapids
low shard
wanton rapids
#

vonovox output is set to line 1 just so i can hear the voice changer myself right?

low shard
low shard
rancid pine
#

Is contentvec or spin better for non-English RVC models? I'm looking for a Chinese and Japanese pretrained model, please give me recommendations

simple ore
rancid pine
#

Shucks

simple ore
#

spin-v2 is what we got now

obtuse kestrel
#

hello, do we still need to install this "python 3.12.7" to our windows, i use fork okada

low shard
wanton rapids
#

@low shard sorry if im bothering, do you know any good female voice models? the one i downloaded sounds kind of weird when i use it

viral mason
#

What are you using those for hmmmmmmmmm? 🙂

wanton rapids
#

no one has to know

#

but basically, even the ones meant for english sound like they were made for japanese or smth

#

idk if im using it wrong

viral mason
#

Do you have an accent by chance?

wanton rapids
viral mason
#

The voice could be trying to copy that too

subtle gate
#

how to fix lag and cutting out

wanton rapids
#

a bit of a slavic accent but its not anything crazy

viral mason
subtle gate
#

it is over 5 sec

viral mason
#

Might help

wanton rapids
#

like i said, the accent is there but u can barely hear it

#

i dont think it should present a problem

viral mason
wanton rapids
#

maybe its the pitch?

viral mason
viral mason
#

Usually 3 is good

viral mason
#

Oop

wanton rapids
#

i have a bit of a deep voice 💔

#

11.7 to be specific

viral mason
#

Hmm

#

Just mess with the pitch until it sounds like the model, btw it really helps to actually put effort in maybe making your actual voice a bit higher pitched when talking

wanton rapids
viral mason
#

I change my voice a little in how I talk with some models like if I use the battle droid compared to just a normal person

wanton rapids
#

either way i dont think makima is it

#

💔

viral mason
#

Dang

#

Well try finding or u could make a model that fits better

wanton rapids
#

yea i need a better voice

wanton rapids
viral mason
#

Uhhh probably not regular female models but u could check my weights page if u wanna browse through everything

wanton rapids
#

ill try it

viral mason
#

I mean u could look, I kinda have to use the bathroom rn 🙏

wanton rapids
#

ah

#

how do i find ur page

hexed osprey
#

Can someone tell me the setup for wokada with my 3060 ti and AMD Ryzen 5 3600 6 core? thanks

viral mason
low shard
timid talon
#

can i ask for help?

lusty whale
#

is it just me or did wokada recently start bugging where your not able to hear yourself nor hear anything coming out of your mic to others

narrow wraith
#

How can I get the voice of a character?

hallow thistle
patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
hallow thistle
#

You can ask anything, it doesn't have to be "can I ask a question?". Also, make sure to read help guidelines before start asking.

short oracle
#

hii i would like to ask how to reduce rtc latency

low shard
short oracle
#

i use deiteris wokada fork

low shard
#

!give-media-perms 1h @short oracle

short oracle
#

its loading D:

low shard
short oracle
#

while using it

low shard
# short oracle i am streaming game

Play on lowest graphics 1080/720p 60/30fps
the gpu prioritizes the game since it's rendering something on the screen, so you have to give more resources to the voice changer

low shard
# short oracle 5080

lowest graphics 1080p 60fps cap is usually suggested for everyone, because this way it can give more resources to the voice changer to have less delay
The GPU is doing 2 very intensive tasks at the same time, if you get what I mean
I mean you're free to experiment to see, but playing on like 4k max graphics is going to take more resources which can higher up the delay, ofc this can depend by the game too

low shard
# short oracle

on wokada deiteris fork, you can **optionally **use more advanced settings for benefits:

  • Advanced Settings -> Force FP32 mode: on (THIS IS OFF BY DEFAULT! Turning this on improves stability. Increases VRAM usage by 200 MB)
  • Advanced Settings -> Disable JIT compilation: off for faster loading speed of the program, on for slightly better performance (10-15 ms) for Nvidia only)
  • Advanced Settings -> Crossfade Lenght: Controls how smoothly the AI stitches different processed parts "chunks" of your voice back together. 0.1 for fastest voice, 0.15 for improved quality but increases delay by ~50 ms
  • Reduce the delay on Windows via the Wasapi / Asio Guide
#

You could also give Vonovox a try, it still gets performance updates, you can cehck its pros&cons if it fits your needs btw

short oracle
short oracle
#

i would never doo that, (totaly not streaming metal gear delta on 4k)

low shard
#

They are in the Advanced settings section, and in the docs

short oracle
#

okayy thank youu

low shard
low shard
short oracle
low shard
short oracle
#

ahh i see

agile relic
#

I'm having a problem installing it

agile relic
#

okay I fixed it myself

brittle wing
#

hi new here - I'm looking for a voice changer to do agent voiceline soundboard ingame - can anyone tell me which one can I use?

oak edge
#

@low shard

#

can you explain how these live voice changers work?

hallow thistle
#

Similar to RVC that converts an audio with an AI voice model, but this one converts and processes audio in realtime.

agile relic
#

@hallow thistle I'm having trouble getting the cable output VBCABLE to get the voice changer to sound in it and have it working.
Is there any guide link or information available to get this fixed?

#

I'd appreciate even a copy paste link from a previous question of this type as well

hallow thistle
agile relic
#

I was using some youtuber's tutorial who told to use VB-Cable and the comments on the video really said it was working, I'll try the one you suggested, do I uninstall vb cable?

hallow thistle
agile relic
#

I annalyzed in virustotal the (vac) and is this a false positive?

  • Win32.Troj.Undef.a
#

Virtual Audio Cable exe program

hallow thistle
hallow thistle
#

And what is your PC GPU?

agile relic
agile relic
#

I can get it in text wait

hallow thistle
agile relic
#

System Information
Current Date/Time: Wednesday, September 10, 2025, 3:07:51 PM
Computer Name: DESKTOP-9BIQ8BE
Operating System: Windows 10 Pro 64-bit (10.0, Build 19045)
Language: English (Regional Setting: English)
System Manufacturer: LENOVO
System Model: 82H8
BIOS: GGCN29WW
Processor: 11th Gen Intel(R) Core(TM) i3-1115G4 @ 3.00GHz (4 CPUs), ~3.0GHz
Memory: 8192MB RAM
Page file: 11621MB used, 8657MB available
DirectX Version: DirectX 12

#

Name: Intel(R) UHD Graphics
Manufacturer: Intel Corporation
Chip Type: Intel(R) UHD Graphics Family
DAC Type: Internal
Device Type: Full Display Device
Approx. Total Memory: 4123 MB
Display Memory (VRAM): 128 MB
Shared Memory: 3995 MB
DirectX Features
DirectDraw Acceleration: Enabled
Direct3D Acceleration: Enabled
AGP Texture Acceleration: Enabled

#

Can it run in cpu (since no dedicated gpu bc laptop)

hallow thistle
agile relic
#

I can run it in the cpu

#

I think

hallow thistle
#

Yes, you can run W-Okada with only CPU but it will be really slow as hell; also not recommended.

agile relic
#

the cpu is kinda good I'd say for being a budget laptop

hallow thistle
agile relic
#

400$ laptop, scam?

knotty moth
agile relic
#

it will run on the cpu just fine don't worry

#

my cpu never failed me

hallow thistle
hallow thistle
agile relic
#

do I run setup64 or setup exe for vac

hallow thistle
agile relic
#

"I am an advanced user" ?

knotty moth
#

doesn't it?

agile relic
#

in 192

hallow thistle
#

Also, don't expect to run the program with another game at the same time. You know right?

agile relic
#

also when I installed the mmvcserver it wouldnt work and I had to manually create some folders and empty jsons that were missing idk if this has to do with anything

hallow thistle
#

What the hell?

agile relic
#

my laptop is the best

hallow thistle
#

I don't trust you for this, seriously.

agile relic
#

I got the cable installed Imma try to figure the rest

#

I think I should reset my pc

#

restart

hallow thistle
agile relic
#

nah I'll be fine, you'll see

hallow thistle
#

No.

low shard
#

AI is intensive, it's not easy to run locally nor a site like chatgpt

hallow thistle
#

Any W-Okada variant, like NVIDIA and AMD/Intel (DirectML), that made for Windows can work with only CPU; the AMD/Intel one is somehow known to work with an integrated GPU like Intel UHD Graphics while NVIDIA one doesn't.

agile relic
#

the console keeps saying
Pipeline is not initialized.
Waiting generate pipeline...

hallow thistle
#

You can't be overconfident on this one. hibikidepressed

agile relic
#

I still dont get volume in the virtual audio cable

hallow thistle
#

Deiteris' fork W-Okada should look like this. If you still use the version that has many files in the same folder, you'd better stop use that version.

knotty moth
#

(first ask me for image perms to show the screenshot)

agile relic
#

I was using the latest nvidia 18a something

agile relic
#

"Waiting for the operation to complete..."

#

it loaded

knotty moth
agile relic
#

I dont understand

knotty moth
#

same, I don't understand why you don't understand on it

agile relic
#

well the ping says over 11000ms and it keeps raising

knotty moth
#

so you've got the problem, eh

agile relic
#

I can change the settings

knotty moth
#

@agile relic

agile relic
#

yes I downloaded this and I'm trying it rn

#

the one with 15k ping

hallow thistle
#

v.1.5.3.18a and b2332 are not the same one.

knotty moth
agile relic
#

"Download AMD, INTEL and CPU on Windows
The latest version as of December 7th 2024 is: dml-b2332 (click here to download)"

#

should I just stream

hallow thistle
#

Struggling huh, that's unfortunate, but here's the settings you can try:
Chunk: 300 ms
Extra: 2.7 s
GPU: CPU

#

F0: rmvpe_onnx

#

Now expect it where its audio being delayed more with higher audio quality at the same time.

agile relic
#

Is Line 1 supposed to never register anything it's just my microphone's volume bars that increase, but the Line 1 Virtual Audio Cable is not getting any signal

hallow thistle
#

Input: your microphone
Output: Line 1 (Virtual Audio Cable)
Monitor: your speakers/headphones

agile relic
#

Yeah and it doesn't work

#

"
Input: Microphone (TRM-10 Audio Device)
Output: Line 1 (Virtual Audio Cable)
Monitor: Headphones (Realtek(R) Audio)
"

#

just forget it, ig my pc is just dumb

hallow thistle
agile relic
#

I'll just stick to voiceweave or some shi

#

that one did work for me and it was just 200ms delay

hallow thistle
#

You should've give up expecting your laptop to be great the first time I told you to go for an online option lol.

agile relic
#

oh you meant this when you said online

hallow thistle
#

I gave you the other options, and you just ignored them. Now it's your choice.

agile relic
#

p2w

knotty moth
knotty moth
#

well whatever you'd experience is your own responsibility GoldshipShrug

viral mason
#

Never heard of voicewave, def worse than any rvc program if it doesn't use rvc

agile relic
#

it has the option to use dedicated gpu for quality or simply your cpu but some sound dogshit others are passable for the most part

#

"some" almost every single one

agile relic
oak edge
#

@viral mason gang you know anything about realtime voice changing apps?

viral mason
oak edge
viral mason
#

That is the only flaw

viral mason
#

No, you can only put 8 models in the voice changer at a time

#

Dr87 is working on that tho

#

Eventually

ocean bluff
#

I used ai to make me look like a chad in my selfie 🗿

oak edge
viral mason
#

Good for you 🧍

viral mason
oak edge
viral mason
#

It has nothing to do with training man it's you just put models in it that were already finished

oak edge
#

im confused

ocean bluff
#

I also used AI to create a relistic guy pooping in a bucket

oak edge
#

like the models im trainin in applio rvc?

ocean bluff
oak edge
viral mason
#

Like, any models, the ones in the voice models section, the one's on weights, any of them

viral mason
oak edge
viral mason
#

Epochs aren't important on the voice changer because it's just using the model you have live

#

Instead of putting like a silly song in it to make it sing

ocean bluff
#

I made s relistic poop

#

It so stinky

oak edge
ocean bluff
#

so shmelly and wet

viral mason
#

Btw u have to have vac lite downloaded

#

It should be in the guide

oak edge
oak edge
#

oh it's just virtual audio cable

#

i already have it

ocean bluff
#

diareaa

viral mason
ocean bluff
#

I wonder why no one talks to me

#

Cause im so nonchalant and my farts are so stinky

hallow thistle
viral mason
#

Shh don't interact with it

hallow thistle
hallow thistle
oak edge
hallow thistle
#

Deiteris fork W-Okada has more friendly GUI, while Vonovox is more of professional.

oak edge
#

but whatever gives me best quality im ready to learn

naive dune
#

Has anyone ever used vibe voice?

brittle bobcat
#

will Kraggle ever be fixed? Does it have any relation to this server? its on aihub

latent kettle
naive dune
latent kettle
#

1.5B is bad. 7B (large) is good. For English. You can add multiple speakers and create podcasts

latent kettle
naive dune
#

Im basically trying to create or clone a character voice and trying to get realistic emotions

#

Idk if vibevoice is best for that

latent kettle
viral mason
naive dune
#

Im using it for my manga and stuff

latent kettle
latent kettle
naive dune
#

thx

low shard
brittle bobcat
#

it's the one on aihub that links to it. should i link it?

latent kettle
low shard
naive dune
latent kettle
naive dune
#

Okay

latent kettle
naive dune
#

looks like it, ill update u on what happends

latent kettle
merry forge
#

i js tried training my first model on rvc but i didnt get the pth, how do i get it?

latent kettle
#

Check logs folder 📂

merry forge
latent kettle
latent kettle
#

Check in assets\weights

#

@merry forge

merry forge
latent kettle
worn jungle
#

How to use voice model?

merry forge
#

if u mean the save small model

lusty whale
#

does anybody know why your not able to hear yourself or anything come out from wokada

#

or is it just me

brittle wing
#

o. o

#

hey

knotty moth
timber heart
#

For the Kaggle Applio notebook, what's the format for the dataset file path?

sonic bear
#

im new to this ai stuff i made a song but i want it to sound real as possible like ynw melly voice can i send it to someone to help

timber heart
#

Ah man, now I feel like a fool; I had leftover datasets from a year ago, I just forgot about how to do that part lol.

#

Thanks man.

viral mason
hexed osprey
patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
hallow thistle
low shard
patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

low shard
#

You can choose either wokada deiteris fork or vonovox

#

Read each pros&cons

worn jungle
simple ore
latent kettle
timber heart
#

Thanks for that as well.

elfin nebula
#

So I already trained my voice model to have a neutral voice, but how do you give it a wide range of voices that emulate emotion?

mortal shell
#

best rvc model

#

wth ai not working

agile lynx
#

not the ai heliping

#

the people is gonna help you not I

#

ai*

viral mason
#

can we give homeless guy the award for most useless help message in this chat

twilit jasper
#

Hey! I'd like to upscale and interpolate a video from compressed and grained 1080p 30fps to clean 8K 60/120fps. The video is a gas giant in space, so with a lot of black and a lot of big compression blocks.
I tried multiple models (OpenProteus, Real-ESRGAN and another I can't find in my mind) but every models has an issue.
Open Proteus has a blurred result for great colors, the keyframes of the video are pretty visible (even more when slowing it down) and the compression was not perfectly concealed.
The other model I can't remember has good details but a bit too much contrast, the real-looking video almost became like an anime, and the keyframes was no longer so much visible, but the compression was looking blocky and weirdly colored.
And Real-ESRGAN is very slow (It's not a problem for me if the result is good, I can keep my computer running for a few days if needed) and has wayy to much contrast, it's anime-looking (But I feel like it's typical Real-ESRGAN, almost every photos I upscale with it has too much contrast)

Can you recommend me a good model for upscaling compressed videos?

#

(I don't need interpolation in first time, I can do it after upscaling and most interpolation models are pretty good)

compact sorrel
#

how to train models/voices?

naive dune
sly valve
#

I’ve made sentences where I say gibberish, and Speech To Text translates it to what it thinks I’m saying. What channel should I share it on?

burnt shell
#

hey guys so i installed a voice and when i talk it just like freezes mid talks

midnight fern
#

Does anyone have a working link for okadas voice changer? Bc my pc isn’t too good for the program

viral mason
broken urchin
#

hey i was wondering

#

what's the difference between one big dataset file for RVC voice training

#

like one big one with 90 minutes

#

or the same one but its just split in 50+ different audio files

#

what's the difference?

#

does it produce better or worse results or

viral mason
midnight fern
#

Is there any links?

viral mason
broken urchin
#

even if the audio isn't split

dim peak
#

how to i change epochs on rvc

viral mason
frail bolt
#

Hi, i am searching for an AI to sort my mail, for example Gmail.

simple ore
#

no

compact trellis
#

guys when i downloaded the main voice changer and i try open “http_start.bat” it doesn’t work for me what do i do?

knotty moth
knotty moth
willow lintel
#

Anyone know what app would be suitable for a voicechanger in mac i5? Ive been trying so hard but no results. It would be great if someone also gave me a tutor

latent kettle
naive dune
latent kettle
#

What error?

naive dune
#

let me send it

latent kettle
#

If it says unable to build whell, install visual studio 14+

#

Than you have to download Microsoft visual studio and install some frameworks

naive dune
#

ok

latent kettle
naive dune
#

25.1.1 im upgrading it right now

latent kettle
#

Python --version

#

Try 3.10 or 3.11

naive dune
#

okay

#

Its the same

knotty moth
# naive dune

try using python 3.11 instead of 3.13 you're having

knotty moth
naive dune
#

YESYESYESYSEYESYYESYES

#

ITS WORKING

#

ITS INSTALLING

#

I set it to path

naive dune
marsh raven
#

@knotty moth are you able to help figure out VAC its not being detected even in discord

broken urchin
latent kettle
naive dune
#

So I used chat gpt to help me 💀

#

And it worked eventually

#

Also for some reason it’s using my cpu now which is weird so I gotta change it to use my gpu

simple ore
#

and for gpu use you need to run torch install with cuda

willow lintel
#

Anyone know what app would be suitable for a voicechanger in mac i5? Ive been trying so hard but no results. It would be great if someone also gave me a tutor

short torrent
#

I set deteris w-okada fork for cuda and everything and the voice changer seems to work fine in all of the tests but whenever I talk over discord it shows that there's no sound? I have a GTX 1070 Ti I know it's not the best but when I test my mic within discord settings it sounds perfectly fine and everything runs smoothly, it's only when I try talking in a voice call or within a game where there's zero sound being heard or indicated
There were a few threads about this issue but none had an answer so I was wondering if anyone experienced knows why this is happening?

weary socket
#

anyone knows if the training script from applio is different from the original rvcwebui?

jovial spire
#

question lol 1-Does anyone here use voice.ai and 2- have people been getting the "No network connection, voice initialization failed. Please try again #3769" error? I can't find any way to see if their servers are down :/

weary socket
#

I wonder if using applio instead could improve my model

#

Also I wonder what happens if I train directly using multiple voice sources, in contrast to using model blender(average the weights) of multiple models

viral mason
#

If u use an Nvidia GPU u should try out both but if u have some other one like AMD or Intel you could use the first link/guide

#

-rt

patent trellisBOT
# viral mason -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

jovial spire
latent kettle
latent kettle
simple ore
#

nothing, if you dont need chinese it should be fine

latent kettle
simple ore
#

with python 3.11

#

you can install as is, you just need to clone the repo, update pip setuptools wheel packages

#

then install numpy standalone, then you can use pip install .

#

for python 3.12 you need to modify pyproject.toml and change numpy version to ==1.26.4 instead of the existing stupid requirement

#

then it installs just fine

latent kettle
#

I see, thank you again cute_chickheart

pine smelt
#

Dunno if anybody has experience with this, but is there a reason why in the tg-dev fork I cannot select GPU under Processing Unit? My only option is CPU

simple ore
pine smelt
#

In deiteris I can and did select my 4090 no problem

#

but here I seem unable to do so

#

It's just not an option in the select field

simple ore
#

you got the right fork?

pine smelt
#

Uh, I think so but I might have not?

simple ore
#

what did you download?

pine smelt
#

Lemme check, perhaps I picked the wrong one from the release page?

#

Oh damn that's it, sorry for being such a noob

simple ore
#

Download NVIDIA on Windows

Download all of the voice-changer-windows-amd64-cuda files. This will include multiple files, likely ending in .zip.001, .zip.002, etc.
pine smelt
#

your name nickname would be more appropriate for me in this case damn

#

sorry for wasting your time!

quiet hatch
#

Agentic AI Framework (agenticaiframework) is a next-generation Python SDK for building agentic applications with advanced orchestration, monitoring, multimodal capabilities, and enterprise-grade scalability.
It offers a modular, extensible architecture for creating intelligent agents that can interact, reason, and execute tasks across multiple domains — from simple automation to complex multi-agent ecosystems.

https://github.com/isathish/agenticaiframework

short torrent
#

everything is up to date & i'm using the right virt cable

simple ore
#

may want to use push to talk in discord

short torrent
knotty moth
weary socket
simple ore
weary socket
#

Is there some out of the box mixture of export for this purpose?

simple ore
pine smelt
jovial spire
#

Hey is there a ticket system? I have a mod question

hidden hinge
#

few questions im quite new to whole real time voice changer thingy... so how much quality is lost if you speak in language model wasn't trained on, what is good few voices i could get to play around with them, and what are the best settings for mmcv ?

viral mason
#

if u got the voice changer from a yt video it's probably over a year old btw, and for ur question pretty sure if using a different language on a model trained on something specific doesn't change quality

low shard
viral mason
hidden hinge
viral mason
#

-rt

patent trellisBOT
# viral mason -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

viral mason
#

it's the best realtime voice changer out there (rvc doesn't stand for that) but if that doesn't work just download deiteris

hidden hinge
viral mason
#

and if u need to switch to deiteris use these

viral mason
hidden hinge
viral mason
hidden hinge
#

also dumb question how do you eject file?

viral mason
nova willow
#

does anyone know good ai voice cloning stuff that I could use for voice modding that is low gpu usage

viral mason
nova willow
#

I am bored

hidden hinge
viral mason
hidden hinge
viral mason
nova willow
#

I want to use it for horror games

#

since I can't afford a voice actor

viral mason
#

make sure your voice settings are
input: regular mic (headset, headphones ect)
output: vac lite
@hidden hinge

viral mason
hidden hinge
nova willow
#

I got a 3050

hidden hinge
nova willow
#

from nvidia

#

as I said I am poor

viral mason
# nova willow from nvidia

that should work fine with either of these, there's a guide for both but f u want good quality go with Vonovox, there's a guide here, it's what the other person is using too

#

-rt

patent trellisBOT
# viral mason -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

hidden hinge
#

yee it doesn't use much of my vram so you should be able to run it on 3050

nova willow
#

what is the one that has the least amount of artifacting

azure patio
#

guys im lost what rvc is the best to use on web?

viral mason
nova willow
#

I don't like when I hear ai voice artifacting

#

sorry if I am being harsh

viral mason
#

ur ok

#

use Vonovox, it's got a lot of settings to prevent that kinda stuff

viral mason
#

rvc doesn't mean realtime voice changer btw lol

azure patio
azure patio
lethal iron
#

which tutorial do you guys use for comfyui zluda windows 10

true iron
#

hi, can you guys help me? I try to find good RVC for voice change

viral mason
nova willow
#

also since I am on the topic of game development and I can't hire a 2d artist/don't wanna dox people, what should I use to make realistic images (the game I am making is like welcome to the game)

viral mason
#

Nvidia, amd, intel

viral mason
nova willow
#

I like the game so I wanted to make my own version of it on unreal

#

not drawing anyone

viral mason
#

That's interesting

nova willow
#

I have reasons to do stuff I wanna do

#

since I am a gamedev

#

and I believe that fiction is fiction

#

and shouldn't go into the real world

viral mason
#

Y'know you could put in the lore that an ai is making the art so you have an excuse for ai art

nova willow
viral mason
#

Just a suggestion of course

viral mason
#

3D humans orr

nova willow
#

and I want to have the same type of person so I don't want it to have overlap and look like a completely different person

sonic garden
#

-colab

patent trellisBOT
# sonic garden -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

nova willow
#

its web browser based

viral mason
#

Ah ok, I'm not sure tbh what you could use

#

I'd ask around

nova willow
#

ty

#

I just don't like using real humans in works of fiction so that is my reasoning

#

espesically of this topic

viral mason
#

I don't really understand it but no judgement from me 🤷‍♀️

nova willow
#

so you can understand why I DON'T want to include real humans in this

#

also the creator themselves use pictures of real women and a lot of people didn't like that

#

and I agree with them

hidden hinge
nova willow
hidden hinge
jaunty walrus
#

Anyone got this working on NixOS?

#

Even with FHS I couldn't get Deiteris' fork working:

ImportError: /home/xnefas/Documents/AI/MMVCServerSIO/_internal/libstdc++.so.6: version `GLIBCXX_3.4.29' not found (required by /home/xnefas/Documents/AI/MMVCServerSIO/_internal/onnxruntime/capi/onnxruntime_pybind11_state.so)
nova willow
#

I spent 5 bucks

#

and it has NOTHING

simple ore
#

sudo apt-get dist-upgrade

upbeat thunder
#

hello

balmy trench
#

Hello, everyone. Where can I try seedream v4 edit? I couldn't find it on Hugging Face.

upbeat thunder
#

bruh how yall doin this

weary socket
#

I got following message after running the training script: "Not enough data present in the training set. Perhaps you forgot to slice the audio files in preprocess?" I do have sliced the audio, and roughly 400 audio files in the sliced directory. what could I miss?

hallow thistle
weary socket
#

oh, my f0 directory is empty...

#

I thought the extract.py would do the feature extraction

hallow thistle
rare flint
#

where's my RMVPE

hallow thistle
# rare flint where's my RMVPE

"rmvpe" is only available in NVIDIA variant of W-Okada, and it doesn't exist in DirectML of it. Judging by your screenshot, it seems like you're using the "original" version of W-Okada DirectML.

knotty moth
dry coyote
#

i went through a few forums

#

and the reason why its outputting mumbling is because i was using a spin model which deiteris's fork of wokada doesnt support

#

neither does the base version

#

but apparently i can implement a SPIN embedder

#

but idk how to do that

hallow thistle
#

I can implement a SPIN embedder
but I don't know how to do that
Really?

dry coyote
#

did u check the pull req

simple ore
#

there's a new fork with that already

dry coyote
simple ore
#

duh

#

yes

hallow thistle
dry coyote
simple ore
#

there's spin-v2 now

dry coyote
#

its just a fork of wokada

simple ore
# dry coyote its just a fork of wokada

"Wokada Tg-Develop's fork since it's a fork of the Deiteris' Fork, containing the performance improvements, has an improved Web User Interface, supports Spin Embedder Models, and has Audio Effects."

hallow thistle
simple ore
#

read the link above

hallow thistle
dry coyote
#

ah i see it now

hallow thistle
#

You didn't see my link?

dry coyote
#

i dont read i just download whatever

#

no hablas engles

hallow thistle
#

That's an issue.

dry coyote
#

its whateverrr

hallow thistle
#

Still.

dry coyote
#

damn this fork is big download

#

3gb

#

ts gonna take like 20 minutes

hallow thistle
#

Almost every AI program is as big as that in GB. This specific fork W-Okada, as you'd imagine, would pack with many features in there. Deiteris fork, however, certain packages and features were removed or missing when those didn't need for most users.

#

If your current internet is slow, it's gonna hard to get it done lol.

dry coyote
#

😼

hallow thistle
dry coyote
#

oh btw is it worth it to get a second gpu for just wokada

#

and if so whats the best one to use for primarily for it

#

in terms of value

hallow thistle
#

My internet is around 120Mbps, while not the fastest, it can download files in gigabytes in not hours but rather minutes.

knotty moth
#

4060 would benefit AV1 streaming

dry coyote
dry coyote
knotty moth
dry coyote
#

😭

dry coyote
hallow thistle
# dry coyote oh btw is it worth it to get a second gpu for just wokada

NVIDIA GeForce RTX 2060 is the minimum for most tasks, RTX 3060 is in between, and RTX 4060 is faster, depending on your budget. RTX 50 is the newest, but requires a specific fork W-Okada in order to work, which the Tg Develops one is not yet known to have the version for this GPU.

rare flint
#

uhh? i tried downloading the new one

hallow thistle
# dry coyote wdym

AV1 is a video codec. You might not need to know now, but only needed if you want to encode a video with higher quality possible some day.

hallow thistle
rare flint
hallow thistle
rare flint
#

hold on i think it's working

knotty moth
rare flint
harsh oriole
#

Good day!
I need help about the changer it does not apply to discord...

patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
harsh oriole
hallow thistle
#

What is your PC GPU? Did you follow any tutorial before this?

jaunty walrus
hidden hinge
keen badger
#

Unsupported post request. Object with ID 'xxxx' does not exist, cannot be loaded due to missing permissions, or does not support this operation. Please read the Graph API documentation at

GUYS HOW CAN I FİX THİS, WHATSAPP CHATBOT VİA FACEBOOK DEVELOPERS

limber tangle
#

-colab

patent trellisBOT
# limber tangle -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

boreal mango
#

hi! how would i go on about training a voice model?

#

i wanna use it for narration

#

literally have no time to read

low shard
# boreal mango literally have no time to read

RVC is STS (Speech-To-Speech), not TTS (Text-To-Speech) natively, unless you use another TTS to make an audio output to use as an input in rvc (which is what for example the Applio RVC fork does automatically with Edge TTS, which is multilingual and good but emotionless)

boreal mango
oak edge
#

yooo can someone help me out with formant shifting? (about how it works and how to use) for context i have male voice and trying to apply that voice for female portion of the song

low shard
#

So I got an AMD Radeon 860M gpu, should I run a voice model on cloud?
yeah, your gpu isn't the best for local AI

i'm guessing you want a realtime voice changer for calls/games (not pre-recorded audios) and are atleast on windows 10/11

#

-realtime

patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

low shard
#

you can use wokada deiteris fork or tg develop modified version

low shard
#

u could just use a TTS alone

boreal mango
copper ferry
#

Hi, I need help with RVC.
I believe my RVC is not functioning as intended. It does not change my voice at all. the delay is also extremely inaccurate. It's my own voice no matter what I change. It also does not let me hear myself unless I use pass through option.
The GUI of it is also very different with what I've seen from others.

#

v 1.5.3.16a onnxdirectML-cuda

#

I also have an AMD GPU, so if I should download a newer/different version of it or do anything to get it to work please tell me.

viral mason
#

-rt

patent trellisBOT
# viral mason -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

viral mason
#

use the first link

#

download the amd version

humble vortex
low shard
sour sun
#

Hi, is the Spin pretrain supposed to sound like garbled mumbling, even when I'm using the new LegacyCore spin pretrain and have selected the Spin feature extraction?

simple ore
analog obsidian
#

i'd not use that pretrain because it got trained when the discriminators were broken

simple ore
#

also there are two of them now (spin and spin-v2)

sour sun
#

Hmm lemme check

#

I renamed it a bit for ease of use but I downloaded them pretty recently, unless there's another page I didn't see 🤔