#✨│ai-help

1 messages · Page 281 of 1

viral mason
#

do you have two possibly like this in your task manager u can check, just click preformance

#

I saw someone said they only have amd but didn't know they had nvidia

stiff dome
#

hi so i wanted to get into ai art creation but idk were to start (i mean what program to use)

elfin nebula
#

I'm training a voice model rn I just need to know which pretrained model I should use

#

Is there a place where I can hear what the pretrained models sound like?

stiff dome
viral mason
#

beatsforge is for drums

stark lion
#

Can I import the voice model I want into Voice Changer?
I'm referring to the ones that are in the models channel here on the server

stark lion
#

Oki doki

viral mason
#

unzip the model and make a new folder to put all of the ones u download in, makes it simpler to keep track of everyone of them

elfin nebula
#

Oh you mean the pre-trained in the gdocs file gotcha

viral mason
#

just train it without choosing a custom one

#

uncheck this for original pretrain

elfin nebula
#

My data set isn't that long wouldn't it have quality issues?

viral mason
#

if the entire dataset has consistent quality nah

#

how long is it?

elfin nebula
#

3 mins

#

Did the thing with audacity too

viral mason
#

do batch 2

#

actually what are u training on kaggle or smth else

elfin nebula
#

Applio

viral mason
#

no I mean

#

like applio on wha

#

is it local?

elfin nebula
#

Local yeah

viral mason
#

ok yea just do batch 2 for that since small dataset

elfin nebula
#

Ngl idk what that means

#

Do you mean the batch size option? Change this to 2?

elfin nebula
#

Do I just send it like this then?

#

The ai docs says 15 if you're new, and total epochs "go for an arbitrarily large value like 1000"

#

Or for "saving every epoch" do I set it at 1

elfin nebula
viral mason
#

even like 700 is probably overtrained

viral mason
#

since in the end you'll delete them all besides the one that turns out the best

elfin nebula
#

So save every 5 and change total to 700?

elfin nebula
viral mason
#

just have it high so u don't have it too low and miss the peak of your model

viral mason
#

but yes every 5

#

save every 5

elfin nebula
#

Okay, already changed it. Batch size 2, save every 5, total 1000. I'll go and generate index then

viral mason
#

good to go, make the index then hit train

#

if u need any samples to test for singing or talking just lemme know I got a bunch

elfin nebula
#

Okay, thank you! I'll let you know how it turns out!

viral mason
#

still kinda low on talking ones especially for female models

ashen bane
#

can anyone help me

viral mason
#

with

#

?

ashen bane
#

i can't show u

#

ss

viral mason
#

that sucks

ashen bane
#

initializing and its done

#

but the voice changer doesn't lauch

viral mason
#

what voice changer

#

there's like 3

#

did u use a yt tutorial to download it btw

viral mason
#

if yes u downloaded something ancient and super outdated

#

so uh what gpu do u have bc u need a new voice changer

ashen bane
#

oh ; (

viral mason
#

nvidia, amd or intel

ashen bane
#

i waited 1 hr bec of internet

ashen bane
viral mason
ashen bane
viral mason
#

I know you're a skeleton but use your brain

viral mason
#

-rt

patent trellisBOT
# viral mason -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

viral mason
#

download that third one

#

read the guide and download vac lite

#

and the voice changer, gotta scroll down a bit for the download link to it

ashen bane
#

thrid one?

viral mason
#

"Wokada Deiteris Fork
Most suggested WebUI with the best general support for many platforms. GUIDE"

#

that one ^

viral mason
#

np!

#

if u have more questions or need more help just lemme know here

ashen bane
#

whcih one btw

viral mason
#

which wha

ashen bane
#

kaggle

#

lighting

#

google

viral mason
#

for what

#

here

elfin nebula
#

So it finished suspiciously fast so I looked into the cmd...

viral mason
#

probably glitched

#

not even a 30 second dataset finishes in 6 seconds

elfin nebula
#

So idk why it's looking through the 5.7 when I have the 6.2 installed

elfin nebula
viral mason
elfin nebula
#

Do I just relaunch the thing?

viral mason
#

maybe? idk how the local one works as I use cloud

elfin nebula
#

Okay, I'll try this again

viral mason
#

I'd maybe ask Noobies as they're pretty smart with this coding stuff unlike me

elfin nebula
#

On second thought... I'll just make a thread. That'll be safer

viral mason
#

yup

wheat plover
#

any updated guides on GPT-SoVITS? or, like, something better for TTS?

ashen bane
simple ore
#

you need to change the visible device in .bat

ashen bane
viral mason
#

I'm having issues with Kaggle Applio not working, I get a connection error every time I click anything what is this

simple ore
simple ore
#

if you can access the UI, there should be no connection error

#

so likely the process did stop while you're stumbling around your UI

viral mason
#

this issue followed me even after deleting an account and making a new one

#

either kaggle hates me or there's an issue with the current applio

ashen bane
simple ore
viral mason
simple ore
viral mason
#

ah

ashen bane
viral mason
ashen bane
viral mason
#

o

#

should be fine then

#

I'm used to it being in dark mode

viral mason
ashen bane
#

which voice changer i use?

signal lion
#

Whats the best gain tune index and everything if u have a deep voice and ur gonna troll ur friend with a girl voice

thick latch
#

How exactly do you change the name of the uploaded models in Vonovox? There's no "ok" button and closing it doesnt save

viral mason
viral mason
green bramble
#

hey sorry im abit lost but how'd i create a tts ai speaker with a custom voice model in the thread?

knotty moth
green bramble
elfin nebula
#

Do I end it here?

simple ore
#

your model blew up

elfin nebula
#

So... Bad?

simple ore
#

check what you got < 20k steps

elfin nebula
#

Wait how do I do that?

elfin nebula
simple ore
#

ctrl-c in the terminal window, then start it again

#

your trained models should be in the logs/model_name folder

elfin nebula
#

I got the trained models... Do i just pick one in random and that's <20k and see how it sounds?

knotty moth
elfin nebula
#

So okay, I got my model how do I test it in tts?

knotty moth
elfin nebula
simple ore
elfin nebula
simple ore
elfin nebula
#

I just need it in english

#

Local is preferable

simple ore
elfin nebula
green bramble
dense agate
#

Does anyone have any mobile apps or websites for uvr5? (Ultimate vocal remover v5) i usually use hugginface space for uvr4 but i want to extract with v5 and i cant find anything

low shard
#

you could also suggest @viscid moss to add the model to his ZeroGPU hf space if it doesn’t have it

brittle wing
#

does this server have LLM loras?

#

its like you train a hugging face language model to have a personality of a character, its much more effective than promoting and R.A.G

severe violet
short torrent
#

Is there any way to reduce cutting out while using the voice changer in w-okada deiteris?

low shard
simple ore
#

use python 3.11 or 3.12

#

based on the version there are slightly different steps required

signal lion
somber cobalt
#

what voice changer is best rn? the og one or the fork?

arctic urchin
#

where do i download the program that let's you change your voice, the github and website is too complicated?

viscid topaz
arctic urchin
#

so i can use the models on voice mod?

viscid topaz
#

Voicemod doesn’t let you load or train custom AI models like RVC

somber cobalt
#

this tg okada fork zip 2nd part is corrupted cat_seriously

somber cobalt
viral mason
#

It's not meant for that specifically, if u wanna be a cool character like Goku or whatever that's fine

#

Creep

simple ore
arctic moat
#

Voice keep doing weird stuff like alot of mini freeze
Rx 6800 and mid cpu like not bad
Ping gets very high (20K ms) after 2 min of talking .Any idea

viral mason
arctic moat
#

Told ya rx 6800

#

And for the voice its like

#

Lemme check

viral mason
#

Question, did you download it off a YouTube video

#

If yes it's super outdated and old

#

Like a year old

viral mason
arctic moat
#

AMD

#

Rn im tryning to download

#

applio

#

Idk if its bettea

viral mason
#

That's for training models

arctic moat
#

ooh

viral mason
#

Although it has recently added realtime it's not very functional

#

-rt

patent trellisBOT
# viral mason -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

arctic moat
#

Best one with amd support ?

viral mason
#

U should download wokada deiteris

arctic moat
#

fork ?

#

Okay

viral mason
#

The tg wokada also supports amd but I have no experience at all with it

#

So I've got no idea what's different about it

viral mason
#

Download links are there once you scroll a bit, also download vac lite if you haven't already, it's at the beginning of the guide

arctic moat
#

Hum

#

Cant find VAC

#

lite

#

Oh wait

#

i already have it

#

Oh

#

Warned as high danger malware and has been deleted

arctic moat
#

I think i just extracted the wrong way

#

mmmhh

#

weirdd.

thorny swift
#

I want to try it on TS but idk how to. It wont work

arctic moat
#

Team speak ?

thorny swift
#

yes

arctic moat
#

Burh

thorny swift
#

the voice changing thing

arctic moat
#

Who still speak on TS

#

I mean there is you

#

So 1

thorny swift
#

I am playing gta rp and the server uses it

arctic moat
#

Wait

#

They dont have a discord ?

thorny swift
#

they dont use it. they use a plugin for TS. It works very well

arctic moat
#

Just change server its better xdddd

#

But

thorny swift
#

nah

arctic moat
#

Do you have VCB thing

thorny swift
#

yes

arctic moat
#

Well

#

idk

#

xddd

#

Did you selected VCB in the settings ?

#

For the mic ?

thorny swift
#

yes

arctic moat
#

And it dont work ?

#

Did you turn it on ?

#

xdd

#

@viral mason Less freeze but MS is still very very high

#

Like rn im at 8K ms

cyan copper
#

How do I find generic RVC models?

crisp nova
#

yo anybody know a good way to convert an image and an audio file to a video of the person talking? best if locally, and applicable to virtual characters (example: https://files.catbox.moe/hrud1f.png)

viral mason
#

Are you trying to be an e-girl or something 🙁

sonic garden
#

-colab

patent trellisBOT
# sonic garden -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

severe violet
#

is there better alternative to vac?

simple ore
#

Virtual Audio Cable v4.70 Lite

#

the only one

muted zodiac
#

Im using w-okada for voice change and I it was good for me when used on pc for vrcahat with less then 1 sec delay. now when I use vr head set I have like 8 sec delay what can I do to fix it?

forest vector
cloud finch
#

-applio

#

-help

#

-please?

torn quiver
#

can somebody help me with my problem really quick

torn quiver
#

why does the voice changer sound all glitchy and stuttering

young halo
#

i just opened the link, wtf

#

nvm, its working now

viral mason
#

It fixed itself tho

young halo
dense agate
dense agate
hallow thistle
patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
hallow thistle
viscid moss
#

Meanwhile u can use it on Colab/Kaggle/Lightning.ai or locally

#

till it gets merged

hallow thistle
dense agate
viscid moss
dense agate
viscid moss
#

I'll ping u when done (On HF)

dense agate
hallow thistle
wind egret
#

I want to admit something, i have 0 idea how to use SD at all, i think i do but i actually don't, as prompting the way i did with NovelAI is not yielding similar result, i realize that maybe the models don't go well with prose / natural language and prefer tag-styled prompts, and i have been following trends on 4chan and civitai without knowing much, which was a big mistake on mmy part especially when i chose to use SwarmUI because it's new to me.

I set up old SD before and i think reason i mainly avoided using it for long is because of the gen speed, but with SwarmUI, i was able to gen faster than before even on 1280x832 res, but i still get bad results..

Also it doesn't help that youtube guides i feel like expect you to know more about it and often feel like a technical overview of the software instead of being helpful for user newbies.

#

I wanted to ai gen anime/furry art in western art style but they don't go very well in end result unlike novelai, i tried artist LORAs and Mol Keun model from civitai and i got very horrendous results that i don't think inpainting/segmenting will fix.

low shard
#

could you please elaborate more?

#

!howtoask

patent trellisBOT
# low shard !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 3060)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message is very helpful.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a lega, safe & ethical community, we will NOT provide help for:

  • (E girl, as an example) catfishing/trolling, scamming, impersonation.
  • NSFW/Porn.
  • Any illegal activities.
    Requests for these topics will be ignored and may result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
muted zodiac
short hazel
#

Guys

#

Whats best setting for voice changer to like record voice and edit?

old hawk
#

GPU: RTX 4050 6GB
Operating system: Windows 11
|I have been trying to use the Deiteris' W Okada Fork real time voice changer following this doc: https://docs.aihub.gg/realtime-voice-changer/local/deiteris-w-okada-fork/#help

The issue i am running into is that it doesn't seem to either pick up any audio from my mic OR is unable to convert it OR isn't able to generate any output. There can be even more possibilities but i can't 100% confirm any. Though if it helps, I have tried three different mics which all worked on their own but didn't work when i ran them through the realtime voicechanger. I used both VB-audiocable and the VAC provided in the doc itself while making sure the other one is deleted and restarting my device before and after the un/installation.

I did get this in the cmd which i'm not aware of what it means due to the lack of my knowledge and vocab in this area. Hope it's helpful:

2025-09-17 18:04:24.9061699 [W:onnxruntime:, transformer_memcpy.cc:74 onnxruntime::MemcpyTransformer::ApplyImpl] 2 Memcpy nodes are added to the graph main_graph for CUDAExecutionProvider. It might have negative impact on performance (including unable to run CUDA graph). Set session_options.log_severity_level=1 to see the detail logs before this message.

If anyone here knows anything about this, i'll be grateful. (sorry if the message is too long, i tried to make it well-documented so it won't be a hard time to understand my issue. If needed more info, i will provide.)

knotty moth
knotty moth
old hawk
#

I have downloaded the correct one from the doc as well

muted zodiac
knotty moth
muted zodiac
#

ok tnx I'll try when I get home

knotty moth
old hawk
# knotty moth is there any error message when you seek through the terminal?

the one that i provided was the only one, rest seems to be normal logs

2025-09-17 18:04:24.9061699 [W:onnxruntime:, transformer_memcpy.cc:74 onnxruntime::MemcpyTransformer::ApplyImpl] 2 Memcpy nodes are added to the graph main_graph for CUDAExecutionProvider. It might have negative impact on performance (including unable to run CUDA graph). Set session_options.log_severity_level=1 to see the detail logs before this message.
#

i've checked my volume and things as well and they seem fine since the mic works without any problem if used directly

knotty moth
#

to show the screenshot here, you need to first ask me for image perms

old hawk
knotty moth
old hawk
#

I have checked and they are the correct ones

knotty moth
old hawk
#

did everything the doc said, change my playback and recording to the default ones and restart my device

knotty moth
# old hawk yep

how is the perf meter and have you tried adjusting the volume & try few other models?

old hawk
#

I did try to tweak with the volume things but still had 0 result, as for perf meter im not sure whats that

knotty moth
#

if it outputs something it should show above -90 dB

old hawk
#

It does move but it has no effect whatsoever to any sound, like it moves in the same pattern indefinetly

old hawk
knotty moth
# old hawk it stays at -90db

for diagnostic purpose, first try stop the voice changer, then click passthru button & change it to red, and it should output your unprocessed voice from the mic

#

if it seems okay, then you can change the passthru back to green

old hawk
old hawk
#

with passthru

#

I have allowed the site my mic permissions and have checked my mic permissions in windows settings as well

knotty moth
#

what if you try passing the virtual cable output and do mic test in discord?

old hawk
#

i tried both monitoring and passing it through discord

#

while monitoring it from discord settings

knotty moth
old hawk
#

VAC first

#

then VB

knotty moth
# old hawk

then how about try audio server mode and select wasapi devices?

old hawk
#

is this correct

knotty moth
#

then if it sounds too noisy, you'd probably need some external noise suppression

old hawk
#

it seems to work now, thanks a lot!

#

i had spent like all my day today trying to look for the issue

#

thank you again

knotty moth
#

np that's good

wind egret
#

i'd like to make good ai gens image but prompting doesn't work out and not sure if it's the models, lora, etc

simple ore
golden walrus
#

guys, can i ask if i need transcript to train a model ?

knotty moth
golden walrus
#

i'm using fork

golden walrus
#

Codename's fork

knotty moth
golden walrus
#

oh, found it

#

let me read

muted zodiac
#

I got the other fork to work like the guide shows, but is there a recommend settings I should use like how many chunks and which F0 detector? also what else can I do for less delay when in VR without VR I have almost no delay with good quality. ty

low shard
low shard
low shard
# golden walrus <:POPOcat:1024965312539525130> oh, okay

Yeah that's the reason it's not in the ai hub docs, you seem to be confusing about the way RVC models are trained, you don't need to train them on text, RVC are Speech-To-Speech models, checking more the Applio and AI Hub Docs would help you understand better :)

If you were also thinking of a transcript you have to read out loud to train a model off your voice, there isn't a standard one that is universally better

#

I hope you understand, and for any issues let us know here :D

golden walrus
#

xd i just want to improve my model quality

muted zodiac
low shard
low shard
muted zodiac
golden walrus
#

but thank you for your time tho

knotty moth
golden walrus
knotty moth
#

btw it is not really a transcript if without proper timestamp

low shard
golden walrus
#

it has timestamp. just i wonder i can reduce the misspelling issue

#

and improve accent

knotty moth
golden walrus
#

well, i have to admit, i don't

#

i just need to know rvc don't need transcript

muted zodiac
low shard
# golden walrus any machine voice will fork up my model, i heard this from a wise man with a pan...

any machine voice will fork up my model
I wasn't talking about training your model with a robotic tts, I said that the other helper might have confused your request with a TTS Model (GPT-SoVITS) training request, since this is a General AI Server, not an RVC Server anymore

It was a miscommunication on both ends, but simply to put it: You don't need to read a specific transcript to increase your RVC model quality :)

i heard this from a wise man with a panda pfp
I'm not sure who you're talking about, but I'm guessing the ex staffer Razer, idk if he's also your friend with the transcript that you were talking about since I don't remember him saying that, but usually we never suggested a specific transcript that has some words to increase quality, I hope you get what I mean

knotty moth
# golden walrus well, i have to admit, i don't

btw rvc actually uses an embedder model which could affect the pronunciation. currently by default it is contentvec, and another best option so far is spin (v2). you can try it in the latest applio.
note that the model inference should be done using the same embedder model as the one used in training the model, otherwise the output voice might sound gibberish.

golden walrus
#

he just told me artificial voice will make my model sound worse

golden walrus
simple ore
#

using an output of TTS with voice cloning to produce more audio of the target voice

#

that generates artificial audio

#

you may train RVC model on that, but results wont be natural / bad

golden walrus
#

that

simple ore
#

i mean if you have no other choices that's the only way

golden walrus
#

i got expressive data like suggestion. but when i got the data from my friend, it contains transcript so i got confused. that's about it

low shard
golden walrus
#

but what is the different between training via applio and fork

patent trellisBOT
golden walrus
#

SCpeak no different right ?

knotty moth
low shard
simple ore
#

there are versions of hifigan that use PPG, but it is done automatically using Whisper ASR

simple ore
#

only TTS/ASR models are trained on audio+transcript

golden walrus
#

since i trained 3 in 32k but i haven't run into errors, or i might have ran into but i don't know if it's an error

golden walrus
#

i think it's the case, then i will drop the transcript aside

simple ore
#

ASR training takes noisy speech with music/sound effects or other stuff so it can learn to extract the actual speech

#

TTS are training on clean audio so it could reproduce the speech in a required voice

patent trellisBOT
# low shard -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

low shard
#

the 1st link is vonovox

#

the 2nd link is wokada tg-develop fork

#

the 3rd link is wokada deiteris fork

simple ore
#

RVC uses speech features extracted from audio, so it does not really care about what's being said or language

low shard
#

vonovox is much more suggested for your windows nvidia setup, because it's in active development unlike the other 2 i mentioned
I mean it's your choice what you want to use, I'm just making you aware, if you'd rather to still use wokada deiteris fork I can help you with any of the 3 programs

golden walrus
#

oh but i'm curious if RVC learn about speech feature but it failed to pronounce certain words

#

is it because the data don't contain that certain spelling ?

knotty moth
golden walrus
#

idk how to give example since it's my language SCpeak

knotty moth
golden walrus
#

i mean i know but i can't explain, reeeeeeeeeee. but i can say i do sound like American try to speak Vietnamese

crisp nova
#

yo what up anybody know a good "base" TTS model to generate the initial TTS to then convert to RVC? Needs to support german, for english ive already found kokoro which does all i want.

sharp sphinx
#

how i find here russian voice model

knotty moth
golden walrus
#

ủa. ả. these has the ? on their head

#

pretty much sound pretty wrong

#

like they try to add the r to the end of the word

knotty moth
golden walrus
#

ah ye, no, it just sounds incorrect, that's all

#

but spin somehow reduce the problem

leaden island
#

Does anyone here used Twilio before?

slow siren
#

how to upload image

viscid topaz
slow siren
#

not there

#

in vc

hallow thistle
viral mason
#

My bad this is ai hub I thought I was in the Vonovox chat, I'm too sleepy for this

hallow thistle
knotty moth
stiff idol
#

Hello everyone. I'm here with a request for help with a RVC error that pops up in CMD. I've trained a test model (no problem), but can't even test it due to weights unpickler error and I tried a few solutions from forum and chatGPT (mainly to explain basic concepts and such).
GPU: GTX 1650 4GB VRAM
The RVC I used: https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI
I run this in mini conda
Other things I have:

  • Python 3.10
  • pytorch 2.7.1 with GPU support (I had version 2.8.0 CPU)
    and somehow managed to fit the requrirements of all the other packages.

This is the error I encountered (also no .index file gets generated and I assume it should be generated after I click on Train Feature Index). Nothing happens when I click it.:
CMD output on pastebin: https://pastebin.com/k3L1a2KE

viral mason
#

they need to fix their damn code on here because since this update this keeps happening

#

nothing even shows up here to show what went wrong is just gives you the middle finger and stays silent

low shard
stiff idol
#

Thank you. I've already download Applio after reading through the first few docs pages. I've dealt with such problems before with ComfyUI nodes, but RVC is something new for me.

analog obsidian
#

mainline works without errors up to pytorch 2.5

#

after 2.6 those errors can be fixed by editing some files

stiff idol
#

Also, is there a general consensus on Pytorch version and such? Oh, rihgt, I saw downgrading to pytorch 2.5 works, but I didn't want to do that due to the requirements because I didn't want to start another environment and do everything anew.

analog obsidian
#

applio uses 2.7.1

stiff idol
#

that's perfect, also, may I ask if .index file is a requirement for a model or not? I get conflicting opinions when I search

analog obsidian
#

is a requirement if you want to apply to the model maker role

#

for casual usage not

#

thats where the accent of the speaker is stored

stiff idol
#

when I used the mainline it didn't generate the index file and the train index option didn't work as I mentioned, so I assume it's just broken

analog obsidian
#

besides that old version of torch, mainline also needs a specific old gradio version to work fine and a old matplotlib as well

#

this

stiff idol
analog obsidian
#

yup

stiff idol
#

it's been a horror for me

#

I'll be dreaming of banbilions of requirements to make VRC work.

analog obsidian
#

yeah mainline havent got a real update since 2023 due to the author giving up on the project

#

applio is literally the same as mainline, just with updated packages

#

but i always recommend mainline because it's the original thing

stiff idol
#

Thanks for help. I'll get back to trying and reading the resources for Applio. I'll have to get the basic understanding of what all the packages do.

Mainline didn't work for me, I guess.

analog obsidian
#

no need to install them manually

#

applio has an installer that does that automatically*

stiff idol
#

uggh, that's the idea I always got. With ComfyUI update it needed a git so I was installing and trying to troubleshoot manually. AI is just a tool, but yeah, thanks. Otherwise I'd still be lost.

analog obsidian
viral mason
#

did Nick get into murder drones recently or is he changing for halloween

ruby nimbus
#

Hi What page for create a song with IA artist

fleet nest
#

How to find the caseoh one?

#

So i can make a cover

cyan copper
#

I just want a generic voice to use, not specific to any character

viral mason
#

I guess just look around but all voices are based one someonehttps://discord.com/channels/1159260121998827560/1175430844685484042

cyan copper
#

You can still get a voice actor that is not famous and has a pretty generic voice

stiff idol
#

Hi, back again. I've got a quick question. Where are weights actually saved? I can't see it in the docs or am I just blind? If Applio is the same branch, then the model should be loaded from assets/weights folder (the voice model) same for .index file

stiff idol
paper flare
#

how to get a voice changer @viral mason

upbeat imp
#

so that someone could make me an RVC model, because I can't do it myself

paper flare
#

he didnt answer me

#

😭

viral mason
paper flare
viral mason
viral mason
paper flare
viral mason
# paper flare Nvidia

cool! there' three options u could download but if u want the best quality I'd use vonovox it's the first option

#

-rt

patent trellisBOT
# viral mason -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

paper flare
#

pls wait for me it will take 15 minutes at least.

#

im new to this

viral mason
viral mason
#

I'm making a video on how to do it right now

#

nvm can't make the video it literally doesn't work rn I'm gonna jump off a building

low shard
viral mason
#

this error is making me suicidal

#

is there any code fix to it

low shard
#

I got into murder drones since recently

viral mason
#

ye it's cool

low shard
#

I didn't think about this for halloween lol

#

some days before halloween hazbin hotel season 2 drops tho

viral mason
#

I heard about that yea

low shard
#

just felt like changing a bit since some thought i was a bot and they say it was also bc of the "by Weights" 😭

viral mason
#

I may make a model of Alastor but not gonna post it because I don't wanna get shot in an alleyway for it

stiff idol
low shard
viral mason
#

u know anything about this btw Nick

low shard
viral mason
#

usually I just teach someone how to do it, bc I ain't making models for ppl

upbeat imp
viral mason
upbeat imp
#

i am stupid

#

xd

low shard
#

@viral mason I deleted the video of your issue because you leaked your Ngrok Token 😭
Have you tried using using Applio's Dataset creator instead in Kaggle? Instead of doing that whole thing uploading manually the dataset

low shard
low shard
viral mason
#

I haven't ever used that option before and don't know what it does or how to use it

#

Also oops

#

About the token thing

low shard
paper flare
#

@viral mason

#

i did it

#

is this saying cuda not avaieble ok?

#

is it ok to say this or just visual

#

like vonovox will work fine

low shard
paper flare
#

how i make it use my gpu

low shard
viral mason
#

Hm weird

paper flare
#

3080ti

#

thats the gpu

viral mason
#

Hopefully that'll work, I'll try it in an hour or so

low shard
# paper flare 3080ti

are you sure? you checked in task manager?

You can check your pc gpu on Windows via:

ctrl+shift+esc (task manager) -> Performance tab -> GPU

paper flare
#

i checked

#

bro i bough the pc myself

low shard
paper flare
#

yes

#

i have

low shard
paper flare
#

latest nvidia drivers

viral mason
#

Hmm

paper flare
#

latest windows

viral mason
#

That's not task manager

paper flare
#

task manager aswell

#

i use desktop

#

not laptop

#

so i can see my gpu aswell

viral mason
#

Weirdd

#

Idk why it won't accept your gpu then

#

Are you in the Vonovox discord?

paper flare
#

idk vonovox

low shard
#

Can you delete the runtime folder

#

and re-run the setup.bat

#

be sure it's in a folder without spaces or weird characters

#

and to not run it as admin

viral mason
paper flare
paper flare
#

in another program

#

i closed it

#

it was pythomn

#

python

#

my bad.

viral mason
#

Lol

paper flare
#

bruh

#

wym cuda not avaieble

#

i have cuda

#

in my gpu

paper flare
#

idk

#

lemme go install

paper flare
#

it says another version already installed

#

so yes.

#

i have

viral mason
#

But is it the same version or an older one?

paper flare
#

how i check

low shard
# paper flare

go into windows apps, check for Visual C++ 2015-2022, uninstall it, try to install the one I gave

paper flare
#

woah

#

wat i do?

low shard
# paper flare done

after you did that, delete the vonovox folder, redownload and re-run setup.bat

paper flare
#

okat

#

@viral mason

#

💔

#

still saying thi

viral mason
#

Uhhh

paper flare
#

u sure its not a false detection

viral mason
#

You may have to switch to wokada deiteris at this point

#

It's probably false

paper flare
#

ok how i setup

#

vonovox then

#

to see if it work good

#

im very new to ts

#

so its confusing

#

nvm

#

ts dont work

#

it says

#

no gpu avaible

#

crazy

viral mason
paper flare
#

it wont even detect

viral mason
paper flare
#

my gpu

#

💔

stiff idol
viral mason
# paper flare it wont even detect

I'd highly recommend joining the Vonovox discord server since you shouldn't be having this issue, the creator can see if he can figure the issue out with you himself

#

I can't send the link here because this server has a thing to avoid promoting stuff

stiff idol
#

in the environment where your python.exe is installed type python -c "import torch; print(torch.__version__)"

#

if you have e.g. pytorch2.8.0+CPU then it'll only ever run CPU and always write GPU not available or something similar

paper flare
stiff idol
#

I think this is alright too, it recognizes my GPU. but if it write CPU next to it, then you've got a CPU version of pytorch installed

paper flare
#

oh

stiff idol
#

so do this in the environment where the python is installed and where you run your instance of that program

paper flare
#

ill check

#

where i check?

low shard
paper flare
low shard
# paper flare

do you perhaps have any anti viruses? you should maybe try reinstalling with them off or with an exception

stiff idol
#

How does the folder structure look like? Could you send me a link for Vonovox?

low shard
#

it setups python in the runtime folder

#

@paper flare what if you go in the runtime folder, write cmd at the top of the file explorer path, then type python -c "import torch; print(torch.__version__)"

low shard
# paper flare

open "runtime" folder
on the top path bar, click cmd, then write the command I told you

paper flare
#

I opened, but i dont understand what is path bar

#

nvm i did it

#

@low shard

#

it says cpu

#

@stiff idol you were right

simple ore
paper flare
simple ore
#

if you have applio v3.5.0 installed already you can try realtime there

simple ore
#

nvm

simple ore
paper flare
simple ore
#
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100 10.5M  100 10.5M    0     0  18.3M      0 --:--:-- --:--:-- --:--:-- 18.3M
Extracting
Creating directories
Installing pip...
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100 2098k  100 2098k    0     0  16.4M      0 --:--:-- --:--:-- --:--:-- 16.6M
Collecting pip
  Using cached pip-25.2-py3-none-any.whl.metadata (4.7 kB)
Using cached pip-25.2-py3-none-any.whl (1.8 MB)
Installing collected packages: pip
  WARNING: The scripts pip.exe, pip3.12.exe and pip3.exe are installed in 'X:\vonovox\Vonovox-1.6.9\runtime\Scripts' which is not on PATH.
  Consider adding this directory to PATH or, if you prefer to suppress this warning, use --no-warn-script-location.
Successfully installed pip-25.2
Installing Packages
Looking in indexes: https://download.pytorch.org/whl/cu128
Collecting torch
  Downloading https://download.pytorch.org/whl/cu128/torch-2.8.0%2Bcu128-cp312-cp312-win_amd64.whl.metadata (29 kB)
Collecting torchvision
  Downloading https://download.pytorch.org/whl/cu128/torchvision-0.23.0%2Bcu128-cp312-cp312-win_amd64.whl.metadata (6.3 kB)
Collecting torchaudio
  Downloading https://download.pytorch.org/whl/cu128/torchaudio-2.8.0%2Bcu128-cp312-cp312-win_amd64.whl.metadata (7.4 kB)```
paper flare
#

yes i did setup.bat

simple ore
#

does it say this?

paper flare
#

idk

simple ore
#

cu128

paper flare
#

ill check

simple ore
#

it looks like the right cu128 version

paper flare
#

for better assist

simple ore
#

hold on

paper flare
#

if you have enough time of course

#

im at #vc1 channel

simple ore
#

if you got to windows explorer and paste %APPDATA% into the path, it should open Appdata/Roaming

#

is there python folder?

paper flare
#

this ye

simple ore
#

like that, press enter, is there Python311 or Python312 folder?

paper flare
#

nope

#

i think not

simple ore
#

okay, give me a sec

paper flare
#

oh ok

simple ore
#

after setup is done, go to runtime/lib/site-packages

#

and see what you got

paper flare
#

@simple ore there

simple ore
#

sort by name lol

paper flare
#

oh sec

#

there u go

torpid granite
#

The ai voice fusion on Applio collab doesn't seem to work

simple ore
#

delete runtime folder

#

open cmd.exe in Vonovox path

#

where setup.exe is, then type setup.bat >log.txt and press enter

paper flare
#

one sec

simple ore
#

let it install

paper flare
#

done i deleted

#

how do i open cmd in

#

vonovox path

#

i only see powershell when i reclick

simple ore
#

with windows explorer in that folder, type cmd.exe in the path

paper flare
#

oh ok

willow abyss
#

i cant run vonovox setup it's getting blocked by smart app control 😭

simple ore
paper flare
#

my explorer and urs

#

different or what

#

@simple ore

simple ore
#

let it run

paper flare
#

sorry for bieng little slow

paper flare
simple ore
#

click there

paper flare
#

yes

#

in the pc icon or

#

downloads

#

or maybe vonovox

viral mason
#

@low shard how do I kill myself

paper flare
#

i did it

simple ore
#

and type over that

paper flare
#

ok sec

#

its doing it

#

what i do?

low shard
simple ore
low shard
#

sorry to disturb via ping, I see you're busy rn too

paper flare
#

ok ill js wait

#

till it finish

simple ore
#

something is wrong

paper flare
#

o

paper flare
simple ore
paper flare
#

i though it was my issue

simple ore
#

can you try again without >log.txt

#

and screenshot the full error where it happens?

paper flare
#

setup.bat

#

like this

#

?

simple ore
#

yes

paper flare
#

i type this in the cmd

simple ore
#

somehow it fails to install main requirements

shy spruce
#

yea his windows installation does not want to correctly install pip packages. it was giving him cpu torch even though he specified cu118

paper flare
simple ore
#

but then the next one (pytorch ringformer?) installs 2.8.0 cpu

paper flare
#

scary red text

shy spruce
#

his windows has something weird going on

paper flare
simple ore
paper flare
#

well it finished but it still gave error

paper flare
shy spruce
#

i can try to package up the full installation including the runtime

paper flare
simple ore
#

I think it is the issue on downloading the runtime?

#

so it then tries to use system python and fails

shy spruce
#

it keeps installing different cpu versions of torch, even when manually using cu118

#

give me a bit and I will package the whole thing including the runtime and upload it to my HF

simple ore
paper flare
simple ore
#

log.txt did create a file in setup folder

paper flare
#

o

#

so what i do now?

#

thats my runtime rn

simple ore
low shard
paper flare
#

there u go

viral mason
low shard
viral mason
#

I tried switching diff tokens and same issue

shy spruce
#

@paper flare are you using stock windows from microsoft? or a 3rd party download / modified windows

#

ive noticed this issue so much with people who have modified versions of windows

low shard
simple ore
#

cell still running?

#

log showing anything weird?

low shard
paper flare
#

no i cracked windows cuz didnt want to pay for activation

low shard
paper flare
simple ore
#

get.activated.win does not break python

paper flare
#

bassically force activation for windows

low shard
simple ore
#

maybe try set PYTHONNOUSERSITE=1

shy spruce
#

if its the big one everyone uses from github, it wont break it

#

but some people download an "optimized windows"

simple ore
#

before running setup.bat

paper flare
#

at all

#

i didnt do these debloat optimized windows

#

stuff

shy spruce
#

i'm uploading a full version with the runtime, give me a little

paper flare
#

thank you so much

simple ore
low shard
shy spruce
#

im not sure why some users get this issue

#

i think the best i can do is host the full package with the runtime

low shard
viral mason
shy spruce
#

yea ive seen it only happen to 2-3 times. but im doing everything by the book so theres not much I can do to solve it

#

the best thing would be hosting the full package option on HF

low shard
simple ore
shy spruce
#

yea thats fine, on releases I can just upload the full package option. I'll prob just write a script to do it

#

with hf cli

shy spruce
paper flare
simple ore
low shard
# simple ore

how did it use 120 requests in a minute if i barely clicked something

simple ore
#

I've only clicked settings and tried to save precision

low shard
#

I was checking the network tab, and yeah same issue

low shard
# simple ore

if i hard refresh the app after like a minute of getting connection errored out, it seems to work all fine

#

@viral mason can u try this too

simple ore
#

no, same error immediately

viral mason
low shard
viral mason
simple ore
#

I think adding 'realtime' ui broke gradio lol

low shard
low shard
simple ore
#

176 .js files

low shard
simple ore
#

okay.. in case caching is disabled

#

F12 and uncheck

#

that may cache all that junk

low shard
#

@simple ore @viral mason this fixed it for me:

  1. open the link and click something to get the error
  2. hard refresh instantly after getting the error
  3. wait 1 minute exactly, then hard refresh again
simple ore
#

try v3.4.0 branch

#

see if that makes any difference

#

3.5.0 is realtime

viral mason
low shard
simple ore
viral mason
#

just replace the 5 with 4?

low shard
viral mason
simple ore
#

first try enabling cache

#

see if that helps

#

if not try 3.4.0 and see if the number of requests in ngrok log is less

low shard
#

so ima try 3.4.0

low shard
viral mason
low shard
simple ore
#

that perhaps may lower the number of requests

shy spruce
paper flare
paper flare
shy spruce
#

ill ping you as soon as its ready

#

its in my screen so i wont forget lol

paper flare
paper flare
#

im installing

shy spruce
#

running setup again might mess up your runtime

paper flare
#

ill delete old one then

shy spruce
paper flare
#

YES

shy spruce
#

i will just upload full versions with my releases

paper flare
#

w

#

what is

#

cpu affinity btw

shy spruce
# paper flare cpu affinity btw

it just uses less cores because a few versions ago we figured out using 100% of the cores causes lag for no performance gain

frigid echo
#

why does it say extension of file should be following pth onnx

azure vortex
#

Any way that i can get a voicechanger?? 🥹

paper flare
#

@shy spruce

#

it crashes afterwards

#

oh shit i set quality very low lemme rec again

paper flare
#

-rc

#

-rt

patent trellisBOT
# paper flare -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Deiteris Fork, with extra features, but supported only for Nvidia GPUs on Windows. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Most suggested WebUI with the best general support for many platforms. GUIDE

⚔️ Wokada Deiteris Fork vs Vonovox

For Windows Nvidia, Both Wokada Deiteris fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Deiteris Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

paper flare
#

omg

#

u online

#

yes

shy spruce
# paper flare

you sure that's your real mic? it doesn't have a real label just the default driver name and its at 96000kz

#

yea but im a little busy this sec

paper flare
#

look

#

there is no other

#

inputs

#

look

#

thats my mic

shy spruce
#

which mic is this thats its 96000hz? can you turn it to 48000?

paper flare
#

ok sec

#

is this

#

right

#

done

#

its 48000 now

#

still crashes

shy spruce
paper flare
#

O

#

WAIT

#

I FOUND ISSUE

#

LOOK

#

when i try run it as admin

#

look what it says

#

system cannot find path

shy spruce
#

you shouldnt be running it as admin, it looks for a different python installation when you do

#

make sure you have that microsoft package

paper flare
#

already

#

here

#

see

#

idk why vonovox dont work for me

analog obsidian
#

maybe try a non onedrive folder?

paper flare
knotty moth
paper flare
#

its not onedrive tho

knotty moth
#

not in the desktop folder

paper flare
#

oh ok

#

sec

paper flare
knotty moth
paper flare
#

to do that

#

:{

#

relax

#

idfk how to run it with cmd

#

im not tech guy like yall

knotty moth
paper flare
#

with cmd