#✨│ai-help

1 messages · Page 333 of 1

boreal creek
#

try to launch replay with powershell $env:CUDA_LAUNCH_BLOCKING="1"

#

gpt 5.5 says :If they want the fastest practical fixes, I’d say:

Reduce batch size to 1.
Lower resolution/model size.
Restart PC.
Update NVIDIA Studio or Game Ready driver.
Reinstall Replay/PyTorch environment if it still fails.
Try WSL2/Linux if Windows keeps producing CUDA launch failures.

#

it could be issues with lack of vram or maybe you are using too much audio?

simple ore
#

your theory is weird

#

the goal of the pretrain training is to make a model capable of predicting how a spectrogram would look like for a given speaker, phoneme, pitch

#

properly trained pretrain should be able to infer (phonemes + pitch) from original audio + speaker vector into the original audio close enough

#

and it should be able to make a good guess of how a different unseen audio would sound if it was said by a known speaker

#

when you finetune a pretrain using a new speaker it realigns into predicting how any input would sound with the new speaker's voice

#

chatgpt's take

#

generally what happens is that the model overfits and loses ability to predict anything but the content of the finetuning dataset

gray dagger
#

moving replay to another drive seemed to do the trick

uneven tendon
#

program works fine with default voices, whenever I upload a custom voice, and swap to it they dont work then the client freezes, if anyone knows this issue? no error in cmd, just ends with...
[Voice Changer] Loading index...
Try loading... model_dir\5\added_IVF256_Flat_nprobe_1_CyreneAidenDawnHSR_v2.index

Version:
MMVCServerSIO_win_onnxdirectML-cuda_v.1.5.3.18a.zip

viral mason
#

default voices only come with very old version of the voice changer

fringe heron
#

Predicting b to b right but not a to b reliably

low shard
#

This is a General AI Discord Server, please elaborate:

  • your pc gpu
  • your pc os
  • what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
  • the tutorial link used
viral mason
#

What voice model are you using, what gpu do you have? (Nvidia or AMD) and what are you wanting to do with each

small sorrel
#

do u have to upload your model to hugging face

low shard
#

This is a General AI Discord Server, please elaborate:

  • your pc gpu
  • your pc os
  • what are you trying to do: TTS, AI Covers, E Girl Trolling / Catfishing or Roleplay
  • the tutorial link used

Do you really need to use docker?

hallow thistle
placid cradle
#

weights Can't you use it?

light ermine
#

chat i need a repo id which supports text generation

#

where do i find it

#

im completely new to ai stuff btw

low shard
cosmic ravine
#

what voice changer do i use

hallow thistle
cosmic ravine
hallow thistle
cosmic ravine
#

how to get vonovox

hallow thistle
stark ether
#

Hola, como hacer un audio con la ia de bad Bunny 2022 UVST

hallow thistle
cosmic ravine
hallow thistle
cosmic ravine
#

oke

#

ty

hallow thistle
#

Use WinRAR or 7-Zip to extract the zip content. Inside Vonovox folder, double click on start.bat to launch the program.

hallow thistle
cosmic ravine
#

i will be back when it done extract

devout dome
#

hey is there a way to setup playback on vonovox?

cosmic ravine
#

what do i do\

devout dome
low shard
low shard
devout dome
#

no no i used a random vid to pass the time while it opened

#

the tutorals i used were from him its working fine now i just dunno how to setup playback for the vc

#

like rn im using the 17_11 vonovox beta

low shard
devout dome
#

i meant "here" but it was one Your_Local_Worm sent on the discord.
Nvidia 5070
Win 11
Roleplay/jokin with friends

#

sorry* im still a lil tired so i keep messing up my words

north drift
#

Guys, can you help me pls?
[Voice Changer] warming up... generating sola buffer.
got this thing and nothing coming after

low shard
viral mason
#

This is much better looking, good job

low shard
viral mason
#

Lol

pliant stream
#

That does look good

low shard
# pliant stream That does look good

thx, btw I was a mod in weights too and don't remember you there (I was more active in AI HUB so maybe it's that), but also thanks for the recent help

#

I think that Weights role might be deleted one day 😭

viral mason
#

Maybe could keep it as nostalgia

low shard
pliant stream
#

I honestly dont remember who gave it to me but somebody did

viral mason
low shard
viral mason
#

Have they fixed the issue with the login thing where new users cannot use it or do you know

pliant stream
viral mason
#

What's moonwake?

pliant stream
low shard
low shard
low shard
pliant stream
#

It's going alright, i was told not to talk about DreamTavern so i cant say much about it on that topic lol

low shard
#

maybe let's not clutter the help channel, my bad for talking here

pliant stream
#

Yeah, fair enough lol.

viral mason
mild ridge
#

Is vonovox and wokada forks web ui only? or can it be run locally like the original wokada. i tried both and they launched the Web ui.

low shard
mild ridge
#
  • Goal : Educational purpose / Roleplay
  • Specific Issue: Having have major audio issues and random high pitches, audio not being picked up properly (choppy) experience with vonovox.
  • Full GPU Name:RTX 4070ti 12GB
  • Operating System: Windows11
  • Tutorial Link used: None / Previous knowledge.
cosmic ginkgo
#

for the voice models that are just a model, what am i supposed to input as the index for RVC?

cosmic ginkgo
#

will it work like that?

fringe heron
#

yes

cosmic ginkgo
#

oki ill give it a try ty :3

fringe heron
viral mason
cosmic ginkgo
#

what does the index effect?

cosmic ginkgo
#

i usually have the index setting turned all the way down on every model i use anyway

#

i had another question, is there anyway to increase the quality of the voice that isnt the chunk?

fringe heron
cosmic ginkgo
#

upping the chunk makes the voice sound better but it takes soo long to translate

viral mason
#

if u have Nvidia u should use vonovox, it's the current best for realtime

fringe heron
cosmic ginkgo
#

i have nvidea

fringe heron
cosmic ginkgo
#

i think i use innxdirectml

fringe heron
#

no that wouldnt sound great

cosmic ginkgo
#

cuda

viral mason
#

where did you download the voice changer from btw

cosmic ginkgo
fringe heron
cosmic ginkgo
#

*500

cosmic ginkgo
viral mason
cosmic ginkgo
#

why is vonovox better?

fringe heron
cosmic ginkgo
#

i think i get it

#

so i can just use the model and it will convert my voice anyway

#

like the index doesnt help much with that

cosmic ginkgo
#

?

#

okay cool

fringe heron
viral mason
cosmic ginkgo
viral mason
cosmic ginkgo
viral mason
fringe heron
viral mason
#

and for vonovox just run the start file

fringe heron
#

@viral mason

#

may i ask sum

viral mason
#

wassup

cosmic ginkgo
fringe heron
#

so since you have the model maker tag, i always made my models splitting dataset in clips of 3 sec to 5 sec, is it bullshit to split them in 400 ms or 300 to "optimize for small chuncks" when going realtime?

#

never really tried that

viral mason
#

the virtual cable

cosmic ginkgo
viral mason
viral mason
fringe heron
viral mason
#

oh

#

I haven't had a dataset over 2 hours

#

so I wouldn't know

fringe heron
#

yea but in princible

#

never tried small chuncks?

#

always used the applio 1 sec min to 5?

cosmic ginkgo
#

is there anything i need to worry about like having installed or my pc specs or whatever before i launch vovonox? sorry im not that well versed v.v

cosmic ginkgo
#

i got a warning about not having smth installed in the cmd as it launched

fringe heron
#

post it

cosmic ginkgo
#

the cmd ent away like soon after, ill try launch again

#

Warning: psutil not available, cannot set CPU affinity

fringe heron
#

wait

cosmic ginkgo
#

oki

fringe heron
#

you runned setup.bat right?

cosmic ginkgo
#

WAIT

#

i might be just stupid

#

i was trying to use the launcher

#

not start.bat

#

LMAO

fringe heron
#

its alright

cosmic ginkgo
#

i feel like a hacker rn

fringe heron
#

:)

#

@cosmic ginkgo hope you will find good sounding models for your voice

cosmic ginkgo
#

tyty

#

ill lyk when i get it working haha

fringe heron
#

you will need it

fringe heron
#

have fun

cosmic ginkgo
#

befre u go

fringe heron
#

yk?

cosmic ginkgo
#

can i pretty much just import all the models ove been using straight over to this?

fringe heron
#

yes

cosmic ginkgo
#

okay sweet

fringe heron
#

even ping (atleast me)

cosmic ginkgo
#

okay ty :3

#

well in that case

#

for the vac

#

does this mean its going to become my defulkt audio device ?

fringe heron
#

yes

#

wait ill show you how to fix

cosmic ginkgo
#

okok

fringe heron
#

so you need to go here, click on audio in the control panel

#

then simply select the audio device you had before or the one you always use and click set default wich for me its predefinito wich is italian

cosmic ginkgo
#

oh i seee

#

then just click my headphones and the set defult

fringe heron
#

yes

cosmic ginkgo
#

will i need to do the same for my mic?

fringe heron
#

yes

cosmic ginkgo
#

okay sweet ty sm

fringe heron
#

np

cosmic ginkgo
#

also

#

is there an option for audio playback on vovonox

fringe heron
#

hear the mdoel output?

viral mason
#

Sorry for disappearing btw I had to go to the store

cosmic ginkgo
#

its oki hahah

fringe heron
cosmic ginkgo
#

oh yea i see

viral mason
#

Then you cannot use it in games tho nobody but you could hear it

fringe heron
#

yea indeed, its playback

cosmic ginkgo
#

oh ok i see

fringe heron
#

just chnage it later to your virtual cable

cosmic ginkgo
#

so there is no way to do both? bc on my old one i was able to

hardy yew
#

just do this for your virtual audio cable

fringe heron
#

imput of cable to output and in game just output of virtual cable

hardy yew
#

and you will hear it

viral mason
fringe heron
viral mason
#

Then set output to line 1 in vonovox

#

Should work as intended

cosmic ginkgo
#

and then thje effects i shouldnt have to mess with right?

fringe heron
#

maybe

cosmic ginkgo
#

but the backend and sample rate etc is there any best option?

fringe heron
#

i raccomand keeping it as is

#

but

cosmic ginkgo
#

ok cool cool

viral mason
#

48k is fine, most default settings

#

Block size and pitch is all you'll need to change

fringe heron
#

sample rate use the model output or if you use the upscaler us 48khz

cosmic ginkgo
#

upscaler?

fringe heron
#

yea

#

there is a audio upscaler in vonovox

#

can turn anything up to 48khz

cosmic ginkgo
#

oh i see i see

fringe heron
#

(i raccomand not really using it)

cosmic ginkgo
#

sweet

#

what is block size?

fringe heron
#

the chunck

cosmic ginkgo
#

ohhhh ok

#

and formant?

#

is there any dial for what it will pick up from my input?

viral mason
#

Just a pitch shifter, only for fun

fringe heron
#

yes

cosmic ginkgo
cosmic ginkgo
#

like a um

#

idk what u call it

viral mason
#

Pitch is for making it sound correct for your voice, formant is a pitch shifter

cosmic ginkgo
#

filter

#

ohhh okay

#

thats helpful

viral mason
#

Male to female 3-12 female to male same but negative

#

Male to male 0

#

Same female

#

Depends on if you're a guy or girl and the model being female or male

#

That's for pitch

cosmic ginkgo
#

okay sweet

viral mason
#

Block size (chunk) should be fine at 0.30 or 0.35

cosmic ginkgo
#

is higher better qaulity? i assume

fringe heron
#

ye

cosmic ginkgo
#

sweeet

fringe heron
#

till a point tho

cosmic ginkgo
#

yea

fringe heron
#

for me some models higher would make it sound a bit odd

cosmic ginkgo
#

oh really?

fringe heron
#

but for me its always a pain to get them relaible on my voice

viral mason
fringe heron
#

or perceptually bad audio

viral mason
viral mason
#

Do you have a noisy environment or people talking/tv in the background

fringe heron
#

nope

viral mason
#

Odd

#

A fan?

fringe heron
#

no

#

most likely is my voice the problem

viral mason
#

Anything that could cause it

cosmic ginkgo
#

sometimes mine picks up my keyboard

viral mason
#

Same

viral mason
fringe heron
#

my physical voice

cosmic ginkgo
#

doesnt help i SMACK my keys ig

fringe heron
#

at infer as source

viral mason
#

You made a voice model of yourself?

fringe heron
#

no

#

what i mean

#

is that any vocie model if i convert my vocie with it and i speak naturally it always ends up on some wierd soudns sometimes

#

unless the target is very similar to me

viral mason
#

That shouldn't happen

#

Is your voice uhh

fringe heron
#

btw i am not a native english speaker and my english could be a bit broken) but using same acent speaker targets and gettign same results is ass

viral mason
#

It could be picking up the accent yea

#

All models will have it

fringe heron
#

its the damn text encoder

#

i need to do more testings

cosmic ginkgo
#

i know mine does that

#

i have an australian accent so

#

some words sound really odd

fringe heron
#

bro i feel your pain

#

thats why i am trying to find a fix for this

cosmic ginkgo
#

and also if i laugh it alwasy sounds like deoms

fringe heron
#

and i might have found it

cosmic ginkgo
#

rlly?

fringe heron
fringe heron
cosmic ginkgo
#

yea haha

cosmic ginkgo
fringe heron
#

and requires quite some work

viral mason
cosmic ginkgo
#

my best bet has just been to adjust my voice when i use the rvc sand have my mic closer

fringe heron
#

i mean in theory its easy but requires a lot of work and i dont knwo hwo relaible it is right now

cosmic ginkgo
viral mason
fringe heron
viral mason
#

Yea as long as it's in there should be fine

fringe heron
#

but for me (to my current looks i gave) the problem is that finetuning a pretrained shits what makes the model "speaker invariant"

viral mason
#

?

fringe heron
#

you know how the model works?

#

here i wrote sum useful

viral mason
fringe heron
#

oh well

analog obsidian
#

perhaps is not the te but contentvec

fringe heron
viral mason
#

I hope real spin is implemented to applio soon

fringe heron
#

even if contentvec is not perfect

#

the content is pretty much the same

#

and if you finetune the TE on the target then when sum else uses it that person is not the target

#

if it goes out of distribution the decoder struggles or even if the output latent is wierd because its not sure how it would be

#

i will test this better soon

proven hill
#

i always trained “vanilla” and got similiar result to “improved” versions

cosmic ginkgo
#

@fringe heron @viral mason ty sm for helping me out im having so much fun playing around with it :3333

viral mason
analog obsidian
analog obsidian
analog obsidian
#

so it behaves like a worse contentvec lol
ok to be fair, not worse, but the quality of the feature extraction is not as good as the original spin

fringe heron
#

btw i also realized this, you said you tried freezing the TextEnc but didnt notice improvements, maybe you kept the posterior enc free to update wich maybe caused a easy path to be "speaker variant" again and saw no improvments. Even if thinking more about it would only cause more problems freezing it.

raven bluff
#

where do i ghet the voicechanger

opaque pond
#

Please can I get a link to set up vonovox…. On a nvidia 4060, 8gb vgram…. What’s the best set up for real time voice changer?

viral mason
patent trellisBOT
# low shard !help-template
AI HUB | Technical Support Desk
📋 Required Help Template

To receive assistance, you must provide your system details. Copy and paste the block below into your reply and fill it out.

⚠️ NO INFO = NO HELP

👉 Fill this form:
- Goal (e.g., TTS, AI Covers, Roleplay):
- Specific Issue:
- Full GPU Name:
- Operating System:
- Tutorial Link used:
📍 Quick Checklist Before Asking

Check Docs: Many fixes are in the AI Hub Docs.
Be Specific: Say "RTX 3060 12GB", not just "NVIDIA".
English Only: Keep all discussions in English.

⚖️ Community Guidelines

• No assistance for NSFW/Porn or ANY Illegal Activities.
• Read the [Full Guidelines](#1402790586028789830 message).

patent trellisBOT
# low shard !help-template
AI HUB | Technical Support Desk
📋 Required Help Template

To receive assistance, you must provide your system details. Copy and paste the block below into your reply and fill it out.

⚠️ NO INFO = NO HELP

👉 Fill this form:
- Goal (e.g., TTS, AI Covers, Roleplay):
- Specific Issue:
- Full GPU Name:
- Operating System:
- Tutorial Link used:
📍 Quick Checklist Before Asking

Check Docs: Many fixes are in the AI Hub Docs.
Be Specific: Say "RTX 3060 12GB", not just "NVIDIA".
English Only: Keep all discussions in English.

⚖️ Community Guidelines

• No assistance for NSFW/Porn or ANY Illegal Activities.
• Read the [Full Guidelines](#1402790586028789830 message).

viral mason
#

we don't need 3

hallow thistle
vale harbor
#

Is Vonovox having issues? It lags when I play games, and I'm using a 16 GB RX 5070 Ti.

#

My internet connection is pretty good; the lag only goes away when I close Vonovox.

undone dune
#
  • Goal (e.g., TTS, AI Covers, Roleplay): just wanna sound like anime characters
  • Specific Issue: dont know what to download i wanna download voicechanger
  • Full GPU Name: Nvidia Gforce rtx 5070
  • Operating System: win 11
  • Tutorial Link used: https://www.youtube.com/watch?v=81KYc8AAmus
hallow thistle
hallow thistle
hallow thistle
ebon basin
#

any1 got a mobile voice changer that’s not bad

hallow thistle
lilac moss
#
  • Goal (e.g., TTS, AI Covers, Roleplay): Extended Pretraining of base rvc v2
  • Specific Issue: Not really sure how to do this, any guidance? Do I just plug in the D and G models into applio? A CLI option for this would be great..
  • Full GPU Name: A100 80GB
  • Operating System: Linux (specifically debian)
  • Tutorial Link used: N/A
misty trellis
#

Any way to get tts working in silly tavern?

#

The docs for it is a joke 🤣

misty trellis
#

Goal, get good tts be it local or using providers like eleven lab
Silly tavern and marinara engine has little to nothing guided and etc on tts
Gpu is rx6900xt 16gb vram
Os is windows 11
None

low shard
#

@shut yoke don’t promote

misty trellis
low shard
low shard
#

I wasn't talking about you

misty trellis
#

Oh my bad I saw original message deleted so I thought that was it 🤣

low shard
low shard
patent trellisBOT
# low shard !help-template
AI HUB | Technical Support Desk
📋 Required Help Template

To receive assistance, you must provide your system details. Copy and paste the block below into your reply and fill it out.

⚠️ NO INFO = NO HELP

👉 Fill this form:
- Goal (e.g., TTS, AI Covers, Roleplay):
- Specific Issue:
- Full GPU Name:
- Operating System:
- Tutorial Link used:
📍 Quick Checklist Before Asking

Check Docs: Many fixes are in the AI Hub Docs.
Be Specific: Say "RTX 3060 12GB", not just "NVIDIA".
English Only: Keep all discussions in English.

⚖️ Community Guidelines

• No assistance for NSFW/Porn or ANY Illegal Activities.
• Read the [Full Guidelines](#1402790586028789830 message).

lilac moss
simple ore
#

you can finetune using a single voice.. or using 100+

#

if you dont use D/G weights and train using 100+ voices, it is called 'creating a pretrain'... nobody has been successful at creating a good one from scratch yet

lilac moss
misty trellis
#

Goal, get good tts be it local or using providers like eleven lab
Silly tavern and marinara engine has little to nothing guided and etc on tts
Gpu is rx6900xt 16gb vram
Os is windows 11
None

simple ore
lilac moss
lilac moss
#

good words abt these pretrains

#

still, could you lmk how to do it?

#

docs aren't very helpful this time

simple ore
#

do you have 100+ different voices with a wide range of phonemes in their respective datasets?

#

with equally good audio quality?

#

if yes, then you can re-tune the og pretrain

lilac moss
#

yes

lilac moss
hallow thistle
#

Challenge accepted?

simple ore
#

and use this new g + original d

lilac moss
simple ore
#

you need to run this script and match the number of speakers in the prepared dataset

#

Applio can only adjust when you try from scratch

lilac moss
simple ore
#

you use the extended g and original d as a custom pretrain

#

if you're training locally, you can simply run a .bat file

#

env\python rvc\train\train.py VCTK_32k_SP1024 1 20 rvc\models\pretraineds\hifi-gan\f0G32k_emb129.pth rvc\models\pretraineds\hifi-gan\f0D32k.pth 0 16 32000 False True False False 5 False "HiFi-GAN" False

#

like this

lilac moss
simple ore
#

yes

lilac moss
simple ore
#
model_name = sys.argv[1]
save_every_epoch = int(sys.argv[2])
total_epoch = int(sys.argv[3])
pretrainG = sys.argv[4]
pretrainD = sys.argv[5]
gpus = sys.argv[6]
batch_size = int(sys.argv[7])
sample_rate = int(sys.argv[8])
save_only_latest = strtobool(sys.argv[9])
save_every_weights = strtobool(sys.argv[10])
cache_data_in_gpu = strtobool(sys.argv[11])
overtraining_detector = strtobool(sys.argv[12])
overtraining_threshold = int(sys.argv[13])
cleanup = strtobool(sys.argv[14])
vocoder = sys.argv[15]
checkpointing = strtobool(sys.argv[16])
lilac moss
#

tysm! i was genuinely writing my own rvc train implementation and it was hurting my head so bad haha 😭😭

short panther
#

Do you have huge differences in AI use in your teams? Some are eager to learn new things, some are still about to explore ChatGPT... how to bridge the gap?

low shard
simple ore
#

normal users do not finetune datasets with 100+ speakers

#

if you use less than 110 you'll be fine 🙂

low shard
simple root
#

Can any 1 guide me throughly on using Flow

lilac moss
simple ore
#

but if you want to, go ahead

hallow thistle
#

I just wanted to share the thing. There's a strategy where if your laptop doesn't have a dedicated GPU, you'd go for an online service like Kaggle or Google Colab, so it should make the voice changer to work perfectly in theory. Actually, when I do the same as this current 2012 Dell laptop that has second gen Intel Core i3 CPU, the audio still stutters because CPU has to process audio stream and other programs at the time, usually unbearable in actual runs.

misty trellis
#

Goal, get good tts be it local or using providers like eleven lab, fish audio for example
to get Silly tavern and marinara engine with working tts
Gpu is rx6900xt 16gb vram
Os is windows 11
None

hallow thistle
#

So, if I actually have to run the voice changer, at this point I'd buy a new PC with a GPU instead. nso_ame_shrug

#

I don't feel like I need help about this one, I already know how things work, even if I rarely run the voice changer for anything other than as a dummy program.

signal jetty
#

Hello can I post a repo to help me?

opaque pond
#

I did everything I saw online right but I’m still getting bad lag issues

opaque pond
hallow thistle
karmic trellis
#

Hi. I have been using a voice model for a while now (with vonovox beta, without any index file, because there was no index file attached to the voice model when i downloaded it), but its quality is sometimes not the best, is there anything i could do to make it better? Like training the voice more myself, enhancing it in any way? It sounds robotic sometimes. I am absolutely willing to spend time on making it better or to learn new things i dont know how to do now, I just want to ehance it somehow

hallow thistle
#

Why roleplay?

hallow thistle
#

Did you follow a guide or tutorial before?

viral mason
#

-rt

patent trellisBOT
# viral mason -rt
🔊 Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project.

• Applio Realtime

A Realtime Voice Changer with similar performance to Vonovox & Wokada Tg-Develop Fork, with extra features.

• Wokada Deiteris Fork

Deiteris' fork (modified version) of wokada that doesn't get updates anymore.

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

lusty bear
#

hello, how can i fix my voice chopping of when using a model, its sounds good but every half a second it bugs, i have an rx6600 and a nice maono mic so idk whats causing this issue

viral mason
lime nova
viral mason
#

How do people literally disappear after asking for help, they must not be in a hurry

fringe heron
patent trellisBOT
# low shard !help-template
AI HUB | Technical Support Desk
📋 Required Help Template

To receive assistance, you must provide your system details. Copy and paste the block below into your reply and fill it out.

⚠️ NO INFO = NO HELP

👉 Fill this form:
- Goal (e.g., TTS, AI Covers, Roleplay):
- Specific Issue:
- Full GPU Name:
- Operating System:
- Tutorial Link used:
📍 Quick Checklist Before Asking

Check Docs: Many fixes are in the AI Hub Docs.
Be Specific: Say "RTX 3060 12GB", not just "NVIDIA".
English Only: Keep all discussions in English.

⚖️ Community Guidelines

• No assistance for NSFW/Porn or ANY Illegal Activities.
• Read the [Full Guidelines](#1402790586028789830 message).

lusty bear
viral mason
#

W beans

lusty bear
#

anyways

#

im using

#

voice changer client

viral mason
#

that one is super outdated, what gpu do u have? (Nvidia or AMD)?

lusty bear
#

amd

#

rx6600

#

8gb

viral mason
#

k one sec I'll get u the downloads to the newest thing

lusty bear
#

both of these>

#

?

viral mason
#

yup

#

first one is the voice changer, second one is a virtual cable that connects it to games or discord ect

lusty bear
#

is VBcable bad?

viral mason
#

I actually have to go somewhere rn so I will not be avaialble consistently, for quick setup run setup64 for vac lite and for tg fork run mmvcserversio

viral mason
lusty bear
#

alright

#

thanks

#

imma follow you up when i set it up

viral mason
proven hill
viral mason
#

let me hear it :3

proven hill
lusty bear
viral mason
#

👍

#

alr bro no rush

lusty bear
viral mason
#

did you close the command promt?

lusty bear
#

no

viral mason
#

hmm

#

what are you microphone settings like?

#

input should be your regular mic and output should be line 1

lusty bear
#

command prompt says pipeline not initialized

#

yeah

#

they are correct

viral mason
#

do u have a model installed?

lusty bear
#

i have kanye

viral mason
#

hmm

#

my tg fork looks like this, any differences besides models and gpu?

lusty bear
#

do i use server

#

or client?

viral mason
#

client doesn't work for me at all but try both

lusty bear
#

still getting the same pipeline error

#

on both

#

[VoiceChangerManager] 'Pipeline is not initialized.'

fringe crag
#

hello

#

if i use general questions, in an offline model, how to know that i have the right model for my build? Is there like a sort of response time? For example, asking how much 1+1 is, and i get an answer in 1 sec or in 1 minute ... /: How can i know this?

viral mason
outer sparrow
#

Hey plz where is the site for downlaod vc

low shard
low shard
#

!help-template

patent trellisBOT
# low shard !help-template
AI HUB | Technical Support Desk
📋 Required Help Template

To receive assistance, you must provide your system details. Copy and paste the block below into your reply and fill it out.

⚠️ NO INFO = NO HELP

👉 Fill this form:
- Goal (e.g., TTS, AI Covers, Roleplay):
- Specific Issue:
- Full GPU Name:
- Operating System:
- Tutorial Link used:
📍 Quick Checklist Before Asking

Check Docs: Many fixes are in the AI Hub Docs.
Be Specific: Say "RTX 3060 12GB", not just "NVIDIA".
English Only: Keep all discussions in English.

⚖️ Community Guidelines

• No assistance for NSFW/Porn or ANY Illegal Activities.
• Read the [Full Guidelines](#1402790586028789830 message).

lusty bear
low shard
fringe crag
low shard
low shard
fringe crag
#

ok, thanks

misty trellis
#

Goal, get good tts be it local or using providers like eleven lab, fish audio for example
to get Silly tavern and marinara engine with working tts
Gpu is rx6900xt 16gb vram
Os is windows 11
None

shy mica
#

ai help me

#

i need to get gf

#

now

#

even ai cant help

proven hill
misty trellis
proven hill
lusty bear
#

maybe this one doesnt like me lol

viral mason
#

It's not as good as tg fork tho

lusty bear
#

idk

#

pipeline is a curse

low shard
molten spire
#

Is this the right channel to ask for pre train models recomendations?

lusty bear
#

i had some

#

updates

#

the model is selected

#

doesnt have any special characters

low shard
# lusty bear updates

did you update everything, select an RVC Model before starting the server, and are using no weird character?

low shard
#

could you send a screen recording of you starting the program?

lusty bear
#

like when i get the error or?

low shard
lusty bear
#

no

low shard
lusty bear
low shard
# lusty bear

have you checked if the same issue appears on every model?

If so, have you tried re-downloading wokada tg develop?

lusty bear
#

oh, one thing

#

the sample rate auto changes to 44100 when i start the server even when i set it to 48000

low shard
vale harbor
#

Is this lag normal when I open a game? It didn't used to happen to me before.

#

Full GPU Name: rtx 5070 ti 16 gb

  • Operating System: windows 11 pro
viral mason
#

hi again

#

have you tried turning the graphics of the game down?

vale harbor
vale harbor
viral mason
#

I don't think the cable is the issue

#

try turning down the graphics in game to see if it helps preformance

vale harbor
viral mason
viral mason
vale harbor
vale harbor
viral mason
# vale harbor why

they're filters all they do is change how it can sound, no quality improvements or anything

vale harbor
viral mason
#

no they're more just to help with background noise or for the eq it modifies sound by increasing or decreasing the loudness of specific pitches to improve clarity, or alter the tone to be warmer, brighter, or less harsh.

#

that last part I took from google

vale harbor
viral mason
#

they could help then yes

#

I only really use the noise gate sometimes

vale harbor
viral mason
#

why would you do that?

bronze pier
#

are there any google colabs for training voice models that still work? or is Applio the only way

viral mason
graceful scaffold
#

hi guyssss

viral mason
#

heloo

graceful scaffold
#

whatcha talkin about/what can i help with :3

#

woah, ai master :O

#

have you mastered the arts of the shoggoth?

viral mason
#

hmmm? what's that? :3

proud vapor
#

One message removed from a suspended account.

#

One message removed from a suspended account.

graceful scaffold
#

anyway sad to see weights shut down last month, i recorded the final minutes

viral mason
#

at least replay is fully functional without needing an account now, only good thing weights ever made was that

hallow thistle
vale harbor
hallow thistle
vale harbor
hallow thistle
hallow thistle
vale harbor
graceful scaffold
#

question for yall

#

whats the max time on a google colab instance or whatever?

#

is it till i close the tab orrrr

viral mason
#

like 4 hours I think

#

kaggle gives 30 hours per week for free users ❤️

#

much better

#

and easier to monitor

hallow thistle
silent grove
#

It sucks that I have all these rvc voices saved but no program to make covers with them

#

All around just sucks

hallow thistle
#

There's Applio RVC. arissip

silent grove
#

Never heard of it

#

It any good?

silent grove
#

Ui is a bit messy and cluttered but yea

#

I'll use this!

#

Thx daddy

silent grove
#

I mainly relied on weights

#

Like 2024 weights

viral mason
#

it was made by weights actually and works the same

silent grove
#

Back when that shit was super community driven

#

Ahh

#

Got a link?

viral mason
#

lemme look rq

silent grove
#

YOU FOOL

viral mason
#

u go to weights and the download is at the bottom

silent grove
#

Ah mb

#

You not-fool

hallow thistle
silent grove
#

Thx Mommy and daddy

#

Now go fight about custody or whatever

hallow thistle
#

For real though, Applio RVC can be a bit harder to use than Replay, but Applio RVC can also be used to train a voice model in one instance.

viral mason
#

very true

#

if he is just looking for a quick cover application replay is nice and requires very little skill to use

graceful scaffold
#

and see if its any better than aicovergen or an RVC webui

graceful scaffold
#

cant post images :<

#

nvm

#

THIS FAT PROGRAM LOL

#

btw guys i recommend "Mem Reduct" its pretty good and can free up RAM

hallow thistle
#

You still haven't told about your laptop specs.

simple raptor
#
  • Goal (e.g., TTS, AI Covers, Roleplay): LoRA training - Generation of reference images
  • Specific Issue: Can't get QwenVL custom node working
  • Full GPU Name: NVidia GeForce RTX 4080
  • Operating System: Windows 11
  • Tutorial Link used: https://www.youtube.com/watch?v=WRaOsu9TDEM&t=160s

I have tried for days to setup/configure my comfyui environment to be able to run QwenVL custom node and models. Running the workflow in the tutorial (approx. 20:34 in timeline) I get missing packages (accelerate) and incompatible package versions (torch) as well as not being able to find the correct GGUF models. I would appreciate assistance in getting this sorted. I've been using CoPilot and 'he' has been doing my head in going overboard with possibilities BUT he has identified that my biggest issue is package versions given I am trying to keep my environments as clean as possible. Can someone provide a 'pip freeze' of their environment, that would go a long way to helping? Thanks Steve

PS - can anyone identify the 3 x '..._Sub' nodes used in the ZIT, Qwen2512 and Flux Klein groups?

graceful scaffold
viral mason
#

never heard of it

graceful scaffold
viral mason
#

:3

graceful scaffold
#

ima go 2 bed

hallow thistle
patent trellisBOT
#
🔊 Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project.

• Applio Realtime

A Realtime Voice Changer with similar performance to Vonovox & Wokada Tg-Develop Fork, with extra features.

• Wokada Deiteris Fork

Deiteris' fork (modified version) of wokada that doesn't get updates anymore.

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

viral mason
#

if you're looking for the voice changer use vonovox for Nvidia gpus but for AMD u can't use Vonovox so use Wokada tg fork

buoyant thistle
#

good comms

#

lowkey got an old ass rig

viral mason
#

what is ur gpu?

buoyant thistle
#

1080 gtx

#

and i7 7700k i think

viral mason
#

oof

#

yea that gpu is kinda iffy for Vonovox

#

probably use Nvidia tg fork instead

buoyant thistle
#

i just deleted the original wokada cuz it says its not good

viral mason
buoyant thistle
#

finally got rid of it :P

#

after like 3 years

viral mason
#

🎉

#

what do u use it for btw, playing as Darth Vader or Goku or smth cool like that?

buoyant thistle
#

ya just some fun stuff, nothing professional

viral mason
#

thank god

#

so many weirdos join here to play as "girls"

#

everyone knows that means they wanna scam ppl

buoyant thistle
#

they want the method

viral mason
#

and we don't allow that here

buoyant thistle
#

oki so u recommend nvidia tg fork

viral mason
#

yea for ur gpu I'd say use that one

#

I'll get the links

buoyant thistle
#

i am upgrading to a 5070 ti and ryzen 7 9800x3d soon tho, im assuming i wouldnt use that nvidia tg fork?

viral mason
#

ooooh

#

I use the same setup I believe

#

get Vonovox once u get that new setup

buoyant thistle
#

doesnt matter if i have an amd cpu?

#

just use the gpu instead on it?

bronze pier
#

for Applio online tensorboard, what do i put for dataset path? i'm running on the public url since the private one doesnt work

viral mason
viral mason
#

colab? locally on pc? Kaggle?

bronze pier
#

I downloaded Applio and uploaded colab to my google drive and ran it there

viral mason
#

I am confused

buoyant thistle
bronze pier
#

im running this file on google colab

#

its in the Assets folder when i downloaded Applio

viral mason
#

wokada tg fork right?

buoyant thistle
#

yes

#

nvidia tg fork

viral mason
#

extract the first one then place the second one in the folder for the first one, third link is a virtual audio cable that is used like VB cable to connect the voice changer to discord or games ect

buoyant thistle
#

o ya i got the vb cable

#

oki fanks

viral mason
#

np!

#

if u have trouble or questions just ping me

buoyant thistle
devout dome
#

hey i keep seeing people talk about "crossfade" and "extra time" but i only see block size. is there any reason why?

#
  • Goal (e.g., TTS, AI Covers, Roleplay): Roleplay
  • Specific Issue: i keep seeing people talk about "crossfade" and "extra time" but i only see block size. is there any reason why?
  • Full GPU Name: RTX 5070 12GB
  • Operating System: Win 11
  • Tutorial Link used: None
viral mason
#

So those don't matter

devout dome
#

ahh i see

#

oh about block size is there like an area i should keep it at for it to not have too big a delay while still sounding ok

#

im using your teto model atm

devout dome
viral mason
#

More or less than that it kinda starts sounding bad

#

0.25 is also ok

#

I personally find 0.35 good as well

devout dome
#

is this the same way while gaming?

viral mason
#

Ya

devout dome
#

aww iguess i cant post gifs yet ;-;

viral mason
#

Nope lol, gotta yap to level up

#

You're very welcome! If u have more questions or need help or anything u can message me here :3

devout dome
devout dome
#

@viral mason
sorry for bothering but I was looking at your GLaDOS model and I saw a fx file for fl studio. is there a way for me to get effects to work on vonovox so i could use it in real time or is that not possible rn?

low shard
frosty wolf
#

got a new pc, and it works now.

#

took a while

#

and im a girl now

hallow thistle
low shard
frosty wolf
#

trans

#

the message i replied to from myself was two years ago

hardy yew
#

Anyway, definitely possible

#

Not sure what he does (he's probably sleeping now and will respond later) but I guess 2 virtual audio cables would do the job

#

Unless FL studio ships with something like that built-in

#

Definitely higher delay than without FL in between

#

But opens up room for a ton of effects

#

And glados sounds great with that autotune

viral mason
#

I can't help showing the setup atm as I'm already in bed and turned my computer off

#

It also requires VB cable doesn't kill itself upon use

#

Tried the same setup I have on my pc on my laptop and the one on laptop explodes and fizzles out

#

VB cable is strange

hardy yew
#

Ooh not asleep after all!

viral mason
#

Not yet

#

Getting there tho

hardy yew
#

If VB cable is a pain then paid VAC offers up to 256 cables so xD might be worth it for someone that needs it often

#

It's not expensive IIRC

viral mason
#

Icky

#

No money for my setup that's bad for business

#

It must stay free for the people

#

The uh, two paid voice effects don't count btw uhmm 👀

#

They're for personal use on models

hardy yew
#

Too late

#

xD

viral mason
#

Hehe beat you to it

#

But ye the autotune and ultrapitch are basically to make the voices sound more accurate

#

Autotune for Glados and Cyn and ultrapitch for anything droid related

hardy yew
#

Yee I remember the glados example was awesome

viral mason
#

:3

#

I really should remake some of those droids

#

They're funny

#

The b1 is hell to go through over an hour of audio from just 1 game

#

I want more of General Grievous in games but nobody ever focuses on clone wars era anymore

viral mason
viral mason
#

Seems I've talked too much, I tend to do that at night

#

Goodnight all

lusty bear
#

tysm!

low shard
lavish python
#

my gpu is 5070 12gb, will using average models increase much input latency on games like valorant

lavish python
#

is it that much latency to consider?

pulsar token
#

a little bit yeah, but I doubt it will be noticeable if u put fps limit

lavish python
#

on average models

lavish python
#

this one for example

#

the game consumes only 20-30% gpu usage

#

is input latency affected if i use in game?

hallow thistle
#

Applio RVC, Vonovox or W-Okada voice changer?

lavish python
#

wokada

hallow thistle
#

Which version is your W-Okada?

lavish python
#

v.2.2.2-beta

viral mason
#

Old and outdated as hell

lavish python
#

for example

hallow thistle
#

Vonovox and Tg Develop's voice changer (b2397) are only known voice changers that can work with NVIDIA GeForce RTX 50 series so.

viral mason
# lavish python for example

U should switch to Vonovox, with your gpu it'll run much better and it's the current best real-time voice changer

lavish python
#

oh alr

lavish python
#

i need an answer for this

hallow thistle
#

That's basically another RVC voice model. Almost every RVC model works pretty much the same. The audio latency depends on not just the settings.

lavish python
#

yeah i gonna research it deeper later

#

but before that

#

i have to know

#

using voice changer will increase input latency or not?

viral mason
#

Like Namari said it's just another voice model, doesn't affect how the voice changer would work

lavish python
#

i dont matter audio latency

viral mason
lavish python
#

input latency from mouse i mean

hallow thistle
lavish python
#

i thought when i use high graphics push gpu usage further make higher latency

#

so using voice changer would be the same way

#

voice changer also push gpu usage further

viral mason
#

Don't use high graphics when using a voice changer and gaming

#

It's not good for either program

hardy yew
lavish python
#

valorant usage on gpu is only 20-30%

#

so this is totally fine to use voice changer beside

#

is this right?

hardy yew
#

Most likely yeah, if your usage is up to 30% then you definitely already limit framerate

#

Should be good

viral mason
#

Cap fps to 60

#

Should be fine then

hardy yew
#

But anyway, just run the voicechanger and if you still have a bit headroom it's all good

#

If not, lower the settings or further limit the framerate

#

Anything that will lower the load where it struggles

lavish python
#

oh thank yall for answer

#

i found sometime the gpu jump up to 50% in combat

#

anyway to limit gpu usage of voice changer?

#

so i wont get laggy on combat

hallow thistle
#

Make sure to try Vonovox.

devout dome
lavish python
bronze pier
#

Whats the easiest way to train a voice model besides Applio?

viral mason
viral mason
stable basalt
#

What is the best free rvc to use rn

viral mason
#

Also what gpu do you have (Nvidia or AMD)

#

And what are u wanting to use it for?

stable basalt
#

Nividia

stable basalt
viral mason
#

Aw :c

stable basalt
#

With friends

viral mason
stable basalt
#

What

hallow thistle
devout dome
viral mason
#

Ye I saw this art on insta and thought it looked very beautiful

#

The full image is very nice

stable basalt
#

What is the best free rvc to use rn

viral mason
stable basalt
#

What statement

devout dome
#

for me its usually vocaloid stuff

viral mason
stable basalt
#

Is this any of your business?

viral mason
viral mason
devout dome
stable basalt
#

I am not

viral mason
#

Then explain

stable basalt
#

Was gonna use markiplier voice

viral mason
#

Oh

#

Ok you're fine then

hallow thistle
#

You could say that.

stable basalt
#

Can i get image perms

#

To send here

viral mason
#

I'm not a mod sadly so I cannot help with that

#

But the best Nvidia voice changer to use for free is Vonovox

devout dome
stable basalt
#

Got download link?

viral mason
#

So we just go like eww to whoever makes the model and move on

#

If I was a mod I'd delete the links to all e-girl models immediately

#

If the weirdos need them so bad go find them yourself

viral mason
devout dome
#

i say technical e-girl because arent there some that are quite literally just a female voice

viral mason
#

That's different

devout dome
#

like i think callie? idk

viral mason
#

They are actually cool unlike "whispery mommy girl voices"

hallow thistle
devout dome
#

it was when local said if he could he would remove the other e-girl voices

#

and i more than likely didnt understand what that truely meant cuz isnt an e-girl just a girl on the internet

#

but then he went further to explain that he meant the "whispery mommy" shit

#

which tbh is understandable

viral mason
#

Second image taken from a post here in voice models

#

First one is cool and very interesting, second one makes my skin crawl

devout dome
#

i think i might get it

#

maybe

hallow thistle
#

There's a difference between an "E-girl" voice model that basically sounds like an actual woman and those funny Vocaloid/UTAU voicebanks. Hope this clears up a bit. TetoShrug

viral mason
#

While they might both be girls one is definite and real and the other is bait

devout dome
hallow thistle
viral mason
#

Who is this shark girl

devout dome
#

hold on lemme find

viral mason
#

What is ZZZ

#

Did someone fall asleep

hallow thistle
devout dome
#

really? oh you know that does make sense now that i think about it

delicate cradle
#

yea

devout dome
#

all the vids of people going in cs lobbies or other games with a "e-girl" voice changer

delicate cradle
#

we used to have a commissions channel and some people would like center their "shop" around egirl voices

#

the old server owner also made a egirl voice

viral mason
#

Ick

delicate cradle
#

a now very bad and not properly made pretrain was made to make them better

delicate cradle
viral mason
#

Find a screenshot

#

Proove this vile claim

devout dome
#

mmm ok i was just curious cuz it was reminded to that egirl voices were looked down upon

delicate cradle
viral mason
#

They are all the same too, no personality

#

Just freaks

devout dome
#

well one more question before I go that isnt a moral one

#

is it better to use models with a high amount of epochs or low (high= >150 low<= 100) i dont really know where it truely is high or low so im just airballin it

viral mason
#

O

#

Higher or lower doesn't equal better

devout dome
#

what does it mean?

viral mason
#

Just how many times the model saw the dataset until it sounded good to whoever made it

devout dome
#

oh thats actually pretty cool

viral mason
#

A model could sound better at 100 but somehow get worse at 130

#

And then better again at 150

#

It's funny

devout dome
#

ohh like that ai walking experiment where that starts at gen 1 until whatever and starts walking but then could just start flopping outta nowhere

#

ok well ty again :D

viral mason
#

Of course:D

#

Once I start my pc up I'll show my setup btw

#

I'm eating breakfast

bronze pier
#

when I click run-applio.bat how long should it take for the window to show up in the browser?

viral mason
#

Idk I use applio on kaggle

devout dome
viral mason
#

Ah school, trauma is already flashing through my head like war

#

I'll get to business then before I start having a mental breakdown

devout dome
devout dome
#

lol

bronze pier
viral mason
#

make sure to use that ping if you need help quickly

bronze pier
#

ah thank u

stable basalt
#

@viral mason is there tutorial for ut

viral mason
#

no all tutorials on yt are outdated and do not use the new stuff

#

for vonovox just run start

#

and vac lite run setup64 then install driver

night condor
#

heyyy

viral mason
night condor
#

yehh'

viral mason
#

what with?

rugged thicket
#

Hey, can I ask about co-founders here?

night condor
viral mason
#

oh sorry no

#

I don't maybe someone else does tho

#

@soft karma what in particular are you looking for in rvc?

#

I saw ur messages in ai chat

soft karma
#

yeah, just asking tbh. I don't know if anything really changed or if is there another model like it
it's just fun to hear specific characters singing or saying something i want

#

i've heard the TTS space is getting a lot, but not sure if what rvc does have a better alternative

viral mason
#

only new improvements really is applio being what is used for training, two new good pretrains both being legacy core 1.5 and 1.6 as well as the pabp which is also decent, as well for realtime Vonovox is the best (Nvidia only)

frail ledge
frail ledge
soft karma
frail ledge
#

@viral mason Can you help me? I can't view the channel.

viral mason
viral mason
frail ledge
viral mason
soft karma
viral mason
#

Applio is the best for that

#

-rvc

patent trellisBOT
viral mason
#

it can be ran either locally on pc or on browser like kaggle or google colab

soft karma
#

yeah, i remember using it in the past. Even training some models. Just wanted to ask if anything better released since then

viral mason
#

nah Applio is still the main one

#

other stuff exists probably but unsure of it

rapid echo
#

how do i fix Applio port issues i just get "Failed to launch on port 6969" etc

delicate cradle