#✨│ai-help

1 messages · Page 313 of 1

knotty moth
#

both files on yours seem at the Downloads, so let them both there, and then do extract

nocturne marsh
#

I did that

#

and my files page doesn't look like that for sure

hallow thistle
nocturne marsh
#

and they weren't in the same folder

acoustic bluff
hallow thistle
acoustic bluff
#

i set to it 48000 hz

#

when I click start server

#

it goes back to 41000 hz

knotty moth
#

tg-develop fork may have a bug to show back to 44.1k for the server mode

nocturne marsh
#

oh wait I'm in

#

it took a few minutes to load

knotty moth
acoustic bluff
#

i did

#

it doesnt work

#

tried everything

#

same error

candid crater
#

what does it mean select audio inut

#

input

nocturne marsh
#

is adding an index important or only thr path file?

hallow thistle
#

When more than 2 people asking for voice changer in here at the same time, it makes things messy for me to provide an answer.

candid crater
#

Sorry.

hallow thistle
# acoustic bluff it doesnt work

Close or Ctrl + C the voice changer's terminal, go to your MMVCServerSIO folder, open stored_setting.json file with Notepad, edit every sample rate to "48000", click save, and relaunch the program again.

hallow thistle
acoustic bluff
#

still getting the same error

#

sample rate is set and staying on 48000hz now tho

#

does the same thing on both

hallow thistle
acoustic bluff
#

if i do passthrough on it works for both 44100hz and 48000hz

candid crater
#

My turn?

acoustic bluff
#

I'll get chrome and try on chrome now but I don't think it's a browser issue

#

am I missing files?

#
    result, vol = self.process_audio(audio_in)
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "voice_changer\VoiceChangerV2.py", line 133, in process_audio
    audio, vol = self.vcmodel.inference(audio_in)
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "voice_changer\RVC\RVCr2.py", line 226, in inference
    raise PipelineNotInitializedException()
Exceptions.PipelineNotInitializedException: 'Pipeline is not initialized.'```
nocturne marsh
acoustic bluff
nocturne marsh
#

I'll ask someone else

#

while you're busy

#

the voice sounds pretty laggy, how can I fix it to become more stable?

candid crater
#

fish

nocturne marsh
#

then imma wait

candid crater
#

What do i do here

nocturne marsh
acoustic bluff
nocturne marsh
#

look at the input device tab

acoustic bluff
nocturne marsh
#

on the right side

nocturne marsh
#

I have the same

candid crater
nocturne marsh
#

then you should be good

hallow thistle
#

I observed for myself, the program worked for me even my laptop is old and has no GPU, of course the perf number is over 30000 ms which is crazy. I don't see any "pipeline not initialized" error so far.

nocturne marsh
#

@acoustic bluff

#

go to _internal file

nocturne marsh
#

and scroll down to where you see a list of folders

#

thats where it should be

nocturne marsh
candid crater
nocturne marsh
#

I had the same thing so I just did server

candid crater
nocturne marsh
#

oh shoot

#

yeah I had the same issue with client

#

so we would be on the same page

hallow thistle
#

The "pipeline not initialized" error sometimes has to happen with a non-RVC voice model (like a Beatrice voice model) being loaded to W-Okada voice changer fork (which only supports RVC voice model), your GPU's driver is outdated or something else.

candid crater
#

dam

acoustic bluff
nocturne marsh
nocturne marsh
hallow thistle
#

The question is: why the program works for me the first try while y'all keep struggling?

candid crater
acoustic bluff
#

I got models from that channel

#

already

nocturne marsh
nocturne marsh
candid crater
#

alr

hallow thistle
nocturne marsh
#

it happens when you start server

#

I was using RMVPE ONX

#

do I not?

candid crater
#

it dosent work i switch still

nocturne marsh
#

if I knew I would help man

#

you sure you can't use server

candid crater
candid crater
hallow thistle
nocturne marsh
candid crater
#

No

hallow thistle
candid crater
nocturne marsh
#

the hell why isn't the settings saving

#

nevermind it was just the sample rate

#

keeps going back to the default after I change it to 48000

#

weirdo

hallow thistle
nocturne marsh
#

sounds laggy a little and robotic

grim ravine
#

hello. i did everything correctly i think but i hear my changed voice how do i close it

hallow thistle
hallow thistle
hallow thistle
nocturne marsh
#

ah I changed the pitch, a little better now

acoustic bluff
#

do you think getting spin helps

#

idk rip i'll just use main outdated one

#

this ain't working

nocturne marsh
#

sounds so robotic

#

HUH

#

bro I refreshed my page

#

and I can't use server anymore

#

the hell

#

oh

#

tf

#

it's normal now

candid crater
#

/:

nocturne marsh
#

why can't I enable noise suppresiuon?

candid crater
#

idk what to do anymore bro

twin rain
#

i cannot find my gpu on processing unit tap

#

how i can fix it ?

nocturne marsh
#

my voice sounds so robotic, a little choppy and so weird, any way to fix it to make it more realistic?

grim ravine
#

its not letting me use gpu i can only use cpu why?

celest violet
#

after that? just unzip normal??

#

.

#

or before unzip i must do any? @hallow thistle

hallow thistle
hallow thistle
celest violet
#

in that folder, appear MMVCServerSIO + 2 file Force GPU Clocks

#

after that?

hallow thistle
celest violet
#

ok

#

rmvpe auto download?

knotty moth
celest violet
#

^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: 'NoneType' object has no attribute 'set_sampling_rate'

#

why this appear on black window

merry forge
#

everytime i upload a voice

#

it dont sound right

#

it sounds nothing like the audio samples people post

#

so idk whats going on

viscid scarab
#

Help me , I have voices in my head

royal vapor
#

I'm trying to get the best quality and least delay using RVC Client. I've seen the guide showing recommended settings for xx50/xx60/xx60TI/xx70 cards etc.. Does it also matter which series these are? 2060, 3060, 4060? Could I get a 2060 card and get the same result as a 3060 or 4060? The card is not being used for anything else

serene swallow
#

Im using 1.7 and it doesnt have the settings like in ur pic

#

Could it be that the author of Vonovox removed features instead of adding in the latest update?

proud wagon
#

any help to humanize ai text

lunar trout
#

does anyone know how to fix the thing where when your using your mic
all of your audio sounds like your under water

brave axle
#

it seems to work but it doesnt detect my mic at all even though i put my correct microphone, and every single app hears me besides okada

knotty moth
proud wagon
knotty moth
knotty moth
#

sorry but we couldn't support for such abusive purposes

proud wagon
#

ur talking like im a med student

knotty moth
daring hatch
#

Repxic: can someone help
seiso💖: you're incompetent

knotty moth
inland field
#

ModuleNotFoundError: No module named 'pyngrok'
""COLAB""

nocturne mural
inland field
knotty moth
nocturne mural
inland field
#

Yes
full error
/content/voice-changer/server

ModuleNotFoundError Traceback (most recent call last)
/tmp/ipython-input-341872215.py in <cell line: 0>()
22 get_ipython().run_line_magic('cd', '/content/voice-changer/server')
23
---> 24 from pyngrok import conf, ngrok
25 MyConfig = conf.PyngrokConfig()
26 MyConfig.auth_token = Token

ModuleNotFoundError: No module named 'pyngrok'


NOTE: If your import is failing due to a missing package, you can
manually install dependencies using either !pip or !apt.

To view examples of installing some common dependencies, click the
"Open Examples" button below.

#

I think solution may be by adding !pip install pyngrok
to notebook

nocturne mural
nocturne mural
#

-realtime

patent trellisBOT
# nocturne mural -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE

⚔️ Wokada Tg-Develop Fork vs Vonovox

For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

inland field
#

Could I run it locally with amd r7 200?

nocturne mural
#

AMD is not very good for running RVC properly, but I don’t think you’ll get much out of only 2 GB of vram.

#

Try that notebook; it might work for you since it keeps the same interface as the original w-okada, just with some changes.

hallow thistle
patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 4060 8gb vram desktop)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a legal, safe & ethical community, we will NOT provide help for:

  • ANY illegal activities.
  • NSFW/Porn.
    Requests for these topics may be ignored, not helped and result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
  • Don't Ask To Ask.
hallow thistle
hallow thistle
#

While the guide of Deiteris' W-Okada lists several recommended chunk and extra settings for known GPUs there, these "max settings" sound more of "known average" settings rather than the absolute one, and while most use "extra" value at 2.7 s, the actual chunk value may be raised over supposed chunk value in a real scenario if run along with a graphic-demanding game.

white shadow
#

anyone else having issues with hugging face

#

nvm we good ig

rapid sable
#

hey i keep getting an startup error about web server

#

any fix?

warm oriole
#

Any good Bangla (bangali) TTS out there?

#

???

rocky isle
#

Do you have to do this for Vonovox as well or is it meant to be done only for W-Okada?

“1 - Open Task Manager, click "Details"

2 - Right-click audiodg.exe and set the priority to "High"

3 - Right-click again and choose "Set affinity" then select only CPU 2.”

lunar trout
#

does anyone know how to fix the thing where when your using your mic
all of your audio sounds like your under water

lavish maple
#

Runway Gen-4.5

#

just donwload that on ur computer and run it

#

its the best one on the market apperenrtly so u will have to tweak and overclock ur systems if u want max of it

#

i was wrong

#

its licesned so its not open source anymore or it wasnt ever

#

so jsut use their website to us eit on cloud

#

ur gpu is well enough tbh

viral mason
viral mason
#

I don't understand what u mean

rocky isle
# viral mason I don't understand what u mean

I have screenshots showing the process actually had 1.9 GB of data in the Working Set (RAM) initially. Then it dropped to 300 MB.

Since the app didn't crash, that missing 1.6 GB of data had to go somewhere. It wasn't just 'empty reserved' space because it was literally sitting in RAM a few minutes prior. Windows definitely PAGED it out.

The thing is, audio still sounds the same and delay hasn’t changed so I don’t know what’s going on here.

viral mason
rocky isle
#

Yes, but the same thing also happens with vonovox and Deiteris W Okada

viral mason
#

that's odd

rocky isle
#

with all versions basically lol

viral mason
#

but that's weird that they all do that

rocky isle
#

yeah i’m using vonovox. Shouldn’t audio be impacted when some of it is running on your ssd instead of your ram though? cause this is happening to me and it still sounds the same lol

viral mason
#

I wouldn't know as I don't know the impacts of using it on different stuff

#

I keep whatever voice changer I use in my downloads

limber tangle
#

What settings do you recommend for RVC? My dataset is studio quality, but the resulting product has excessive metallic artifacts in the letter “s.” (t de esser)

viral mason
limber tangle
viral mason
#

personally using the 32k version has gotten good results every time

#

klm 4 is kinda old if I am remembering correctly

limber tangle
#

hmm

limber tangle
viral mason
#

it has many different versions, 32k, 40k, and 48k, as well as spinv2 I think and also Refinegan

limber tangle
#

Does it work properly for Turkish too?

viral mason
#

I wouldn't know as I speak english but you can try ^^

limber tangle
#

Well thank you for your help I'll give it a try

viral mason
alpine lotus
#

so uhhh gradio just killswitches itself when you try to open the public url on colab

#

idk if it's a me issue but this didn't happen when i was about to train a model on colab earlier

robust mirage
#

guys what voice changer should i take? i have rtx 4060 laptop gpu, should i get local or cloud instead?

alpine lotus
brave axle
#

i just followed the guide and it still cant hear me at all

elder kelp
#

im tryna use the L voice, but how I configure the settings and get the best ones?

viral mason
viral mason
# elder kelp no?

what version are u using, it'll be easier to help if I know what one u have

#

the only 3 that tare recommended that are up to date is wokada deiteris, wokada tg fork, and the best currently vonovox

crimson depot
#

@viscid moss

tame oracle
#

-rt

patent trellisBOT
# tame oracle -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE

⚔️ Wokada Tg-Develop Fork vs Vonovox

For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

hallow thistle
hallow thistle
#

Also, your account "pronouns" part is questionable, by the way. karinthink

hallow thistle
rapid sable
#

im using RVC and im having trouble selecting models

hallow thistle
rapid sable
hallow thistle
#

W-Okada realtime voice changer or Applio RVC (retrieval-based voice conversion)? The initials RVC doesn't always mean realtime voice changer.

hallow thistle
frank juniper
#

hello everyone, can someone please connect me with an Ai training job or any given programming task I would really appreciate 🥺

wind orbit
#

Does someone have problems with odaka aswell? Yesterday it was perfect and now those high pitches and cut offs are there and I changed nothing

hallow thistle
# crimson depot I already DM

If it's a general knowledge, you can talk in here. It doesn't have always be a single moderator who knows about UVR5, unless you were trying to contact Eddy for his UVR5 fork bug fixes.

crimson depot
hallow thistle
wind orbit
hallow thistle
wind orbit
wind orbit
#

The dml one doesn’t work to me somehow

#

I tried it from huggingface and it just opens the cmd and closes right after

#

It says something abt windows pkg and then writes a huge chunk of commands and then just closes

hallow thistle
#

What I sent to you is a different (b2397) version to what you're using (likely being v.1.5.3.18a).

wind orbit
#

leemme try

hallow thistle
#

The issue with v.1.5.3.18a W-Okada, especially its DirectML variant has to happen with its outdated and buggy code, where the program only uses CPU even if your AMD Radeon RX GPU is present.

wind orbit
#

ohh and that explains the high pitches?

#

because with the voice im using there shouldnt be any, its one of the best teached ones

hallow thistle
#

No idea which "high pitches" refer to, but if you mean those background noises coming to your microphone it's likely the cause, or when the program is so laggy that the audio is pitching up and down unexpectedly.

wind orbit
hallow thistle
#

Still, though. TetoShrug

sweet osprey
#

where is the AI HUB FR

#

@hallow thistle

wind orbit
#

the one u sent me

#

can u send me the settings for noise suppression? i cant change it somehow

#

like the noise 1 and 2 doesnt wanna check

hallow thistle
# sweet osprey where is the AI HUB FR

AI Hub FR or AI Hub France no longer exists. What you're here now is "AI Hub by Weights" Discord server, which is the second iteration of previous AI Hub that was taken down in 2023.

hallow thistle
wind orbit
#

okay tysm

#

it works so good

hallow thistle
#

You're welcome. Make sure to set extra number to 2.7 s for more audio quality, though always check performance number at top right screen if there any delay. Tokipeace

robust mirage
viscid scarab
#

Help me

#

I need some help

hallow thistle
wild kindle
#

what program can be used to generate the images that are listed in the ai-images section?

dry turret
#

I have a question, it's my first time using w-okada and with each voice that I use, and listen back to the recording of my voice, the voice is always so choppy and cuts out mid word. Any idea how this can be fixed?

tulip onyx
#

I need someone to tell me if there's an alternative to Hugging Face to remove noise, echoes, etc., because the page isn't working when I try to access it.

spare vale
#

I think this is the right place to ask. Any models I upload myself have super high res, never gets below like 3000 ms, and when I first use the models i uploaded, they start off at like 100ms and work just fine until it quickly skyrockets in ms. i've uploaded even the most downloaded models in this server and I never get anywhere close to lower, even using the quickest settings. Using the premade models I usually get around res 200ms. Please help. (i'd send pictures but i cant)

hallow thistle
hallow thistle
spare vale
hallow thistle
viral cradle
#

u got one for mac os?

#

hello

#

?

hallow thistle
viral cradle
#

macBook air

hallow thistle
#

To check your Mac's CPU/chip, click Apple icon at top left corner screen, navigate "About This Mac".

viral cradle
#

its a applke m1

#

*apple m1

viral cradle
#

wont work

#

tells me this may harm device when i try terminal

runic cape
#

who can help me to get the Server URL

sturdy nimbus
#

how to fix the delay

flint hazel
#

Can someone help uh my gpu is rtx 2060 super so my chunk size is not going lower than 2400 but guides say it should atleast go down to 128 this is my version vcclient_win_cuda_2.0.78-beta

flint hazel
#

💔

viral mason
#

You should switch to wokada tg fork

flint hazel
#

I thought it was the latest and they discontinued it feck chatgpt

viral mason
#

Lemme get u the links

flint hazel
viral mason
flint hazel
#

true true

#

tho is it the reason why the Chunk count is so high?

viral mason
flint hazel
#

ya sure it will run on my gpu..?

viral mason
#

You can try 🤷‍♀️

flint hazel
#

💔💔

viral mason
#

Idk why it's needed but it is

#

And for the virtual audio cable just extract and run setup64

#

Then to run wokada tg fork run mmvcserversio

#

It's an exe file

flint hazel
#

so ya mean after extracting the 001.zip and put the 002 zip in the 001.

flint hazel
flint hazel
viral mason
flint hazel
#

aa..I see..

flint hazel
#

ty for the help

viral mason
#

You're welcome! If u need more help just @ me here or if u need to send pictures and can't here just ask to dm me ^^

flint hazel
#

oki

tame oracle
#

!howtoask

patent trellisBOT
# tame oracle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 4060 8gb vram desktop)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a legal, safe & ethical community, we will NOT provide help for:

  • ANY illegal activities.
  • NSFW/Porn.
    Requests for these topics may be ignored, not helped and result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
  • Don't Ask To Ask.
wide perch
#

Using RVC with virtual audio cable and the output isn't producing my voice at all.

#

What can I do to fix this?

viral mason
#

Do you mean the real-time voice changer

wide perch
#

When I start it, the vb doesn't work at all

#

Don't hear my voice even though I have Listen to Device enabled

viral mason
#

What input and output settings do u have set?

#

<@&1159293140440723499> kill him

proven mortarBOT
viral mason
#

Uh oh

wide perch
viral mason
manic parrot
#

which graph should i follow?

wide perch
viral mason
viral mason
wide perch
#

the vb output mic just doesn't work, dont know what to do

manic parrot
viral mason
#

Oki doki

wide perch
viral mason
#

I'll get the link

#

To set this up just download it, extract the zip file and run setup64

#

Not as admin btw just regular

wide perch
#

the vb mic doesn't even show the green bars when I talk, so it's literately not picking up my voice at all

viral mason
#

if so it's outdated

wide perch
#

is there a new rvc that I'm not aware about?

viral mason
viral mason
viral mason
wide perch
#

So I shouldn't be using the standard one then?

viral mason
#

what gpu do u have?

wide perch
viral mason
#

ooh nice

wide perch
#

thanks dude

#

So out of the 3 which do you recommend?

viral mason
#

that one is peak, I have the 5070 ti

wide perch
#

siick

viral mason
#

vonovox also has a ton of features that might confuse u

wide perch
#

alr thanks ill give it a try

viral mason
#

let me get u the link so u don't download the wrong one

wide perch
#

thanks

viral mason
#

all u gotta do is just download both, extract the 001 zip and put the 002 zip into the folder for the 001 zip since 002 cannot be extracted but is needed for whatever reason

manic parrot
#

I have google colab pro, can i train 2 models at once

#

l4

viral mason
#

kaggle is better tbh, 30 hours free

#

can't train more than one model at a time but still, for free 30 hours, and colab at most will give u maybe 4 for free

#

per account

wide perch
#

never used web ones before

viral mason
#

the interface shows up on web browser yes

#

but it's still on local

#

runs from your pc gpu and the task manager

manic parrot
wide perch
#

LETS GOOO

#

Thank you so much

viral mason
viral mason
#

like how to use it or anything

wide perch
viral mason
#

alr, hope it works well for u in games and stuff

wide perch
#

hope so too

wide perch
#

Just wondering if it is possible to use my GPU since it would run better

wide perch
# viral mason

No yeah I did change it but clicking Start Server does nothing

#

The performance stats don't move

viral mason
#

odd

#

did u change from client to server?

#

client does not work

wide perch
#

ooh

#

thats probably why lol

#

its working

lament prawn
neat patio
#

Can I get some proformence tips (specs rx 580 8gb
I3 10100)

lethal dirge
#

Hi everyone, could you help me download the program to create voices for my characters, please?

torpid spire
#

ive got a 4060 gpu and when i open start http bat it doesnt do anything js opens then closes

patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 4060 8gb vram desktop)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a legal, safe & ethical community, we will NOT provide help for:

  • ANY illegal activities.
  • NSFW/Porn.
    Requests for these topics may be ignored, not helped and result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
  • Don't Ask To Ask.
hallow thistle
# torpid spire ive got a 4060 gpu and when i open start http bat it doesnt do anything js opens...

Last update: November 22, 2025

GitHub

Not all pretrains are downloaded anymore when starting the application for the first time. Instead there is a download-manager inside of the advanced settings, to download different models (RMVPE a...

torpid spire
#

thats amd @hallow thistle im nvida

lethal dirge
#

Could you please send me the download link for my PC?

#

please

hallow thistle
hallow thistle
lethal dirge
#

Or rather, what processor or what version of Windows do I use?

hallow thistle
#

No joke question. To check your PC CPU and GPU, open Task Manager, go to Performance tab, see CPU, GPU 0 or GPU 1 on the left panel.

lethal dirge
#

Could you please send me the download link for my PC?

hallow thistle
hallow thistle
#

Why should you ask more? That's a download link.

lethal dirge
hallow thistle
#

It's not that hard to click that download link. Shigureshrug

viral mason
hallow thistle
# neat patio Can I get some proformence tips (specs rx 580 8gb I3 10100)

W-Okada voice changer or Applio RVC (non-realtime)? While the voice changer could work with AMD Radeon RX 5xx GPU, it would likely struggle when with higher settings (Extra 2.7 s on bXXX W-Okada fork). Applio RVC (retrieval-based voice conversion) can work with AMD GPU but with some tweaks.

hallow thistle
viral mason
#

interesting

hallow thistle
# manic parrot I have google colab pro, can i train 2 models at once

With Colab Pro, while this tier allows you to run few more notebook instances on the same account, training 2 models at the time (if they on separate instances and use same GPU) sounds impractical or impossible, because resources would be shared and distributed across notebook instances, leading to slower performance for both and potentially drain compute units faster. For full performance, running only one instance is always better.

candid urchin
#

What is the most stable real-time software at the moment?

#

I’m currently using Applio v3.6.0 to train a voice model. Could anyone recommend the best configuration settings for achieving optimal results?

manic parrot
#

i got this error while starting training in applio

unborn oracle
#

hello everybody, I have never ever did anything AI related other than using AI, I wanna get into it and learn about it and models and stuff, I dont know where to begin or how to, all I know is that you need a decent pc, I have a 4080 super and a ryzen 7 7800x3d, anyone who has tips please share and thanks (;<->;)
I use windows 11

hollow matrix
#

guys can someone help , when i hit the start.http nothinf happpens....

hallow thistle
hallow thistle
#

!howtoask

patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 4060 8gb vram desktop)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a legal, safe & ethical community, we will NOT provide help for:

  • ANY illegal activities.
  • NSFW/Porn.
    Requests for these topics may be ignored, not helped and result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
  • Don't Ask To Ask.
cosmic epoch
hallow thistle
cosmic epoch
hallow thistle
#

W-Okada can't train a voice model, only inference, which means converting your vocal in realtime using RVC voice model.

cosmic epoch
hallow thistle
#

W-Okada is a free and open source program, much like RVC (retrieval-based voice conversion; like Applio RVC fork).

hollow matrix
hallow thistle
cosmic epoch
hallow thistle
#

How am I supposed to answer that?

cosmic epoch
hallow thistle
hallow thistle
#

Generally, I do not like to tell little trivias (like your queries) whose common information can be accessed either in #1159513888199540817 or Google, and if you do research and understand a bit on why W-Okada voice changer is like that, it would be great.

hollow matrix
cosmic epoch
#

@hallow thistle Also, is it impossible to to upload models to weights? because when i click on "train model" the bottom that says "upload model" dissappeared.

cosmic epoch
hallow thistle
cosmic epoch
hallow thistle
#

Applio RVC is the only best known working RVC fork. Some other RVC implementations are either outdated, no longer maintained.

thick knot
#

w-okada don't work, why

#

@pseudo depot pro help

hallow thistle
hallow thistle
#

Yes, that one is more useful, just that when you were overlooking it.

hallow thistle
#

Download both of them.

thick knot
hallow thistle
#

Simply, these are split zips of a single zip file.

#

Why why? You use WinRAR or 7-Zip to open the .zip.001 one.

pseudo depot
thick knot
#

and for

pseudo depot
#

but I don't know anything about the AI models that are created here, I am specialized in another area of software engineering

thick knot
hallow thistle
pseudo depot
viral mason
cosmic epoch
viral mason
#

It's very sad to see them remove useful features, like downloading the outputs of ai vocals and now uploading good models

serene swallow
wild bobcat
#

ugh

#

this sucks

royal vapor
#

I'm trying to download tg-devlop's version but i get that the 002 rar file is damaged... I've tried downloading it several times. Anyone experienced this?

viral mason
#

And run mmvcserversio

#

You can't unzip it but it's still needed

royal vapor
#

so directly into the MMVCServerSIO folder?

viral mason
#

Just drag and drop into the folder of 001, then u can run mmvcserversio so wokada tg fork starts

robust halo
#

PyInstaller\loader\pyimod02_importers.py:378: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.

fair crag
#

Hello

lethal basin
#

your local worm

viral mason
#

that increases the index, the index basically just controls the model's "accent" it's not needed to be moved past 0 tho ^^

nocturne patrol
#

is there a tutorial? I'm on PC windows and i have a GTX 1070?

nocturne patrol
alpine lotus
alpine lotus
#

i wonder why...

nova abyss
viral mason
#

I use a 5070ti and I can get pretty low, all the way to 122.7 for chunk

viral mason
nova abyss
viral mason
nova abyss
#

the lower the chunk the faster it comes out but the higher the chunk the slower it comes out?

worthy helm
#

why does it sound like there is a delay\

silent stratus
#

anyone know whats goin on with the applio notebooks rn?

#

this what i just got in kaggle:

Save the link for later, this will take a while...
Traceback (most recent call last):
File "/kaggle/working/program_ml/app.py", line 30, in <module>
from tabs.inference.inference import inference_tab
File "/kaggle/working/program_ml/tabs/inference/inference.py", line 9, in <module>
from core import (
File "/kaggle/working/program_ml/core.py", line 20, in <module>
from rvc.lib.tools.model_download import model_download_pipeline
File "/kaggle/working/program_ml/rvc/lib/tools/model_download.py", line 14, in <module>
from rvc.lib.utils import format_title
File "/kaggle/working/program_ml/rvc/lib/utils.py", line 9, in <module>
import wget
ModuleNotFoundError: No module named 'wget'

viral mason
hallow thistle
#

@viral mason By the way, aside from sending these people links, how much do you know about the voice changer? W_

hallow thistle
nova abyss
hallow thistle
#

I have so some mixed thoughts about whether if Weights.com still considered good or no, as much as people here especially Local_Worm keep shitting on the site as if the website itself is some political material.

#

As much as I felt guilty for didn't actually pay for the site whereas I once won a prize for a free Weights subscription, the Weights and related websites (Voyages) are only good for "draft" creations, especially AI covers, inference and some model trainings. The most parts could be done through separate softwares like UVR5 and Applio RVC, but this workflow is more complex than the Weights website itself, to which I just think some lazy people wouldn't even do this.

ashen minnow
hallow thistle
#

What is your PC GPU? And did you follow any tutorial before?

ashen minnow
#

when i try to upload any model it's says it's missing toml file

hallow thistle
#

!howtoask

patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 4060 8gb vram desktop)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a legal, safe & ethical community, we will NOT provide help for:

  • ANY illegal activities.
  • NSFW/Porn.
    Requests for these topics may be ignored, not helped and result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
  • Don't Ask To Ask.
ashen minnow
#

every model what i dowland it's only have 2 files and default from vcc have a lot more

hallow thistle
ashen minnow
#

intel core i3-9100F and yes i used tutorial

hallow thistle
ashen minnow
#

Nvidia Geforce Gtx 1650

hallow thistle
# ashen minnow Nvidia Geforce Gtx 1650

Last update: November 22, 2025

GitHub

Not all pretrains are downloaded anymore when starting the application for the first time. Instead there is a download-manager inside of the advanced settings, to download different models (RMVPE a...

ashen minnow
hallow thistle
languid turtle
#

@hallow thistle

#

what is the best version

#

of tg okada

#

the most faster

#

and realistic

hallow thistle
#

While the voice changer might work with NVIDIA GeForce GTX 980 Ti, the entire GTX 9xx GPU series is slightly older than GTX 10xx and the more recommended RTX 20xx, so you don't always expect it to work fast whereas the program might struggle with that GPU.

languid turtle
hallow thistle
# languid turtle Is there a more optimized version?

Oh. There's Vonovox, a complete voice changer alternative, but this specific software is known to work best with RTX 20xx, so even if you get GTX 980 Ti to work it would likely struggle either way. So, "W-Okada voice changer fork" is your only hope to run locally. For full performance otherwise, consider for a more recent PC with better GPU (like GeForce RTX 3060) or an online website instead.

hallow thistle
languid turtle
hallow thistle
#

The audio quality has to happen with chunk/extra settings and the RVC voice model itself, not just specific versions.

distant hamlet
swift thunder
#

Is Gradio down?

wind yacht
#

Yo

#

How do you fix the res ms thing skyrocketting when you're using the voice changer in a VC

#

Like when I'm not speaking it's like res 14 ms, and when I do speak it launches up to 500ms and sometimes even above 1k, making my voice cut badly

#

I'm running a gtx 1650

viral mason
#

I don't know anything about fixing errors caused by the person's computer or if they're missing anything like python, I also cannot help at all with the cloud versions because I know nothing about setting them up

#

But besides that stuff I mainly know that if they're not using wokada deiteris fork, wokada tg fork, or Vonovox they are using an outdated software

#

Probably stuff I'm missing I just woke up

cobalt prairie
#

guys

#

How can i fix the constant cutting in my voice

#

with the chat bot

#

i mean voice bot

viral mason
#

???

cobalt prairie
#

Like the voice is cutting

#

i verified and no external sound exept my voice is

#

like hearing

viral mason
#

I'm very confused, could you explain what you mean by voice bot

outer valley
#

@distant hamlet i post it here because in general i cant post images.
This is my tensorboard of my first tries to train. With different settings (because me dum and a bit confused about applio settings).

And i have the feeling i get something wrong.

#

Thank you a lot.
Maybe this is a bit dumb to ask. But what applio settings would you use for a dataset of 1h 13 min that is with a sample rate of 48k and very diverse with normal talk and many emotional talk and noises.

#

I used BS-Roformer-Viperx-1297 in UVR5 to remove the background music. And in RX 10 i used this module chain (screenshot) to go over all the files. They raw audio data had background music and other noises and personally i hear a lot of this room/reverb in it a lot. So i wonder if that workflow i did is a good way to go. @distant hamlet

distant hamlet
distant hamlet
outer valley
# distant hamlet I guess since your dataset is so large, it works out. Maybe batch size of 10-12?

The model of the orange line that was trained for 8h with batch size of 8. Was bad but better then the other ones with batch sizes of 16-22 that only trained for around 2h.

Epochs are something i wondering even after reading about it what it really means when you lets say train for 400-1000 epochs. More is not better i got that because of overtraining. But for that there is tensorboard and even in applio a setting to stop training after a xx number of not doing and better process.
But doesnt that not mean i could like always go for more epochs and just need to find and pick the best epoch or does a high epoch setting alone affects quality? No right?

distant hamlet
distant hamlet
outer valley
# distant hamlet chain looks fine. how bad is the original audio tho? And are they all pieced aud...

I think if you fine with a little wall of text....

I have a voice model that i created with merging 2 models in Okada.
I really like that model and that sounds really good but it is old. Like from 2023 and 2024 and it has this high pitch and ss/zz/hiss problem that it makes this weird ai voice sounds.

Since back then i wanted to fix that model but i used already existing ones. Sadly there was some sort of wipe back then? Not sure i was never much active in AIhub.
So i was not able to find the orginal uploader of the 2 models.
And since then try to find better base models. When there was still "commission section" i got some models made from the "Master model makers" but... to be honest they not reached the quality of this one random model i got before the wipe or what ever happened to that time.

They had to much hall/reverb or to muddy or had the same hiss/zz problem.

So tried to merge around with different models i thought could maybe help to what i want. I only could get a small upgrade from that.

In short now in 2026 i hoped there is better ways to go for my goal with like "pretrains" and using my 4090 to just do it myself now.

Back then i got told trying to make a model with emotional data is not a good idea and not works out well.
Now at least "gemini" kind of tried to tell me that changed or would be possible.

So i got raw data from youtube from the voice i wanted with like many different emotional sounds and talking and a lot of normal talking gemini said a 13 min of emotional stuff and around 30-40 min of talking would work.

So i used UVR5 and RX 10 with was i showed even manual cut A LOT of stuff UVR5 missed in RX10 and cut to long silent parts between talking. But only were i felt the pause was too long.
So the dataset is a big mix of different sources of the same voice.

viscid scarab
#

help me

outer valley
# distant hamlet I usually find de-reverbing from MVSEP (or other AI models) to be sufficient. RX...

So gemini was telling me it would be good to "normalize" audio to bring it all to the same lvl ?
Like laughing or scream would otherwise be to loud and whisper to quiet for training. Because i was very unhappy with the first try of training and checking the dataset i felt like... it still had to much of strong reverb and this noises of for example a PC being near the mic.
So the 2.0 cleaning gemini suggested "loudness control"

outer valley
dreamy seal
#

if i visit the website the execution instantly got terminated

#

would be just like this after it finished lanching and it got terminated

distant hamlet
distant hamlet
outer valley
distant hamlet
dreamy seal
#

why is this paused now? how do i use it again?

outer valley
#

the 2.0 cleaned dataset i trained with 16 batch size.
Should i try go down to 8 and normal training settings?

Was my idea of a lot of emotional audio in the dataset maybe a wrong idea? Emotional talk i mean like :

  • funny weird mouth noises that for example some vtuber do.
  • whispering
  • burst of laughing or other noises

i manual cleaned and removed what i felt could be to bad from quality or maybe not work well. But maybe it is still contain to problematic sounds?

There is a mix of talk from like 2022-2023 and from 2025.
So there is a little difference from for example accent of the voice. But my goal is a own voice with 2 different voices so i not care for accent.

It only would be a problem if that means using audio from different times of the person voices with different accent and maybe a small change in audio from maybe different mic setup hurt the training a lot.

outer valley
worn lodge
#

sup guys, so, Im trying to run qwen image with the upscaler fine tune. I got a rtx 3060 12gb + 32gb of ddr4 ram. Now, I tried both the quant4 and quant8, nothing, im always hitting OMM, like Im probably doing something wrong, because what I wanna do is: I run the model on the ram, and then I offload some layers to the gpu, this works fine in text models, but vision models... idk, they just dont wanna work. Any recommendations?

serene swallow
#

-vonovox

#

-help

#

!vonovox

#

Bro. Can someone give me this disc's vonovox repository

hallow thistle
hallow thistle
#

Assume you have NVIDIA GPU since you never stated anything about it just demanding for the voice changer.

serene swallow
#

Why do you sound aggressive about it

#

It was just a question bro

#

You dont have to reply if you dont want to

hallow thistle
hallow thistle
# dreamy seal if i visit the website the execution instantly got terminated

For "free" Colab users, Google Colab can disconnect your current runtime (especially the one with Web UI) anytime, which is an expected behavior. With Colab Pro, you could run Applio RVC UI or any Web UI notebook on Colab without a problem. This is not a bug or defect that persists with Applio RVC UI Colab notebook, it also happens with certain notebooks that use Gradio, ngrok or other Web UIs like "W-Okada voice changer" as well.

#

The same issue** doesn't** happen with Applio RVC no-UI because this specific notebook doesn't use Gradio or any web UI codebase, certain features/commands are separated into code cells like this.

loud warren
#

This file was blocked because files like this from the internet arent safe

#

how do i fix

#

@hallow thistle

hallow thistle
#

!howtoask

patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 4060 8gb vram desktop)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a legal, safe & ethical community, we will NOT provide help for:

  • ANY illegal activities.
  • NSFW/Porn.
    Requests for these topics may be ignored, not helped and result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
  • Don't Ask To Ask.
loud warren
weak sorrel
#

Full GPU Name: AMD Radeon RX 7900 XT
Operating System: Windows 11
Detailed Description: I use the basic W Okada client but wanted to see if i can maybe use something that can work better for AMD but ended up making it worse lol just wanna know if i messed up or if my system is cooked and keep with what ive been using (when i say worse i mean my system lags and my ms is INSAINE i cant get it to lower at all even at chunk at 800ms)
Tutorial Used: https://github.com/deiteris/voice-changer?tab=readme-ov-file

hallow thistle
weak sorrel
#

okay

#

its laggy still it keeps being "close" to the perf but when i raise it it will only keep going up

weak sorrel
#

so if i match 208 i would need 250 and etc

#

oh?

#

i clicked the download link it said as recommended

viral mason
#

are you using wokada deiteris possibly?

weak sorrel
#

yea

viral mason
#

oh nvm ur good

weak sorrel
#

oh did i fuck up

hallow thistle
#

What do you mean you're running the older version like v.1.5.3.18a?

weak sorrel
#

i wont lie im not sure where to check to see what verison i am on

hallow thistle
weak sorrel
hallow thistle
#

On top right of voice changer interface, it's obvious.

weak sorrel
hallow thistle
# weak sorrel

You set GPU to AMD Radeon(TM) which is an integrated GPU, not the mentioned AMD Radeon RX 7900 XT which is a dedicated GPU.

weak sorrel
#

OOPs

#

yes

#

i kept switching to see what would work

#

like better

hallow thistle
#

Simply, a dedicated GPU is better than an integrated one.

weak sorrel
#

i saw that too yee

#

im just trying everything mbmb

#

this is the ms im at with the right gpu^^

hallow thistle
#

Set F0 det to "rmvpe_onnx".

weak sorrel
#

we at 260ms now

#

any advance settings i should try?

hallow thistle
hardy hare
#

went through the normal process of setup with colab applio, and when i open the public url the colab "completes" the task, making gradio unusable... is this a bug?

wind yacht
#

both chatgpt/gemini can guide you through that

hardy hare
#

well. trying to work with chatgpt on this but i'm not exactly getting very far. i don't really code things, i just like using colab because it's usually simple..

hallow thistle
wind yacht
wind yacht
#

ill get an answer for u

hallow thistle
wind yacht
#

youd be surprised what ai can do

hallow thistle
#

Ok, AI bro.

hardy hare
#

according to gpt at least, the colab is cancelling the gradio public link upon opening it.

in as simple terms as i can explain, i click start server, i wait for the public url, i click the url, gradio says a session isn't open, and coming back to colab it displays as "complete" with a check mark.

atp gpt gave me a bunch of code to add so i probably already need to refresh anyway. idk why i even tried to see if it could help at all.

#

again, i don't code, and whenever colabs have bugs like this i'm usually at the mercy of other people coming out with new ones or something.

#

i've been using the same one (afaik) for months without issue.

#

it's uh.. linked to the applio colab guide.

hallow thistle
#

Try delete your current Applio RVC notebook, and then re-import the link https://github.com/IAHispano/Applio/blob/main/assets/Applio_Kaggle.ipynb to your account again.

hardy hare
#

alright, tried that, didn't seem to make any difference, but i also don't know if i deleted it properly. it ran as if it was still in.

#

i could try a different google account maybe?

#

i have spares, i'm a free user and i hate time limits

#

i swapped accounts and it worked, so... i have no idea why it happened to that account in particular

#

either way i guess i'm all good now.

outer valley
#

Is there a good tool for voice/speaker seperation ? Like if i want to remove TTS or 2-3 other speakers and only want one specific voice? I used WhisperX with a little script but i was wondering if there is a more easy way or maybe even better working then what i use.

flat trail
#

why is it so delayed or sometimies i aint even hearing it

patent latch
#

The tensorboard extension is already loaded. To reload it, use:
%reload_ext tensorboard
Reusing TensorBoard on port 6006 (pid 3062), started 0:02:05 ago. (Use '!kill 3062' to kill it.)
Ngrok URL: https://hiltless-marbly-brianne.ngrok-free.dev
WARNING:ngrok.tunnel_ext:error connecting to upstream error=Connection refused (os error 111)
An error occurred connecting to Discord: Could not find Discord installed and running on this machine.

i'm getting an error like this, what's the reason?

magic lynx
#
/content/Applio
An error occurred extracting the index: need at least one array to concatenate
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.

huh?

#

same here

/content/Applio

No wav file found.
/usr/local/lib/python3.11/dist-packages/torch/utils/data/dataloader.py:626: UserWarning: This DataLoader will create 4 worker processes in total. Our suggested max number of worker in current system is 2, which is smaller than what this DataLoader is going to create. Please be aware that excessive worker creation might get DataLoader running slow or even freeze, lower the worker number to avoid potential slowness/freeze if necessary.
  warnings.warn(
Not enough data present in the training set. Perhaps you forgot to slice the audio files in preprocess?
An error occurred extracting the index: need at least one array to concatenate
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.

random forge
#

how to addapt this ai voice?

carmine glade
hallow thistle
random forge
hallow thistle
#

No, but like is it about Applio RVC (non-realtime) or W-Okada voice changer?

#

!howtoask

patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 4060 8gb vram desktop)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a legal, safe & ethical community, we will NOT provide help for:

  • ANY illegal activities.
  • NSFW/Porn.
    Requests for these topics may be ignored, not helped and result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
  • Don't Ask To Ask.
cosmic epoch
#

@hallow thistle is there a version of replay that works?

hallow thistle
cosmic epoch
hallow thistle
cosmic epoch
# hallow thistle

two questions: 1, can this app be used on PC? and 2, is it possible to train rvc models with it?

hallow thistle
#

UVR5 doesn't use RVC voice model. It's a program that separates audio into stems.

cosmic epoch
hallow thistle
cosmic epoch
hallow thistle
#

UVR5 is not your typical AI cover maker program, the what.

jaunty shale
#

in an online method of Tg Develop's W-Okada (Kaggle), the option to use server is greyed out, is there a reason why?

viral mason
viscid scarab
#

I need some help

real depot
#

Hello everyone.
I’m testing inference in Applio with models trained by me using SPYN V2 as the embedder.
I’ve noticed that if I leave everything at the default, like the index at 0.75, the voice sounds very close to the original, but the pronunciation of the phonemes becomes a bit strange.
I also notice this with models trained with cvec: in Applio I need to tweak things to try to make it sound better, while in Replay, for example, it already sounds perfect, with correct pronunciation.
I’ve heard comments that Applio doesn’t seem to be very good for inference, but it’s the only one I know with support for SPYN V2.
Has anyone experienced this and has any suggestions?
If I lower the index to a value below 0.75, for example 0.3, the pronunciation gets better, but it loses some of the voice characteristics.

analog obsidian
real depot
#

Although there is a difference, with SPYN V2 and the index at 0.75 in Applio the voice sounds very close to the natural one, but both with SPYN V2 and with cvec I need to make adjustments, whereas in Replay I just run inference.
Well, at least that’s been my experience with inference using both.
Replay still doesn’t have support for SPYN V2 to compare.

analog obsidian
#

maybe they're not really using the index file

#

bc index files will always have some negative impact in the pronunciation, it's not normal to have perfect flawless pronunciation with them

real depot
real depot
analog obsidian
swift yarrow
#

i think its a simple problem but i cant seem to find a solution too

#

please ping me when ur here so i let u know what i need help w

craggy bough
swift yarrow
#

okay, i have a problem where im on intel gpu so i downloded the vc client win std beta, i extracted open file opened dist and i opend the start_http in the dist folder and it didnt load the voice changer all it said in the terminal was
C:\Users####\OneDrive\Desktop\ai voice changer\dist>main.exe cui --https false --no_cui True so i need help loading the voice changer

#

@craggy bough

craggy bough
swift yarrow
#

i got it from here

#

bruh

#

not that

#

i got it from there

craggy bough
swift yarrow
#

and

#

Intel(R) Core(TM) i3-1005G1 CPU

#

is that reccomended to download vc client

craggy bough
swift yarrow
#

graphics

craggy bough
viral mason
#

they're so cooked

swift yarrow
#

where that at

craggy bough
craggy bough
swift yarrow
#

i js followed a tut

#

turns out its outdated

viral mason
#

ew don't help them anymore

swift yarrow
#

what

craggy bough
#

and its for people with proper gpus

viral mason
swift yarrow
craggy bough
craggy bough
austere stream
#

i swear this happens every time i use kaggle and i forget how i got around it all the other times

sudden violet
#

where do i get the zip files for these?

gusty ferry
#

why is it not letting me upload any voices

hallow thistle
# sudden violet where do i get the zip files for these?

Are you sure you wanna use this specific W-Okada version that made to use Beatrice voice models especially? Because Beatrice model is rare in "AI Hub by Weights" here. "RVC (retrieval-based voice conversion)" models are more common since they give significant better audio quality, especially RVC v2.

hallow thistle
hallow thistle
hardy yew
#

HI! I have a question regarding training RVC models, more precisely the choice of sample rate. So far I've been mostly sticking to legacy core 1.5 48kHz pretrain so that was my dataset's sample rate, but I wonder if there's a chance the models would train better on lower frequency samples (with appropriate pretrain, e.g. 32/40kHz legacy core).
Is sticking to 48kHz alright, or should I rather choose a minimum sample rate that fits the frequency spectrum of my dataset input? I haven't checked it so far so perhaps my input samples don't actually utilize the entire 24kHz frequency range

viral mason
#

I can see the dust on it

#

What gpu do u have

analog obsidian
#

and its just bad for high sample rate, too many problems, like worse breaths and esses

sudden violet
#

but i dont know what version to download

viral mason
#

you should use wokada tg fork, I'll get you the links

sudden violet
#

thank you brooooooooooo

sudden violet
#

ok

viral mason
#

and the other two are for wokada tg fork

sudden violet
#

ok they're all downloading

#

whyd u send 2?

viral mason
#

for vac lite run setup 64 after extraction and for wokada tg fork after u download both extract 001 then place 002 into the folder of 001 since it cannot be extracted but needs it anyways

viral mason
sudden violet
#

so its making me reinstall vb cable?

viral mason
#

no? it's a different one that works the same but doesn't cause issues on windows

#

vb cable isn't as recommended since it causes weird issues on windows sometimes

sudden violet
#

ok its done on vac4lite

#

oh ok

#

then i open which one first?

viral mason
#

?

sudden violet
#

0.001?

#

or the 0.002?

viral mason
#

extract 001 and then place 002zip file into the folder of 001

#

after u do that just run mmvcserversio in the folder of 001

sudden violet
#

wait theyre the same thing no?

viral mason
sudden violet
#

oh ok

#

where did u find this new version from because i still look here for the main things

viral mason
#

oh that's really old

#

if you do this

#

-rt

patent trellisBOT
# viral mason -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE

⚔️ Wokada Tg-Develop Fork vs Vonovox

For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

viral mason
#

then u can find the three main voice changers used now

#

I would recommend you vonovox since your gpu can definitely handle it but it's currently in a beta stage adding new things and I'd rather give u the newest version of it than the current version that has less features than the beta

sudden violet
#

how do i pick that?

viral mason
sudden violet
#

the vonovox

viral mason
#

if you'd rather use that I can give you the download

#

you'd need to delete the one I just gave u tho to save on space

sudden violet
#

whats the difference

viral mason
sudden violet
#

oh ok

viral mason
#

just run setup first then run start

#

they're .bat files

sudden violet
#

do i extract it

viral mason
#

yup

sudden violet
#

ok so how is it the best?

viral mason
#

it produces more natural speech, has a lot of different features to block out bg noise messing with the model so weird sounds don't come out or give random voice cracks

#

there's an optional paid effects like adding reverb or a 8bit effect although you can easily bypass that by using a DAW like fl studio or just using voicemod

#

it's mainly to support the creator which is always nice

sudden violet
#

so its the best version of the three?

#

whats the third one?

viral mason
#

the third one is a little outdated compared to the othe two, it's wokada deiteris fork

#

basically the original wokada but made more up to date code wise and has a few quality of life features, tho the other two have them as well

sudden violet
#

oh okkkkkkkkkk

#

i like the web version that they made for the rvc one

#

easier

viral mason
#

so wokada tg fork?

#

that one runs on a web browser

#

vonovox doesn't tho

sudden violet
#

yeah

#

so waht do i do when its done

viral mason
#

depends, which one are you going to use

sudden violet
#

vonoxo

#

i was waiting for the setup to finish

viral mason
#

alrighty, after the setup is done run the file called start

sudden violet
#

ok it worked thanks so much gangyyy

viral mason
#

you're welcome!

#

if u need more help ask me here

#

btw to import a model just download one from here, extract and press one of the empty slots and insert the .pth file

#

and underneath is a box that says index I believe

#

import the index file for the same model in that

sudden violet
#

on vonovox

#

?

viral mason
#

yep

magic lynx
#

im finishing revamping applio notebook ||which wont be even accepted but idgaf, they dont want autotune infer fixed||
and i wonder which badge set is the best
i think 2, 3* or 4
* - uses website colors

fringe swan
#

How do I train a voice for free

#

@magic lynx can you help me

shadow sparrow
viscid scarab
#

Guys I need some help

#

Help me

hallow thistle
hallow thistle
#

!howtoask

patent trellisBOT
# hallow thistle !howtoask
❓ How to Ask for Help
✅ Before You Ask!
  1. Check Docs & Guides: Your answer may already be in the AI Hub Docs or the https://discord.com/channels/1159260121998827560/1159513888199540817 channel.
  2. Search the https://discord.com/channels/1159260121998827560/1192011222023950368 : Look for existing posts that solve your issue. Do not invade someone else's post.
📝 How to ask?

Tell your:

  • Full GPU Name: (e.g., NVIDIA RTX 4060 8gb vram desktop)
  • Operating System: (e.g., Windows 11)
  • Detailed Description: What were you trying to do and what went wrong?
  • Tutorial Used: Link to the guide you were following.
  • Screenshot: A picture of the full error message.
🚫 Prohibited Topics (We Will NOT Help With These)

To maintain a legal, safe & ethical community, we will NOT provide help for:

  • ANY illegal activities.
  • NSFW/Porn.
    Requests for these topics may be ignored, not helped and result in moderation action.
<:matsuripray:1159685390156967936> Community Expectations
  • Be Polite & Patient: Our helpers are volunteers. You may ping the Helpers role once.
  • English Only: Please keep all conversations in English.
  • Don't Ask To Ask.
viral mason
hasty solstice
#

im here

tame oracle
#

okay

#

what voice changersing

#

*u using

hasty solstice
#

Voice Changer Client Demo

tame oracle
#

Dats old as balls

#

-rt

patent trellisBOT
# tame oracle -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE

⚔️ Wokada Tg-Develop Fork vs Vonovox

For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

tame oracle
#

first guide is vonovox

hasty solstice
tame oracle
#

is the best of those three

hasty solstice
#

awsh man i cant send media

#

but does it say

#

For Windows
Download this: VAC Lite (Virtual-Audio-Cable by Muzychenko)??

tame oracle
#

yes

#

that is the recomended one

hasty solstice
#

is there a step by step on how to set this up completely?

hasty solstice
#

thank uu!

#

i see it now

tame oracle
#

yess ofc

hasty solstice
#

ill see about what i could do and hopefully itll stop making me sound like a robot

tame oracle
#

im sure it will, its a lot better code wise

hasty solstice
#

@tame oracle ive gotten far to the point where the files is already exported and i installed the "setup64", what now?

hasty solstice
#

Do i open that?

#

i think so

tame oracle
#

yes

hasty solstice
#

its opened now

#

its just

#

wait

#

is this it?

hallow thistle
#

Words alone won't always imagine as image. For better understanding, there's screenshot.

hasty solstice
#

i just now got image perms

#

hold on

hallow thistle
#

That's not the actual program, it's "help" doc for Virtual Audio Cable.

hasty solstice
#

where do i find that?

hallow thistle
#

This is the actual control panel for Virtual Audio Cable.

hasty solstice
#

OHHHH

hallow thistle
#

No, you don't need to set anything in this control panel.

hasty solstice
#

okay so im basically set?

hasty solstice
#

All of them are now checked

#

is there an option where like

#

okay im going to sound so slow but like possibly add the models?? 😭

hallow thistle
#

This is Tg Develop's W-Okada fork.

#

This is where you upload a voice model to the voice changer.

hasty solstice
#

so i have to go to that website?

hallow thistle
#

No way, you're looking for Vonovox, not the W-Okada fork.

#

By the way, if you wish to stay for Vonovox, I can help you a bit about Vonovox. But if you'd like to try that one W-Okada fork, the easier interface, I can send you links.

cedar sky
#

Hi guys, Ive spend all dat yesterday to try to find cheap api to kling models video provider. Does anyone know the workflow or any api or apy way to get kling video generation price less than 0.5$ per 5 sec? 🙏

spark citrus
#

-colab

patent trellisBOT
# spark citrus -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

viscid lily
chrome geode
#

So I've got the "generating buffers" thing goingon and I can't find a single way to fix this on discord or online

#

Anyone have any actual assistance with this?

sturdy oracle
#

Hey guys
I'm new here
I joined because I don't have a good idea of how gen ai models work, but find their use of copyrighted materials problematic
I want to create software that efficiently and quickly poisons Gen AI through encoding music, audiobooks, images etc
Does anyone here have an idea of how I could go about doing that?

shy torrent
#

Hey guys any quick help I installed via 'W-Okada Fork Guide' but whenever I try a model my voice never changes anyone has a fix ? GPU 5070TI

viral mason
#

which guide did you use?

shy torrent
#

this one

#

downloaded the 5000 series one

viral mason
#

hmm I'd switch to either vonovox or wokada tg fork tbh

#

both are better than deiteris

#

-rt

patent trellisBOT
# viral mason -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Vonovox

A Realtime Voice Changer with similar performance to Wokada Tg-Develop Fork, with extra features, but it supports only Nvidia GPUs on Windows 10/11 unlike other options that have wider support. and without cloud options GUIDE

• Wokada Tg-Develop Fork

A personal fork (modified version) of Wokada Deiteris Fork, it just adds some Quality of Life improvements to it like supporting Spin Embedder and Audio Effects. Don't expect too much about it since the creator made it originally as a personal project. GUIDE

• Wokada Deiteris Fork

Deiteris' fork (modified version) of wokada that doesn't get updates anymore. GUIDE

⚔️ Wokada Tg-Develop Fork vs Vonovox

For Windows Nvidia, Both Wokada Tg-Develop fork and Vonovox have similar performance & quality. Users should read the pros and cons for both and choose based on their differences, such as UI and Vonovox's paid effects.
Read Wokada Tg-Develop Fork Pros&Cons & Vonovox Pros&Cons

⛔ Outdated/Discouraged

These options are not recommended for use.

• Original Wokada

Not suggested, older versions in youtube tuts are even way worse. GUIDE

• RVC GUI Mainline Realtime

The program is worse compared to the ones above, and much less updated. GUIDE

shy torrent
#

ill try rn

viral mason
#

vonovox is currently the best of the three but I personally use tg fork for the fast model swapping time

#

if u do use tg fork u will need the 001 zip and 002 zip

viral mason
#

alrighty, all u need is just to download the current version 1.6.9 and run setup first, the run start

shy torrent
#

ty bro

viral mason
#

np!

shy torrent
#

also i've seen people talking abt malware inside of w-okada soft were there triping or what ?

viral mason
#

if it isn't from here it most likely is a scam

#

a lot of people are scummy and try and trick people with versions of old wokada that have viruses

viral mason
frigid stone
#

Is there a standardized regular speaking audio sample for both male and female voices that can be used for testing inferences?
I'm looking for something that covers a good amount of speaking sounds just to know if a model works well

brittle wing
#

yo anyone ?

#

i wanna learn as much as i can about aiii

viral mason
#

what version of wokada are u using?

#

also u cannot upload it if it's a json file sadly

frigid stone
viral mason
#

what gpu do u have btw, since ur using a really old version of the voice changer u should upgrade

viral mason
#

oki doki

abstract comet
#

What’s this new model you speak of?
I’d be really interested to learn about it

abstract comet
#

Specifically for singing models

abstract comet