#✨│ai-help

1 messages · Page 245 of 1

dark ginkgo
#

so flux dev would work on it then you are saying

dark ginkgo
#

@simple ore I dont see these options on mine

#

also, when ever I try to change the sd vae to ae, it changes back

#

as in?

#

yes, but it also needs the sd_text_encoder in the options

#

it wont let me switch

#

also looiking it up, I have not found any infor regarding automatic 1111 being compatible. Everyone say to use forge or comyfui but I dont have any tutorial online to setup on a1111

#

I changed it to that already but also still no text encoder option

#

like I cant find this in the list of options

#

can you verify you are indeed using a1111 and not forge? a1111 has been abandoned from what I read so far with zero updates

#

as of 2024

#

furthermore, it said flux is incomaptible with a1111 and had no plan to update

red imp
#

can anyone help me, this keeps popping up when I try to use the MMVCServerSIO in the folder, I am on a m1 silicon mac

red imp
#

i switched to a updated os and it worked

#

but now for some reason when i try to select an output and monitor for the audio inside the voice changer, nothing shows up

#

idk how to use it

#

ohh ok thanks

#

it works

dark ginkgo
#

@simple ore So I just downloaded this, used the default what it comes with. Already having issues

#

SD.next, never heard of it until today

#

I have yet to touch anything

#

like it just finished downloading

#

I'm doing my usual prompt test for flux, 1024/1024

#

photograph of a red apple on wooden table, red apple, wooden table, high quality, professional photograph, dark background,

thick aurora
#

RVC v2 disconnected is working bro!

simple ore
#

there's a good community there

light latch
#

Hi, can someone help me? I know how to look for AI cover models, but I don't have the link to make the voice work in an audio? I used Google Collab, has anything changed? Does anyone have a new link? Please answer.

elder coral
#

is this a clean model

elder coral
quasi gyro
#

Did a new klm model drop or am I seeing things? Cause I saw that the thread was updated

quasi gyro
quasi gyro
simple ore
#

if you use a custom embedder option you can use any version

#

or applio exp branch

#

or the fork

modern surge
#

-colab

patent trellisBOT
# modern surge -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

modern surge
#

I need a model to separate the drums of a song

#

Just the drums

#

Does anyone knows how to?

#

-uvr

patent trellisBOT
quasi gyro
jovial kraken
#

I mean the RVC version by W-okala

latent kettle
peak path
#
import torch
from pipeline import build_audiosep, separate_audio

device = torch.device('cpu')

model = build_audiosep(
      config_yaml='config/audiosep_base.yaml',
      checkpoint_path=str(models[0][1]),
      device=device)

audio_file = 'zand.wav'
text = 'water drops'
output_file='separated_audio.wav'

# AudioSep processes the audio at 32 kHz sampling rate
separate_audio(model, audio_file, text, output_file, device)
simple ore
# peak path

depending on the version of torch, it is either a warning, or it simply did not load the weights at all. All torch.load calls need to be provided with weights_only=True

hazy eagle
#

i use firefox how to fix this

#

everytime i choose my microphone on client that pops up

simple ore
#

or change the mic settings in the sound contol panel

hazy eagle
#

i tried this on many browsers but chrome gives best quality, i cant test it on firefox because of that error

knotty moth
#

beside that the built-in noise suppression (Sup2) can only be used in client mode

quiet axle
#

hi was wondering what crossfade setting u guys usin, or just using the default?

quiet axle
#

oh thanks, btw i was wondering. So i just reinstalled my windows cause of a certain problems. For some reason my vc doesnt sound like it used to

#

got some advice you reckon i could do?

#

i feel like the voice seems to abruptly end on a sentence more so then usual

elder coral
#

how to make my index file have added_

#

because it doesn't have it

knotty moth
quiet axle
#

which protocol is better sio or rest? whats the different anyway

knotty moth
quiet axle
#

oh i see, thanks for information

simple ore
elder coral
knotty moth
elder coral
#

yes

knotty moth
#

and it works in any rvc applications

simple ore
#

very old apps called index files like 'added_IVF1406_Flat_nprobe_1_modelname_v2.index' for no good reason

knotty moth
#

the mainline and older versions name it like that, also spawns total_fea.npy and trained_*.index

summer reef
#

how to create models

#

pls help me

elder coral
summer reef
patent trellisBOT
# elder coral -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

elder coral
#

here it is

#

ask the helpers first to find videos on how to make models like that

atomic goblet
#

I was thinking of getting a dedicated gpu for the voice changer, does anyone think a 5060 Ti would be good to handle that?

remote steeple
#

how to make voice changer sound not robotic

simple ore
atomic goblet
simple ore
atomic goblet
#

3090 Ti

simple ore
#

you can find like 3050 used for $100-150

#

just need to make sure you have a free 8x pcie slot and 2 slots for the card itself

atomic goblet
#

Hmm, that can work. I can try to find one then

forest jolt
#

hi whats the latest fork or program for rvc gui
i tried to use rvc ai cover maker and that its not sounding as good as rvc gui

hazy eagle
sleek sand
#

I need to make a Russian girl's voice out of my male one. which model should I use and how should I configure it? I use AI-Voice Changer

undone abyss
#

My command prompt closes automatically after opening start.bat anyone know a fix? I've downloaded PyTorch and Python

undone abyss
#

Windows 11 I have an intel gpu

latent kettle
#

Applio, w-okada?

undone abyss
#

w okada

latent kettle
#

Have you downloaded correct version from guide?

undone abyss
#

I believe so, yes

#

i can try to check which one i downlaoded

latent kettle
#

-realtime

patent trellisBOT
# latent kettle -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

undone abyss
#

thank you

undone abyss
#

and install this?

latent kettle
#

Probably if it is wrong version

undone abyss
#

okay thank you

#

what do i do with the file you gave me

#

@latent kettle

#

do i extract

#

or like

#

what do i do

latent kettle
undone abyss
#

OMG ITS DOWNLOADING THANK YOU SO MUCH

#

youre the best @latent kettle

#

appreiciate it

latent kettle
#

Yw glad to help you anime_giveheart

undone abyss
#

@latent kettle one last question, how do i add a voice to the voice changer. I can't find a tutorial on YouTube to help me because the tutorials show w okada via app and im on a website

latent kettle
#

@undone abyss

viral mason
undone abyss
undone abyss
#

i appreiciate both of you guys' help

viral mason
#

no problem ^^

#

my dms are always open if anyone has questions about setting up the okada voice changer

undone abyss
#

@viral mason I just dmed you and discord took away my permissions to dm, do you know why?

viral mason
#

I couldn't message you either

undone abyss
#

i think someone went on my account and blocked ppl

#

i reset my cookie'

#

idk what happened it was weird

#

i texted you

#

and then discord logged me out of everything the second i did

#

luckily i reset the cookie in time before stuff happened

#

but that was rlly weird

viral mason
#

ok do I can't dm u for some reason but

#

I'll send the pictures here instead

undone abyss
latent kettle
undone abyss
#

idk what happened

#

that was weirddd

latent kettle
#

I mean w-okada for intel

viral mason
#

I can't send pictures

latent kettle
#

The one you was using before, it was designed for Nvidia

viral mason
#

I'm gonna hang myself

undone abyss
undone abyss
undone abyss
#

that makes sense

latent kettle
#

Yep

viral mason
#

I need image perms?

latent kettle
#

Delete that

viral mason
#

or somthing

latent kettle
viral mason
#

wtf is wrong with discord rn then

undone abyss
#

discord being weird

latent kettle
viral mason
#

ugh

undone abyss
#

w okada is delayed when i talk, anyone know how to fix?

#

and wghat settings to put?

viral mason
undone abyss
#

okayokay

#

how do i make it

#

sound better

viral mason
latent kettle
viral mason
undone abyss
#

thank you

#

what would be good settings for intel gpu?

latent kettle
#

@undone abyss

undone abyss
#

top one?

viral mason
#

also this IF IT LETS ME SEND I SWEAR TO GOD

#

finally

#

jesus h christ

latent kettle
undone abyss
#

i cant reopen mmvcserveriosio it says failed to excecute script 'client' due to unhandled exception

#

nvm i got it

#

ill look at setting explination now @latent kettle

undone abyss
#

thank you

viral mason
#

I'll try helping if I can but I'm about to burst into a ball of angry juice

undone abyss
undone abyss
viral mason
#

kaggle doesn't work

#

it's fucking dead

full marsh
#

same kaggle showing firbase problem

viral mason
#

at least emojis work

full marsh
#

colab dead now kaggle ..nice

viral mason
#

hold on

#

it loaded for me

#

kaggle

viral mason
#

it's not saving tho

full marsh
viral mason
#

:(

#

is there a way around or nah

full marsh
#

it loads for me too but won't work

#

no unfortunately until the creators themselves fix the issue

viral mason
#

do they fix it quickly usually orrrr

#

bc I kinda need to keep training the model I'm working on

full marsh
full marsh
#

see if it's fixed after few hours or wait for the notebook creators fr a bettr solution

viral mason
#

first discord breaks now Kaggle

#

sadge

full marsh
#

u can always train locally if u have gd hardware

viral mason
#

I don't like local training, it messed with my vr

full marsh
#

i only use local fr inference training takes toll on my pc

viral mason
#

that's fair

#

kaggle was having issues like not too long ago was it not?

#

is someone attacking them or

viral mason
full marsh
#

u can get free immediate only CPU tho

#

i'm waiting too

viral mason
#

bad update

#

this better not stick

full marsh
tough fiber
#

its stuck like this is this collab broken ? i used this 1 hour ago but now

tough fiber
#

now works

#

idk what happened i guess its collab thing

#

servers maybe

full marsh
#

Finalllyyyy

winter dew
snow vine
#

i need a lil help

#

ho do you make ur voice not sound glitchy

crystal pine
#

How do you make modles??

astral jungle
#

RVC V2 DISCONNECTED has been active 💔

fair prism
#

what AI stem do you guys use

#

is UVR still the most high quality

terse halo
#

Does anyone know how to use the voicechanger?

#

do have a version for ryzen?

viscid moss
viscid moss
#

patent trellisBOT
# viscid moss -rt
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

viscid moss
#

First link

terse halo
viscid moss
fair prism
#

how do you know which is the best pretrain

#

should Ijust use titna

viscid moss
#

Do it after removing instruments

umbral breach
#

Hey there, is the 9070XT capable of running inference or training on windows? Ive been trying to get it running on Applio with Zluda with no luck. Or should I just give up? lol

viscid moss
#

Reverb, then backvocals and finally denoise

simple ore
#

unfortunately by default applio install comes with python 3.10 and the rock requires python 3.11 or 3.12 (applio does not not work with 3.12)

simple ore
#

to use python 3.11 with Applio you basically need to nuke env folder and make a fresh venv using separately installed python3.11 and use pip install -r requirements.txt

#

without conda

#

or maybe change the install script to use python3.11 conda

#

that's another possibility

#

once applio is installed, install the experimental wheels

umbral breach
simple ore
umbral breach
simple ore
#

but the method from the applio ticket should work

umbral breach
#

oh okay, so it is possible its just me missing something...
I'll give it a go, thanks for the help

terse halo
#

I tried to download it yesterday but I found it very confusing and it gave an error, can anyone help me?

terse halo
latent kettle
#

Okay

latent kettle
patent trellisBOT
# latent kettle -realtime
💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

latent kettle
#

The first guide

#

Most suggested

terse halo
#

What I'm confused about is that there is a MB and a 2GB version

terse halo
latent kettle
#

After that you need virtual cable.

#

Virtual cable to connect your games or discord with Voice Changer

terse halo
latent kettle
#

Do You want real-time voice changer?

terse halo
latent kettle
#

@terse halo

terse halo
#

After I extract, where do I click?

latent kettle
terse halo
#

Ok

#

It's already downloading, and then I'll come back to ask, ok?

terse halo
#

and now?

latent kettle
#

Okay good

#

Download virtual cable

terse halo
#

link?

terse halo
#

Is it normal to say that there is a virus?

latent kettle
#

I think it's better to ignore it

terse halo
#

ok, it's downloading

#

and now?

#

@latent kettle

latent kettle
#

Okay start and give mic permissions

terse halo
#

done

#

and now?

#

can i close this?

latent kettle
#

Noo

#

Don't do that

#

It will not work then

#

@terse halo

terse halo
#

I haven't closed it yet

#

I already added the voice, how do I configure it so that the voice is synchronized?

terse halo
#

with my voice

terse halo
#

donwloand

latent kettle
#

After installing the Virtual Cable, it changes your default audio system. Click Yes when it asks you to open the audio device settings (or press WIN+R, type "mmsys.cpl" if you closed it already), and change your Recording and Playback devices back to your usual devices. Same for communications device aswell (right click -> set as default communication device)

terse halo
#

like go back to original?

#

@latent kettle

latent kettle
#

You must select virtual cable as output in voice changer and your mic as input in voice changer

latent kettle
#

Yep

terse halo
terse halo
latent kettle
#

Check ?

terse halo
#

this thing

#

@latent kettle

ocean holly
#

I got a question about w okada, or well, AI voice changers in general (hopefuly this is the right channel) Is there a software, app, or a general way to let the voice changer pick up my voice, while my IRL surroundings are playing like normal? for example, I use the voice in a vc, and theres a knock on my door, people can hear the knock from my mic, but also the AI voice.

final path
#

Hey guys I need someone to guide me to properly train a model, I'm a bit familiar with the process but I can still use some help

viscid moss
#

leave it as default

olive bear
#

tes

vapid gorge
#

Una imagen de dios en el espacio

winter dew
viscid moss
#

Those are outdated

viscid moss
#

models works for 48, but I'm not completely sure

#

Check it on a spectogram

scarlet fulcrum
#

how to change or fix delay?

viscid moss
#

try batch 8

viscid moss
#

I've 0 clue about those options tbh

#

ye

#

Not exactly

analog obsidian
#

yeah the description of the batch size is wrong
the number depends in how big the dataset is
since u got 1 hour i'd recommend 8 or 16, with 8 being safer here

knotty moth
#

vram is just a constraint, obviously not affecting the results

analog obsidian
#

20 minutes or less = batch 4
30 minutes and above = 8

knotty moth
#

not only that, it also depends on the dataset diversity

#

if it's too diverse, you might better split and train separate datasets

analog obsidian
knotty moth
#

it's still okay, unless you have multiple sources having different quality

analog obsidian
#

thats fine

#

still wrong

knotty moth
#

the safe bet is to use a single source, or try normalizing each source

analog obsidian
#

the only really bad thing for datasets is inconsistent quality and whispering

#

8, 6 is too low for 1 hour

#

i train big datasets

#

so i kinda know what it's best for them

analog obsidian
#

^ repeated words, very monotone speech, no expressions

knotty moth
analog obsidian
#

just use 8

#

for anything above 2 hours you may wanna try batch 16

#

and for 5 hours 32 is good

#

the more data u add, the more realistic the output, just sayin

analog obsidian
knotty moth
analog obsidian
#

yeah basically

#

another tip for best results: use spin, single scale loss

knotty moth
knotty moth
#

but well bf16 as well as fp16 allow using AMP

analog obsidian
#

so applio has this new branch named f0_spin, it introduced two game changing stuff: a new embedder, and they brought back the original's rvc way to calculate mel
spin handles breaths better than cvec (the default embedder)
back then applio dev added a new way to calculate mel named multi-scale, which is great but not intended to be used in rvc/hifigan, so it was found that adds ringing to the models due to that single scale was brought back, using it should give you a model with very little ringing/no ringing at all

analog obsidian
#

rvc > train > train.py
multiscale_mel_loss = False

disabling multiscale mel slightly reduces the vocal range of your model so remember that in case you wanna train a singing dataset

#

rvc > models > pretraineds > custom

#

yt_nails i forgot it wasn't named custom pretraineds anymore lmao

#

if this is too complicated for u, you can just ignore it anyway

#

yeah actually it's somewhat easy, finetuning in rvc is not really a hard task

#

uh weird, redownload the zip

analog obsidian
#

should give a less robotic model and better breaths

#

website? you mean colab?

#

dont run it as admin

#

just double click it

knotty moth
analog obsidian
#

if they dont like it they can use the old stuff

#

just because u dont like it doesnt means is bad

#

it's actually a great update

#

literally what rvc-boss intended to do back then

#

a new embedder

#

no lmao that aint a virus relax

#

training is so light on the gpu

#

u can literally play games while training

#

xD

#

u gotta need python 3.10 or 3.11 tho

#

this more advanced approach do need a few gigs of space tho

knotty moth
#

why being paranoid of it if you have good cooling system

analog obsidian
#

wait

#

its downloading the pretrains

#

dont close the cmd

knotty moth
#

I assume you don't play with any overclocking yet, so it should be safe, even in the furmark stress test

analog obsidian
#

did u open it as admin again?

knotty moth
#

well, have you done manual install with the latest torch and cuda 12.8?

#

the current compiled release one only works on RTX 40-series/older

analog obsidian
#

i think they already added support for 50xx in the branch

#

its fine bro

#

ignore the error

#

librosa being cringe for no reason

#

open the url in ur browser

analog obsidian
#

well there u go
congrats u installed applio yay

#

🏆

#

lmaooo

#

enjoy super fast training speeds now

#

yea

#

just train locally, it's better

#

kaggle is piss bad

#

u want a tutorial on how to use applio locally

#

?

#

ok so u did the two steps, applio is installed
download the pretrain i gave u, place where i told u to place it
place ur datasets inside the assets > datasets folder (or you can do it somewhere else, it doesnt matter lol)

knotty moth
#

at this point you might better sell your 5090 to any folks wanting it so bad and knowing what to do

analog obsidian
knotty moth
#

that was way overpriced, recently it has quite dropped

analog obsidian
#

then manually place the location of the dataset like this

#

yeah

#

use auto slice if you haven't truncated the silence of your dataset

#

yea but like

#

your audio still has silence

#

rvc kinda hates that

#

so use auto slicer

#

yes

#

multiscale thing?

#

more natural results

knotty moth
analog obsidian
#

so like simple words
applio by default added a thing that boosts your dataset voice range at the cost of ringing (a static sound while singing high notes)

#

if u dont want that

#

u can disable it

#

inside your model's log folder

#

everything should be there, index, g, d, ur epochs

#

graphs

#

and finally, save every 10 epochs if u wanna save some disk space

void holly
#

anyone know how remilia bandxz makes his voice like that?

crude flame
analog obsidian
analog obsidian
astral void
#

@left sentinel

forest jolt
#

how to de harmony a track with uvr5?

manic olive
#

I have an applio error saying that no api found and I am using version 3.2.3

knotty moth
manic olive
polar grail
#

yo, ive been tryna setup this ai shit for years on a shitty laptop that couldnt run all the downloads. Im back with an actual PC, someone want to help me set it up an explain how it works? im a dumbass and would prolly need help in a voicecall

latent kettle
#

@simple ore sorry to bother you but can you explain why this is happning like my g avg loss in decreasing but the G total loss is increasing.

simple ore
#

what batch size?

latent kettle
elder coral
#

why isn't mainline working

flint anvil
#

chat i am looking at the available pitch extraction and i asked gpt deepresearch to figure out what's better for realistic m2f voice conversion and it says crepe-full is better but it gives it more delay - and i notice the deiteris fork and i think wokada also have crepe-full without onnx so i assume its gpu bound
I have a really good gpu and i only have to run the voice changer, so no games and stuff, so is crepe-full better than rmvpe for pitch extraction if i can afford to run it?

urban dune
#

help, Why only works Beatrice, how to make it work and rvc

agile kelp
#

Guys from your experience , i tried many models , can anyone suggest me a model that cannot look like its an AI talking ? i tried many couldnt find the perfect one

urban dune
#

only one installed when launched, the other one doesn't download, I don't know why when the program starts

knotty moth
urban dune
#

I mean, everything worked for me before, after resetting Windows, when I start HTTP, only Beatrice is installed, without RVC, and nothing works

#

maybe it is possible to download RVC separately and everything that should be downloaded automatically

digital cairn
#

hey guys with experience, what is better, deiteris w-okada or vonovox? in terms of delay and quality of output

neat meadow
#

hi Could you let me know what the most current/latest version is right now? Please send me the link

brittle wing
#

Can anyone tell me how to add beats to an acoustic song using ai 😢

novel radish
#

Does anyone know of a model for cloning a very expressive voice with a lot of vocal voice! NOT LIKE ANUEL AA, BAD BUNNY. They don't have a voice! SOMETHING FOR MANELE FOR AN ARTIST NAMED FLORIN SALAM! HELP! I NEED SOMETHING BETTER THAN HIFI-GAN AND MORE REALISTIC WHEN CLONING SINGING VOICES

low shard
low shard
low shard
#

elaborate:

  • ur pc gpu
  • what u want to do
  • what u exactly mean
novel radish
#

I have a 1070 Nvidia, I want something better than Hifi-gan.. my cloned voices with this method are good but I want something better that feels more real when cloning these expressive voices, look for FLORIN SALAM AND YOU WILL SEE THAT HE HAS A LARGE VOICE! not like Anuel AA or Bad Bunny

crude flame
low shard
novel radish
#

I'm looking to train models! Better than Hifi-Gan! I train singing artist models.

#

I have applio rvc wit HIFI-GAN

neat meadow
low shard
low shard
neat meadow
novel radish
neat meadow
#

also i use at2020 and scarlet solo 4th

low shard
neat meadow
#

yes

low shard
low shard
#

we do general ai here

novel radish
#

Ok thanks

low shard
#

@neat meadow RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.

Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)

#

what you'd need is wokada deiteris fork

low shard
neat meadow
#

what is fork version? What's the difference from the main version?

low shard
neat meadow
low shard
low shard
neat meadow
#

how to download fork version?

low shard
#

100% you should

share a screenshot of your current wokada so i can check the version, or just the folder name

neat meadow
#

appriciate

low shard
#

yes its the latest dw then

neat meadow
#

It corrects the accent of the model. Using it generates a lot of cpu usage

#

i like zero

#

idk formant Is it a new feature

low shard
#

you can decide how much index to do

higher value means more trained index is used, but can sound a bit more autotune

honest cedar
#

Can anyone help me with voicemeeter related issues?

#

I’m trying to bitcrush my rvc model

hallow thistle
#

High risk of damage? That ususally happens when your CPU overheats, it can happen when you overclock or your PC simply has a bad cooling system, regardless of any process running. It also depends on your PC CPU and such.

#

Promoting your Discord server or stuff is not allowed in this server. It's obvious.

hollow tartan
hollow tartan
hallow thistle
honest cedar
#

@crude flame Are you able to help my voicemeeter, codeman is asking if you can help me out

crude flame
honest cedar
#

It’s like working with blender all over again 🥲

honest cedar
#

still having issues

crude flame
#

Then what's your problem

honest cedar
#

and it says fader grain for all the sliders which doesnt match up with the picture

crude flame
honest cedar
#

no audio anywhere

crude flame
#

Did you fix your output audio in Windows

brittle wing
#

Applio no UI is not working again

honest cedar
#

and theres like 30 input output options

low shard
#

don't advertise please

low shard
crude flame
brittle wing
low shard
honest cedar
brittle wing
#

"Pkg_resources is deperecated as an API"

brittle wing
low shard
#

!give-media-perms 1h @brittle wing

brittle wing
#

Ah no prob it's training through

honest cedar
crude flame
#

So you have your windows output as your headphones and in voicemeeter you have your hardware out a1 as your headphones

crude flame
#

Did you restart your PC after installing voicemeeter

honest cedar
crude flame
#

On your virtual input try selecting a1 for both

honest cedar
#

there i put it as A1

#

still hear nothing

crude flame
#

Idk then, that always works for me

honest cedar
#

damn so much for bitcrushing

brittle wing
#

Why does it look like this

crude flame
honest cedar
crude flame
#

Wild

honest cedar
#

so random

#

now i got to see if it even works with rvc

#

@crude flame and what would i set input and output as?

crude flame
honest cedar
digital cairn
honest cedar
#

this isnt doing anything

#

no db

digital cairn
#

try restarting the whole client

honest cedar
#

okay

crude flame
honest cedar
crude flame
#

Did you select a voice

honest cedar
#

yes i chose my model, turned it on

#

and its just not going off

honest cedar
honest cedar
#

thats definitely a problem then

brittle wing
honest cedar
brittle wing
#

How do I decipher this?

honest cedar
#

someone else had told me you dont need to but dont take my word for it

latent kettle
#

It's your choice model will work with and without index

paper bloom
#

hey is there anyway to make owakada voicechanger more adapable to the english accent?

#

like sometimes it pronounces words diffrently

low shard
#

no

brittle wing
prime shell
#

how i can download applio?

prime shell
brittle wing
#

Cannot load file containing pickled data

#

-colab

patent trellisBOT
# brittle wing -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

brittle wing
#

Seems to be an issue with Numpy...yes

swift thunder
#

Does anyone know if Applio Colab is down? It stays here and doesn't advance: Starting backup loop... Files are up to date

swift thunder
brittle wing
#

What singer

swift thunder
brittle wing
swift thunder
brittle wing
#

Applio no UI is unusable

#

Am I doing resuming wrong?

swift thunder
brittle wing
swift thunder
#

No way, I have to wait to see if they can fix it.

brittle wing
crisp vault
#

is somebody in the hispanic one?

#

i cant find the link to it

brittle wing
#

Colabs are broken again...

brittle wing
#

'Pkg_resources is deprecated as an API'

simple ore
brittle wing
#

What did I do wrong?

simple ore
#

Which colab?

brittle wing
#

I tried the no workaround one too still the same outputs in code

simple ore
#

what workaround?

brittle wing
simple ore
#

that was a long time ago, fixed the install like 3 times after that

brittle wing
brittle wing
simple ore
#

not needed any more, just open the link as is

simple ore
#

the link above installed just fine, no errors or warnings

brittle wing
simple ore
#

screenshot

simple ore
brittle wing
brittle wing
#

Maybe I'm not resuming properly?

simple ore
#

that's wariong thrown by librosa

#

not an error

brittle wing
#

How...

#

Warning?But my model doesn't want to train it gets stuck on "data not found?"

simple ore
#

did you actually preprocess anything?

#

extracted features?

brittle wing
simple ore
#

make sure you actually restored the backup

brittle wing
#

Im cinfused

simple ore
#

Did you actualy load the backup, are all the files under logs/modelname?

brittle wing
#

I usually install load the backup & the last cell and hit training

#

Wait

simple ore
#

my spidey sense is tingling

brittle wing
#

???

simple ore
#

why you checking mute folder instead of your model's folder... which is not visible there

#

you said you used the URL i gave you, but did not load the backup?

brittle wing
#

Uh I'm still installing.

simple ore
#

but it is not in logs

brittle wing
#

😭

swift thunder
#

Don't forget about Applio UI too, please.

simple ore
#

load backup copies files from your google drive to logs

#

then you can resume training

brittle wing
#

I know the copies are there

simple ore
brittle wing
#

I'm doing it

#

RN

swift thunder
brittle wing
#

Now ?

brittle wing
#

It's happening

#

A good change

brittle wing
swift thunder
brittle wing
#

Last cell?

#

Still getting the same warning @simple ore

#

That's the issue!

simple ore
#

yes, it is warning

brittle wing
#

Look at the latest clip that's the error

brittle wing
simple ore
#

ugh

brittle wing
#

"No data left in file"

simple ore
#

try !uv pip install librosa==0.11.0 --upgrade

#

make a new cell

brittle wing
simple ore
#

run that, then your training

brittle wing
#

Still can't train

#

Honestly

simple ore
#

you're killing me bub

brittle wing
#

I'm sorry

simple ore
#

and i'm trying to watch a movie

brittle wing
#

Did everything over & over wasted 2 hours

simple ore
#

i'm gonna check locally

brittle wing
simple ore
#

it throws warnings, but works fine

#

librosa 0.11.0 install

#

works fine

brittle wing
#

Where are you looking?

#

I'm still confused

simple ore
#

now I'll check on colab

brittle wing
simple ore
#

after you ran 'install'

#

preprocess and extract featiures works fine with librosa 0.11.0

simple ore
#

as I see everything works fine

brittle wing
#

Yes I ran it but got errors previously

#

Just please tell me the execution order

simple ore
#

connec to drive, clone, install, +extra cells for librosa, load models, load backup, set training variables (must use the same model name and dr), then training

brittle wing
#

That's what I did...

#

After install I run the new cell for librosa ah oki

simple ore
#

that's only to hide those annoying warnings

brittle wing
#

After install comes the new cell?

simple ore
#

make a new cell, see above

brittle wing
#

Numpy array Multy array failed to import.

#

Ah wait backup complete,files are up to date

simple ore
#

!pip show numpy

#

it works fine for me

brittle wing
#

Training cell was stuck on "files are up to date"

simple ore
#

I have no idea.,. maybe your backup is fked

brittle wing
#

How do I fix it.

simple ore
#

I've tested restore from backup and resume training and it works for me.

brittle wing
undone abyss
#

im using w okada rn, and everytime i talk it gets super laggy and the ms spikes rlly high, anyone know a fix? or a way to reduce the lag

golden belfry
#

Hello! Im just curious as to how i could fix my issue. whenever i press start on a voice, it keeps saying “Frequent errors occur. Please check if the model of the framework being targeted is loaded.”

swift thunder
#

Hi, Applio UI won't load my datasets. Can you help me?

open raft
#

I’m new to voice-conversion and excited to explore RVC! I’ve read through the README and glanced at the code in model.py and inference.py, but I’m not sure where the “core” algorithm is implemented, and how all the pieces fit together.

What I’d love to know:

Which files or classes handle the feature extraction and model architecture?

Where is the training loop defined, and how do data preprocessing and postprocessing hook in?

Are there any papers, blog posts, or diagrams you recommend for a high-level overview?

Any in-code comments or tutorials aimed at beginners that I should read first?

I’m eager to learn and eventually contribute—thanks in advance for any guidance! 🙏

crude flame
open raft
quasi gyro
#

@simple ore for the klm v3 model, is it hifi gan?

wise token
#

why has every model become extremely high pitched even on 12 tune

simple ore
wise token
quasi gyro
#

the exp 3 pretrain for klm

#

its hifi right?

simple ore
#

which one

#

klm 6.1 v3 is hifigan spin7-12

quasi gyro
wise token
#

can someone help me

simple ore
quasi gyro
#

@simple ore is there a way to enable fp16 on apollo

#

so i can have faster training with my 4090

simple ore
#

4090 is fine crunching fp32

quasi gyro
simple ore
#

fp32 is not slow

#

i have 55 hour set doing 24min/epoch on 4070 with fp32

#

with fp16 it would probably be 20?

bronze hedge
#

yo guys, why does my changer take like 30 seconds to process, and even after it processes it cuts off and sounds horrible?

hallow thistle
#

Do you? Using an index model in W-Okada is not really recommended, as doing so will use more resources to process, potentially reducing the overall performance.

knotty moth
sterile hawk
#

so i set up my inputs and stuff with mic and cable input but cant hear anything and when i try to test in discord the yt video is playing in my mic test. anyone know whats wrong?

patent trellisBOT
# hallow thistle !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
hallow thistle
#

Which W-Okada version are you using? Are you using VB-Cable or Virtual Audio Cable? And what is your PC GPU?

sterile hawk
hallow thistle
flint anvil
#

how do i edit the deiteris fork on my windows pc? i install it but it fails to install faiss-gpu because its linux only and i cant even build this on gh actions to get a working source (no idea how to reproduce)
im trying to change the source code

hallow thistle
hallow thistle
flint anvil
hallow thistle
flint anvil
#

but python wont install faiss-gpu on win

hallow thistle
flint anvil
#

but i cant work from source on win11 because faiss gpu doesnt exist for w11

hallow thistle
flint anvil
#

i want to edit the code for it

#

but its not possible to build a working version for gpu (i would explain why but itll just confuse us more)

hallow thistle
#

So what makes you think b2332 is outdated? Sorry, I don't do coding, but this version is the most stable out there. Any attempt to upgrade one of Python components can cause some other components to conflict each other, I've tried it with other Python-related programs.

flint anvil
#

it has nothing to do with the quality itself

#

but if you cant help me because this is a programming question can you point me to someone who can?

hallow thistle
quiet axle
#

so which one is better vonovox or deiteris fork?

hallow thistle
quiet axle
#

cool thx for the fast response

#

do any of you guys use voicemeeter or just use VAC lite and call it a day?

simple ore
#

voicemeeter is lil tricky to route

quiet axle
#

but is it better?

simple ore
#

not really

#

vac is dumb simple for what it does

hallow thistle
#

What does this mean? Although you can use index file in "regular RVC program", Applio for example, using index in W-Okada is still not really recommended anyway. As what I said.

quiet axle
#

oh okay, i was reading the realism section of the docs and they recommend using voicemeeter

hallow thistle
quiet axle
#

oh okayy thanks for the info

knotty moth
#

but ffs not the virtual cable

quiet axle
#

Whats lighthost?

knotty moth
#

the guide should have mentioned it

tawdry matrix
#

yo i need help with a voice changer

hallow thistle
patent trellisBOT
# hallow thistle !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
brittle wing
#

what f0 det do you use for tesla T4 gpu?

hallow thistle
#

Also, read the "How To Troubleshoot" above before you ask anything here.

brittle wing
#

My current f0 works (rvmpe_onnx) but i was wondering if theres a better one

#

the guide doesnt say anything about t4 gpu

knotty moth
hallow thistle
knotty moth
#

kaggle has dual gpu but it will use a single gpu anyway

brittle wing
#

I thought by "tesla" they meant like elon musks tesla and I was so confused

hallow thistle
#

NVIDIA Tesla GPU and Tesla the company are two different things, the NVIDIA Tesla is now called NVIDIA Data Center GPU for newer GPUs within series.

hallow thistle
#

Almost every RVC model has index file alongside with it, but some only provide just pth file. Not really that surprising. An index file stores the accent of that voice model.It can be created during voice model training in RVC program.

knotty moth
#

it's just whether it includes index file or not

#

it depends on the model maker

candid osprey
#

403 ERROR
The request could not be satisfied. for links lol.

#

I tried vpn it doesn't work.

tawdry shore
#

so i have the app on my phone can i still do the contest?

ancient fable
#

WHat are good AI voice apps

hallow thistle
ancient fable
#

What are good RVCs?

hallow thistle
#

I use Weights.com mostly for fast-accessing AI cover. But there's Applio, which is available as locally (PC) and online.

#

RVC refers to AI programs that can do voice convert and voice model training, but as what I said, there are many different programs of it, which one of them being Applio.

simple ore
brittle wing
# simple ore use a better vpn

I still have the same issue as yesterday.Would you mind if I dm you my drive folder with backups and you see where's the problem

#

NO DATA LEFT IN FILE...AGAIN

#

I CAN'T TRAIN NY MODEL

#

Should I start all over...

#

With another account

hallow thistle
#

Do I need to explain it again?

#

If you don't remember what I said earlier, let me say to you again. Using index in W-Okada is not recommended, as it will cause it to use more performance. While you can use index in regular RVC program, yes, but that's all.

analog obsidian
#

the voice changer app is running a rvc model in realtime
rvc does not stand for realtime voice changer, they're two separate things, rvc originally only works for local conversions and don't support realtime inside the webui

#

every rvc model is compatible with the .index files (yeah you can use any .index file with any model), although index files in realtime cause several issues and their usage in those conditions is not recommended, pick any .index file, set the index value to 0 and forget about its existence

analog obsidian
#

i think w-okada forces you to select a index file but im not sure, regardless, setting the index value to 0 will disable the index

golden walrus
#

Anyone know if KLM is good for real time? misc_smoke_cry

#

It sounds so good in these sample

#

Spin is a breakthrough for me

analog obsidian
#

for spin i'd recommend noobies pretrain instead, but the grads are a bit high, not sure why

#

spin only ^ doesn't work with cvec

#

remember spin is still experimental, for a more safer approach, use the original pretrain and contentvec

stone lynx
#

do u guys know where i can find how to create ai voice guide

low shard
stone lynx
sonic night
#

hi, i wanted to use rvc to put an audio file and then corvert it to another audio file with an AI voice, idk how

low shard
# stone lynx rx 6700 xt

You can:

  • Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
  • Cloud (remote good pc, easier and faster than ur PC but it's limited):
    • Weights.com: Partnered with AI Hub, lets u do them easily but u may be in a queue
    • Applio Colab: max 4 hours daily, not granted, of GPU

Easiest possible : weights.com
easiest cloud: Ilaria rvc zero
easiest local: Applio

brittle wing
#

Ok

#

i made a environment folder inside coquis ai tts repo folder

#

and installed it

#

but how do i open it?

#

I don't see anything

#

Documentation on the github says to make a python script, but i wanted a GUI of the app

#

i don't wanna use the command console...

#

How do i open coquis ai tts?

sacred marten
#

hi

#

i have a question

#

where do you put these voice models in?

#

like what software

low shard
simple ore
latent cypress
#

does rtx 4060 8gb good enough for training image models locally? weights queue is taking way too long

simple ore
#

8gb in 2025 is a big mistake

#

you can probably train sd1.5 lora on it, not much else

median monolith
#

is it normal that the "Start" cell on Applio Kaggle takes a lot of time to even give me the links? its been like 10+ minutes and it still says "cell execution is queued".

median monolith
#

now +30 minutes

azure patio
#

which model is made to separate backing vocals from vocals?

brittle wing
brittle wing
azure patio
brittle wing
#

Well use that model on mvsep

azure patio
#

cause i tried it and it gave just vocals and instrumentals(which were empty since i have only vocals already)

brittle wing
#

Yes but use big beta 6x by unwa on the music source separation colab for Acapella first.

azure patio
#

i have acapella already

brittle wing
#

Then use this model on mvsep

#

Then use dereverb by anuew

#

After use uvr deecho

brittle wing
brittle wing
azure patio
#

thank you tt_happy

#

btw i did the acapella using the thing from kimberleyjsn, is that much different?

dire zinc
#

Guys how can I sort launch a project

#

On discord

median monolith
#

Im starting to get kinda insane, because idk wtf is wrong with my dataset for applio not being able to train it. the logs keep saying "Not enough data present in the training set. Perhaps you forgot to slice the audio files in preprocess?"

simple ore
golden walrus
#

Let me try Noobies' pretrain

simple ore
#

for some reason

#

i've tested noUI colab yesterday and it was fine

median monolith
#

If this helps, these are the options I put.

median monolith
simple ore
#

1 minute audio wont fit into any training buckets

#

who gave you this idea?

#

start again with a new model name and use simple slicing

#

make sure you get your ~30 files in extract features processed

median monolith
# simple ore

i mean, the dataset has little to no silence already, and so I thought that it was no necessary to cut it even more (let it as it is)

simple ore
#

you need to slice it, it has nothing to do with silences

median monolith
#

i thought it was basically going to remove parts of the audio and make it even shorter if i choose any other option, my bad

simple ore
#

automatic slicing does remove excessive silences, simple just shreds the file into digestible chunks

median monolith
simple ore
#

use default 3/0.3

median monolith
simple ore
#

if you have enough silence, actual 0 level silence, you can set mute files to 0

median monolith
opal depot
#

so for kaggle w-okada, how do I have persistence for files apply without having to do save version every time? do I run in edit mode or

median monolith
simple ore
#

what's the batch size?

median monolith
simple ore
#

60s/3 = 20 + 10% overlap

#

so 22 files is okay

#

what's in f0?

median monolith
simple ore
#

is it kaggle or colab?

median monolith
simple ore
#

22/(4x2) < 3

#

try batch 2

median monolith
simple ore
#

dont expect much of anything from just a minute of audio

median monolith
#

ik lol

signal chasm
#

Not sure why