dark ginkgo Jun 11, 2025, 7:19 PM

#

so flux dev would work on it then you are saying

dark ginkgo Jun 11, 2025, 7:40 PM

#

@simple ore I dont see these options on mine

#

#

also, when ever I try to change the sd vae to ae, it changes back

#

as in?

#

yes, but it also needs the sd_text_encoder in the options

#

it wont let me switch

#

see here #✨│ai-help message

#

also looiking it up, I have not found any infor regarding automatic 1111 being compatible. Everyone say to use forge or comyfui but I dont have any tutorial online to setup on a1111

#

I changed it to that already but also still no text encoder option

#

like I cant find this in the list of options

#

#✨│ai-help message

#

can you verify you are indeed using a1111 and not forge? a1111 has been abandoned from what I read so far with zero updates

#

as of 2024

#

furthermore, it said flux is incomaptible with a1111 and had no plan to update

red imp Jun 11, 2025, 10:01 PM

#

can anyone help me, this keeps popping up when I try to use the MMVCServerSIO in the folder, I am on a m1 silicon mac

📎 message.txt

red imp Jun 11, 2025, 10:28 PM

#

i switched to a updated os and it worked

#

but now for some reason when i try to select an output and monitor for the audio inside the voice changer, nothing shows up

#

idk how to use it

#

ohh ok thanks

#

it works

dark ginkgo Jun 11, 2025, 10:57 PM

#

@simple ore So I just downloaded this, used the default what it comes with. Already having issues

#

SD.next, never heard of it until today

#

I have yet to touch anything

#

like it just finished downloading

#

I'm doing my usual prompt test for flux, 1024/1024

#

photograph of a red apple on wooden table, red apple, wooden table, high quality, professional photograph, dark background,

thick aurora Jun 11, 2025, 11:05 PM

#

RVC v2 disconnected is working bro!

simple ore Jun 11, 2025, 11:11 PM

#

dark ginkgo <@155030383648440320> So I just downloaded this, used the default what it comes ...

anyway, if you have issues with sd.next, follow the link from their github page to discord

#

there's a good community there

light latch Jun 12, 2025, 1:04 AM

#

Hi, can someone help me? I know how to look for AI cover models, but I don't have the link to make the voice work in an audio? I used Google Collab, has anything changed? Does anyone have a new link? Please answer.

elder coral Jun 12, 2025, 1:36 AM

#

is this a clean model

#

elder coral Jun 12, 2025, 1:37 AM

#

thick aurora RVC v2 disconnected is working bro!

finally

quasi gyro Jun 12, 2025, 2:50 AM

#

Did a new klm model drop or am I seeing things? Cause I saw that the thread was updated

simple ore Jun 12, 2025, 3:13 AM

#

quasi gyro Did a new klm model drop or am I seeing things? Cause I saw that the thread was ...

yes

quasi gyro Jun 12, 2025, 3:22 AM

#

simple ore yes

Where can I find it? There’s no links

simple ore Jun 12, 2025, 3:23 AM

#

https://huggingface.co/SeoulStreamingStation/KLM6_Experimental/tree/main 3 days ago

quasi gyro Jun 12, 2025, 3:24 AM

#

simple ore https://huggingface.co/SeoulStreamingStation/KLM6_Experimental/tree/main 3 da...

And it uses the spin embedd right? I can also just use Apollo or did he use code names fork?

simple ore Jun 12, 2025, 3:32 AM

#

if you use a custom embedder option you can use any version

#

or applio exp branch

#

or the fork

modern surge Jun 12, 2025, 5:32 AM

#

-colab

patent trellisBOT Jun 12, 2025, 5:32 AM

#

modern surge -colab

📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**

Google Colab

• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

modern surge Jun 12, 2025, 5:35 AM

#

I need a model to separate the drums of a song

#

Just the drums

#

Does anyone knows how to?

#

-uvr

patent trellisBOT Jun 12, 2025, 5:37 AM

#

modern surge -uvr

UVR: Ultimate Vocal Remover

One of the best free and open source vocal and instrumental isolation tool.

Website

https://ultimatevocalremover.com/

GitHub

https://github.com/Anjok07/ultimatevocalremovergui

Guide

https://docs.aihub.gg/rvc/resources/dataset-isolation/#vocal-isolation--cleaning

quasi gyro Jun 12, 2025, 5:51 AM

#

simple ore if you use a custom embedder option you can use any version

Yah but is it the spin embedd? Like what did he train it with is what I mean?

jovial kraken Jun 12, 2025, 5:55 AM

#

I mean the RVC version by W-okala

latent kettle Jun 12, 2025, 5:57 AM

#

jovial kraken I mean the RVC version by W-okala

RVC is different thing and w-okada is different. W-okada is a real-time voice changer. Which use RVC as base to work

#

https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#kaggle

Deiteris' W Okada Fork

Last update: May 5, 2025

peak path Jun 12, 2025, 7:18 AM

#

I used the Colab version of AudioSep.
https://github.com/Audio-AGI/AudioSep
https://colab.research.google.com/github/Audio-AGI/AudioSep/blob/main/AudioSep_Colab.ipynb
But I've got an error.

Is there an active maintained version out there?

#

import torch
from pipeline import build_audiosep, separate_audio

device = torch.device('cpu')

model = build_audiosep(
      config_yaml='config/audiosep_base.yaml',
      checkpoint_path=str(models[0][1]),
      device=device)

audio_file = 'zand.wav'
text = 'water drops'
output_file='separated_audio.wav'

# AudioSep processes the audio at 32 kHz sampling rate
separate_audio(model, audio_file, text, output_file, device)

#

📎 message.txt

#

simple ore Jun 12, 2025, 7:43 AM

#

quasi gyro Yah but is it the spin embedd? Like what did he train it with is what I mean?

yes, spin7-12 layers

simple ore Jun 12, 2025, 7:45 AM

#

peak path

depending on the version of torch, it is either a warning, or it simply did not load the weights at all. All torch.load calls need to be provided with weights_only=True

hazy eagle Jun 12, 2025, 8:43 AM

#

i use firefox how to fix this

#

everytime i choose my microphone on client that pops up

simple ore Jun 12, 2025, 9:01 AM

#

hazy eagle everytime i choose my microphone on client that pops up

if you're running it locally, use sever mode

#

or change the mic settings in the sound contol panel

#

hazy eagle Jun 12, 2025, 9:40 AM

#

simple ore if you're running it locally, use sever mode

i don't want to use server because i think client is better

#

i tried this on many browsers but chrome gives best quality, i cant test it on firefox because of that error

knotty moth Jun 12, 2025, 10:13 AM

#

hazy eagle i don't want to use server because i think client is better

server mode allows using WASAPI devices which have less latency than MME devices that are used in client mode

#

beside that the built-in noise suppression (Sup2) can only be used in client mode

quiet axle Jun 12, 2025, 12:02 PM

#

hi was wondering what crossfade setting u guys usin, or just using the default?

knotty moth Jun 12, 2025, 12:25 PM

#

quiet axle hi was wondering what crossfade setting u guys usin, or just using the default?

the default

quiet axle Jun 12, 2025, 12:26 PM

#

oh thanks, btw i was wondering. So i just reinstalled my windows cause of a certain problems. For some reason my vc doesnt sound like it used to

#

got some advice you reckon i could do?

#

i feel like the voice seems to abruptly end on a sentence more so then usual

elder coral Jun 12, 2025, 12:33 PM

#

how to make my index file have added_

#

because it doesn't have it

knotty moth Jun 12, 2025, 12:35 PM

#

quiet axle i feel like the voice seems to abruptly end on a sentence more so then usual

increase chunk settings according to gpu performance

quiet axle Jun 12, 2025, 12:36 PM

#

which protocol is better sio or rest? whats the different anyway

knotty moth Jun 12, 2025, 12:41 PM

#

quiet axle which protocol is better sio or rest? whats the different anyway

Protocol: rest (Use SIO if you want less delay but if you encounter any issues with SIO switch back to rest. Rest has slightly more delay than SIO)

quiet axle Jun 12, 2025, 12:41 PM

#

oh i see, thanks for information

simple ore Jun 12, 2025, 12:47 PM

#

elder coral how to make my index file have added_

you dont need to rename it.. usually a proper .index is included, regardles of what it is actually named

elder coral Jun 12, 2025, 12:47 PM

#

simple ore you dont need to rename it.. usually a proper .index is included, regardles of w...

because my model will get rejected

knotty moth Jun 12, 2025, 12:49 PM

#

elder coral because my model will get rejected

not really, we know Applio names it YourModel.index

elder coral Jun 12, 2025, 12:49 PM

#

yes

knotty moth Jun 12, 2025, 12:50 PM

#

and it works in any rvc applications

simple ore Jun 12, 2025, 12:50 PM

#

very old apps called index files like 'added_IVF1406_Flat_nprobe_1_modelname_v2.index' for no good reason

knotty moth Jun 12, 2025, 12:52 PM

#

the mainline and older versions name it like that, also spawns total_fea.npy and trained_*.index

summer reef Jun 12, 2025, 1:10 PM

#

how to create models

#

pls help me

elder coral Jun 12, 2025, 1:36 PM

#

summer reef how to create models

Read this guide first before making models

#

https://docs.aihub.gg

Home

Last update: May 5, 2025

summer reef Jun 12, 2025, 1:44 PM

#

elder coral Read this guide first before making models

I can't understand this document. I want to find a current video and download it locally. When I tried it on the cloud before, there were problems like discornnect error.

elder coral Jun 12, 2025, 1:45 PM

#

summer reef I can't understand this document. I want to find a current video and download it...

-colab

patent trellisBOT Jun 12, 2025, 1:45 PM

#

elder coral -colab

📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**

Google Colab

• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

elder coral Jun 12, 2025, 1:45 PM

#

here it is

#

ask the helpers first to find videos on how to make models like that

atomic goblet Jun 12, 2025, 2:01 PM

#

I was thinking of getting a dedicated gpu for the voice changer, does anyone think a 5060 Ti would be good to handle that?

remote steeple Jun 12, 2025, 2:11 PM

#

how to make voice changer sound not robotic

simple ore Jun 12, 2025, 2:12 PM

#

atomic goblet I was thinking of getting a dedicated gpu for the voice changer, does anyone thi...

you can even get somethig cheaper if you wanna use a dedicated card

atomic goblet Jun 12, 2025, 2:14 PM

#

simple ore you can even get somethig cheaper if you wanna use a dedicated card

What GPU were you thinking?

simple ore Jun 12, 2025, 2:17 PM

#

atomic goblet What GPU were you thinking?

what's your main gpu?

atomic goblet Jun 12, 2025, 2:17 PM

#

3090 Ti

simple ore Jun 12, 2025, 2:18 PM

#

you can find like 3050 used for $100-150

#

just need to make sure you have a free 8x pcie slot and 2 slots for the card itself

atomic goblet Jun 12, 2025, 2:23 PM

#

Hmm, that can work. I can try to find one then

forest jolt Jun 12, 2025, 2:26 PM

#

hi whats the latest fork or program for rvc gui
i tried to use rvc ai cover maker and that its not sounding as good as rvc gui

hazy eagle Jun 12, 2025, 3:29 PM

#

knotty moth server mode allows using WASAPI devices which have less latency than MME devices...

so its better to move on server mode?

sleek sand Jun 12, 2025, 4:57 PM

#

I need to make a Russian girl's voice out of my male one. which model should I use and how should I configure it? I use AI-Voice Changer

undone abyss Jun 12, 2025, 5:08 PM

#

My command prompt closes automatically after opening start.bat anyone know a fix? I've downloaded PyTorch and Python

latent kettle Jun 12, 2025, 5:09 PM

#

undone abyss My command prompt closes automatically after opening start.bat anyone know a fix...

What you are using?

undone abyss Jun 12, 2025, 5:09 PM

#

Windows 11 I have an intel gpu

latent kettle Jun 12, 2025, 5:09 PM

#

Applio, w-okada?

undone abyss Jun 12, 2025, 5:09 PM

#

w okada

latent kettle Jun 12, 2025, 5:09 PM

#

Have you downloaded correct version from guide?

undone abyss Jun 12, 2025, 5:09 PM

#

I believe so, yes

#

i can try to check which one i downlaoded

latent kettle Jun 12, 2025, 5:10 PM

#

-realtime

patent trellisBOT Jun 12, 2025, 5:10 PM

#

latent kettle -realtime

💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

latent kettle Jun 12, 2025, 5:11 PM

#

undone abyss i can try to check which one i downlaoded

https://github.com/deiteris/voice-changer/releases/download/b2332/voice-changer-windows-amd64-dml.zip

undone abyss Jun 12, 2025, 5:11 PM

#

thank you

undone abyss Jun 12, 2025, 5:11 PM

#

latent kettle https://github.com/deiteris/voice-changer/releases/download/b2332/voice-changer-...

do i delete the one i have rn

#

and install this?

latent kettle Jun 12, 2025, 5:12 PM

#

Probably if it is wrong version

undone abyss Jun 12, 2025, 5:12 PM

#

okay thank you

#

what do i do with the file you gave me

#

@latent kettle

#

do i extract

#

or like

#

what do i do

latent kettle Jun 12, 2025, 5:15 PM

#

undone abyss do i extract

Yes extract it

undone abyss Jun 12, 2025, 5:15 PM

#

OMG ITS DOWNLOADING THANK YOU SO MUCH

#

youre the best @latent kettle

#

appreiciate it

latent kettle Jun 12, 2025, 5:16 PM

#

Yw glad to help you anime_giveheart

undone abyss Jun 12, 2025, 5:22 PM

#

@latent kettle one last question, how do i add a voice to the voice changer. I can't find a tutorial on YouTube to help me because the tutorials show w okada via app and im on a website

latent kettle Jun 12, 2025, 5:23 PM

#

undone abyss <@1174561195102056459> one last question, how do i add a voice to the voice chan...

See the edit button. Click on it and add model to any slot

#

#

@undone abyss

viral mason Jun 12, 2025, 5:27 PM

#

latent kettle

pth files goes to model and index goes to index

undone abyss Jun 12, 2025, 5:29 PM

#

latent kettle

tyty

undone abyss Jun 12, 2025, 5:29 PM

#

viral mason pth files goes to model and index goes to index

alr thank ypu

#

i appreiciate both of you guys' help

viral mason Jun 12, 2025, 5:29 PM

#

no problem ^^

#

my dms are always open if anyone has questions about setting up the okada voice changer

undone abyss Jun 12, 2025, 5:41 PM

#

@viral mason I just dmed you and discord took away my permissions to dm, do you know why?

viral mason Jun 12, 2025, 5:48 PM

#

undone abyss <@1023278814752677918> I just dmed you and discord took away my permissions to d...

idk did you block me?

#

I couldn't message you either

undone abyss Jun 12, 2025, 5:48 PM

#

i think someone went on my account and blocked ppl

#

i reset my cookie'

#

idk what happened it was weird

#

i texted you

#

and then discord logged me out of everything the second i did

#

luckily i reset the cookie in time before stuff happened

#

but that was rlly weird

viral mason Jun 12, 2025, 5:49 PM

#

ok do I can't dm u for some reason but

#

I'll send the pictures here instead

undone abyss Jun 12, 2025, 5:49 PM

#

latent kettle https://github.com/deiteris/voice-changer/releases/download/b2332/voice-changer-...

was it bc i downloaded this?

undone abyss Jun 12, 2025, 5:49 PM

#

viral mason ok do I can't dm u for some reason but

okay ty

latent kettle Jun 12, 2025, 5:50 PM

#

undone abyss was it bc i downloaded this?

Voice changer for intel gpu

undone abyss Jun 12, 2025, 5:50 PM

#

idk what happened

#

that was weirddd

latent kettle Jun 12, 2025, 5:50 PM

#

I mean w-okada for intel

viral mason Jun 12, 2025, 5:50 PM

#

I can't send pictures

latent kettle Jun 12, 2025, 5:50 PM

#

The one you was using before, it was designed for Nvidia

viral mason Jun 12, 2025, 5:50 PM

#

I'm gonna hang myself

undone abyss Jun 12, 2025, 5:51 PM

#

viral mason I can't send pictures

thats so weird

undone abyss Jun 12, 2025, 5:51 PM

#

viral mason I'm gonna hang myself

donttt

undone abyss Jun 12, 2025, 5:51 PM

#

latent kettle The one you was using before, it was designed for Nvidia

ahhh

#

that makes sense

latent kettle Jun 12, 2025, 5:51 PM

#

Yep

viral mason Jun 12, 2025, 5:51 PM

#

I need image perms?

latent kettle Jun 12, 2025, 5:51 PM

#

Delete that

viral mason Jun 12, 2025, 5:51 PM

#

or somthing

latent kettle Jun 12, 2025, 5:51 PM

#

viral mason I need image perms?

Nop

viral mason Jun 12, 2025, 5:51 PM

#

wtf is wrong with discord rn then

undone abyss Jun 12, 2025, 5:51 PM

#

discord being weird

latent kettle Jun 12, 2025, 5:52 PM

#

viral mason wtf is wrong with discord rn then

Idk. I'm able to send images. We have Same roles

viral mason Jun 12, 2025, 5:52 PM

#

ugh

undone abyss Jun 12, 2025, 5:53 PM

#

w okada is delayed when i talk, anyone know how to fix?

#

and wghat settings to put?

viral mason Jun 12, 2025, 5:53 PM

#

undone abyss w okada is delayed when i talk, anyone know how to fix?

that's normal

undone abyss Jun 12, 2025, 5:53 PM

#

okayokay

#

how do i make it

#

sound better

viral mason Jun 12, 2025, 5:53 PM

#

latent kettle Jun 12, 2025, 5:54 PM

#

viral mason

Congratulations 🎊

viral mason Jun 12, 2025, 5:55 PM

#

viral mason

@inner pivot

undone abyss Jun 12, 2025, 5:56 PM

#

thank you

#

what would be good settings for intel gpu?

latent kettle Jun 12, 2025, 5:58 PM

#

patent trellis

Follow this guide

#

@undone abyss

undone abyss Jun 12, 2025, 5:59 PM

#

top one?

viral mason Jun 12, 2025, 5:59 PM

#

also this IF IT LETS ME SEND I SWEAR TO GOD

#

finally

#

jesus h christ

latent kettle Jun 12, 2025, 6:00 PM

#

undone abyss top one?

Yes. Just scroll down to "Settings Explained"

viral mason Jun 12, 2025, 6:01 PM

#

https://tenor.com/view/last-brain-cell-last-brain-cell-sntr37-gif-25436963

Tenor

undone abyss Jun 12, 2025, 6:02 PM

#

i cant reopen mmvcserveriosio it says failed to excecute script 'client' due to unhandled exception

#

nvm i got it

#

ill look at setting explination now @latent kettle

latent kettle Jun 12, 2025, 6:05 PM

#

undone abyss ill look at setting explination now <@1174561195102056459>

Good luck

undone abyss Jun 12, 2025, 6:06 PM

#

thank you

viral mason Jun 12, 2025, 6:06 PM

#

undone abyss ill look at setting explination now <@1174561195102056459>

good luck ^^

#

I'll try helping if I can but I'm about to burst into a ball of angry juice

undone abyss Jun 12, 2025, 6:08 PM

#

viral mason I'll try helping if I can but I'm about to burst into a ball of angry juice

yeah, i feel you. voice changer is lowkey complicated. and idk why you cant send images 😭

undone abyss Jun 12, 2025, 6:08 PM

#

viral mason good luck ^^

thank uu

viral mason Jun 12, 2025, 6:27 PM

#

kaggle doesn't work

#

it's fucking dead

full marsh Jun 12, 2025, 6:27 PM

#

same kaggle showing firbase problem

viral mason Jun 12, 2025, 6:27 PM

#

misc_cry

#

at least emojis work

full marsh Jun 12, 2025, 6:28 PM

#

colab dead now kaggle ..nice

viral mason Jun 12, 2025, 6:28 PM

#

hold on

#

it loaded for me

#

kaggle

viral mason Jun 12, 2025, 6:29 PM

#

full marsh colab dead now kaggle ..nice

it might be back

#

it's not saving tho

full marsh Jun 12, 2025, 6:30 PM

#

viral mason it's not saving tho

yeah that's the firebase issue

viral mason Jun 12, 2025, 6:30 PM

#

:(

#

is there a way around or nah

full marsh Jun 12, 2025, 6:30 PM

#

it loads for me too but won't work

#

no unfortunately until the creators themselves fix the issue

viral mason Jun 12, 2025, 6:31 PM

#

do they fix it quickly usually orrrr

#

bc I kinda need to keep training the model I'm working on

full marsh Jun 12, 2025, 6:32 PM

#

viral mason do they fix it quickly usually orrrr

u need to try after few hours cause that issue is firebase...so kaggle might be doing it

viral mason Jun 12, 2025, 6:32 PM

#

full marsh u need to try after few hours cause that issue is firebase...so kaggle might be ...

😔

full marsh Jun 12, 2025, 6:33 PM

#

see if it's fixed after few hours or wait for the notebook creators fr a bettr solution

viral mason Jun 12, 2025, 6:33 PM

#

first discord breaks now Kaggle

#

sadge

full marsh Jun 12, 2025, 6:33 PM

#

u can always train locally if u have gd hardware

viral mason Jun 12, 2025, 6:33 PM

#

I don't like local training, it messed with my vr

full marsh Jun 12, 2025, 6:34 PM

#

i only use local fr inference training takes toll on my pc

viral mason Jun 12, 2025, 6:35 PM

#

that's fair

#

kaggle was having issues like not too long ago was it not?

#

is someone attacking them or

viral mason Jun 12, 2025, 7:48 PM

#

what

full marsh Jun 12, 2025, 7:53 PM

#

viral mason what

gotta wait b4 getting GPU

#

u can get free immediate only CPU tho

#

i'm waiting too

viral mason Jun 12, 2025, 7:58 PM

#

full marsh gotta wait b4 getting GPU

misc_cry

#

bad update

#

this better not stick

full marsh Jun 12, 2025, 8:02 PM

#

viral mason <:misc_cry:1176674698629750975>

going slow 😦

tough fiber Jun 12, 2025, 8:03 PM

#

its stuck like this is this collab broken ? i used this 1 hour ago but now

full marsh Jun 12, 2025, 8:04 PM

#

tough fiber its stuck like this is this collab broken ? i used this 1 hour ago but now

which colab is this

tough fiber Jun 12, 2025, 8:04 PM

#

#

now works

#

idk what happened i guess its collab thing

#

servers maybe

full marsh Jun 12, 2025, 8:15 PM

#

Finalllyyyy

winter dew Jun 12, 2025, 8:38 PM

#

is https://applio.org/ the thing to use to make models??

snow vine Jun 12, 2025, 9:05 PM

#

i need a lil help

#

ho do you make ur voice not sound glitchy

crystal pine Jun 12, 2025, 10:06 PM

#

How do you make modles??

astral jungle Jun 13, 2025, 3:09 AM

#

RVC V2 DISCONNECTED has been active 💔

Screenshot_2025-06-13-10-01-46-210_com.android.chrome.jpg

#

misc_lets_fucking_go

fair prism Jun 13, 2025, 3:23 AM

#

what AI stem do you guys use

#

is UVR still the most high quality

terse halo Jun 13, 2025, 4:06 AM

#

Does anyone know how to use the voicechanger?

#

do have a version for ryzen?

viscid moss Jun 13, 2025, 4:24 AM

#

fair prism what AI stem do you guys use

it depends on what u want

viscid moss Jun 13, 2025, 4:24 AM

#

fair prism is UVR still the most high quality

yes

#

✅

viscid moss Jun 13, 2025, 4:24 AM

#

terse halo Does anyone know how to use the voicechanger?

-rt

patent trellisBOT Jun 13, 2025, 4:24 AM

#

viscid moss -rt

💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

viscid moss Jun 13, 2025, 4:24 AM

#

patent trellis

@terse halo

#

First link

terse halo Jun 13, 2025, 4:28 AM

#

viscid moss <@723551098681688085>

can you Dm?

viscid moss Jun 13, 2025, 4:28 AM

#

terse halo can you Dm?

I'm about to sleep tbh. U just need to download the version according to u GPU

fair prism Jun 13, 2025, 4:33 AM

#

how do you know which is the best pretrain

#

should Ijust use titna

viscid moss Jun 13, 2025, 12:05 PM

#

Do it after removing instruments

umbral breach Jun 13, 2025, 12:18 PM

#

Hey there, is the 9070XT capable of running inference or training on windows? Ive been trying to get it running on Applio with Zluda with no luck. Or should I just give up? lol

viscid moss Jun 13, 2025, 12:31 PM

#

Reverb, then backvocals and finally denoise

simple ore Jun 13, 2025, 12:34 PM

#

umbral breach Hey there, is the 9070XT capable of running inference or training on windows? Iv...

there is a fresh experimental built of native pytorch for rocm that supports 9070 on windows, zluda is not required

#

unfortunately by default applio install comes with python 3.10 and the rock requires python 3.11 or 3.12 (applio does not not work with 3.12)

#

to use zluda you can see https://github.com/IAHispano/Applio/issues/1005

umbral breach Jun 13, 2025, 12:38 PM

#

simple ore there is a fresh experimental built of native pytorch for rocm that supports 907...

sweet

simple ore Jun 13, 2025, 12:39 PM

#

to use python 3.11 with Applio you basically need to nuke env folder and make a fresh venv using separately installed python3.11 and use pip install -r requirements.txt

#

without conda

#

or maybe change the install script to use python3.11 conda

#

that's another possibility

#

once applio is installed, install the experimental wheels

umbral breach Jun 13, 2025, 12:40 PM

#

simple ore to use zluda you can see https://github.com/IAHispano/Applio/issues/1005

i've tried this but still no luck

simple ore Jun 13, 2025, 12:40 PM

#

https://github.com/scottt/rocm-TheRock/releases/tag/v6.5.0rc-pytorch-gfx110x

simple ore Jun 13, 2025, 12:40 PM

#

umbral breach i've tried this but still no luck

"no luck" does not give me anything to investigate

umbral breach Jun 13, 2025, 12:42 PM

#

simple ore "no luck" does not give me anything to investigate

ah sorry, I'll try the 3.11 method that u mentioned when i get some time, I'm still new to all this so thank you for the heads up

simple ore Jun 13, 2025, 12:47 PM

#

umbral breach ah sorry, I'll try the 3.11 method that u mentioned when i get some time, I'm st...

I have not tried the rock, you may need some beta of HIP SDK 6.5 that is not available yet.... never mind, should work with 6.2.4

#

but the method from the applio ticket should work

umbral breach Jun 13, 2025, 1:20 PM

#

oh okay, so it is possible its just me missing something...
I'll give it a go, thanks for the help

viscid moss Jun 13, 2025, 1:27 PM

#

#

https://docs.aihub.gg/rvc/resources/dataset-isolation/#the-best-models-for-uvr-are

Dataset & Isolation

Last update: May 5, 2025

terse halo Jun 13, 2025, 4:00 PM

#

I tried to download it yesterday but I found it very confusing and it gave an error, can anyone help me?

latent kettle Jun 13, 2025, 4:30 PM

#

terse halo I tried to download it yesterday but I found it very confusing and it gave an er...

Hii

#

Which GPU do you have

terse halo Jun 13, 2025, 4:31 PM

#

latent kettle Which GPU do you have

RTX 3060

latent kettle Jun 13, 2025, 4:31 PM

#

Okay

latent kettle Jun 13, 2025, 4:31 PM

#

terse halo RTX 3060

-realtime

patent trellisBOT Jun 13, 2025, 4:31 PM

#

latent kettle -realtime

💻 Local Realtime RVC

Guides for Programs that use RVC Models in Realtime for Calls/Games

• Wokada Deiteris Fork

Most suggested. GUIDE

• Original Wokada

ONLY the latest alpha comes close to the Deiteris Fork performance, older versions in youtube tuts are way worse. GUIDE

• **RVC GUI Mainline Realtime**

Unavailable, the guide is outdated and the program is worse compared to the ones above, and much less updated

latent kettle Jun 13, 2025, 4:31 PM

#

The first guide

#

Most suggested

terse halo Jun 13, 2025, 4:32 PM

#

What I'm confused about is that there is a MB and a 2GB version

latent kettle Jun 13, 2025, 4:32 PM

#

terse halo What I'm confused about is that there is a MB and a 2GB version

https://huggingface.co/Shadicti/deiteris-Fork/blob/main/voice-changer-windows-nvidia-b2332.zip

voice-changer-windows-nvidia-b2332.zip · Shadicti/deiteris-Fork at...

terse halo Jun 13, 2025, 4:33 PM

#

latent kettle https://huggingface.co/Shadicti/deiteris-Fork/blob/main/voice-changer-windows-nv...

Just download this or is there more?

latent kettle Jun 13, 2025, 4:33 PM

#

After that you need virtual cable.

#

Virtual cable to connect your games or discord with Voice Changer

terse halo Jun 13, 2025, 4:35 PM

#

terse halo Just download this or is there more?

After downloading this, where do I click?

latent kettle Jun 13, 2025, 4:35 PM

#

Do You want real-time voice changer?

terse halo Jun 13, 2025, 4:35 PM

#

latent kettle Do You want real-time voice changer?

Yes

latent kettle Jun 13, 2025, 4:36 PM

#

#

@terse halo

terse halo Jun 13, 2025, 4:37 PM

#

After I extract, where do I click?

latent kettle Jun 13, 2025, 4:38 PM

#

terse halo After I extract, where do I click?

After the download, you extract the zip file. You open the folders until you see an exe application called MMVCServerSIO and run that.

terse halo Jun 13, 2025, 4:38 PM

#

Ok

#

It's already downloading, and then I'll come back to ask, ok?

latent kettle Jun 13, 2025, 4:39 PM

#

terse halo It's already downloading, and then I'll come back to ask, ok?

Maybe Okay

terse halo Jun 13, 2025, 4:59 PM

#

latent kettle Maybe Okay

downloaded and I already extracted it and clicked on the exe

#

and now?

latent kettle Jun 13, 2025, 4:59 PM

#

Okay good

#

Download virtual cable

terse halo Jun 13, 2025, 4:59 PM

#

link?

terse halo Jun 13, 2025, 5:01 PM

#

latent kettle After the download, you extract the zip file. You open the folders until you see...

#

Is it normal to say that there is a virus?

latent kettle Jun 13, 2025, 5:03 PM

#

I think it's better to ignore it

terse halo Jun 13, 2025, 5:03 PM

#

ok, it's downloading

#

and now?

#

@latent kettle

latent kettle Jun 13, 2025, 5:06 PM

#

Okay start and give mic permissions

terse halo Jun 13, 2025, 5:07 PM

#

done

#

and now?

#

can i close this?

latent kettle Jun 13, 2025, 5:10 PM

#

Noo

#

Don't do that

#

It will not work then

#

@terse halo

terse halo Jun 13, 2025, 5:11 PM

#

I haven't closed it yet

#

I already added the voice, how do I configure it so that the voice is synchronized?

latent kettle Jun 13, 2025, 5:12 PM

#

terse halo I already added the voice, how do I configure it so that the voice is synchroniz...

Synchronized with what?

terse halo Jun 13, 2025, 5:13 PM

#

with my voice

#

terse halo Jun 13, 2025, 5:13 PM

#

latent kettle Virtual cable to connect your games or discord with Voice Changer

link?

latent kettle Jun 13, 2025, 5:14 PM

#

terse halo link?

https://software.muzychenko.net/freeware/vac470lite.zip

terse halo Jun 13, 2025, 5:14 PM

#

donwloand

latent kettle Jun 13, 2025, 5:15 PM

#

After installing the Virtual Cable, it changes your default audio system. Click Yes when it asks you to open the audio device settings (or press WIN+R, type "mmsys.cpl" if you closed it already), and change your Recording and Playback devices back to your usual devices. Same for communications device aswell (right click -> set as default communication device)

terse halo Jun 13, 2025, 5:16 PM

#

like go back to original?

#

@latent kettle

latent kettle Jun 13, 2025, 5:20 PM

#

You must select virtual cable as output in voice changer and your mic as input in voice changer

terse halo Jun 13, 2025, 5:20 PM

#

latent kettle You must select virtual cable as output in voice changer and your mic as input i...

like this?

latent kettle Jun 13, 2025, 5:21 PM

#

Yep

terse halo Jun 13, 2025, 5:22 PM

#

terse halo like go back to original?

.

terse halo Jun 13, 2025, 5:23 PM

#

latent kettle Yep

Do I put the check on the microphone or on "line 1?"

latent kettle Jun 13, 2025, 5:23 PM

#

Check ?

terse halo Jun 13, 2025, 5:23 PM

#

this thing

#

@latent kettle

ocean holly Jun 13, 2025, 6:15 PM

#

I got a question about w okada, or well, AI voice changers in general (hopefuly this is the right channel) Is there a software, app, or a general way to let the voice changer pick up my voice, while my IRL surroundings are playing like normal? for example, I use the voice in a vc, and theres a knock on my door, people can hear the knock from my mic, but also the AI voice.

final path Jun 13, 2025, 7:50 PM

#

Hey guys I need someone to guide me to properly train a model, I'm a bit familiar with the process but I can still use some help

viscid moss Jun 13, 2025, 9:16 PM

#

leave it as default

olive bear Jun 13, 2025, 9:17 PM

#

tes

vapid gorge Jun 13, 2025, 9:18 PM

#

Una imagen de dios en el espacio

winter dew Jun 13, 2025, 10:21 PM

#

https://applio.org/ is this the site for making models

viscid moss Jun 13, 2025, 10:26 PM

#

#

Those are outdated

viscid moss Jun 13, 2025, 11:27 PM

#

models works for 48, but I'm not completely sure

#

Check it on a spectogram

scarlet fulcrum Jun 13, 2025, 11:33 PM

#

how to change or fix delay?

viscid moss Jun 13, 2025, 11:34 PM

#

try batch 8

viscid moss Jun 14, 2025, 12:15 AM

#

I've 0 clue about those options tbh

#

ye

#

Not exactly

#

https://docs.aihub.gg/rvc/resources/training/#batch-size

Training

Last update: May 5, 2025

analog obsidian Jun 14, 2025, 12:21 AM

#

yeah the description of the batch size is wrong
the number depends in how big the dataset is
since u got 1 hour i'd recommend 8 or 16, with 8 being safer here

knotty moth Jun 14, 2025, 12:22 AM

#

vram is just a constraint, obviously not affecting the results

analog obsidian Jun 14, 2025, 12:23 AM

#

20 minutes or less = batch 4
30 minutes and above = 8

knotty moth Jun 14, 2025, 12:23 AM

#

not only that, it also depends on the dataset diversity

#

if it's too diverse, you might better split and train separate datasets

analog obsidian Jun 14, 2025, 12:25 AM

#

knotty moth if it's too diverse, you might better split and train separate datasets

?

#

why lol

analog obsidian Jun 14, 2025, 12:26 AM

#

knotty moth not only that, it also depends on the dataset diversity

not really, i got a very monotone 5 hour dataset, batch 16 was worse than batch 32

knotty moth Jun 14, 2025, 12:27 AM

#

it's still okay, unless you have multiple sources having different quality

analog obsidian Jun 14, 2025, 12:28 AM

#

thats fine

#

still wrong

knotty moth Jun 14, 2025, 12:29 AM

#

the safe bet is to use a single source, or try normalizing each source

analog obsidian Jun 14, 2025, 12:29 AM

#

the only really bad thing for datasets is inconsistent quality and whispering

#

8, 6 is too low for 1 hour

#

i train big datasets

#

so i kinda know what it's best for them

analog obsidian Jun 14, 2025, 12:31 AM

#

knotty moth not only that, it also depends on the dataset diversity

#

^ repeated words, very monotone speech, no expressions

knotty moth Jun 14, 2025, 12:32 AM

#

analog obsidian still wrong

the thing is, I tried separating screaming parts for trying on metal vocals

analog obsidian Jun 14, 2025, 12:32 AM

#

just use 8

#

for anything above 2 hours you may wanna try batch 16

#

and for 5 hours 32 is good

#

the more data u add, the more realistic the output, just sayin

analog obsidian Jun 14, 2025, 12:33 AM

#

knotty moth the thing is, I tried separating screaming parts for trying on metal vocals

i already told u singing is not the way to test stuff

knotty moth Jun 14, 2025, 12:33 AM

#

analog obsidian not really, i got a very monotone 5 hour dataset, batch 16 was worse than batch...

so you tried bs 32 with checkpointing on a 16 GB gpu?

analog obsidian Jun 14, 2025, 12:34 AM

#

knotty moth so you tried bs 32 with checkpointing on a 16 GB gpu?

no checkpointing, bf16, 24gb vram gpu

#

yeah basically

#

another tip for best results: use spin, single scale loss

viscid moss Jun 14, 2025, 12:36 AM

#

https://tenor.com/view/burger-eating-frieren-frieren-beyond-journey's-end-sousou-no-frieren-gif-13425073513713719938

Tenor

knotty moth Jun 14, 2025, 12:36 AM

#

analog obsidian i already told u singing is not the way to test stuff

not everyone expect ideal results for that, if you can combine inference results to remove robotic sounds, it just works

knotty moth Jun 14, 2025, 12:37 AM

#

analog obsidian no checkpointing, bf16, 24gb vram gpu

I thought tf32/fp32 are preferrable to it

#

but well bf16 as well as fp16 allow using AMP

analog obsidian Jun 14, 2025, 12:39 AM

#

so applio has this new branch named f0_spin, it introduced two game changing stuff: a new embedder, and they brought back the original's rvc way to calculate mel
spin handles breaths better than cvec (the default embedder)
back then applio dev added a new way to calculate mel named multi-scale, which is great but not intended to be used in rvc/hifigan, so it was found that adds ringing to the models due to that single scale was brought back, using it should give you a model with very little ringing/no ringing at all

analog obsidian Jun 14, 2025, 12:40 AM

#

knotty moth I thought tf32/fp32 are preferrable to it

bf16 works just fine

#

https://github.com/IAHispano/Applio/tree/exp/f0_spin

GitHub

GitHub - IAHispano/Applio at exp/f0_spin

A simple, high-quality voice conversion tool focused on ease of use and performance. - GitHub - IAHispano/Applio at exp/f0_spin

#

you need this pretrain in order to use spin: https://huggingface.co/Aznamir/spin/blob/main/f0G32k_spin7-12.pth https://huggingface.co/Aznamir/spin/blob/main/f0D32k_spin7-12.pth (download g and d, and place them inside the "custom pretraineds" folder)

#

rvc > train > train.py
multiscale_mel_loss = False

disabling multiscale mel slightly reduces the vocal range of your model so remember that in case you wanna train a singing dataset

#

rvc > models > pretraineds > custom

#

yt_nails i forgot it wasn't named custom pretraineds anymore lmao

#

if this is too complicated for u, you can just ignore it anyway

#

yeah actually it's somewhat easy, finetuning in rvc is not really a hard task

#

uh weird, redownload the zip

#

analog obsidian Jun 14, 2025, 12:51 AM

#

analog obsidian rvc > train > train.py multiscale_mel_loss = False disabling multiscale mel sli...

forgot to mention this
d_ste_per_g_step = 2
rvc's discriminator is pretty piss, this was added to make it less bad

#

should give a less robotic model and better breaths

#

website? you mean colab?

#

dont run it as admin

#

just double click it

knotty moth Jun 14, 2025, 12:54 AM

#

analog obsidian rvc > train > train.py multiscale_mel_loss = False disabling multiscale mel sli...

looking forward to seeing it come to the stable release
rn I'm still rather conservative against the new stuffs including spin and things like that, as my current one "just works"
and I don't think you should recommend it to commoners, yet

analog obsidian Jun 14, 2025, 12:54 AM

#

if they dont like it they can use the old stuff

#

just because u dont like it doesnt means is bad

#

it's actually a great update

#

literally what rvc-boss intended to do back then

#

a new embedder

#

no lmao that aint a virus relax

#

training is so light on the gpu

#

u can literally play games while training

#

xD

#

u gotta need python 3.10 or 3.11 tho

#

this more advanced approach do need a few gigs of space tho

knotty moth Jun 14, 2025, 12:57 AM

#

why being paranoid of it if you have good cooling system

analog obsidian Jun 14, 2025, 12:57 AM

#

#

#

wait

#

its downloading the pretrains

#

dont close the cmd

knotty moth Jun 14, 2025, 12:59 AM

#

I assume you don't play with any overclocking yet, so it should be safe, even in the furmark stress test

analog obsidian Jun 14, 2025, 12:59 AM

#

did u open it as admin again?

knotty moth Jun 14, 2025, 1:00 AM

#

well, have you done manual install with the latest torch and cuda 12.8?

#

the current compiled release one only works on RTX 40-series/older

analog obsidian Jun 14, 2025, 1:00 AM

#

i think they already added support for 50xx in the branch

#

its fine bro

#

ignore the error

#

librosa being cringe for no reason

#

open the url in ur browser

analog obsidian Jun 14, 2025, 1:01 AM

#

analog obsidian open the url in ur browser

dis

#

well there u go
congrats u installed applio yay

#

🏆

#

lmaooo

#

enjoy super fast training speeds now

#

yea

#

just train locally, it's better

#

kaggle is piss bad

#

u want a tutorial on how to use applio locally

#

?

#

ok so u did the two steps, applio is installed
download the pretrain i gave u, place where i told u to place it
place ur datasets inside the assets > datasets folder (or you can do it somewhere else, it doesnt matter lol)

knotty moth Jun 14, 2025, 1:05 AM

#

at this point you might better sell your 5090 to any folks wanting it so bad and knowing what to do

analog obsidian Jun 14, 2025, 1:05 AM

#

misc_trolley

#

knotty moth Jun 14, 2025, 1:07 AM

#

that was way overpriced, recently it has quite dropped

analog obsidian Jun 14, 2025, 1:07 AM

#

then manually place the location of the dataset like this

#

#

#

#

yeah

#

use auto slice if you haven't truncated the silence of your dataset

#

#

yea but like

#

your audio still has silence

#

rvc kinda hates that

#

so use auto slicer

#

yes

#

multiscale thing?

#

more natural results

#

emoji_40

knotty moth Jun 14, 2025, 1:11 AM

#

if you want step by step guidance, this convo should be continued in a new thread in https://discord.com/channels/1159260121998827560/1192011222023950368

analog obsidian Jun 14, 2025, 1:12 AM

#

so like simple words
applio by default added a thing that boosts your dataset voice range at the cost of ringing (a static sound while singing high notes)

#

if u dont want that

#

u can disable it

#

inside your model's log folder

#

#

everything should be there, index, g, d, ur epochs

#

graphs

#

and finally, save every 10 epochs if u wanna save some disk space

#

nah just make a thread here > #1192011222023950368

void holly Jun 14, 2025, 1:20 AM

#

anyone know how remilia bandxz makes his voice like that?

crude flame Jun 14, 2025, 2:31 AM

#

analog obsidian

Erm actually wavlm may be better than spin (ignoring the breaths)

analog obsidian Jun 14, 2025, 2:32 AM

#

crude flame Erm actually wavlm may be better than spin (ignoring the breaths)

breaths are crucial for realtime yt_nails

crude flame Jun 14, 2025, 2:33 AM

#

analog obsidian breaths are crucial for realtime <:yt_nails:1159569314848972891>

Just don't breathe misc_trolley

analog obsidian Jun 14, 2025, 2:33 AM

#

troll

astral void Jun 14, 2025, 4:18 AM

#

@left sentinel

forest jolt Jun 14, 2025, 5:36 AM

#

how to de harmony a track with uvr5?

manic olive Jun 14, 2025, 5:42 AM

#

I have an applio error saying that no api found and I am using version 3.2.3

knotty moth Jun 14, 2025, 5:43 AM

#

manic olive I have an applio error saying that no api found and I am using version 3.2.3

try getting the latest version 3.2.9

manic olive Jun 14, 2025, 5:47 AM

#

knotty moth try getting the latest version 3.2.9

I installed the new version and it shows me the error file not found requirements.txt

polar grail Jun 14, 2025, 5:47 AM

#

yo, ive been tryna setup this ai shit for years on a shitty laptop that couldnt run all the downloads. Im back with an actual PC, someone want to help me set it up an explain how it works? im a dumbass and would prolly need help in a voicecall

latent kettle Jun 14, 2025, 8:55 AM

#

@simple ore sorry to bother you but can you explain why this is happning like my g avg loss in decreasing but the G total loss is increasing.

simple ore Jun 14, 2025, 8:56 AM

#

latent kettle <@155030383648440320> sorry to bother you but can you explain why this is happni...

dont use search, just expand loss_avg_50

#

what batch size?

latent kettle Jun 14, 2025, 9:05 AM

#

simple ore what batch size?

13 minutes of dataset and 4 batch size

elder coral Jun 14, 2025, 9:09 AM

#

why isn't mainline working

flint anvil Jun 14, 2025, 9:15 AM

#

chat i am looking at the available pitch extraction and i asked gpt deepresearch to figure out what's better for realistic m2f voice conversion and it says crepe-full is better but it gives it more delay - and i notice the deiteris fork and i think wokada also have crepe-full without onnx so i assume its gpu bound
I have a really good gpu and i only have to run the voice changer, so no games and stuff, so is crepe-full better than rmvpe for pitch extraction if i can afford to run it?

urban dune Jun 14, 2025, 10:03 AM

#

help, Why only works Beatrice, how to make it work and rvc

agile kelp Jun 14, 2025, 10:05 AM

#

Guys from your experience , i tried many models , can anyone suggest me a model that cannot look like its an AI talking ? i tried many couldnt find the perfect one

urban dune Jun 14, 2025, 10:05 AM

#

only one installed when launched, the other one doesn't download, I don't know why when the program starts

knotty moth Jun 14, 2025, 10:19 AM

#

urban dune help, Why only works Beatrice, how to make it work and rvc

ye the og wokada is good for beatrice only

#

better get one of these:
https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/
https://docs.aihub.gg/rvc-voice-changer/local/vonovox/ (Nvidia support only yet)

Deiteris' W Okada Fork

Last update: May 5, 2025

Vonovox

Last update: June 2, 2025

urban dune Jun 14, 2025, 10:29 AM

#

I mean, everything worked for me before, after resetting Windows, when I start HTTP, only Beatrice is installed, without RVC, and nothing works

#

maybe it is possible to download RVC separately and everything that should be downloaded automatically

digital cairn Jun 14, 2025, 11:46 AM

#

hey guys with experience, what is better, deiteris w-okada or vonovox? in terms of delay and quality of output

neat meadow Jun 14, 2025, 12:50 PM

#

hi Could you let me know what the most current/latest version is right now? Please send me the link

brittle wing Jun 14, 2025, 1:23 PM

#

Can anyone tell me how to add beats to an acoustic song using ai 😢

novel radish Jun 14, 2025, 1:31 PM

#

Does anyone know of a model for cloning a very expressive voice with a lot of vocal voice! NOT LIKE ANUEL AA, BAD BUNNY. They don't have a voice! SOMETHING FOR MANELE FOR AN ARTIST NAMED FLORIN SALAM! HELP! I NEED SOMETHING BETTER THAN HIFI-GAN AND MORE REALISTIC WHEN CLONING SINGING VOICES

low shard Jun 14, 2025, 2:03 PM

#

novel radish Does anyone know of a model for cloning a very expressive voice with a lot of vo...

elaborate:

ur pc gpu
what u want to do (pre-record or realtime)

also, remindn yourself that it depends on how the model was trained, and you need to kinda voice act yourself

low shard Jun 14, 2025, 2:03 PM

#

neat meadow hi Could you let me know what the most current/latest version is right now? Plea...

of what? what do you want to do?
what's ur pc gpu?

low shard Jun 14, 2025, 2:04 PM

#

digital cairn hey guys with experience, what is better, deiteris w-okada or vonovox? in terms ...

kinda similar, heard @crude flame say it was slightly better for nvidia gpus in terms of delay

#

elaborate:

ur pc gpu
what u want to do
what u exactly mean

novel radish Jun 14, 2025, 2:05 PM

#

I have a 1070 Nvidia, I want something better than Hifi-gan.. my cloned voices with this method are good but I want something better that feels more real when cloning these expressive voices, look for FLORIN SALAM AND YOU WILL SEE THAT HE HAS A LARGE VOICE! not like Anuel AA or Bad Bunny

crude flame Jun 14, 2025, 2:06 PM

#

low shard kinda similar, heard <@673327878288703519> say it was slightly better for nvidia...

I love how my phone autocorrected vonovox to bonobo. Anyway yeah vonovox is better in delay and same quality

low shard Jun 14, 2025, 2:07 PM

#

novel radish I have a 1070 Nvidia, I want something better than Hifi-gan.. my cloned voices w...

a gtx 1070 isnt that great, are you looking to train models or use them in realtime?

want something better than Hifi-gan
RVC has limits, it can't 100% pass for human, for example it sucks at laughing

crude flame Jun 14, 2025, 2:10 PM

#

novel radish I have a 1070 Nvidia, I want something better than Hifi-gan.. my cloned voices w...

Vits1 problem not hifi

novel radish Jun 14, 2025, 2:14 PM

#

I'm looking to train models! Better than Hifi-Gan! I train singing artist models.

#

I have applio rvc wit HIFI-GAN

neat meadow Jun 14, 2025, 2:17 PM

#

low shard of what? what do you want to do? what's ur pc gpu?

5600 and rtx3060

low shard Jun 14, 2025, 2:18 PM

#

novel radish I'm looking to train models! Better than Hifi-Gan! I train singing artist models...

RVC has limits, all you can do is try to train better and do voice acting

low shard Jun 14, 2025, 2:18 PM

#

neat meadow 5600 and rtx3060

download what? what do you want to do? also wdym with 5600?

neat meadow Jun 14, 2025, 2:18 PM

#

low shard download what? what do you want to do? also wdym with 5600?

ryzen 5600 cpu

novel radish Jun 14, 2025, 2:19 PM

#

low shard RVC has limits, all you can do is try to train better and do voice acting

And how do I train better? Or tell me exactly what to do to improve as much as possible.

neat meadow Jun 14, 2025, 2:20 PM

#

low shard download what? what do you want to do? also wdym with 5600?

voice changer

#

also i use at2020 and scarlet solo 4th

low shard Jun 14, 2025, 2:21 PM

#

neat meadow voice changer

realtime for calls right?

neat meadow Jun 14, 2025, 2:22 PM

#

yes

low shard Jun 14, 2025, 2:22 PM

#

novel radish And how do I train better? Or tell me exactly what to do to improve as much as p...

you can try checking the suggestions in the docs https://docs.aihub.gg/, but as i said, you will NEVER get something that is 100% like a human

low shard Jun 14, 2025, 2:22 PM

#

neat meadow yes

next time specifiy it, this isnt a voice changer server

#

we do general ai here

novel radish Jun 14, 2025, 2:23 PM

#

Ok thanks

low shard Jun 14, 2025, 2:23 PM

#

@neat meadow RVC = Retrieval-based-Voice-Conversion, the best Few Shots Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models. Technically, Mainline RVC does have a go-realtime.bat (aka RVC-GUI), but it's pretty messy and outdated so it's extremely not suggested for realtime.

Wokada = uses RVC for realtime inference. There's 2 main versions, Original made by Wok, and the most suggested one is Deiteris Fork (modified version)

#

what you'd need is wokada deiteris fork

#

read its guide https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

Deiteris' W Okada Fork

Last update: May 5, 2025

low shard Jun 14, 2025, 2:23 PM

#

novel radish Ok thanks

yw

neat meadow Jun 14, 2025, 2:26 PM

#

what is fork version? What's the difference from the main version?

neat meadow Jun 14, 2025, 2:26 PM

#

low shard <@1248980356779085955> RVC = Retrieval-based-Voice-Conversion, the best Few Shot...

thx

low shard Jun 14, 2025, 2:31 PM

#

neat meadow what is fork version? What's the difference from the main version?

better quality and performance

neat meadow Jun 14, 2025, 2:32 PM

#

low shard better quality and performance

fork version is better?

low shard Jun 14, 2025, 2:32 PM

#

wokada deiteris fork b2332 is the latest, there's also another program which might slightly be better but only for nvidia gpus https://docs.aihub.gg/rvc-voice-changer/local/vonovox/ and i haven't tested it personally

Vonovox

Last update: June 2, 2025

low shard Jun 14, 2025, 2:32 PM

#

neat meadow fork version is better?

yes

neat meadow Jun 14, 2025, 2:33 PM

#

how to download fork version?

low shard Jun 14, 2025, 2:35 PM

#

neat meadow how to download fork version?

read https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/

Deiteris' W Okada Fork

Last update: May 5, 2025

#

100% you should

share a screenshot of your current wokada so i can check the version, or just the folder name

neat meadow Jun 14, 2025, 2:36 PM

#

appriciate

low shard Jun 14, 2025, 2:37 PM

#

neat meadow appriciate

yw and lmk

#

yes its the latest dw then

neat meadow Jun 14, 2025, 2:44 PM

#

It corrects the accent of the model. Using it generates a lot of cpu usage

#

i like zero

#

idk formant Is it a new feature

low shard Jun 14, 2025, 2:48 PM

#

you can decide how much index to do

higher value means more trained index is used, but can sound a bit more autotune

honest cedar Jun 14, 2025, 4:15 PM

#

Can anyone help me with voicemeeter related issues?

#

I’m trying to bitcrush my rvc model

hallow thistle Jun 14, 2025, 4:16 PM

#

High risk of damage? That ususally happens when your CPU overheats, it can happen when you overclock or your PC simply has a bad cooling system, regardless of any process running. It also depends on your PC CPU and such.

#

Promoting your Discord server or stuff is not allowed in this server. It's obvious.

hollow tartan Jun 14, 2025, 4:17 PM

#

hallow thistle High risk of damage? That ususally happens when your CPU overheats, it can happe...

Mine is running on my cpu for a long time and theres been no problem.

hollow tartan Jun 14, 2025, 4:18 PM

#

hallow thistle Promoting your Discord server or stuff is not allowed in this server. It's obvio...

What do i do? I cant take the bot outside tho. How do i find people to test it?

hallow thistle Jun 14, 2025, 4:18 PM

#

Aruthink

honest cedar Jun 14, 2025, 4:20 PM

#

@crude flame Are you able to help my voicemeeter, codeman is asking if you can help me out

crude flame Jun 14, 2025, 4:29 PM

#

honest cedar <@673327878288703519> Are you able to help my voicemeeter, codeman is asking if ...

Do you have light host installed

honest cedar Jun 14, 2025, 4:30 PM

#

crude flame Do you have light host installed

Yes it’s mainly just voicemeeter

#

It’s like working with blender all over again 🥲

crude flame Jun 14, 2025, 4:31 PM

#

honest cedar Yes it’s mainly just voicemeeter

https://docs.aihub.gg/rvc-voice-changer/realism/#voicemeeter-setup

Realism

Last update: May 3, 2025

#

Follow that

honest cedar Jun 14, 2025, 4:33 PM

#

crude flame Follow that

i did

#

still having issues

crude flame Jun 14, 2025, 4:34 PM

#

Then what's your problem

honest cedar Jun 14, 2025, 4:35 PM

#

crude flame Then what's your problem

i cant hear anything when im running voicemeeter

#

and it says fader grain for all the sliders which doesnt match up with the picture

crude flame Jun 14, 2025, 4:38 PM

#

honest cedar i cant hear anything when im running voicemeeter

Like can't hear game audio can't hear the voice or can't hear anything at all

honest cedar Jun 14, 2025, 4:38 PM

#

crude flame Like can't hear game audio can't hear the voice or can't hear anything at all

everything

#

no audio anywhere

crude flame Jun 14, 2025, 4:39 PM

#

Did you fix your output audio in Windows

brittle wing Jun 14, 2025, 4:39 PM

#

Applio no UI is not working again

honest cedar Jun 14, 2025, 4:39 PM

#

crude flame Did you fix your output audio in Windows

im not sure how to setup my windows audio since the guide doesnt show me that

#

and theres like 30 input output options

low shard Jun 14, 2025, 4:40 PM

#

don't advertise please

low shard Jun 14, 2025, 4:40 PM

#

brittle wing Applio no UI is not working again

what's ur pc gpu? whats wrong?

crude flame Jun 14, 2025, 4:41 PM

#

honest cedar and theres like 30 input output options

Select your headphones for Windows output

brittle wing Jun 14, 2025, 4:41 PM

#

low shard what's ur pc gpu? whats wrong?

Android user

low shard Jun 14, 2025, 4:41 PM

#

brittle wing Android user

elaborate the issue

honest cedar Jun 14, 2025, 4:41 PM

#

crude flame Select your headphones for Windows output

okay im going to attempt it

brittle wing Jun 14, 2025, 4:41 PM

#

low shard elaborate the issue

Wait it's Colab is the code broken again.

#

"Pkg_resources is deperecated as an API"

brittle wing Jun 14, 2025, 4:43 PM

#

low shard elaborate the issue

See...
Vedi

low shard Jun 14, 2025, 4:43 PM

#

brittle wing "Pkg_resources is deperecated as an API"

send a screenshot

#

!give-media-perms 1h @brittle wing

brittle wing Jun 14, 2025, 4:44 PM

#

Ah no prob it's training through

honest cedar Jun 14, 2025, 4:44 PM

#

crude flame Select your headphones for Windows output

still cant hear anything

crude flame Jun 14, 2025, 4:48 PM

#

So you have your windows output as your headphones and in voicemeeter you have your hardware out a1 as your headphones

honest cedar Jun 14, 2025, 4:48 PM

#

crude flame So you have your windows output as your headphones and in voicemeeter you have y...

yes exactly

crude flame Jun 14, 2025, 4:49 PM

#

Did you restart your PC after installing voicemeeter

honest cedar Jun 14, 2025, 4:49 PM

#

crude flame Did you restart your PC after installing voicemeeter

i did yes

#

crude flame Jun 14, 2025, 4:50 PM

#

On your virtual input try selecting a1 for both

honest cedar Jun 14, 2025, 4:50 PM

#

#

there i put it as A1

#

still hear nothing

crude flame Jun 14, 2025, 4:54 PM

#

Idk then, that always works for me

honest cedar Jun 14, 2025, 4:55 PM

#

damn so much for bitcrushing

brittle wing Jun 14, 2025, 4:56 PM

#

#

Why does it look like this

crude flame Jun 14, 2025, 4:57 PM

#

crude flame Idk then, that always works for me

You could try setting voicemeeter input and aux as default comms device and default sound device

honest cedar Jun 14, 2025, 4:58 PM

#

crude flame You could try setting voicemeeter input and aux as default comms device and defa...

ill try it

honest cedar Jun 14, 2025, 5:01 PM

#

crude flame You could try setting voicemeeter input and aux as default comms device and defa...

that didnt work but i switched my stereo input 1 to A1 and it worked for some reason

#

crude flame Jun 14, 2025, 5:02 PM

#

Wild

honest cedar Jun 14, 2025, 5:02 PM

#

so random

#

now i got to see if it even works with rvc

#

@crude flame and what would i set input and output as?

crude flame Jun 14, 2025, 5:04 PM

#

honest cedar <@673327878288703519> and what would i set input and output as?

Input your mic
Output line 1

honest cedar Jun 14, 2025, 5:05 PM

#

crude flame Input your mic Output line 1

im not hearing my model

digital cairn Jun 14, 2025, 5:07 PM

#

honest cedar im not hearing my model

Put your headphones/speakers into monitor

honest cedar Jun 14, 2025, 5:08 PM

#

digital cairn Put your headphones/speakers into monitor

it is

#

this isnt doing anything

#

no db

digital cairn Jun 14, 2025, 5:08 PM

#

try restarting the whole client

honest cedar Jun 14, 2025, 5:08 PM

#

okay

crude flame Jun 14, 2025, 5:08 PM

#

honest cedar this isnt doing anything

Did you click start

honest cedar Jun 14, 2025, 5:09 PM

#

crude flame Did you click start

yeah i did click start

crude flame Jun 14, 2025, 5:09 PM

#

Did you select a voice

honest cedar Jun 14, 2025, 5:09 PM

#

yes i chose my model, turned it on

#

and its just not going off

honest cedar Jun 14, 2025, 5:31 PM

#

crude flame Did you select a voice

is it supposed to be picking up my desktop audio?

crude flame Jun 14, 2025, 5:31 PM

#

honest cedar is it supposed to be picking up my desktop audio?

No

honest cedar Jun 14, 2025, 5:31 PM

#

thats definitely a problem then

brittle wing Jun 14, 2025, 5:32 PM

#

honest cedar Jun 14, 2025, 5:32 PM

#

crude flame No

cause that bar is picking up like youtube

brittle wing Jun 14, 2025, 5:32 PM

#

How do I decipher this?

honest cedar Jun 14, 2025, 6:06 PM

#

someone else had told me you dont need to but dont take my word for it

latent kettle Jun 14, 2025, 6:08 PM

#

It's your choice model will work with and without index

paper bloom Jun 14, 2025, 6:25 PM

#

hey is there anyway to make owakada voicechanger more adapable to the english accent?

#

like sometimes it pronounces words diffrently

low shard Jun 14, 2025, 7:42 PM

#

brittle wing Ah no prob it's training through

so how did it go

#

no

brittle wing Jun 14, 2025, 9:04 PM

#

low shard so how did it go

Normal like everything's fine

prime shell Jun 14, 2025, 9:49 PM

#

how i can download applio?

simple ore Jun 14, 2025, 9:51 PM

#

prime shell how i can download applio?

https://huggingface.co/IAHispano/Applio/tree/main/Compiled/Windows

IAHispano/Applio at main

prime shell Jun 14, 2025, 9:51 PM

#

brittle wing Jun 14, 2025, 10:13 PM

#

simple ore https://huggingface.co/IAHispano/Applio/tree/main/Compiled/Windows

Another Applio no UI error/issue while train

#

Cannot load file containing pickled data

#

-colab

patent trellisBOT Jun 14, 2025, 10:15 PM

#

brittle wing -colab

📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**

Google Colab

• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

brittle wing Jun 14, 2025, 10:16 PM

#

simple ore https://huggingface.co/IAHispano/Applio/tree/main/Compiled/Windows

Do you have any workaround for this?

#

Seems to be an issue with Numpy...yes

swift thunder Jun 14, 2025, 10:30 PM

#

Does anyone know if Applio Colab is down? It stays here and doesn't advance: Starting backup loop... Files are up to date

brittle wing Jun 14, 2025, 10:32 PM

#

swift thunder Does anyone know if Applio Colab is down? It stays here and doesn't advance: `St...

Same I can't train

swift thunder Jun 14, 2025, 10:33 PM

#

brittle wing Same I can't train

must have fallen

brittle wing Jun 14, 2025, 10:33 PM

#

swift thunder must have fallen

(who are you training)

#

What singer

swift thunder Jun 14, 2025, 10:34 PM

#

brittle wing What singer

Korean

brittle wing Jun 14, 2025, 10:34 PM

#

swift thunder Korean

I know but which artist

swift thunder Jun 14, 2025, 10:35 PM

#

brittle wing I know but which artist

Sooin (MEOVV) was going to train but it's no good applio haha

brittle wing Jun 14, 2025, 10:35 PM

#

swift thunder Sooin (MEOVV) was going to train but it's no good applio haha

Ah okay so we're not training the same person lemme tag @simple ore for help

#

Applio no UI is unusable

#

Am I doing resuming wrong?

swift thunder Jun 14, 2025, 10:51 PM

#

brittle wing Applio no UI is unusable

I don't like it, it's very complicated haha

brittle wing Jun 14, 2025, 10:52 PM

#

swift thunder I don't like it, it's very complicated haha

It's easy 4 me

swift thunder Jun 14, 2025, 10:53 PM

#

brittle wing It's easy 4 me

Currently I like the most up-to-date, that's why I use Applio UI.

#

No way, I have to wait to see if they can fix it.

brittle wing Jun 14, 2025, 11:03 PM

#

swift thunder Currently I like the most up-to-date, that's why I use Applio UI.

Idk how to train there.

crisp vault Jun 14, 2025, 11:04 PM

#

is somebody in the hispanic one?

#

i cant find the link to it

brittle wing Jun 14, 2025, 11:12 PM

#

Colabs are broken again...

brittle wing Jun 14, 2025, 11:28 PM

#

'Pkg_resources is deprecated as an API'

simple ore Jun 14, 2025, 11:57 PM

#

brittle wing 'Pkg_resources is deprecated as an API'

so far it is a warning

brittle wing Jun 14, 2025, 11:58 PM

#

simple ore so far it is a warning

Can you help me+elaborate?

#

What did I do wrong?

simple ore Jun 14, 2025, 11:58 PM

#

Which colab?

brittle wing Jun 14, 2025, 11:59 PM

#

simple ore Which colab?

Applio no UI but with workaround

#

I tried the no workaround one too still the same outputs in code

simple ore Jun 15, 2025, 12:00 AM

#

what workaround?

brittle wing Jun 15, 2025, 12:01 AM

#

simple ore what workaround?

The code fixes you suggested in winter

simple ore Jun 15, 2025, 12:01 AM

#

that was a long time ago, fixed the install like 3 times after that

brittle wing Jun 15, 2025, 12:02 AM

#

https://github.com/IAHispano/Applio/issues/1025

brittle wing Jun 15, 2025, 12:03 AM

#

simple ore that was a long time ago, fixed the install like 3 times after that

So I shouldn't use the colab notebook I added this stuff on?

simple ore Jun 15, 2025, 12:03 AM

#

not needed any more, just open the link as is

#

https://colab.research.google.com/github/iahispano/applio/blob/master/assets/Applio_NoUI.ipynb

Google Colab

brittle wing Jun 15, 2025, 12:03 AM

#

https://colab.research.google.com/drive/1olxO4hsZqO0SuoQMJkWS37jkitxwRfHB?authuser=4#scrollTo=j2nf3FiM14bx

simple ore Jun 15, 2025, 12:04 AM

#

the link above installed just fine, no errors or warnings

brittle wing Jun 15, 2025, 12:04 AM

#

simple ore https://colab.research.google.com/github/iahispano/applio/blob/master/assets/App...

I get the same errors here too

simple ore Jun 15, 2025, 12:04 AM

#

screenshot

brittle wing Jun 15, 2025, 12:04 AM

#

simple ore the link above installed just fine, no errors or warnings

Uh wait

simple ore Jun 15, 2025, 12:04 AM

#

brittle wing Jun 15, 2025, 12:06 AM

#

brittle wing Jun 15, 2025, 12:06 AM

#

simple ore

It's a training issue

#

Maybe I'm not resuming properly?

simple ore Jun 15, 2025, 12:07 AM

#

that's wariong thrown by librosa

#

not an error

brittle wing Jun 15, 2025, 12:07 AM

#

How...

#

Warning?But my model doesn't want to train it gets stuck on "data not found?"

simple ore Jun 15, 2025, 12:09 AM

#

did you actually preprocess anything?

#

extracted features?

brittle wing Jun 15, 2025, 12:09 AM

#

simple ore extracted features?

In what sense?I should do that before loading backup after switching accounts?

simple ore Jun 15, 2025, 12:10 AM

#

make sure you actually restored the backup

#

brittle wing Jun 15, 2025, 12:10 AM

#

Im cinfused

simple ore Jun 15, 2025, 12:11 AM

#

Did you actualy load the backup, are all the files under logs/modelname?

brittle wing Jun 15, 2025, 12:11 AM

#

I usually install load the backup & the last cell and hit training

#

Wait

simple ore Jun 15, 2025, 12:11 AM

#

my spidey sense is tingling

brittle wing Jun 15, 2025, 12:12 AM

#

#

???

simple ore Jun 15, 2025, 12:12 AM

#

why you checking mute folder instead of your model's folder... which is not visible there

#

you said you used the URL i gave you, but did not load the backup?

brittle wing Jun 15, 2025, 12:13 AM

#

simple ore you said you used the URL i gave you, but did not load the backup?

Model folder is on drive

#

Uh I'm still installing.

simple ore Jun 15, 2025, 12:14 AM

#

but it is not in logs

brittle wing Jun 15, 2025, 12:14 AM

#

😭

swift thunder Jun 15, 2025, 12:15 AM

#

Don't forget about Applio UI too, please.

simple ore Jun 15, 2025, 12:15 AM

#

load backup copies files from your google drive to logs

#

then you can resume training

brittle wing Jun 15, 2025, 12:15 AM

#

simple ore load backup copies files from your google drive to logs

How I do that???

#

I know the copies are there

simple ore Jun 15, 2025, 12:15 AM

#

brittle wing Jun 15, 2025, 12:15 AM

#

I'm doing it

#

RN

swift thunder Jun 15, 2025, 12:16 AM

#

simple ore

is this the applio ui or no ui?

brittle wing Jun 15, 2025, 12:16 AM

#

Now ?

brittle wing Jun 15, 2025, 12:16 AM

#

simple ore

That's what I'm doin' rn.

#

#

It's happening

#

A good change

brittle wing Jun 15, 2025, 12:18 AM

#

swift thunder is this the applio ui or no ui?

No UI one

swift thunder Jun 15, 2025, 12:18 AM

#

brittle wing No UI one

(cries in UI) 😭

brittle wing Jun 15, 2025, 12:18 AM

#

simple ore

Done now ...?

#

Last cell?

#

Still getting the same warning @simple ore

#

That's the issue!

simple ore Jun 15, 2025, 12:23 AM

#

yes, it is warning

brittle wing Jun 15, 2025, 12:23 AM

#

Look at the latest clip that's the error

brittle wing Jun 15, 2025, 12:24 AM

#

simple ore yes, it is warning

Look how do I fix that

simple ore Jun 15, 2025, 12:24 AM

#

ugh

brittle wing Jun 15, 2025, 12:24 AM

#

"No data left in file"

simple ore Jun 15, 2025, 12:24 AM

#

try !uv pip install librosa==0.11.0 --upgrade

#

make a new cell

brittle wing Jun 15, 2025, 12:24 AM

#

simple ore try `!uv pip install librosa==0.11.0 --upgrade`

Oki

simple ore Jun 15, 2025, 12:24 AM

#

run that, then your training

brittle wing Jun 15, 2025, 12:29 AM

#

simple ore run that, then your training

ImportError: numpy.core.multiarray failed to import

#

Still can't train

#

Honestly

simple ore Jun 15, 2025, 12:31 AM

#

you're killing me bub

brittle wing Jun 15, 2025, 12:34 AM

#

simple ore you're killing me bub

I can't train 😦

#

I'm sorry

simple ore Jun 15, 2025, 12:35 AM

#

and i'm trying to watch a movie

brittle wing Jun 15, 2025, 12:36 AM

#

simple ore and i'm trying to watch a movie

I understand but just help me fix the error quickly cause I loaded backup & ran the LAST cell

#

Did everything over & over wasted 2 hours

simple ore Jun 15, 2025, 12:39 AM

#

i'm gonna check locally

brittle wing Jun 15, 2025, 12:39 AM

#

simple ore i'm gonna check locally

Oki

simple ore Jun 15, 2025, 12:42 AM

#

it throws warnings, but works fine

#

#

librosa 0.11.0 install

#

works fine

brittle wing Jun 15, 2025, 12:47 AM

#

Where are you looking?

#

I'm still confused

simple ore Jun 15, 2025, 12:47 AM

#

now I'll check on colab

brittle wing Jun 15, 2025, 12:47 AM

#

simple ore now I'll check on colab

Pls do

simple ore Jun 15, 2025, 12:53 AM

#

#

after you ran 'install'

#

preprocess and extract featiures works fine with librosa 0.11.0

simple ore Jun 15, 2025, 12:57 AM

#

brittle wing Pls do

did you run 'set training variables' cell?

#

as I see everything works fine

brittle wing Jun 15, 2025, 1:00 AM

#

Yes I ran it but got errors previously

#

Just please tell me the execution order

simple ore Jun 15, 2025, 1:01 AM

#

connec to drive, clone, install, +extra cells for librosa, load models, load backup, set training variables (must use the same model name and dr), then training

brittle wing Jun 15, 2025, 1:01 AM

#

That's what I did...

#

After install I run the new cell for librosa ah oki

simple ore Jun 15, 2025, 1:02 AM

#

that's only to hide those annoying warnings

brittle wing Jun 15, 2025, 1:04 AM

#

After install comes the new cell?

simple ore Jun 15, 2025, 1:07 AM

#

make a new cell, see above

brittle wing Jun 15, 2025, 1:14 AM

#

simple ore try `!uv pip install librosa==0.11.0 --upgrade`

Done

#

Numpy array Multy array failed to import.

#

Ah wait backup complete,files are up to date

simple ore Jun 15, 2025, 1:16 AM

#

!pip show numpy

#

it works fine for me

brittle wing Jun 15, 2025, 1:17 AM

#

simple ore it works fine for me

New cell?

#

Training cell was stuck on "files are up to date"

simple ore Jun 15, 2025, 1:19 AM

#

I have no idea.,. maybe your backup is fked

brittle wing Jun 15, 2025, 1:20 AM

#

simple ore I have no idea.,. maybe your backup is fked

In what sense?
That could be

#

How do I fix it.

simple ore Jun 15, 2025, 1:27 AM

#

I've tested restore from backup and resume training and it works for me.

brittle wing Jun 15, 2025, 1:38 AM

#

simple ore I've tested restore from backup and resume training and it works for me.

Restore from backup...

undone abyss Jun 15, 2025, 2:26 AM

#

im using w okada rn, and everytime i talk it gets super laggy and the ms spikes rlly high, anyone know a fix? or a way to reduce the lag

golden belfry Jun 15, 2025, 2:30 AM

#

Hello! Im just curious as to how i could fix my issue. whenever i press start on a voice, it keeps saying “Frequent errors occur. Please check if the model of the framework being targeted is loaded.”

swift thunder Jun 15, 2025, 2:46 AM

#

Hi, Applio UI won't load my datasets. Can you help me?

open raft Jun 15, 2025, 3:09 AM

#

I’m new to voice-conversion and excited to explore RVC! I’ve read through the README and glanced at the code in model.py and inference.py, but I’m not sure where the “core” algorithm is implemented, and how all the pieces fit together.

What I’d love to know:

Which files or classes handle the feature extraction and model architecture?

Where is the training loop defined, and how do data preprocessing and postprocessing hook in?

Are there any papers, blog posts, or diagrams you recommend for a high-level overview?

Any in-code comments or tutorials aimed at beginners that I should read first?

I’m eager to learn and eventually contribute—thanks in advance for any guidance! 🙏

crude flame Jun 15, 2025, 3:28 AM

#

open raft I’m new to voice-conversion and excited to explore RVC! I’ve read through the RE...

have you tried looking in extract.py, train.py, and preprocess.py for feature extraction, preprocessing, and the training code

open raft Jun 15, 2025, 3:32 AM

#

crude flame have you tried looking in extract.py, train.py, and preprocess.py for feature ex...

I’ve reviewed the suggested modules, but my goal is to build an intuitive mental model before diving back into the code.

quasi gyro Jun 15, 2025, 3:53 AM

#

@simple ore for the klm v3 model, is it hifi gan?

wise token Jun 15, 2025, 4:06 AM

#

why has every model become extremely high pitched even on 12 tune

simple ore Jun 15, 2025, 4:09 AM

#

quasi gyro <@155030383648440320> for the klm v3 model, is it hifi gan?

i imagine klm v3 is old af

wise token Jun 15, 2025, 4:09 AM

#

simple ore i imagine klm v3 is old af

do yk why?

quasi gyro Jun 15, 2025, 4:09 AM

#

simple ore i imagine klm v3 is old af

I mean the pretrain @tight ether made

#

the exp 3 pretrain for klm

#

its hifi right?

simple ore Jun 15, 2025, 4:10 AM

#

which one

#

klm 6.1 v3 is hifigan spin7-12

quasi gyro Jun 15, 2025, 4:11 AM

#

simple ore klm 6.1 v3 is hifigan spin7-12

thank u

wise token Jun 15, 2025, 4:15 AM

#

can someone help me

simple ore Jun 15, 2025, 4:50 AM

#

wise token can someone help me

pitch +12 increases the input audio's pitch up

quasi gyro Jun 15, 2025, 5:08 AM

#

@simple ore is there a way to enable fp16 on apollo

#

so i can have faster training with my 4090

simple ore Jun 15, 2025, 6:33 AM

#

quasi gyro so i can have faster training with my 4090

faster is not better

#

4090 is fine crunching fp32

quasi gyro Jun 15, 2025, 6:34 AM

#

simple ore faster is not better

Figured it out anyway, need faster for the project Im doing, doesn't have to be perfect

simple ore Jun 15, 2025, 6:35 AM

#

fp32 is not slow

#

i have 55 hour set doing 24min/epoch on 4070 with fp32

#

with fp16 it would probably be 20?

bronze hedge Jun 15, 2025, 7:49 AM

#

yo guys, why does my changer take like 30 seconds to process, and even after it processes it cuts off and sounds horrible?

hallow thistle Jun 15, 2025, 7:52 AM

#

Do you? Using an index model in W-Okada is not really recommended, as doing so will use more resources to process, potentially reducing the overall performance.

knotty moth Jun 15, 2025, 7:54 AM

#

quasi gyro Figured it out anyway, need faster for the project Im doing, doesn't have to be ...

fp16 has exploding gradients issue, to mitigate it switch to bf16 or fp32 (tf32)

sterile hawk Jun 15, 2025, 8:08 AM

#

so i set up my inputs and stuff with mic and cable input but cant hear anything and when i try to test in discord the yt video is playing in my mic test. anyone know whats wrong?

hallow thistle Jun 15, 2025, 8:11 AM

#

sterile hawk so i set up my inputs and stuff with mic and cable input but cant hear anything ...

!howtoask

patent trellisBOT Jun 15, 2025, 8:11 AM

#

hallow thistle !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

hallow thistle Jun 15, 2025, 8:11 AM

#

Which W-Okada version are you using? Are you using VB-Cable or Virtual Audio Cable? And what is your PC GPU?

sterile hawk Jun 15, 2025, 8:18 AM

#

hallow thistle Which W-Okada version are you using? Are you using VB-Cable or Virtual Audio Cab...

it says v.1.5.3.18a, i am using VB-Cable and i have a NVIDIA GTX 1070

hallow thistle Jun 15, 2025, 8:19 AM

#

sterile hawk it says v.1.5.3.18a, i am using VB-Cable and i have a NVIDIA GTX 1070

Download and use this better W-Okada instead. Yours is old and outdated. And make sure to try Virtual Audio Cable lite instead of VB-Cable. https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#download-nvidia-on-windows

Deiteris' W Okada Fork

Last update: May 5, 2025

sterile hawk Jun 15, 2025, 8:20 AM

#

hallow thistle Download and use this better W-Okada instead. Yours is old and outdated. And mak...

okay, thank you very much

flint anvil Jun 15, 2025, 8:21 AM

#

how do i edit the deiteris fork on my windows pc? i install it but it fails to install faiss-gpu because its linux only and i cant even build this on gh actions to get a working source (no idea how to reproduce)
im trying to change the source code

hallow thistle Jun 15, 2025, 8:21 AM

#

sterile hawk okay, thank you very much

Let me know if you have issue about delayed audio and low quality. anime_pray

hallow thistle Jun 15, 2025, 8:23 AM

#

flint anvil how do i edit the deiteris fork on my windows pc? i install it but it fails to i...

Which Detris fork W-Okada did you install? I don't think this fork W-Okada would do that.

flint anvil Jun 15, 2025, 8:24 AM

#

hallow thistle Which Detris fork W-Okada did you install? I don't think this fork W-Okada would...

the latest b2332 for nvidia cuda on win

hallow thistle Jun 15, 2025, 8:25 AM

#

flint anvil the latest b2332 for nvidia cuda on win

If you wanna modify fork W-Okada, there's a GitHub for that, if you know how to do so. https://github.com/deiteris/voice-changer

GitHub

GitHub - deiteris/voice-changer: リアルタイムボイスチェ...

リアルタイムボイスチェンジャー Realtime Voice Changer. Contribute to deiteris/voice-changer development by creating an account on GitHub.

flint anvil Jun 15, 2025, 8:25 AM

#

hallow thistle If you wanna modify fork W-Okada, there's a GitHub for that, if you know how to ...

i think you're not understanding - i downloaded this and tried to start it from source

#

but python wont install faiss-gpu on win

hallow thistle Jun 15, 2025, 8:26 AM

#

However, there's a compiled one there in this guide, this one doesn't need to be installed from start. https://docs.aihub.gg/rvc-voice-changer/local/deiteris-w-okada-fork/#download-nvidia-on-windows

Deiteris' W Okada Fork

Last update: May 5, 2025

flint anvil Jun 15, 2025, 8:26 AM

#

hallow thistle However, there's a compiled one there in this guide, this one doesn't need to be...

yeah thats not useful to me, i want to write my own better fork as its not being updated

#

but i cant work from source on win11 because faiss gpu doesnt exist for w11

hallow thistle Jun 15, 2025, 8:32 AM

#

flint anvil but i cant work from source on win11 because faiss gpu doesnt exist for w11

Do you mean you want fork W-Okada to work on Linux? Sorry, but I still can't figure out what you want to do with this.

flint anvil Jun 15, 2025, 8:32 AM

#

hallow thistle Do you mean you want fork W-Okada to work on Linux? Sorry, but I still can't fig...

i want to edit the deiteris fork on windows

#

i want to edit the code for it

#

but its not possible to build a working version for gpu (i would explain why but itll just confuse us more)

hallow thistle Jun 15, 2025, 8:36 AM

#

So what makes you think b2332 is outdated? Sorry, I don't do coding, but this version is the most stable out there. Any attempt to upgrade one of Python components can cause some other components to conflict each other, I've tried it with other Python-related programs.

flint anvil Jun 15, 2025, 8:37 AM

#

hallow thistle So what makes you think b2332 is outdated? Sorry, I don't do coding, but this ve...

i need to write postprocessing sfx, fix the wokada gui as its confusing in parts, remove outdated stuff like fcpe, and clean up the code

#

it has nothing to do with the quality itself

#

but if you cant help me because this is a programming question can you point me to someone who can?

hallow thistle Jun 15, 2025, 8:40 AM

#

flint anvil but if you cant help me because this is a programming question can you point me ...

Anyone with the "Engineer" role. But @wispy lodge is the author of the fork program.

quiet axle Jun 15, 2025, 9:32 AM

#

so which one is better vonovox or deiteris fork?

hallow thistle Jun 15, 2025, 9:37 AM

#

quiet axle so which one is better vonovox or deiteris fork?

Deteris' fork W-Okada.

quiet axle Jun 15, 2025, 9:38 AM

#

cool thx for the fast response

#

do any of you guys use voicemeeter or just use VAC lite and call it a day?

simple ore Jun 15, 2025, 9:49 AM

#

quiet axle do any of you guys use voicemeeter or just use VAC lite and call it a day?

virtual audio cable

#

voicemeeter is lil tricky to route

quiet axle Jun 15, 2025, 9:49 AM

#

but is it better?

simple ore Jun 15, 2025, 9:49 AM

#

not really

#

vac is dumb simple for what it does

hallow thistle Jun 15, 2025, 9:51 AM

#

What does this mean? Although you can use index file in "regular RVC program", Applio for example, using index in W-Okada is still not really recommended anyway. As what I said.

quiet axle Jun 15, 2025, 9:52 AM

#

oh okay, i was reading the realism section of the docs and they recommend using voicemeeter

hallow thistle Jun 15, 2025, 9:53 AM

#

quiet axle do any of you guys use voicemeeter or just use VAC lite and call it a day?

Some might use Voicemeeter as a second virtual line to Virtual Audio Cable lite, but that's it. VAC lite is still recommended, I use this one.

quiet axle Jun 15, 2025, 9:54 AM

#

oh okayy thanks for the info

knotty moth Jun 15, 2025, 10:20 AM

#

quiet axle do any of you guys use voicemeeter or just use VAC lite and call it a day?

voicemeeter is only for audio routing, though there is lighthost as alternative

#

but ffs not the virtual cable

quiet axle Jun 15, 2025, 10:20 AM

#

Whats lighthost?

knotty moth Jun 15, 2025, 10:21 AM

#

the guide should have mentioned it

tawdry matrix Jun 15, 2025, 10:41 AM

#

yo i need help with a voice changer

hallow thistle Jun 15, 2025, 10:42 AM

#

tawdry matrix yo i need help with a voice changer

!howtoask

patent trellisBOT Jun 15, 2025, 10:42 AM

#

hallow thistle !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

brittle wing Jun 15, 2025, 12:48 PM

#

what f0 det do you use for tesla T4 gpu?

hallow thistle Jun 15, 2025, 12:49 PM

#

brittle wing what f0 det do you use for tesla T4 gpu?

NVIDIA T4 in which website? Is it Kaggle or Google Colab? And which notebook are you running?

#

Also, read the "How To Troubleshoot" above before you ask anything here.

brittle wing Jun 15, 2025, 12:51 PM

#

hallow thistle NVIDIA T4 in which website? Is it Kaggle or Google Colab? And which notebook are...

kaggle, im running https://www.kaggle.com/code/suneku/voice-changer-public

#

My current f0 works (rvmpe_onnx) but i was wondering if theres a better one

#

the guide doesnt say anything about t4 gpu

knotty moth Jun 15, 2025, 12:53 PM

#

brittle wing the guide doesnt say anything about t4 gpu

it is Nvidia gpu

hallow thistle Jun 15, 2025, 12:53 PM

#

brittle wing My current f0 works (rvmpe_onnx) but i was wondering if theres a better one

The F0 Det on W-Okada, if you're using NVIDIA GPU there, always select regular "rmvpe". The rmvpe is for NVIDIA GPU, while rmvpe_onnx is for non-NVIDIA GPU (AMD and Intel).

knotty moth Jun 15, 2025, 12:54 PM

#

brittle wing what f0 det do you use for tesla T4 gpu?

the performance is around RTX 3050 but with 16 GB vram

#

kaggle has dual gpu but it will use a single gpu anyway

brittle wing Jun 15, 2025, 12:55 PM

#

knotty moth it is Nvidia gpu

Oh I didn't know, thank you guys

#

I thought by "tesla" they meant like elon musks tesla and I was so confused

knotty moth Jun 15, 2025, 12:56 PM

#

brittle wing I thought by "tesla" they meant like elon musks tesla and I was so confused

it is Nvidia Tesla lineup

#

https://www.techpowerup.com/gpu-specs/tesla-t4.c3316

hallow thistle Jun 15, 2025, 12:58 PM

#

NVIDIA Tesla GPU and Tesla the company are two different things, the NVIDIA Tesla is now called NVIDIA Data Center GPU for newer GPUs within series.

hallow thistle Jun 15, 2025, 1:24 PM

#

Almost every RVC model has index file alongside with it, but some only provide just pth file. Not really that surprising. An index file stores the accent of that voice model.It can be created during voice model training in RVC program.

knotty moth Jun 15, 2025, 1:26 PM

#

it's just whether it includes index file or not

#

it depends on the model maker

candid osprey Jun 15, 2025, 1:58 PM

#

403 ERROR
The request could not be satisfied. for links lol.

#

I tried vpn it doesn't work.

#

https://huggingface.co/Shadicti/deiteris-Fork/blob/main/voice-changer-windows-nvidia-b2332.zip

tawdry shore Jun 15, 2025, 2:42 PM

#

so i have the app on my phone can i still do the contest?

ancient fable Jun 15, 2025, 2:46 PM

#

WHat are good AI voice apps

hallow thistle Jun 15, 2025, 2:47 PM

#

ancient fable WHat are good AI voice apps

Is it pre-recorded audio converter or realtime voice changer? There are plenty of AI voice apps available. One of them being RVC.

ancient fable Jun 15, 2025, 2:48 PM

#

What are good RVCs?

hallow thistle Jun 15, 2025, 2:49 PM

#

ancient fable What are good RVCs?

karinthink

#

I use Weights.com mostly for fast-accessing AI cover. But there's Applio, which is available as locally (PC) and online.

#

RVC refers to AI programs that can do voice convert and voice model training, but as what I said, there are many different programs of it, which one of them being Applio.

simple ore Jun 15, 2025, 2:55 PM

#

candid osprey I tried vpn it doesn't work.

use a better vpn

brittle wing Jun 15, 2025, 3:44 PM

#

simple ore use a better vpn

I still have the same issue as yesterday.Would you mind if I dm you my drive folder with backups and you see where's the problem

#

NO DATA LEFT IN FILE...AGAIN

#

I CAN'T TRAIN NY MODEL

#

Should I start all over...

#

With another account

low shard Jun 15, 2025, 4:00 PM

#

candid osprey https://huggingface.co/Shadicti/deiteris-Fork/blob/main/voice-changer-windows-nv...

use your forum https://discord.com/channels/1159260121998827560/1383455966921752677

hallow thistle Jun 15, 2025, 4:20 PM

#

Do I need to explain it again?

#

cat_seriously

#

If you don't remember what I said earlier, let me say to you again. Using index in W-Okada is not recommended, as it will cause it to use more performance. While you can use index in regular RVC program, yes, but that's all.

analog obsidian Jun 15, 2025, 4:40 PM

#

the voice changer app is running a rvc model in realtime
rvc does not stand for realtime voice changer, they're two separate things, rvc originally only works for local conversions and don't support realtime inside the webui

#

every rvc model is compatible with the .index files (yeah you can use any .index file with any model), although index files in realtime cause several issues and their usage in those conditions is not recommended, pick any .index file, set the index value to 0 and forget about its existence

analog obsidian Jun 15, 2025, 4:56 PM

#

i think w-okada forces you to select a index file but im not sure, regardless, setting the index value to 0 will disable the index

golden walrus Jun 15, 2025, 5:42 PM

#

Anyone know if KLM is good for real time? misc_smoke_cry

#

It sounds so good in these sample

#

pepe_cry

#

Spin is a breakthrough for me

analog obsidian Jun 15, 2025, 5:54 PM

#

golden walrus Anyone know if KLM is good for real time? <:misc_smoke_cry:1159570646519521363>

og pretrain is better for speech

#

for spin i'd recommend noobies pretrain instead, but the grads are a bit high, not sure why

#

https://huggingface.co/Aznamir/spin/resolve/main/f0G32k_spin7-12_single.pth?download=true
https://huggingface.co/Aznamir/spin/resolve/main/f0D32k_spin7-12_single.pth?download=true

#

spin only ^ doesn't work with cvec

#

remember spin is still experimental, for a more safer approach, use the original pretrain and contentvec

stone lynx Jun 15, 2025, 6:51 PM

#

do u guys know where i can find how to create ai voice guide

low shard Jun 15, 2025, 7:04 PM

#

stone lynx do u guys know where i can find how to create ai voice guide

what's ur pc gpu?

stone lynx Jun 15, 2025, 7:22 PM

#

low shard what's ur pc gpu?

rx 6700 xt

sonic night Jun 15, 2025, 7:26 PM

#

hi, i wanted to use rvc to put an audio file and then corvert it to another audio file with an AI voice, idk how

low shard Jun 15, 2025, 7:54 PM

#

stone lynx rx 6700 xt

You can:

Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Applio (AMD Windows) : A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
- Mainline (AMD Linux/Windows) : The original RVC
Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Weights.com: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours daily, not granted, of GPU

Easiest possible : weights.com
easiest cloud: Ilaria rvc zero
easiest local: Applio

low shard Jun 15, 2025, 7:54 PM

#

sonic night hi, i wanted to use rvc to put an audio file and then corvert it to another audi...

what's your pc gpu?

brittle wing Jun 15, 2025, 8:15 PM

#

Ok

#

i made a environment folder inside coquis ai tts repo folder

#

and installed it

#

but how do i open it?

#

I don't see anything

#

Documentation on the github says to make a python script, but i wanted a GUI of the app

#

i don't wanna use the command console...

#

How do i open coquis ai tts?

sacred marten Jun 15, 2025, 8:27 PM

#

hi

#

i have a question

#

where do you put these voice models in?

#

like what software

low shard Jun 15, 2025, 8:38 PM

#

sacred marten where do you put these voice models in?

what's your pc gpu? what do you want to do?

simple ore Jun 15, 2025, 8:39 PM

#

brittle wing How do i open coquis ai tts?

there's a server mode you can use

#

latent cypress Jun 15, 2025, 8:43 PM

#

does rtx 4060 8gb good enough for training image models locally? weights queue is taking way too long

simple ore Jun 15, 2025, 8:54 PM

#

8gb in 2025 is a big mistake

#

you can probably train sd1.5 lora on it, not much else

median monolith Jun 15, 2025, 9:04 PM

#

is it normal that the "Start" cell on Applio Kaggle takes a lot of time to even give me the links? its been like 10+ minutes and it still says "cell execution is queued".

median monolith Jun 15, 2025, 9:22 PM

#

now +30 minutes

azure patio Jun 15, 2025, 9:41 PM

#

which model is made to separate backing vocals from vocals?

brittle wing Jun 15, 2025, 9:41 PM

#

simple ore 8gb in 2025 is a big mistake

Maybe my backup got messed up cause I tried to clone the main repository

brittle wing Jun 15, 2025, 9:42 PM

#

azure patio which model is made to separate backing vocals from vocals?

Mel and Becruily karaoke also DM me the song you want separated I can do it for you by using uvronline's backing vocals separator

azure patio Jun 15, 2025, 9:44 PM

#

brittle wing Mel and Becruily karaoke also DM me the song you want separated I can do it for ...

thank you, the thing is i have quite a lot because im getting the vocals to create a model

brittle wing Jun 15, 2025, 9:44 PM

#

azure patio thank you, the thing is i have quite a lot because im getting the vocals to crea...

Well you can use it yourself

azure patio Jun 15, 2025, 9:44 PM

#

brittle wing Mel and Becruily karaoke also DM me the song you want separated I can do it for ...

you mean this one?

brittle wing Jun 15, 2025, 9:44 PM

#

Well use that model on mvsep

azure patio Jun 15, 2025, 9:44 PM

#

cause i tried it and it gave just vocals and instrumentals(which were empty since i have only vocals already)

brittle wing Jun 15, 2025, 9:45 PM

#

Yes but use big beta 6x by unwa on the music source separation colab for Acapella first.

azure patio Jun 15, 2025, 9:45 PM

#

i have acapella already

brittle wing Jun 15, 2025, 9:45 PM

#

Then use this model on mvsep

#

Then use dereverb by anuew

#

After use uvr deecho

brittle wing Jun 15, 2025, 9:46 PM

#

azure patio i have acapella already

Use th model on the Acapella

brittle wing Jun 15, 2025, 9:46 PM

#

azure patio you mean this one?

Yes

azure patio Jun 15, 2025, 9:47 PM

#

thank you tt_happy

#

btw i did the acapella using the thing from kimberleyjsn, is that much different?

dire zinc Jun 15, 2025, 9:55 PM

#

Guys how can I sort launch a project

#

On discord

median monolith Jun 15, 2025, 10:09 PM

#

Im starting to get kinda insane, because idk wtf is wrong with my dataset for applio not being able to train it. the logs keep saying "Not enough data present in the training set. Perhaps you forgot to slice the audio files in preprocess?"

simple ore Jun 15, 2025, 10:16 PM

#

median monolith Im starting to get kinda insane, because idk wtf is wrong with my dataset for ap...

show the log of preprocess and extract features step

golden walrus Jun 15, 2025, 10:19 PM

#

analog obsidian remember spin is still experimental, for a more safer approach, use the original...

I mean, spin has better quality in real time in Vonovox. Also it can produce my language tone a bit better. cat_blush

#

Let me try Noobies' pretrain

median monolith Jun 15, 2025, 10:21 PM

#

simple ore show the log of preprocess and extract features step

analog obsidian Jun 15, 2025, 10:23 PM

#

golden walrus I mean, spin has better quality in real time in Vonovox. Also it can produce my ...

yeah spin is great cat_dance

simple ore Jun 15, 2025, 10:23 PM

#

median monolith

well, it failed to extract f0 and features

#

for some reason

#

i've tested noUI colab yesterday and it was fine

median monolith Jun 15, 2025, 10:27 PM

#

If this helps, these are the options I put.

median monolith Jun 15, 2025, 10:28 PM

#

simple ore well, it failed to extract f0 and features

the dataset/wav file is only 1:03 minutes long

simple ore Jun 15, 2025, 10:28 PM

#

1 minute audio wont fit into any training buckets

#

who gave you this idea?

#

start again with a new model name and use simple slicing

#

make sure you get your ~30 files in extract features processed

median monolith Jun 15, 2025, 10:30 PM

#

simple ore

i mean, the dataset has little to no silence already, and so I thought that it was no necessary to cut it even more (let it as it is)

simple ore Jun 15, 2025, 10:31 PM

#

you need to slice it, it has nothing to do with silences

median monolith Jun 15, 2025, 10:31 PM

#

i thought it was basically going to remove parts of the audio and make it even shorter if i choose any other option, my bad

simple ore Jun 15, 2025, 10:34 PM

#

automatic slicing does remove excessive silences, simple just shreds the file into digestible chunks

median monolith Jun 15, 2025, 10:50 PM

#

simple ore automatic slicing does remove excessive silences, simple just shreds the file in...

well, in that case, i wonder what are the recommended values thay i should put for both "Chunk length (sec)" and "Overlap length (sec)" if i choose the simple option, for such a short dataset that at best has like a second of actual silence. maybe the default values are enough and will not "cut parts of the audio and make it lose content"?

simple ore Jun 15, 2025, 10:51 PM

#

use default 3/0.3

median monolith Jun 15, 2025, 10:52 PM

#

simple ore use default 3/0.3

alr, then the audio should be let intact (not lose/cut/remove content/information) 👍

simple ore Jun 15, 2025, 10:52 PM

#

if you have enough silence, actual 0 level silence, you can set mute files to 0

median monolith Jun 15, 2025, 10:55 PM

#

simple ore if you have enough silence, actual 0 level silence, you can set mute files to 0

i suppose the "silence" on the very start and very end should be that ?

opal depot Jun 15, 2025, 10:56 PM

#

so for kaggle w-okada, how do I have persistence for files apply without having to do save version every time? do I run in edit mode or

median monolith Jun 15, 2025, 11:26 PM

#

simple ore make sure you get your ~30 files in extract features processed

ok, so, i guess it did went better than last time, but the logs still say the not enough data thing, and instead of ~30 files, it gave me 22. same values, just changed the model name and the audio cutting to simple with default values.

simple ore Jun 15, 2025, 11:27 PM

#

what's the batch size?

median monolith Jun 15, 2025, 11:27 PM

#

simple ore what's the batch size?

4

simple ore Jun 15, 2025, 11:27 PM

#

60s/3 = 20 + 10% overlap

#

so 22 files is okay

#

what's in f0?

median monolith Jun 15, 2025, 11:29 PM

#

simple ore what's in f0?

simple ore Jun 15, 2025, 11:29 PM

#

is it kaggle or colab?

median monolith Jun 15, 2025, 11:29 PM

#

simple ore is it kaggle or colab?

kaggle

simple ore Jun 15, 2025, 11:29 PM

#

22/(4x2) < 3

#

try batch 2

median monolith Jun 15, 2025, 11:37 PM

#

simple ore try batch 2

alr, i guess its finally going!

simple ore Jun 15, 2025, 11:40 PM

#

dont expect much of anything from just a minute of audio

median monolith Jun 15, 2025, 11:41 PM

#

ik lol

signal chasm Jun 16, 2025, 12:01 AM

#

Not sure why