low shard Mar 7, 2025, 1:56 PM

#

Is RefineGAN not stable enough yet?

simple ore Mar 7, 2025, 1:57 PM

#

well, it was great in small scale tests, but large tests not so much

low shard Mar 7, 2025, 1:59 PM

#

simple ore well, it was great in small scale tests, but large tests not so much

will applio keep it in the future or possibly remove it forever
https://cdn.discordapp.com/emojis/1225582606062456893.webp?size=48

simple ore Mar 7, 2025, 2:19 PM

#

i'm testing some changes

#

but that means yet another retrain

#

at this point it is no longer "refine" gan

jaunty iris Mar 7, 2025, 3:37 PM

#

so thats why i couldnt get it to work 😭

#

i was going insane earlier i reinstalled like 7 times

analog obsidian Mar 7, 2025, 3:43 PM

#

simple ore at this point it is no longer "refine" gan

sad to see refinegan go
but will you continue working on it?

old sigil Mar 7, 2025, 3:46 PM

#

hey all

simple ore Mar 7, 2025, 3:46 PM

#

jaunty iris so thats why i couldnt get it to work 😭

you can re-enable it by editing tabs/train/train.py

jaunty iris Mar 7, 2025, 3:46 PM

#

Oh ok

simple ore Mar 7, 2025, 3:47 PM

#

change this back to True

old sigil Mar 7, 2025, 3:47 PM

#

i am new in rvc and I want help to understand how to setup it on either local or collab?
can somebody guide me please

simple ore Mar 7, 2025, 3:47 PM

#

analog obsidian sad to see refinegan go but will you continue working on it?

I would not say it is gone gone

jaunty iris Mar 7, 2025, 3:47 PM

#

simple ore change this back to True

Thank u

simple ore Mar 7, 2025, 3:48 PM

#

but it is no longer does what the original paper did

#

my current version is using interpolation and a parallel resblock, so it is more like hifigan that has solved the problem with horizontal lines at 4, 8 and 12KHz

old sigil Mar 7, 2025, 3:51 PM

#

simple ore but it is no longer does what the original paper did

hey could you help me a little bit in giving a rough overview

#

?

simple ore Mar 7, 2025, 3:55 PM

#

???

old sigil Mar 7, 2025, 3:55 PM

#

I want to setup rvc and train a model with my voice

#

can you give me a rough idea on how to do this ?

simple ore Mar 7, 2025, 3:57 PM

#

why dont you look at https://docs.aihub.gg/essentials/how-to-make-voice-models/

How to Make Voice Models

In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.

old sigil Mar 7, 2025, 3:57 PM

#

thanks!

kindred eagle Mar 7, 2025, 4:15 PM

#

The orignal one made by rvc boss

low shard Mar 7, 2025, 4:33 PM

#

kindred eagle The orignal one made by rvc boss

tha'ts commonly called mainline/original

#

Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), not the best to train (make models) even if still possible

You can:

Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
- Mainline: The original RVC
Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.com: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio UI Colab: max 4 hours daily, not granted, of GPU

Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio

#

I would suggest you applio for more updates

#

also, check the docs https://docs.aihub.gg for more info on vocal separation

Home

Last update: Oct 21, 2024

fluid topaz Mar 7, 2025, 4:38 PM

#

I know rx but which one is vx? Kinda lost

low shard Mar 7, 2025, 4:38 PM

#

old sigil i am new in rvc and I want help to understand how to setup it on either local or...

what's ur pc gpu and what do u want to do

glacial pollen Mar 7, 2025, 4:38 PM

#

fluid topaz I know rx but which one is vx? Kinda lost

one sec

#

#

waves' Clarity vx - dereverb pro mono

#

( Reminder; it is for mono audio. So you take one channel and work on it (( as you should anyway )) )

fluid topaz Mar 7, 2025, 4:39 PM

#

glacial pollen waves' Clarity vx - dereverb pro mono

Oh cool thanks

glacial pollen Mar 7, 2025, 4:39 PM

#

✨

old sigil Mar 7, 2025, 4:45 PM

#

low shard what's ur pc gpu and what do u want to do

I have a mac m2 air
and I want to try voice changer real time

low shard Mar 7, 2025, 4:45 PM

#

old sigil I have a mac m2 air and I want to try voice changer real time

wrong channel then, rvc isn't for that

#

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

#

always elaborate your requestes when asking, people can't know your pc nor what u want to do

#

@old sigil go to #🔍│help-w-okada

old sigil Mar 7, 2025, 4:46 PM

#

low shard RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on ...

How about this speech to speech conversions?
this has some latency?

low shard Mar 7, 2025, 4:47 PM

#

old sigil How about this speech to speech conversions? this has some latency?

rvc is just on pre-recorded audios, it's not meant for realtime, you need to use only wokada if you want realtime with rvc models

#

those are 2 different program names

#

rvc does not equal to "realtime voice changer"

old sigil Mar 7, 2025, 4:48 PM

#

low shard rvc is just on pre-recorded audios, it's not meant for realtime, you need to use...

so what I am getting is, for rvc
If i have pre-recorded audios and i could train a model out of it
can I use it for speech to speech conversion?

low shard Mar 7, 2025, 4:48 PM

#

there is a go realtime for the original/mainline rvc, but it's way worse optimized than original wokada, and way worse than wokada deiteris fork

low shard Mar 7, 2025, 4:49 PM

#

old sigil so what I am getting is, for rvc If i have pre-recorded audios and i could train...

RVC is a STS AI, you just said you wanted realtime voice changer for calls, not on pre-recorded audios

old sigil Mar 7, 2025, 4:49 PM

#

low shard RVC is a STS AI, you just said you wanted realtime voice changer for calls, not ...

ya but I was curious in knowing how rvc works

#

but Yeah!
thanks for your help

I'll have a look on wokada for my usecase

low shard Mar 7, 2025, 4:53 PM

#

old sigil ya but I was curious in knowing how rvc works

yeah it's speech to speech

brittle wing Mar 7, 2025, 5:32 PM

#

Yo guys,
I’m tweaking the Batch Size setting and not sure what to pick. 4 = better accuracy but slower, 8 = faster but "standard".
Does 4 actually improve sound quality, or is 8 just as good?
Would appreciate a simple explanation!

oak sandal Mar 7, 2025, 5:36 PM

#

i used to do up to 20 batch size

#

i think theres barely any difference between 4 and 8 so if you have the power to spare just do 8

oak sandal Mar 7, 2025, 5:37 PM

#

oak sandal i used to do up to 20 batch size

if we are still on RVC v2 training on mangio-crepe that is

low shard Mar 7, 2025, 5:37 PM

#

brittle wing Yo guys, I’m tweaking the Batch Size setting and not sure what to pick. 4 = bett...

check https://docs.aihub.gg/rvc/resources/training/#batch-size

it depends on your gpu and dataset lenght

low shard Mar 7, 2025, 5:37 PM

#

oak sandal i used to do up to 20 batch size

till 20 seems too much

low shard Mar 7, 2025, 5:38 PM

#

oak sandal if we are still on RVC v2 training on mangio-crepe that is

you don't use the mangio fork still, right?

#

because that's old since 2023

oak sandal Mar 7, 2025, 5:39 PM

#

low shard till 20 seems too much

do you think i would care with an SLI of gpu's that are worth like an used BMW?

#

once you get to cloud training you never go back

oak sandal Mar 7, 2025, 5:39 PM

#

low shard you don't use the mangio fork still, right?

Applio

#

also yea i'd rather use the old mangio fork even in 2025

#

Still crepe and still rmvpe, i see we dont have a new extraction method yet after all this time

#

the "embedder" is new to me tho

#

iirc it was by default on chinese hubert

analog obsidian Mar 7, 2025, 5:41 PM

#

brittle wing Yo guys, I’m tweaking the Batch Size setting and not sure what to pick. 4 = bett...

if your dataset is below 30 minutes, use batch size 4
if your dataset is above 30 minutes use batch size 8

oak sandal Mar 7, 2025, 5:41 PM

#

now we got.. contentvec?

low shard Mar 7, 2025, 5:41 PM

#

oak sandal do you think i would care with an SLI of gpu's that are worth like an used BMW?

you don't need multiple gpus to train RVC models

oak sandal Mar 7, 2025, 5:41 PM

#

low shard you don't need multiple gpus to train RVC models

ofc you don't, you can manage aswell with a tiny lil 3060 12gb

analog obsidian Mar 7, 2025, 5:41 PM

#

oak sandal iirc it was by default on chinese hubert

no

oak sandal Mar 7, 2025, 5:41 PM

#

but when your dataset is 2 hours long

analog obsidian Mar 7, 2025, 5:42 PM

#

rvc always used contentvec

#

hubert = contentvec

#

same thing

low shard Mar 7, 2025, 5:42 PM

#

oak sandal once you get to cloud training you never go back

no, local training is better but it depends on your setup, cloud isn't as much stable and you can see that in #📰│dev-updates , especially google colab for training kinda sucks

oak sandal Mar 7, 2025, 5:42 PM

#

not google colab

#

actual cloud servers with RTX server gpu's

#

lmao

low shard Mar 7, 2025, 5:43 PM

#

oak sandal also yea i'd rather use the old mangio fork even in 2025

mangio rvc fork is pretty old and not maintained, I don't get why you would say that when Applio got more improvements

oak sandal Mar 7, 2025, 5:43 PM

#

i used colab only like 3 or 4 times when RVC wasnt a thing and SVC is all we had

analog obsidian Mar 7, 2025, 5:43 PM

#

oak sandal also yea i'd rather use the old mangio fork even in 2025

mangio fork uses extremely outdated code, everything its wrong there

low shard Mar 7, 2025, 5:43 PM

#

analog obsidian mangio fork uses extremely outdated code, everything its wrong there

^^^

analog obsidian Mar 7, 2025, 5:43 PM

#

even the logging is bugged

oak sandal Mar 7, 2025, 5:43 PM

#

i made great models even with the outdated code and everything being wrong + bugged logging in 2023

#

i seriously doubt there was much improvement since then

analog obsidian Mar 7, 2025, 5:44 PM

#

because it wasnt outdated in 2023

#

mangio stopped receiving updates around that time

low shard Mar 7, 2025, 5:44 PM

#

oak sandal Still crepe and still rmvpe, i see we dont have a new extraction method yet afte...

there's other enhancements in the code, it's not just that

analog obsidian Mar 7, 2025, 5:44 PM

#

it was on pair with mainline back then

#

now its even behind mainline

#

and mainline is also extremely outdated

oak sandal Mar 7, 2025, 5:45 PM

#

i would love hearing audio difference between 2023 mangio-crepe and nowadays RMVPE on applio or whatever yall prefer now 👀

analog obsidian Mar 7, 2025, 5:45 PM

#

rmvpe its just a f0 estimation

#

is not a quality thing

oak sandal Mar 7, 2025, 5:45 PM

#

it had pitch issues

low shard Mar 7, 2025, 5:45 PM

#

oak sandal ofc you don't, you can manage aswell with a tiny lil 3060 12gb

you can even train on worse, it's just not suggested for not wasting time

analog obsidian Mar 7, 2025, 5:45 PM

#

rmvpe has been always the same

#

🥹

oak sandal Mar 7, 2025, 5:46 PM

#

not RMVPE itself

#

the implementation

#

it was a mess

oak sandal Mar 7, 2025, 5:46 PM

#

low shard you can even train on worse, it's just not suggested for not wasting time

yea and thats why i always used Runpod or other paid services

unkempt sapphire Mar 7, 2025, 5:50 PM

#

does sup1 even work

#

only sup2 is good

#

and it sounds so ass with sup2 on

low shard Mar 7, 2025, 5:50 PM

#

unkempt sapphire does sup1 even work

wrong channel, use #🔍│help-w-okada

unkempt sapphire Mar 7, 2025, 5:50 PM

#

oops

oak sandal Mar 7, 2025, 5:50 PM

#

bro i legit checked on Weights.gg, my old model trained on mangio's fork is still magnitudes better than the newer models

low shard Mar 7, 2025, 5:50 PM

#

unkempt sapphire oops

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

oak sandal Mar 7, 2025, 5:50 PM

#

😭

#

most of the work is separating the vocals, not using the right version of RVC

#

im p sure i wouldn't be able to replicate that level of clean vocals again, kim vocal 1 was goated back then

low shard Mar 7, 2025, 5:55 PM

#

oak sandal im p sure i wouldn't be able to replicate that level of clean vocals again, kim ...

kim vocal is pretty old, there's better models now

low shard Mar 7, 2025, 5:55 PM

#

oak sandal bro i legit checked on Weights.gg, my old model trained on mangio's fork is stil...

you can't know how good the person trained the model nor how

oak sandal Mar 7, 2025, 5:56 PM

#

low shard you can't know how good the person trained the model nor how

read what i wrote above

oak sandal Mar 7, 2025, 5:56 PM

#

oak sandal most of the work is separating the vocals, not using the right version of RVC

.

#

i used multiple models on FLACs files, then manually fixed issues in FL Studio + izotope rx

#

most people just download random mp3 128kbps acapellas from youtube and be done with it

#

every other people that tried to make an "updated" version of my model failed miserably, the audio that gets generated is a mess, both in sound quality and voice similiarity

crude flame Mar 7, 2025, 5:59 PM

#

oak sandal every other people that tried to make an "updated" version of my model failed mi...

what is your model?

oak sandal Mar 7, 2025, 5:59 PM

#

crude flame what is your model?

Sfera Ebbasta

#

#

this one

crude flame Mar 7, 2025, 6:00 PM

#

oh a singer

no wonder people mess it up

oak sandal Mar 7, 2025, 6:00 PM

#

then how come i didn't mess it up?

#

🤔

crude flame Mar 7, 2025, 6:00 PM

#

you prob used studio sessions

oak sandal Mar 7, 2025, 6:00 PM

#

no

crude flame Mar 7, 2025, 6:00 PM

#

you tried cleaning the dataset

oak sandal Mar 7, 2025, 6:01 PM

#

no studio session, all manual labour and AI

#

from the song themselves

#

i just downloaded the whole album in FLAC quality

#

then one by one, minute by minute, cleaned the dataset

analog obsidian Mar 7, 2025, 6:02 PM

#

crude flame you tried cleaning the dataset

its not

crude flame Mar 7, 2025, 6:02 PM

#

oh

#

wow

analog obsidian Mar 7, 2025, 6:02 PM

#

YUM

low shard Mar 7, 2025, 6:03 PM

#

oak sandal most of the work is separating the vocals, not using the right version of RVC

also using an updated version helps, applio got more enhancements like the benchmark flag #🔊│ai-development message which is turned on the main branch

but yeah ofcourse it depends much on how people clean the dataset

oak sandal Mar 7, 2025, 6:05 PM

#

ok so, after 2 years the advancements were... better GUI and slight code optimization?

#

what happened to that one dude who was developing RVC "v3"

low shard Mar 7, 2025, 6:07 PM

#

oak sandal what happened to that one dude who was developing RVC "v3"

rvc boss left rvc to rot, he's working on gpt so vits and recently released the v3

low shard Mar 7, 2025, 6:07 PM

#

oak sandal ok so, after 2 years the advancements were... better GUI and slight code optimiz...

there are also newer things, like the refinegan vocoder on the main branch being experimental rn

#

idk why ur so attached to mangio fork

oak sandal Mar 7, 2025, 6:07 PM

#

i think last time they were trying vocos

#

iirc that was the name

oak sandal Mar 7, 2025, 6:08 PM

#

low shard idk why ur so attached to mangio fork

i'm not, we just didn't get any decent improvement since 2023

#

atleast for RVC

low shard Mar 7, 2025, 6:09 PM

#

oak sandal i'm not, we just didn't get any decent improvement since 2023

we did get some improvements with applio, it's just the original dev now works on other projects than RVC, so it's not an RVC v3 but more experimenting

#

rvc boss works basically mostly on https://github.com/RVC-Boss/GPT-SoVITS

GitHub

GitHub - RVC-Boss/GPT-SoVITS: 1 min voice data can also be used to ...

1 min voice data can also be used to train a good TTS model! (few shot voice cloning) - RVC-Boss/GPT-SoVITS

oak sandal Mar 7, 2025, 6:12 PM

#

which is probably best for speaking only

low shard Mar 7, 2025, 6:12 PM

#

ofcourse it's not such an entire new type of architecture, but i would rather use code that gets updated and improvements rather one that doesn't get any at all

oak sandal Mar 7, 2025, 6:12 PM

#

i dont think we are gonna see any decent or sizable improvements in the next year

#

maybe 2027

glacial pollen Mar 7, 2025, 6:36 PM

#

oak sandal now we got.. contentvec?

It always was contentvec, not hubert

#

#

As for f0 extractors.. yeah, there's nothing really better as of now, afaik

#

As for vocos, it is not worth it.
Quite tricky to get properly working and tbf, potential phase reconstruction issues aren't worth it, similarly stft and I believe istft vocoders

simple ore Mar 7, 2025, 7:05 PM

#

most generators are only useful for mel to wav reconstruction, not encoded latents to wav

#

hifigan uses NN filters to predict the waveform from the encoded latents

#

vocos does not have enough capacity to predict

tough fiber Mar 7, 2025, 7:22 PM

#

-kaggle

karmic oliveBOT Mar 7, 2025, 7:22 PM

#

tough fiber -kaggle

Suggestions for @distant turtle

📘 Kaggle Notebooks

Applio Notebook, by Vidal Kaggle
Applio Notebook, by Shirou Kaggle
Music Source Separation, by Shirou Kaggle
UVR5 NO UI, by Eddy Kaggle
Original W-Okada's Voice Changer, Kaggle
Modified W-Okada's Voice Changer, Kaggle
🆕 UVR5 UI, by Eddy, ArisDev & Nick088 Kaggle
🆕 RVC AI Cover Maker UI, by Shirou & ArisDev Kaggle
📖 How to use RVC Mainline on Kaggle by Cauthess

Note: Kaggle limits GPU usage to 30 hours per week.

low shard Mar 7, 2025, 7:33 PM

#

oak sandal i dont think we are gonna see any decent or sizable improvements in the next yea...

I hope we get them asap

unique rock Mar 7, 2025, 10:05 PM

#

Can someone tell me why I get this in Applio Voice Blend?

model_blender(model_name, pth_path_1, pth_path_2, ratio)
ValueError: too many values to unpack (expected 2)

I tried to blend a 200 epoch and 48k sample rate model with a 270 epoch model the same sample rate

simple ore Mar 7, 2025, 10:34 PM

#

unique rock Can someone tell me why I get this in Applio Voice Blend? model_blender(model_...

if both models were made in Applio and both at the same sampling rate, then there should be no issues with merging

#

there may be an issue if the models came from different sources

icy vessel Mar 7, 2025, 11:04 PM

#

what pretrain does the applio colab use by default ?

low shard Mar 7, 2025, 11:08 PM

#

icy vessel what pretrain does the applio colab use by default ?

The original/OG pretrain

icy vessel Mar 7, 2025, 11:09 PM

#

low shard The original/OG pretrain

ok thanks

deft flare Mar 7, 2025, 11:13 PM

#

Hi everyone I hope I'm texting the right chat.. I have had to cancel kits.ai due to being ridiculously expensive.. anyone knows good complete walkthrough tutorial how to make a model from scratch? kits would sort everything for me so I feel kinda lost

low shard Mar 7, 2025, 11:16 PM

#

deft flare Hi everyone I hope I'm texting the right chat.. I have had to cancel kits.ai due...

Kits.ai just uses RVC but in a simplified User Interface

#

First of all what's your PC GPU

deft flare Mar 7, 2025, 11:17 PM

#

macbook pro m1

low shard Mar 7, 2025, 11:17 PM

#

deft flare macbook pro m1

Oh, macs ain't really good for AI

deft flare Mar 7, 2025, 11:17 PM

#

oh?

#

i'll bite the bullet i guess

low shard Mar 7, 2025, 11:19 PM

#

deft flare oh?

Train (make) RVC Models on cloud:

Prepare the Dataset
Setup RVC:
Choose a cloud way to use RVC,

Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Applio (UI)
- Mainline (UI, No guide as of right now)

Be sure to know about the tensorboard

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.com/ which ofc uses RVC

RVC Inference (use models) on pre-recorded audio on Cloud

You can use either:

Weights.com: Easiest Possible Ever Automatic
Ilaria RVC Zero: Fastest free on cloud
Applio (ui)

deft flare Mar 7, 2025, 11:19 PM

#

so I want to train after this musician.. perhaps the whole discography not faring more than 30-40 minutes tota

low shard Mar 7, 2025, 11:19 PM

#

If you want, you could try locally (runs on your Mac), https://docs.applio.org/applio/getting-started/installation , but no one has ever reported training a model successfully on Mac since it's kinda slow

Applio - Installation

Applio is easy to install. We recommend the precompiled version for new users as it's ready to use.

deft flare Mar 7, 2025, 11:19 PM

#

I actually downloaded this earlier this evening but it's all so overwhelming

#

i have uvr5 as well just don't know whether there's an ubiquitous setting to extract clearest lead vocals

#

if you could guide me step by step mate, I'd paypal you or something if that suits you

low shard Mar 7, 2025, 11:23 PM

#

deft flare i have uvr5 as well just don't know whether there's an ubiquitous setting to ext...

I'm not even sure if that supports Mac at all

deft flare Mar 7, 2025, 11:23 PM

#

uvr5 seems to work when I open it haha

low shard Mar 7, 2025, 11:24 PM

#

Hopefully it runs on MPS using the integrated m1 pro chip rather than the CPU

#

Else it's gonna be even slower

deft flare Mar 7, 2025, 11:24 PM

#

right

#

(pretends to understand)

low shard Mar 7, 2025, 11:25 PM

#

@viscid moss hey sorry to disturb, do you remember if UVR supported MPS for Macs, or does your version of it support it?

deft flare Mar 7, 2025, 11:25 PM

#

low shard <@274566299349155851> hey sorry to disturb, do you remember if UVR supported MPS...

ur so kind thank u ❤️

low shard Mar 7, 2025, 11:27 PM

#

deft flare ur so kind thank u ❤️

I was asking the other staffer if he remembers if that program supports it, or if his own version supports it since he made a separate own version of UVR by the way

deft flare Mar 7, 2025, 11:28 PM

#

yeah I gotchu, I was just moved how there's actually someone being understanding towards complete n00b

#

lost kind of kindness on the internet for most part

low shard Mar 7, 2025, 11:28 PM

#

Unfortunately unless you spend like 4k dollars, macs are pretty shit for AI

#

And even if you spend that much and get the most powerful Mac made for user consumption, the issue is not many AI programs support Mac at all

deft flare Mar 7, 2025, 11:29 PM

#

when I was experimenting with google collab stuff it seemed to do decent like?

low shard Mar 7, 2025, 11:29 PM

#

deft flare when I was experimenting with google collab stuff it seemed to do decent like?

Yeah that's because it doesn't run on your Mac, it runs on a cloud server (remote good PC) that use Nvidia GPUs

#

Nvidia GPUs are the best in terms of performance and support in AI field

deft flare Mar 7, 2025, 11:30 PM

#

Ahh

low shard Mar 7, 2025, 11:30 PM

#

low shard Yeah that's because it doesn't run on your Mac, it runs on a cloud server (remot...

I also sent you cloud links before

#

Did you install applio locally, or did you just use a Google colab? And this for UVR too?

deft flare Mar 7, 2025, 11:30 PM

#

both locally

low shard Mar 7, 2025, 11:32 PM

#

deft flare both locally

Applio should have Mac support, hopefully uvr does too

#

Tbh, I don't know how much I could suggest you to use them locally, At this point cloud would be faster than your Mac, but the only issue is limited GPU time

deft flare Mar 7, 2025, 11:34 PM

#

I can leave it overnight or w/e not to worry about that really ❤️ guidance is pivotal for me

low shard Mar 7, 2025, 11:36 PM

#

deft flare I can leave it overnight or w/e not to worry about that really ❤️ guidance is pi...

Be sure to watch out if it overheats or anything though

#

I would personally suggest you to use cloud for faster processing, but your choice

#

I don't know how much time it could take to be honest, not sure if it's going to be overnight or more or less, so I can't guarantee you much on that

viscid moss Mar 7, 2025, 11:44 PM

#

low shard <@274566299349155851> hey sorry to disturb, do you remember if UVR supported MPS...

I think it has MPS support, cause they made releases for Mac. UVR5 UI doesn't have Mac Support (There's no installation files + idk if works correctly cause I don't have a Mac to test) but audio-separator (UVR5 UI core) have MPS support.

#

UVR5 UI probably works with MPS but I haven't tested it because I don't have a way to test it.

#

So I recommend just use UVR5 for mac

low shard Mar 7, 2025, 11:47 PM

#

viscid moss UVR5 UI probably works with MPS but I haven't tested it because I don't have a w...

https://github.com/Nick088Official/Minecraft_Skin_Generator/blob/main/Scripts%2Fminecraft-skins-sdxl.py#L137 maybe this can help

GitHub

Minecraft_Skin_Generator/Scripts/minecraft-skins-sdxl.py at main · ...

Generates Minecraft skins with a text prompt using the HuggingFace "monadical-labs/minecraft-skin-generator" model. - Nick088Official/Minecraft_Skin_Generator

low shard Mar 7, 2025, 11:47 PM

#

viscid moss I think it has MPS support, cause they made releases for Mac. UVR5 UI doesn't ha...

Alright thx for letting me know

viscid moss Mar 7, 2025, 11:50 PM

#

low shard https://github.com/Nick088Official/Minecraft_Skin_Generator/blob/main/Scripts%2F...

that's already built in, just need to test xd. I'll try in a while using a VM maybe, to make the installation works at least atm I'm busy with irl work

low shard Mar 8, 2025, 12:10 AM

#

viscid moss that's already built in, just need to test xd. I'll try in a while using a VM ma...

Alright goodluck

rapid spade Mar 8, 2025, 2:37 AM

#

is there a guide on how to download the software needed for ai stuff

dire juniper Mar 8, 2025, 3:34 AM

#

anyone know why i cant upload a rvc model to my voice changer ?

karmic flax Mar 8, 2025, 4:41 AM

#

i get alot of errors when installing RCV with the TroubleChute one line command. and im to stupid to install it manually. i used RVC on my old pc (win10) before and now on win11 everything just wont work. it openes the website but it wont ever finish converting

hallow thistle Mar 8, 2025, 4:42 AM

#

karmic flax i get alot of errors when installing RCV with the TroubleChute one line command....

Which RVC program are you using?

karmic flax Mar 8, 2025, 4:42 AM

#

Retrieval-based-Voice-Conversion-WebUI

#

if that wasnt the correct answer im sorry lol im not rlly deep into software stuff..

hallow thistle Mar 8, 2025, 4:42 AM

#

https://cdn.discordapp.com/emojis/872008154257432636.webp?size=48

#

It would be better to use Applio the RVC instead.

#

What is your PC GPU?

karmic flax Mar 8, 2025, 4:43 AM

#

RTX 5080

hallow thistle Mar 8, 2025, 4:45 AM

#

Damn. Unfortunately, there's no known stable version of Applio for this specific GPU. But I think you can use Applio with CPU instead.

brittle wing Mar 8, 2025, 4:45 AM

#

i want a model that sounds realistic please, i dont mind the size of the file

karmic flax Mar 8, 2025, 4:45 AM

#

hallow thistle Damn. Unfortunately, there's no known stable version of Applio for this specific...

hm i have the 9800x3D im sure it will be enough right?

brittle wing Mar 8, 2025, 4:45 AM

#

karmic flax hm i have the 9800x3D im sure it will be enough right?

smh bros flexing

karmic flax Mar 8, 2025, 4:45 AM

#

no im wondering about stuff cuz i dont have a clue lol

brittle wing Mar 8, 2025, 4:46 AM

#

that's like one of the most powerful if not the most poweful cpu rn

#

ik cuz i have one myself

karmic flax Mar 8, 2025, 4:46 AM

#

yeah for gaming but for AI stuff idk?

brittle wing Mar 8, 2025, 4:47 AM

#

youre thinking its like nvidia vs amd but its different from gpus

karmic flax Mar 8, 2025, 4:47 AM

#

i just heard for productivity stuff core count is more important but ig not

hallow thistle Mar 8, 2025, 4:47 AM

#

karmic flax hm i have the 9800x3D im sure it will be enough right?

I'm not sure about this one, but yeah just like RTX 50 series, there doesn't seem to be any version of Applio made for this specific AMD GPU.

brittle wing Mar 8, 2025, 4:47 AM

#

what do you want to do with it

hallow thistle Mar 8, 2025, 4:48 AM

#

brittle wing smh bros flexing

Are you saying you hate people flexing their stuffs or something?

#

brittle wing Mar 8, 2025, 4:48 AM

#

hallow thistle Are you saying you hate people flexing their stuffs or something?

no?

#

i flex my 7900xtx sometimes

#

flexing is ok as long it dose not include lying imo

hallow thistle Mar 8, 2025, 4:50 AM

#

Imagine thinking "my laptop GPU is Intel HD Graphics 3000" is a lie. misc_skull_distorted

brittle wing Mar 8, 2025, 4:52 AM

#

yea imagine

simple ore Mar 8, 2025, 4:52 AM

#

hallow thistle I'm not sure about this one, but yeah just like RTX 50 series, there doesn't see...

9800 is cpu

brittle wing Mar 8, 2025, 4:53 AM

#

imagine imagining stuff that isn't true and just assuming it over the conversation

simple ore Mar 8, 2025, 4:53 AM

#

karmic flax RTX 5080

at the moment there's no proper torch build for 5000 series

#

but you can install something for applio

karmic flax Mar 8, 2025, 4:53 AM

#

so the issue is my gpu? dang lol

hallow thistle Mar 8, 2025, 4:54 AM

#

simple ore 9800 is cpu

I've mistaken 9xxx for AMD RX GPU. misc_cry

simple ore Mar 8, 2025, 4:54 AM

#

you can download applio compiled version, then update torch to cu128

#

manually

karmic flax Mar 8, 2025, 4:54 AM

#

would the best bet be to use Applio? i have it installed but i dont rlly get it haha

simple ore Mar 8, 2025, 4:54 AM

#

actually not compiled, clone the repo

#

then run the installer

#

hopefully it does not error out

#

then update torch

karmic flax Mar 8, 2025, 4:55 AM

#

skull_sob "just do it" already confused lol

simple ore Mar 8, 2025, 4:55 AM

#

https://huggingface.co/w-e-w/torch-2.6.0-cu128.nv

#

env\python -m pip install that_wheel_file

#

https://huggingface.co/w-e-w/torch-2.6.0-cu128.nv/resolve/main/torch-2.6.0+cu128.nv-cp310-cp310-win_amd64.whl

hallow thistle Mar 8, 2025, 4:57 AM

#

Applio is the only way to get converted audio done very fast with proper GPU. With RTX 50 GPU, you'll have to do some code a bit.

karmic flax Mar 8, 2025, 4:57 AM

#

saddies. so there is no way for me lol

hallow thistle Mar 8, 2025, 4:58 AM

#

You can use any other RVC program, but all of them will only use your PC CPU because neither of them have complied for RTX 50.

karmic flax Mar 8, 2025, 4:58 AM

#

i mean i wouldnt care what it uses as long if it just works

hallow thistle Mar 8, 2025, 4:59 AM

#

That's all good.

karmic flax Mar 8, 2025, 4:59 AM

#

i wont be training my own models and stuff. i just need stuff to be converted into voices via pretrained models

simple ore Mar 8, 2025, 5:00 AM

#

hallow thistle Applio is the only way to get converted audio done very fast with proper GPU. Wi...

no code, just replace torch version with update that supports 5000 series

karmic flax Mar 8, 2025, 5:00 AM

#

but hooow

#

u linked me a file i cant even do anything with lol

#

cuz tf is a .whl file

simple ore Mar 8, 2025, 5:00 AM

#

clone the repository

#

run installer

karmic flax Mar 8, 2025, 5:00 AM

#

simple ore clone the repository

thats the first thing im failing on lol

simple ore Mar 8, 2025, 5:00 AM

#

once that is done run the command I provided from command prompt

#

#

download zip

karmic flax Mar 8, 2025, 5:01 AM

#

step one completed successfully lol

#

then ig run-install.bat

#

then running ur command in a terminal with admin perms

#

module "env" couldnt be loaded

karmic flax Mar 8, 2025, 5:22 AM

#

misc_cry

karmic flax Mar 8, 2025, 5:29 AM

#

simple ore ``env\python -m pip install that_wheel_file``

it just throws me an error when trying that

simple ore Mar 8, 2025, 5:33 AM

#

karmic flax it just throws me an error when trying that

karmic flax Mar 8, 2025, 5:33 AM

#

but how do i even get there

simple ore Mar 8, 2025, 5:33 AM

#

open command prompt

karmic flax Mar 8, 2025, 5:34 AM

#

yeah

simple ore Mar 8, 2025, 5:34 AM

#

#

you need to download the wheel into the same folder

karmic flax Mar 8, 2025, 5:35 AM

#

okay did that

#

dang i did it. im a hacker. lets hope it works lol

#

greatly appreciate the help tho 🫂

simple ore Mar 8, 2025, 5:36 AM

#

FCPE wont work until there's a updated torchaudio

#

but hopefully the rest would

karmic flax Mar 8, 2025, 5:38 AM

#

hm now it doesnt start up anymore lol

simple ore Mar 8, 2025, 5:47 AM

#

give it a bit

karmic flax Mar 8, 2025, 5:47 AM

#

it throws an error that it doesnt find smt

karmic flax Mar 8, 2025, 6:06 AM

#

ig "env/lib/site-packages/torchaudio/lib/libtorchaudio.pyd" wasnt found lol

#

ig i need to do the same with libtorchaudio but idk what version

carmine shuttle Mar 8, 2025, 6:49 AM

#

Guys is it normal for a voice model in zip that i heavy 259MB ?

karmic flax Mar 8, 2025, 7:17 AM

#

id still appreciate help with getting any voice conversation to work on my gpu. cirnoblush

brittle wing Mar 8, 2025, 7:21 AM

#

Anyone know the python version requirement if I want to run this version of UVR5 locally?

https://huggingface.co/spaces/TheStinger/UVR5_UI/tree/main

I cloned the repo and tried to install the requirements with the lastest python 3.13.2 but failed:

ERROR: Could not find a version that satisfies the requirement torch<2.5,>=2.3 (from audio-separator) (from versions: 2.5.0, 2.5.1, 2.6.0)
ERROR: No matching distribution found for torch<2.5,>=2.3

hallow thistle Mar 8, 2025, 7:24 AM

#

brittle wing Anyone know the python version requirement if I want to run this version of UVR5...

Do not install any Python program related using the very most recent version of Python. Use a version of Python like Python 3.10.x or Python 3.11.x.

simple ore Mar 8, 2025, 8:52 AM

#

karmic flax ig "env/lib/site-packages/torchaudio/lib/libtorchaudio.pyd" wasnt found lol

can you show the full error stack?

simple ore Mar 8, 2025, 8:53 AM

#

brittle wing Anyone know the python version requirement if I want to run this version of UVR5...

you may need to downgrade python to 3.10

knotty moth Mar 8, 2025, 9:25 AM

#

hallow thistle I've mistaken 9xxx for AMD RX GPU. <:misc_cry:1176674698629750975>

there's only 9070 & XT

knotty moth Mar 8, 2025, 9:39 AM

#

brittle wing Anyone know the python version requirement if I want to run this version of UVR5...

the alternative is Anjok's UVR with latest patch, and ZFTurbo's MSST repository that works on python 3.10-11

GitHub

GitHub - ZFTurbo/Music-Source-Separation-Training: Repository for t...

Repository for training models for music source separation. - ZFTurbo/Music-Source-Separation-Training

glass igloo Mar 8, 2025, 10:00 AM

#

Hello. I've encountered an issue while creating an index file. I have approximately 7 hours of audio for voice training, and the training process went smoothly. However, when trying to create the index file, the process stops at around 1,300 files out of 48,000, and then an index file is generated, which is only 30 megabytes in size. When using this index file, the voice often converts with artifacts. In other models I've trained on 10-20 minutes of data, the index files weigh 120+ megabytes. What should I do in this situation?

simple ore Mar 8, 2025, 10:02 AM

#

glass igloo Hello. I've encountered an issue while creating an index file. I have approximat...

with 4000+ sliced segments the index creation attempts to narrow down the dataset, but it only goes so far before it detects that it gets no improvement

#

you can always run inference without an index and check if that comes okay

glass igloo Mar 8, 2025, 10:09 AM

#

simple ore you can always run inference without an index and check if that comes okay

Without the index is fine, but the voice is not as similar as with the index.

simple ore Mar 8, 2025, 10:18 AM

#

the voice comes from the model

#

index is just an accent

#

7 hours of audio is too much for a finetune and not enough for training from scratch

flint glade Mar 8, 2025, 10:32 AM

#

guys

#

can anyone help me

#

on the voice changer

glass igloo Mar 8, 2025, 10:32 AM

#

simple ore the voice comes from the model

What is the best length of the dataset to use? If I shorten the dataset, is it better to continue training the same model or start training again?

brittle wing Mar 8, 2025, 10:32 AM

#

hallow thistle Do not install any Python program related using the very most recent version of ...

Thanks I now use 3.10.16 and it works flawlessly

flint glade Mar 8, 2025, 10:32 AM

#

i downlaoded it but when i will open it is doesnt open

brittle wing Mar 8, 2025, 10:32 AM

#

simple ore you may need to downgrade python to 3.10

Thanks it works

flint glade Mar 8, 2025, 10:32 AM

#

brittle wing Thanks it works

can you help me

brittle wing Mar 8, 2025, 10:33 AM

#

flint glade can you help me

Can you be more specific about why you cannot open it, is it because of the dependencies?

flint glade Mar 8, 2025, 10:33 AM

#

i have downloaded the voice changer and the guy on the video says open it

#

i open it but it doesnt open

low shard Mar 8, 2025, 10:35 AM

#

flint glade i have downloaded the voice changer and the guy on the video says open it

Don't follow YouTube tuts ever at all for RVC and Wokada,those are old

#

First of all, you want a realtime voice changer for calls?

flint glade Mar 8, 2025, 10:36 AM

#

can you say me a good voicechanger i will make a girl voice

flint glade Mar 8, 2025, 10:36 AM

#

low shard First of all, you want a realtime voice changer for calls?

yes and for games

low shard Mar 8, 2025, 10:36 AM

#

flint glade yes and for games

Wrong channel then

flint glade Mar 8, 2025, 10:36 AM

#

okay

#

for

#

discord

#

i will

low shard Mar 8, 2025, 10:36 AM

#

flint glade okay

RVC doesn't mean realtime voice changer

#

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

#

Tell your PC GPU in #🔍│help-w-okada

flint glade Mar 8, 2025, 10:36 AM

#

ok

simple ore Mar 8, 2025, 10:37 AM

#

glass igloo What is the best length of the dataset to use? If I shorten the dataset, is it b...

30-60 min, content varity > quantity, 3 hours of mumbling is worse than 15 minutes of speaking and singing

#

new model, obviously

knotty moth Mar 8, 2025, 11:48 AM

#

glass igloo Hello. I've encountered an issue while creating an index file. I have approximat...

that's unless you force select faiss method (which could take longer and produce an index file in around 2 gigs) instead of leaving auto in the index training

empty parrot Mar 8, 2025, 12:14 PM

#

simple ore index is just an accent

Yo, i didn’t know index is for accent. I always thought .pth always needs it to work. Is there a good documentation to learn some of this? Id like to understand general idea how it works but not the too deep explanation for like engineers.

hallow thistle Mar 8, 2025, 12:16 PM

#

A pth file always required to work, of course it is.

#

An index file is a file that stores voice accent and used for specific voice model. It can be achived during voice training process.

brittle wing Mar 8, 2025, 2:05 PM

#

Yo guys,
I still don’t get how Batch Size really works.
Does 4 actually improve sound quality, or is it just a performance thing?

odd shale Mar 8, 2025, 2:17 PM

#

brittle wing Yo guys, I still don’t get how Batch Size really works. Does 4 actually improve ...

Yep, it will improve it depending on your dataset.

#

(I'm talking about quality)

#

It will mostly be useful if you have a short dataset below 5-4 mins.

#

If your dataset is around 10+ mins, go for 8 on batch

analog obsidian Mar 8, 2025, 2:19 PM

#

brittle wing Yo guys, I still don’t get how Batch Size really works. Does 4 actually improve ...

batch size = number of files trained at once
bs 4 trains 4 files at once
bs 8 trains 8 files at once
for simplicity:
less than 30 mins of audio = use batch 4
more than 30 mins of audio = use batch 8

#

too high in a small dataset might hurt the model ability to generate new audio

odd shale Mar 8, 2025, 2:21 PM

#

analog obsidian batch size = number of files trained at once bs 4 trains 4 files at once bs 8 tr...

Espera Lyery

#

Si es buena idea usar batch 4 con cualquier dataset que vaya debajo de 30 minutos?

#

Esperate leí mal XD

analog obsidian Mar 8, 2025, 2:21 PM

#

misc_troll

#

si el dataset es menor a 30 mins usa batch size 4
si el dataset es mayor a 30 mins usa batch size 8

brittle wing Mar 8, 2025, 2:23 PM

#

Got it! So for less than 30 mins, Batch 4 is best, and for more than 30 mins, Batch 8 is better. Just to be sure, using Batch 4 on a longer dataset wouldn’t really improve quality, right?

analog obsidian Mar 8, 2025, 2:24 PM

#

brittle wing Got it! So for less than 30 mins, Batch 4 is best, and for more than 30 mins, Ba...

nope
it could actually make it worse, the model would be stuck in a bad place and never leaving it

#

so the model would sound weird

brittle wing Mar 8, 2025, 2:25 PM

#

analog obsidian nope it could actually make it worse, the model would be stuck in a bad place an...

Ohh I see! So using Batch 4 on a long dataset could actually trap the model in a bad state? Thanks for the clarification!

odd shale Mar 8, 2025, 2:27 PM

#

brittle wing Ohh I see! So using Batch 4 on a long dataset could actually trap the model in a...

Yep, for that reason i mostly recommend using 8 on batch for any dataset beyond 10-20 mins.

analog obsidian Mar 8, 2025, 2:27 PM

#

brittle wing Ohh I see! So using Batch 4 on a long dataset could actually trap the model in a...

yesss
technically any batch size works but the results depends heavily on the dataset
for making things simple just stick to what i said before

#

there are times where bs 4 gives better results than 8 and viceversa

odd shale Mar 8, 2025, 2:28 PM

#

analog obsidian there are times where bs 4 gives better results than 8 and viceversa

Yup, it's matter of testing

brittle wing Mar 8, 2025, 2:29 PM

#

Thanks a lot!

earnest stone Mar 8, 2025, 5:36 PM

#

how do you make rvc?\

low shard Mar 8, 2025, 6:00 PM

#

earnest stone how do you make rvc?\

elaborate:

ur pc gpu
what do you mean? did you mean make an rvc model?

karmic flax Mar 8, 2025, 6:01 PM

#

low shard elaborate: - ur pc gpu - what do you mean? did you mean make an rvc model?

im sure he gone already. mericCat

low shard Mar 8, 2025, 6:02 PM

#

karmic flax im sure he gone already. <:mericCat:966370270526967828>

welp, youre right 😭

#

People need to understand that helpers might be busy sometimes and they can't reply in 2 mins

#

Do you need any help?

karmic flax Mar 8, 2025, 6:03 PM

#

yeah..

karmic flax Mar 8, 2025, 6:03 PM

#

low shard Do you need any help?

well. i got help before but im sure he was also busy. i couldnt solve my issue but im also veeery nooby when it comes to software so yeah

#

from what i understood is that torch/torchlibaudio doesnt have working versions for the 50series nvidia cards? and id need to update manually to a nightly version of both

#

buutt yeah

low shard Mar 8, 2025, 6:04 PM

#

karmic flax well. i got help before but im sure he was also busy. i couldnt solve my issue b...

elaborate:

your pc gpu
what guide/download link are you using
the issue specifically
what do you want to do

karmic flax Mar 8, 2025, 6:05 PM

#

low shard elaborate: - your pc gpu - what guide/download link are you using - the issue sp...

RTX5080
Got told Applio is best.
Not getting it to work in 50series cards
Just convert existing audio using models into other voices

low shard Mar 8, 2025, 6:09 PM

#

karmic flax RTX5080 Got told Applio is best. Not getting it to work in 50series cards Just c...

Thank you for replying

I checked that you meant this chat #✨│ai-help message , unfortunately this is related to the rtx 50 serie needing a new pytorch version, meaning there's no precompiled version, and you need to do it via source
I don't have a 50 serie, but I can try helping you out

karmic flax Mar 8, 2025, 6:09 PM

#

low shard Thank you for replying I checked that you meant this chat https://discord.com/c...

Would greatly appreciate that

#

if u have the patience to handle me haha

low shard Mar 8, 2025, 6:12 PM

#

karmic flax Would greatly appreciate that

you're running on windows 11, right?

karmic flax Mar 8, 2025, 6:12 PM

#

low shard you're running on windows 11, right?

Yes

low shard Mar 8, 2025, 6:14 PM

#

karmic flax Yes

try going back to where you had that libtorchaudio missing issue, open CMD, run env\python -m pip install https://download.pytorch.org/whl/nightly/cu128/torchaudio-2.6.0.dev20250308%2Bcu128-cp310-cp310-win_amd64.whl

#

then, try running applio

#

@karmic flax btw be aware of the missing ROPs and melted connectors for the 50 serie, many 50 serie gpus are having that issues

#

you should prob check out if it's happening to you too

karmic flax Mar 8, 2025, 6:16 PM

#

low shard <@502523097589481503> btw be aware of the missing ROPs and melted connectors for...

yeah. I luckily dont have missing rops c: and my gpu is powerlimited to use 288watts. the 12v connector is rated for 600watts

low shard Mar 8, 2025, 6:17 PM

#

karmic flax yeah. I luckily dont have missing rops c: and my gpu is powerlimited to use 288w...

alright, hopefully nvidia does something soon about this

low shard Mar 8, 2025, 6:17 PM

#

low shard try going back to where you had that libtorchaudio missing issue, open CMD, run...

also let me know about this

karmic flax Mar 8, 2025, 6:17 PM

#

hmm i have errors but i cant dm u and i dont have perms to upload a pic

low shard Mar 8, 2025, 6:17 PM

#

karmic flax hmm i have errors but i cant dm u and i dont have perms to upload a pic

!give-media-perms 1h @karmic flax

karmic flax Mar 8, 2025, 6:18 PM

#

low shard Mar 8, 2025, 6:18 PM

#

can you try uploading the pic now?

#

weird, @karmic flax can you try to run the same command #✨│ai-help message , but add after install, add --force-reinstall then leave everything else as it was

karmic flax Mar 8, 2025, 6:20 PM

#

already confused cirnoblush

low shard Mar 8, 2025, 6:21 PM

#

karmic flax already confused <:cirnoblush:1261283915696705647>

go to cmd again, as you did before, then run env\python -m pip install --force-reinstall torch-2.6.0+cu128.nv-cp310-cp310-win_amd64.whl

karmic flax Mar 8, 2025, 6:22 PM

#

so i needa get that file first

low shard Mar 8, 2025, 6:24 PM

#

karmic flax

oh I thought you downloaded #✨│ai-help message

well, you can run env\python -m pip install https://huggingface.co/w-e-w/torch-2.6.0-cu128.nv/resolve/main/torch-2.6.0+cu128.nv-cp310-cp310-win_amd64.whl

#

seems like you don't need to force-reinstall, because you didn't install it in the first place

karmic flax Mar 8, 2025, 6:24 PM

#

i was looking for it but i couldnt find the correct version

low shard Mar 8, 2025, 6:25 PM

#

could you try the command I just told you?

karmic flax Mar 8, 2025, 6:25 PM

#

yeah its downloading rn

#

its doing something at least lol

low shard Mar 8, 2025, 6:26 PM

#

karmic flax

nice, try running applio after it's done

karmic flax Mar 8, 2025, 6:26 PM

#

i mean red is always bad right. but ill try running it lol

#

low shard Mar 8, 2025, 6:29 PM

#

karmic flax i mean red is always bad right. but ill try running it lol

can you run #✨│ai-help message and tell me the output?

karmic flax Mar 8, 2025, 6:29 PM

#

low shard can you run https://discord.com/channels/1159260121998827560/1159290139609137264...

low shard Mar 8, 2025, 6:34 PM

#

karmic flax

shit

what if you try
env\python -m pip install https://download.pytorch.org/whl/nightly/cu128/torchaudio-2.6.0.dev20250306%2Bcu128-cp310-cp310-win_amd64.whl

#

this build is earlier than the one I sent you before

karmic flax Mar 8, 2025, 6:34 PM

#

low shard Mar 8, 2025, 6:36 PM

#

karmic flax

my last guess would be running: env\python -m pip install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu128

#

see if that works

karmic flax Mar 8, 2025, 6:40 PM

#

#

well dang. it worked

#

uchuClap thank uuu aloooot 🫂

low shard Mar 8, 2025, 6:41 PM

#

low shard my last guess would be running: `env\python -m pip install --pre torch torchvisi...

@simple ore you might find this helpful btw

low shard Mar 8, 2025, 6:41 PM

#

karmic flax <a:uchuClap:1327887579307049000> thank uuu aloooot 🫂

you're welcome

simple ore Mar 8, 2025, 8:30 PM

#

low shard <@155030383648440320> you might find this helpful btw

yeah, they've fixed the nightly build 3 days ago

violet badger Mar 8, 2025, 9:16 PM

#

Sorry I have not trained any models before. How might I do such a thing for rvc? (https://huggingface.co/spaces/TheStinger/Ilaria_RVC)

low shard Mar 8, 2025, 9:22 PM

#

violet badger Sorry I have not trained any models before. How might I do such a thing for rvc?...

IlariaRVC isn't for training models, it's only for inference

#

what's your pc gpu

violet badger Mar 8, 2025, 9:22 PM

#

3070

low shard Mar 8, 2025, 9:23 PM

#

violet badger 3070

yeah you wouldn't even need to use Ilaria RVC, it's a cloud (remote good pc) service, but your pc is good enough

#

As you got a good PC, you can use RVC locally, you can choose between:

Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
Mainline: The original RVC

violet badger Mar 8, 2025, 9:23 PM

#

low shard yeah you wouldn't even need to use Ilaria RVC, it's a cloud (remote good pc) ser...

Oh Alright, how might I train a voice though

#

oh

low shard Mar 8, 2025, 9:24 PM

#

violet badger oh

lmk

violet badger Mar 8, 2025, 9:24 PM

#

low shard lmk

WHich one would be easier in your opinion

low shard Mar 8, 2025, 9:24 PM

#

violet badger WHich one would be easier in your opinion

Applio

violet badger Mar 8, 2025, 9:25 PM

#

low shard Applio

Wasnt Ai Hispano the people who made Applio hacked/

#

or is that fixed

low shard Mar 8, 2025, 9:25 PM

#

violet badger Wasnt Ai Hispano the people who made Applio hacked/

that happened long time ago in october 2024

#

it's all completely fixed and safe

violet badger Mar 8, 2025, 9:28 PM

#

low shard it's all completely fixed and safe

Alright, which one would I download? I cant find the download button shown and im not sure which one I should click

low shard Mar 8, 2025, 9:29 PM

#

violet badger Alright, which one would I download? I cant find the download button shown and i...

Compiled > Windows > The download button

violet badger Mar 8, 2025, 9:29 PM

#

low shard Compiled > Windows > The download button

Thank you

low shard Mar 8, 2025, 9:29 PM

#

yw

brittle wing Mar 8, 2025, 11:50 PM

#

Yo quick question about my RVC training. I’m at 27k steps now. Loss kept dropping but has been super slow since 10k-12k.

analog obsidian Mar 8, 2025, 11:56 PM

#

brittle wing Yo quick question about my RVC training. I’m at 27k steps now. Loss kept droppin...

hear the epochs and compare them

#

if its overtrained the model is going to sound robotic

#

while in epochs that arent overtrained they're going to sound just fine

brittle wing Mar 8, 2025, 11:58 PM

#

analog obsidian hear the epochs and compare them

Thanks! I'll compare and pick the best one.

analog obsidian Mar 8, 2025, 11:58 PM

#

if you use spek its easier to spot overtraining, there'll be missing frequencies in the result

brittle wing Mar 8, 2025, 11:59 PM

#

analog obsidian if you use spek its easier to spot overtraining, there'll be missing frequencies...

Oh okay, thanks! I'll check with Spek to see if there are missing frequencies. Appreciate the tip

low shard Mar 9, 2025, 12:24 AM

#

#📰│dev-updates message

knotty moth Mar 9, 2025, 12:24 AM

#

analog obsidian if you use spek its easier to spot overtraining, there'll be missing frequencies...

missing frequencies? it is when the spectrogram cutoff is lower than the target sample rate

analog obsidian Mar 9, 2025, 12:25 AM

#

knotty moth missing frequencies? it is when the spectrogram cutoff is lower than the target ...

no like sorry i explained bad
i meant to say to check for missing harmonics lmao

#

overtrained models generate noise instead of harmonics at some point

#

🦈

knotty moth Mar 9, 2025, 12:33 AM

#

analog obsidian overtrained models generate noise instead of harmonics at some point

imo the detail will be crisper to match the dataset but it loses some pretrain ability (that could lead to some possible robotic sounds)

analog obsidian Mar 9, 2025, 12:35 AM

#

knotty moth imo the detail will be crisper to match the dataset but it loses some pretrain a...

yuh

small vortex Mar 9, 2025, 6:23 AM

#

Best anime latina egirl mommy wifey voice model?

#

hallow thistle Mar 9, 2025, 6:29 AM

#

small vortex Best anime latina egirl mommy wifey voice model?

#

Y'all be popping out and asking for E-girl voice model just to troll and cat the damn fish someone.

small vortex Mar 9, 2025, 6:31 AM

#

hallow thistle Y'all be popping out and asking for E-girl voice model just to troll and cat the...

I actually did this to see if you would still answer but damn

#

https://tenor.com/view/spongebob-get-a-job-soup-spongebob-squarepants-find-a-job-gif-19390154

#

hallow thistle Mar 9, 2025, 6:32 AM

#

https://cdn.discordapp.com/emojis/1016021305545461950.webp?size=48

small vortex Mar 9, 2025, 6:35 AM

#

Like everything you do is just answering people who asks for e-girl models

#

lol

#

just actually searched your messages lol

#

misc_baffled

tame mica Mar 9, 2025, 6:39 AM

#

mf predicted what he would say

#

lowkey ts funny i fw u doe

hallow thistle Mar 9, 2025, 6:41 AM

#

Like if asking anything would make you more money though.

#

Shit. You acting like if I do this everyday huh.

turbid root Mar 9, 2025, 6:46 AM

#

Hi! How do you know if the model is overtrained?

knotty moth Mar 9, 2025, 7:05 AM

#

hallow thistle Like if asking anything would make you more money though.

this kind of thing somehow irritates me

strong shadow Mar 9, 2025, 10:20 AM

#

Hey does, anyone know how i can get multiple (5) .VOB files and combine them into 1 whole video? No one seems to. I've tried clipchamp but there's a 1second gap/pause thats a mess.

glacial pollen Mar 9, 2025, 10:21 AM

#

strong shadow Hey does, anyone know how i can get multiple (5) .VOB files and combine them int...

Not a channel for that

long gazelle Mar 9, 2025, 11:11 AM

#

@low shard

low shard Mar 9, 2025, 11:12 AM

#

long gazelle <@911742715019001897>

Oh nvm this seems an error on HuggingFace side, you could try again tomorrow

long gazelle Mar 9, 2025, 11:12 AM

#

ah okay

#

also could you explain the license thing

brittle wing Mar 9, 2025, 11:20 AM

#

Anyone know the required python version to run https://huggingface.co/spaces/TheStinger/Ilaria_RVC/tree/main

I use python 3.10.16 and run into errors while installing requirements:

Building wheels for collected packages: omegaconf, samplerate, srt, antlr4-python3-runtime
  Building wheel for omegaconf (pyproject.toml) ... done
  Created wheel for omegaconf: filename=omegaconf-2.0.6-py3-none-any.whl size=36882 sha256=0b988ea25770e060c1ad0bde20dfbd7da84924e620f08626d4507bff6e337ece
  Stored in directory: /tmp/pip-ephem-wheel-cache-mgtyolse/wheels/ee/67/d9/a68a521e487bb78d6599d3a157f5bb01d0760c689a9c2ac78f
  Building wheel for samplerate (pyproject.toml) ... error
  error: subprocess-exited-with-error
  
  × Building wheel for samplerate (pyproject.toml) did not run successfully.
  │ exit code: 1
  ╰─> [4 lines of output]
      running bdist_wheel
      running build
      running build_ext
      error: [Errno 2] No such file or directory: 'cmake'
      [end of output]
  
  note: This error originates from a subprocess, and is likely not a problem with pip.
  ERROR: Failed building wheel for samplerate
  Building wheel for srt (setup.py) ... done
  Created wheel for srt: filename=srt-3.5.3-py3-none-any.whl size=22483 sha256=2ca8125c77c760943695358d90630f7dbf1a8fc14d0c479a94cc8bbaa9d08d93
  Stored in directory: /home/yui/.cache/pip/wheels/d7/31/a1/18e1e7e8bfdafd19e6803d7eb919b563dd11de380e4304e332
  Building wheel for antlr4-python3-runtime (setup.py) ... done
  Created wheel for antlr4-python3-runtime: filename=antlr4_python3_runtime-4.8-py3-none-any.whl size=141246 sha256=5175ca3614fd8bc7ba3b90590c4920a09065e395cd7a667f3d53289ec8ed5974
  Stored in directory: /home/yui/.cache/pip/wheels/a7/20/bd/e1477d664f22d99989fd28ee1a43d6633dddb5cb9e801350d5
Successfully built omegaconf srt antlr4-python3-runtime
Failed to build samplerate
ERROR: Failed to build installable wheels for some pyproject.toml based projects (samplerate)

#

Also if there's a better alternative please let me know

low quail Mar 9, 2025, 11:24 AM

#

so a question, haven't touched ai voices in a while...
how do you make a cover of a song again? colab doesn't work for me anymore, whenever I try to load a model it gives me not found

hallow thistle Mar 9, 2025, 11:40 AM

#

brittle wing Also if there's a better alternative please let me know

The better alternative to RVC is Applio.

brittle wing Mar 9, 2025, 12:01 PM

#

hallow thistle The better alternative to RVC is Applio.

Thanks anime_pray

low shard Mar 9, 2025, 12:36 PM

#

long gazelle also could you explain the license thing

It's the license for usage of your model

long gazelle Mar 9, 2025, 12:36 PM

#

ah oaky

low shard Mar 9, 2025, 12:37 PM

#

low quail so a question, haven't touched ai voices in a while... how do you make a cover ...

Tell:

your PC GPU
the google colab link you're using
the error

low shard Mar 9, 2025, 12:38 PM

#

brittle wing Anyone know the required python version to run https://huggingface.co/spaces/The...

Are you trying to run it locally? It's not meant to run locally

#

Also what's your PC GPU and what do you want to do

#

AMD moment

#

What's your PC GPU

tulip cloak Mar 9, 2025, 12:58 PM

#

@low shard hey is mailine back, the colab one

low shard Mar 9, 2025, 12:59 PM

#

Maybe @simple ore knows, he used to have AMD till he recently switched to finally Nvidia

low shard Mar 9, 2025, 12:59 PM

#

tulip cloak <@911742715019001897> hey is mailine back, the colab one

Nope, Hina is busy and won't be fixed for at least another week, use Applio or something else meanwhile, also you can just look at #📰│dev-updates

tulip cloak Mar 9, 2025, 12:59 PM

#

low shard Nope, Hina is busy and won't be fixed for at least another week, use Applio or s...

Yeah I saw it, was just making sure

low shard Mar 9, 2025, 1:00 PM

#

Well the batch size does depend also on the dataset length, but 8 shouldn't make your PC crash at all

#

It's fine

outer isle Mar 9, 2025, 1:02 PM

#

I’m going to create (Mostly) Blackiana model with Apollo, is there a voice file?

low shard Mar 9, 2025, 1:04 PM

#

Tensorboard to check how the model goes

low shard Mar 9, 2025, 1:04 PM

#

outer isle I’m going to create (Mostly) Blackiana model with Apollo, is there a voice file?

You need to find yourself the dataset

hallow thistle Mar 9, 2025, 1:04 PM

#

You use Tensorboard to track the process, to make sure the model won't be too overtrained or undertrained.

#

Oh, so that's what Tensorboard looked like if run locally. cat_stare

low shard Mar 9, 2025, 1:06 PM

#

https://docs.aihub.gg/rvc/resources/training/#usage-guide

Training

Last update: Dec 24, 2024

brittle wing Mar 9, 2025, 1:10 PM

#

low shard Also what's your PC GPU and what do you want to do

Just a Laptop RTX 4050, I want to do this since running it online has restrictions

low shard Mar 9, 2025, 1:21 PM

#

brittle wing Just a Laptop RTX 4050, I want to do this since running it online has restrictio...

Ehh you can inference but will be limited on the training

brittle wing Mar 9, 2025, 1:31 PM

#

low shard Ehh you can inference but will be limited on the training

IK, I don't train models. But huggin face has limitations even you are just inferencing isn't it?

low shard Mar 9, 2025, 1:36 PM

#

brittle wing IK, I don't train models. But huggin face has limitations even you are just infe...

Yeah that's because you're using an expensive PC GPU, google colab and Kaggle exist too btw

brittle wing Mar 9, 2025, 1:37 PM

#

low shard Yeah that's because you're using an expensive PC GPU, google colab and Kaggle ex...

oh the using the CPU won't trigger the limit, thanks

low shard Mar 9, 2025, 1:52 PM

#

brittle wing oh the using the CPU won't trigger the limit, thanks

CPU is over 10 times slower

#

Are you sure it's using your GPU

hallow thistle Mar 9, 2025, 1:55 PM

#

If your GPU percent goes high in Task Manager, it's definitely working.

analog obsidian Mar 9, 2025, 2:20 PM

#

gpu and batch size?

#

also dataset size matters

#

its normal
Amd is just very slow

#

150 epochs if every slice is 3s

#

if the slices are different lengths
around 200 epochs

#

ok
yes, around 200 epochs

#

as long the cmd is open, its fine

#

500 epochs is too much for small models, even when using the automated slice

#

ai is random so u cant just predict when its going to sound fine

#

is always a good practice to compare the epochs
at some point the model will naturally overtrain, which makes final epochs sound very robotic

#

in that case your final model would be any epoch before overtraining

simple ore Mar 9, 2025, 2:38 PM

#

read the note at the bottom of the AMD installation instructions

low shard Mar 9, 2025, 2:45 PM

#

AMD moment

simple ore Mar 9, 2025, 3:00 PM

#

45min set on 6700xt was taking ~4min/epoch

#

yeah, it was an overnight training to 200e

low shard Mar 9, 2025, 3:34 PM

#

simple ore 45min set on 6700xt was taking ~4min/epoch

tbh I thought it was way faster, wasn't zluda optimized?

simple ore Mar 9, 2025, 3:35 PM

#

it is faster then colab

#

not as fast as kaggle

low shard Mar 9, 2025, 3:35 PM

#

simple ore not as fast as kaggle

yeah T4x2

#

yeah 30 hours of GPU weekly

#

way better than google colab that has random daily gpu with a max of 4 hours daily

simple ore Mar 9, 2025, 3:39 PM

#

use scalars tab

upbeat chasm Mar 9, 2025, 3:40 PM

#

Does enyone know why when I import rvc to voice.ai app every voice sounds almost the same

low shard Mar 9, 2025, 3:41 PM

#

upbeat chasm Does enyone know why when I import rvc to voice.ai app every voice sounds almost...

voice.ai sucks, don't use it at all

#

@upbeat chasm you want realtime voice changer for calls? tell your pc gpu in #🔍│help-w-okada

#

I mean wokada ones are fine, they are open source

upbeat chasm Mar 9, 2025, 3:42 PM

#

Then what free service should I use

low shard Mar 9, 2025, 3:42 PM

#

upbeat chasm Then what free service should I use

you should use wokada, either locally or on cloud

#

as I said, tell your pc gpu in #🔍│help-w-okada

#

you can also use it on google colab and kaggle, yes

#

rvc realtime from mainline/original rvc is pretty old

upbeat chasm Mar 9, 2025, 3:43 PM

#

I want voice change Just for mayself to use not to use on calls

simple ore Mar 9, 2025, 3:43 PM

#

expand losses, or avg_50 if you have that

low shard Mar 9, 2025, 3:44 PM

#

upbeat chasm I want voice change Just for mayself to use not to use on calls

so, inference (use models) on pre-recorded audios?

#

still, tell your pc gpu

#

@upbeat chasm You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU

simple ore Mar 9, 2025, 3:45 PM

#

does not look good at all

#

depending on the model

upbeat chasm Mar 9, 2025, 3:46 PM

#

low shard <@829294958544027668> You can check your pc gpu via: ctrl+shift+esc (task manage...

I know

simple ore Mar 9, 2025, 3:47 PM

#

flat mel and g loss means the model is close to the dataset in the type of the data it has

#

generally both have to go down

low shard Mar 9, 2025, 3:48 PM

#

@upbeat chasm #🔍│help-w-okada message since you got a 3060 laptop, and don't need to use the models in realtime for games/calls

Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), not the best to train (make models) even if still possible

You can:

Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
- Mainline: The original RVC
Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.com: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio UI Colab: max 4 hours daily, not granted, of GPU

Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio

simple ore Mar 9, 2025, 3:50 PM

#

in at least the first 5k steps both mel loss and g loss have to go down

#

if they dont there's something wrong with the dataset or something else

#

too big of the batch size o something

#

g total

#

depends on the dataset size/batch size

#

What's your batch size?

#

too much for 15min

#

use 4

blazing solar Mar 9, 2025, 4:04 PM

#

-colab

karmic oliveBOT Mar 9, 2025, 4:04 PM

#

blazing solar -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
Hina's Mod AICoverGen WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Hina's Modified Original W-Okada's Realtime Voice Changer, Google Colab
FaceFusion UI, by Nick088 Google Colab
FaceFusion NO UI, by Nick088 Google Colab
EasyGUI, by Rejekts Google Colab
🆕 Music Source Separation Training (Inference), by Jarredou & Makidanye Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

simple ore Mar 9, 2025, 4:05 PM

#

and now your model exploded

#

because you're training with FP16

#

you can stop now, it is dead

rough parrot Mar 9, 2025, 4:31 PM

#

! C:\Users\user\Downloads\vcclient_win_cuda_2.0.76-beta.zip: C:\Users\user\AppData\Local\Temp\Rar$EXa23844.2909.rartemp\dist\main\web_front\assets\
yall know the reason for this?

simple ore Mar 9, 2025, 4:33 PM

#

rough parrot ! C:\Users\user\Downloads\vcclient_win_cuda_2.0.76-beta.zip: C:\Users\user\AppDa...

unzip the file lol

rough parrot Mar 9, 2025, 4:33 PM

#

alr

dim nacelle Mar 9, 2025, 5:21 PM

#

Sorry, I want to know how to control the speed of the voice, and can I produce the vocie and srt together? Thank you!

low shard Mar 9, 2025, 5:24 PM

#

rough parrot ! C:\Users\user\Downloads\vcclient_win_cuda_2.0.76-beta.zip: C:\Users\user\AppDa...

are you following a youtube tutorial?

#

the file name seems to be the latest version of Original Wokada (Wokada is used for using RVC models in realtime for calls/games)
But the wokada deiteris fork is way better than the original wokada in performance and quality

#

@rough parrot tell your pc gpu in #🔍│help-w-okada , this is also the wrong channel since RVC and Wokada aren't the same program

coral viper Mar 9, 2025, 5:33 PM

#

What is the step to use voice models in voice.ai app?

quasi condor Mar 9, 2025, 5:34 PM

#

coral viper What is the step to use voice models in voice.ai app?

literally don't use voice ai it sucks

coral viper Mar 9, 2025, 5:35 PM

#

quasi condor literally don't use voice ai it sucks

Why is that?

#

Then what should I use?

quasi condor Mar 9, 2025, 5:36 PM

#

coral viper Why is that?

if u wanna use realtime voice changer use w-okada

#

or if u wanna make a cover use applio

coral viper Mar 9, 2025, 5:36 PM

#

quasi condor or if u wanna make a cover use applio

Is this free?

#

Where can I download?

quasi condor Mar 9, 2025, 5:37 PM

#

coral viper Is this free?

first what's ur gpu?

coral viper Mar 9, 2025, 5:44 PM

#

quasi condor first what's ur gpu?

What's a GPU?

#

Where can I see my GPU?

quasi condor Mar 9, 2025, 5:45 PM

#

coral viper What's a GPU?

right click ur taskbar and click task manager

coral viper Mar 9, 2025, 5:46 PM

#

quasi condor right click ur taskbar and click task manager

And then?

quasi condor Mar 9, 2025, 5:47 PM

#

there should be gpu 0

#

click that and tell me what's ur gpu

coral viper Mar 9, 2025, 5:48 PM

#

quasi condor right click ur taskbar and click task manager

I don't quite understand this

#

Can you just send me the pic of the step?

quasi condor Mar 9, 2025, 5:48 PM

#

@low shard u can take it from here

coral viper Mar 9, 2025, 5:49 PM

#

I'm not smart

#

Now you speak in my language

#

RTX 3050, I guess

dim nacelle Mar 9, 2025, 5:52 PM

#

Could Applio 3.1.1 control the speed of ai voice? and can I produce the vocie and srt file together?

coral viper Mar 9, 2025, 5:52 PM

#

That's what you mean right?

#

Or the processor?

#

And then what? How to instal Applio

#

Ok

low shard Mar 9, 2025, 5:53 PM

#

coral viper What is the step to use voice models in voice.ai app?

don't use voice.ai

low shard Mar 9, 2025, 5:55 PM

#

coral viper RTX 3050, I guess

are you looking for ai covers or realtime for calls

coral viper Mar 9, 2025, 5:55 PM

#

low shard are you looking for ai covers or realtime for calls

No, for song cover

#

But I'll try another fun I guess

low shard Mar 9, 2025, 5:56 PM

#

coral viper No, for song cover

Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), not the best to train (make models) even if still possible

You can:

Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
- Mainline: The original RVC
Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.com: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio UI Colab: max 4 hours daily, not granted, of GPU
- RVC-AI-Cover-Maker-U ColabI: Automatically separates the vocals and instrumentals, converts the voice and mix all together back

Easiest possible (automatically separates vocals & instrumentals) : weights.gg & rvc-ai-cover-maker-ui
easiest cloud: Ilaria rvc zero
easiest local: Applio

coral viper Mar 9, 2025, 5:56 PM

#

Download compiled version right?

#

Kinda sus

#

Seriously please

#

On it

#

Why is that?

#

ApplioV3.2.8-bugfix.zip right?

#

Oh

#

Can we request a model? Is there a payment?

#

After download Applio zip, what else?

#

And then?

#

Why is it take so long to extract?

analog fossil Mar 9, 2025, 6:35 PM

#

Where can I get rvc v2 voice models?

coral viper Mar 9, 2025, 6:36 PM

#

@primal barn I didn't find batfild

low shard Mar 9, 2025, 6:36 PM

#

analog fossil Where can I get rvc v2 voice models?

You can search rvc ai voice models at:

#1175430844685484042
In #🔍│find-models , Do /find with @earnest musk
https://weights.com/ (login required)
https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
https://voice-models.com/
https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)

if there isnt one, you can:

#1159289738314919936
#1191429836321849435
make it yourself with our docs guides https://docs.aihub.gg/essentials/how-to-make-voice-models/

earnest muskBOT Mar 9, 2025, 6:36 PM

#

low shard You can search rvc ai voice models at: - <#1175430844685484042> - In <#11635920...

:wave: @low shard, How can I help?

Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image

coral viper Mar 9, 2025, 6:37 PM

#

Oh

#

@primal barn The screen is black

#

Then what?

#

It's just dark with "Applio" on the left up corner

#

Yes I guess

#

Wait, is it require internet?

#

its like this

#

Oh yes, now it open in browser

Now what?

#

#

In browser?

#

Yes, and then what?

#

Why not in the program itself instead?

#

Yeah

#

So it's online then?

#

Oh cool

#

@primal barn Now how to put the model?

stray raven Mar 9, 2025, 6:49 PM

#

hi guys i see support came for the 50 series https://github.com/IllIlIlIllIl/voice-changer/releases/tag/b2335

i have a 5080 can i get help installing the voice changer please? for some reason i cant work it out, i had a diff pc with a weaker gpu and it worked good there

low shard Mar 9, 2025, 6:49 PM

#

stray raven hi guys i see support came for the 50 series https://github.com/IllIlIlIllIl/voi...

wrong channel

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

#

use #🔍│help-w-okada

stray raven Mar 9, 2025, 6:49 PM

#

my bad

coral viper Mar 9, 2025, 6:49 PM

#

How to put the model?

#

@primal barn So I must move the model file into that folder?

#

Just the pth? Not the index too?

#

Put the index and pth alongside the reference and the mute?

#

folders

#

Aight aight. So after I put the model ...

Then I put the song or audio I want there

And then just convert?

#

How to do that?

#

Dang, it sounds goofy cat_blush

#

@primal barn It sounds goofy. And some part there's like a glitchy sound

#

Can I fix it to sound more smoothly?

lime wadi Mar 9, 2025, 7:13 PM

#

hi i tried to download RVC but when i start it it gives me an error saying that the 'thinker' module is missing does anyone know how to fix this?

coral viper Mar 9, 2025, 7:20 PM

#

Both

#

glacial pollen Mar 9, 2025, 7:41 PM

#

coral viper

This is quite likely due to harmonies / vocal layering / and echo / harsh reverb smearing the f0 traces too much

coral viper Mar 9, 2025, 7:41 PM

#

The song. Cuz I don't know how to separate

quasi condor Mar 9, 2025, 7:43 PM

#

coral viper The song. Cuz I don't know how to separate

jesus

#

u could've just separated it on uvr5 or mvsep

coral viper Mar 9, 2025, 7:44 PM

#

quasi condor u could've just separated it on uvr5 or mvsep

Are they paid?

quasi condor Mar 9, 2025, 7:44 PM

#

using melband

quasi condor Mar 9, 2025, 7:44 PM

#

coral viper Are they paid?

hell no

coral viper Mar 9, 2025, 7:44 PM

#

quasi condor using melband

Why melband?

quasi condor Mar 9, 2025, 7:44 PM

#

coral viper Why melband?

its good for extracting vocals

coral viper Mar 9, 2025, 7:45 PM

#

How?

#

I hate watching youtube tutorial

quasi condor Mar 9, 2025, 7:45 PM

#

coral viper I hate watching youtube tutorial

the tutorials on youtube are outdated

coral viper Mar 9, 2025, 7:46 PM

#

You said that I should pay for it

glacial pollen Mar 9, 2025, 7:46 PM

#

coral viper I hate watching youtube tutorial

You wanna get uvr as others said, and also get fv4 model ( gabox's melband roformer voc fv4

#

Imho, currently the best one

#

Also, don't use the link they gave you, it won't have support for newest newest models

#

https://github.com/Anjok07/ultimatevocalremovergui/releases/download/v5.6/UVR_1_15_25_22_30_BETA_full.exe

#

first this one

glacial pollen Mar 9, 2025, 7:48 PM

#

glacial pollen first this one

and then you wanna patch the uvr with:
https://github.com/TRvlvr/model_repo/releases/download/uvr_update_patches/UVR_Patch_1_21_25_2_28_BETA_small_rofo.exe

#

oh

#

there

#

Once you dona all, you wanna get a model I'll link

coral viper Mar 9, 2025, 7:48 PM

#

I don't understand how to patch cute_dogwave

glacial pollen Mar 9, 2025, 7:49 PM

#

those are installs my man

#

install one, install the 2nd one

#

No manual work involved in that part

#

Once you install both in the order I specified, head to:
https://huggingface.co/GaboxR67/MelBandRoformers/tree/main/melbandroformers/vocals

GaboxR67/MelBandRoformers at main

#

and download this:

#

It's the fv4 model for isolation I mentioned

coral viper Mar 9, 2025, 7:49 PM

#

glacial pollen install one, install the 2nd one

Using 2 the files from above?

glacial pollen Mar 9, 2025, 7:50 PM

#

coral viper Using 2 the files from above?

yes, first the first link, then the 2nd

#

first is the base ish, 2nd is the patch

coral viper Mar 9, 2025, 7:50 PM

#

glacial pollen Once you install both in the order I specified, head to: https://huggingface.co/...

Then what's thi

glacial pollen Mar 9, 2025, 7:50 PM

#

It is a model for separation

#

which you'll use in uvr

#

the fv4 model I mentioned

#

you'll have to download it

get my config file:

📎 voc_gabox.yaml

#

Now, focus.
Once you install the uvr, you'll have to find it's folder and then, put the model ( fv4 ) in here:

#

so, uvr's folder / models / mdx_net_models

#

( yes, you'll just cut n paste the model file in there )

glacial pollen Mar 9, 2025, 7:53 PM

#

glacial pollen you'll have to download it + get my config file:

As for the config file ( the yaml file )
Once you open the uvr and select the fv4 model:

coral viper Mar 9, 2025, 7:53 PM

#

glacial pollen It is a model for separation

I don't understand. Aren't the models are just pth?

glacial pollen Mar 9, 2025, 7:53 PM

#

.pth is just a format of models used in pytorch

#

pth = pytorch

#

.ckpt is another format meaning a checkpoint

#

don't worry about that part

coral viper Mar 9, 2025, 7:54 PM

#

Oh god, this is too much for my 12 y.o brain

glacial pollen Mar 9, 2025, 7:54 PM

#

well.. If you wanna hop into vc, I might help you if you screenshare

#

Other than that, you gotta follow the text instructions

coral viper Mar 9, 2025, 7:55 PM

#

That's just make things worse

I'll just follow your tutorial from what you sent above

If I stuck, I'll just ping you

glacial pollen Mar 9, 2025, 7:55 PM

#

sounds good

glacial pollen Mar 9, 2025, 7:55 PM

#

glacial pollen As for the config file ( the yaml file ) Once you open the uvr and select the fv...

anyway, back to this.
Once you select the fv4, you'll be prompted to configure it or select the config, something along that

#

#

like this

#

In the model type you'll have to select " mel band roformer " ( just not the v2 variant )

coral viper Mar 9, 2025, 7:56 PM

#

glacial pollen you'll have to download it + get my config file:

What should I do with yaml?

glacial pollen Mar 9, 2025, 7:56 PM

#

that part is for the first box

#

the " select model param "

#

there'll be an option to open / use the yaml config

#

and you'll use the one I sent you

#

Lastly, you'll be clicking ok or was it apply for all windows

#

And that's that from adding custom models

#

Now as for configuring the uvr for usage

#

#

( do it after properly setting up fv4 model

#

you could also do this:

#

so, setting the wav type to 32 bit float

#

( in case you'll be using uvr for making datasets / samples for model training, else you can keep it as 16 bit )

#

Oh yea, in here you can use 11 or 16, I'd recommend 16 tho

#

Now... if you need an authentic guide for uvr, models n stuff..
https://docs.google.com/document/d/17fjNvJzj8ZGSer7c7OFe_CNfUKbAxEh_OBv94ZdRG5c/edit?tab=t.0
But I warn you, it's a messy spaghetti 👀

Google Docs

Instrumental, vocal & other stems separation & mix/master guide - U...

edit 04.03.25 deton24’s Instrumental and vocal & stems separation & mastering (UVR 5 GUI: VR/MDX-Net/MDX23C/Demucs 1-4, and BS/Mel-Roformer in beta MVSEP-MDX23-Colab/KaraFan/drumsep/LarsNet/SCNet x-minus.pro (uvronline.app)/mvsep.com/ GSEP/Dango.ai/Audioshake/Music.ai) General reading advice | D...

coral viper Mar 9, 2025, 8:05 PM

#

glacial pollen Now... if you need an authentic guide for uvr, models n stuff.. https://docs.goo...

What's this? Wdym spaghetti?

glacial pollen Mar 9, 2025, 8:05 PM

#

Open the website up and you'll understand what I mean : )

#

as for what it is, well, exactly what it says

#

Documentary, just docs on uvr and models

coral viper Mar 9, 2025, 8:05 PM

#

Aren't the steps you sent above before enough?

glacial pollen Mar 9, 2025, 8:06 PM

#

I mean ye, it's enough, but if you wanna know more about everything, that's the place

coral viper Mar 9, 2025, 8:07 PM

#

Oh ok

glacial pollen Mar 9, 2025, 8:07 PM

#

mh mh

coral viper Mar 9, 2025, 8:26 PM

#

glacial pollen

I'm here. Which one I should click?

glacial pollen Mar 9, 2025, 8:28 PM

#

I mean

#

I already wrote there which ones

#

Did you actually read my msgs

#

coral viper Mar 9, 2025, 8:29 PM

#

I did, but it confusing, looks like not in order

glacial pollen Mar 9, 2025, 8:29 PM

#

👀 because someone cut into my msgs making it more confusing than it should be.

Model type: mel-band roformer

model param box: from yaml config / config file or something along that

coral viper Mar 9, 2025, 8:30 PM

#

In Select Model Param, which one I should click?

Or is that where I put yaml?

#

After I download yaml file, where I should put it?

In what folder?

glacial pollen Mar 9, 2025, 8:30 PM

#

send ss

coral viper Mar 9, 2025, 8:32 PM

#

glacial pollen send ss

glacial pollen Mar 9, 2025, 8:33 PM

#

the model param box, open it and show an ss of it

#

pro tip, shift+win+s lets you select the area to make a screenshot out of / crop

#

then ctrl+v in here

#

well, taking you long

#

Don't want to sound rude but, can't be staying here for a whole day 🙄 got stuff to do

coral viper Mar 9, 2025, 8:41 PM

#

coral viper Mar 9, 2025, 8:42 PM

#

glacial pollen pro tip, shift+win+s lets you select the area to make a screenshot out of / cro...

When I do that, it keeps back to before I click Select Model Param

glacial pollen Mar 9, 2025, 8:42 PM

#

#

install new yaml

coral viper Mar 9, 2025, 8:42 PM

#

glacial pollen Don't want to sound rude but, can't be staying here for a whole day 🙄 got stuff...

Ok sorry. You don't have to reply it now

glacial pollen Mar 9, 2025, 8:42 PM

#

this is what you always use when first adding new model

#

so, any custom models

coral viper Mar 9, 2025, 8:42 PM

#

glacial pollen

Ok

glacial pollen Mar 9, 2025, 8:43 PM

#

then ye, set the type as mel-band roformer ( again, not v2 ) and confirm all

#

and that's all

coral viper Mar 9, 2025, 8:43 PM

#

glacial pollen install new yaml

What yaml I should install?

glacial pollen Mar 9, 2025, 8:43 PM

#

One I provided

coral viper Mar 9, 2025, 8:43 PM

#

Oh ok

glacial pollen Mar 9, 2025, 8:43 PM

#

glacial pollen you'll have to download it + get my config file:

this

coral viper Mar 9, 2025, 8:44 PM

#

glacial pollen this

and then?

glacial pollen Mar 9, 2025, 8:44 PM

#

Save config

#

then confirm

coral viper Mar 9, 2025, 8:45 PM

#

Confirm where?

#

There is no confirm button

glacial pollen Mar 9, 2025, 8:45 PM

#

Unless the window disappeared then that's fine

#

Either way, once you close all the windows ( if any ) you'll have the main ui

coral viper Mar 9, 2025, 8:45 PM

#

Save config and close?

glacial pollen Mar 9, 2025, 8:45 PM

#

ye

coral viper Mar 9, 2025, 8:46 PM

#

Then how about the model type?

glacial pollen Mar 9, 2025, 8:46 PM

#

......

#

coral viper Mar 9, 2025, 8:46 PM

#

glacial pollen Mar 9, 2025, 8:47 PM

#

Why so many people lately have such attention issues

#

misc_cry

coral viper Mar 9, 2025, 8:47 PM

#

Wait wait

#

It's 3 AM here

coral viper Mar 9, 2025, 8:48 PM

#

glacial pollen

Done. Amd then?

glacial pollen Mar 9, 2025, 8:48 PM

#

got the main ui?

coral viper Mar 9, 2025, 8:49 PM

#

glacial pollen got the main ui?

This?

glacial pollen Mar 9, 2025, 8:49 PM

#

yes

#

now

#

#

mdx-net, and the box below, fv4 model

coral viper Mar 9, 2025, 8:49 PM

#

On it

glacial pollen Mar 9, 2025, 8:49 PM

#

as for 1 and 2

#

you can drag n drop your music / song onto 1 area

#

and you can also drag some empty folder ( for instance, if you wanted it for outputs ) into 2 area

#

much quicker than doing it the other way

coral viper Mar 9, 2025, 8:50 PM

#

glacial pollen

Shall I must turn all of this in order?

glacial pollen Mar 9, 2025, 8:51 PM

#

well no

coral viper Mar 9, 2025, 8:51 PM

#

Good

#

@glacial pollen Oh, anyway. What version is it again?

5.6.1 right?

glacial pollen Mar 9, 2025, 8:53 PM

#

of the uvr?

#

it's 5.6 but beta variant

#

just name it 5.6_beta if you want

coral viper Mar 9, 2025, 8:53 PM

#

glacial pollen of the uvr?

Yes

#

The patch you said to install was installed by me with the same way like the full version one

Is it right?

glacial pollen Mar 9, 2025, 8:55 PM

#

coral viper The patch you said to install was installed by me with the same way like the ful...

why asking

#

something doesn't work?

#

both uvr and patch is installed the same way as anything else that is an installer

coral viper Mar 9, 2025, 8:55 PM

#

glacial pollen why asking

Make sure that I didn't missed steps

glacial pollen Mar 9, 2025, 8:56 PM

#

you'll know if you did all right once u run an isolation

#

If you get no errors, means you did all right

coral viper Mar 9, 2025, 8:56 PM

#

glacial pollen If you get no errors, means you did all right

It said no errors in green

glacial pollen Mar 9, 2025, 8:56 PM

#

ye, then it works

coral viper Mar 9, 2025, 8:57 PM

#

glacial pollen ye, then it works

But why it take so long?

Is it because I want to make it in flac?

glacial pollen Mar 9, 2025, 8:58 PM

#

coral viper But why it take so long? Is it because I want to make it in flac?

Uhhhh, your gpu?

#

Aside, I never said it'll be fast

#

it takes time, esp at overlap 11 or 16

coral viper Mar 9, 2025, 8:58 PM

#

glacial pollen Uhhhh, your gpu?

30

glacial pollen Mar 9, 2025, 8:58 PM

#

1-3 mins is pretty normal

#

for most songs

#

for instance, 3~ ish min song for me, takes 1.2 to 1.5 mins or something ( rtx 3060

#

so there's nothing wrong about it

coral viper Mar 9, 2025, 8:59 PM

#

Aight, I take it maybe because I want it in flac

#

@glacial pollen So, the result will be vocal and instrument?

knotty moth Mar 9, 2025, 8:59 PM

#

coral viper But why it take so long? Is it because I want to make it in flac?

overlap 8 or lower is faster but 16 has optimal quality consistency (let's consider it like ultra quality in game graphics settings)

glacial pollen Mar 9, 2025, 9:00 PM

#

No, vocal

#

fv4 is a vocal model, haven't tested it in instru mode so can't promise anything

#

and if you need a model that yeets the backing vocals, try this:

#

#

it is downloaded from this section

#

you download it, then hit refresh list and done. It'll appear in the model list

flint solar Mar 9, 2025, 9:02 PM

#

glacial pollen No, vocal

Does applio Kaggle use the latest code?

glacial pollen Mar 9, 2025, 9:03 PM

#

flint solar Does applio Kaggle use the latest code?

Not sure

#

I don't maintain kaggle or colabs so, can't help sadly

analog obsidian Mar 9, 2025, 9:04 PM

#

flint solar Does applio Kaggle use the latest code?

by default no but you can easily change it to use the main branch instead

knotty moth Mar 9, 2025, 9:05 PM

#

glacial pollen and if you need a model that yeets the backing vocals, try this:

there is also melroformer karaoke by aufr33, but that can be also a decent alternative

glacial pollen Mar 9, 2025, 9:05 PM

#

knotty moth there is also melroformer karaoke by aufr33, but that can be also a decent alter...

Oh, will have to check it out

coral viper Mar 9, 2025, 9:05 PM

#

glacial pollen and if you need a model that yeets the backing vocals, try this:

No I just want to separate vocal and instrument

And then put the vocal in Applio

And then put the ai covered vocal to instrumental back together again

glacial pollen Mar 9, 2025, 9:05 PM

#

karaoke 2 is super nice and, tbf not sure how it compares to classic bve ( been ages

#

but kara 2 has an awful noise

#

so, maybe the one u pointed out is a lil better

flint solar Mar 9, 2025, 9:06 PM

#

analog obsidian by default no but you can easily change it to use the main branch instead

What abt the applio colab

glacial pollen Mar 9, 2025, 9:06 PM

#

coral viper No I just want to separate vocal and instrument And then put the vocal in Appl...

For instru there are probs better models

#

https://docs.google.com/document/d/17fjNvJzj8ZGSer7c7OFe_CNfUKbAxEh_OBv94ZdRG5c/edit?tab=t.0

Google Docs

Instrumental, vocal & other stems separation & mix/master guide - U...

edit 04.03.25 deton24’s Instrumental and vocal & stems separation & mastering (UVR 5 GUI: VR/MDX-Net/MDX23C/Demucs 1-4, and BS/Mel-Roformer in beta MVSEP-MDX23-Colab/KaraFan/drumsep/LarsNet/SCNet x-minus.pro (uvronline.app)/mvsep.com/ GSEP/Dango.ai/Audioshake/Music.ai) General reading advice | D...

#

check the docs

coral viper Mar 9, 2025, 9:06 PM

#

glacial pollen For instru there are probs better models

Wdym?

glacial pollen Mar 9, 2025, 9:06 PM

#

ctrl+f and type in instru or instrumental

glacial pollen Mar 9, 2025, 9:07 PM

#

coral viper Wdym?

Exactly what I said. For instrumentals there are propably better models

#

because fv4

#

is a voc type model

#

it was made to handle vocals, mainly ( or rather, is mostly good at vocals and I can't promise anything on instrumentals' quality. Haven't tried it for that

coral viper Mar 9, 2025, 9:07 PM

#

glacial pollen Exactly what I said. For instrumentals there are propably better models

I thought the character model

glacial pollen Mar 9, 2025, 9:07 PM

#

coral viper I thought the character model

🤔

coral viper Mar 9, 2025, 9:09 PM

#

glacial pollen 🤔

Voice model I mean

knotty moth Mar 9, 2025, 9:14 PM

#

coral viper No I just want to separate vocal and instrument And then put the vocal in Appl...

even after you separate using a vocal model like gabox fv4, you should check if it contains some backing vocals/harmonies

glacial pollen Mar 9, 2025, 9:14 PM

#

coral viper Voice model I mean

prev-gen models were rather universal ish, mostly
Nowadays tho, more and more models are being made with a specific specialization in mind

#

Instrumentals, vocals, stems, sfx, backing vocals / harmonies

#

and some diverge into having specific properties, such as fullness of vocals, or noise, bleedless etc

#

In this case, fv4 is primarily a voice / vocal model

#

If you need more details and explanations / overviews, the doc I sent you has all of that.
All you need to do is search for a keyword ( ctrl + f ) and have some read

broken urchin Mar 9, 2025, 9:29 PM

#

can someone help me

#

inference doesnt work on Applio

#

i put in my audio and convert it and it just gives out a blank voice recording

#

glacial pollen Mar 9, 2025, 9:37 PM

#

broken urchin i put in my audio and convert it and it just gives out a blank voice recording

send ss of the console log

#

Also, which applio, newest one? or precompiled package

broken urchin Mar 9, 2025, 9:37 PM

#

newest applio

broken urchin Mar 9, 2025, 9:38 PM

#

glacial pollen send ss of the console log

glacial pollen Mar 9, 2025, 9:38 PM

#

firstly, try to change the mp3s name

#

no spaces

#

use _s as alternative

knotty moth Mar 9, 2025, 9:39 PM

#

broken urchin

check if the file really exist in that directory, or if not sure convert it to wav first

broken urchin Mar 9, 2025, 9:40 PM

#

yeah the file is in the folder

#

its there

knotty moth Mar 9, 2025, 9:42 PM

#

glacial pollen no spaces

tbh it should still work on files with spaces and somewhat kanji chars

glacial pollen Mar 9, 2025, 9:43 PM

#

well, supposedly ye
yet most times issues of a " file doesn't exist " sort happen, it's either the naming, corrupted file, path or lack of ffmpeg

simple ore Mar 9, 2025, 9:43 PM

#

likely it is not .mp3

#

it may be just .wav renamed to mp3

glacial pollen Mar 9, 2025, 9:44 PM

#

broken urchin its there

Can you check on some other random audio first?

#

gotta exclude if the file itself's not an issue

broken urchin Mar 9, 2025, 9:45 PM

#

glacial pollen Can you check on some other random audio first?

i will try

coral viper Mar 9, 2025, 9:47 PM

#

glacial pollen and if you need a model that yeets the backing vocals, try this:

Where can I get UVR-MDX-NET Karaoke 2?

broken urchin Mar 9, 2025, 9:47 PM

#

glacial pollen Can you check on some other random audio first?

i tried on different audio

#

still the same thing

#

just gives out a blank recording

#

also same error in the console

#

i also tried on different format

glacial pollen Mar 9, 2025, 9:49 PM

#

coral viper Where can I get UVR-MDX-NET Karaoke 2?

I showed it on screenshot

#

it is download from within the uvr

glacial pollen Mar 9, 2025, 9:49 PM

#

broken urchin i tried on different audio

Check if you have ffmpeg file in ur applio ( folder

#

#

these

broken urchin Mar 9, 2025, 9:50 PM

#

i have both of those

glacial pollen Mar 9, 2025, 9:50 PM

#

Also, again, which applio you running

broken urchin Mar 9, 2025, 9:50 PM

#

the one from the server

#

from ai hub website

coral viper Mar 9, 2025, 9:51 PM

#

glacial pollen I showed it on screenshot

No. In my UVR5 only UVR-MDX-NET Inst HQ 5

broken urchin Mar 9, 2025, 9:52 PM

#

https://docs.aihub.gg/rvc/local/applio/

glacial pollen Mar 9, 2025, 9:52 PM

#

broken urchin the one from the server

well

broken urchin Mar 9, 2025, 9:52 PM

#

this one

glacial pollen Mar 9, 2025, 9:52 PM

#

lemme rephrase. Did you download a zip

#

aka, unpack n run

broken urchin Mar 9, 2025, 9:52 PM

#

yeah it was a zip file

glacial pollen Mar 9, 2025, 9:52 PM

#

coral viper No. In my UVR5 only UVR-MDX-NET Inst HQ 5

It is there, you gotta scroll down

#

( doesn't show in my case cause I already have it downloaded, the karaoke 2

glacial pollen Mar 9, 2025, 9:53 PM

#

broken urchin yeah it was a zip file

well, that's all weird

#

Normally people don't encounter such issues

#

nowadays at least

broken urchin Mar 9, 2025, 9:53 PM

#

so what do i do lmao

glacial pollen Mar 9, 2025, 9:53 PM

#

well, I can propose checking my fork maybe
if you're up for it

#

but that'll result in some downloading 🤔

#

cause like, having ffmpeg, checking the path / naming, validating on other files
that's pretty much all there is to diagnosing the issue

broken urchin Mar 9, 2025, 9:54 PM

#

is that the only way?

glacial pollen Mar 9, 2025, 9:54 PM

#

Those audio related problems are pretty obscure

#

and unclear

glacial pollen Mar 9, 2025, 9:54 PM

#

broken urchin is that the only way?

Imo the easiest

#

as it has to work 100%, my fork, no other way

#

that'd indicate an issue with something else, other than applio itself at least

#

Well, it'd end up on downloading stuff anyways

cause you'd have to get normal applio from repo ( not the package

#

but while we're at it, imma recommend my fork cause why not ¯_(ツ)_/¯

#

Else I'm out of ideas

broken urchin Mar 9, 2025, 9:56 PM

#

so i cant really fix this

coral viper Mar 9, 2025, 9:56 PM

#

glacial pollen It is there, you gotta scroll down

broken urchin Mar 9, 2025, 9:56 PM

#

its something in my pc

#

thats causing my issue

#

its not applio

coral viper Mar 9, 2025, 9:57 PM

#

I did scroll, but it reached limit and no more showing

glacial pollen Mar 9, 2025, 9:57 PM

#

coral viper

man man man

#

#

What is written there 🙂

glacial pollen Mar 9, 2025, 9:58 PM

#

coral viper

Because you show me the main ui, not the settings section

#

glacial pollen Mar 9, 2025, 9:58 PM

#

broken urchin its something in my pc

I have a hard time imagining what it'd be

#

Applio works within it's own environment

#

it's independent from ur pc

#

Unless you're not using it that way 🤔 ( for whatever reason

#

you have env folder in applio?

broken urchin Mar 9, 2025, 9:59 PM

#

yes i have env

#

glacial pollen Mar 9, 2025, 9:59 PM

#

Try this

#

for infer ^

#

put it in assets/audios

broken urchin Mar 9, 2025, 10:00 PM

#

alright hold up

#

what language is that i dont understand anything

glacial pollen Mar 9, 2025, 10:00 PM

#

That is surprising 🤔

#

#

considering ur anime pfp

#

It's japanese, the language

#

Anyways, did it work?

#

or nah

broken urchin Mar 9, 2025, 10:01 PM

#

yeah it worked

glacial pollen Mar 9, 2025, 10:01 PM

#

Then that's the case

broken urchin Mar 9, 2025, 10:01 PM

#

glacial pollen Mar 9, 2025, 10:01 PM

#

It's either the path u screwed up

#

or the naming

broken urchin Mar 9, 2025, 10:01 PM

#

it gave out a recording actually

glacial pollen Mar 9, 2025, 10:01 PM

#

as I said before

#

oh

#

recording?

#

wdym

#

It didn't do the inference? did output the input file?

broken urchin Mar 9, 2025, 10:01 PM

#

yeah it did the inference

glacial pollen Mar 9, 2025, 10:01 PM

#

well then, it works

#

so again

coral viper Mar 9, 2025, 10:01 PM

#

glacial pollen and if you need a model that yeets the backing vocals, try this:

So which one I should use to yeet the background vocals?

Karaoke 2 or Inst HQ 4?

glacial pollen Mar 9, 2025, 10:01 PM

#

glacial pollen It's either the path u screwed up

^

low shard Mar 9, 2025, 10:01 PM

#

glacial pollen It's japanese, the language

-# I mean not everyone who watches anime knows japanese though

glacial pollen Mar 9, 2025, 10:02 PM

#

Show me how you previously would input the path to your audio

#

In here

#

broken urchin Mar 9, 2025, 10:02 PM

#

i would just record a voice message of my voice in voice recorder and just put it in the audios folder and thats it

glacial pollen Mar 9, 2025, 10:03 PM

#

well

knotty moth Mar 9, 2025, 10:03 PM

#

broken urchin it gave out a recording actually

still that ffmpeg error in the recording file? like I said try converting to wav in audacity

glacial pollen Mar 9, 2025, 10:03 PM

#

then name it differently

#

try "test"

#

for the name

broken urchin Mar 9, 2025, 10:03 PM

#

ok

glacial pollen Mar 9, 2025, 10:03 PM

#

if that fails, then those mp3s are being screwed up in some way

#

or applio from precompiled had issues with mp3s? ( doubt it but Nothing surprises me anymore

broken urchin Mar 9, 2025, 10:04 PM

#

ill try to rename it to test then

coral viper Mar 9, 2025, 10:04 PM

#

coral viper So which one I should use to yeet the background vocals? Karaoke 2 or Inst HQ 4...

@glacial pollen sir

broken urchin Mar 9, 2025, 10:05 PM

#

yeah i just tried them out on my voice recordings again but doesnt work lol

glacial pollen Mar 9, 2025, 10:05 PM

#

coral viper <@1239634084133601423> sir

try hq4

#

karaoke 2 is for backing vocals

#

fv4 is for vocals

#

hq4 for instrus or really any other the docs recommend or mention

broken urchin Mar 9, 2025, 10:05 PM

#

also the files are different for some reason

glacial pollen Mar 9, 2025, 10:05 PM

#

bruh

#

I mean, it's just metadata so

#

you can always use ffmpeg to convert it, maybe could help

#

you got ffmpeg installed and added to path, on ur pc?

broken urchin Mar 9, 2025, 10:06 PM

#

nah

#

i dont even know what that is honestly hahaha

glacial pollen Mar 9, 2025, 10:06 PM

#

or idk, take the file to the same location you have the ffmpeg in

broken urchin Mar 9, 2025, 10:06 PM

#

i just have it in the applio folder

glacial pollen Mar 9, 2025, 10:06 PM

#

then open up the cmd in there

#

#

then enter and you'll get cmd

#

now, having both ffmpeg and ur mp3 there

#

type in the cmd:
ffmpeg.exe -i test.mp3 test.wav

#

the " -i " means input, that's your file right after it

#

and then, there's output, in this case test but as wave