#✨│ai-help

1 messages · Page 221 of 1

unique rock
#

I always run my models through Mel Band Karaoke and then through that model and my models are perfectly clean.

knotty moth
#

it's pretty good but can't remove delay echoes

unique rock
latent cypress
knotty moth
unique rock
unique rock
knotty moth
latent cypress
unique rock
# knotty moth

oh, any modifications to his aggression or are you leaving it intact?

latent cypress
latent cypress
knotty moth
#

unlike the latter, the former doesn't damage breaths

unique rock
# latent cypress sometimes

The way I do it usually removes the back notes and adlibs well, although sometimes it eats up the high notes too. It's quite clean. I should try your method.

latent cypress
#

yeah I usually use anvuew's dereverb after extracting the vocals, if there are still background vocals left after dereverb I usually go to uvronline (xminus) to remove them using melband karaoke or uvr bve v2

#

but if the background vocals still not removed, then i didn't add the audio to my dataset

unique rock
#

Removing the back and leads from Billie Eilish's songs is very difficult, but that method helped me a lot and my voice is very clean. Now I'll try your method and see how it goes.

latent cypress
unique rock
latent cypress
unique rock
# latent cypress which song?

all of them, meaning the volume is generally low and it is necessary to turn it up, that is where quality comes into play.

latent cypress
#

have you open it in audacity?

unique rock
#

Yeah

latent cypress
#

just normalize the volume or use amplify

unique rock
red kayak
#

thats a breath

brittle wing
#

-colab

patent trellisBOT
# brittle wing -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

buoyant frost
#

Whenever I try to use Inference tool on Applio, and press Convert - it just says processing and the amount of seconds next to it - but it never completes it, just displays "Error" - I haven't gotten it to work a single time

#

How long does it usually take to convert a 2 min audio file?

gleaming kindle
#

pitch extraction is taking forever, is that normal?

#

thread:7, f0ing, Hop-Length:64: 100%|█████████████████████████████████████████████████| 22/22 [54:02<00:00, 147.36s/it]
thread:3, f0ing, Hop-Length:64: 100%|█████████████████████████████████████████████████| 23/23 [56:09<00:00, 146.52s/it]
thread:10, f0ing, Hop-Length:64: 100%|██████████████████████████████████████████████| 22/22 [1:46:29<00:00, 290.43s/it]
thread:1, f0ing, Hop-Length:64: 83%|██████████████████████████████████████▊ | 19/23 [1:46:12<35:03, 525.98s/it]
thread:3, f0ing, Hop-Length:64: 100%|██████████████████████████████████████████████████| 23/23 [56:03<00:00, 65.50s/it]
thread:4, f0ing, Hop-Length:64: 64%|███████████████████████████████▏ | 14/22 [55:19<36:17, 272.20s/it]
thread:4, f0ing, Hop-Length:64: 68%|████████████████████████████████ | 15/22 [1:08:59<50:59, 437.06s/it]
thread:5, f0ing, Hop-Length:64: 73%|████████████████████████████████▋ | 16/22 [1:43:21<1:21:19, 813.17s/it]
thread:6, f0ing, Hop-Length:64: 50%|██████████████████████▌ | 11/22 [1:46:07<2:48:07, 917.04s/it]
thread:9, f0ing, Hop-Length:64: 32%|███████████████▎ | 7/22 [37:58<1:40:01, 400.07s/it]
thread:8, f0ing, Hop-Length:64: 64%|████████████████████████████▋ | 14/22 [1:35:28<1:47:45, 808.22s/it]
thread:9, f0ing, Hop-Length:64: 36%|████████████████▎ | 8/22 [1:42:16<5:47:19, 1488.57s/it]
thread:10, f0ing, Hop-Length:64: 100%|██████████████████████████████████████████████| 22/22 [1:46:13<00:00, 124.93s/it]
thread:11, f0ing, Hop-Length:64: 68%|█████████████████████████████▉ | 15/22 [1:43:07<1:24:01, 720.25s/it]
thread:12, f0ing, Hop-Length:64: 68%|█████████████████████████████▉ | 15/22 [1:43:07<1:00:33, 519.13s/it]
thread:13, f0ing, Hop-Length:64: 59%|██████████████████████████ | 13/22 [1:46:00<2:09:05, 860.64s/it]

#

it has been 3 hours

knotty moth
gleaming kindle
#

I had it use my gpu,,

knotty moth
#

and it is the lowest recommended gpu for training

gleaming kindle
#

im using a 4060 laptop

#

i decided to restart it

knotty moth
woeful crow
#

hi i cant run applio colab and i get this message

ERROR: Failed to launch TensorBoard (exited with 1).
Contents of stderr:
Traceback (most recent call last):
File "/usr/local/bin/tensorboard", line 4, in <module>
from tensorboard.main import run_main
ModuleNotFoundError: No module named 'tensorboard'
Traceback (most recent call last):
File "/content/program_ml/app.py", line 1, in <module>
import gradio as gr
ModuleNotFoundError: No module named 'gradio'

buoyant frost
gleaming kindle
knotty moth
grim bay
#

I would say yes

#

But not sure if it is

blazing solar
#

Can yall help me this?

#

Hello?

low shard
#

Applio colab is broken for now @blazing solar @woeful crow

#

This is a Google colab/uv issue

low shard
blazing solar
low shard
# blazing solar What kind?

Train (make) RVC Models on cloud:

  1. Prepare the Dataset
  2. Setup RVC:
    Choose a cloud way to use RVC,
  • Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
  • Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
  • Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
  1. Be sure to know about the tensorboard

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.com/ which ofc uses RVC

RVC Inference (use models) on pre-recorded audio on Cloud

You can use either:

#

Or do you have a PC? Did you try checking it's GPU

latent cypress
#

Is it fine to add the same song but in a different language to my dataset?

latent cypress
white bough
latent cypress
white bough
#

For colab, atm you can replace all the instances of "!uv pip" in the install cell by "!pip" and it should work, although it might take longer

brittle wing
#

Hi, may I ask what was the way to clone the main repository into applio no UI colab so I can see the newer average graphs ?

brittle wing
#

-guides

simple ore
brittle wing
unique rock
full tundra
#

What rvc I can use to create and train voice model?

odd shale
surreal latch
unique rock
brittle wing
#

what is the link for the browser rpc

#

like the one u can use in browser

knotty moth
#

what rpc?

brittle wing
#

its called like liana rpc or sum

#

i remember it was a website

brittle wing
hallow thistle
knotty moth
#

nah just being dyslexic

hallow thistle
#

Too crazy.

hallow thistle
patent trellisBOT
hallow thistle
#

This is the correct command for showing guide links to RVC and Applio.

olive hill
#

What is this error on Kaggle

hallow thistle
olive hill
olive hill
hallow thistle
#

Try delete the notebook you created for yourself and then create the new one from start.

olive hill
olive hill
#

@hallow thistle Please help

knotty moth
low shard
serene horizon
#

So what is the cause of static/crackle in silence gaps in resulting file from inference?

I convert the same piece of audio, and the model I created in Applio has it, while the model I created in RVC Mainline doesn’t have it.

knotty moth
serene horizon
serene horizon
knotty moth
#

so you might want to fall back to the old labeling method for use in mainline

serene horizon
knotty moth
serene horizon
#

Also, Applio has more “electronic” noise on actual voice than Mainline model. Especially with “SSS” sounds.

knotty moth
#

some training configurations are actually different tho

serene horizon
latent cypress
#

is applio main automatically set to fp32?

analog obsidian
#

in the latest compiled version fp16 is enabled by default

latent cypress
#

ah i see, i was confused because i didn't find any option to change precision

latent cypress
analog obsidian
#

everything is fp32 now

latent cypress
#

okay, thanks

latent cypress
#

is using checkpointing really that slow? it's been 5 minutes after i click start training, and the cmd didn't show anything

analog obsidian
#

maybe you've got out of vram even with checkpointing enabled

#

i know that 48k requires way more vram to train models

#

40k should also require a bit more vram than 32k

golden walrus
#

guys, 1 question, is it possible to train a model that can be smooth in my language?

latent cypress
analog obsidian
#

that saves vram

#

but it got removed in applio because it was causing problems when training from scratch

eager siren
#

Is there any recommended cloud collab for training?"

blazing solar
latent cypress
analog obsidian
#

however you can use the index to force the accent of the dataset in the results

golden walrus
#

pepe_cry somehow i train it with my language data but end up sound like a British

analog obsidian
#

yea you're not alone in that, idk why it happens

#

never had that issue personally

latent cypress
#

i guess i have to try kaggle sometimes

analog obsidian
#

yes training is faster because there's a new flag named "benchmarking" that speeds up the training

latent cypress
analog obsidian
#

but noobies told me that the speed boost was more noticeable while using fp16, and that fp32 negated the speed gains

latent cypress
analog obsidian
#

for me it's also very fast lol, specially when training 32k

latent cypress
#

for some reason i doubted that my applio is using fp32 lol

analog obsidian
#

you can have fun and experiment with it if you want

go to rvc/train/train.py

find

torch.backends.cudnn.benchmark = True

set it to false to disable the speed gain

#

i have not compared how slow the training is with that setting turned off xd

latent cypress
analog obsidian
#

there's no point to train 48k with a 16k cutoff

#

slower training + high vram usage for basically the same results as if you were training using 32k

latent cypress
analog obsidian
latent cypress
#

the audio source is actually lossless, but after removing the vocals and reverb, some echoes still remained. then i removed them using a de-echo model, which caused this cutoff.

analog obsidian
#

i would use 32k

#

the dataset now has a 17k cutoff

#

and also is damaged due to separation models being used

latent cypress
latent cypress
analog obsidian
#

the dataset doesn't reach 20k nor 18k

#

17k is pretty much 32k territory

latent cypress
#

is the sound detail still good at 32k? haven't tried 32k until now

analog obsidian
#

you'll notice a difference if the dataset is actually true 48k/40k

#

but if the dataset is 32k instead, then no, training that in 48k/40k doesn't give you better results

analog obsidian
latent cypress
#

i just thought 32k is less detailed than 40k all the time lol

analog obsidian
#

40k and 48k are more clear than 32k

#

but again, only if the audios are true 40k or 48k

#

17k from separation models is not true 40k nor 48k

latent cypress
#

ah okay okay

analog obsidian
#

if your dataset had a frequency cutoff of 24k you can downsample it to 32k and compare them
see if you can hear some difference lol

analog obsidian
#

if you still have them, try to convert one of the audios to 32k

#

and try to hear if there is a big difference

#

i did it rn and i cant hear the difference

latent cypress
analog obsidian
#

see? don't be scared of 32k

#

48k has more fidelity than 32k, buuut, only hardcore audiophiles can hear the difference

latent cypress
#

i guess i don't have to worry about mix lossless audio and the audio from youtube into my dataset

analog obsidian
#

the dataset needs consistent quality

#

dont mix stuff

latent cypress
analog obsidian
latent cypress
#

i tried to make a singing model, tbh, my dataset is kinda repetitive if i only use lossless audio, while the singer is hitting high notes on youtube lol

analog obsidian
#

well you can experiment and try training your current dataset, but it is a known fact that training from different sources can lead to bad results

latent cypress
# analog obsidian dont mix stuff

I always do this when training the model, but for some reason people say my model quality is really good, i even got ai master role on some serverskull_sob

analog obsidian
#

im just telling you what i know
if u want to continue training like that, do it lols

#

at the end what matters is if you like how your models sound

latent cypress
#

and now I feel like I don't deserve the ai master role on that servermisc_cry

analog obsidian
#

ai is random
i also make mistakes while training models

#

this thing can be really confusing sometimes lol

#

im still learning more about rvc

#

it's all trial and error here

latent cypress
#

yeah, you're right

grim bay
#

@viscid moss Why does only UVR-Deecho-Dereverb works? The rest models in Dereverb tab are just giving error

opal kelp
#

-colab

patent trellisBOT
# opal kelp -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

knotty moth
storm holly
#

not sure it works anymore

knotty moth
knotty moth
knotty moth
golden walrus
#

guys, is there any guide on how to use UVR UI ?

#

i've met some error but i have no idea where i went wrong

#

like this

#

or

knotty moth
golden walrus
#

i don't get it, why error, what file

knotty moth
golden walrus
#

is it environment path or

#

the name of the voice file ?

knotty moth
storm holly
#

wait i need some kind of ngrok token for applio colab?

golden walrus
#

or the uvr path lah

knotty moth
golden walrus
#

ahhhhhhhh

#

i thought uvr just need to throw in that file and done

#

because i don't see any output path

#

or input path in the ui

knotty moth
#

I mean like C:\yourfolder\your.wav

#

also you can try first pre-convert the mp3 file to wav using audacity

golden walrus
#

cat_blush roger that

knotty moth
latent cypress
hot dome
golden walrus
#

i think i have to download the model and import it in the file somewhere

limpid birch
#

-colab

patent trellisBOT
# limpid birch -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

latent cypress
#

do you think it's overtrained after 8.7k steps?

viscid moss
#

if still... can u show me this part of the error?

#

My DMs are also open for help

#

I think the problem is the installation path, it's directly in C:\ Try installing it in Downloads

unborn stirrup
#

Is anybody else having issues with Mainline? I keep getting error "ERR_NGROK_8012".

wanton valley
#

same

#

from the #📰│dev-updates it seems there are still issues with the damn colab so i guess we wait

tropic garden
simple ore
simple ore
analog obsidian
#

old g/total graph

knotty moth
viscid moss
#

@left flame add those lines at the beggining of the cell

solemn walrus
#

.

keen island
#

hello, does the rvc gui also exist with RMVPE

knotty moth
#

-gui

patent trellisBOT
hallow thistle
patent trellisBOT
knotty moth
hallow thistle
#

Don't expect RVC-GUI to get updated with rmvpe model; this particular RVC fork is too old. Use a better RVC program like Applio instead.

keen island
#

Oh okay thanks, I'm not up to date, which is the easiest?

brittle wing
#

Applio no UI doesn't wat to preprocess dataset

keen island
#

I actually found the rvc gui quite good, audio in, audio out in the same folder

knotty moth
#

why good if there's no rmvpe?

keen island
#

That's exactly why I asked if the gui is also available with rmvpe, well I just have to switch

latent cypress
knotty moth
hallow thistle
knotty moth
keen island
#

I have already loaded trained voice models into rmvpe. Which folder do they have to go into in apollio?

hallow thistle
#

rmvpe isn't RVC program. It's a pitch extraction model used in RVC.

brittle wing
#

Applio no UI doesn't want to preprocess dataset

Traceback (most recent call last):
File "/content/Applio/core.py", line 15, in <module>
from rvc.lib.tools.prerequisites_download import prequisites_download_pipeline
File "/content/Applio/rvc/lib/tools/prerequisites_download.py", line 3, in <module>
from tqdm import tqdm
ModuleNotFoundError: No module named 'tqdm'
@simple ore also in the "download models" sub cell

latent cypress
knotty moth
brittle wing
#

What do I use for training

knotty moth
#

host BackEnd?

#

are you doing this shit for rvc?

brittle wing
#

What do I use for training is there something that works?

knotty moth
#

does it use rvc or nah?

#

do you know what purpose of this channel so that you're asking it here?

#

anyway you could ask chatgpt or claude

weak cipher
#

I will delete the message

low shard
brittle wing
#

What happened

low shard
#

you can use alternatives

#

Train (make) RVC Models on cloud:

  1. Prepare the Dataset
  2. Setup RVC:
    Choose a cloud way to use RVC,
  • Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
  • Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
  • Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
  1. Be sure to know about the tensorboard

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.com/ which ofc uses RVC

RVC Inference (use models) on pre-recorded audio on Cloud

You can use either:

#

ofc don’t mind about applio since it doesn’t work right now

brittle wing
low shard
#

It happens often that cloud breakes, especially google colab

#

because notebooks devs need to adapt to every cloud site update and bug, which is tiring

#

Or the other option would be buying a good pc, but not everyone can afford that

brittle wing
#

Hmm I just want a working Colab

low shard
# brittle wing Mainline?

mainline is the og rvc project yes, it doesn’t have the same perks like easy ui and tts or some adjustments, but it should work

brittle wing
#

The Colab

#

Mainline

low shard
#

I’m also checking right now if mainline colab works

brittle wing
#

Okay

low shard
#

because the same creator of the mainline colab, hina, is busy and some others of their colabs are broken like wokada, so i’m not sure if mainline is broken too

brittle wing
#

I hope mainline works at least but I find it difficult to use

low shard
brittle wing
low shard
#

else, the only other working way to use applio is to buy a good pc

#

because applio kaggle is broken too

#

or you could try other RVCs like mainline or rvc ai cover maker ui or ilaria rvc

low shard
brittle wing
#

-colab

patent trellisBOT
# brittle wing -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

latent cypress
low shard
brittle wing
low shard
#

so yeah use other alternatives

brittle wing
#

Whaaat

#

I can't train?

#

Applio UI colab?

low shard
#

cloud is unstable asf

brittle wing
#

When will. It get fixed

#

I have a model to train

low shard
brittle wing
#

...

low shard
#

for applio i told you we need uv and google colab to fix it, it’s their problem that causes issues for applio

#

it’s not an applio issue, it’s related to another thing that applio uses for cloud

low shard
brittle wing
#

I understand but when will it get fixed

low shard
#

or paying a model master commission, or wait for someone o do your free model request

brittle wing
#

I know that

low shard
brittle wing
low shard
latent cypress
#

medleyvox really ruined the frequency, and I didn't realize I was training the model with this audiomisc_cry

low shard
#

@brittle wing your only options are:

  • use weights
  • use other cloud alternatives, like mainline or applio on lightning ai
  • buy a good pc
  • ask a model master or maker to make it
  • wait for fixes
  • pay for cloud
brittle wing
low shard
brittle wing
low shard
#

no other ways than the ones i said

#

unless someone makes a script fix

brittle wing
#

I see no inference, no training

knotty moth
brittle wing
latent cypress
low shard
brittle wing
brittle wing
#

E complicato

low shard
latent cypress
latent cypress
brittle wing
latent cypress
low shard
patent trellisBOT
low shard
#

the first 2 should work

brittle wing
#

Does Ilaria rvc on hugginface work?

low shard
brittle wing
#

For inference

brittle wing
#

Is it the same result as Applio?

#

How do I use the lighting notebook

latent cypress
brittle wing
#

Wait it's similar to colab

latent cypress
brittle wing
low shard
#

traiming = make models

#

iference = use them

brittle wing
low shard
# brittle wing Okay does it allow pretrains

it’s the same as applio, just different cloud site, on lightning ai the interface is a bit harder, needs a phone number verification which could take some days for them to verify u, like 2 or 3, then also gives u a max 22 hours of gpu monthly
But the program is the same

brittle wing
low shard
brittle wing
#

I'm waiting for them to verify me

low shard
#

gpus are expensive

low shard
brittle wing
#

22 hours a month aren't enough

low shard
brittle wing
#

When will the fix come...

low shard
brittle wing
#

I know

low shard
brittle wing
#

I'm just saying

low shard
#

just like you can for colab too

low shard
brittle wing
#

I don't wanna pay Money for such things

#

I need colab back

low shard
#

you can’t expect to get the easiest service 24/7 all for free

brittle wing
low shard
#

AI is expensive asf

low shard
#

have patience

latent cypress
#

I always thought buying a good pc (at least rtx 4060) is an easy thing in other countries. considering the salary from their job is at least bigger than mine which is only 2.6M IDR ($162.50) per month.

brittle wing
low shard
brittle wing
#

Is it s python script problem or a general one.

tropic garden
#

We buy Macs instead and get stuck. 😥

low shard
# brittle wing How do i fix it

Unless you know coding and are able to find a workaround, all you can do is wait for google colab and uv to fix it, it’s a problem of the site hosted machines and of the python dependencies

latent cypress
simple sundial
#

hey guys whats up

low shard
brittle wing
#

...

low shard
simple sundial
#

Is this the right help channel for the voice-changer?

brittle wing
low shard
brittle wing
#

Weights isn't sophisticated enough no pretrains

#

WTF

low shard
latent cypress
low shard
brittle wing
simple sundial
low shard
#

rvc is for pre-recorded audios and model training

simple sundial
#

does it not stand for that?

#

ahhh

low shard
#

please read it

simple sundial
latent cypress
low shard
simple sundial
#

thanks then!

latent cypress
low shard
brittle wing
latent cypress
latent cypress
brittle wing
#

@low shard idk maybe someone found a fix

low shard
brittle wing
brittle wing
low shard
# brittle wing

No that’s an old fix that already got added, the issue is related to something else now

#

all you need is wait or use the alternatives

#

i don’t know how to tell you but there’s no other way

low shard
#

you don’t have other options

low shard
#

colabs break weekly

#

cloud in general breaks very very easily

latent cypress
# brittle wing WHAAT

like the fix was made in March, I'm not sure that's the way to fix clouds for now.

low shard
#

it’s another type of issue

brittle wing
low shard
#

i understand you need colab, but there’s no other way

#

Please just follow the options i told you

brittle wing
low shard
#

keeping to ask won’t fix anything

low shard
brittle wing
low shard
#

it was for pydantic

#

look

brittle wing
#

Since when are the colabs malfunctioning

brittle wing
low shard
low shard
#

It doesn’t matter if you keep asking, the answer will be always the same, cloud gets broken weekly

#

hopefully things get fixed soon

latent cypress
#

then it's useless to tell you how to fix it

low shard
low shard
#

Python library

simple ore
#

uv is a new requirements installer

brittle wing
golden walrus
#

guys, is the overtraining detection working ?

low shard
#

You need to modify the code like in the issue discussion

#

I mean that manually for yourself

latent cypress
#

i have so many audio with varying frequencies like this on my dataset, should I delete the audio that has cutoffs, or keep it and resample everything to 32k, then train it using a 32k pretrain?

simple ore
#

and what

brittle wing
latent cypress
brittle wing
#

Or you wanna know everything.

latent cypress
brittle wing
latent cypress
#

so i need to know how to train properly

brittle wing
#

You already know

#

I think

latent cypress
#

not really

brittle wing
#

Cause sometimes it's different

latent cypress
brittle wing
latent cypress
brittle wing
#

I know you have trained on Applio colab that's what I'm asking

brittle wing
# knotty moth

Zip file, right?Not a folder in drive containing audio files

latent cypress
latent cypress
#

hahaha

brittle wing
#

Wait what do I do

latent cypress
#

too easy

brittle wing
#

You know?

#

Ah okay

latent cypress
#

i mean i recognize you

brittle wing
#

But should it be a zip file uploaded into Applio

latent cypress
#

never used applio colab before

#

but on local you don't have to zip it

brittle wing
#

Well where's THE guide.

latent cypress
latent cypress
brittle wing
#

Yes

#

THANKS 💪

knotty moth
latent cypress
brittle wing
#

@simple ore IT WORKED

#

Like a miracle

brittle wing
#

Yes but I didn't understand about the format zip file or folder with audios.

brittle wing
#

I understood

#

Is "Load a backup" for resuming?

#

Where's the pretrains tab.

#

Nvm found in advanced settings

rocky radish
#

Hi guys im kinda a noobie how do i get my macbook voice changer to work

#

How do i connect the ai voice thing on mac to discord

brittle wing
#

Hmm how do I import dataset in Applio colab

latent cypress
#

is it normal for the loss/g/total starting from the bottom and not from the top?

brittle wing
latent cypress
brittle wing
brittle wing
#

Still doesn't work

brittle wing
#

I'm just wasting GPU for nothing

echo drum
#

I have a question guys. I heard some voice changers which also go with your depths and highs of your voice without sounding so electronic all of a sudden. Is there a quality difference depending on the model and where can I get a high quality one?

simple ore
brittle wing
#

Like as a zip file or audio file folder or a merge of all audios???like I need help.

#

I just wasted 2-3 hours of GPU just waiting for an answer...

warm iris
#

is there a google colab RVC coverter?

#

-RVC

patent trellisBOT
simple ore
#

then you provide a path for noUI colab

#

if you're using UI colab, then you make a dataset and upload wav files

brittle wing
simple ore
#

it does

brittle wing
#

How...

#

I tried and it don't work

brittle wing
simple ore
#

add this at the start of the install cell, run install

brittle wing
simple ore
#

it is the same, you just need to run it before you run the install

brittle wing
lunar perch
#

hello all 🤗
im searching a open source or free voice cloning with emotions.
best would be, the emotion reference audio can be a different language.
can you please give me some tipps, im near blind from searching google 😦

i want to dub some old anime to my homelanguage, so i used applio to train some models.
then tts for dub the new audio in my homelanguage. actual im testing f5-tts for the emotional speaking.
but a more "automation" on my work would help me a lot, so i start searching for something
that can help me speed up the process and is not limited to english/chinese language only.
thank you!

brittle wing
#

-colab

patent trellisBOT
# brittle wing -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI**

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

warm iris
#

can I get an actual download link without signing up for weights.com?

golden walrus
#

question, is it possible to stop training mid way ? because i accidentally set 500 epochs pepe_cry

warm iris
#

The computer that sung Daisy Bell

#

nvm, got it

pastel oak
oak plank
analog obsidian
#

use loss avg50, its more accurate than old loss graphs

analog obsidian
#

after some time the model will converge to a value
it'll have a flat line pattern

latent cypress
analog obsidian
latent cypress
analog obsidian
golden walrus
analog obsidian
latent cypress
#

but thanks for explaining

#

at least i know what to do after this

latent cypress
analog obsidian
#

well if you dont save every epochs then it's mostly random and you have to hear the epochs until you find the one you like the most

golden walrus
#

i only have these

latent cypress
golden walrus
#

cat_pawbite same applio, it's 1 8 3 something

analog obsidian
#

thats the compiled version

#

still has the old loss graphs

remote karma
#

hello, i downloaded okada but after extracting and opening the exe it opens up cmd but nothing happens after any idea on what I should do?

golden walrus
#

pepe_cry do you mind show me how can i download precompiled version?

analog obsidian
golden walrus
#

or a way to update

#

oh, okay, i need to test it out right ? these are just for reference ?

latent cypress
analog obsidian
#

after extracting it, run this

#

let it install the env, don't close the cmd until the installation is done

#

after that, run this and you can train like always

latent cypress
analog obsidian
#

which is most of the time, rvc learning the mute file

latent cypress
#

rvc training always got me feeling nervousyt_nails

#

guess i’m just being extra paranoid

golden walrus
#

cat_pawbite me too

#

i'm too new for this field

latent cypress
# analog obsidian why?

idk bro, i’ve been training since everyone was still using svc, so I shouldn’t have any reason to be nervous, but somehow, there’s always that bit of worry whenever I do this

#

i might’ve been training for a long time, but it was never really backed up with the right knowledge on how to do it properly

jolly pivot
#

I am having trouble with the Applio cloud it won't start applio. It will install but it will not start after installing it says error. any fixes

latent cypress
jolly pivot
#

Thank you

low shard
twilit forge
#

if "const" in schema:
TypeError: argument of type 'bool' is not iterable
An error occurred launching Gradio: When localhost is not accessible, a shareable link must be created. Please set share=True or check your proxy settings to allow access to localhost. i keep getting this when running apllio locally

simple ore
#

if the requirements have gradio >5.0.0, then can go for 5.23.1

twilit forge
placid holly
#

ah they Merged Voice Changer chat here

placid holly
#

may i ask the link and other stuff directed to W-Okada again, because no pinned message yet for it

analog obsidian
#

you have very high values in your graph

#

thats weird uhh
small datasets usually converge to a value of 35 (or even less than this)

analog obsidian
#

you were training without a pretrain?

analog obsidian
latent cypress
analog obsidian
latent cypress
analog obsidian
#

why are you using overtraining detector 😭 😭 😭

latent cypress
edgy tangle
#

Wut

latent cypress
#

should i start it again from the beginning?yt_nails

analog obsidian
#

ive asked him like 5 times to remove overtraining detector

#

and he always tell me "just dont enable it"

#

😭

latent cypress
#

🤣

edgy tangle
#

"If it doesn't work, just dont use it"

analog obsidian
#

he knows it's bad
i know it's bad
we know it's bad

#

yet still there

#

anyways that thing stops the training when the graph fluctuates a bit
which is normal during training

analog obsidian
#

reaching true overtraining takes a fk ton of time guys

#

around 1.5k epochs

latent cypress
#

but if i maxed out the smoothing, the graph seems going down

#

should i always maxed out the smoothing?

analog obsidian
#

avg loss50 don't need too much smoothing

edgy tangle
latent cypress
edgy tangle
#

looks overtrained

#

I think

analog obsidian
#

it stopped going down at 20k

latent cypress
analog obsidian
#

now the model is doing small adjustments

twilit forge
latent cypress
twilit forge
golden walrus
#

cat_pawbite wait, if i have a data that 39 mins long, it will get overtrained around 200 right ?

analog obsidian
#

guuuys

#

i told ya

latent cypress
analog obsidian
#

true overtraining happens at 2k epochs

analog obsidian
golden walrus
#

fork, i will train it to 200 and see then

analog obsidian
#

what are you seeing in your graph, is your model being close to converge

golden walrus
#

cat_blush thank you

analog obsidian
#

meaning it's stopping having meaningful improvements

#

when it reaches the point of flattening you can keep training for a bit (1 hour or 30 mins) and then you can start hearing the epochs
just choose one that doesn't sound robotic

twilit forge
latent cypress
edgy tangle
#

@analog obsidian Im right? yt_nails

latent cypress
analog obsidian
edgy tangle
#

yay im right

twilit forge
#

how long it supposed to take when i preprocess my dataset on applio locally

edgy tangle
#

how long is the dataset?

twilit forge
#

6 mins 30

latent cypress
twilit forge
#

im still on gtx

latent cypress
twilit forge
#

1660

edgy tangle
#

It shouldn't take more than 5 minutes

latent cypress
#

is that enough to train on 1660?

edgy tangle
#

uhh

#

depends on the vram

twilit forge
#

im on 32gb ram

edgy tangle
#

not ram, vram

latent cypress
#

it's 6gb right?

twilit forge
#

yeahj

#

it might be 8 lemme see rq

crude flame
twilit forge
#

yeah 6gb

edgy tangle
#

6gb it's not recommended, but should work fine with checkpointing

#

and for feature extraction shouldn't take too much time

latent cypress
twilit forge
#

its been 15 mins

analog obsidian
#

usually the graph stays in a flat line and then goes up

edgy tangle
#

for 6.5 minutes dataset

twilit forge
edgy tangle
#

How many cpu cores did you set?

twilit forge
#

6

edgy tangle
#

set it to 1

analog obsidian
#

maybe you've got out of vram

twilit forge
edgy tangle
twilit forge
edgy tangle
#

And becomes too slow

twilit forge
#

do i need to restart the preprocess when setting it to 1

analog obsidian
twilit forge
#

or just let it keep going

latent cypress
analog obsidian
#

restart the whole thing and use only 1 cpu core

twilit forge
#

bet

#

and do i need to put vocals in zip file

analog obsidian
#

nop

edgy tangle
#

local?

analog obsidian
#

put them in a folder

twilit forge
#

yeah ok

latent cypress
edgy tangle
#

If it is just a single file, use dataset creator hehe

analog obsidian
#

you can also put the folder in assets > datasets

edgy tangle
#

yeah

twilit forge
#

i did

blazing solar
#

Guys how to use kaggle?? Im newbie

twilit forge
#

i already sectioned out the dataset on audacity

#

do i keep audio cutting and effects on

edgy tangle
#

uhh

analog obsidian
#

merge all audio files into a big one

#

and use this in audacity

#

after that use simple mode slicing

#

with these settings

twilit forge
analog obsidian
#

ah you're using the compiled version

twilit forge
#

yeah

analog obsidian
#

download main branch instead

twilit forge
#

the other version wasnt working

edgy tangle
twilit forge
edgy tangle
twilit forge
#

goat

analog obsidian
edgy tangle
#

Click code, then Download ZIP

analog obsidian
#

download the zip, place it in a non onedrive location

latent cypress
analog obsidian
#

after that, use run-install.bat

#

don't close the cmd until everything is installed

#

when it's done, you can then use run-applio.bat

analog obsidian
#

slicing does impact training a lot

latent cypress
twilit forge
#

okay ill let u know when its done

analog obsidian
#

mainline auto slicer tries to slice the dataset into 3s chunks too but most of the time fails

crude flame
twilit forge
#

are u saying on local traning its better to keep the file as 6 mins instead of slicing it?

analog obsidian
#

look

#

first have one audio file

#

if you have multiple, merge them into one

latent cypress
analog obsidian
edgy tangle
analog obsidian
#

then use simple slicing with the default settings

analog obsidian
#

it's very important that the silence is truncated with the settings i sent

edgy tangle
#

If you keep the silence, it will take more time at training, trying to learn about the silence (not good for gpu's health yt_nails )

latent cypress
#

is there a way to automatically merge this audio? it's a lot

analog obsidian
#

merge them in audacity

edgy tangle
latent cypress
edgy tangle
latent cypress
#

all i have is sliced audio

analog obsidian
#

just load everything into audacity

latent cypress
analog obsidian
#

select all tracks and use this

#

align end to end

#

export it as a single file

#

then open it, run truncate silence

#

and train that

latent cypress
#

pretty sure my pc would straight-up roast me if it could talk, all this non-stop trainingmisc_lets_fucking_go

twilit forge
#

what should i set the silent training files too?

edgy tangle
#

Leave it at default

balmy blaze
#

Pls help me with kaggle

twilit forge
#

1000 epochs for my file should be good?

edgy tangle
#

6.5 minutes dataset right?

twilit forge
#

yeah

edgy tangle
#

set it at 300-350

blazing solar
#

What should i do after this?

twilit forge
#

and do i need to turn checkpoint on

edgy tangle
#

yes

#

4 batch size

edgy tangle
twilit forge
#

cache dataset gpu?

#

off

blazing solar
latent cypress
#

tf i've just added to my dataset😭

edgy tangle
latent cypress
edgy tangle
#

just follow the guide

edgy tangle
twilit forge
edgy tangle
#

try to put epoch saving at <10

blazing solar
edgy tangle
#

like 5 or 1, so you can find the best epoch precisely

twilit forge
analog obsidian
#

you can only find the best epoch saving every 1

edgy tangle
analog obsidian
#

every 5 or more it stops being precise and you're just using random epochs

edgy tangle
burnt saffron
#

curious if anyone knows a fix or if its just an issue with the dataset itself?

analog obsidian
edgy tangle
burnt saffron
#

alright thanks

edgy tangle
#

but for 10-15 minutes dataset its not "recommended"

analog obsidian
#

a problem with increasing the batch size is the lack of generalization in the model

#

so keep in mind that

burnt saffron
#

is there a specific batch size youd suggest?

edgy tangle
#

4

analog obsidian
#

try 8

edgy tangle
#

but if you want dataset fidelity, then 8

analog obsidian
#

noobies recommend 4 for everything below 30 mins

edgy tangle
#

at the cost of worse generalization

blazing solar
edgy tangle
burnt saffron
analog obsidian
#

the voice will sound even closer to the dataset

blazing solar
edgy tangle
twilit forge
#

is there any other setting i should play around with when traning the dataset?

blazing solar
twilit forge
#

im saving each epoch as 1

analog obsidian
edgy tangle
crude flame
#

😭 i dont think adding 33 more minutes to the set helped my model

still need to mess with settings to see if i can get a better model

blazing solar
twilit forge
analog obsidian
edgy tangle
twilit forge
analog obsidian
#

but your model will have bad generalization

crude flame
analog obsidian
twilit forge
edgy tangle
crude flame
#

ill try 12

analog obsidian
#

when trying to inference things too different from the set

edgy tangle
twilit forge
#

oh i get u

analog obsidian
twilit forge
#

would batch size 6 maybe help work around the problem'

analog obsidian
#

lower batch sizes are able to do more stuff but usually they start to sound different

edgy tangle
#

and sound horrible at inference when you use a voice too different from your dataset

twilit forge
#

okay thank u

latent cypress
edgy tangle
analog obsidian
latent cypress
#

oh i see

edgy tangle
#

yeah

#

the model will noticeable die

analog obsidian
#

lower batch sizes generalize better but sometimes the voice tend to sound a bit different

blazing solar
edgy tangle
#

not just sounding robotic

edgy tangle
#

and set it there

peak shale
#

my bum itchy

twilit forge
analog obsidian
#

bigger batch sizes are more accurate to the dataset, but the model loses a lot of generalization

blazing solar
latent cypress
edgy tangle
peak shale
blazing solar
edgy tangle
crude flame
edgy tangle
blazing solar
crude flame
edgy tangle
peak shale
#

i trained my dog to sit down so

twilit forge
latent cypress
edgy tangle
analog obsidian
#

@latent cypress @edgy tangle @twilit forge bs16 = batch size 16

bs4 = batch size 4

edgy tangle
peak shale
edgy tangle
latent cypress
analog obsidian
edgy tangle
peak shale
#

yo how do i vocal remove vocals

latent cypress
#

ah

twilit forge
peak shale
#

isaak this is a real problem

twilit forge
latent cypress
peak shale
edgy tangle
#

with bs4 it generalizes better and try to fake the voice to reach that note

blazing solar
edgy tangle
twilit forge
edgy tangle
#

just copy it and paste it on kaggle

burnt saffron
edgy tangle
burnt saffron
#

NO

edgy tangle
#

what

blazing solar
peak shale
#

i wanted the nair video but it didn’t work

edgy tangle
#

bro

blazing solar
latent cypress
#

i thought it was nair videoskull_sob

peak shale
#

it could’ve been

#

stupid discord

edgy tangle
#

the picture i sent to you its the kaggle page

crude flame
edgy tangle
#

just copy authtoken and then paste it on kaggle

peak shale
#

guys who is this?

lunar perch
#

i try using f5-tts with german language.
download "marduk-ra/F5-TTS-German" from huggingface.
set the path to the files in the f5-tts webUI.
but the output audio sound like it is spoken reverse or maybe chinese.
any idea what i did wrong ? 🙏

blazing solar
burnt saffron
peak shale
#

hey

#

i know you

lunar perch
#

i see, this discord is more for kids 😂
my fault 🤦‍♂️

peak shale
#

boy

burnt saffron
#

no way this mf just left the server wtf

peak shale
#

we made him leave boys 😎🥶

blazing solar
edgy tangle
blazing solar
edgy tangle
#

read it

blazing solar
edgy tangle
#

You need to install first

#

or it will not work

blazing solar
#

Install what