Rate the model
To rate this model, either use /rate or click the buttons below.
1 messages · Page 1 of 1 (latest)
To rate this model, either use /rate or click the buttons below.
didnt collapse with that small batch size?
no
wow
36 mins of data, batch 4 and no collapse
tho idk if it generalized better than batch size 8
i trained two 1 hour models with batch size 4
yes
it sounds pretty good to have bg noise tho
the inference audios, not sure about the dataset but probably the dataset may have slight bg noise
tho lyery is like experimenting on "non-cleaned" audios i guess
I wonder if they make the model have less mess ups
using mel denoise aggressive may help a bit
tho it isnt that messed up, for me its good even tho it has bg noise, tongue clicks and mic noise (?)
both dataset and inference audios has background noise
allows the model to handle background noise better
i used spectral denoise
btw mel denoise aggressive removes more things besides white noise
aufr33 confirms it
it still may help a bit i guess?
i delete sound effects manually
i don't use any ai denoisers in my models
if you want to leave background noise in your dataset make sure is barely audible
Bro I need to do this so bad because my models are so sensitive to noise that it's hard to make samples for them. Also at what volume should you listen to your dataset to see if the noise is barely audible?
like this
So if the noise is under -70 db it's fine?
after normalization the volume is going to increase to this value
yes
this is after rvc normalization
don't go over -63db
Would it be a good idea to add artificial white noise or does it have to be real noise
Artificial meaning using the signal generator and adding white noise everywhere and real noise meaning the noise that's in the dataset before cleaning
wouldn't adding noise damage it
If it's quiet nah
has to be real noise
ai can still detect it tho :p
no, rvc can clone noise
Rvc expects noise
oh
Thanks to the most noisy pretrain
which is bad apparently since models used to get taken down ruthlessly for it
loud noise is bad for models
but yes i also agree models with slight background noise shouldn't be taken down
as they can inference noise better
(like this model)
also too much denoising damages the dataset
I have been conditioned to remove noise from datasets no matter how small because of that bot that takes your models down for the slightest flaw
how so?
also i didn't de-ess this dataset
I don't do that either
so both de-essing and denoising is not mandatory for models, thats just ai hub nonsense
damages the dynamics of the model, which is the most important part of the dataset
dynamics?
makes the audio sound a bit muffled
thats bad for rvc, makes gradients go crazy
and the end result is the model having volume issues
what kind?
the model randomly sounds very low then very high
I've experienced that with pitch sometimes but never volume
yes bc that problem only happens when your dataset is full of damaged dynamics
so as long you denoise good enough, is going to be fine
ah
if i were u i would train that model on 800 epochs or 1000 with batch size 12
tho sometimes harsh esses may be bad if u dont de ess
good for ya :p
ye
this dataset had harsh esses
wow
rn im in a bad mood and bad day :p
thought that esses sound bad, tho iirc bad esses may be caused by overfitting
bad sibilance happens when the dataset is too small or not diverse enough
oh, so it's not because of overfitting
it is overfitting, small datasets overfits extremely fast
unstable training also causes bad sibilance
I just use the denoise on MVSEP by aufr33
it's pretty good since it basically removed the horrible mic noise from the hl1 scientists
the aggressive one?
ye
I do that all the time, only because I lazy and just copy my last post since most use same epoch count
I forget to change stuff sometimes bc excitement to post it