#Horizontal g/total
1 messages · Page 1 of 1 (latest)
dataset size and batch size?
the graph looks bad
The spikes are RVC learning silence
the mute file shouldnt appear that often in the graph, its bad
Watch him set the max amount of mutes 
dataset 26 min and batch size 8
Is the silence truncated?
yes
with this?
Are you using the default amount of mutes?
I used strip silence in Studio One but basically the same configuration
a piece of the dataset
looks like the model is learning silence
maybe something went wrong during the preprocessing? how many slices you got?
Did you touch mutes at all?
no
I trained another voice last week in the same way and everything went well
Do you think there's a lack of silence?
Those spikes are due to silence
maybe there's too much silence
take the non-truncated version of the dataset
then truncate it using audacity, not studio one
using these settings
okay
did with audacity
with smoothing turned on it wasn't even falling before, now it is, but I still don't know if it's right
ignore the loss g/total graph, is not precise
focus only in the loss avg50 one
so it's good now?
hmm seems to be less noisy than before
also use smoothing 0.5
for more accurate reading of the avg50 graph
breaths are good
model needs them to learn how to reproduce breathing
I have a question, I'm using pretrained from the same language as this dataset. Could this influence anything? The vocal I trained last week was 15 minutes long
Maybe because this is almost 30 minutes long
bigger datasets will be always better than smaller datasets
graphs don't need to be extremely smooth either
something in between
Great then, thanks for your help