#Overtraining?
1 messages · Page 1 of 1 (latest)
g/total is just average iirc. check specially g/mel and g/kl (i may summon lyery about the g/kl thingy xd)
Yea not overtraining ig.
How to know if its overtrained
How can you know
Idk
tensorboard
I know but how can you tell by the graph
if g/mel or g/kl rise up, even if both rise up, stop training and export the weight that's similar to the lowest point before OT
or keep training to see if the issue continues
So g/total isn't the one that I should look at
Probably yeah, it's just average of model
That doednt even explain anything
The g/mel g/kl are going down but g/total is going up
g/total is an avg that is true but mel and kl are always gonna improve, there's no point of watching those
is going up forever?
like not wanting to go down again
Yeah i think so
can u post the graph here?
the one u believe is overtraining
yup seems to start overtraining
u can wait a bit more to see if it goes down again or stop training now
pick weight files around 46k
tho i advise to stop training now since even if it goes down, the improvement will not be worth the wait (barely/not noticeable at all)
Thats what i suspected too but people told me to check g/mel and g/kl
Its still training so im gonna wait a bit
those are always gonna go down, they only go up if something went is very wrong (super extremely rare cases)
I see
mel is the clarity of your model, kl is how well the model learns the voice
g/total is the avg of fm, mel and kl
it shows u the real improvement of the model
What's fm?
feature matching
mostly the timbre and how natural your model will sound
Thank you for explaining it so well! I really appreciate it!
Also, do I pick the model before the lowest point or do I pick the one that is the lowest point
ideally pick the lowest point, if you can't pick the lowest point itself use the most close to it before overtraining
mostly you dont need to train for more than 500 or even 1000 epochs once you starts seeing the overtraining point
The model is at 803 epochs now
Wish that there was a way to detect overtraining faster
automatic overtraining detectors are unreliable yea 
I saw a lot of graphs of people where the lines weren't going crazy up and down like mine
but after you train more models is going to be easier to spot overtraining
this is caused by batch size
Yeah this like my 5th model
Really? How does it affect the model?
too low causes crazy graphs like u said
too high causes very smooth graphs
again I said you could pick some weight files based on the optimal/overtraining point, also you can stop training anytime
for rvc training use batch 8, is a balance between the two
very safe and works
ow i see, anything below 8 gives you those crazy graphs
I use batch size 2
Didn't really know if using a lower batch size had a bad impact on the voice of the model
this is very unstable and not recommended for training, might degrade the model perfomance as well
it doesn't really impact the quality, is more of a stability setting
when the graphs are unstable, things can go wrong
like your model overfitting too fast
or robotic feeling in the voice
if your gpu vram is too low or weak, you can opt for some colab/kaggle notebook that uses T4 with 15 gb vram that surely enables to use batch size 8-16
in simple words, your model can't really improve too much
have you tried hina's mainline colab?
I haven't
https://colab.research.google.com/github/hinabl/RVC-Online/blob/main/Mainline_Colab_Full.ipynb
don't use custom pretrains as they have multiple issues the original doesn't have
and disable extra
no need to run the theme loader cell
you first run these two
after that you put your ngrok token here
when you do that, you can now run the start running cell and it should work fine
this colab uses google drive as their file storage
you can upload your dataset's folder into google drive
your dataset location would be something like this /content/drive/MyDrive/mydataset
exactly, is going to use google's gpu
which can do batch 8
what are you saying is wrong. u can't predict a graph, silly.
it was always a requirement to check g/mel and g/kl
iirc the guides even said that you needed to check these graphs
if you're training for mel yeah
if you're training for g/total nop
coz lowest g/total already has best mel
g/total is just average tho
we check g/mel and g/kl, you can't predict graphs without seeing them 🙂
Does that mean that i can also set it up on my phone
yea it can run in your phone
i have no idea how to behaves there tho, on pc if you close the tab, it ends* your current session
Also since it doesnt use my gpu it means that i can do other stuff too since it doesnt hinder the pcs performance
yep
Ayo? @radiant sentinel level 11 !!! 
is what is needed to generate the rvc links
the GUI and the tensorboard links
after u made ur account you go here https://dashboard.ngrok.com/get-started/your-authtoken
ngrok is the fastest way to put anything on the internet with a single command.
copy this
u paste* the authtoken in this
Do I need to run the cel everytime I want to train a model
And does continuing training a model work the same as local rvc
if u stop your session, yes, you need to repeat the process again
google saves your ngrok token so you don't need to paste it again, just need to run the 3 cells again and you'll be fine
is going to continue training until the session stops (google has free daily limits)
around 2~4 hours
the colab is still going to save your model progress, if your training stops because you hit your daily limit, you can continue training later
kaggle has 12 hours worth of daily limit and a 30 hour free weekly
is a bit complicated to use but we have a very good tutorial explaining how to use it
so in case you're not happy with google's daily limit
i'll leave the kaggle tutorial here
-kaggle
- Applio Notebook, by Vidal Kaggle
- Applio Notebook, by Shirou Kaggle
- Music Source Separation, by Shirou Kaggle
- UVR5 NO UI, by Eddy Kaggle
- RVC Mainline, by Hina Kaggle
- Original W-Okada's Voice Changer, Kaggle
- Modified W-Okada's Voice Changer, Kaggle
- 📖 How to use RVC Mainline Kaggle by Cauthess
Note: Kaggle limits GPU usage to 30 hours per week.
Explore and run machine learning code with Kaggle Notebooks | Using data from No attached data sources
This guide for Mainline Kaggle is an alternative option to the Mainline Colab notebook for training voice models
It is complete and should walk you through every step of the way since Kaggle has a difficult learning curve. However, it will be updated constantly to go over parts that need more cla...
the tutorial also teaches you how to do the ngrok part as well
Thank you sm!
Do I uncheck custop pretrains when running every cell
Just don’t run it
U can ignore it
It gives me an error when i start running the tab with the ngrok_token
Nevermind i got it
When i open the url it says page not available
can you post a screenshot of the error here?
So i cant use the file that on in my file explorer on my pc
Does it have to be on google drive?
yes, only on google drive
I'm confused. I used a file that's on my pc and it's training
Oh wait
Its says error
Lol

But how do i get the google drive path file
first drag and drop your dataset's folder into google drive
or you can create a new folder there and put your wavs inside, both work
your path is /content/drive/MyDrive/mydataset
with mydataset being the name of your folder of course
so if you named your folder "dataset" the path would be /content/drive/MyDrive/dataset
For me dataset is in a folder called training
/content/drive/MyDrive/training/nameofthedataset
if you put your dataset inside the datasets folder then it would be /content/drive/MyDrive/training/datasets/dataset
or if you didnt create a folder inside the datasets folder
just do /content/drive/MyDrive/training/datasets
what if you try training/datasets/nameoffile instead
It still says no such file directory
your dataset is a zip file, folder or only the audios?
zip files doesn't work for example
ohh ok so only do this /content/drive/MyDrive/training/datasets
try if that works
don't put the name of your audio at the end
copy and paste what i sent
It worked!

Batch size 8 right?
yep
I used to use a custom pre trained model but are the og models better and should i not change it
use the og models, they're better
don't change them
Wait i did everything and pressed train and it gives me an error
yea the colab is a bit buggy, go to the colab tab
see if its training there
should be saying epoch 1 or something like that
wat, you started training just now? if u started now it should start from 1
you can now open the tensorboard link that was generated when you ran the cell
scroll up in the start running cell until you find it
Yes I see it
nice, now you can let it train without problems
remember to not close the google colab tab
The rvc tab still says error but it is training right?
yeah its training don't worry, the gui is a bit broken but the training is perfectly fine
I refreshed the tensorboard but its just loading
hmm try reloading the tensorboard site once more (not google colab)
is a bit laggy compared to local tensorboard because is running on the internet and not locally
so if you're having some internet issues is going to be a bit slower
I don't get it already saved 42 epochs to my drive folder
Even tho colab says that it just saved epoch 2
And the epochs saved are folders
you have 42 .pth files?
folders not pth files
thats so weird, never happen to me
rvc saves two things
G and D, and .pth
it shouldnt create new folders
yes, pth files are saved in the weights folder
Their all folders
ooohhh you're double clicking them
yeah pth files are actually folders
but if you download them, they're going to download as .pth files
Oh yeah you're right
to download them, right click the .pth file and click download
I did
nice
those epochs are from an older unfinished training of yours
the epoch 13 is actually this training
But i never trained this dataset
probably is from other one
maybe you tried training on colab before?
so i deleted everyhting and also the models in the dataset folders and logs but they keep saving new epochs in the wieghts folder
yea its supposed to do that, is saving your epochs at the frequency u set
Oh no so it's not gonna stop until it reached 1000 epochs!?
yup lols, remember to delete the epochs you're not going to use to save space
2 hours
Does a higher batch size result into a faster training time
yes but too high limits what your model can do
so is less versatile
its a bit complicated but the guide covers pretty much everything u need to know about kaggle
Ayo? @whole anvil level 46 !!! 
I got kaggle to work Yay!
@whole anvil I've used different languages in the dataset is that okay?
As long is the same person’s voice, yep
yeah i was just asking because i saw that some pre trains are more suited for specific languages but then again I don't think i can change the pre trains on kaggle and google collab
You can train any language on any pretrain, don’t worry, is going to work fine
For example, the original pretrain is trained in english and you can still train different languages with it
Alright! Thanks!
@whole anvil i just have one question. How do I know which target sample rate to choose
No
rvc disconnected is not working for me
