#Reducing Mode Collapse

1 messages · Page 1 of 1 (latest)

late marlin
#

Creating voice models is new to me, so I want to learn more about mode collapses.
How does a mode collapse affect the end result of a model? How many mode collapses should one expect? Some say a few are fine; others mention discarding their models after seeing one. How does batch size play a part in this, and how can one alter one's dataset to prevent this?

formal ore
# late marlin Creating voice models is new to me, so I want to learn more about mode collapses...

Seems like your model flatlined way too soon, so I believe it is more of a batch size issue rather than a dataset issue.

To check if it's a dataset issue, try training the model on batch 4. If this doesn't solve it, then it is most likely a dataset issue.

If it is a dataset issue, it could be either low quality (though I'm not all too sure about this, only heard one source about it) or badly trimmed silences.

late marlin
#

Thanks for the help. Is it always better to train on low batch sizes? What does a badly trimmed silence look like?

formal ore
late marlin
#

Is that a bad decision?

#

Also, is it better to leave coughs and such in datasets? I usually leave them out.

lost onyx