# We've seen some great improvement by fine-tuning on a few hundred short audio clips less than 30 seconds each. The more, the better, of course.