Huh even though there is a decent amount noise in the dataset (and in the inference audio) the model didnt turn out to bad.
Info:
- Batch Size: 6
- Dataset Length: 30 Minutes
- Pretrain: Default
- Precision: FP32
- Sample Rate: 32K
Please credit me if used 🙂
📥 Download: https://huggingface.co/Razer112/Public_Models/resolve/main/Riko_Solari.zip?download=true