Model Info :
- Architecture : RVC Arch.
- Vocoder : NSF-HiFiGAN
- Sample Rate : 32kHz
- F0 Extraction : RMVPE
- Embedder : ContentVec
- Pretrain : KLM KPU II
- Precision : FP32
Training Details :
- Epochs : 180
- Dataset Length : 18m 50s
- Batch Size : 4
Note :
i combined the studio and live vocals in this model because the total duration of the studio vocals felt too short for me. if the output quality sounds bad, try lowering the feature ratio to 0.2.