Sharing with my personal pretrained model with everyone, now in public beta ||English or Spanish?||
Dataset:
- Size: 1921 hrs of speech & vocals
- Languages:
- Arabic (~70 hrs)
- Chinese (Mandarin) (~70 hrs)
- English (~800 hrs)
- French (~42 hrs)
- German (~35 hrs)
- Hindi (~30 hrs)
- Indonesian (~53 hrs)
- Japanese (~140 hrs)
- Korean (~80 hrs)
- Portuguese (~40 hrs)
- Russian (~188 hrs)
- Spanish (~200 hrs)
- Tagalog (~30 hrs)
- Singing (All) (~190 hrs)
- Common (Unknown)
- Sampling Rate: 32kHz (done) / 40kHz (retraining)
Models:
Base Model: for fine tuning
- Data: 1921 hours (low-mid quality)
- Steps: 3,890,220
- Batch: 40
- Precision: FP32
- Sampling Rate: 32k
- *RMVPE **
Fine-Tuned Model: for regular models
- Data: 102 hours (high quality)
- Steps: 2,854,856
- Batch: 20
- Precision: FP32
- Sampling Rate: 32k
- *RMVPE **
Hardware:
- CPU: AMD EPYC 9754
- RAM: 256GB
- GPUs: 1x H100, 4x L40s, 1x RTX 4080, 1x RTX 4070 Ti
Links
https://huggingface.co/MUSTAR/Rigel-rvc-base-pretrained-model
Rigel Base model (32k) - https://huggingface.co/MUSTAR/Rigel-rvc-base-pretrained-model/tree/main/Rigel_32k_Base_and_FineTuned/Base-model_32k_fp32
Rigel Fine Tuned (32k) - https://huggingface.co/MUSTAR/Rigel-rvc-base-pretrained-model/tree/main/Rigel_32k_Base_and_FineTuned/FineTuned-model_32k_fp32
Nanashi ft on Rigel base #1254252587973083187 message
(little note, do not use 40k version till it retrained)
Credits
- 0x2E
- Aleks don Pedro
- Blaise
- Eugene Starky
- Leo_Frixi
- Litsa_the_dancer_UwU
- Mikhail
- Player1444
- Prosto Dead Artem
- RomanKrukovsky
- SCRFilmsE
- Shirou
- Сергей Electrik
- Warlock700
- 서울스트리밍스테이션
(if i forgot to mention someone Thank you and I'm going to remind you in advance that I'm sorry and i apologize for the inconvenience of me forgetting to put you in the credits tab)
(no tests for now, sorry currently doing them)


