My GPU does not work with Python 3.10 or above, therefore, I am using Mainline RVC, not Applio. I can only use standard HifiGAN, not RefineGAN.
My training dataset has low, mid, and high pitch range in singing, however, the high pitch part is not a lot.
I'm hoping to use a pretrain model already covers the high pitch range, so after training, when I infer high pitch vocals, the high pitch part is covered, there will be no electric-like artifacts in the result.
So far, I have found KLM 4.2 made by Seoul, and SingerPretrain by Sztef, are covering high pitch. However, both models are in 32K only.
If my inference target is 1/3 English, 1/3 Chinese, 1/3 Japanese, in singing, standard 40K, in Mainline RVC, what would be the best pretrain model to use, to achieve this high pitch coverage goal?