#Korean Female Voice Model Pro (Epoch 1000 / RVC2)

1 messages ยท Page 1 of 1 (latest)

regal forge
#

This model is newly developed, incorporating a novel recording method and having undergone dataset cleaning. It has been refined to handle expressions in Korean, which are challenging to implement in previous models. This model is designed for use in Korean speech-to-speech, voice changers, music covers, and more.

์œ„์˜ ๋ชจ๋ธ์€ ์ƒˆ๋กœ์šด ๋…น์Œ / ๋ฆฌ๋นŒํŠธ ๋ฐฉ์‹๊ณผ ๋ฐ์ดํ„ฐ์…‹ ํด๋ฆฐ ์ž‘์—…์„ ๊ฑฐ์นœ ๋ชจ๋ธ ์ž…๋‹ˆ๋‹ค. ํ•ด๋‹น ๋ชจ๋ธ์€ ๊ธฐ์กด ๋ชจ๋ธ๋“ค์— ๋น„ํ•ด ๊ตฌํ˜„์ด ์–ด๋ ค์šด ํ•œ๊ตญ์–ด์˜ ํ‘œํ˜„์ด ๊ฐ€๋Šฅํ•  ์ˆ˜ ์žˆ๋„๋ก ๊ต์ •๋œ ๋ชจ๋ธ๋กœ ํ•œ๊ตญ์–ด ์Šคํ”ผ์น˜ 2 ์Šคํ”ผ์น˜, ๋ณด์ด์Šค ์ฒด์ธ์ €, ์Œ์•… ์ปค๋ฒ„๋“ฑ์— ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก ์ œ์ž‘ํ•˜์˜€์Šต๋‹ˆ๋‹ค.

Having fun! ๐Ÿ™‚

Information :

Model :
Model Name : Korean Female Voice Pro

Train info :
RVC 2
1k Epochs
Batch size per GPU : 40

Dataset - source Info :
22 mins of speech
12 mins of Sing

Dataset - source correction :
KLM V7 Pretrained

Model Link-
https://huggingface.co/SeoulStreamingStation/IU-Voice-MultiLanguage-V1/resolve/main/SSSFemaleVoicePro.zip?download=true

ripe ospreyBOT
#
Rate the model

To rate this model, either use /rate or click the buttons below.

ripe ospreyBOT
hollow hemlock
#

Where do I get KLM V7 pretrain

#

I'm thinking of making a GI Kirara Korean model

#

CV: Kang Eun-ae

serene canyon
blissful lion
#

simply put they took a bunch of high quality audio, cleaned it up and trained a pretrain with it

#

the reason why this sounds clean is because

  1. it's a language u dont understand
  2. the audio which was fed to RVC was high quality
  3. high batch size which usually makes learning more stable but ofc limits the model a bit more
#

so to answer ur questions, no this isnt some ultimate pretrain they made

serene canyon
#

oh okay thought it was a public avaible pretrain lol

blissful lion
#

maybe it is, who knows

reef elm
regal forge
#

I woke up to find that a lot of people had left comments. Thanks for all the kind explanations. As Lisa mentioned, KLM is based on materials published by the Korean Language Research Institute and is composed of pretrained data. It specifically aims to generate sentences that can produce various sounds connecting words, focusing on learning and implementing various irregular sounds. This is not an open model yet, so it can't be downloaded. Separately, as many experts here have mentioned, regardless of the pretrained model, the quality of audio or its quality ultimately depends much more on the cleanup work of the dataset to be trained. We're trying various methods for restoration and studying hard to create a model of the highest quality possible.

river vaporBOT
#

Ayo? @regal forge level 6 !!! lfg

regal forge
#

Thanks again for your interest! lusbertmoment