#hey

1 messages · Page 1 of 1 (latest)

crimson wharf
#

how many and how long each wav file of vocal should be long for making custom model

terse crane
#

Above 5 mins is enough, though you can try and get to 30-45mins dataset. You can just stack the voicelines in one file and it trains just fine

#

you may consult

#

-audio

sturdy runeBOT
# terse crane -audio
📚 Audio Guides
  • Perfecting Audio Isolation on Low-End Rigs: A Practical Guide, by Litsa The Dancer and Faze Masta Google Docs
  • Gathering and Isolating Audio, by SCRFilms Google Docs
  • How to make a good model All-In-One guide, by LUSBERT lusbertmoment Rentry
  • Creating Datasets for RVC using iZotope RX, by LUSBERT lusbertmoment Rentry
  • Vocal Mixing Tutorial, by Roomie YouTube
🛠️ Tools
crimson wharf
#

thx

crimson wharf
#

10-20 sec vids each for 10 min is ok?

#

or it need be same lenght each vid

muted lotusBOT
#

Ayo? @crimson wharf level 3 !!! lfg

crimson wharf
#

btw this is from 2 different song same person it sound diff is that ok?

terse crane
#

Hmm, I think it should be fine, though you should denoise more via the Karaoke models, noise gate, cut out the noisy parts with tools like Audacity