#DMR V1 | Mangio-Crepe / English / A Fine Tuned Pretrain For Mommy Voices ~

1 messages · Page 1 of 1 (latest)

silver tusk
#

DMR Pretrain V1

DMR Download D: https://huggingface.co/Razer112/DMR_Pretrain/resolve/main/D_DMR-V1.pth?download=true
DMR Download G: https://huggingface.co/Razer112/DMR_Pretrain/resolve/main/G_DMR-V1.pth?download=true

The DMR pretrain aims to improve e-girl voices and deep male voices.

Dataset Info

  • Length: 11.3 Hours
  • Total Voices: 22
    • 16 Female voices
    • 6 Male voices

Model details

This was trained on Applio with a 4060 TI 16gb on a batch size of 12 and is trained with Mangio Crepe at a 32 hop length with a sample rate of 32k.

Samples:

https://discord.com/channels/1159260121998827560/1255616722480926860
https://discord.com/channels/1159260121998827560/1254834021368598608
https://discord.com/channels/1159260121998827560/1256387296677072988

Special Thanks

I would like to give a BIG thanks to @vestal dawn for helping me finish the dataset for DMR he was a massive help ❤️

How to use:

  1. Download G and D pth files from Hugging face
  2. Place it in the pretrained_v2\custom pretrained folder
  3. Paste the name of the pretrain into the pretrains box
  4. Youre ready to make cringe voices!
echo roost
#

First

unreal plank
#

Does this pretrain work well with whispering?

silver tusk
#

no

#

no pretrain can make whispering work

#

its an rvc issue

oblique socket
#

rmvpe could improve but mangio crepe is great tho

silver tusk
oblique socket
stone garnet
#

I am afraid of this pretrain

echo roost
stone garnet
echo roost
solemn mirage
#

I tried using it but RVC v2 Disconnected gave me an Error

solemn mirage
# silver tusk What error?

I can't remeber sadly but i tried to use custom with both links liked to the G and D
And it wasn't able to use that so i had to switch back to original to be able to train my Voice earlier

silver tusk
#

Maybe you accidentally pasted the G link into D

solemn mirage
#

Hmm i made sure that i didn't and looked 3-4 times before i was proceeding

silver tusk
#

Yeah I just tried it and it works

solemn mirage
#

I used Google Collab RVC v2 Disconnected

silver tusk
#

I just tried it on there

solemn mirage
#

Hmm idk now

#

I putted links there maybe that was false

#

But its fine

oblique socket
#

Is it possible to do pretrains on cloud?

#

I always get errors on Kaggle and Colab for being AFK

silver tusk
queen cragBOT
#

Ayo? @silver tusk level 21 !!! lfg

oblique socket
silver tusk
oblique socket
#

How?

silver tusk
#

Just train them like any other model

#

I suggest kaggle bec it has more GPU hours then colab

oblique socket
#

U can use around 6 hours i guess

#

or 3

silver tusk
#

On kaggle I get 30

oblique socket
silver tusk
#

Yeah

oblique socket
#

But i tried once and i got disconnected due to AFK

silver tusk
#

After how long?

oblique socket
silver tusk
oblique socket
#

Maybe i'll try one day

#

because i'm planning to do a castillan spanish pretrain

oblique socket
#

Yeah

#

I was training a Sonic model for 3 hours and i didn't get afked

#

But i'm sure that a average pretrain (50 epochs + 32k rate + 2hour data) should take more than 6 hours i guess

silver tusk
oblique socket
#

50 epochs, batch size 16 and 32k sample rate support

silver tusk
oblique socket
silver tusk
oblique socket
#

If it undertrains i'll increase the epochs a bit or reduce batch size

#

Tho i wonder if people use YT to source datasets for a pretrain

oblique socket
#

Maybe if i can find hours of castillan spanish podcasts on youtube i'll use it as dataset source

silver tusk
#

That would be good

#

Just be sure it isn't super compressed

#

Also variety is needed so get like ~30 minutes of audio per person

#

And get a ton of people

oblique socket
#

I'll use random voices, as long they sound HQ

open stag
#

also, what is this pretrain finetuned on ? on the Original Pretrain?

silver tusk
open stag
silver tusk
open stag
silver tusk
open stag
#

ywlfg

foggy pumice
#

dominant mommy roleplay

silver tusk
#

Im in shambles. I did some testing of my pretrain and i found out that it adds noise to models but doesnt distort harmonics

normal falcon
#

and rmvpe didnt existed when he made it

silver tusk
#

did he use crepe?

silver tusk
normal falcon
# silver tusk did he use crepe?

i don't think so, looking through mainline updates it seems crepe got added very briefly as a pitch extraction method for a couple of weeks/months, then got replaced by rmvpe
v2 pretrains were already a thing tho
edit: the first part of this message is true, but crepe was never in a public release, the second part is false, rmvpe was already a thing when v2 pretrains were made

silver tusk
#

huh

#

time to use every pitch extraction to see if they make a difference

normal falcon
#

Oh wait
rmvpe was already a thing when v2 pretrains got released

#

since they deleted crepe off existence even before releasing v2
there's a high change the og pretrains v2 use rmvpe xD

stone garnet
normal falcon
# stone garnet I wanna know the best one

mangio for high quality datasets that are properly cleaned, mangio hates any type of noise residual
rmvpe for everything else, can also work in high quality datasets

rmvpe sometimes adds a metallic sound to the model
mangio is less pitch accurate than rmvpe

marble prawn
#

Can you even imagine a pretrain for non voices

stone garnet
#

OV2 is the closest bc it works best from my experience with non voice models

main tangle
oblique socket
oblique socket
#

Leo told me that hifigan is limited for custom pretrains

unreal plank
unreal plank
zealous kraken
oblique socket
oblique socket
zealous kraken
oblique socket
oblique socket
zealous kraken