#Alice Merton - No Roots, by Neuro.
1 messages · Page 1 of 1 (latest)
It was converted against the original voice, correct?
yeah
Lemme try something 
Want me to drop the original files?
Nah, I am good
(I've already sliced and separated the tracks)
Thanks
Oh, you posted this here, cool
Is this confirmed information or just what you assume, since you didn't make the model?
That's the guy I got the RVC from btw
And I got it from wispers, I made the dataset
It seems I have misunderstood CJMAXiK's question
I understood "original voice" as the voice of Alice Merton
I don't quite understand it either, is it referring to Evil's voice or the original singer?
Well, that information would be kinda improtant to know
Original voice is Merton's
I think that's what it's made from, I have another song that sounds better
That was also made by converting from the original voice
I can send that one if you want to hear what the model can do
Since the configuration Kai is using might be wrong somehow
Ye, I just found a setting that might change everything
once I'm done with the second song, I'll get back to this one
SVP?
I dunno, but it's from wispers as well. I deleted mine around 10 minutes ago before I found the issue with settings, gotta train it once again
Tuning for SynthV
Also, here's another difference from that option
(It took only 15 seconds before)
(It's been 11 minutes)
maybe I overkilled it
Holy shit
Tuning is not that good, but I don't feel like fixing it

And it is noisy because there is a heavy post-production chain involved
The model from my dataset doesn't seem to have noise, at least when used correctly
I'll try to top that, brb
Arms race lul
This is some of the model's high-end output
idk why the pitch is so broken for me lmao
Must be some setting you're getting wrong, the model wors really well with whatever wispers is using
goddamnit
What's an SVP though?
^
huh
I didn't use the original Merton's voice track, I generated a different one using SynthV
Alright, back to training 
damn
This is bullshit, I just trained Gagarin and brother's already on Mars

damn
Welp, I can't get "ABBA - Lay All Your Love On Me" working no matter the settings, so, womp womp ig...
Could you make it sometime, should you be free, please?
really wanted to hear that song from Evil lol
You can also ask wispers, they seem to know how to use the model correctly. You just need to split out the vocals and lyrics
I know how to do that 
Ok then, whatever works
The model from wispers using my dataset is probably the current best one anyway
Just the high ranges are a bit inaccurate
I can give you my dataset if you want, it helps a lot with reducing noise based on what wispers got
My GPU is busy rn, so take these instead
I spent like a moth making it
And half a spacebar
what GPU do you use btw?
lol
3070 Ti
nice
I have a 4070Ti
And all xx70 class
^
And two are 70Tis
Well, even if the model wispers made is not the most accurate for now, it's certainly the least noisy, at least as far as I can hear. I think wispers also said they're still gonna be adding more Evil singing data to the training dataset for the model to increase accuracy and range, with my dataset to stabilize it and reduce noise
Since my dataset is quite a large speech dataset, so apparently it's good for that
Well, wish me luck
good luck o7
Are you still using the same dataset as before?
Yes, I just need to bring my model back
My dataset apparently helps as an additional thing to stabilize things
So I can give you it if you want it
Not now
Ok then, let me know if you want it at some point
I'll continue working with wispers to make our model better
Currently it apparently only has like 20 songs on top of my 1180 or something files of speech
My set is 35 voice tracks from songs
Well, wispers says they'll be adding more to ours to improve the model, especially for higher ranges
I know (he's in my DMs
)
Makes sense
Well, we'll see who manages to make the best model, but ours seems to be really good when used correctly, so maybe adding in more song data will improve it above what yours can do while having my large speech dataset to stabilize it and reduce noise, or at least that's what wispers said it does
The noise from what I've heard from your model is not really good
Especially for what I plan to use the model for
Which is DIVA modding
only issue I have rn is "sha-ring" and "waa-sting", "emo-hion"
From your output or something else?
ye
and those
Well, who knows how much the model will improve with more songs fed in
This is promising
I tinkered with that a bit and made this:
Ain't that nice
https://docs.google.com/spreadsheets/d/1tvZSggOsZGAPjbMrWOAAaoJJFpJuQlwUEQCf5x1ssO8/edit?pli=1#gid=0
There's over 600 entries
given, there's like 2-5 of each voice
anyway, eepy time
Should we get our model listed there at some point?
I prefer to have my models exclusive to me.
I prefer to share everything possible, which is why I put the model on my public FTP server
The one wispers made from my dataset
So even if your is better, ours is more accessible
And I think ours could become as good or better than your model with enough data
I like this competitive nature 