#✨│ai-help
1 messages · Page 123 of 1
Or well, yeah you could delete the 48k ones if you dont plan on but no need to really
I am even more annoyed that I couldnt find the actual problem to the answer. All I found is some guy from the server absolutely insulting the shit out of the one that experienced that issue 😭
damn wtf
Ayo? @river sedge level 1 !!! 
ayoo
Could it have to do smth with my dataset?
As I said, it could if you have many many files and they vary too much in file size
Hmm, I have abt 70 files with a max length of 10 seconds
So, I can try deleting some?
Put everything into one file
I am going to assume you watched a youtube tutorial that told you to split audios into 10 second files
dont do dat
just keep it as one file, see if it works
no i mean just dont want traing this data anymore ...
ill train new data so
is it need or no
Oh no no, the thing youre looking at is NOT your voice model
those are pretrains. You need those to train any voice model
If you want to delete your entire voice model the 230 epochs, delete the folder ixtivhmale in "logs" folder
should the audio file be flac or wav? Just to be sure
oh okay
both are lossless so doesnt matter
perfect, thanks
and is it good i have mp3 file data so if i rename this file mp3 to .wav is it work
hey is it ok if use OV2 on a 6:30 dataset

Funnily enough, ever since that update, I changed my style into fewer but longer messages, so I am doing the exact opposite
no that doesnt work
so how can i change
Convert
online convert websites is enough or
I dont know if theres any point in that because if its already mp3 theres not much to do right..?
should i change with a program
Probably if you dont want to download program
anybody?
i have audacity already lol
but i dont know how to use
i think i did somehow -*-
well nice my data 507mb
Yes, wav files are very big
use FLAC instead
if its a problem for you
-paperspace
💡 Click on the button below to open the guide.
so shadicti
Ayo? @tough fiber level 7 !!! 
which should i pick, in tensor board
g/total or g/mel or g/kl
i want pick lowest point, from which 1 i pick?
write on search bar g/t and check
what about mel and kl
well i only check g/t
some1 advise me to follow them alongside with /t
if its increasing its overtraining so
idk anything about The Hz im always choose 48hz
kinda hard to understand
@red kayak
Dont know
@proper shale advised me about /mel and /kl
Good advice, I pinged Litsa because he can explain it
From my knowledge:
The easy, non complicated answer: Train until you are sure you hit the lowest point in g/total, then take the voice model before the lowest point in steps.
The more advanced answer: Even if g/total hit's the lowest point, monitor the graphs d/total ; mel and kl. If those are still sinking, you may keep training
the only 1 i want to know is
if /mel and /kl going down and g/total going up
how can i pick lowest point
As I said, even if g total going up but mel and kl going down you can take anything after, you DONT HAVE TO pick the lowest point of g total if those are going down
most recent/newest one is probably ok
Understandable
i get it now, u mean if mel and kl going down just take newest point in g/total
What do u need
That simply isn't true
what should i do?
Ayo? @split frigate level 7 !!! 
What shad is saying isn't entirely true however over training isn't ideal
If ur generators loss starts to increase that means that the model is slowly getting worse and worse at generalizing the data given
Even if Mel and kl are improving (Mel = clarity btw) that still doesn't enable he model to perform at its best cuz it's slowly starting to over exaggerate. The generators outputs start getting worse and worse the discriminator keeps on telling fake samples apart
hi litsa
Hi :>
Good thing I disclaimered indirectly that my answer may not be the right one
It's fine
my EN is bad can u give me like short and clear answer of what should i do?
sry
we learn
U can in fact over train for a little bit to get that extra bit of clarity but still it isn't advisable to do so unless u actually have a good dataset with no noise
Just look at g/total and check Mel and kl from time to time
Ur main focus should be g total though
and pick lowest pont?
Yes set smoothing to 0.987 and ignore outliners on
literally ill punch my pc im so close
g/mel and g/kl is hard to understand but i want master rvc and make completion model
already did that
B4 u master RVC u should learn how to make datasets and how to choose data
still didnt get what should i do if mel and kl going down and total going up
in my case i will just ignore thats mel and kl going down and pick lowest point in g total
Which all takes experience
Yeah do that
my datasets about 10min and clear as a broadcast
and i clear it more from audcity
Podcasts are never really that clean
so its like nothing clearer i cant get and some of it over 10min
nothing clear as they imo

There's software u can use for audio repair
which 1 ?
Uvr and iZotope rx10
i want get best thing i can
tysm
Good luck
Both
💡 Click on the button below to open the guide.
u don't necessarily need to pick the lowest point, as i said b4
but you can pick the lowest mel/kl points
are mel and kl equl in choosing lowest point?
You only ever choose the lowest point of g/total
okay.. tysm
for local running is using Mainline more recommended than Ilaria RVC?
For training yes, because only Mainline does it out of those two
For inference Ilaria RVC is better
I'm looking to transform text to speech or songs to a voice of a model so not training I think
is that what inference is ?
cool, i'll go for that then. thanks
can i manually change the language that rvc is using?
but after cloning ilar rvc 3 in the readme it says to clone https://huggingface.co/spaces/mateuseap/magic-vocals/ but it doesnt exist
nvm i'm dumb
Ayo? @glass drum level 2 !!! 
why is my command promt saying this?
AttributeError: 'RVC' object has no attribute 'tgt_sr'
and how do i fix it?
is it RVC gui realtime?
yeah
Did you download the CODE or the binary release?
idk
whats the folder called
Ok send a screenshot of your RVC
alr
Ayo? @brittle wing level 3 !!! 
does Ilaria RVC not work on windows?
Change your OUTPUT device to SAMSUNG (MME)
Also, the default index file causes issues, even if youre not using it. So load up any other index, it doesnt matter if its the same as your voice model, just any custom one you downloaded
there is no samsung (mme)
There should be one called SAMSUNG (Nvidia high definition audio) (MME)
it should be somewhere rather on the top
Is that even your output? Your headphones?
im not using headphones
Your speakers, same thing
yes thats my output
oh
Ok, then change your Input to the one with Windows DirectSound
However, from experience, anything other than MME causes issues but maybe youre lucky
I am not familiar with OV2 models but if its RVC it will work just fine
Select it, and select its index also
thats what i did before
After you have the same type of driver for input and output, try voice changer again
and it says that error
OV2 is pretrain for rvc models
Ah ok gotcha
Ye
alr
can you just not use input: microphone MME and output: line 1 MME?
since youll be running this as a voice changer anyway
Thats fine, output should also be something with MME for example the Line 1
Isnt it just Line 1 Virtual Audio Changer but you changed the name to Voicemod?
If not then ig download the real VAC Line 1
run "setup64.exe"
it is installed like that
ah ok
alr
how do i set it to my microphone?
wait
is this how it supposed to be
thats the correct one, but select the (MME)
for output or input
output
there is none
OH
IM STUPID
i thought that was all of them
theres still a bunch
You didnt scroll up did you
i didnt lol
i just noticed it 🤣
Now it should work
alr
it works but you have to do some complicated things
wsl I imagine right?
i suggest you using ilaria rvc mainline directly but its currently on hold
nah
Hmmm
what do i do
1 sec
alr
yes
yes
then screenshot the entire error code
was asking if these 3 files are in the hubert folder or not
there is
in the command promt or
ye
Wait did you extract the rvc1006nvidia folder out of the zip?
yeah?
Ayo? @brittle wing level 4 !!! 
Confused why the folder path is listed twice lol
what gpu do you have?
Quite honestly not sure if thats even going to be able to run it
Its a weak gpu
is it?
its very outdated
yeah
it only has 2gb vram aswell
oh
But now I cant really tell if it wont start because of that or if there are any files missing
THE 770
yeah...
babe you will not be able to even make it run even if you pray
use the colab
didnt that get deleted
nope
Hes trying to run the RVC realtime. Dont think theres a colab for that, aside from wokada
ohhhhhhhhhhh its realtime
what is this one
but we are in help rvc
Let's backtrack quickly
https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/releases/tag/updated1006v2
Did you download your rvc from this?
its for the normal stuff
no
It still technically counts as RVC, as it is hosted by RVC
lame, there should be a voice changer help channel instead of okada
okada runs worse but its more update and have easier gui
Same. I was told there was one, but people still asked in this channel instead of clear help VOICE CHANGER 💀
so thats why it got changed to okada
people are stupid
was that no towards me? where did you download it from then
uhh let me find the link rq
Elevenlabs plugin on Applio can't use custom voices? (the ones you get from voice library)
yea its the same
Well yeah thats linked on the link i sent
oh
i need the hxh narrator fr
Then I dont know why that issue is happening. It is caused by pre-trained models like hubert not being downloaded, but youre claiming it is so no clue.
idk
try redownload the prezip?
is there any other voice changers that i can run that use rvc?
theres wokada but your gpu is not strong enough
#1159289738314919936 is your best bet
Free or paid
oh
Is your internet good? theres a colab version
google speedtest and lmk your download upload
k
does it make sense to train more than 300 epochs? I decided to train 1000 but I don't think there will be a better model
We use tensorboard to determine overtraining or not
alr
try target 500 then observe the tensorboard
Check out these links, follow the steps
alr
Copy the entire error
JSONDecodeError Traceback (most recent call last)
<ipython-input-6-825849d8c545> in <cell line: 31>()
31 if os.path.exists(config_path):
32 # File exists, proceed with creation of creds and client
---> 33 creds = Credentials.from_service_account_file(config_path, scopes=scope)
34 client = gspread.authorize(creds)
35 else:
5 frames
/usr/lib/python3.10/json/decoder.py in raw_decode(self, s, idx)
353 obj, end = self.scan_once(s, idx)
354 except StopIteration as err:
--> 355 raise JSONDecodeError("Expecting value", s, err.value) from None
356 return obj, end
JSONDecodeError: Expecting value: line 1 column 1 (char 0)
more likely outdated colab
what should i do now?
Ayo? @west summit level 1 !!! 
from where i get the latest?
-colab
Suggestions for @west summit
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
^ check these messages above. You can use Ilaria for inference (ai covers), or RVC Disconnected to train voice models
Anyone use tensordock? Im trying to run AiCoverGen or any other collab on Tensordock through jupyter notebook and its not working. Every collab notebook ive tried on there has run into issues an im tired of burning my money on there lol. Has anyone got it to work?
Probably just gonna call it a wash and switch to Paperspace next payday
Was also considering Collab Pro but Collab pro is such a rip off compared to the other cloud gpu providers
Why is ai gen taking forever for a 10 second clip?
how do i use this
the installation step?
use what?
thanks man
i just installed so-vits-svc, but i get PermissionError: [Errno 13] Permission denied: error everytime i start, any help?
or should i ask this somewhere else
The processing after inputting my audio file
My advice would be to not use So-Vits, its really outdated and most models are made for RVC not So-Vits
Ayo? @ornate wasp level 4 !!! 
It's taking 20-30 minutes for the final output
what are your hardware specs?
prob he meant ai covergen colab
i see, do u know where to get rvc?
yea google RVC1006Nvidia
then click the first huggingspace link
or you can look at the guides for other links
but RVC1006Nvidia is the one that works best for me
just download, unzip and run go_web bat
same GUI as so-vits or nah?
Ayo? @meager iron level 1 !!! 
where do I put the CKPT files in the voice changer?
no clue if its the exact same gui, I haven't touched sovits since last summer. But the gui is super simple and easy. I'm an idiot and I can use it no problem
what so-vits gui do you use?
W-Okada?
On the w-okada gui, press the "edit" button and add the .pth file of your model.
You dont use ckpt files on the voice changer
.ckpt? it doesnt support gpt sovits models
alr thanks i will try it
i just need to run the go_web then, no prerequisites?
Yeah, it doesn't support GPT sovits models.
I do add just that but when I click upload it's just blank
Like the others said, gpt sovits dont work
so the voices without index files just don't work?
They do, if theyre RVC .pth files
yea thats all I had to do
if you run into any issues feel free to ping me
I can't change my voice model on applio.
no matter what I do (unload, refresh, restart) it only uses the same voice model that I used at first
Upload a screenshot there.
how do i get it to work then 🥲 i am big dumb
Ayo? @digital summit level 1 !!! 
of what?
Of your W-Okada GUI.
even when you close down applio completly and reopen it?
Do you have an RVC model downloaded?
yeah
i tried butters from south park, the hand unit and kurzgesagt and just uploading the path file but they all just appear blank
not sure if i should be doing something else
my advice would be to just use regular rvc. these cool fancy versions with all the crazy features are nice but they always have issues for me
INPUT - Your Microphone
OUTPUT - Virtual Audio Cable this is mandatory so you can actually use the voice changer on other apps.
MONITOR - Headphones if you want to have live monitoring, this will throw most people off as there is a delay, but its good for the initial test.
Virtual cable download:
https://software.muzychenko.net/freeware/vac470lite.zip
"setup64.exe"
wait, this work with AMD?
no you have to use the other version hold on
google RVC1006AMD_Intel
everything else will be the same, just use that version as it was meant for amd
i see
thankfully havent downloaded anything yet i was still researchin
hey i need help, i downlaoded phyton. but i still can't open the voice changer
yea give that one a go, what gpu do you have?
i got amd gpu
i meant what amd gpu lol
this is it, was not sure if we were allowed to send links so thanks for that
RX 580 8GB
if its github hugginface etc links youre fine to send them
one of the classics! But yea that link shad sent is the same thing I was trying to have you get. Thatll work for ya
should be a trustworthy page ya know
Good to know, I appreciate the info.
RVC Guides (How to Make AI Cover)
Translation by country
same as before? with what file should i start it with?
Ayo? @meager iron level 2 !!! 
should be the exact same. just unzip the zip and then run go_web bat
go-web-dml.bat
i assume you know where to put the index and path
for AMD, you always look for anything with DML at the end lol
listen to shad he knows his stuff
I sense the power of a big Helper inside you, so I shall let you do your thing and correct wherever needed prayge
alright thanks
it works now
still figuring it
aw thank you 🙂
awesome! glad its working. no errors or anything when you try to convert a file?
how do i do that 😭
i wanna make like ai song covers
ive done it before but not with rvc
oh sorry just saw this, so do you already have the model and the song you want to convert?
Ayo? @ornate wasp level 5 !!! 
yea man
but like i have AMD so theres not much options
okay cool, so do you mind sending me a screenshot then
so that way I can see what options you do have
what kind of screenshot?
a screenshot of your screen. I want to see what options your webui has
its probably the same as the one I have but i would like to make sure before I give you the wrong advice
hmm ok holdon
sure
is like dis
great thanks
so thats the realtime coversion gui. Its used for changing your voice in realtime, not for making song covers
yeah ive noticed
i was searching for the song covers but it seems like theres none for amd cards (?)
not sure
hold on a sec ill see
aight
Ayo? @meager iron level 3 !!! 
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
okay so try opening a command prompt in the rvc folder and then run the follwing command
runtime\python.exe infer-web.py --pycmd runtime\python.exe --dml
no clue if this will work but its worth a shot
ok imma try
section " for vocals"
i cant understand 1 thing
i ment this
it opens this
?
how can i fix it now i havent see any solution
@proper shale
can you share me best settings u got for clear vocals?
i cannot understand this
YES! thats it!
idk anything litreally
literally
It literally shows the models you should use
I am trying to use voices on the song "we didnt start the fire" but get bad results because on some parts of the song there is a high pitched and low pitched voice overlapping, which returns bad results. Is there any way to fix this, or is it not possible
MDX23C-InstVoc HQ if you want to separate a song instrumental - vocals
but it seems like i have trouble changing the voice tho im looking for some ways
(ive downloaded some new voices)
Kim Vocal for blabla this that
https://rentry.co/ExtractGuide
else look at this
did you put the pth file for the model in \RVC1006AMD_Intel1\assets\weights ?
ok i just refreshed it and its there
imma try do convertin
go for it! let me know how it goes. im rooting for you!
my mind about to blow up
bro on the top left
CHOOSE PROCESS METHOD
ENSEMBLE METHOD
change that to MDX or VR
depending on what you need
And then select ONE model, run it
Then when youre done, select that new file again, and apply the rest of the models one by one
what about settings ?
First, on the bottom left "choose vr model" select the model
Nice job, i didn't know you knew about UVR usage.
Settings you can most likely leave default
bs-roformer go brrrrrrr
what if i havent find MDX23C-InstVoc HQ
Its on CHOOSE PROCESS METHOD: MDX
You selected VR-Architecture
they are categorized
sry about that mate
If you don't find it there, you might need to download it
YES I LOVE ERRORS
Lol no I am using CAPS LOCK to make it a bit easier, maybe it makes it easier to understand what to look for. I am not trying to sound angry if you think it sounds like taht 😅
Do you have a Nvidia GPU?
i do
Oh
Which GPU
I get that error because mine can't do GPU Conversion
it isnt working on it?
which gpu
i hope 3060 is the correct answer
IT WORKSS THANKSSS
but it still needs a lot of tweaking but its fine i can experiment 💯
choose the process method, dont enable the ensemble algorithm stuff unless u wanna ensemble stuff ig
ensemble is to use two voice sep stuff at once
so
yeah
anyways, id say, use mdx-net stuff
or try bsroformer
Thats an old screenshot
check the latest
@pastel oak explained to me untill this 1
select saved settings rq
Glad I could help man! Make sure to pay it forward, if you see someone having the same issue that you did then teach them what I taught you. And if you need any more help feel free to ping me
rq ?
real quick
same error if in saved settings
yes i named the folder
UVR ouput
id recommend renaming that so it has no spaces and then u try again
or just export it to ur c: drive instead.....
same error
same erorr
this is annoying
for sureee
btw, i keep getting this error
i will try change the file it self'
Weird I've never gotten that on local RVC. Does it happen every time you try to convert something?
first time no, but the second ones yeah
Is the command prompt still running?
Closing the command prompt also closes RVC so if you closed it that'd cause it
after the error, yes
gave something like this before the error
Try running that command I sent to open RVC again and see if it fixes it
so run the command again after error?
Oh also, you might want to copy that command I sent earlier to the notepad, and then save it as a bat file in the same folder. That way you can just run it from the bat instead of putting in the command each time
Yeah just run the command again after you close out the first command prompt. Then test it to see if it gives another error
still error
didnt close the console tho
or am i supposed to not touch anything
@proper shale@pastel oak
its worked after i change audio file it self
tysm
Nope you don't have to close the console. The consoles gotta stay open while using it. If it's still not working try restarting your PC. When I get errors that's how I fix it most of the time.
ight imma try later, thanks for the help before, i will ping you if i come up with any new problems 👍
Sure, glad I could help. You take care 😁
is this over training?
Youre at the start of the model, keep training
click the blue button on the right side
to expand the graph
i'm gonna run out of disk space didn't know it took that much 💀
it's already at 80gb and there is 25gb left on the disk
It seems a bit early to judge. let's train it a bit more.
You enabled ckpt saved at every frequency, turn it off for the next time
delete pth files that you dont need of older model versions at 1k steps or whatever
it should never be 80gb
yeah it's the pth files. 800mb each and it made 100 so far
delete the ones you dont need
everything under 2.8k you can throw out safely
everything above 3.2k until recently you can delete aswell
what's the differnece between Dxxx.pth and Gxxx.pth?
You use the most recent one of both D and G to continue training the voice model if you stop now
for those you should also only have one of each
yeah the d and g path are the ones i was referirng to
I'll delete older ones
I dont know if applio is the same but on mainline its called something like "save only the latest ckpt file to save diskspace" that setting is turned off by default, shoudl be turned on
found it
Ayo? @glass drum level 3 !!! 
you recommend turning both off?
nono the second one is important to be enabled
so you can select a certain X steps version of the model
since when you overtrain you wont know until you trained further, so you want the older pth files of that OT point
got it, thanks for the help boss
the file names has the epoch number not the step. how can I find which step it is?
since the graph shows steps
Would wait for someone who uses Applio to reply, I thought it always shows steps
seems like it's ~12 steps per epoch so gonna go with that
unless someone knows a better way
Hey i have a question, what is the best pretrain_type for singing models?
which one of these
@helpers
I'd say original is pretty good in general
i heard something about OV2
OV2 and Rin take care of datasets shorter than 5 mins better than the originals
yeah my dataset is 6+ minutes
I'd say go for it with the original
alrighty thanks!
Np
also what is a good amount of epochs for a dataset of about 6 and a half minutes?
it isnt that good of quality aswell
Ayo? @rustic trellis level 3 !!! 
@helpers
You could use 200-250
thanks!
hello link rvc google?
-colab
Suggestions for @frank olive
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
other model italian?
If I want to train a french speaking model, should I use the same model as suggested in the Training RVC v2 models Guide ?
Italian models? Your best bet is to go on weights.gg and search for models with the "italian" tag
Yeah, there's not a specific french pretrain right now
Ok I'll try some stuff, see if it still works
Im not sure if this is the right place to ask but Im trying to learn how to make ai music and I have gotten up to easyui mangio-rvc, but when I paste the link to download my model I just get an error every time
I tried it in the google colab, and in a localhost
Both give me an error, Ive tried models on drive and huggingface and they dont work
Is there no way to just upload it myself since I have the file already downloaded on my pc?
Greetings, any idea on how to remove distortion in RVC voice.ai app because when I use it it sounds really laggy and like a robot?
It doesnt seem to actually matter what I put in it just always errors, even on the default demo link
Could be that the colab easyguo mangiorvc you use is outdated.
I'd suggest you use Ilaria's RVC
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
Thank you!
Ayo? @thin cliff level 1 !!! 
Choose one of the ilarias both should be the same(?)
I was using one from a video I found on youtube thats pretty old so yeah that makes sense its outdated
We do not offer support for voice.ai
Yeah all of them are outdated ngl unless its from members of this server
Theres also guides on how to use these all in this server
Check out the docs in the pinned message, go over to Cloud -> ilaria for proper guide
I just tested it out and Ilaria's works 🙂
How long should it take to process?
Its on 5 minutes now
Im converting a 2 minute vocal track
Nvm it finished
RVC Guides (How to Make AI Cover)
Translation by country
Thank you so much for the help Shad, I finished making my first song and it worked perfectly!
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
🤨
mode collapse ggs
is there anything i can do about it
Ayo? @low tangle level 4 !!! 
how long is ur dataset
90min
rvc does not work well with long dataset, try to get the best 25 minutes from your 90 minutes audio
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
quality over quantity, no matter how long it is, you might have probably used poor quality audio or improperly processed it (e.g. truncating it too short instead of splitting using audio labelling in Audacity)
its very high quality i promise, and i made sure each file was long enough
i guess i should probably cut it down
i just worry about getting all the right phonemes and pitches even though ik it doesn't necessarily need it
what would an ideal dataset consist of in terms of pitches, phonemes, vocal qualities, and the combinations of such?
just trying to have a variety on all dat
length should be below 30 minutes
then its all g
i added the google drive link to ilaria and made it public but theres no models in the dropdown (i refreshed)
is it looking for a .zip?
if so which files does the archive need?
ill try pth and index and see if that works
yay that worked
just got done extracting features, then i trained my model, where do i go from here?
to use it and stuff
Load up the model in inference and convert acapellas
how do i do that
Are you using a webui?
Go to the first infer tab
Select your model (whatever epoch) and select the index
Might need to click refresh
it has a dropdown button but it doesnt do anything
Ayo? @wet wren level 3 !!! 
You might need to click in the middle of the box
I think it was buggy?
And also refresh/restart if no models show up
doesnt work, it lets me type in it tho
Check if there are models in the logs folder where you installed applio
There should be more models pth files in the same logs folder
Something is wrong cause there is no trained models below that folder?
a mute folder with random folders with no pth
am i supposed to train or generate an index first
-rvc
Full AI Voice Model Training Guide (Local)
Link: YouTube
Credits: Christopher Villanueva
Model training with Mainline RVC
Link: Rentry
credits: Raven (ravencutie21)
AICoverGen Colab Guide
Link: Google Docs
Credits: Eddy (Spanish Helper)
Create a model with RVC disconnected (colab)
Link: Google Docs
Credits: Angetyde
How To Make an AI Cover With Ilaria RVC
Link: Rentry
Credits: 👽 Julia (ailen2091)
eh I don’t remember
yeah it still didnt work and theres none of that
You must not have actually trained the model
RVC Guides (How to Make AI Cover)
Translation by country
is this sample with encoding ok?
Yeah
ty
Ayo? @split frigate level 8 !!! 
when training i choose 40hz right ? as long as its 44hz
tysm
Yes, but the thing is that windows always lies as to what your ACTUAL sample rate is
It might say 44.1hz but your spectogram is actually at 36k, in that case you need 32k..
with the program "spek" you can see it
in taining page there is 2 options only
I dont want to confuse you even more, you can try training at 40k like others do but if you want to be very very technical xd
Click on v1, then click on v2
Its bugged
i want to be
nice! tysm
i will see the spek program
from where can i download it ?
https://www.spek.cc/p/download
Windows Installer
When you put your audio file in there, send a screenshot and I can explain what you need to look for
40hz right?
You always look on the left side at the kHz and then you multiply the number by 2 that you see
So your specto is at around 15 kHz, times 2 = 30 khz
it doesnt even reach 32 so you should definitely try 32 for now
And next time try to see if you can get your files from somewhere else
And next time try to see if you can get your files from somewhere else
wdym?
i get it mostly from youtube
what youtube downloader did you use?
mostly
if u got better one i would use it]
Youtube is inconsistent - sometimes the audio is rlly good with 44khz but sometimes its literally 20khz
so idk if its the videos or the tuberipper.cc
I can show you what I use later when im on pc
okay
tysm
if it 44k i will choose 48hz better right?
Ehh i dont understand but ill explain what you should do
When you download file from youtube, open that file in spectogram. Check for its khz top. If its 24khz * 2 = 48khz then when you edit for cleaning vocals, use 48khz
If EVERY file you have is 16khz * 2 = 32khz, then you do 32khz instead
Is that understandable?
yessir i get it already
Theres UVR models for that, I dont know off the top of my head. Never used drum removals
Its also on mvsep probably
i ment 44k is between 40khz and 48khz
which is better?
Ahh
40
Because:
44khz doesnt have the frequencies like 48, so the model will have noise, so you always go down
Same thing gor 36, 37, 38khz = you choose 32
i get it
tysm
are stereo and 24bit are great with 32khz or there is better?
It should work just fine
I dont know if theres any better
tysm u r great
Wav or flac what do u prefer?
both are good so it doesnt matter
If I have source raw footage of audio at high quality and sample rate id use wav. Everything else like from youtube, instagram etc, flac is enough. If you want you can choose wav but not needed
tysm
normalize below 0 dB before exporting to any PCM, unlike 32-bit float that won't get clipped
also better let the format sample rate 44.1k for compatibility (not confused with the sample rate used for training)
what'sthis?
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
what are you using
can someone help with installing applio?
i clicked install on colab
i clicked run, now idk whats happening
RVC Guides (How to Make AI Cover)
Translation by country
dml
?
Yo, what’s the best settings for covers? I know it’s a personal take, but I still want to have some recommendations from others.
There isn't a best setting, when you use like AICoverGen or Ilaria RVC you can stick with the default settings or play with them if you're not happy about how the result sounds
About AI cover gen what models do they use?
You mean for vocal/instrumental separation and stuff like that?
Yea
Voc_FT for vocal/inst separation, MDX Karaoke 2 for lead/background vocals separation, Foxjoy Reverb HQ for de-reverb
Do you think foxjoeys reverb is better than the standard uvr dereverb one?
it's less aggressive
and doesn't have the 17khz limit
is there a way to have applio add the steps number in the trained model?
currently it's only saving the epoch number for me, for example test_520e.pth

When useing vocal split mode shound or shound not derever the back vocals?
doesnt applio have a steps counter or anything similar
dereverb the backings? not really, unless u want to
k
Reverb HQ only works on stereo reverb, while DeReverb-DeEcho also removes some mono reverb
IIRC
AMD/intel dedicated GPU or iGPU?
As in, standard one only dereverbs up to 17khz right?
yeah
use applio, it has steps + epoch on the output
foxjoy dereverb removes stereo reverb only, yet the UVR one can remove mono reverb as well, though the former is preferrable for stereo reverb
(note that if foxjoy one fails to remove, it is mono reverb)
Thankss
I am using applio tho as i said lol. i'm using last version and it's not saving the steps
tick "high end process" in VR arch settings to get fullband
no idea
😦
use 3.1.1
yeah, replace/overwrite the files
oo ty
Hi guys, is it possible to train a tts model and regulate the speed of it so that it reads very slowly without glitches in voice like it's in slowmo?
Edge TTS API actually has rate (speed) parameter but Applio doesn't use it, so might need to modify if you know how to code
ig if you take a tts result and slow it down you could maybe do that... wouldnt be realistic
or do what @proud elbow said
Do you know if there's a google colab where I can upload my trained voice model (trained with easyGUI and uploaded in huggingface) and use it to read with the possibility to adjust the speed of reading?
I'm currently using Soni Translate in colab the RVC Custom voice, it works great but I cant' slow down the readin in a natural way, it's all forcing the model
So my question is, how do I train a tts?
is there a easyGUI version to train TTS not voice models?
how does it work?
welp, you can't train tts models on RVC
because, audio conversion
ig you could use GPT-SoVITS
any colab that you know?
some colab links there
#1229627925658337360 message
Ayo? @verbal oasis level 5 !!! 
founded this inside GPT-soVITS
there's also no webui one
dunno if this works but try it
yeah, let's see
what it is
anyway do you guys can tell me how does it work?
I' m new to all this, so it looks like first comes the TTS reading model and then the Voice, is it correct?
if yes then my question is how's a TTS model trained? Is it based on Audio? how it determines the pace/speed of reading?
you give it audio files, ask it to transcribe said files, then it does the training part
so it starts from audio and it the speed of reading is automatically detected?
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
thanks, anything to train tts?
yeah uh... you give it a reference audio, and also the audio it's been trained on affects the pace too
is training audios easy?
Ayo? @brittle wing level 1 !!! 
https://docs.aihub.wtf/tts/gpt-sovits/ in case you wanna know better
i mean... yeah? just takes some getting used to
https://docs.aihub.wtf/ in case u wanna learn more
Last update: Mar 10, 2024
ty
This is way too complicated for me unfortunately.
Do you have know any slow tts reader ready to use?
try this little girl voice
Is this easy gui?
Applio
thanks, let me try it
Is it possible in Applio uploading a custom TTS and a custom voice?
no custom voices here but prob some plugins for custom TTS
plugins?
idk much but there might be
I'm getting Error when hitting Convert on Applio
Do you have any ideas why?
the RVC voice model required
Look at the Colab notebook box, it usually shows more info
Yeah that's true
But then I need to upload a model right?
I have a huggingface link with my model
if I go into Download
it gives me error
does colab work?
alternatively, put the pth & index file in logs\yourmodel, then click refresh in inference tab
ah someting happened
even if Error appear
it downlaoded the model
It is working now?
Yes it is working now
but I have the problem that the TTS is a female and my model is Male so when converting it mixes them somehow and makes a very cringe voice
any workaround?
-12 pitch or might be different value
I'm trying let's see the effect
I'm aware it could sound very weird
why it mixes the voices?
shouldn't it use only my voice model?
and use the TTS solely as a style of reading?
@proud elbow I see you active in help chats loads
do you have any interest in joining staff?
I might doubt my activeness though I have chance to check the chats, but sure as needed
thats it
i dont know what this means
i tried reinstalling
and doing different versions
Ayo? @vestal lion level 1 !!! 
they all do this
btw for the people with message logger go to #🔍│help-w-okada i sent to wrong channel
Which GPU do you have, and which version of wokada did you download (onnxgpu-cuda or DirectML)
Answer in #🔍│help-w-okada
Rtx 3050 ti laptop and an i5
oh k
ngl i think you would be accepted if you apply for staff
limited vram, only using it alone
prob later
do you guys know why i keep getting this error in rvc gui?

rvc gui is outdated, try following instead
-rvc
Full AI Voice Model Training Guide (Local)
Link: YouTube
Credits: Christopher Villanueva
Model training with Mainline RVC
Link: Rentry
credits: Raven (ravencutie21)
AICoverGen Colab Guide
Link: Google Docs
Credits: Eddy (Spanish Helper)
Create a model with RVC disconnected (colab)
Link: Google Docs
Credits: Angetyde
How To Make an AI Cover With Ilaria RVC
Link: Rentry
Credits: 👽 Julia (ailen2091)
oh
i didnt think it was
is the Ilaria RVC link broken?
?
whenever i click on it it says this
Ayo? @brittle wing level 5 !!! 
-colab
Suggestions for @brittle wing
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
alr
I'm trying to download the new Japanese-speaking hubert, just download the checkpoint .pt and replace it in the applio folder, or do I have to put something else? For example, this code is mandatory, where is it placed?
Or just placing the new "hubert_base.pt" in the folder where I have applio installed is enough?
help me!!!
Replacing it should be enough
They both load using the same code
ok
how do i use this
its your gpu
And when will Hubert perform? You have to do something else (preload some configuration or something). For example, when I start training, is the hubert applied when I perform the tone extraction or when I load the dataset, or, at the time of training? Because in the new version of applio, I don't know if it is using the new hubert file or the old one?
you probably dont have a driver installed
how do I install a driver
how do I find out
its cloud not local, and make sure the runtime uses T4 also have run the first installation cell
what do you mean
Hubert extracts the features of an audio. When training the hubert is only used on the pitch and feature extraction step and not during the actual training process.
how do i put the runtime
check if the status in top right corner has T4, or Runtime -> change runtime type
can you send an image
how do i add the model in the hugging face spaces
not rn 💀
how do i add voices to the hugging face spaces
do i like drag it in or
-help
So I don't need to use the code separately or cite it in some way, or use it with the "transformers" and "soundfile" libraries? To train my models with the new Hubert, wouldn't this be applicable in that way?
The code uses fairseqs checkpoint utils function which is made for hubert
Excuse me, does anyone know how to resolve this error?
"JSONDecodeError Traceback (most recent call last)
<ipython-input-11-320ee20c9227> in <cell line: 31>()
31 if os.path.exists(config_path):
32 # File exists, proceed with creation of creds and client
---> 33 creds = Credentials.from_service_account_file(config_path, scopes=scope)
34 client = gspread.authorize(creds)
35 else:
5 frames
/usr/lib/python3.10/json/decoder.py in raw_decode(self, s, idx)
353 obj, end = self.scan_once(s, idx)
354 except StopIteration as err:
--> 355 raise JSONDecodeError("Expecting value", s, err.value) from None
356 return obj, end
JSONDecodeError: Expecting value: line 1 column 1 (char 0)"
Send the colab link
its an outdated one from yt lol
i solved this already in general chat
anyone have link for rvc1228?
RVC Guides (How to Make AI Cover)
Translation by country
Um hi, paperspace not working with wget https://huggingface.co/lollenape/LollenApeRVC/resolve/main/install.py ??
-rvc
Full AI Voice Model Training Guide (Local)
Link: YouTube
Credits: Christopher Villanueva
Model training with Mainline RVC
Link: Rentry
credits: Raven (ravencutie21)
AICoverGen Colab Guide
Link: Google Docs
Credits: Eddy (Spanish Helper)
Create a model with RVC disconnected (colab)
Link: Google Docs
Credits: Angetyde
How To Make an AI Cover With Ilaria RVC
Link: Rentry
Credits: 👽 Julia (ailen2091)
We still can train RVC Mangio with Paperspace right?
yess i know
Ayo? @fluid gyro level 1 !!! 
anyway now the things you did on paperspace are also on colab
okayy, im new, If I have any doubts I'll ask
In my opinion it's better to applio anyway
Im at uni so no GPU, thinkpad carbon x1 jajj
colab does not need your computer's GPU
it's online
yeah but I ran out
the only flaw is that it lasts 4 hours a day
anyways I already paid
what do I upload? in data source, the zip?
cz the repository aint working
so I downloaded the og file 4gb to now upload it manually
wait while I tag people more expert than me ahhaha
thanks dude!
@red kayak sorry if I tag you but I don't understand much about paperspace
neither do I hahh so if u could give me a hand, Im obsessed to pull a model thru just cz the ones who made some where horrible
@zinc anchor do you use paperspace?
Nope
Might experiment with it in the future
-paperspace
Suggestion for @tranquil cliff
💡 Click on the button below to open the guide.
If they need guides for RVC here
thankss!!!!!
It's only how to set up thouh
well at least its something hahha
I can help later. What do u need to do
i only have the perfect .flac , nothing more hahh
Just create a rvc2 model
for me to use it in a song
cz the ones created are shit
Okay then follow the guide
thanks a lot, seriously!
the one u sent right?
Hello.
You're the one who needs help with Paperspace?
You can read the guides tho.
But in case of any other issue let me know.
thanks everyone!
Ayo? @fluid gyro level 2 !!! 
69 #Rename the folder to experiment_name and move it to /dataset/.
70 print("Dataset Type: Single Speaker")
---> 71 fi = os.path.join(temp_directory, experiment_name)
72 os.rename(directories[0], fi)
73 shutil.move(fi, final_directory)
NameError: name 'experiment_name' is not defined
any help?
