#✨│ai-help
1 messages · Page 143 of 1
maybe try an audacity re-export, as .wav
use ElevenLabs or something else that gives decent TTS results
using style tts2 or xtts v2 with voice cloning should improve results by a lot right?
Ooor maybe that, yeah
#1 RVC Realtime states I have to have an index file
#2 RVC Realtime gives me this error when starting audio conversion
File "threading.py", line 980, in _bootstrap_inner
File "threading.py", line 917, in run
File "C:\Users\BumiMandias\Desktop\RVC1006Nvidia\gui_v1.py", line 653, in soundinput
with sd.Stream(
File "C:\Users\BumiMandias\Desktop\RVC1006Nvidia\runtime\lib\site-packages\sounddevice.py", line 1800, in __init__
_StreamBase.__init__(self, kind='duplex', wrap_callback='array',
File "C:\Users\BumiMandias\Desktop\RVC1006Nvidia\runtime\lib\site-packages\sounddevice.py", line 898, in __init__
_check(_lib.Pa_OpenStream(self._ptr, iparameters, oparameters,
File "C:\Users\BumiMandias\Desktop\RVC1006Nvidia\runtime\lib\site-packages\sounddevice.py", line 2747, in _check
raise PortAudioError(errormsg, err)
sounddevice.PortAudioError: Error opening Stream: Illegal combination of I/O devices [PaErrorCode -9993]```
Any clues
tts is weird with rvc, since they're basically just inserting a tts voice into rvc to infer that
you need an index file in its little area, but you don't need to use it
Ah
could be some random index file
that's when you use different audio type stuff
there's MME, WASAPI, shit like that
use the same type on both
Ohh
Gotcha
Time to test out the voice ey
Oh wow it sounds pretty damn good
Not bad!
Definitely nothing amazing but surely great for my first try
Er first try whilst I know what I'm somewhat doing
i'm using the 40k pretrained model (i use the same one for all my models)
Normal pretrains get the job done 95% of the time
okay the audio file works when I open the discord app
but not on the discord website
amazing
def not bad for a first try at all
First one is the dataset Second is the ai voice obviously
Perhaps I could have better settings but the voice itself is pretty dang sweet
oh gawd that background noise, not wonder uvr did not handle that very well
It was a different dataset
That background noise is because im playing it through like
OBS and I recorded it all weird
That's not directly from the set
oooh
well ok I just raised the audio on the original video and there is actually background noise .-.
but it seems to have handled it well
I'd try raising the extra inference time to max, and lowering the pitch to like, 12 through 10.
Rvc is pretty good at handling some white noise, but only like soft fan kinda white noise, otherwise it's going to infect your model
can someone help me with using the mangio rvc thing
Yeah the dataset could 100% be better after raising the volume I can definitely hear issues
ye, another thing with uvr, it's not 100% consistent at dealing with white noise. If you listen back to the audio file you'll probably notice moments where it doesn't really catch it all
it's particularly bad with that if the white noise is decently loud, or changes in volume
like background music
and it's also not very good at getting rid of noise if its happening while someone is speaking at the same time
it'll like blend together
Guess I just gotta go and find better datasets then
what's up?
https://docs.aihub.wtf/ I suggest reading the Mangio part inside RVC > Local then 
Last update: Mar 10, 2024
File "threading.py", line 980, in _bootstrap_inner
File "threading.py", line 917, in run
File "C:\Users\BumiMandias\Desktop\RVC1006Nvidia\gui_v1.py", line 653, in soundinput
with sd.Stream(
File "C:\Users\BumiMandias\Desktop\RVC1006Nvidia\runtime\lib\site-packages\sounddevice.py", line 1800, in __init__
_StreamBase.__init__(self, kind='duplex', wrap_callback='array',
File "C:\Users\BumiMandias\Desktop\RVC1006Nvidia\runtime\lib\site-packages\sounddevice.py", line 898, in __init__
_check(_lib.Pa_OpenStream(self._ptr, iparameters, oparameters,
File "C:\Users\BumiMandias\Desktop\RVC1006Nvidia\runtime\lib\site-packages\sounddevice.py", line 2747, in _check
raise PortAudioError(errormsg, err)
sounddevice.PortAudioError: Error opening Stream: Invalid number of channels [PaErrorCode -9998]``` uhhh anyone experience this before?
No matter what input & output I use this happens now
thank youi
Huhhh what is this

Exactly..
Yw :)
I suggest a restart and using a different type of audio API or whatever it's called
looks like an error with your audio devices
Oh I switched to mme only and it works now
this is so upsetting that the voice sounds so damn good but there is background noise
Would this be considered done..?
many such cases
sadge
Check mel and kl
pure sadness
perhaps ill pay this person to give me a version of it without background... 🤣
This mel?
lmao. def not a sus request
No... Not at all..
Still going, but I'd recommend stopping training and exporting model
So you can return later
If you say it's not done yet, why stop now?
what do you say if they ask, "but why tho?" xD
its very difficult to understand
I don't mind waiting for all 500 epochs to wrap up.
Cause Google Colab limits sessions
And it could make you lose the model 
It does, but I have enough compute units.
UhhHhhhhHhh I have a medical condition that makes it so I can't hear anything but background noise if it exists?
Ah
👌
Alrighty then
(I bought em a long time ago)
slicccck
(foolish I know)
so slick haha
Actually you know what I could do, to possible clean it up. Run it through steelseries's ai thingy
You can always ask good ol GPT if you're confused
wouldnt work in the longrun but maybe ill try it if i can't find any datasets without background noise
Or you can explain the issue and we can try to help ya
it's really good at separating noise
i use it for my mic so i thought why not rvc too
way more than uvr honestly xD
My plan for now is to wait for it to finish training to 500 and test out different iterations of the model where either g or d graphs dip and see which one sounds best.
welp ig im on the hunt to find a new dataset without background noise
Ayo? @crystal gull level 8 !!! 
Normally the latest one sounds the best
Yes, this too
graphs are cool and all, but your ears are your most powerful tool to decide which checkpoint is best
But, either way, if mel and kl are still going down, model is improving, so it's most likely the best version of your model
chat gpt?
yeah
hm oka
Don't tell me you've never heard of GPT as we speak in this AI hub server
||no offense to noobies but the vast majority of people that touch chatgpt are morons and don't know how to properly use it ||
I swear it's like people ask chatgpt to do one specific thing, like "create me a unity game" or "code me a minecraft mod that does this" instead of asking it questions along the way and helping it explain to you what to do
Precisely.
It's not necessarily wrong to ask it to make something but it just doesn't understand specifics rather than small portions, hate the people who try chatgpt to code and go "CHATGPT SUCKS" because they don't know how to properly use it
Chatgpt / Copilot is amazing with code if you know how ai works 👌
Is this not common sense?
I see people all the time asking it to make huge projects or code a whole thing. As if it understands that
It's like people don't understand that ai is based off things already created
It's like asking your mom to make you a sandwich only to get upset when your mom put ham instead of turkey
^^^
no ive heard of it its just i dont use it if im confused about something
Ayo? @brittle wing level 2 !!! 
and i still dont know what im doing wrong
Take advantage of that.
Well what step are you at.. What exactly are you trying to do at the moment
make an ai voice cover
No shit sherlock.. Respectfully
But what STEP
There is no button called "make an ai voice cover"
Have you considered looking up a tutorial on YouTube? Those were exactly where I got started.
ig step 3?
Of what doc? The ai hub docs?
yes
So your at the inferencing step and setting up RVC
Are you trying to install it locally or on a colab?
wait i alreaedy downloaded it
You also have to consider, do you have a powerful enough gpu or cpu for what your trying to do. If not, you'll have to get someone else to do it. Or use a collab (cloud option in the docs)
i have a good cpu and gpu to be able to make it
What gpu?
little quick test here, loud ass music was in the background of this, but steelserires catches most of it. Like you can still hear some of it, but you could easily cut those parts out. BIG BRAIN
I am indeed a genius
how do i check that in the system settings
woopsies
Shows in the top right of that page
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
lemme see
right click task bar, task manager, performance
slowww
damn.
😛
okay i checked it
Gotcha and what do you have?
AMD Ryzen 5 3600 6-Core Processor
.. Thats a cpu
thought itd be helpful
Ah
go to the gpu tab
NVIDIA GeForce GTX 1650 SUPER
is that good
But I don't think you're doing too well with a 1650 super and ai
Well that isn't THE problem that's just the first step in deciding whether your using cloud or local, I believe the cloud gpus are faster than that? I don't know 100% as I've only ever used local
im using local i think
Perhaps @slim geyser knows?
Yes if you downloaded it onto your pc then that's called local
But the issue is I don't know if that gpu can create an ai cover in a reasonable amount of time
Versus a collab and if it would be more worth to run on a collab
i have that then
I would recommend trying the colab. Doesn't take too long either.
Mares do you know if it'd be more worth for him to run in a colab versus local on a 1650 gtx?
Idk colab specs
In my experience, it's not bad.
You should be able to train with that, just pretty slowly.
Cuz you gotta pay for cloab if you don't wanna get cut off so, not ideal if you already have a card that can train
Ah right
wait you have to pay for it?
yeah?
No
It's "freemium"
no im not
show you can train locally if you don't want to get randomly cut off by colab, just keep following the steps
because using collab for rvc and stuff like that is against the spirit of colab in the first place
You can use the colab for a good while at no cost. Once your free processing units run out, just switch to another Google account or make another.
im not trying to train im trying to use the voice conversion
they don't wanna give free compute to a bunch of little timmys making skibidi toliet ai covers
yeah thats what i mean*
oh well shit oh
If you already are using a pre-made vooice
voice*
You can do it just fine on your gpu
I thought you didn't already have one*
i did
idk if they fixed it, but google had blacklisted rvc notebooks in the past, where they only worked for long enough if you paid for compute
ive been trying to know why i keep getting an error everytime i use the thing
Using the mainline?
Would be nice if you showed the error
it just says error
In my experience, it works just fine. Both the hina mod voice conversion and the v2 disconnected.
im probably looking like an idiot but
I want to know how to make my own RVC models (mainly so I can put it through my voice so I can basically do a overcomplicated miku situation), but all I really know is to use other models, I forgot where I got my RVC engine, and for some reason I cannot supply a screenshot, so oof...
Here's a brilliant tutorial to get your started: https://www.youtube.com/watch?v=tnfqIQ11Qek&t=717s
Ayo? @junior halo level 7 !!! 
Thanks
But first, you must prepare a dataset properly.
Where are you sourcing your chosen voice from?
huh, guess they manged to dodge it somehow
Sure did. Except you can't just run it overnight. It'll give you the boot for inactivity.
For me, I'm gonna start with my voice, so I can just record me saying a lot.
Ayo? @quasi ether level 1 !!! 
Frankly.. I have no idea, how did you start the program.
Looks like the api having issues maybe?
i opened the mangio rvc, then go-web.bat
Ayo? @brittle wing level 3 !!! 
Record yourself speaking and/or singing in a variety of pitches. Avoid straying away from your "normal" voice. Record at least 5 minutes of audio. Then, truncate silence using these settings:
Though I won't need to now, Here's a screenshot of my RVC engine
(assuming you are using Audacity)
Which I do! :D
Are you doing this locally?
I think so? Though if things go wrong, I have the video
No idea never seen that happen before, sounds like you started it properly. Perhaps a pc restart?
What's your GPU? Just to double check.
Is there any command prompt with errors in it?
(Hold on as I try to find the GPU of my laptop)
task manager > performance > gpu
from when i was trying to like use the ai voice conersion
i gave you like a 3 second headstart
ahhhh
I have very improper typing skills, or lackthereof.
LOL
im a little too good
Though just to say, I have a second faster computer, though I have no access to it as of RN
can i let you borrow some skills pls
gpu aint good enough ey
yes plz
well my cpu work with it?
ok they will be delivered tomorrow
your cpu is fine but your gpu aint shiii
"RVC SHIT" what a real folder name
lol
i dont wanna pay for it
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
which one should i use?
From what I have read online, it looks like this gpu isn't too bad performance wise.
Not sure if it can train okay?
that.. isn't.. a gpu
oh
Oh, Oof, Hold on, Messed up on my end
pf
gpu tab
colab time
unless you have a tab called GPU 1 after it
I knew it would come down to Colab.
both yalls gotta use colab 😭 lol
Colab ain't too bad.
which one is that ?
i feel bad for those who dont have the gpu to local, though i waited so long for my upgrade. I was sitting on a 1060 6gb for FOREVER
yo dj what should show use for ai covers ey?
Colab suffices for now.
do you know?
psh if you ever need something done faster than colab my dms are open ;p
I don't understand your question.
I'll make sure to when I have access to my better computer, send the GPU here just to see.
"show" is trying to make ai covers, do you need a specific colab setup for that or could you use something like applio?
I think aihub suggested using llaria?
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
so i clikc the first one?
yes but which should show use 😭
Try out what Kah uses ig
cant sign in
What
can't sign in, can't use it.
it keeps redirecting me to sign in
okay i got it
thats it i think im gonna go find a random girl at my school and pay her to stand infront of my mic and say shit for 15 minutes straight
im throwing hands, i cant find any good data
where would i use a tts with custom models
I have four words for you:
raw
videogame
audio
files
Pick any cute sounding character. Pluck the audio from the game files directly or an online archive.
what video game has an e-girl talking for 15 minutes
im gonna need to play that whole game
valorant
Ayo? @livid hazel level 2 !!! 
best e-girl
warframe copypasta moment
do you need a good egirl voice model?
yes
OOH
lol
bruh
Overwatch??
good point
Yeah?? :D
delta can i dm you/
yes
🤔 perhaps kiriko
I would not use audio of exaggerated characters if you wanna use it for real time. You'll just end up sounding like a cartoon character
Unless you're into that sorta thing, I guess???
ok, unironically one of the best egirl data fonts is vtubers doing reaction content on youtube
because they're reacting to a video they got no background noise of their own
just clip the parts where they pause the video
boom
Genius.
yeah there's ton of them
not really surprising considering how brain dead easy it is to react, but yeah
good for finding nice n' normal feminine speaking data as well, since not all of them do the cringe high pitched vtuber thing
Just one last question, what's your reconmended epoch for around 4 minutes 40 seconds of training data?
Ayo? @quasi ether level 2 !!! 
Start with 250 epochs. Keep your eye on the graph.
Ok
Anyone have a simpler way of realtime audio to microphone besides voicemeeter banana?
It's so bloated and has so many extra features i dont need
I followed the tutorial...
it's not Picking up volume no matter what i do.
Ayo? @sharp glacier level 2 !!! 
I'mma restart.
You need to run the cell that downloads the pretrained model.
i don't understand.
oh
Nor do I have experience with live audio conversion.
Turn on Sup 2 and change f0 detection to rmvpe. It's probably voicemeeter's fault or your mic isn't enabled for speaking
I'm training RN!
whats fo detection i don't see that
Click start
Instead of harvest use RVMPE
i got passed that part but
50 mucking epochs to go. yay
Though from the looks of it, the loss/g/total graph doesn't look like it will go any lower again.
The other two graphs are still going down.
Shouldn't be a big deal. I'll see how he sounds.
you dont need voicemeeter, VAC could work without issues, whose download link can be found in pinned guide in #🔍│help-w-okada
Ayo? @proud elbow level 40 !!! 
So I have finished training my model, but how through the method I did could I get a ZIP out of it so my RVC engine could import it?
Anybody know why I can't uninstall RVC? I'm the administrator, I shouldn't need my own permission to uninstall but it refuses to give me access to this folder specifically.
i don't see it
guy's i'm sorry i'm so stupid..
but i'm not good with pcs
NVM
you should find it in the rickroll guide: https://rentry.co/VoiceChangerGuide
(ctrl+F VAC if not sure)
simply dont use that old application, get newer and better one
-local
- 🍏 Applio, by IA Hispano GitHub
- Mangio-RVC-Fork, by Mangio621 Huggingface
- RVC Studio, by SayanoAI Huggingface
- AICoverGen, by SociallyIneptWeeb GitHub
- Replay, by Replay Team Website
- Original RVC, by the RVC-Project team GitHub
- GPT-SoVITS, by RVC-Boss GitHub
You can find more info on the #1159513888199540817 channel. If you can't find your answer, feel free to ask for help in #✨│ai-help. Credits to Faze Masta and Antasma for compiling these links.
the issue is the guides for w-konda
i never found the one for the other thing
so i'm confused i only need w-o and the vac right?
RVC is portable
It’s not a install, you just delete the folder
you can uninstall voicemeeter now
Yeah by uninstall I mean delete the folder and it will not allow me to do so.
never follow youtube tutorials since they're old
The only folder on my PC that does this btw.
Can you delete other folders
What shows up?
Don’t use RVC-GUI
It’s very outdated
okay banana is uninstalled
Use original RVC instead
Ayo? @sharp glacier level 3 !!! 
Says I need permission from my own god damn account to delete it and also refuses to let me modify or change permissions for the RVC folder at all.
Ayo? @charred hound level 1 !!! 
ok
i opened the file and see a bunch of stuff what do i click..
You don't have to click anything. I don't remember doing that
if you cant open the guide unless using VPN, here's the link for VAC
https://software.muzychenko.net/freeware/vac470lite.zip
So I guess I just have a 10gb folder of shit that I'm never using again just sitting on my desktop now.
Did you download vac lite or vac trial
You should download vac lite
i mean leave the folder alone
Try restarting then trying to delete
Currently getting rid of old GUI, and installing OG
Or delete its contents one by one
i got what was sent to me by MJ earlier today..
which is VAC
Run setup64
Windows more like windowlicker
does anyone have one that bypasses this shit
my pc's not bad
and it's fine
tired of people saying "your pc is to low grade"
:/
run the setup64 (the 64a is for ARM cpu)
open the voice changer and set output to the VAC (Line 1 iirc)
How do I manually export a version of the model at a specific step count during my training?
Which files do I even need? The index and pth, right?
but theres 3 line 1s
I see an option to export it at a specific step/epoch count, but it says to not do so once the training is complete, which it is.
Should I try it anyway? Will it mess anything up?
input as what?
oh
mk
alright now how do i link it to dc?
or games
what about output?
alright thanks btw
I dunno which exact pth and index files to save.
How would I put in an audio file and get an ai audio file out?
The last what?
If I have to find a particular pth file, which index file do I save with it?
added_index
Wrong file
You open go web
If it’s not there now it is
If you don’t trust downloading from me you can download from the repo
Ok, I would like to ask how to put in custom AI models in Web GUI?
Put the .pth in the weights folder
Which is in assets
Ok
And put the .index in logs
Thx
so i try to extract this but it doesn't work
it's in the server like all the voice files i went to downloads hit extract all browse desktop ai voice folder and extract but it only keeps transfering as a zip.
i think it's broken
whats the command for the doc
try without the blob https://huggingface.co/Blocktoast64/The-AI-Bakery/tree/main
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
Ayo? @open mountain level 1 !!! 
now i'm having yet another issue
i can't hear my youtube audio.
and idk how to fix it. i got discord working and games but for whatever reason yt now won't play audio only fix i found was uninstalling things. but does anyone know the corect fix
I need help trying to train a new model, I keep trying the tutorials but they wont work for me
you tried the guides here? https://docs.aihub.wtf/ and they don't work
Last update: Mar 10, 2024
Yes but I think I’ll poke around at it more tomorrow, I’ve been up trying to do it for the last 3 hours - I’m not sure why but when it gets to the GUI part it won’t go through
Ayo? @alpine basin level 1 !!! 
Idk I’m new to anything so I’m probably doing something wrong
I'll let someone else answer that. I'm not familiar with training on GUI
also you can post in #1192011222023950368 with a pic attached
just in case someone sees it
I’m open to learning any way of training but yeah I’ll try again - thanks dude!
yea... idk how to fix yt.
Ayo? @sharp glacier level 4 !!! 
how do you tell you're overtraining
g total loss went up basically
How do make an AI cover?
where do you check that
But seriously you could check #1159513888199540817 and find out there
Hi,
is there a passable to make AI sounds crispy with high pitch voices?
you had set it 48k in preprocessing before, so you have to start over on it
is there no way to convert from 40k to 48k without restarting completely
you have to either continue with 48k or start over completely with 40k (or vice versa?)
what does tranpose do again
pitch shifting in inference
How do you check whether you're overtraining
Ayo? @proper sapphire level 3 !!! 
Last update: Feb 10, 2024
thanks
Please use the most up to date version of RVC to avoid these small mistakes
hey uh, I'm using AICoverGen-WebUI, by SociallyIneptWeeb (modified by Hina), and I noticed there is always a little bit of the original persons voice in the background in the output audio. Is there a way to prevent this?
FileNotFoundError: [Errno 2] No such file or directory: '/content/drive/MyDrive/rvcDisconnected/filelist.txt
how can i fix that
On average how long is an epoch supposed to take
i used that colab before and it didnt finish now i guess ill use applio
rvc disconnected always gives errors i bored
is it better using Server Audio or Client Audio?
Ayo? @rocky fjord level 2 !!! 
you must use client
Ayo? @wanton sail level 1 !!! 
mmm...
why?
'cause you dont have a server or vds, that happens on local network so client
Autosave enabled
Depends on the length of dataset
And batch size
i disabled it but nothing change
what does batch size exactly do
am i need set it from 0
actually it is
because when I use Client then forward to Discord, my friends voice is somehow also got converted
but when I use server, only my voice got converted
I have a batch size of 2 and dataset that's 12 minutes long
I wouldn't really recommend using that if you're making AI covers, since it can fail on the vocal extraction part. Use MVSep to extract vocals and prefer other Colabs instead, imo that's the best option
Mmmm that'll take a while per epoch
what does batch size do
Trains x number of samples at once
Batch size 2 trains 2 samples at once.
Those samples are 3 or 4 seconds long...
oh
yeah do everything again tbh
and see if it works this time
is having a batch size of 1 bad for learning rate
Yeah since it'll change parameters every sample
Use 2 and beyond
Im looking at my tensor board but for some reason loss/g/total isn't updating but everything else is
Sometimes you do need to click the refresh button again

alr
i set it to auto update
but it just didnt work
I'm at 56 epochs
how much more do you think I'll need before it plateaus
Mmmm I suppose it'll keep going for a while
Check mel and kl
Mm
Yeah it's probably still going
Though I'm kinda worried it's getting high kl and mel values
Test out and see how it's going, though. Values don't mean much if it sounds good yk
what do kl and mel represent
i have a problem with "RVC Realtime"
every time it infer voice, for the first word it glitching/cut
-for example i count 1 to 5, that 1 will cut/glitching, rest are fine
-next i stop speaking for several second, and tried speak again, it happen again
-only happen everytime on first word after silence (at start of it inferring)
any guess what happen or solution?
Windows 11
i5 11400
RTX 3070 Laptop
Ayo? @placid holly level 9 !!! 
Kl = how different the model is from the dataset.
Mel = how well it's reproducing clarity from the dataset
Lower is better
Increase extra inference time perhaps
/what's Thresh do
it worked !
omg
Mic sensitivity slider, basically
yoooo
So, lower are good?
if my surrounding a bit noisy
If you have bg noise, maybe increase
Higher = less noise being converted
Can you share your settings here real quick?
Just wanna see what else you set up so maybe we can change something and make it work better
Ah I see, thanks
is there a way to stop training on this version
Perhaps it's doing that because of index?
I'd say mess around with the loudness factor and response threshold stuff, too
Haven't used RVC realtime yet, so I'm hoping it can be fixed like that
You can close command prompt, as long as it isn't saving the D/G files you should be fine
I see your save frequency is 10, so, every 10 epochs it'll save the D/G files
If you're on epoch 60 for example, I suggest waiting until epoch 61 to close it
So you can be sure everything is fine
really, I'm confused
Sometimes, the voice from other people in voice channel are got converted by the RVC
although the input is Microphone
it fixed it, but sometime it still there, but not so common
Okay thank you
ah sorry, it still there, it back again
Feels like mic bleed
Not really sure why this is happening tbh
You're welcome :)
yeah it's really bad rn
I'll probably leave it on while I sleep
Oh that's probably your dataset 
My data set seemed fine
No background music
no other people talking
This happens if Extra is above 1.01~ish i think. It has been brought up before but not fixed yet
So feel free to lower it to that, or around that number, and try around
It only sounds muffled in the beginning if the voice changer has been idle for ~5-10 seconds
Perhaps it sounded too muffled?
Anyway, keep training and see how it goes
i'll try sour solution later
Maybe it's RVC screwing up, too
I don't doubt that
is having a little echo bad
Ayo? @proper sapphire level 4 !!! 
i can send you what i'm using
Mmmm yes
Oh yeah don't really post datasets in here 
Forgot to tell ya
oh
Taking a quick listen, there are some noisy samples inside
The first few seconds seem decent but have some slight bg noise... 
Did you use any type of isolation for the dataset?
no
Mmmm
I suggest doing so, maybe use MVSep Demucs DNR, that seems to remove noise and background noise quite well
https://mvsep.com - also create an account for lower queues
Yes
ok
How do you prevent voice cracks and make ai covers that sound more natural?
Use an audio with quality and a model with quality, then you can edit it using any DAW you like to sound more decent
Any RVC link that's working?
I don't know what DAW is
Ayo? @sturdy sluice level 1 !!! 
uhh
convert the audio maybe
i tried both wav and mp3
strange ... wav and mp3 works for me
Oh wait maybe it's too long
Try separating 5 mins of ur dataset then, it should work
iirc mvsep will cut audios more than 10 mins
Try to use the lead vocals only, voice cracks can happen with poorly isolated lead vocals or when you're using all vocals at once
Didn't do that for me though... 😭
if I use an altered dataset should I restart or continue from what Ihave
You'd have to train from the beginning
doesn't work
I followed the guide on the website about isolating voices. Is there a better way to isolate them?
You should be fine for the most part...
Though I think you should use better models nowadays
If you're on MVSEP - use BSRoformer (most recent model type) to get the best vocals, and use Ultimate Vocal Remover HQ on model type "UVR-BVE" to get the background vocals and lead vocals
ok it fixed it, thank you
mine better at 2.99, lower better but i lost accuracy
So BSRoformer is for getting the instrumentals and UVR-BVE is to get lead and non-lead vocals?
yeah BSRoformer gets all vocals and the instrumentals
Alright, thanks. I have never used MVSEP, so I'll give it a try
Alright I'm going to leave it training for 300 epochs and see how far that gets me
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
Anyone got Evelyn?
-help
-local
- 🍏 Applio, by IA Hispano GitHub
- Mangio-RVC-Fork, by Mangio621 Huggingface
- RVC Studio, by SayanoAI Huggingface
- AICoverGen, by SociallyIneptWeeb GitHub
- Replay, by Replay Team Website
- Original RVC, by the RVC-Project team GitHub
- GPT-SoVITS, by RVC-Boss GitHub
You can find more info on the #1159513888199540817 channel. If you can't find your answer, feel free to ask for help in #✨│ai-help. Credits to Faze Masta and Antasma for compiling these links.
Ayo? @gentle merlin level 1 !!! 
-gui
Is there any way MMVC (AI voice changer) can work with AMD GPUs?
Or only cuda cards?
Thanks
Ayo? @errant prairie level 1 !!! 
Shuld just dump it in a folder with ai voicechanger i guess? Right?
How this fix??
TypeError: Cannot read properties of null (reading 'modelSlots')
TypeError: Cannot read properties of null (reading 'modelSlots')
at i (http://127.0.0.1:18888/index.js:2:1305771)
at Object.updateServerSettings (http://127.0.0.1:18888/index.js:2:1306003)
kk
Mod?? Admin??
what's a good mel and kl
You're sure you followed all the steps correctly? Also you can't change the folder name when the model finishes to train(or gives an error)
Yeah the issue was fixed I think I was trying to retrain under the same name without deleting the previous files
After a few restarts and name changes it fixed
Web Server Launch Exception, DLL load failed while importing beatrice_internal_api
how to fix?
And it trained correctly?
Yup
Hello i have some trained and full rvc i think ( i mean i have the voice that i want) but i cannot make it to song that i want does someone can help me
Does someone can help me
can anyone help i have amd graphic card and it wont recongnise it at all it keeps using cpu
i take it this means im overtraining
what is this?
bro colab time is over and i couldnt find .pth files
it was 500 epoch, i guess rvc disconnected time was not enough for this epoch
Ayo? @wanton sail level 2 !!! 
-overtrain
All-In-One Guide on how to make a good model
This guide explains how the D and G files works and much more: https://rentry.org/RVC_making-models
Credits: LUSBERT 
Automated Overtraining Detection (AOD)
Will be available soon in #1159513888199540817
Credits: grvyscale
what colab are u using?
MVSEP keeps giving me an error saying "This is not an audio file'
I've tried both WAV and MP3
rvc disconnected
At the end of the colab runtime, in the case of RVC Disconnected, any models that were not manually saved will disappear.
Its normal that it uses almost 60% of my CPU when im using it?
Hi i have a model but i don''t know where to set it to make a AI Cover
Does anybody have an issue of MVSEP not letting you upload any audio
Where i can use the model to make the song
how can i fix this error in collab? (GPU Check)
Exception Traceback (most recent call last)
<ipython-input-23-74d578b798b3> in <cell line: 21>()
23
24 else:
---> 25 raise Exception("No GPU detected; training cannot continue. Please change your runtime type to a GPU.")
26 gpus = "-".join(i[0] for i in gpu_infos)
Exception: No GPU detected; training cannot continue. Please change your runtime type to a GPU.
p.s. i have tried changing runtime doesnt fix it
Ayo? @slate bramble level 1 !!! 
run out of gpu, use a different account or wait 12-24 hours
OK thank you!
depending on what colab wants
ive been working on this voice, do u have any recommendations on where i can find a code to help prevent timeout?
if not tysm anyways 🙏
google-
Sorry do you know where i can use already a model to make a AI Cover
Suggestions for @worldly oasis
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
do u have the model?? if not and u want to make a AI voice use that^
I have the model
what ai cover are u refering to
u can use text to speech or RVC n say the words/sing wtv
all i will say bc idk if i can even help you 🙏
ok i used chatgpt it told me a code lol but when i start the code no other cell will start if its active advice?
main issue i keep timing out thats why im having this gpu issues 😭
i done the vocals and i have the full model
Is it free
Before i used Google Collab but they ruined it
Ayo? @worldly oasis level 2 !!! 
Yea but it's not the same as before i already have the model just i need it to place it to some music but don't know how
is there any more easible way before i used google collab but now it's not the same
ok if i understand right u are trying to make a song right?
just to put the model in some music to make it with that voice
yes like Justin Bieber played by Selena Gomez
do u have the audios?
u have the voice lines already ??
Yes
so u just want to add music and voice audio together?
I need to sound like her sing song that's not her
ooo wait ok we are talking ab model sorry i cant understand very well
i mean that song already is played by someone and i need it to change that sound with mine
you need help training it
What mvsep option is recommended for removing artifacts for datasets
Oh sorry guys i don't know how to describe it
let's say i have Sel's voice and want's to be on JB song but don't know how
i already have done training vocal model
is it possible forsomeone toanswer this if not itsok T_T
What mvsep option is recommended for removing artifacts for datasets
yesss but howww
Hi guys, I need help with my gpu, i can only chose CPU and nothing else
Intel® Core™ i7-4600U CPU @ 2.10GHz
Ayo? @violet narwhal level 1 !!! 
oh no 
You'd have to use Colab yeah
If I choose the gpu option, it is normal if it still uses cpu instead of the gpu? (it uses like 30-60% of my cpu when im using it)
You're welcome
If you're on AMD, that can occur
yea I use AMD
Have you exported your downloaded models to ONNX though?
Ayo? @muted whale level 1 !!! 

Definitely do that, since it'll help performance wise. It'll use CPU though, that's just how the AMD version is, sadly
https://rentry.co/VoiceChangerGuide - check the "uploading models to AMD" section
EOL - No further Updates
Github - Blanc-dot
Discord - Blanc_dot
Despite being end of life, most if not all information has not really changed, so should be very accurate until actual new stuff comes out.
Other Links
Antasma's Local Error Fixes
Antasma's Colab guide
Sushi's useful Links - You need...
And what is this supposed to do?
Basically, converting to onnx makes it actually use your GPU
It will make it work better? I mean, when I import a new voice, this one just gets really laggy and it doesn't sound natural
Yes, it'll make it not laggy, at least
Which one is better for medium quality English dataset? Titan or Original pretrain?
Just go for original
Ehhh, not so sure about that, but you can always configure the Extra, Index and Tune parts to make it better
what does the index do?
Applies the model's accent to your voice
(or tries to, yk)
@proper shale should i go for 32k or 40k training by looking this spectogram ?
32k
thank you
since it doesn't go a lot beyond 16kHz
If it really went beyond that, up to 24kHz with actual, usable data, yeah you'd go with 40/48K
can somebody help me, I keep trying to make a model but it never lets me get past this part and I've been following the guide to a tea
it barely goes to 22kHz
and it feels like it's not really data, more like filler
dithered stuff
Have ya linked it with Google Drive yet? Hmm
yeah
That is... weird tbh
I suggest switching up Google accounts, putting that zip in the rvcDisconnected folder, and trying again

Hi, i wanted to ask for some help with the installation of python requirements which doesn't seem to work for me, it crashes with many logs and this error
ModuleNotFoundError: No module named 'distutils'
When training using Applio, how can I manually analyze graphs to know if I'm overtraining?
With the RVC can you put the google drive link into 'Inferencing voice' or is there another step?
Hi
i would like to know if this has something to do with some errors going on in collab version
Last update: Feb 10, 2024
What colab are u using?
That's outdated
Did you download the correct W-Okada version that matches your GPU?
Which GPU you have?
Oh lol, I've been using Applio Colab and damn I'm lucky it say's the runtime disconnected but also that it's waiting to finish the current execution, and that actually just continues to train the model
Hi, im trying ilaria RVC but it doesn't uploads any model
would you mind helping please :c
mi model dont appears
This might be a stupid question, but my voice dataset is in 48khz. Do I use the 48k sample rate
Thanks for helping!
yes
thanks
Not necessarily, check the spectrogram
Normally I just use 32k, 48k can bring noise and doesn't bring any big benefits
Use the most recent one instead: https://colab.research.google.com/drive/1mHKTGH5e3SAyDSBss1KtiYRbDdQzwSMs
could this be a possible point of over train?
Check mel and kl. seems to still be going
yeah it's still going
no
Nope.
Hello.
When I try to download my model from the EasyGui colab (gradio), I get an error message :
FileNotFoundError: [Errno 2] No such file or directory: 'assets/weights/My-Voice'
What shall I do ?
i have everything downloaded but it doesnt pick up my mic, what should i do?
can anyone help i have amd graphic card and it wont recongnise it at all it keeps using cpu
alright, thanks
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
mine just says "cpu", i doubt that the rx 580 is supported but if it is, tell me what im doing wrong.
Refresh
First part explains it
Depends on cuda for GPUs so you can only really infer
Which one is better for training? Applio or RVC Disconnected ?
im using the ai voice changer but it only runs on cpu cuz i have amd and not nvidia
cuda is on nvidia
LoL
u got the same issue?
Ayo? @maiden remnant level 1 !!! 
At least with applio yes why it told me only nvidia graphics xd
MMVCServerSIO_win_onnxdirectML-cuda_v.1.5.3.15
someone said it might work on this version
though i cant try rn
luck bro
Ayo? @weak flame level 1 !!! 
Did you download the directml verison
HF Spaces runs in the cloud so your GPU doesn’t matter
yes
now i downloaded older version and il see if it works
how do i do this
VoiceChangerV2 Initialized (GPU_NUM(cuda):0, mps_enabled:False, onnx_device:CPU-DML)
see
How do I monitor my voice with rvc?
Ayo? @tranquil raven level 2 !!! 
Are there any video tutorials to rvc's gui
Ayo? @violet narwhal level 3 !!! 
:C
Ilaria is not working also :c
You finished your daily google colab gpu
either wait till tmr or use another google acc
thats the reason why its not working
how much should I wait
either around a day, or use an alt google acc
hey
Ayo? @maiden remnant level 2 !!! 
how can i use amd gpu with ai cuz the cuda shi wont allow me
its not really of a question related to rvc but i want to try to make music using it, i can do that but i cant seem to figure out how to put the background music on the voice over, everybody else's seem so perfect
how do i get this working
i got it running
i keep getting error though after moving some models into the rvc folder
-local
- 🍏 Applio, by IA Hispano GitHub
- Mangio-RVC-Fork, by Mangio621 Huggingface
- RVC Studio, by SayanoAI Huggingface
- AICoverGen, by SociallyIneptWeeb GitHub
- Replay, by Replay Team Website
- Original RVC, by the RVC-Project team GitHub
- GPT-SoVITS, by RVC-Boss GitHub
You can find more info on the #1159513888199540817 channel. If you can't find your answer, feel free to ask for help in #✨│ai-help. Credits to Faze Masta and Antasma for compiling these links.
Kl should be around 1 to 0, mel should be around the mid 10s
i see
my total/g is weirdly bumpy but my mel keeps going down
should i train it more
mel keeps going, it's still improving
alr
So, how do I convert wav to pth?
you train a model, using the .wav as dataset
Last update: Mar 10, 2024
has everything you need to know about training and RVC in general
Which one is better for training? Applio or RVC Disconnected ? I'm using Google Colab
change the extension 
training the model as pth from wav files as dataset is not called "conversion" tho
H guys, just wanted to ask what is currently the best quality audio cloning open source repo/colab going around now adays, since a lot of variants are coming up
yw :)
Test
What's the best inteference time and sample length setting for rvc real time voice changer
For a RX 570
either applio or mainline, can't go wrong with em
https://docs.aihub.wtf/ for guides n stuff
Last update: Mar 10, 2024
test and see what works best, tbh
So I have to do trial and error
⠀
Settings for AMD GPUs 
Don't forget that your models needs to be converted in ONNX!
F0 Det.: rmvpe_onnx (suggested for all series)
7xxx XT cards: 112-128 chunk | +16384 extra
6xxx XT cards: 128-192 chunk | +16384 extra
5xxx XT cards: 192-256 chunk | +8192 extra
RX 580: 192-256 chunk | +8192 extra
RX 570: 192-256 chunk | +8192 extra
RX 560: 256-384 chunk | +8192 extra
Advanced Settings
Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low
⠀
I already know this list exists
this is for Okada
yeah, tbh
"Sample length: The realtime voice changer works by sending small chunks of audio for quick conversion, then stitching them together.
Longer sample lengths feed in longer chunks, making the stitches less obvious and reducing GPU requirements but increasing output latency.
On a low end GPU, setting this too low will make the GPU unable to keep up and produces stutters.
On a high end GPU, setting this too low will cause warbling as an artifact of stitching many overly-short chunks together.
Equivalent to "CHUNK" in w-okada."
"Extra inference time: How much old audio to load into each chunk.
The extra context usually improves voice quality for the generated chunk but is more demanding for the GPU.
Equivalent to "EXTRA" in w-okada."
Ayo? @brittle wing level 1 !!! 
Found this on some sort of youtube comment
yea this is right
K thanks✌️
You're welcome :)
If I set the extra interference setting to the max, is that fine for my RX 570 8GB variant?
It seems to be using 35-45% of my GPU usage
Setting that to the max kinda screws things up
In task manager
Do you think 3.25 should be a sweet spot
It's giving this spiky graph on the 3D encoding
See if that works out
Do you think -50 response threshold is too sentivive
I know the fact that setting it to -40 literally makes the client pick any sound from your mic
Depends on your mic
Only when using harvest as your pitch extractor
What if I'm using rvmpe
Is it basically useless
Yup
Should I just set that setting to 0 or something lol
yeah, ig
Hello, I have the voice of a person that I would like to use on RVC, but it is only a voice message, how do I do the index and the other thing I don't understand?
so you want to clone that voice, right?
https://docs.aihub.wtf/ has everything you need to know about training n stuff 🙏
Last update: Mar 10, 2024
It's a friend who authorizes me to take this voice to troll someone however it's a 19 second voice message, how do I put it in the index and the other thing? so yes clone I think
Is a RX 6700 XT somewhat better than the RTX 3060 12GB variant
Yeah you have to download that voice message (or ask them to send a new one) and do the steps to train your model
If both are used in RVC
For AI? nah
Since NVIDIA stuff is just plain better for AI
Against the RTX 3060 for AI
i cant find it
there is nvidia or gpu install links but i cant find the downloand link of the app