#✨│ai-help
1 messages · Page 200 of 1
Bleh, oh well I'm desperate at this point.
Training locally will be a good option according to me.
Okay then I'll trust your expertise and we'll go for locally.
So there is no other option.
So now you have to download RVC
If we are going to do this locally I'll have to set up the other PC, so I have to register a Kaggle account and Install RVC, which version?
If you want to do locally, no kaggle account is needed. But if you don't want to use any other PC, then use kaggle instead
For local Version, you have to download RVC, for kaggle you will get a notebook which you have to execute (no need to download anything in kaggle)
But you said local = better, so what version do I need to install on my other PC?
I came across a few.
Some I coulnd't figure out how to install.
That's how I landed on the one that opens Gradio.
There are many forks.
- Mainline (original RVC)
- Applio (modified RVC with advance features)
Are these two difficult to install?
To give you insight into the walnut that houses my brain, this was already too difficult for me: https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI
This is outdated BTW
No additional scripts to run and stuff just unzip and click to use?
Yes.
Oh had no idea 😅
Waana continue with Applio ?
Uhm is that the better one of the two or something?
What ?
I'll install whatever you think is best if its applio I will install that
Applio is good, user friendly. But both use same algorithms. So no need to worry about quality
For my personal choice, I use applio
Anything else setup related that I have to do?
No.
Cuz if it's just insttalling applio I'll focus on that and if its done and setup I will come back.
Okay I will set everything up, hopefully you're still around later 😬
May I tag/ping you?
Thanks.
I will be back later then, the other PC is sitting on the floor so I first have to physically set everything up.
can someone help me idk how to open rvc
Hey, son! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:
- General RVC help: #✨│ai-help
- W-Okada / Realtime RVC: #🔍│help-w-okada
- AI image related: #🔍│help-ai-art
please
Which rvc are you using
Mainline or applio?
If appio. Runapplio.bat
can i use applio as realt time
No. Applio is different thing. For realtime w-okada is best
thanks
Do u need help installing Wokada
hey i was installing the thingy here but like it's been so long and it still didnt do done any ways i can fix this?
it just stops
around here
and it doesnt load the voice changer itself
For MMVC and W-Okada, go to #🔍│help-w-okada. This #✨│ai-help channel here is for RVC.
oh mybad
Hello everyone
I want to use applio from my pc to my mobile
On local host
I don't want a public link.
@hallow thistle what are you typing? It's too long 😪
You can tweak some command line for Gradio to let it run on public link. However, one of the only ways to let Gradio to share link over LAN instead of internet is to plug LAN adapter from your phone to your PC, but I find this method to be impractical.
Bhot my pc and my mobile are connected to same network
I think this way is a bit better than connecting your phone to a LAN adapter on the same network. But you might wanna do some coding a bit.
In app.py ?
More likely.
Where should I add it
Please tell me @hallow thistle
@simple ore how do I access applio from my LAN devices
edit run-applio.py and add --share ?
Then ?
I want that I can access it from my phone but without internet. My phone and PC are both connected to same network
Okay I think it's installed, I should be ready to go now.
then you just run it and the gradio interface should be available from the network?
Does Applio need to be connected to Discord?
You can change launch_gradio in app.py to the following, so it will allow network connections without creating gradio link:
def launch_gradio(port):
Applio.launch(
favicon_path="assets/ICON.ico",
server_name="0.0.0.0",
share="--share" in sys.argv,
inbrowser="--open" in sys.argv,
server_port=port,
)
I need to type that into CMD?
That was in response to other message
ye RVC and wokada are different programs
Don't want to use on network. Lemme explain. I'm sitting in Same room same house and we both are connected to same wifi (without internet) and I want to access my local host from my phone (as we do in apache server for web servers) without internet. How do I access Applio GUI in my phone. I mean I'll be the client but the Applio will be hosted by my pc in my own LAN network
Lemme try
"from the network" = from the local network, if that's not clear
I want to access localhost:6969 on my mobile.
Okay. now how do I access it on my mobile without using internet.
My pc will act as server and I will be a client
You need to check IP address of your PC and use it on mobile, like http://192.168.0.2:6969
Ohh that's what I want
A huge thank you 😆 you are the only who understand what I want to say
A huge thank you again
A question can I turn off my cellular data. Because my phone is connected with hotshot with my PC. If I do that, will it work?
If you mean whether you will still be able to access Applio on your server, then yeah. It won't affect this in any way
Can someone help maybe?
Hey, ◂Maartje▸! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:
- General RVC help: #✨│ai-help
- W-Okada / Realtime RVC: #🔍│help-w-okada
- AI image related: #🔍│help-ai-art
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
Here's also the screenshot for Tensor.
I got the advice to use Applio, downloaded it try to launch it but I get these two errors. I just want to create a voice model.
It works. Thank you. Can I normal use it for training and other things?
I have changed the code as you said
Will it impact on applio ? Do I have to change that code again and again?
it's not an error, that tensorboard will show data only after u actually start training
No, this needs to be changed only once. If you update applio, then you'll need to apply this code again. Everything else should work as usual
did u actually start training after getting the dataset and putting the right settings
I don't know how I can't open Applio.
could u elaborate more
like what's in the cmd, and what's ur pc gpu
and which tut are u following
This is what I get if I try to launch it.
I'm not following one, tried to but in the first few minutes everything already looks completely different to what I have.
It was said that Applio is user friendly, but is there maybe something super duper user friendly maybe this still is just too complicated I don't know?
I have my voice samples clean and ready to go in WAV format, I just need a program where I can get to the training part. Was halfway there with colab but that one is broken now.
but is there maybe something super duper user friendly maybe this still is just too complicated I don't know?
Nope, AI is not meant to be as friendly as installing a Steam Game, it's computer science research still, also running it locally is harder on cloud
Ofc Applio tries to be as user friendly as possible
The literally EASIEST thing in the world would be maybe Weights.gg (cloud) or Replay (local version of weights.gg basically), but I have never tried Replay, but they both use RVC
Try opening http://127.0.0.1:6969 in your browser
mmm it seems to be running on your browser actually if u open 127.0.0.1:6969, but I have never seen the issue above before, and wasn't able to search for it neither in this server nor IAHispano
not sure if that matters
maybe @simple ore knows to what it's related
YES!
Now it's open, do I need this link every time?
yes
however idk why it says that issue above, not sure if it matters
what's ur pc gpu?
Quadro P4000
Is there a recommendation you can give to how much training I should do?
Like is there a average min/max.
Last time I did 500, would that have been enough?
is it the 8gb vram one
there isn't
you gotta follow the tensorboard
there isn't any average
It should be 8
Last update: Dec 24, 2024
should be good enough iirc
Well let's see how far I get, at least now I have buttons I can press.
Thank you so much, Dr Jr thank you as well with your help earlier : )
yw
@ionic geyser this is the right channel to ask how to make ai covers
- Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours, not granted, of GPU
Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
I gave you only cloud ones since u showed me u got integrated graphics
what are the ones for good pcs
u told me u got integrated graphics
it's useless to do it locally
else show me ur task manager
maybe u got gpu 0 and gpu 1
locally is okada?
bc u said u got a gaming laptop
what? wokada is realtime voice changer, it uses RVC but for realtime
RVC does AI Covers and train models
Locally = runs on ur pc
cloud = runs on remote good pc
here look
show me the gpu 1
the gpu 0 is integrated which is not good, maybe gpu 1 is better
that's not that good
it's a old 3050, laptop gpus are weaker, and it got only 4gb of vram
.okay it still works good for me
so what do i do
i run most programs good
i need to make music with ai
AI takes more computing, your PC isn't that good
but i would suggest u cloud
ur pc is gonna be kinda slow
and how do i do it locally
is there a tut
whats the difference between cloud and local like
isnt it the same quality
no yt tuts
only written ones
I literally gave u the link, it's an hyperlink (blue text that are a link when u click them)
is it hard to install
all yt tuts are old
is it hard to install
@low shard
and whats the diff isnt cloud the same quality as local
same quality,
BUT:
Cloud = runs on REMOTE GOOD PC, meaning it willl be faster, but has limited gpu time
Local = runs on ur pc gpu, meaning it will be slower as ur pc gpu isn't that good
have Patience
i'm writing
😭
i can't type in a sec
u just gotta read, ofc this isn't downloading a Steam Game
so no diff
even if u have a good computer
read it all, there's differences man 😭
don't just read the first 2 words
the diff is itll run slow on my pc okay but itll run quick on a good pc just like a colab that id use
so theres no diff
it would be way better on a good pc, bc u aren't limited by a gpu time
and it would be faster
u can be disconnected randomly on colab
and u can make a song
But with a GPU its WAY faster
btw this is only for converting voice
like AI Covers
it doesn't make songs like Suno
if youre making a song you shouldnt expect to finish it fast
i wana make a song nick
it doesn't make the instrumentals
it can only convert the voice
i know ill get the beats
like make an ai cover
alr if u want to make ai cover it's fine
u want an ai cover right?

what
oh with your own voice?
Uh
no like my voice to drake
No. His voice to someone elses
thanks
Yea he said it already
u would prefer that, u can actually get randomly disconnected on google collab
bc it gives MAX 4 hours daily
of NOT GRANTED GPU
well i can make a song
in that time
and if anything finish it the next day
done it before
alr ur choice then
^
thanks
weights.gg is the easiest
yw
no the songs are private
It doesn't
unless u share them
Its private
ill use illaria theres advanced options
if theres one with tempo
like adjusting voice tempo
that would be amazing
bpm
with that, u also need to separate the vocals and instrumentals of the song manually https://docs.ai-hub.wtf/rvc/resources/dataset-making/#vocal-isolation--cleaning
Last update: Dec 24, 2024
ill be making the song mysel
ill extract the voice alone and convert it
i wish there was a tempo adjustment for the voice to match the bpm like old collabs had
yes, that guide shows u how
im getting gpu task aborted and i just started collab
what does that mean
show a screenshot
if you are using Ilaria RVC Zero, that's not colab....
that's hosted on HuggingFace Spaces, with a ZeroGPU
so what do i do bro
don't ping me everytime
i'm testing myself
no need to ping
oh thanks
the space was preparing, refresh the page
whats better qualith that or apolio or is it same quality
Applio gets more updates,
But the quality should be the same between all RVC forks (Mainline, Applio & Ilaria RVC Zero)
there hasn't been a new model architecture since almost 2 years
thanjs alot sir
yw
thanks
do u need anything else or have other issues or can i mark this convo as solved?
no thanks
so I can mark this as solved right?
wait i downloaded the model into apolio but i dont see it when i click the model
voice models is empty
apolio
it's actually Applio
Could you tell me step by step what you did?
i went to downalod
put the index and pth files
and i went back to where u use the auio to make a song
i cant find the model
did u click 'refresh models'?
send me the model download link to be sure it's working
also show me a screenshot of ur Applio
do i put the index
not the name
yes
click on 'Index file'
nothing shows up
btw all models on weights.gg are renamed to model.pth and model.index
the .pth is the actual voice, the index is the accent
u sure u uploaded it just like in https://docs.ai-hub.wtf/rvc/cloud/applio-colab/#inference-
Last update: June 15, 2024
ik, click the "manually"
did u put first pth, then index?
i put both same time
u should put the pth first
THEN put the index
maybe retry that
@ionic geyser btw the quality between Applio & Ilaria RVC Zero should be the same, dw
it's still just RVC
and also Weights.gg AI Covers are private
you can use that too if u want, there are advanced settings
the same ones or extra ones?
thanks alot btw bro!
What happened wuthRVC V3?
Over Trained? Or Keep on going?
Batch Size 4 + 30+ mins datasets + Pretrain KLM 48k X3 4.3
You replied the wrong person
@brittle wing I'm sorry to bother you
No problem
Seems over trained I guess
Damn that's a lot of overtraining...
RVC V3 ?
YEES
Do it exist?
It's in development
Oh wow, how do you know btw ?
@brittle wing
Thank you.
I’m going to redo it because I do notice my dataset has a mixture of 40k and 48k.
I’m going to have to resample everything to 40k, maybe it won’t over train too much
Increase your batch size to 8
If you have dataset of 30+ minutes
I’ll keep that in mind.
Should I resample everything?
I noticed my dataset is mixed with 40k+48k using KLM 3 48k pretrainers
Hey, how do we make models now that the google collab methods been busted?
Yes you can. Check your dataset sample rate in spek
Use kaggle
??
show name of the gpus
stick with gpu0
Why his GPU 0 is working even he have plugged in another GPU ?
i need context
so what can i use to train voice models with like programs?
Use your nvedia rtx 4060
Last update: Oct 21, 2024
rvc -> local -> mainline or applio
people prefer applio these days since it gets updates
Training or RTX 2060 is possible?
it doesn't exist
RVC v3 will never officially exist
there's just some EXPERIMENTS like codename fork
but it's not v3, the original creators don't care much about it anymore
not even officially
you meant unofficially?
actually, @simple ore and @glacial pollen are working on updates
yea just not sure how well/fast
they added 44.1k sample rate and other things iirc
i bet everything it will not change much
check #🔊│ai-development
no access
no acess gamers
Someone Is asking me. Should I say him to train locally..
tldr they changed everything, the gan, the discriminators etc
Last update: Dec 17, 2024
how big of a difference it makes? because its probably close to nothing
no shitting on others project its just that i dont see any real improvement
they are currently working on it, surely not 'nothing'
well good luck
ye goodluck to them
i don't work on that
I think it will surely take a lot to make an 'rvc v3', but hey better than nothing
Is RVC V3 is coming?
let's see how it goes
no
officially it's 99% never coming
the creators left rvc to rot
they work on gpt-sovits
noobies and codename are working on an experimental fork
pretrains are still being trained, but the early results they got is the fixing of mirroring frequencies
their project is still not finished tho
Gpt so vits. Text to speech thing ?
i hope true pretrains and not fine tunes
yes
true pretrains from scratch
Okay. Someone Is asking me 2060 is good for training with 6 gb vram ?
no
ehh the 6gb of vram aren't the best
I think doable as @quasi dagger trained (super slowly) even on a gtx 1650
but ye he should be able tho
Okay. So using kaggle will be a good option?
even if ofc a better gpu would be suggested, like the min requirement in the docs is rtx 20 ti
why not weights at this point
You are right. Weights.gg can be helpful too. I just forgot about that. Thanks ilaria
no prob
he could try locally but I'm not sure how good that will run
anyways ye
ofc weights.gg is even easier than kaggle
his choice
You are no longer admin ?
ye since a lot of time
since a year
Even though it has 6gb VRAM I only managed to train at batch size 4
ye surely doable
Hi
I want to make song from ronaldo voice how can i make?
What's ur PC GPU?
rtx 3070
pc
Well what if I increase batch size to 8 ? Maybe bigger batch size = less vram consumption?
8 gb
Use applio or mainline RVC
On Applio you can only set it to the maximum your GPU allows, so if you set to 8 it would go back to 6
good enough for inference
how to use it or anyguide?
Your Nvidia GPU is good enough
You can:
- Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours, not granted, of GPU
Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio
I would suggest locally if u gonna use it more than once, if it's a 1 time thing, u can just use weights.gg
@low shard suggested you
is ilaria rvc zero still untopped for inference?
It's the opposite. Larger batch size - more vram is used. With 6gb vram, you can fit batch size 4-6
thanks for you time
If i have any question in future i will ask
wdym with 'untopped'? like speed?
Okay. So if 8GB vram can I set batch size to 10 or 12
8
Yw : )
like being easy
Weights.gg uses multiple A100s iirc, the only difference between it and ilaria rvc zero in inference, is that it's easier and it has queue
nah weights.gg is easier for that
well yea tbh it has already the models
it automatically separates instrumentals and vocals
but isnt it paid?
it's noob-friendly
Okay. I never knew that
it has a paid plan, but 99% of the things are free
except u got some little queue
that's all
how can it be a sustainable business then
and on the paid plan it allows u to make videos
people don't like waiting + weights.gg is rich asf
Also skip queue for inference
seems shady on my end
u can talk with River or Bea if u really think that
i talked multiple times with river
Not sure about the rich part as I saw they were cutting the free training a while ago due to the constraints
but offering a free product with nothing back in returns means the user is the product
that was months ago, but it got added back
- lora training free too
- image creation free too
how can that be free
It has limitations for daily usage like other platforms like Leonardo ai for instance so it's not like 100% free 24/7
Queues
still
yea true
its a cost for them
I don't really think that's how Weights.gg is going, they pretty chill and have the money for it
else they wouldn't be giving away a free vr & nintendo switch #1317718319628943360
it's to get more popular i guess
Yeah, so more people can go with the weights premium subscription
mh
Nintendo switch 
it's not that hard to just shorten alot the limits
Actually @proven hill weights is now owner of AI HUB
yup
sadly
How will they choose a winner?
closed source won again 😔
but they don't bc they care about the users here, last time people complained about no free trainings so they added them back basically
Blaise stopped working on Applio as well
mh mh idk its still strange
i knew before everyone
Do Weights.gg own Ai Hub ?
Technically yes
blaise told me he wouldve stopped months ago
everyone simply found jobs and got tired of this
But how and why ?
I remind everyone this is a help channel
🥺 seriously
smh
Oops
Btw you also have by Weights in your username
trying to revive the server since it's dead
I just followed the trend. I thought people are doing this to be cool.
it's a meme of the name change
No, they were mocking the server name change
as many complained about it before, and now I don't see anyone talk about that much
I just do it bc it's funni tbh
idrc if it's named ai hub or ai hub by weights or wtv
Even I asked you "should I change my name and add by Weights " you said yes
ye we were joking around lol
also let's talk in #✦│chat
I'm going to change my name again. I was unaware about that
Thank you
more than mocking, just joking
Is that good enough?
go to Scalars tab
set smoothing to 0.95, do other charts (fm, mel) agree?
how big is the dataset and batch size?
batch size 15
What do you mean by “how big is the dataset”?
how many hours are in it
01:16
Minutes:Seconds
i mean batch 15 is a weird number, suitable for 10+ hour sets
not for 75 seconds of audio
How much is it supposed to be?
2 or 4
f*ck
there's way too much variety between steps on those charts
set it to 4
hi dose anyone know what software is normally used to make rvc models
And how often do I need to save the models?
10 is fine, you'll be able to pick something out before runs into a wrong direction
ok, thanks
Btw, what is batch size exactly?
75 seconds of audio = 25 samples x 3s + 2 mute files = 27 samples
27 samples / batch 4 = 7 batches (=steps) per epoch
RVC, what's ur pc gpu
the training step takes a batch (4 samples), tries to reproduce them, gets evaluated, re-adjusts parameters, repeats
Okay, I think I got it, thanks again
batch size dictates how big the adjustment is going to be
big variety of samples = bigger the variance between steps
that's why you had crazy 20-45 range for some metrics
at the end of the epoch the learning rate goes 1 notch down (=training speed decreases), samples get reshuffled between new batches and the training contuniues
so it is a balance between having enough adjustment per step vs constantly lowering learning rate
I hate it when that happens
Is there any way I can get this epoch 215?
Where am I supposed to look then?
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Now I have this
hard to choose, check one close to 1600 steps
Starting training...
Loaded pretrained (G) 'rvc/models/pretraineds/pretrained_v2/f0G48k.pth'
Loaded pretrained (D) 'rvc/models/pretraineds/pretrained_v2/f0D48k.pth'
Process Process-1:
Traceback (most recent call last):
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\multiprocessing\process.py", line 314, in _bootstrap
self.run()
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\multiprocessing\process.py", line 108, in run
self._target(*self._args, **self._kwargs)
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\rvc\train\train.py", line 497, in run
reference,
UnboundLocalError: local variable 'reference' referenced before assignment
An error occurred extracting the index: need at least one array to concatenate
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.
how to fix
I think I'll wait for the full training first (500 epochs) and then I'll start choosing
make sure you actually sliced the audio and not trying to train on 1hour long wav
i did
show log for extract features step
and/or filelist.txt content
what is in sliced audios / 16k
are there other folders?
and what is in f0 and v2_extracted
nothing
Starting pitch extraction with 3 cores on cuda:1 using rmvpe...
Process Process-1:
Traceback (most recent call last):
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\multiprocessing\process.py", line 314, in _bootstrap
self.run()
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\multiprocessing\process.py", line 108, in run
self._target(*self._args, **self._kwargs)
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\rvc\train\extract\extract.py", line 118, in process_files
self.model_rmvpe = RMVPE0Predictor(
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\rvc\lib\predictors\RMVPE.py", line 460, in init
).to(device)
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\site-packages\torch\nn\modules\module.py", line 1173, in to
return self._apply(convert)
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\site-packages\torch\nn\modules\module.py", line 853, in _apply
self._buffers[key] = fn(buf)
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\site-packages\torch\nn\modules\module.py", line 1159, in convert
return t.to(
RuntimeError: CUDA error: invalid device ordinal
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.
Pitch extraction completed in 6.72 seconds.
Starting embedding extraction with 3 cores on cuda:1...
Process Process-2:
Traceback (most recent call last):
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\multiprocessing\process.py", line 314, in _bootstrap
self.run()
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\multiprocessing\process.py", line 108, in run
self._target(*self._args, **self._kwargs)
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\rvc\train\extract\extract.py", line 179, in process_file_embedding
model = load_embedding(embedder_model, embedder_model_custom).to(dtype).to(device)
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\site-packages\transformers\modeling_utils.py", line 2905, in to
return super().to(*args, **kwargs)
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\site-packages\torch\nn\modules\module.py", line 1173, in to
return self._apply(convert)
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\site-packages\torch\nn\modules\module.py", line 779, in _apply
module._apply(fn)
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\site-packages\torch\nn\modules\module.py", line 779, in _apply
module._apply(fn)
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\site-packages\torch\nn\modules\module.py", line 779, in _apply
module._apply(fn)
[Previous line repeated 1 more time]
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\site-packages\torch\nn\modules\module.py", line 804, in _apply
param_applied = fn(param)
File "C:\Users\iibug\Downloads\ApplioV3.2.8-bugfix\env\lib\site-packages\torch\nn\modules\module.py", line 1159, in convert
return t.to(
RuntimeError: CUDA error: invalid device ordinal
CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.
i'll delete this after since so big
wrong device selected?
IT'S EXTRACTING
IT COMPLETED
what do you think would be good for batch size?
it's at 5 rn
it does not have much to do with VRAM.. other than "will batch x fit into 12GB"
so you got like 20 min audio there
use batch 4
do i gen index before or after training?
it runs automatically, but you can generate it after you start the training
what do you think?
click the square button under each chart to adjust the fit
right or left square?
\
they do look rather terrible, but not much can be done with such small set
you may have better luck using a non-default pretrain like ov2
“works well for small datasets and English language” this line alarms me, as my model is trained in Russian language
it is okay, just dont use index and you'l have almost no accent
By the way, what does the index affect?
mostly prononcuation.. english audio + voice model + russian index (1.0) = russian accent
Which version should I download?
depends on your audio
Ok, I downloaded it all, but I'll probably try it during the day since I'm deep in the night now
Good night and thank you for helping me
how can i update rvc if I have an old ancient version easily?
raise LocalProtocolError("Too little data for declared Content-Length")
h11._util.LocalProtocolError: Too little data for declared Content-Length```
anyone know what this error means?
I just dragged in my audio and its outputting this
another way to do is to put the input audio in your Applio\assets\audios
or if not sure, try download the latest applio 3.2.8 bugfix
Thats what i tried doing
Ill try getting the bugfix
https://docs.ai-hub.wtf/rvc/resources/vocal-isolation/
^ this page is down, anyone know what UVR models were recommended for clean vocal extraction?
rvc
Their server is up now.
how do i update my old rvc build from july 2023
i want rvmpe
do i have to wipe the whole folder and start over?
If you mean the original RVC GUI, this one has been long outdated. The most recent one released around October 2023.
oh, can i update it to that one?
I want to use rmvpe models instead of crepe
isnt rmvpe way better?
RMVPE is way better than any other model.
Unless you wanna make your own fork of RVC GUI by yourself, there you can download its source code from GitHub. Otherwise, you can try Applio the RVC GUI to do some audio conversion.
ok but wont the latest rvc gui work regardless?
how can i update mine to the newest
idc if its old
The most recent RVC GUI indeed works, but there won't be any official update after that.
rvc gui is too old and doesn't have rmvpe which gives better quality
I recommend the much newer one, i.e. Applio
For RVC GUI, you'd find some bugs going on.
im coming from a video
-gui
Useful link of what?
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
If you're on your smartphone, a cloud service is the best and only way to do RVC AI cover.
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Google Colab is a cloud service that lets you run Python there. First six links are RVC, but I'd recommend for Applio.
Install Applio there on Colab? Yes.
okay lemme test
Oh I didn't realize that thank you. Also applio just scares me since it's closed source
Applio isn't closed source
It's open source
What's ur PC gpu
RVC, W-Okada, Stable Diffusion are all open source. Unless you've downloaded a proprietary program that does audio conversation similar or like RVC, which is indeed closed source.
can someone tell which sample rate is good for voice accuracy
The sample rate of your dataset
IOS >> apple iPhone?
hey how come the official uvr 5 new model announcement uses some weird github release not the official anjok07 one?
like eddiecrack somethng
how can i avoid overtraining? using weights.gg
My UVR5 UI use audio-separator a CLI-type python package based on the same code as UVR5
i see
so you just made a webui version
i decided to just use anjok
how can i seperate main and backing seperately with melband roformer
ye i think, although audio-separator has some extra things, likewise UVR5 has other extra things
yessir
does anyone know if you need to split inputs to train?
i put 2 10 min files in
uncut
Mel-Roformer-Karaoke-Aufr33-Viperx could be useful for that, or BVE models on VR Arch
train RVC model?
No needed
RVC do that automatically
i only see kim and viper melband roforomer on uvr
how do i find this Mel-Roformer-Karaoke-Aufr33-Viperx
That's one of the disadvantages of UVR5, you have to search and add the models manually, and change the configuration files if necessary
so what is better in your opinion kim or viper roformer
For vocals? I think Kim
ok thanks
yeah, i still found kim model in official download section of uvr 5 tho @viscid moss
i wonder what model mvsep uses?
lemme see
gimme a sec
I think is the same
yes, did kimberly really make this herself?
idk
thats crazy it sounds so clear lol
fr
one more question where can i find the melband roformer karaoke model?
it seems to be on mvsep but I would prefer to download it locally iykwim @viscid moss
How can I download new UVR5 UI ? Do it have any compiled version or I have to do everything manually on CMD
is there a way to train on mac yet?
or is there any service which offers me online training with individual settings & unlimuted amount of dataset?
lol where did you get this from? I am nervous to download random files?
is it from their huggingface?
Nope, they don't publish that model on HF idk why
Not yet, instalation is through cmd with an installer i made
I recommed cloning and install it on ur Downloads folder
When will the .exe will be available?
Okay. Which models are best for vocal isolation, de eco, de reverb and de noise?
Not now, my wifi is not functioning.
best extraction method for vocals: unwa's big mel roformer beta 4
de-reverb: anvuew mel dereverb v2
backing vocals: mel roformer karaoke
denoise: mel denoiser
Yeah nvm
- Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours, not granted, of GPU
Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
And do you know how to remove stereo delay ?
That uses an automatic overtrain detector
Hmmm idk
I was using MDX23C for vocal isolation, Vr ach- deEco , MDX de reverb normal and UVR denoise is that bad?
Should I remove my old UVR and Go with New one ?
MDX23C ye, others are kinda fine
Def yes
What's wrong with MDX23C ?
Low SDR
Can you show me how new UVR look like
Roformers are better
Beta version is identical just code changes that u can't really see
SDR meaning ?
UVR5 UI is different cause is gradio UI
Okay. If I remove old one and follow the link you have sent. Will it install everything automatically?
Ohh I see. Just like applio and mainline
Yep
I need to install 3 things, Git (to update UVR5 UI), FFmpeg and UVR5 UI. All that is described on the link i send u
Okay. What about models?
Models will be downloaded automatically when u choose them
Ur welcome
@low shard how did the voice model i send work for u yesterday
What issue are u having now
Okay, I finished training and these are the results
wym? What should I do or look at?
Seeking Help with Climate Data Project!
I’m working on a project that uses synthetic data generation to address gaps in climate datasets and improve predictive modeling for climate change mitigation. Here's what I aim to achieve:
Generate synthetic climate datasets (Avg Temperature, CO2 Emissions, Renewable Energy %) using methods like Gaussian Mixture Models (GMMs), Variational Autoencoders (VAEs), and Copulas.
Validate synthetic data with metrics like Wasserstein distance and Kolmogorov-Smirnov tests to ensure consistency with real-world patterns.
Visualize trends, anomalies, and regional variations using ArcGIS/QGIS and traditional plots.
The goal is to create scalable, reliable datasets that aid in better climate analysis and decision-making. I’d love your inputs, advice, or suggestions on how to improve this framework or achieve these goals more effectively.
If you’ve worked on synthetic data, GIS tools, or climate modeling, I’d really appreciate your help! Let’s brainstorm
can someone help me i have dragged the spongebob voice ai zip and it says that i cant upload files like that
Hey, djgyrtazao! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:
- General RVC help: #✨│ai-help
- W-Okada / Realtime RVC: #🔍│help-w-okada
- AI image related: #🔍│help-ai-art
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
i cant put screenshot for spongebob
are you talking about https://voice.ai ?
Download now free real time voice changer for pc. Modify your voice, select one from the Voice Universe or clone any voice you want
it's trash, we don't support that
it abuses ur pc power for distributed training
or are u talking about smt else
voice.ai fa schifo, abusa del potere del tuo pc in background
disinstallalo fra, lo usavo anche io prima
Wokada, specialmente il detieris fork
parliamone in #🔍│help-w-okada
ai
si
ok
@low shard
yea u need to put the huggingface download link url in that
what model link are u having issues with
oh, ya seems like
generally, I need a new model for my song remake and I'm checking what the options are here
You can search rvc ai voice models at:
- #1175430844685484042
- In #🔍│find-models , Do /find with @earnest musk
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
:wave: @low shard, How can I help?
Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image
I thought you could paste a YouTube link there and it would create a model itself
what? No model training doesn't work like that
Ilaria RVC Zero is made only for Inference (use models) not training (make models)
on RVC you need to download, curately clean the dataset, then train it looking at the tensorboard
it's AI
the only way to do it that ez, is on https://weights.gg
no other way to do it that easy
I used a site outside of this discord and it worked normally but this site didn't have the features I needed
if only it was that easy 😔
RVC isn't a site, it's a program
You prolly used smt like kits.ai, which still uses RVC in the background on their cloud pc,
AI isn't installing a Steam game
I have a question when it comes to RVC. I want to change out my 5700XT for a 4080S Suprim X but I'm using a i7-11700F CPU. Will the CPU affect RVC when it comes to my voice latency or no?
Idk if I need to upgrade my CPU to reduce latency or will all that work be using the GPU primarily?
cpu has pretty much nothing to do with rvc
its all gpu
Okay,perfect
@cloud mantle the only way to do it that easy, is by using weights.gg
what's wrong with weights?
just use since it's the easiest free way, it uses the same program anyways
only when generating a song the AI doesn't get into the rhythm well and sings in a different key
Ciao, buonasera.
Sono nuovo in questo forum e in questo "mondo" AI...
Dove posso chattare (o aprire un ticket) con uno Staffer?
Possibilmente (se possibile) uno staffer italiano poichè non so altre lingue
LMFAO
@low shard
Ciao, grazie cara. Gli scrivo in privato o qua?
aspetta che risponda
cosa devi fare?
Volevo sapere come si creava una Singing Voice Cover, prendendo una canzone, lasciando la base così, e cambiando le parole... Vorrei farlo con la voce di Lazza, perchè ho visto che c'è un'anteprima ma non so come fare per farlo cantare ugualmente all'originale.
cambiando le parole giusto? è più difficile di un AI Cover normale, perchè dovresti:
- scaricare la canzone
- separare le vocali e la base
- cantare la versione con le parole modificate
- Usare RVC per cambiare la tua voce in quella di Lazza
- Mettere insieme tutto quanto
o intendi un AI Cover normale senza cambiare le parole?
what's ur pc gpu and what are you using
Ciao grazie della risposta. Intendo come ha fatto qua [#1287422682446565578 message] il 3 audio. Però con un altra canzone di lazza. Poi la base la ricollego.
NVIDIA RTX 3060
penso che lui abbia fatto cosi, oppure si può usare il TTS (Text To Speech) (da testo a voce) ma non è brava nel cantare
Intel i7
mm, what softeware did u download
Applio
you might wanna check if your drivers are updated
and that u are running on the latest applio version
what's ur ram, and vram?
esiste che tu sappia un tts singer che gli puoi dare degli input (es: canta come questa canzone), però con il testo che scrivo io?
The Latest
non credo che esiste, forse Suno o Udio ma non li uso
I found this #✨│ai-help message , maybe that works
yea it's enoughù
I did that, didn't work
so, u used that and are 100% sure ur drivers are updated?
non hai una piattaforma testata da te che mi puoi consigliare?
I'll try updating them but its wierd cause the issue only happens when the file isn't small
sinceramente no, o puoi provare con il metodo difficile con RVC, o puoi testare se Suno/Udio vadi bene
ma di solito non modifico il testo delle canzoni
facevo solo ai cover prima che le eliminassi
what's the vram (dedicated gpu memory) of ur 3060
and is it laptop
Its not a laptop, and I dont know the Vram sorry
ovvero? che cambia?
show me a screenshot when going into gpu 0 or gpu 1 in task manager
this occurs when starting training?
It occurs when I convert it to ai, im not trainging
training*
Ho provato già Suno e Udio l'ho visto ora ma Udio non fa caricare audio nella versione free, mentre Suno capisce nulla e rileva le canzoni di altri artisti perciò non va come input.
che semplicemente potevo cambiare la voce dandogli come input le vocali della canzone, ma nel tuo caso o le canti tu o usi un TTS (che cantera male di solito)
have you tried multiple files or only the same one?
@zinc crow il metodo migliore è cantare tu stesso, poi fare il cambio dalla tua voce a quella di lazza (se ho capito bene) e poi metterci sotto la base
Only that one
try other one
mmm, forse potresti chiedere anche a @civic meteor , ma al 99.9% lui avra fatto come ti ho spiegato io con RVC
cioè cantando
I tried a differen't one and it worked
the one Ive been trying to do is 30 minutes
and the one that worked was 19
Is my file too long?
which pitch extractor are you using? rmvpe or crepe? which format is the audio you want to inference?
is it wav?
Wav, rvmpe
is "split audio" option in the inference being used?
yes
alr i believe the audio file might be too long and you're probably getting out of vram
Really?
It happens at the very start
open task manager and check your main gpu vram usage (the 3060)
this one?
yea it means you're getting out of vram
doesn't look like a oom problem
might be related to how applio handles very long* audio files
only an applio dev can really help you in this
@simple ore he has been trying to inference an 30 minute long audio file in applio but he gets cudnn non contiguous input
he tried another audio file and it worked just fine
what do you think its going on here? he's not getting out of vram either
split audio option is checked
does split audio actually split it into multiple chunks? or does it not find an appropriate silence gaps to separate?
he gets cudnn error as soon he tries to inference that specific audio
the other audio he tried was 19 minutes instead of 30
can you try to inference your audio file again but this time show us what do you get in the cmd window?
the one that gives u the cudnn error
@cloud finch
normally with 12GB you can infer ~15min audio
oh so it was indeed a vram error
i suspected that first
i need help with rvc
whats a good free rvc?
i just installed it again after long time, i started go-realtime-gui and it opened but when i click start it crashes, no not really it just gets stuck and dont work then
You shouldn't use the go realtime for realtime voice changing
what else then?
Wokada deiteris fork (more optimized) > Wokada > RVC go-realtime
Wokada deiteris fork, let's talk in #🔍│help-w-okada
ok G
RVC is open source and free
Tell me ur PC GPU and what are u looking for
RVC = Retrieval-based-Voice-Conversion, NOT Realtime Voice Changer
Anybody know exactly what the OG pretrain is? The devs say VCTK but I tried training my own with VCTK as a test, and while the output sounded decent, it came out more muffled/less high-end-y than the OG pretrain.
also the RVC devs say is the readme that they will be adding more data to the pretrain so I can't tell if they did or not.
it is 109 speaker set using vctk corpus 0.80 or 0.92 mic1
Ah I wonder if I included other mics
Hi, where can I train RVC voice models? I use Android. Can someone tell me where I can train voice models?
Unfortunately, if you intend to use the Android device itself, that's not possible. As for which websites ( such as weights.gg and such ) support android devices, I've no clue
If you want to train a voice model using Termux on your phone, no. But you can go for a cloud service like Google Colab or Weights instead.
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
well thanks
Try weights website I suppose
if it doesn't work in mobile mode, switch to desktop mode ( if still doesn't, rip. Maybe try google colab but again, I never needed to use it so don't know if it works
Ok, and what collaboration would be Google and Weight? I was not told that on that page the voice models do not give good results when they are trained there.
When did I said there was a collaboration between Google and Weights?
"Colab" and collaboration aren't the same word.
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
^
First three links are RVC Colab notebooks that can do training. But I'd suggest to go for Applio as it has an easy GUI to use.
and which ones are for training models?
For Weights, I suggest it would be better to have a desktop PC so you can do drag and drop audio file into Weights easily.
Please read my message above.
The applio one doesn't work, it doesn't open
Can you send a screenshot here when it won't launch?
But I don't have a PC, I live in Latin America where everything is horrible. I only have Android, otherwise I wouldn't be asking how I train from Android.
Again, can you screenshot where it won't launch? Just so I can verify about issue that made Applio not working.
And which of those that are there are useful for training?
Was this ever solved?
even if you have a 5090 and 128 GB ram, it is strongly recommended to split into at most 6 mins each, or using applio's split audio feature
also before using split audio feature, you should denoise and noise gate the input audio first, as there couldn't be a voice sentence longer than 1 minute without silence gap
does anyone know like if theres any ai that can find a song because websites dont wok for me
im having trouble finding this song and its not known so idk if anyone can help so is there like s ong finder ai or anything
regular shazam dont work?
no
upload it to soundcloud, if it is copyrighted you'll know soon 🙂
its unreleased and rlly hard to find so idk if it will get copyrighted
Wierd thing is though is that the other 2 files I do were around the same length, 29 -32 minutes
and they worked fine
But it only happens with this specific file
its so wierd
Try to cut the file into 3 more-or-less even parts and infer each separately ( without slice audio option )
AI HUB Docs