#✨│ai-help
1 messages · Page 204 of 1
- Creating Datasets for RVC using iZotope RX11, by Cauthess
- Gathering and Isolating Audio, by SCRFilms ❄
- Instrumental and vocal & stems separation & mastering guide, by deton24
- Vocal Mixing Tutorial, by Roomie
- https://mvsep.com/
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
-overtrain
Moved to /faq command.
-rvc
Suggestions for @quiet crystal
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
-realtime
-rt
Interaction has expired, use the command again for a new interaction.
Interaction has expired, use the command again for a new interaction.
That's crazy. You wanna run both Applio the RVC and fork W-Okada at the same time? 
where is chat
#✦│chat and #🧬│ai-chat are chat channels.
What sample rate should I use?
32k
I have been out of the loop for a while, whats the best way to use speech to speech ai now?
It's such a SHIT in audio recording applications or audio files that I'm embarrassed to listen to it and turn it off immediately. Even if I sound normal, my voice sounds incredibly deep on the recording. I can use it with my own voice and automatically detect it. I can make it better, more realistic, without distortion, or with what kind of application?????
Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Vocal Separator (UVR) Extra Model Fusion Troubleshooting “No GPU is currently available for you after 60 seconds” “Where can I see my ZeroGPU ...
Ah thank you
no probs
anybody here got a kacy hill voice model?
voice changer?
check #1175430844685484042 or weights.gg
it is not in there unfortunatly
gonna give up
Changing your own voice, whatever you call it
then request it in #1159289738314919936 or commission a model master in #1191429836321849435
in real time?
yes
read this guide: https://rentry.co/ForkVoiceChangerGuide
Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update December 12: NEW UPDATE VERSION b2332
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVoiceChangerGuid...
Is there a shorter and faster training? as video. I don't speak English anyway
no
F***K
This doesent seem to do realtime voice changing, whats the best way to do that?
is it this?
yes
tytyty
And people say don't use it, they say it is very difficult to differentiate. I want to make another one with my own voice, nothing comes out other than sounding like a robot.
for example this
Bruh, the guide tells me that a 3090 can do 20-40 ms chunk plus 2.7 but my 4090 needs to be above 100 ms chunk for it to not lag
Any place i can sort voice models by ranking or popularity?
guys I'm using colab for real time voice changer
how can I bind my output voice to discord or any other software that uses my mic
Hey don't say that please
i can do 80ms on my 6700 xt
so somethings wrong lmao
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
is there an easy way to upsample and downsample my audio stream on windows to pin it to the sample rate that different models use?
why would you do that
its useless
just check the sample rate with spek
Can u use okada live rvc on discord voice messages?
so the input sample rate doesn't matter? let's say i'm trying to realtime with a voice that is trained on 32k, my microphone is 96k. there will be no issues or degradation?
anyone know when 3.2.9 or 3.3 comes out for applio
i failed at custom installing the damn package update
just get a compiled build, sheesh
what? ppl yesterday were saying compiled build was outdated even though i installed 12/25
they said g/loss/total graphs were not correct
Does RVC disconnected stem vocal from input file?
My data is already clear single person voice so if it does any step to separate vocal from background sounds or from another person voice I want to skip that step
nope
I managed to train my model, but it gets some robotic voice at the end of the sentence, tried many epochs (250-500). Do i need to train it more? How to set it correctly?
Should I train it more times? 23mins of dataset, totally clean with silence reduction in audacity
did you check the tensorboard?
-docs
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
first link
Will it be okay if I include laughs & speech in my dataset of the same person aside from vocals?
Thank you, so lets summarize:
1.download applio and pretrain model
2.train it correctly
yes but not a lot or it might break it
REALLY
pretrain model is influded in applio
How many minutes of laugh and speech?
use the og pretrain
id say 3 laughs top
Nah I'll use KLM x4
why
How much speech
min 10 minutes
Cause I want to
make sense but the normal pretrain is better
No
wdym no
Pretrains exist to ease model training also to improve the model results
do you think i dont know what im talking about 😭
Nah I just have personal preferences and that's all
make sense
Nothing to do w you
Wdym og pretrain is better?
In what sense I think pretrains are a modern variant to ease model training and all
less noise
yea but they also add noise
Mhm okay but KLM doesn't add noise like titan
still adds it
its a fine tune
And...? That's not a problem at all since I have a bandlab preset that has noise gate in it
Also a little noise doesn't matter that much

it’s not like every single pretrain is bad
no, but the og is still better
It all depends on ur dataset language and lenght
og is trained only on english
ofc im assuming hes trainingnin english
imagine u want an anime image, would u use an anime or realistic image model 
this doesnt make any sense 😭
if u want to talk in italian, you would use an LLM that has been trained on it rather than one that didn’t
ye but he was training in english presumably
presumably
what language is your dataset?
Korean + few English
current repository has unreleased stuff, including better logging, but it is still a work in progress
What is the best version for mmvc
And where I can download it
I’m guessing you’re talking about Wokada, which is a realtime voice changer, different than rvc program
Use #🔍│help-w-okada
MMVC is a name for W-Okada. This #✨│ai-help here is about RVC.
Sorry then
dw
Is the colab fully updated to repo
Some files that have been uploaded to Ilaria RVC from Hugging Face are all saved somewhere in %temp%, I found out that today.
why would u use that locally
Curious. 
Nope
People have preferences

Also I'm the one making the model so I decide for myself
no one is arguing here
Okay does KLM x4 handle speech and laughs
I see
x2 is better at handling high pitches
Finally, it indeed went well. 
insane
It took an hour to finish because of how slow my laptop CPU is. It would be faster with a better PC and an NVIDIA GPU. 
it amazing it runs locally
40 mins
completely normal
Meanwhile the 4 hours dataset gives 30 MB .index
…how
that's with minibatch k-means
with the normal faiss it would be ~2 GB
Average .index file size for RVC voice model weighs around thousand megabytes, so this is fine.
applio has the option whether to use faiss or k-means
Damn
Imagine that Ilaria RVC from Hugging Face being developed to run locally. 
How does the speed compare
Thinking of changing the code to force it to use faiss but maybe Colab usage will run out first lol
u can do train index locally just using average laptop cpu
why….
Just curious. 
lol
it onlt happend when you have 200k chunks
after ~4000 starts shuffling and showing a text output as it attempts to reshuffle pieces that are somewhat unique
with 200k chunks the extracted features get grouped into clusters and most of them get discarded
that again brings the index to somewhatmanageable size
Bro i have delay and my voice is very distorted can one fix this
For W-Okada, go to #🔍│help-w-okada.
Could anyone help me creating ai cover? i have the model and isolated vocals ready ! 
Hey, 𝐌𝐞𝐥𝐭𝐫𝐲𝐥𝐥𝐢𝐬! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:
- General RVC help: #✨│ai-help
- W-Okada / Realtime RVC: #🔍│help-w-okada
- AI image related: #🔍│help-ai-art
What's ur pc gpu
RTX3050 laptop
lary over
Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), not the best to train (make models) even if still possible
You can:
- Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours daily, not granted, of GPU
Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio
thank you so much
yw lmk
hey i need a app to run my Models and i simply cant find any :c
Is Applio broken ? I have a linear graph on a 20minutes audio training (with google collab), does anyone have a solution ? Here are my parameters Sampling rate : 40k; Pitch extraction : Crepe; Batch Size 15; and 500 epoch in total. I kept the batch size at 15 because it generally gives me better results. Hope you can help me. (The graph is loss/g/total)
click the third blue box
Ohh ok thanks a lot !
Is RVC v2 Disconnected down?
mismatch between sample rates of the model and the pretrain
513 - 32k model
sorry for tagging after so long, but where i can download these models? when i unfold vr or mdx these models are not there
u need to add that models manually
and u also need UVR5 beta for that
this one i got covered
having the same issue
Do you know where I can download them?
from HF
here's the doc
edit 30.12.24 deton24’s Instrumental and vocal & stems separation & mastering guide (UVR 5 GUI: VR/MDX-Net/MDX23C/Demucs 1-4, and BS/Mel-Roformer in beta MVSEP-MDX23-Colab/KaraFan/drumsep/LarsNet/SCNet x-minus.pro (uvronline.app)/mvsep.com/ GSEP/Dango.ai/Audioshake/Music.ai) General reading adv...
ur welcome
is it ok if my dataset is 1hour and 32 mins? XD
as long as the dataset has clear audio, why not, but know that it will take a long time to train
1min to train 1epoch, and i set 800epochs sooo yep, pretty long
time will tell
Help
File "/usr/lib/python3.10/multiprocessing/process.py", line 314, in _bootstrap
self.run()
File "/usr/lib/python3.10/multiprocessing/process.py", line 108, in run
self._target(*self._args, **self._kwargs)
File "/content/Mangio-RVC-Fork/train_nsf_sim_cache_sid_load_pretrain.py", line 254, in run
train_and_evaluate(
File "/content/Mangio-RVC-Fork/train_nsf_sim_cache_sid_load_pretrain.py", line 438, in train_and_evaluate
"slice/mel_org": utils.plot_spectrogram_to_numpy(y_mel[0].data.cpu().numpy()),
File "/content/Mangio-RVC-Fork/train/utils.py", line 231, in plot_spectrogram_to_numpy
data = np.fromstring(fig.canvas.tostring_rgb(), dtype=np.uint8, sep="")
AttributeError: 'FigureCanvasAgg' object has no attribute 'tostring_rgb
mangio is hella outdated just get something more recent
Last update: Apr 01, 2024
What?, I used it earlier today and it was working perfectly!
I did the procedure I always do on all my models!
Could it be that I created a new account?
RVC v2 Disconnected?
I have the same error on mine
Yes
This just happened now @narrow nova
I really hate Applio
It always disconnects by itself and I end up losing everything
isnt mangio rvc fork local?
yes colab
Yes
Ok cant help wait for someone else
I create my models on my cell phone
But rvc disconnected & mangio rvc are really not recommended these days
i cant stop you from using it so goodluck
I know that
Unfortunately Applio is very unstable and always disconnects
@pastel oak Is there any version of Applio without UI?
you dont
not that i know of
check dms i sent u a pic
noUI Applio Colab
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
-train
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
If you are facing RVC Disconnected errors try this fix
#📑│making-models message
or pip install matplotlib==3.7.0
or pip install matplotlib==3.7.0
I don't really understand the process, I'm very confused
I train on a cell phone and it's a bit confusing. Can you explain the process to me in more detail?
u need to add that line like this:
only that one?
yep
oh, thanks a lot broh
Man when using weights the models are eh
thanks,it works
Yeah it's 2.4 GB now
Is it used in the training step? i.e. will it affect the .pth file?
hi did someone now a uvr model that remove the sound at the end of my audio pls ? i already test a lot of model with this audio but someone tell my to ask here where i can find custom model
can somebody help me i wanna train my voice in applio
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Could anyone help me? training does not start
guys what is "epochs 300"?
Epoch: The number of iterations performed to complete one full cycle of the dataset during training. It's not possible to say precisely how many epochs you need for your dataset, you need to monitor the TensorBoard Graph to know if your model is overtraining.
The same thing is happening to me too

maybe I hope not but did they ban him?
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Why r u using mangio fork 😭
suddenly someone having had the matplotlib related issue, then fixed and more ppl using it
you two pls read the pinned message about matplotlib fix here, or you can try my tweaked version here: #1159290752195633273 message
but anyway I'd recommend the latest Applio 3.2.8 colab/kaggle with more features and improvements
-gui
I feel like this should be said in #📰│dev-updates
People never check pins
ye
can u do it? till Kit Lemonfoot fix it
Sure
Have you already told Kit about that?
I pinged him in #💬│staff-chat
Oh, nice
Maybe it's better I say to either use the modified link in #1159290752195633273 message or adjust the fix themselves
Bc I feel like telling them to upload the file will just make it harder for the average user
real
use this pic if u want
I got this
hello, where can i find a custom model to remove this noise at the end of my audio with uvr5 pls ?
ty
hmmm lemme try rq
Should I also specify that people can ofc use alternatives?
I genuinely thought fewer people would be using RVC Disconnected since there's Kaggle
ye ye
Yeah
I genuinely thought people preferred UIs more tbh

I put it as is like this: !pip install matplotlib==3.7.0
The UIs are great! The problem is that colab disconnects and gradio stops working😕
Btw since a while we encrypt the code which solves that issue
Did you have that issue recently or heard it from others?
Because that used to happen before, but shouldn't anymore
I use my cell phone so that must be why
Yeah, it happened this week when I was creating a cover.
maybe
cause on phone u need to switch the browser tabs on UI ones
I usually leave the tabs like this
What was the exact warning?
disallowed code execution? Inactivity?
Reconnection
When the collab reconnected I had to reload my gradio
yooo that's was rq
yo can someone help me rq?
Hey, Jre! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:
- General RVC help: #✨│ai-help
- W-Okada / Realtime RVC: #🔍│help-w-okada
- AI image related: #🔍│help-ai-art
That happens for inactivity, so it’s because of the mobile tab, not because of the UI
if it was for the UI, it wouldn’t automatically reconnect
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
be specific
u face a warning message right? If so press cancel. Warning says "U need to restart or something"
And continue
everything by my rvc seems to be correct like the setting but it just simpy isnt working
which
what error
You have to be specific
I see, I wish I could find a better mobile browser.
what pc gpu?
like the voice changer isnt working, i can hear myself but the voice changer isnt applying
wrong channel, rvc isn’t a realtime voice changer
share the tutorial link you followed in #🔍│help-w-okada
oh, i hit restart haha, thanks
||R - Realtime
V - Voice
C - Changer
||
hell no
xd
Hi, i have a question, The Hugging Face Pro plan is not useful for using Ilaria_RVC?
Now it comes out like this, without needing to put: !pip install matplotlib==3.7.0
the creator updated it
well, the huggingface pro plan will give you more usage time for zerogpu iirc
I see, because I restarted Google and everything came out.
I don't understand why I'm always out of credits even with the Pro version, I can only generate once every 15 minutes even with a Pro subscription?
U should contact HF staff for that
I saw ur ticket and it's kinda weird how ur Zero GPU time is full
Okay, its not normal so ?
and still can't use it
The Zero GPU usage updates every GPU request that u do, so ya it's weird
Okay
Unless the Hugging Face system is just glitching out.
You can somehow run Ilaria RVC (from Hugging Face) locally, but this way is kinda complicated to get it work, as you'll need to download the entire repository using Hugging Face Hub on Python.
https://cdn.discordapp.com/attachments/1159289354439626772/1326182820430745682/image.png
I realized 2 FFmpeg .exe files were missing from this repository, so I added them using ones from Ilaria RVC from GitHub. The first time launching on my laptop, it just picked up FFmpeg from C:\FFmpeg rather than from inside the Ilaria RVC folder itself.
With an RTX 2060, it's complicated to run it locally?
does anyone have the links from cloud RVC?
I don't think the author made it to run on NVIDIA GeForce GPU. In the terminal, it says something like "T GPU", which I think it supposed to run on NVIDIA Tesla GPU like A100 or H100.
it works fine on my 4060 ti
Ilaria RVC from Hugging Face or GitHub?
-colab
Suggestions for @reef tusk
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
-kaggle
- Applio Notebook, by Vidal Kaggle
- Applio Notebook, by Shirou Kaggle
- Music Source Separation, by Shirou Kaggle
- UVR5 NO UI, by Eddy Kaggle
- Original W-Okada's Voice Changer, Kaggle
- Modified W-Okada's Voice Changer, Kaggle
- 🆕 UVR5 UI, by Eddy, ArisDev & Nick088 Kaggle
- 🆕 RVC AI Cover Maker UI, by Shirou & ArisDev Kaggle
- 📖 How to use RVC Mainline on Kaggle by Cauthess
Note: Kaggle limits GPU usage to 30 hours per week.
oh
The Ilaria RVC version I'm talking about is the one that used in its Hugging Face space, and I downloaded it in a dirty way. 
on colab, how much generations you can do ? Its like HF ?
per account if u re out of GPU just switch acc
All right so its better than HF ?
in terms of inference time ye

google colab gpu is slower
if you meant about having more time to use it, or uploading longer audio files, then yes
but zerogpu huggingface time is faster
unless its a cpu one
ye i mean this
It free on colab ? and do you have a guide ?
yes, and guides are in https://docs.ai-hub.wtf
Last update: Oct 21, 2024
wait you got a 2060 gpu
why don’t you do it locally on your pc instead?
huggingface space and google colab are meant only for people who got a bad pc, they are cloud computijg services
Dont know how to install it xD
locally, it will be harder to setup and won’t be as fast as huggingface for example, but you will have unlimited time
still in that docs i sent you, there is a guide for local too
i would suggest you applio
I will check, thank you bro
yw
Does anyone know what this means and how to fix it?
2025-01-09 [E:onnxruntime:Default, cuda_call.cc:116 onnxruntime::CudaCall] CUDA failure 35: CUDA driver version is insufficient for CUDA runtime version ; GPU=1040 ; hostname=DESKTOP; file=D:\a_work\1\s\onnxruntime\core\providers\cuda\cuda_execution_provider_info.cc ; line=125 ; expr=cudaGetDeviceCount(&num_devices);
Yes you can, you just need a pre-compiled fork and VxKex to run it
Do anyone know which TTS is best?
depends on the language
are you sure your drivers are updated
yes, and i have amd
you have an AMD GPU?
Are you on windows? Which GPU exactly
what does "AttributeError: 'NoneType' object has no attribute 'tobytes'" mean
and ""HTTP/1.1 500 Internal Server Error""
i have rx 6700 xt GPU
i tried another version (std instead of cuda) and it works now
are u talking about RVC Disconnected? check #📰│dev-updates
are you talking about Wokada (aka MMVCServerSIO or client)
this is the wrong channel, RVC is not realtime voice changer, wokada is
use #🔍│help-w-okada
What's the difference between training model services and personal AI Voice model? ^^
..nothing?
i wonder whats going through your mind to write this question out of context out of the blue
like what do you need what do you want
You sure~
Someone is offering both in the #1191429836321849435
fixed lol
but ye i havvent used this for some time forgot what i needed to install
Third one down I believe
i initially installed the other one yk
You need the client. It comes with everything you need
They changed it a but still technically the same thing assuming
ye i installed it
also
where do i post a photo#
can u tell me if my advanced settings are alr lol#
Probably just a labelling thing everyone calls it differently
youre talking in the wrong channel
oh sorry
Hello, is there anyone who can help me with the error: FileNotFoundError: [Errno 2] No such file or directory: '/content/Mangio-RVC-Fork/logs/
pinned msg
I didnt even read the error I just read mangio rvc fork and assumed its the same error, though you might have to do it either way
it cant find the folder, so the folder doesnt exist. check again
The folder is created and the index is created, what isn't created is the pth.
is there a way to use realtime rvc gui on roblox?
like so people on voice chat could hear?
Realtime rvc GUI isn't suggested
Old
Wokada deiteris fork is better
is that realtime
where can i download pls?
What's ur PC gpu
my error is : FileNotFoundError Traceback (most recent call last)
<ipython-input-18-5d26394eab49> in <cell line: 86>()
86 set([name.split(".")[0] for name in os.listdir(gt_wavs_dir)])
87 & set([name.split(".")[0] for name in os.listdir(feature_dir)])
---> 88 & set([name.split(".")[0] for name in os.listdir(f0_dir)])
89 & set([name.split(".")[0] for name in os.listdir(f0nsf_dir)])
90 )
FileNotFoundError: [Errno 2] No such file or directory: '/content/Mangio-RVC-Fork/logs
nvidia geforce rtx 3050 oem
Interaction has expired, use the command again for a new interaction.
tysm
What Google colab u using
I don't know.
Send the link of what u using
What is the latest version of rvc?
hey if i have raw studio session audio files but you can hear the backing instrumental through headphone leak what mvsep model would you use to clear that
also what settings in rx 11 to clear noise pre train
English and hindi.
Hi, is it not possible to download rvc anymore?
I keep getting this error, but everytime I restart the runtime it happens again, and it always disconnects before I get to do anything. I'm using Hina_Mod_AICoverGen_colab.
Which rvc
anyone have sunoAI ?
So does it matter to the .pth file whether I used faiss or kmeans for the .index file?
no
how can i make my own voice a model that i can use
what’s ur pc gpu
its bad
what is it
What will happen if I include a few seconds of breathing in my dataset?
Nothing, it'll do fine. In fact, breathing samples in a dataset is a good choice
( provided they belong to the voice / whoever's the model's voice provider. If it's from someone else, can't promise anything )
the old guide that suggests removing breaths is kinda misleading
^
How many seconds of speech and laughs aside from vocals and rap?
well, the deal is always balance but if we speak of misc such as breathing, there's no rule
same goes for laughs and speech tbh
Cause I want to include so many things also does KLM x4 support that?
Key is balance so the model does not bias towards one and not the other
Not sure about KLM's behavior exactly as I do not maintain it ( aka, ain't sure what it does support and what not. I haven't tested it )
How many minutes or seconds
Uhm
It depends man, in world's ideal scenario, you want to have equal balance for each subtype of content
we can't predict how things come out really
Wdym
well, laughing, breathing, moaning, spitting whatever really, is a subtype
I can only give you an example of how I'd manage it
5 mins of diverse talking, 5 mins rapping, 5-7 mins of singing, 1 min of breathing ( good quality )
Mhm I actually use the minutes randomly like
was just an example
by no means was it in " sequential manner "
not that it'd matter anyway, all gets chopped inside and mixed / shuffled
In my case some rap verses 4 vocal songs
Mhm yes but the person has so many voice variations.
Will the training support that
Deeper, higher
I don't mean pitch
wdym deeper / higher if it's not pitch
5 minutes laughing, 10 speech?
well no, that's a bad idea
Well how do i explain
Laughing should be treated as misc, as an extra
not the primary purpose of the model
A person can go from higher and louder tones
well ye, that is a pitch
And lower deeper
" higher, lower, tone "
that's a pitch
Hmm yes will the training support that stuff as well
It supports anything you throw in
as long it's all balanced and enough of it ( not too little, not too much ) it'll do fine
In ideal situation, yea
All comes down to how well you train it and how good is the set
How much speech And laugh
Cause I'm thinking of making ICP covers the laugh is required
No idea man, as I said, we nor nobody else in here can give you any estimates
😮
machine learning like this is not deterministic where you can estimate how much of X is needed
better if you just trained a few times and see how it goes in various set proportions
Yes but I wanna include the person laughing
^
This applies to anything
And talking
seriously man, people used to make damn chainsaw or water drop models
You day not too little, not too much so 5 mins of laughing?
And how many of talking
no..
too little can be 1 min of laughing vs 10-15 mins of speech
you get the deal
if you overdone one type of content, model will bias towards it
Will that be enough
Yea so just try some proportions, do a few training runs and see how it goes
Doing a good model needs some testings naturally
5 mins of laughing 10 of speech?
I suppose you can try ^
if that fails, try 2-3 of laughing and 8-10 of speech
and if that failed too, 5 of laughing, 6-7 of speech
Because when I convert into RVC the laughs sound so unnatural like aehehdjdo
I don't wanna waste time tbh
Then you won't get your " perfect results "
Uh
My best model was a matter of 2~ months worth of experiments
you can't toss all in, do a 1 run of training and expect miracles.. that's just the truth
You got si much time on your hands
What's the problem of giving 40 mins to 1.5h of training
Uh really
It's not my first time training either
Can I send you the dataset one day?
for?
as in, y
Cause like, I don't really do datasets inspection
( Neither It'd help the case in any way. )
So you can check it
All you gotta know is that I haven't made nor maintain custom pretrains ( which supposedly do or do not support laughing or whatever )
the core concept of rvc is to handle speech or singing, that's the og premise. Whatever customs do, is out of my reach.
Now, if you intend to perfect out or nail experimental stuff or things that are known to rather fail most of the time, you gotta put some time and effort in
And tell me what's unnecessary or not
Well, I don't really think it's necessary
You said it's not your first time training, right? so you should know the deal
and yeah, I can't tell you to what include and what not, without experiments you can't really estimate anything or guess how model'd perform
If you don't trust your skills or are short on time, just commission me or anyone else, see what'll come out of it
else give it some time and work, nail it and be happy, ye
no pain no gain as they say lol
^
Sorry man, that's not something I am up to
It's okay
Hi, on Applio, can we not record vocal longer than 15 seconds? Every time I try to make a vocal longer than 15 seconds, this error pops up
why wont you actually use a software intended for recording?
at least use Audacity
@simple ore I want best tts for English and hindi language. I want to use it locally. I don't want to use any website or any software like that. I need a fully open source project. Do you know any tool ?
for english the usual recommendation - f5 tts, fish speech, fpt-sovits (needs training)
for hindi it is tricky, the perhaps fairseq vits from facebook
or you need to train the models above
its not opening i got python and shit
Do you know how to train custom models and how to install all these stuff
not for fairseq
How can I learn. Is there any youtube tutorial or any server?
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
Does RefineGan work on rvc disconnected?
no
trot
what
What does that new coach do?
you mean colab?
refineGan, what does or what
alternative vocoder for rvc
tutorial br para baixar o gravador?
hey guys!
im very confused on how make an ai model, i got the vocals and all those things
but idk if i have to download a program that uses gpu or alot of cpu
to train it
honestly i was actually wondering where you even put in the models
is it elevenlabs?
sorry that i'm not helping i'm just hoping someone knows
sorry?
the best plan for elevenlabs
idk about elevenlabs
oh mb
RVC
does that work with tts?
yeah but the tts sounds kinda robotic
yeah i'm mostly looking for tts
oh
custom tts, no i dont think so
many custom tts i've seen sound like throat cancer
like 15.ai
rip
yeah, who knows what the guy behind 15.ai is doing
its been offline for like a year now
honestly i'm just trying to find a good tts
there arent many in the field if im gonna be honest
the most you can do
is use elevenlabs to generate generic speech
and then use RVC like ilaria rvc
to turn it into a character you wish it to be
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/
Freemium 11labs: An easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
thanks for helping
this might help
lmao
I mean, you're right that 11labs is the best tts
except that it's paid
nah i've got the money
it makes the voice sound a bit different
and you cant even control which accent it develops
Then you could just use 11labs, you can train models directly in that iirc,
Or try as he said to use it as an input for RVC
Honestly I haven't used it since I like more Open Source tools
alright
so i'm guessing you can't tell me which plan is best either?
...sorry that came off rude
nah it didn't dw, I was actually tryna check their plans, but maybe it's better you ask someone who actually used it
I don't want give you bad info ofc
hello EA Esports
rvc disconnected has been fixed ye #📰│dev-updates
alr cool
im trying to make a heavy 2007 ai model
but idk what to do with steps and epochs
epochs im guessing i can figure out using tensorboard, but ive never used it
and as for steps, idk what they are
epochs = unit of measurement of the training cycles, how many times ur dataset has been trained
there isn't a right amount
i thought there was
im trying to make my model not undertrained but not overtrained either
steps = a single update to the models parameter
nope, there isn't, https://docs.ai-hub.wtf/rvc/resources/training/
Last update: Dec 24, 2024
btw don't share datasets here
ok
yes
Should I load dataset in rvc Disconnected when resuming?
question, is there a guide out there that suggests a general total training time/epochs for a dataset length? like "if you have a 20 minute dataset, train for x minutes/epochs"
No, only the folders with the information of the first training with the cell number "2333333" import model for resume
To retrain you skip all the cells inside the "preprocessing" cell and you skip to the train one.
whats a website that has real time voice changers?
weights.gg isn't real time right?
no, it is a random process and nobody can tell you a good estimate
Is there any RVC Text to speech program out there that doesn't require a ton of github usage and python stuff?
I can't always afford to talk into it.
Applio can do this.
And also Ilaria RVC on Hugging Face can do this.
Thank you for being so quick, gah damn.
W-Okada is a realtime voice changer program. For Weights, I've already told you in #🧬│ai-chat.
Voice.ai also has realtime voice changer service, but this is a real scam and never advised to use it.
Which voice model should I choose to insert the .json file?
What is the point of that?
If it's some kind of RVC fork program that uses .json file for a voice model to display its information, I don't know. But Applio doesn't seem to do that.
I wish I could upload a photo
If it's W-Okada, the only .json file that found inside a voice model folder is params.json.

just start training with given batch size and see how long the first few epochs take
Is it normal for it to take forever to launch from the bat?
If it takes too long to launch, even with faster CPU, it's not normal.
It loaded eventually, cmds are weird sometimes.
Can you screenshot that?
There's nothing to screenshot, it loaded.
Really? That's fine.
Where do you put weights/logs for this btw
ah, aight
o/
i was on my way to train a model using google colab, but two combined datasets have 12 and 15 minutes. however, when i'm training using "rvc v2 disconnected", it keeps on writing
TF-TRT Warning: Could not find TensorRT
what is the recommended setting for training a good model?
(can't send the dataset in question because copyrekt moment. total of these voicelines when separated are 500 files)
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
how to configure work for a video card?
RVC or W-Okada?
1 min
Sorry, but I don't a direct message from random user if nothing private. To talk about W-Okada, go to #🔍│help-w-okada.
ok ok
i am getting this error while making cover
is there anyone who can guide me to fix this error
2025-01-11 10:58:57.4020271 [E:onnxruntime:, sequential executor.cc:514 onnxruntime:: ExecuteKernel] Non-zero status code returned while running Pad node. Name:'/rmvpe/mel extractor/Pad' Status Message: CUDA error cudaErrorNoKernel ImageForDevice:no kernel image is available for execution on the device
have solution?
How can i use these models for text to speech locally?
Applio can do RVC TTS locally.
-rvc
Suggestions for @quasi ginkgo
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
great thank you!
@tight ether I keep getting errors when trying to train with KLM x4
errors?
does the error occur during training?
I can't train
It seems like it might not recognize multiple speakers... I haven't tested it on disconnected, so I'm not sure.
Where do i place the cpkt and pth files in applio
Yesterday worked but I couldn't resume
huh
What
I should use Applio or what
Ckpt and pth files? RVC voice model only use pth and index.
yeah, try it on applio or codename's Fork.
Uh seriously... RVC Disconnected is outdated then?
I'm not sure about RVC Disconnected, but "RVC GUI" fork is indeed outdated.
not really, but we can't know all the updates the developers made, so it’s safer to use a fork that has already been tested.
okay thank you!
x4 was trained with lastest ver of applio so.. yeah
Oh okay
@simple ore sorry for ping bud, but can you help?
Hey, SeoulStreamingStation! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:
- General RVC help: #✨│ai-help
- W-Okada / Realtime RVC: #🔍│help-w-okada
- AI image related: #🔍│help-ai-art
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
Okay then!How do I resume w applio???
-guides
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
How do I give the dataset path?
What's the right dataset path and how do I resume
are you going to train it locally?
On Applio Colab
colab?
Yes
https://colab.research.google.com/github/iahispano/applio/blob/master/assets/Applio.ipynb?authuser=2#scrollTo=iwDUiyus9F3l there's a training tab but how do I set dataset path & how do I resume???
I've never used colab before. 
Uhhh
you should train with either og pretrain or KLM using applio
How do I resume on Applio and how do I set the correct dataset path?
resuming from the different application might break the model, better start over the model within applio
How do I resume with applio I asked
I still haven't started training
Also I asked how ro I set the correct dataset path
the guide doesn't explain it?
No
Which guide is that???
Can you link the guide
Also how do I set the right dataset path
ditto
i have read the guide but rvc v2 disconnected does not work not because it exceeded time limit
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- FaceFusion UI, by Nick088 Google Colab
- FaceFusion NO UI, by Nick088 Google Colab
- EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
(well, dataset path was even right but seems like it doesn't work.)
i was lost about how many good epochs to train for that large dataset
12 and 15 minute dataset; then it's 28 minutes in total
it is a skill problem
aka not running feature extraction
@brittle wing make sure you're not skippig steps
Hello how do I import the right dataset path on Applio colab also how do I resume training and ... Where do input the pretrain links on Applio Colab
I ran everything necessary
Yesterday It worked idk what happened now
🫠
tensorboard looks broken af
can somebody help me out? Am trying to troll my friend that am a girl in an alt account , but he doesnt believe me until i say "hi im evelin and this is my voice" , can somebody help me out with an AI girl voice saying that 💀
if you trained with applio, you just start training again with more epochs
if you trained with something else, you are breaking new horizons then
but dont worry, it is simple
Mhm I'm used to RVC Disconnected but I only can inference w applio
prepare dataset, extract features, then put d/g weights from rvc disconnected into the model's folder
I'm asking how.do.i train w applio where do.i.put the.pretrain links I checked in the training tab...
applio colab or what?
ui/no ui?
pretrain is a starting set of weights, if you are resuming training, it is no longer needed - resume starts with D_xxxx.pth/G_xxxx.pth
Is this already overtrained I can't believe it
I mean
12 minutes data w KLM 4.3 x4 Pretrain
The other graphs decreased idk
???
What's this
all files are 44000khz but is 40k good enough?
Uh yes
nice, thanks
i think i'll spend the whole night finding my way to train a model with that large set
😭
but tensorboard says it's all blank
censored the name because it's for my oc/private purposes
44000Khz? That's so many number. I've only heard of 40000Hz and 48000Hz being used to train a voice model.
it's fine
refresh
(i don't have time then i have 1 hr and 40 minutes)
i'll make a help forum post for this tomorrow
praying for my model tho
i have 500 files of dataset
then i separated it into 250
tried, but it stays 0%
How much minutes on your dataset?
Have you cleaned it properly?
Hv I overtrained
file 1: 12 minutes
file 2: 15 minutes
pinging me is fine so i need attention (busy btw)
not all of them because some voices have muffled ones, during a call (no bass if i do equalizer), and the bleeps (yep, it's from a visual novel, if you ask) which i thoroughly curated it
no
Welp.... rippy rippy.
Hi I was just asking if my model is finished
i might clean those with a better start so i need to clear it
With a bit of luck the model will may come out decent anyway if these noises and sfx aren't too present on the dataset
how many epochs for 12 to 15 minutes?
i asked for a friend, then he told me like 450-500 epochs
Nope.
That's wrong.
see the tensorboard
Sir, I'm sorry to disturb but
Can you tell me
The final epoch amount will depend on the tensorboard.
Exactly.
i did 128 instead of 64 in crepe hop length
you should also test several interesting model checkpoints
Have you used rmvpe or mangio crepe on pitch extration?
usually rmvpe and i don't touch others
Mhm yes I'm going to rn but what can you tell me by looking at the graph
mangio crepe uses the hop length, not rmvpe
oh
in my testings crepe hop length 128 is very similar to rmvpe, almost exactly the same
tho if you want to use crepe use a hop length of 64 since thats when its better than rmvpe in terms of accuracy
they said their tensorboard wasnt working iinm :p
but has 0 effects if you used rmvpe
I can tell due to the graph that your dataset wasnt too clean.
What
Uh but is the model overtrained or what
ah, appreciate it
lyery's reply:
will take it lightly, so i'm taking notes for that
Nah I used a pretrain
this
It's quite clean as can be.
Welp, i use kaggle Applio so tensorboard so that's not my problem.
The model is overtraining if the g/total graph goes up
Also there's almost no noise
keep in mind if you use crepe for training your dataset has to be very clean because crepe is not robust as rmvpe and handles noise horribly
Still needs training?
i expected some high-quality output when i use my voice (i have my good mic, so dw) so here i am, training this
edit: the expected output has some accurate breathing since it has various evidences in a dataset (ye ifkyk if you're into visual novels 👀)
Wdym
And have you labeled the dataset properly?
i mean if they cant see the tensoreboard how are they gonna see if it's ovt xd
Wdym by labeled
I'm not sure, but you can test various checkpoints (pth's) from your model too.
Not sure
What-
Which batch size? Did you cleaned the dataset?
I mean, applied audio labeling on Audacity.
if you dont want to clean the dataset use rmvpe for training
8 since i'm new to rvc disconnected
edit: total dataset duration when combining 2 files according to audacity is 28 mins 30 secs
thats fine
Huh what I only used Melroformer denoise also I don't have a computer
Hm, it's fine then.
8 is the perfect balance between generalization (model capability of generating audio in this case) and stability
Jajaj el 8 en batch es mi favorito.
tho one thing worth mentioning is that if you really want to the model to sound more like the dataset you can slightly increase it but this comes with the problem of the model having weird pronunciation
bc loses generalization
Is my model overtrained or not...
i need to finalize the pitch extraction tho
if i'm new to training models, without cleaning it (like only the dataset.zip placed aside experiment folder), can i use rmvpe?
Yup rmvpe batch size 8
Safest options
rvcDisconnected
ocname_bundled.zip
rmvpe, batch size 8
folder structure for tomorrow's training
how many recommended crepe hop length (scrolling back, is it 64?)
Nope. ._.
for rmvpe it doesn’t do anything
only affects crepe
i thought it's mangio crepe
I see
I can't see the screenshot properly but it doesn't seem like it's overtraining.
mangio crepe is the same as crepe
again use mangio crepe if you want to use it
Then why is the line going like...
If u want to use mangio set a hop length of 64
It's going down after 4.5 steps!
Look
Maybe after 150 epochs it will overtrain
150 epochs in my experience: 20% sloppy
450 in my opinion: 95% accurate since it's completely becoming human because of pronunciation
Not always.
might train tomorrow, thank u all for the advice 
Not always, because all will depend on the dataset, the voice you wanna train, and training settings.
So it will be always random.
How about now?
Started going up after 5k!!!
170 epochs the end?
it has to rise forever
i usually wait 1 hour to confirm it stopped improving
overtraining is pretty obvious when you see it in a graph
Really are you sure
yuh, it just goes up forever
Wdym
I tested it at 150 and it already sounds like the person
graph gonna end up looking like this
It already looks like this...
After 170-180
I downloaded the 170e weight
if ur happy with the result then stops the training
yo guys, im new to this, whats the issue with this where it keeps glitchintg?
Mhm it sounds like the target person not that I'm happy w the results
#🔍│help-w-okada for realtime/w-okada help
im using rvc
are you talking about realtime voice changer? if so, then that's not RVC, but wokada
be sure to clean the inference audio, remove reverb and any other sound effects
don't inference audio with harmonies
is it possible to remove the inference and reverb with UVC?
i want to get minecraft steve to sing Die With A Smile
There's alot of harmonies (i think thats what it is) in that song
Yup use mel karaoke to remove harmonies
alr tysm
whats the best sample rate between 32,40 and 48k for the most accurate to use with voicechanger, with rvc2 and a dataset of about 35 mins (i just know some ai knowledge from about 1 year ago or more) and i use The Mangio-RVC-Fork so i dont know if its giving me disadvantages.
and should i turn pitch guidance of or on if im using a talking dataset
How do I filter my samples for inference?
How can I easily find the voice dialogues used by a game character? In tones and emotions
Overtraining?
It's going down so no
Huh
Thank you for finally accepting me
It's a short dataset
Graph going down = not overtraining
Wow that's super short
And is it overtrained.
It starts to concern me when the like goes like this:
Avg graphs?
RVC Disconnected
AI HUB Docs
