#✨│ai-help
1 messages · Page 186 of 1
⠀
Local Forks 🖥️
⠀
Mainline RVC
Original project, suggested for advanced users,
by the RVC-Project team.
Applio
Simplified, suggested for all, by the Applio team.
RVC Studio
Simplified, suggested for all, by SayanoAI.
Mangio-RVC
Simplified, may not be supported anymore, by Mangio621.
AICoverGen
Simple yet great way to make covers, by SociallyIneptWeeb.
Replay
From the greators of weights.gg, excellent product for everyone.
⠀
⠀
Local Forks 🖥️
⠀
Mainline RVC
Original project, suggested for advanced users,
by the RVC-Project team.
Applio
Simplified, suggested for all, by the Applio team.
RVC Studio
Simplified, suggested for all, by SayanoAI.
Mangio-RVC
Simplified, may not be supported anymore, by Mangio621.
AICoverGen
Simple yet great way to make covers, by SociallyIneptWeeb.
Replay
From the greators of weights.gg, excellent product for everyone.
⠀
gradio is unlikely the cause of your problems
oh
then what caused this error to pop up...?
what error?
.
wait ima run it again and show it to u
Ayo? @lost lodge level 7 !!! 
when i run the last cell i get this message
none of that are errors
Ensure that GPU support is enabled in your Kaggle notebook and that a GPU accelerator is attached to your notebook session. You can verify this by checking the "Accelerator" setting in the "Settings" panel of your Kaggle notebook.
it took ages to finish running the cell on the message i got
[waited 3 hours and it still didnt finish running the cell yet]
Its taking so long than usual to succesfully run the final cell
how do i make my own model
just open all those links
visit site
that's why you should have read the guide carefully
https://rentry.co/RVC-Mainline-Kaggle
This guide for Mainline Kaggle is an alternative option to the Mainline Colab notebook for training voice models
It is complete and should walk you through every step of the way since Kaggle has a difficult learning curve. However, it will be updated constantly to go over parts that need more cla...
yeah
Is RMVPE+ better than RMVPE?
no, it just has a f0 min/max limit
Ayo? @turbid temple level 1 !!! 
@knotty moth how do i make datasets
can you train a model with an amd gpu
This may be a dumb question, but i notice on this list that half of the models (ones that are 500mb and 1gb) do not work in my RVC program, is there another program for these and are they higher quality? https://huggingface.co/QuickWick/Music-AI-Voices/tree/main
which one exactly?
hey im trying to finetune text to text AI and now i wanted to fine tune text to speech or speech to speech AI and anybody knows how to do it becouse everytime i tried it failed and in the internet there are no good tutorials
I got a GeForce RTX 3060 in my pc
in that folder there are D/G pretrains
Here is one for example https://huggingface.co/QuickWick/Music-AI-Voices/tree/main/Kali Uchis 30k
Any of them that dont say (RVC) in the title, these are usually much larger zip files than the RVC ones
Anybody know if the voice changer works for macbook? I want to know that its possible before I waste my time.
I am aware it is available on Mac, but unsure if it would work on a macbook.
those dont seem RVC models
config files are different
probably some other singing models
yeah would you happen to know what they are for and would they be higher quality than the RVC ones?
gpt sovits probably
over 1 year ago lol
yeah, probably that
Most of the RVC models on that link are also 1yr ago lol so i didnt know, what is the best place to find the highest possible quality models?
A model is the result of training on a dataset. You can learn more about it in the Applio Docs
Documentation for a high-quality, open-source speech conversion ecosystem designed for simplicity and optimized performance
Anybody know if the voice changer works for macbook? I want to know that its possible before I waste my time! apologies if thats a dumb question.
technically macbook with MPS can run pytorch... in reality not so good
hey im trying to finetune text to text AI and now i wanted to fine tune text to speech or speech to speech AI and anybody knows how to do it becouse everytime i tried it failed and in the internet there are no good tutorials
And I hope somebody could help me
I got a GeForce RTX 3060 in my pc
I figured it'd be pretty awful, thanks.
you can try downloading onnx version
that's why #1175430844685484042 and weights.gg are better place to find
you can test weights.gg with their samples
⠀
HuggingFace Spaces 🤗
⠀
Ilaria RVC
EasyGUI port with some improvements, by Ilaria.
RVC-HFv2
Applio port, by r3gm.
AICoverGen
AICoverGen port, by r3gm.
Advanced RVC Inference
Extended version of the GUI with advanced settings, r3gm.
⠀
guys how do i install it
i cant figure it out
like i legit do NOT know what to download bruh 
Ayo? @unreal hill level 1 !!! 
Ayo? @midnight relic level 1 !!! 
Is that bad
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
-guides
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
possible to do what?
btw for helpers watching this, i already redorected him to the guide in general
i was doing sum training but it was taking like a minute per 1 epoch whats up w that
@low shard nicky wicky
whats ur pc gpu
im doing colab
@hoary nimbus im pinging u in the right channel, whats ur pc gpu
ig should be fine
Time kinda depends by gpu, dataset, batch size
I am using macbook
you can’t train locally (on ur pc) on a macbook
its not powerful + not supported for training, only inference (use models)
As you dont got a good PC, its better you use cloud (remote good pc) for training an RVC Voice Model:
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
hi i need help making an ai voice
Thankyou ! i’ll try this
What's ur PC GPU
one sec let me check
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
AMD Radeon RX 7600
U could try with applio zluda
Documentation for a high-quality, open-source speech conversion ecosystem designed for simplicity and optimized performance
ok thanks
make sure you dont skip any steps 🙂
your welcome
nah skip all the steps 🔥 
How do I use the voice cover thing for a song link on my phone
so, you want to make an ai cover?
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
guys how can i fix some machine sound
Which program do you use for auto segmtentaion?
@urban flint i pinged u in the right channel, whats ur pc gpu?
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
Do you mean dataset segmentation?
You can use Audacity's audio labeling function.
oh tanks!
What's better for ai covers? Mango applio or like a realtime rvc voice changer thing.
I tried mango and applio but covers on yt sound alot less ai like and the voice overal sounds better.
illaria falls?
what?
im triying to use it but dont work
all rvc versions have the same quality
what matters is the model quality and how much u cleaned the vocals
"Illaria falls" doesn't exist wdym
illatia doesnt work
Whats the best stuff to use to clean vocals?
Ayo? @gilded frigate level 1 !!! 
Reverb remover helps a whole lot probably
Last update: Feb 29, 2024
Is there something i could do in audacity to clean it up more? Maybe make it louder or smth
Oh shii
I didnt know that was a thing
Do you also perhaps know what the best osaka (ayumu kasuga) model is?
I got a 900+ epoch one from huggingface but maybe theres a better one somewhere idk
no, there are over 20k rvc models
Damn
more epochs doesn't mean more quality
the only way to see is try
yes https://docs.ai-hub.wtf/rvc/resources/epochs-tensorboard/ people who train models use the tensorboard
Last update: Feb 10, 2024
Yeh i read that it can have problems with new data bc its too dependent on the stuff it trained on
Interesting stuff
Hey idk why but my Appolo webinterface is not creating an index i checked it and its not there but its says in the webinterface that it has been succesfully created and i get this error then and im pretty new so idk if its okeys or bad:
Starting preprocess with 7 processes...
100%|████████████████████████████████████████████████████████████████████████████████████| 2/2 [00:07<00:00, 3.82s/it]
Preprocess completed in 7.64 seconds on 00:03:45 seconds of audio.
Traceback (most recent call last):
File "D:\ApplioV3.2.6\rvc\train\train.py", line 84, in <module>
with open(config_save_path, "r") as f:
FileNotFoundError: [Errno 2] No such file or directory: 'D:\ApplioV3.2.6\logs\my-project\config.json'
An error occurred extracting the index: [WinError 3] Das System kann den angegebenen Pfad nicht finden: 'D:\ApplioV3.2.6\logs\my-project\v2_extracted'
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.
An error occurred extracting the index: [WinError 3] Das System kann den angegebenen Pfad nicht finden: 'D:\ApplioV3.2.6\logs\my-project\v2_extracted'
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.
An error occurred extracting the index: [WinError 3] Das System kann den angegebenen Pfad nicht finden: 'D:\ApplioV3.2.6\logs\my-project\v2_extracted'
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.
An error occurred extracting the index: [WinError 3] Das System kann den angegebenen Pfad nicht finden: 'D:\ApplioV3.2.6\logs\my-project\v2_extracted'
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Can anyone tell me which of these are better
Jammable or musicfy lol for ai covers
Any rvc colab.
Because you don't have to pay for anything.
Jammable is just the same as RVC but paywalled.
You can add them to your output made on RVC using any DAW of your preference.
do anybody got a good link for ai covers i dont like these ones
That's all we have
In Colab I have 3 hours of training available and I have a 15-minute dataset, can I manage to train the model every 100 epochs?
Easily you can train 100 epochs in three hours
thanks for your help.
Ayo? @late charm level 3 !!! 
Yw : )
If you want to train for a long time use kaggle
Móvile.
Bit harder than colab but have 30 hours of GPU runtime. Without disconnecting
@late charm
I use a mobile phone, I tried to follow some guidelines and I was more lost than the boy's mother.
It is possible to use kaggle on mobile
Anyways...
Yes, but I get confused and in the end everything goes wrong.
Then use colab
You're here?
Yeah
Hey
Use RVC d
And
Do you want to send the files dataset and stuff so I can help you and the model gets trqined quicker
Do you have a GPU
It is confidential
Which GPU do you have
Tesla T4
The person doesn't want help it's okay
For your local system
But I want 😆
To train a deepfacelab model
Why did you offered him to train a model
I only know about voice models
Well because it's a voice model
And It's a model I want
I'll tell you how to do. It is used to make professional deepfakes
Sorry I'm not really interested
So you both want a same voice model.
Not a problem
Nope it's a voice model I need
And I wanna help the person train it faster
No in Colabs I dunno own a computer
Oh.. then you can't help me. It is banned by Google colab. I thought you have it in your local pc
Yeah
Okay tq
Np
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Use #🤖│bots to execute the commands
-overtraining
You can detect if a model is overtraining if the TensorBoard graph starts to rise and never comes back down. An overtrained model will sound robotic, muffled, and won't be able to articulate words well.
Check these resources to learn more about this topic
- Epochs & TensorBoard from
AI HUB Docs - TensorBoard from 🍏 Applio Docs
-help
-audio
- Creating Datasets for RVC using iZotope RX11, by Cauthess
- Gathering and Isolating Audio, by SCRFilms ❄
- Instrumental and vocal & stems separation & mastering guide, by deton24
- Vocal Mixing Tutorial, by Roomie
- https://mvsep.com/
When I open the mic in the Counter Strike 2 game, why does the sound delay and lag? is there any way to solve this problem? I use a GTX 1650 SUPER
im tryna make a model and i did backround music removal but the music has singing in it so its peaking up the singing voice is there a way that i can automatically remove that singing sense its a lower volume compared to the voice model im trying to make
bc as u can see those spikes right there is the voice im actually trying to get and its louder than the music in the back
why is my voice so laggy
Does anyone help me create ai cove?
Hey, ssf! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:
- General RVC help: #✨│ai-help
- W-Okada / Realtime RVC: #🔍│help-w-okada
- AI image related: #🔍│help-ai-art
@native crystal He doesn't answer me I need someone to help me on his behalf create ai cove
Uhmm where do I post like the wips of a cover or whatever?
how to make rvc own voice model
What's ur PC GPU
Like post ur ai cover? Post it in yt then share the link in #1159290752195633273
rtx 6090 heheheh nly fun bro only tell me how i make like complete video link or whatecver to make own voice model i will make i will arange any gpu
Ayo? @elder belfry level 2 !!! 
... You seriously need to tell me ur gpu
and no, there is no single yt video for it, all on yt are OUTDATED
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
And training takes alot of computing, it cant work on any gpus
bro now i havce rc480 4gb but my czns have gaming shop so i will grab any gpu my i have rx480 4 gb
that gpu sucks
and has low vram
you can't train locally (on ur pc) with that
u have to use cloud (remote good pc)
As you dont got a good PC, its better you use cloud (remote good pc) for training an RVC Voice Model:
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
Kaggle suggested
how i do i amn completeluy beginer even i dobnt know hich website or what hugging face use foir ytraining model

i am completely beginer anyone help me completelyonly 1 time
Hey, CALLMEYOURX2! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:
- General RVC help: #✨│ai-help
- W-Okada / Realtime RVC: #🔍│help-w-okada
- AI image related: #🔍│help-ai-art
huggingface is the biggest ai platform, but you cant train on that, i gave you 3 cloud computing services
its better you click on Kaggle Mainline
its a guide to do it on kaggle, which gives alot of gpu time
can you msg me in personal if you do its help me alot plz brothr
i rather prefer we chat there, its easier for both of us and other helpers can also help too in case
first of all its better you read the guide
i click on kagles mainlin ui
its direct me on website what i do next
bro its such a too much hard can anytbody make free model of my own voice
@low shard how much you charged for my project
charged?
I didn't charge you anything
If you meant 'how much do you take for a paid commission of a model', i don't do those things
you can either #1159289738314919936 or #1191429836321849435 if u want someone to do it for u
you need to read the guide, it tells u what to do
oh ok thanks
⠀
Google Colabs 
⠀
AICoverGen-WebUI
Useful for making quick covers, by Hina.
AICoverGen-NoWebUI
Useful for making covers, doesn't include a UI, by Ardha, by Eddy, Hina and Gdr.
RVC Disconnected
To train new voice models, by Kit Lemonfoot.
EasyGUI
The OG interface, by Rejects.
⠀
⠀
Download for Nvidia GPUs 
Version 18a cuda
Download for AMD GPUs 
Version 18a directml
Download for Intel GPUs 
Version 18a directml
Download for Mac 
Version 17b Mac
⠀
Hey does anybody know how I can create my own llm to text like chat gpt with cuda?
⠀
Google Colabs 
⠀
AICoverGen-WebUI
Useful for making quick covers, by Hina.
AICoverGen-NoWebUI
Useful for making covers, doesn't include a UI, by Ardha, by Eddy, Hina and Gdr.
RVC Disconnected
To train new voice models, by Kit Lemonfoot.
EasyGUI
The OG interface, by Rejects.
⠀
Is there a website that's good with robotic voices?
⠀
Google Colabs 
⠀
AICoverGen-WebUI
Useful for making quick covers, by Hina.
AICoverGen-NoWebUI
Useful for making covers, doesn't include a UI, by Ardha, by Eddy, Hina and Gdr.
RVC Disconnected
To train new voice models, by Kit Lemonfoot.
EasyGUI
The OG interface, by Rejects.
⠀
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
-help
⠀
Download for Nvidia GPUs 
Version 18a cuda
Download for AMD GPUs 
Version 18a directml
Download for Intel GPUs 
Version 18a directml
Download for Mac 
Version 17b Mac
⠀
⠀
Google Colabs 
⠀
AICoverGen-WebUI
Useful for making quick covers, by Hina.
AICoverGen-NoWebUI
Useful for making covers, doesn't include a UI, by Ardha, by Eddy, Hina and Gdr.
RVC Disconnected
To train new voice models, by Kit Lemonfoot.
EasyGUI
The OG interface, by Rejects.
⠀
⠀
Settings for Nvidia GPUs 
F0 Det.: rmvpe (suggested for all series)
RTX 40-series: 80-96 chunk | +16384 extra
RTX 30-series: 96-112 chunk | +16384 extra
RTX 20-series: 112-128 chunk | +16384 extra
GTX 16-series: 128-192 chunk | +8192 extra
GTX 10-series: 128-192 chunk | +8192 extra
Advanced Settings
Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low
⠀
⠀
Download for Nvidia GPUs 
Version 18a cuda
Download for AMD GPUs 
Version 18a directml
Download for Intel GPUs 
Version 18a directml
Download for Mac 
Version 17b Mac
⠀
@low shard mines the same as yours but uhd
that wasn't mine btw, random from google
oh
both ones are like that?
NVIDIA GeForce RTX 3050 6GB GPU
bc that's integrated graphics which is bad
kinda better, what are you looking for?
sorry im on a work laptop
i want like to change my voice in the studio to a specific artist
training (making models), inference (using models) on pre-record audiso, inference on realtime for calls
so, inference on pre-recorded audios right? like u record ur voice in the studio then change it into the ai one
yea
Your pc is good enough to use RVC locally (runs on ur pc), you can choose between those for doing it locally if u really want to:
- Applio: A fork of RVC with some extra features like Applio TTS, same quality tho
- Mainline: The original RVC
but honestly its not that good so it will be slow, id suggest cloud (remote good pc) like Ilaria RVC Zero which will also be easier than installing it locally, the only bad thing is its not unlimited as local so you willl have a zerogpu quota for your account
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
thank you
your welcome
that's good enough and still faster than cpu mode
btw you can even port the repo of ilaria RVC space locally (not to be confused with the old "ilaria mainline")
3050 is colab speed
Ilaria RVC Zero is optimized for ZeroGPU, Ilaria Mainline isn't the old version, its the only local ilaria rvc version
kaggle will be faster
that huggingface space runs on ZeroGPU
which is an A100, alot faster than google colab and kaggle
I thought Ilaria Mainline may have been abandoned, and btw I'm not sure but I noticed that somehow ilaria's rmvpe+ is less likely to produce voice cracks than the og mainline's rmvpe
how long this usually take
Ilaria Mainline may have been abandoned
Ilaria RVC Zero doesn't get any updates since 4 months either lol
but I noticed that somehow ilaria's rmvpe+ is less likely to produce voice cracks than the og mainline's rmvpe
That's super weird, are u comparing also the mainline rmvpe or mainline rmvpe+? iirc mainline has rmvpe+ too
That depends on your own upload speed
depends all on your internet, shouldn't take long
oh nvm it worked i thought it was frozen
Ayo? @pliant axle level 2 !!! 

the og one only has vanilla rmvpe, and iirc rmvpe+ was introduced in applio 2.x
yea ur right
Where do I drop my rvc models in applio local?
logs folder, both the index & pth file
Thx
hello, i have a massive delay between my real life input and eventual output into discord via vbcable, does anyone know how i could fix that?
Whats your GPU, whats your settings, which version of wokada do you have (top right somewhere it says it)
I’m just trying to figure out how people make covers but are able to block out the background vocals
so I did some research and I am trying to make my models sound much crispier. but this made me confused.
so I used my voice for test. I put these settings and exported it.
regularly I leave sample rate as it is. It always sounds normal, but this time I changed it to the same sample rate
the low pitch voice sounds good, but when it gets higher, it gets distorted
any ideas why?
Ayo? @jaunty shale level 9 !!! 
does llaria support realtime?
And no, Ilaria Mainline/RVC doesn't support realtime
so this is
Ayo? @brittle wing level 2 !!! 
how do i use #🔍│find-models
it is quite the opposite
you pick the sample rate of the model, the result is that anything you infer using this model will be in the same sample rate
whichever sample rate of the data set, it will be resampled to the sample rate of the model for learning and to 16khz for f0 extraction
resampling does affect quality, so keep that in mind
the distortion comes from the model never learning how to reproduce high pitched voices
Yep, that's basic RVC logic
funny thing, if you use a mix for male and female voices in the same dataset, the model sometimes switches to a different voice to sing, etc
That's true too.
Anyone having issues where a voice app gets stuck or seems like it isnt loading?
Ayo? @shadow geyser level 2 !!! 
hey friends has anyone tried hooking up their voice model to some kinda shitty noise filter
I found a program called voicemeeter but it's kicking my ass rn
and cantabile another program
right I'm tryna use it with cantabile which lets you use plugins and stuff
it feels like it could work but idk I can't figure it out
sorry for the dumb question but i downloaded a voice model and put the .pth file in the assets>weights folder and it works great. I was wondering if I should do anything with the index file? Thanks very much for any help
if using rvc it goes into logs, I think?
Strange.. I did it before without changing sample rate and it went fine
I use rvc disconnected and applio for conversion
Cuz this was first time this happened to me
I tried 32k sample rate, and obviously it got worse-
I'm confused
my applio is in my E drive but when i use it my C drive loses space, can i make it so my E drive loses space instead?
run WizTree, see what actually consumes your space
alright
- pip cache, 2) huggingface cache 3) gradio temp folder
also
that's likely culprits
Im tryna train a model, does the dataset have to be online, or can it be a local directory?
alr ill def check those out
dataset creator is only needed when you have no access to a local drive
oh
When i did this i got an error
no space
wdym?
well, actually using a space is fine... so perhaps no files in such folder or no supported files?
anyway, it would really help to check the terminal window to see what's happening
yeah, that wll do
anyone who downloaded them as html files?
no need for that
read the message i linked u
there are new temp docs to use
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
-rt
This interaction has expired, use the command
/guidesif you wish to see it again.
Command timed out. Use /help to for a new interaction
One message removed from a suspended account.
This interaction has expired, use the command
/guidesif you wish to see it again.
One message removed from a suspended account.
This interaction has expired, use the command
/guidesif you wish to see it again.
Ayo? @brittle wing level 1 !!! 
One message removed from a suspended account.
One message removed from a suspended account.
One message removed from a suspended account.
its asking for an audio file and u are giving it a pth file...
One message removed from a suspended account.
yeah any audio file u want to inference with
One message removed from a suspended account.
One message removed from a suspended account.
the pth file should go into ur assets/weights/
and your index should go into logs/(create a folder inside the logs folder)/
One message removed from a suspended account.
yeah thats where the pth goes
One message removed from a suspended account.
One message removed from a suspended account.
One message removed from a suspended account.
Ayo? @brittle wing level 2 !!! 
yeah but dont name the folder in the logs folder, "logs" , it may confuse you. instead name it something like your model's name e.g MOZE
One message removed from a suspended account.
One message removed from a suspended account.
One message removed from a suspended account.
One message removed from a suspended account.
yeah thats how u infer
One message removed from a suspended account.
One message removed from a suspended account.
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Hi, I have been dealing with this problem for days and still couldn't fix it. I restarted my computer, changed the folder location, I really did everything I could but none of those things helped me. Can someone please tell me how I can fix this error? 
I'm using Applio by the way
did you select the model at the top?
yes
show it
Well... I was testing it again and now it works. But the download failed
Can someone help me with this please?
can someone help me setup rvc for mac if its possible to? im not sure how to do it
⠀
Download for Nvidia GPUs 
Version 18a cuda
Download for AMD GPUs 
Version 18a directml
Download for Intel GPUs 
Version 18a directml
Download for Mac 
Version 17b Mac
⠀
what does this error mean? (its in the UVR5 NO UI thing)
I think it does not like the file name
idk how to change it
u can only inference for mac, not train
if u really want to do it locally u can try https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/docs/en/README.en.md
or else use cloud (remote good pc)
Hey, I need help. I don't understand why I train a model to 40k and if it reaches 16k I have to train it again to 32k. How does this benefit the model?
" model to 40k" what?
Yeah.
It does not reach the frequency of 20k but 16k.
if you have an audio signal that reaches 20KHz, then you need 40k sample rate model to train
if you have an audio signal that reaches only 16KHz, then you only need 32k sample rate model to train
there's no 'again'
here I have mp3 that is 48k, but the content is below 16k
I've executed everything and it still fails. Why is this happening to me? 
Hey guys. What would you say is the best rvc real time voice changer currently? I'm using deiteris' Fork for W-Okada rn
Asking cause it's been a couple months lmao
that's the one, maybe some minor changes since then
Good to know. Thanks :)
Ayo? @stable hollow level 1 !!! 
Did you get a no-feature-todo in the feature extraction?
Bc it seems that there is a problem with your dataset
Are u sure it's in .wav
How do I retrain starting from the same epoch I left off at
you just start the training again for the same model, keep in mind that RVC has a bug that overwrites the latest weights/.pth, so you may want to keep a copy of that
you dont need to run preprocess/extract features step
Thanks
How to
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
hello, i am having issues viewing the grad_norm_d graph from the tensorboard. the data points just show NaN and nothing appears. Any known way to fix this?
Did you click on the refresh button?
Yes, nothing changes
can someone help me?
Just ask instead of asking for help
btw helped in #🧬│ai-chat message
All is NaN? your model got f'ked
oh nyo
I've seen it a few times
depends on whether you saved all weights or just latest
with just latest it is dead
you may recover a model you saved every epoch, if you did
fun
i didnt
i guess i trained in a FP32 env THEN switching to a FP16 env killed it
Ayo? @austere compass level 1 !!! 
idk why but when i use Gura the ping goes op to above 10Kms ping, but when i switch its down to 1 or 3ms
Ayo? @ebon topaz level 1 !!! 
you're trying to use iGPU
oh
Ayo? @flint grail level 2 !!! 
change HIP_VISIBLE_DEVICES in the bat file
guy
i'm gonna make my first 3 models of the slaughter me funkin opponents
but i have a feeling this will be hard
i gonna make the fnf slaughter me street muppet models
stop using mangio that screws GTX gpus
hello
My name is Arkeshan, shortly Ark, I'm trying to know about what is tensor thingy
in Mangio RVC, I'm training a data set 7.30 mins, is 5000 epochs enough?
it’s basically a tool to check how the training is going, u could see more about it in https://docs.ai-hub.wtf/rvc/resources/epochs-tensorboard/
Last update: Feb 10, 2024
5k epochs are way too much, id go with 500/1k, there isn’t a rigut amount of epoch but that’s enough time to see how it’s going with the tensorboard
what’s ur pc gpu btw?
rtx 4060
Ayo? @oak edge level 1 !!! 
its laptop btw
Man what the
How much vram (dedicated gpu memory)?
uhm 8
also laptop gpus are weaker than desktop ones
I already trained 5000 epochs for a 1.30 minutes data set twice
RTX 4060 laptop and desktop are same die
That's what i said
die??
pretty much 5-10% deviation from desktop gpu die
yes the hardware gpu die?
oh thought u were telling me to die
Sorry man I'm kinda new so could've overtrained a tiny teeny bit
yea usually with the tensorboard people find the right epoch under 1k epochs
I'm trying to master the art of seperating vocals from songs, swapping it with ai , and make all the songs sang by my fav singer
btw more epochs don’t mean more quality
one thing i noticed the voice quality doesn't change
but the regular electronic distortions
and some parts where volumes fluctuates becomes less
You've gone four digits 5 times 🙏
training too much could overtrain which could make worse quality or have no impact so no reason in training more
i saw a video where a guy recommending to train 20000 so i thought i start with 5000 (Guy had 1650 super btw)
never follow yt tuts
they are like a year old
Can I send 2 seperate Comparisons one with 250 Epoch then with 820 epoch
and wtf who would train with a gtx 😭
made fresh today itself
Idk man he just trained around 5 epoch for demonstration purposes in vid
can’t listen to them rn but sure
(i’m at school)
ipad 
(even if i kinda hate apple ngl)
Man still your school allows, 1st world privilages I guess..
even better
Well usually no one finds improvement (in rvc context) after that
so trying all weird stuff out..
Man but the stuff I found was clear especially stuff i noticed is when the data training time improving the pronounciation becomes better
Ayo? @oak edge level 2 !!! 
and sometimes the random voice cuts is lesser and the best of it all higher pitch songs are clear
when u train an rvc model, you train using a pretrainex model, basically a “model” trained by the creators (even if there are community ones) with like multiple voices in it, it’s done to help people train their models
without that, it would be from scratch, meaning you’d have to train with tons of hours of data and way more time
I have a data set of same song one with 5000 epoch 1.30 minute training set, then now 7.30 minute 820 epoch, the 820 one has better clarity, very less distortions than 5000 with 1.30 minute set
I don't even understand fraction of how this happens😭
Btw training took me to 800 epoch with 7.30 minute set only took about 6 hours so not a big time
fortunately there are pretrains, without them ud really have to train alot more
training pretrains takes alot of time and data
(and power
)
sooo is all getting sorted out for me?
by creators of mangio rvc?
im just saying that you usually wouldn’t need to train with over 1k, or need to train with more than 1 hour of data (if u ever do) as u might have no improvements
btw, mangio rvc is a fork (modified version) of rvc, its outdated since a year
Man then which gui you recommend i'm not good with CLI at all
i mean, its still usuable but usually people rather go with more updated things like mainline (the original rvc) or applio (an updated rvc fork)
they all have a UI
where can I download it man... I don't know much about installation either...
i mean, quality wouldn’t change, rvc did no improvements in a year unfortunately and has been left to rot from the original creators itself
Its more suggested to use them in case mangio does weird errors
ofc u can use mangio too, i was just telling u tho
do I need to put 23 hours of training again??
the main developer rvc boss kinda left it ye, he works now on gpt so vits which is a tts program (while rvc is a sts program)
23 hours of training?
is that how much it took u for doing the 5k epochs?
dang 23 hours are alot
no 9000 epochs
5k epochs on 7 minutes is 4800 too much
- 6 hours of this 820 epoch 7.5 minute set
no
you have ALOT of patience
5000 + 4000 of 1.30 minute different sets
trying to squeeze a bathtub of lemon juice from one lemon
5000 SPB, 4000 my own voice
aight then I'll fix at 1000, because I seem to have a lot of time these days up until february
well so which one you recommend me to download
if u want a simple ui, u could try out applio
I hope my laptop is not gonna die out before I go to uni due to all training.. one time when I put crepe extraction it just started to BSOD🥲

is it better than mangio
don’t train alot
is crepe extraction better than rmvpe? for training>
but crepe seems to be ram hungry and makes my laptop BSOD almost all time
my lap is legion pro 5i, 4060,13700HX
well it still gets updates and the ui could be easier than it, personally yea
it’s better to use rmvpe, it’s less sensitive to noises
oh yes ram is 16 gb
damn u lucky
okk... So man what are the benefits i can see by switching
i got integrated graphicsh with an i3 of 10 or 11 gen
nah it was a gift for my hard work
Btw u could see more about the f0 methods aka pitch extraction in https://docs.ai-hub.wtf/rvc/resources/inference-settings/#also-known-as-f0-they-re-the-algorithms-for-converting-the-vocals
Last update: Feb 25, 2024
copied the wrong thing oops, edited it
technically crepe has slightly better quality but it’s slower and more prone to noises
which is why most people use rmvpe which is still good quality, less proje to noises and faster
i’d suggest to use rmvpe
please ask in #🔍│help-w-okada
i’m not a wokada helper, but if ur following yt tuts, don’t
ty very much
-rt
This interaction has expired, use the command
/guidesif you wish to see it again.
its better u follow the 1st written guide
its the wokada fork, which has better performance too
but yt tuts are mostly outdated
yes mine already has noises
thanks for the help
uhm man I was wondering should i jump to appolio
yw
so what kind of benefits i'm looking at, less tearing artifacts
Ayo? @oak edge level 3 !!! 
like electronic stuff between voice, will those reduce
do I have to train all over from scratch in applio>
technically mangio is less suggested as its old, id suggest applio
If u meant as “are there pretrains in applio”, yes there are pretrains in every rvc
no no, I mean training the specific voice
the models are usable in both mangio andapplio
in my case SPB
didn’t u train that model already?
I want to train his voice to swap to songs
yes in mangio
yea, u can use that model in applio too
but will that work with applio i d k how to transfer this file to that
the models can be used in all rvc versions
ohh... thankgod, can I continue to train from where i left(820 epoch) in applio?
They are like interchangeables
I mean if the set is same will there be anydifference in outcome at all
like less artifact output, more clear voice
i don’t really do local, but technically u should be able too
well nope with the same set
Man how old r u
if ur having noise issues, it’s better u retrain with rmvpe
Oh my gosh again...
wait can you tell if a model has noise issues by listening?
started about ai like july 2023
Dayum... I d k what I was even doing when I was 14, probably playing some games in my android tablet T_T
well yea
oh yea i play games too, used to mod and hack mvs in 2022
(hack in modifying UE4 files to get free cosmetic, i dont ruin other experiences)
Man is a prodigy
this is my current training set...
did some basic mods, unfortunately idk about 3d models
man that's still very very advanced at that stage, maybe I'm surprised since I'm from 3rd world T_T
Can I ask doubt about what the sliders do in RVC
where are u from btw? I’m from italy
Sri Lanka
about the sliders?
yes what these even do
btw why shouldn't I place the appilo in D drive My C is really cramped atm
you can place it anywhere
just dont place it into a folder with onedrive/spaces/funny local characters
https://docs.ai-hub.wtf/rvc/resources/inference-settings/ these might answer you more easily than doing a wall of text lol
Last update: Feb 25, 2024
thanks
oh thank you man
this makes no damn sense... unless Mangio does something stupid
the model takes audio at 16k and produces the sample ratio it was trained with
any idea what should I download
v3.2.6
idk what those even mean so i'm just sliding them here and there
Average mangio moment 
I just sometime put all sliders to max and click convert
btw why tf did huggingface mark the pickle imports as sus
it marks all non-safetensors as sus
Man 4.21 gb T_T
I'm gonna take my time train 1000 epochs in Mangio then in Appilo, then compare them with various songs T_T gonna take a week..
Ayo? @oak edge level 4 !!! 
once again, 1000 epochs on <10 min file is crazy
but you can test the results every 10 epochs if you want
i mean are there any downsides other than more time ??
it will be overtrained af
and....???
robotvoices
Won't the electronic slider thingy won't do anything to reduce that effect?
no
if you want quality, get more audio
Oh, then I'll go and check out 100 200 300 400 500 600 700 then 1000 = 3800 epochs of same file but different, Like SPB_100, SPB_200, SPB_300 and what about checking them individually to find the point where voice turns robotic?
I searched yt for a long time to find this 7.30 minutes clip, the singer is old and he passed away, only some pure clips of his singing is there in yt
I'm not comfortable about feeding voice ripped from UVR to train as it has many artifacts...
What uvr model did u use? Bc most people here use uvr
yea the with bs roformer iirc
idk names man T_T, But the ripped is still bad for the songs
I can hear those small echoes apart from pure voice some times violin music too
echo can be removed using a de reverb or de reverb (used on the extracted vocals)
but that makes the song hard to voice swap, like voice cuts
and fluctuating volume of singer
mm weird
how do i get the model to run from the .json file?
im tryna get the juice wrld one to work
the only files u need to run are .pth and added.index for rvc
no json
oh, sorry
what do i use to run the .pth?
are you following a yt tut btw?
no, i couldn't find a good one. i js got the model off the google sheet
Ayo? @azure magnet level 1 !!! 
its good bc those are old, and the google sheet is old too
don’t use those
You can search rvc ai voice models at:
- #1175430844685484042
- Send " @gusty kestrel search (name of the model)", without the ()
- Do /find with @earnest musk
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://applio.org/models
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/
What’s ur pc gpu and what are u using btw?
its not the best, good only for inference (using models)
what are u using btw
like applio, mainline
bc u can run the program either locally (on ur pc) or using cloud (remote good pc) computing services like google colab
what are u using rn to make ai covers ?
ohhhhh
nothing, this is my first time trying. i mean fl studio if thats what u mean, mvsep to seperate the stems of the song im using
ohh lol
the program to use those speech to speech model is naked rvc
one sec lemme tell u
This is for locally
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
ilaria rvc zero is on cloud, its a zerogpu (A100, way faster than ur gpu) huggingface space
so ilaria rvc zero will be faster than ur gpu and easier, but u got limits so can’t use it 24/7, u got a quota per account, while local will be kinda harder to install and slower but unlimited
so its kinda ur choice
your welcome
what difference does the index file have on the final inferred audio
the index file contains the accent of the voice
i see, thanks
yw
@low shard Can the uvr website release more drum models?
the x-minus website
Ayo? @brittle wing level 6 !!! 
i dunno about drum models
hey man I tested 100 epoch vs 1000 epoch
for 7.30 minute audio..
also can I ask about UVR vocal + instrument seperation here?
Your best bet for isolating vocals is BS Roformer.
If you ask.
Also, more epochs doesn't mean more quality.
I currently tried BS Reformer beta 4
also the epoch, the accent was heavy with 1000 epoch than with 100 and the major difference I noticed was the loudness
the spikes shown in audacity for the same song everything same settings except for the models one with 100 and other with 1000, the 1000 shown more loudness and bold pronounciations
with accent much closer to the model singer's voice
while 100 epoch was pretty low on loudness and some words slightly mispronounced and the accent wasn't much convincing as 1000 epoch
also are the ensemble with 1296+1297+23C INSTVocHQ better than MELBAND_reformer_big_beta_4?
Probably on 100 epochs it was overtrained.
You can just use BS Roformer (11.31 SDR version) on mvsep.
I use the local version
You can try mvsep too.
then is the loudness and accent getting much closer to the AI singer a bad thing? because lower I go on epoch it gets worse T_T
Ah i mispelled.
I mean, undertrained.
MVSEP has these queue too
than can I go above 1000? like 10000?
when can we actually see the limit?
Tho there are some models that aren't available on UVR.
You got a 7 min dataset, that would be overkill.
I'm not sure if you overtrained the model when reaching 1000 epochs.
Have you checked tensorboard?
Still no idea how to check it
is that some software to be downloaded?
so what's best on MVSEP for songs? I just need clean vocals
BS Roformer (11.31 SDR version)
okk
aight so the tensor board bat is doing it's cmd thing now
Well it's not opening the localhost address for me
Ayo? @oak edge level 5 !!! 
where i take a link ? pls help
Of what, theres many things you can do
wait its around 3.96k... but this graph is weirder than the one in tutorial T_T
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
does anyone here have okada working with a very low delay / sample size? dm me
how do i extract background vocals with uvr5? The "main vocal" extraction thing on the best models list doesnt do anything. In fact the vocal file is distorted and really low quality while the actual okay sounding bit still including background vocal file is named instrumentals
You can try using Mel Karaoke or UVR BVE V2 on either MVSEP or UVR Online
Try using that one.
alr
Ayo? @gilded frigate level 2 !!! 
are those exclusive to the websites or smth?
Yep, these are exclusive from these websites.
im doing everything locally so i dont have to deal with those queue's and stuff and now you just get cock blocked by a paywall again smh
what a pain
⠀
Download for Nvidia GPUs 
Version 18a cuda
Download for AMD GPUs 
Version 18a directml
Download for Intel GPUs 
Version 18a directml
Download for Mac 
Version 17b Mac
⠀
so i was doing a resume of a training and this appeared, does anyone know how to fix it?
You are trying to remove background noise from a normal song? Can you even hear "noise" when the beat is playing
If its vocals, BS Roformer removes background noise and instrumentals. You can use it on mvsep or uvr
Make an account which is for free to skip like 95% of the queues and get to yours asap
Hi. How do I upload my model to Huggingface? I'm using RVC Disconnected and I don't know how to use it. Sorry for the dumb question, but I'm new to AI Covers. How does this work?
Hello im new in this what can i use to test a model that i train in local ? i was trying to use \Mangio-RVC-v23.7.0 but im having an error when i open de ui
I notice my RVC models not being able to pronounce certain adlibs like grah and brrt is it impossible for rvc to do these sounds or do I need more adlibs in my dataset should I remove the adlibs from my dataset if rvc cant do them?
whats the rvc that u can use in ur browser link?
mangio is kinda outdated, what's ur pc gpu?
you can just upload it urself https://docs.ai-hub.wtf/essentials/voice-models/#uploading-to-hugging-face
Last update: Apr 01, 2024
every interface is in the browser lol
i was using an amd r5 430 but i think its no compatible cuz too old
i searched online and it seems to have 2gb vram.. Idek how even you are on your browser rn
☠️
Ayo? @upper scarab level 1 !!! 
You can't do it locally (on ur pc)
u need to use cloud (remote good pc) computing services
what are u lookinn for
for now im trying to do an AI cover
use ilaria rvc zero, zerogpu (A100) huggingface space
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
fastest free cloud way
any tips to make the vocal sound less like autotune guys? Im trying to make a song cover by using Ilaria RVC (yes i have already separated the vocal and the instrumental)
Index influence -> How much accent is applied: 0.9
Respiration median filtering: 3
Envelope ratio: 0.25
Consonant breath protection: 0.5
Ayo? @flint wadi level 1 !!! 
I've had my settings set so the ai voice plays back to myself so I can hear if it's working but now the voice all of the sudden plays like 4 seconds after I talk. It used to be instant
turn pitch down in the original audio
Nevermind i fixed it
ty, it kinda reduces it a bit
sometimes it may help to remove reverb/echo, but mainly the robot voice comes from several vocalists joining in
ah i see
if I train a model is it okay if I just have one wav file or do I cut it into 5 second clips
preprocess does the cutting
if youre using applio it also does some basic filtering/normalization
why when I press start
Ayo? @wicked nebula level 1 !!! 
I don't even sound like the model I'm using
if not, the model doesn't even works
I tried using other models
still not sounding like the model I'm using
what's the best pre trains for an english female voice
extra
I'm using applio atm but there's a storm if my pc goes out while it's still not finished do i restart from the beginning
hi
I got this what do
When you finish your Free Daily GPU on Google Colab, you can:
- Use an alt google account
- Use kaggle which gives more gpu time but its harder
- Wait until tomorrow
- Pay for colab pro
just use kaggle
what’s ur pc gpu btw?
4070
should I just do it locally
how do I save my progress tho it timed out at epoch 150
is there anyway to save this
yea ofc
why do it cloud when u got a good enough pc to do it locally
most of my tries are pretty bad was wondering if it would be better on a different gpu
i think u can’t if the gou time finished, unless u see got the G & D files in the google drive, in that case u could use those as “pretrain” to continue the training
ah I guess I have to start over
that depends by ur dataset and if ur using the tensorboard
what does using a tensorboard does it tell you when to stop training or what epoch is the best
that's what I remembered at least maybe I'm wrong
it tells u which epoch to use
like helps u to not under/over train
the reason why you should use your 4070 locally rather than that
do you reccomend I cut my audio into 5 second clips then if I train them locally
or does not matter
I'll just give it another go tomorrow I suppose
any advice how to make a good data set
use this method rather than cutting inbetween several sentences which compromises quality with popping artifacts
https://rentry.co/RVC-dataset-RX11#noise-gating-and-audio-labeling
My models sound great for short durations but feel incredibly monotonous after 1 minute because it doesn’t fully clone over expression and emotion of source sound. Anyone know if this is a limitation of RVC or is this just that I need more expressive training data?
I am trying to train my own AI model, but the tool Im using got too old to do its work. Is there tool I can use to make my own AI voice model?
I'm guessing you want a speech to speech ai model, so RVC
What's your PC GPU?
Ayo? @spiral cape level 1 !!! 
as I was making, I got
WARNING: Ignoring version 2.0.6 of omegaconf since it has invalid metadata:
over and over, and I cant find way to figure this out
#1159290752195633273 or else ur msg gets deleted
Oh yeah nvm, that's not good enough to train (make models) locally (on ur PC) you can only inference (use models)
Does it work after the warning?
As you dont got a good PC, its better you use cloud (remote good pc) for training an RVC Voice Model:
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
It should be just a warning and u should be all good, but I'd suggest u use kaggle mainline instead
Im using google colabs for AI, and does it mean that there will be no problem making the voice model?
Do i just have to run anyway?
my bad, seems like this code doesnt work anymore.
I see github sites exposed already inside.
Is anyone else having problems with the Applio Google Colab (no UI) and Pytorch? I can't seem to make it work...
It would be better u use kaggle
Google colab is easier but gives only 4 hours daily of gpu
Around 4 hours, it's not even granted and u could get only an hour
While Kaggle is bit harder but gives u 30 hours weekly granted
@nocturne mural seems like people getting errors in the applio no UI colab btw
Already 2nd report
it likely installs torch 2.5
should be fixed with a new clone
Vidal did a PR last night
-rt
This interaction has expired, use the command
/guidesif you wish to see it again.
.
Ayo? @distant jasper level 2 !!! 

im just looking to understand some basic terminology. is this something i could use to make an AI that will mimic a character's voice based on voice clips?
if so, how do i get started? and if not, what would that be called generally?
It can convert any voice to your favorite character's voice
Your have to use RVC for it. If you have a decent GPU use it locally if not use cloud services such as kaggle and colab
hmmm i see, thank you... i assume its compatible with text to speech in some way?
You can use APPLIO fork for TTS
alright then! so to get started i would need to install an RVC program and find or create the voice model i want to use to go with it...?
Ayo? @gilded mortar level 1 !!! 
im very curious about the actual process of creating a voice model and what that involves
im also curious what an epoch is
Epoch is a training cycle
If you read a book for 1 time that means 1 epoch.. more times you read more information will you get from it. Similarly If Ai algorithms study your dataset for 1 times it's 1 epoch.
-rvc
i see. im familiar with the principles of ai training but i guess not all the terminology
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
- Italian Guides by Ilaria
All you need @gilded mortar
So here is a guide for you to do all this stuff. Read it.
thank you very much!
i need help with rvc , i keep getting an error can someone help me?
Yw ; )
Show me the error
do i copy paste what came up in cmd
?
guys when i start the programm it has all the time whitescreen
Yes. Or level up yourself to send pictures
what shall i do
Traceback (most recent call last):
File "C:\Users\Daanish\Desktop\RVC1006Nvidia\runtime\lib\site-packages\gradio\routes.py", line 321, in run_predict
output = await app.blocks.process_api(
File "C:\Users\Daanish\Desktop\RVC1006Nvidia\runtime\lib\site-packages\gradio\blocks.py", line 1007, in process_api
data = self.postprocess_data(fn_index, result["prediction"], state)
File "C:\Users\Daanish\Desktop\RVC1006Nvidia\runtime\lib\site-packages\gradio\blocks.py", line 953, in postprocess_data
prediction_value = block.postprocess(prediction_value)
File "C:\Users\Daanish\Desktop\RVC1006Nvidia\runtime\lib\site-packages\gradio\components.py", line 2076, in postprocess
processing_utils.audio_to_file(sample_rate, data, file.name)
File "C:\Users\Daanish\Desktop\RVC1006Nvidia\runtime\lib\site-packages\gradio\processing_utils.py", line 206, in audio_to_file
data = convert_to_16_bit_wav(data)
File "C:\Users\Daanish\Desktop\RVC1006Nvidia\runtime\lib\site-packages\gradio\processing_utils.py", line 219, in convert_to_16_bit_wav
if data.dtype in [np.float64, np.float32, np.float16]:
AttributeError: 'NoneType' object has no attribute 'dtype'
Ayo? @brittle wing level 1 !!! 
Which program ?
w-okada
w-okada
Which version are you using? And which GPU do you have
windows 17b
This interaction has expired, use the command
/guidesif you wish to see it again.