#✨│ai-help
1 messages · Page 160 of 1
In modded version, it will use fp16 only when possible. It's disabled for GPUs that are known to be not capable/not performant with it
Hi pls give me the Google Collab link pls
And force fp32 option is mostly in case if somehow gpu didn't get into filter or there are actual problems
Ow i really thought it would make a quality difference xD thank goodness i asked this
Hello! I have a question
Chill and behave.
You have a functioning brain and a search bar
Sorry.
Ayo? @stone gust level 1 !!! 
time to learn how to use discord's ui
Some people need to learn respect and when to wait if an active case's being solved
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- EasyGUI, by rejects Google Colab
- UVR5 NO UI for Google Colab, by Eddy Google Colab
- How to use RVC Mainline Colab by Cauthess
- Full AI Voice Model Training Guide (local) by Christopher Villanueva
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
I believe u can increase this by paying tho not sure
Which is the link there are so many
The one i sent u is the correct one
Okay
How do you figure out the sample rate of a .pth file so you know what other .pth files you can merge it with?
Do an infer on a clean 48khz audio, load up the inferenced audio ( model's output ) in any program that let's you see a spectrogram
and the rest should be simple. Look at the frequency response range
Or do an infer and send me the output here
Got it, thanks!
Np
In case you can't find anything good, here's a clean 44.1 ( might not be ideal but can still work )
SynthV's AI output, if anyone asked.
( Yes, it is a good performance tester for female models - which I use quite often )
click edit on wokada and look for your model
the number after f0 is your sample rate
They might not use w-okada tho
it should say either 32000, 40000 or 48000
They wanna merge the pth afteral ( ckpt fusion ) that's in rvc
Yeah I use RVC, but I can pick up w-okada if it lets me read the sample rate real quick
Ayo? @frigid stone level 1 !!! 
ow man i got confused again lmao i thought this was the wokada channel
Thanks, bot
yea just inference and u can see the sample rate aswell
^ easier approach than dl'ing gigs
tho, reality is
if you can hear it not muddy / muffly
it is 48
if it's somewhat fine but lacks that " harsh ish " fidelity and clarity, especially in sibilants, it's 40khz
if it's muddy or foggy, you get the idea, it's 32khz
But then, some 40khz sets or so are used in 48khz models hence yea, better send the output here and I can help with that
or just use spek (this is a inference result)
Doesn't let you see the upsampling's effects tho
there are some subtle granual nuances here n there, better off If I check it in rx
how good is to merge two different voices anyways? we know merging epochs is fine but what about for example
1 voice g/total 2 voice g/total
not from the same model ofc
I'd say it depends
how compatible are the voices ( timbre wise )
and what kind of ranges they learned
if one sucks at high, other doesn't
crappy abomination can occur 👀
I also wouldn't advise mixing low pitch voices with high ( say, mommy voice with high pitched anime girl voice )
Butttt, ye, experiment if you want

ikr
So I checked the sample rate of the voices given by default with RVC, kikiv1 and guanguanV1, both say they're 32k but they won't merge in Applio, I get a TypeError "cannot unpack non-iterable KeyError object"
Because applio cant run mainline models iirc
Or at least time i tried it didnt worked xD
Can't even merge them because of that? Is there a better tool for merging? I basically just chose Applio because it was the first program that had merging
Mainline
Mainline RVC can merge?
Yep
Not sure about " applio can't run mainline models " as all models are the same, regardless of the fork
Wasnt the wokada models ones because theyre onnx?
I'd be more opting towards 32k isn't supported for merging
There's no ui option for that nor within the code
What tab in the web-ui do I click into for that?
I skimmed over the web-ui for RVC a little while ago but never found one which is why I had Applio
I wonder why last time i tried loading one of my mainline models and the results sounded like sans speaking sample
e e e
cause applio itself was most likely broken
will be called model utils
There @frigid stone
Mainline can do 32k merging?
nope
ignore the left side tho
F
Buttttt, from what I know
you can just select 40k
for 32 merges
YET I cannot promise any stability
might work fine on the outside, might be a mess inside
and some inference might come out badly, or may not
Im gonna try applio again with mainline models tho, if it doesnt work again then… yea
I have created life, thank you very much for the assistance
Yay
Time to dump Applio because it's not working for the one thing I downloaded it for
Mainline is the goat
👀
I didn't read hard enough to find the merge, but now that I have, it definitely is
RVC Mainline, if youre using the prepackage RVC1006 will not work fine with merges btw in case youre using that
if you want a good enhanced mainline, go for my fork
I'm using the one literally called "Latest" because the one in the releases doesn't work with ASIO
Please tell me thats fixed in your fork
guys ı need to train voices but that ı used for training is on google colab and it has limits, ı need free and unlimited one can anyone help with that ?
Ye thats mine thatll work
The merging part
https://github.com/codename0og/codename-rvc-fork
It does work
Oof nice
Merging is a part of my training routine, always
32k should work fine too, just select 40k or 48k, test stuff around
I asked that because he said this
but tbf, I do not absolutely see any reason to recommend mainline over my fork
it should be either that or applio
The Latest one I use works fine with merges as far as I can tell
Ayo? @frigid stone level 2 !!! 
Thanks, bot
as Mine's literally mainline but enhanced ( including model, index and audio picking
I tested it with the 1 merge that worked from Applio
( +, recently finished moving to fumiama's base - just not updated on github yet
Im waiting for the update 
Tbf, it's nothing of a game changer if you want me to be honest
Ayo? @glacial pollen level 19 !!! 
it's more so now the fork will be compatible with potential big changes fumiama could introduce
So applio models works in mainline? Ah thanks, so it seems my applio installation just bugged last time i tried something similar lol
( more or less. Hash is a nono and won't incorporate it
i was away :p
All good
Yeah, I made a merge by brute forcing whatever models actually merged, and then it worked in my RVC just fine
just one thing I can say for sure what breaks most if not all merged models is
feature extraction type
i.e. Mangio x rmvpe models, might not always sound fine or work fine
Time to go through all my models and figure out what khz they are
and, that's 200% confirmed, different precision mode trained models
are not liking each others

for instance, fusing bfloat16 with fp32 resulted in a tragedy
but I wouldn't worry. Majority of models you can find are fp16 anyways
fp32 are super rare
Welp hes safe because everyone trains in fp16
I'd even go as far as saying at least 90% or 95% of models are fp16
guys ı need to train voices but that ı used for training is on google colab and it has limits, ı need free and unlimited one can anyone help with that ?
can anyone help me? my voice changer seems to not be working. ive correctly got everything (python and etc.) and got all the inputs outputs correctly but when i press play on a diff app (yes i have virtual audio cable) it just does not change the voice
Ayo? @rapid knot level 1 !!! 
Screenshot your voice changer app
Is your passthru always green when you start voice changer? does it ever become red
its always red
done
where can i do that?
oh mb
it still does not work like i dont hear a thing on the discord test one and stuff ill go check again if i have put the wrong ones
Did you select cable as your input on discord
Yup and it doesnt recognize the voice
Uninstall vb cable beacuse its bad and download the better one
EOL - No further Updates
Github - Blanc-dot
Discord - Blanc_dot
Despite being end of life, most if not all information has not really changed, so should be very accurate until actual new stuff comes out.
Other Links
Antasma's Local Error Fixes
Antasma's Colab guide
Sushi's useful Links - You need...
VAC
alr
red means it picks the unprocessed voice
i see
i got it all correct but its still the same, not picking up my voice and not replaying it
zero gpu has a limit
the volume also doesnt change even when i start it
So the A100 is for premium user only?
Ayo? @whole rain level 1 !!! 
yes?
you mean rvc?
i belive it has a drake pic
this is way too old sorry
check https://aihub.wtf/docs for help
what should i use then for voice clone? with samples
check the docs
Have the audio file of your song ready, & let's extract the vocals from it with an audio isolation software.
do you want to clone someones voice or use a cloned voice?
clone someones voice
either follow this:https://docs.aihub.wtf/essentials/how-to-make-voice-models/
In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.
or pay people to make it (e.g. me and others)
there are more affordable gpu rental alternatives
truly peak of ai hub. racism and slurs everywhere
which one?
already reported to admins.
nvm
they removed timeouts for helpers
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
my voice doesnt get morph to the model, it just sound the same
i didnt download python or pytorch, is this what may be causing the issue?
Ayo? @cunning juniper level 1 !!! 
i think we have the same issue with this one
Is passthru on?
It should be green
Send a screenshot of your settings
its red
Make it green
how do you do that
Change it to a RVC model
Click it
came someone plssss help me
Don’t ask to ask, just ask
wheres the rvc model
Anyone that isn’t the Beatrice one
its been so weird every time i try to use it sounds like im talking to a spirit box
What’s your GPU?
Adjust tune
i talk to it and it says nothing back
been using the RVC - GUI and have been using models on this server but it seems like most of them sound distorted but when I hear the samples it seems fine
what could I be doing wrong 🤔
Did you click start?
RVC-GUI is outdated
can someone hop on a call with me?
Ayo? @distant thicket level 1 !!! 
yeah the ui is also super laggy
What should I use now then ?
Do you have the audio cable set up?
i do but how do i use it?
-local I personally use original/mianline
Suggestions for @orchid glade
- 🍏 Applio, by IA Hispano GitHub
- Mangio-RVC-Fork, by Mangio621 Huggingface
- RVC Studio, by SayanoAI Huggingface
- AICoverGen, by SociallyIneptWeeb GitHub
- Replay, by Replay Team Website
- Original RVC, by the RVC-Project team GitHub
- GPT-SoVITS, by RVC-Boss GitHub
Credits to Faze Masta and Antasma for compiling these links.
Set output on Okada to the cable and input on discord to the cable
i have not even tierd it on discord its not even working on its own
Send a screenshot of your settings
Also what’s your GPU
1660
rip
Should work decent
oh.
-realtime
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
so i need the other one
Read the first one
Should have the NVidia download
Ayo? @distant thicket level 2 !!! 
why does my ai model not have an index!
????
I tried to make a salad fingers ai cover and it dont have an index!
i mean i have python and pytorch but no idea why it doesnt work
Ayo? @rapid knot level 2 !!! 
still not working until now?
nope, im trying to reinstall rn
what do i set moniter to?
can someone help me?!
hey everybody, just getting into this AI voice stuff. Can somebody point me to a guide of how to use a voicemodel that i get from this server?
I have heard that there may be different methods for different models, here is the one I am attempting to use: #1175498017697181858 message
I just need the elevenlabs adam voice, but they removed it from the site. If this voice model I linked here is bad for whatever reason, feel free to point me to any others you happen to be aware of
You dont need an index
im talking and nothing is happending
hello how do you uninstall it?
really?o
delete all the files?
just delete it?
Voice model goes into assets -> weights folder
Index goes into logs folder
i mean its what i did, still reinstaling tho ill keep u up with the results and if doesnt work lets both try to get help
there isnt an index in the model folder i download
Not the end of the world
Index is accent of model , not needed
sure thanks
For making ai covers ("Inference") check this out
#✨│ai-help message
Thanks i did it
You don't need to install pytorch and even python
Show screenshot with settings and command line that opens when you run start_http
could u check dm's? thanks
Ayo? @cunning juniper level 2 !!! 
Set chunk to 192 and extra to 8192
ideally id be looking for TTS with that voice model, unless there is something I dont understand
or is the point to use one of these TTS models on the site and then convert to the desired voice via inference?
how do i turn the sensitive down
its picking up background noise
Enable sup2 for noise suppresion
this seems to be the answer. Result is not necessarily great, but thats probably down to the model and some fine tuning. Thanks for the help 👍
Ayo? @undone vault level 1 !!! 
where can i download rvc V2?
sorry for the late response
Which model did you load?
just a predownloaded one (if thats what your asking)
prob because i redownloaded it ill tell u if it works after or not
Now its this and still doesnt seem to be working
yup
any resources for making the model be a little more excited/yelling? or would that need to be a modification to the model itself?
uh where exactly is the cuda toolkit?
and just now i uploaded a voice model but it seems to give this when i try to switch to it
The problem might be that you already have cuda toolkit installed, but it's incompatible
i see, so can i just not use new models?
You can try editing start_http.bat and add the following line in the beginning
set CUDA_PATH=
where's start_http.bat? I cant seem to find it on the MMVC something folder
Ayo? @rapid knot level 3 !!! 
How did you start the voice changer?
by start_http ?
Well, that's the file, yeah
Right-click it > Edit
alr did that ty
yup it was on notepad
well how do i get the voicechanger to work now?
OH NVM IT DOES
Help pls
C:\Program Files (x86)\MMVCServerSIO>MMVCServerSIO.exe -p 18888 --https false --content_vec_500 pretrain/checkpoint_best_legacy_500.pt --content_vec_500_onnx pretrain/content_vec_500.onnx --content_vec_500_onnx_on true --hubert_base pretrain/hubert_base.pt --hubert_base_jp pretrain/rinna_hubert_base_jp.pt --hubert_soft pretrain/hubert/hubert-soft-0d54a1f4.pt --nsf_hifigan pretrain/nsf_hifigan/model --crepe_onnx_full pretrain/crepe_onnx_full.onnx --crepe_onnx_tiny pretrain/crepe_onnx_tiny.onnx --rmvpe pretrain/rmvpe.pt --model_dir model_dir --samples samples.json
Traceback (most recent call last):
File "MMVCServerSIO.py", line 33, in <module>
File "mods\log_control.py", line 83, in initialize
File "logging_init_.py", line 1169, in init
File "logging_init_.py", line 1201, in _open
PermissionError: [Errno 13] Permission denied: 'C:\Program Files (x86)\MMVCServerSIO\vcclient.log'
[10304] Failed to execute script 'MMVCServerSIO' due to unhandled exception!
hello, i have a question and a little help about the rvc how will i run it again after closing it?
is it the same step when i opened it on the file i downloaded?
what are you using
why this error?
all colab versions give me problems, errors, why?, locally the problems are different, but colab shouldn't cause problems, I've always used it
Problem seems to be voice model
What did you put in for voice model
What path or link
Thats an odd obvious question or am I misunderstanding
You open the file the way you started it the first time yes
You answered my question, thank you
I cloned the voice as usual and put it on huggingface
Lots of things that could be the case
- Disable antivirus program
- If you did indeed have an antivirus program running, then delete the MMVCServerSIO folder again (you DONT have to delete the zip file with the bunch of numbers behind it)
And then extract/unzip it again with the antivirus disabled - What gpu do you have
Whats the URL path you put in
can you download minecraft parkour anywhere for free?
What does that have to do with AI
This seems strange to me since I always do this, thanks for the reply
remove the ?download=true from the url when you insert it
ok I'll try, I don't use colab much, I used local more, but it gives me a strange error
on mangio rvc, locally it gives me this error, I use an AMD
Id recommend you get mainline instead
there you run go-web-dml.bat
I unfortunately dont recognize the error on your mangio local either
ok I'll try, thanks
for creating AI models, would you include recordings from different microphones in the dataset or not? would it make rvc perform better or worse?
since people sound different with different microphones
with perform i mean would it sound more or less realistic
For realtime purpose, you should definitely only have ONE microphone used
For RVC, at least for me, it has not made much of a difference assuming the person doesnt suddenly sound completely different. Some difference is okay
thanks!
with RVC i mean inferencing/ai covers
yeah i know
Posting it here in case someone uses 1.0.0 of my rvc rtvc as I've pushed the update:
https://github.com/codename0og/rvc-realtime-voice-changer/releases/tag/v1.1.0
And here's instructions on what's what etc for new users.:
https://github.com/codename0og/rvc-realtime-voice-changer
Guys which one is more important, dataset or epochs? If I wanna make my model better, should i add more audio data or increase epoch number?
Both
Epochs and so, training duration
batch size
dataset
Pretty much everything is equally as important, there's no "priority"
I'm using replay ai and i don't think it has those options like batch size
I didn't see em anyway

Then that's on you for using such things
What should I use then? I don't have a gpu so that's a bit of an issue
Well, there are other alternatives
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- EasyGUI, by rejects Google Colab
- UVR5 NO UI for Google Colab, by Eddy Google Colab
-docs
- How to use RVC Mainline Colab by Cauthess
- Full AI Voice Model Training Guide (local) by Christopher Villanueva
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
Have a read
There's some zero spaces or such, maybe ilaria?
Not sure 100% as I don't track these
Alright I appreciate it
but if you search through this discord
you'll find lots of alternatives / solutions
And by the way do they have the ability to pause the training?
I wanna train models for a long time but I don't wanna have my laptop running for a week straight💀
and appropriate files have to be kept
hold on.. Imma check if I made a guide for it in the past
Alright
oh, seems like not.. welp.
uhhhh
tl;dr
- you can stop the jupyter cell or whatever you use that keeps the rvc running as long you don't see " saving optimizer state at xxxx "
n such
or any info displaying savings of generator ( G ) or discriminator ( D )
in a console or such
other than that, you need:
- your model's folder
( ur model's folder from log folder )
it has to contain everything
- most recent .pth for your model from training
There
Man I really gatta step up colab knowledge I didn't understand a single word you said😭
Whenever you train, after each trained epoch there's saving taking place. Easiest way you can spot it is by noticing " saving optimizer state at epoch xxxx " or something along the line
What's the difference between "E3" and "Ov2"? What specifically do they mean? Sorry if asked a lot, couldn't find anything about it
you can't pause during that moment because the generator or discriminator ( which have the optimizer states saved at the time ) will break
Next thing would be, each trained or actively trained model has it's folder
completely unrelated to the convo;
does installing the requeriments for the voicechanger messes up things in your fork? im a bit paranoid fsfs
models' folders are located in " log " folder
Ohh so it automatically saves itself after every epoch?
If that's the case that's nice
each epoch = a produced model
Right
as long you have saving frequency set to 1
now, continuing
Whenever you wanna resume the training, your model's folder has to be present in " log " folder
- the most recent epoch / .pth model / weight mode ( most recent one you had when you stopped the training )
those are located in: rvc folder / assets / weights folder / here
And visually it'd be:
Ah got it so I can continue the training from the last saved epoch
I'm barely keepin up with this
Right
this is my model's folder located in logs folder
as you can see from the address bar
such have to be present
( if you like, switch colabs and so on or whatever )
and aside that, most recent .pth model
imagine this is my model's 192th epoch
so MyModel_e192_Sxxxx.pth
more or less
Wait this is a bit unrelated but I saw somewhere tht colab stopped giving access to rvc for free? I thought there was like a time limit for the training
you'd have to have that .pth in:
rvc folder / assets / weights / here
and model's folder in:
rvc folder / logs / here
then when you resume the training, you only input the model's name again ( as in, " experiment name " or however the given rvc is localized ) and hit train model
all the mid-steps such as feature extraction etc. are only done once
For webUIs yes
But there are bypasses
You gotta ask others really as I stopped using colab very very long time ago
Some of the colabs don’t use webuis or have a bypass
Nope
it's cross-compatible
in fact
you should be fine with just my requirements for rvc
lemme confirm it tho
hey guys i have questions, im very new to ai modeling how do you do this?
im a complete noob
What are you trying to do?
Make a cover or model
Finally someone on my level of understanding
A cover
Oh no wait sry didn't mean to insult u
K, do you want to use a cloud/online method or using a local instance?
What is the difference between them?
Yup, you'll be fine
omg it also fixed the weird flexasio bug i had
flexasio decided to stop working in wokada and mainline's realtime, but works perfectly fine in your fork, thank uuu 😭
sorry im completley lost
voice changer's list is actually a part of rvc's list
The local one will be a program installed onto your computer and it depends on your GPU
yay
U can either do it online on a website or a program on ur pc
So ye, as long you have all that was needed for prev version, you're fine
no need to install extras
Online/cloud are via websites
Ohhh, whichever one is easier
But they usually have some restriction of some sort
maybe website?
-spaces Ilaria RVC
Suggestions for @mild dock
- Ilaria RVC Zero, by thestingerx Huggingface Spaces
- RVC⚡ZERO, by r3gm Huggingface Spaces
- Applio, by IA Hispano Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
yup i had everything installed before, thank goodness
Try using Ilaria RVC it’s easy to use and fast
To put your model in go to the upload/download section and put in the models pth and index
Or the models download link
Then you upload the audio (clean vocals no music) you want to convert, select your model and convert!
Awesome, have fun man ~
Okay cool i have a model downloaded right now, do i need to convert it to anything? or can i just drag it into the upload sections right away
Drag into the upload section
Make sure you unzipped it since in there are the .pth and .index files
Ayo? @mild dock level 1 !!! 
do they usually take a while to upload?
Depends but shouldn’t take long
Might depend on your internet
ohh okay yeah because i just dragged both in like 2 min ago and the pth one just got uplaoded
waiting for the index to competley uplaod both
okay i just uploaded the model
whats left to do?
Go to infernence
Click refresh and click your model
Then upload your clean, without music, audio and click convert
oh cool is that it?
Yeah it should take less then a minute
If you want to adjust accent in can under index ratio you can in advanced settings
ohh okay cool! thank you guys i apreciate it
guess im stuck ... for some unknown reason i can not train anymore with applio - it worked before (4 hrs ago).
applio moment
i ended up reinstalling the whole thing - it just refuses to do the last step - the training
Why not use mainline
cuz i heard it was no longer maintained.
i did so in the past - it was also compared to applio a struggle to setup.
Unzipping and opening a bat file?
welp can try again - maybe in time things changed.
actually i already got it installed
🤦♀️
Applio has ZERO real extra features over mainline
Or at least to my testing
go-web.bat letsgooo
log?
aside, yeah as tim said
there's no real reasons to use applio over mainline aside some utils maybe or gimmicks
one could argue about code optimizations or whatever but it's quite marginal
well lets see if this thing is magically doing it for me 🙂
you'd actually gain more using my fork as it's mainline with my enhancements:
if I was to be honest with you
oh it cannot use a pre-train ?
I did move from mainline to fumiama's base ( updated it ) but it's not pushed yet
however, what is available, still works so
Nah, it can use pretrain
you just gotta replace the G / D paths
...
ok where is that located (i feel another facepalm moment lol ... is it this easy XD )
... my brain is fluid / cooked atm - ok tnx
np
🤦♀️ 🤦♀️
ps, in case you intended to use my fork, clone the repo - ignore the release
I recommend kaggle unless you have a GPU
I wrote up a quick one here but just copied over a few info from the mainline colab guide https://rentry.co/RVC-Mainline-Kaggle
It works with Cpu right
Ayo? @plain fulcrum level 4 !!! 
12 hours per day which I should have switched to long ago
. I got too comfortable with rvc disconnected
Oh so it runs without the need to have a gpu right
Cus I only got integrated graphics
yup, you just need to verify your phone number and turn on the internet on kaggle
Yeah imma check that one out tmr I appreciate it
📚 All-In-One English documentation
Full AI Voice Model Training Guide (Local)
Link: YouTube
Credits: Christopher Villanueva
Model training with Mainline RVC
Link: Rentry
credits: Raven (ravencutie21)
AICoverGen Colab Guide
Link: Google Docs
Credits: Eddy (Spanish Helper)
Create a model with RVC disconnected (colab)
Link: Google Docs
Credits: Angetyde
You can replace the files
abcdef
Ayo? @oblique gust level 1 !!! 
you
i need an AI voice conversion tool where i can drop my audio file and then it will convert it
not live audio conversion
please help, i've been fucking around with this for the past couple of HOURS and i cant figure this out https://i.imgur.com/MqnZPPo.png
i dont know what to do
like i genuinely have 0 clue what to do and how to fix this and it's so infuriating
can anyone genuinely help i swear to GOD himself i'm about to strike down my entire setup and destroy everything in it
this might be the most annoyed i've ever been by a single program on earth
and i've had issues like this before and the fixes were so goddamn easy and for some reason i COMPLETELY forgot all of my python and rvc knowledge
the brainwiping martians wiped my fucking memory
i need an AI voice conversion tool where i can drop my audio file and then it will convert it
not live audio conversion
why is it laggy on v auido cable?
no mine is messed up
oh
Question guys
would it make the audio better if i re enter the exported file again into the output?
or is that going to distort it
,i need an AI voice conversion tool where i can drop my audio file and then it will convert it
not live audio conversion
does anyone have one
Hello here ! Sorry to ask. Is there a GUI that works ? Ilaria doesn't anymore for me, and Easy GUI is down for a while...
RVC
someone help me how to upload models on 'voice-models' and how to ask permission on moderators?
#1159514067187277865 /submit
It's an alternative for 'voice-models' to upload models there or anyone there can help to upload models on 'voice-models'?
#1159514067187277865 /submit
You have to submit one of your models and get approved to post in there
Oh okay thanks
Incase if you're in weights's server too and if you used weights can you help me to upload models on 'model-sharing' channel from weights's server too?
I’m not in weights server
Okay thank you
But unfortunately I can't remember the epochs of my models because I trained it from weights not from Google Colabs since weights doesn't show the epochs for trained models there so probably I can't upload/share my models there unless I type the epochs of the models ✌️
Nah abandon the attempt now
I'll never create a very good ai voice model for fnf
Fuck my life
someone please answer this, holy shit. i need an AI voice conversion tool where i can drop my audio file and then it will convert it
not live audio conversion
Table Of Contents Introduction Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Settings (Inference) Ilaria TTS Introduction Ilaria RVC Zero, is an RVC (Retrieval-based Voice Conversion) Fork made by Ilaria & mikus, running only on Hugging Face Spaces, it’s called this way...
It's the best one 
Applio can be good too
or RVC Mainline but both are very good, samething about Ilaria RVC
Guys, I'm looking for a AI that can make a cover of Look At Me my XXXTentacion with the voice and musical style of George Straight. What discord channel or server is capable of doing this?
when exporting an audio from a flp (fl studio project), are these the best options if i want the highest quality and fidelity possible of the audio?
the audio/sample im exporting has a samplerate of 44100Hz and has a bit depth of 16
i know there is problems when exporting an audio/sample with certain specifications if they have a certain samplerate and bitdepth, i just dont know what are the best ones for this specific audio/sample
ideally 32-bit float for mixing/mastering materials, but as a final product to publish, flac 16-bit
Is there a way to make another model for another artist that’s not on here if that made sense
That’s called training a model
-docs
- How to use RVC Mainline Colab by Cauthess
- Full AI Voice Model Training Guide (local) by Christopher Villanueva
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
Yeah it’s just complicated
Then learn so it’s not
We have guides that show you step by step
Though if you want to be lazy we have a #1159289738314919936 channel but no one really takes requests
To much
Or alternatively you can check if the models exist on weights gg
Then don’t do it
😭
Guys I had a question, is there like a ratio for the length of the dataset and epoch number?
Like yesterday I trained something as a test with 1 min of data and 50 epochs, so if today I wanna increase the data to 10 mins, how much should I increase my epochs by
Sorry i misunderstood ignore my previous messages.
I would train at about 150-250 epochs for 10 minutes data if you use the default Pretrain
MY RVC on google collab is not working
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- EasyGUI, by rejects Google Colab
- UVR5 NO UI for Google Colab, by Eddy Google Colab
How long does 1 epoch take for u guys on average?
For me it took around 1 hour and a half for 50 epochs
Yeah imma try 150 epochs today when I get my 10 min dataset
So how do people handle harmony? the AI has trouble picking a pitch and jumping around in background vocal tracks
any fix or remediation?
Do the epochs speed up during the training?
nope
its mostly around the same time as the first epoch
помогите установить
Ayo? @sand glade level 1 !!! 
Bro fr made a guide
Also putting like 500 epochs and using tensor board is good
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
Hello so, question. my RES keeps increasing but only for RVC models. I'm using Voice Changer and I just got into it so I'm messing with the models that are already in the program. I've tried the Beatrice model and it works fine but for RVC models the RES keeps rapidly increasing.
how do I make it stop constantly increasing
I've tried lowering the chunks and extras and I'm currently at 4096 for extra which is the lowest and 4 chunks
You should increase chunks, not lower them
oh thanks!
Hey if i don't use pitch guidance when training a rap model, what decides pitch? It sounds good, but it's interesting because it does seem to be changing pitch (its not a single note), but it's not following the input audio
Try adjusting the pitch
Yeah but it will sound less gay if you adjust the pitch
Be straight then
😭
I see. so i can export this 44100hz samplerate and 16 bitdepth sample with the settings that i showed and it will export perfectly as i hear it in the flp? (im afraid to export it with wrong configurations and mess with the frequency (make it higher pitched) or make it not the highest quality and fidelity possible for it), do i need to lower a certain option (like the bit float one) to export it with no artifacts or problems at all?
hi
what is the version rvc ilaria is using?
huggingface
the version number
or the gradio version number
Ayo? @scarlet wedge level 4 !!! 
hey im getting this error in google colab
[Errno 2] No such file or directory: '/content/voice-changer/server'
/content
ModuleNotFoundError Traceback (most recent call last)
<ipython-input-2-cc642795cb8e> in <cell line: 24>()
22 get_ipython().run_line_magic('cd', '/content/voice-changer/server')
23
---> 24 from pyngrok import conf, ngrok
25 MyConfig = conf.PyngrokConfig()
26 MyConfig.auth_token = Token
ModuleNotFoundError: No module named 'pyngrok'
NOTE: If your import is failing due to a missing package, you can
manually install dependencies using either !pip or !apt.
To view examples of installing some common dependencies, click the
"Open Examples" button below.
which colab are u using?
Is there a new google colab link? If so, could anyone send me it? Thanks
for realtime voice changer or rvc?
Rvc
nope, every colab is updated now
when a colab gets updated/moved to other link will be announced on this channel: #📰│dev-updates
-colab
Suggestions for @unreal lotus
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- EasyGUI, by rejects Google Colab
- UVR5 NO UI for Google Colab, by Eddy Google Colab
guys how can ı run .sh files ? ı tried bash but it didn't worked
Is there a tutorial for making a model from start too finish?
yep, check https://docs.aihub.wtf/essentials/how-to-make-voice-models/ for rvc models
In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.
applio colab isn't making a public url
use the 'share_tunnel' option and re run the start applio cell
okay i couldn't figure out what to do with that at first and thought it also wasn't working but now i know how to use it
Alr seems like it works fine, if u get other errors u can ask here
Where did you get this file from?
That is a model file
yes
You need to use a program like RVC or w-Okada to use it
any tut?
-realtime
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
First one
/guides realtime
Should have settings, downloads, tips, and basically everything else
Ignore the rickroll
YouTube tutorials often are outdated and spread misinformation
Like I’ve seen YouTubers say you need to download python/pytorch, which you don’t need to since it’s bundled with it
um
2nd link is for real-time RVC not w-okada
Which you can use instead but w-okada has better ui and is easier to use
i just want a realistic girl voice in realtime for fivem 
Use this one
It’s really just downloading, unzipping, running a file and selecting the model and settings
Which you already have the model
okay
idk where to click on the guide xd
some github link some nvdia cuda
idk xd
Here’s the direct link
Ayo? @fierce hinge level 2 !!! 
Set input to your microphone
Set output to the audio cable (see guide)
Set monitor to your speakers/headphones
Select your GPU as the GPU
audio cable?
In order to use the voice changer in other apps you need to get a audio cable
vac lite is the best free one that most people use
No it’s a program
Run the setup64
that one would also work
kk
People have reported it being kinda buggy tho, so if your experiencing issues with audio in other apps with it use the one I linked
yes
then when i do that
how can i heard myself
in real time?
to see the result
Select a model then click start
i have to change some setting to get the voice realistic?
To add a model click edit, then click upload, then upload the .pth and .index that were in the models zip file
Tune
Also make sure to use RMVPE
whats that?
A setting under f0 det:
@fierce hinge also make sure to use the chunk and extra the guide says to for your GPU
what i have to change?
so i put rmvpe
wtf
my model is weird
not even looking like the preview
eww
OHGHH
its sound cool
how to stop heard myself xd
Set monitor to none
I personally keep it so I know the changer isn’t acting up
Does it sound weird on the voice changer itself?
a bit
But less?
yes
Might be the audio cable
112
it crashh
Reopen it make sure there’s nothing super intensive running
Ayo? @fierce hinge level 3 !!! 
K
Try using this audio cable instead
See if that sounds better on discord
im using this one
Increase chunk and it should be good
increase?
Might want to turn off index if there’s weird artifacting
how much?
Just by one or two levels
Unmute your mic so I might be able to hear what’s wrong
Might be the tune
Sounds good to me tho
u think so ?.??
Say soemthing again
might be the discord mic test
Yeah
Should
kk
okay
w w w
ill test on fivem later
but something is weird
liike
its look like a accent
Accent is controlled by the index
owww
Just a little note you will need to limit frame rate or game settings
So you can give more room for the voice changer to convert and stuff
kk
and when i start to talk it make like
pp
pp
like idk
maybe only on the client
and sometime everything work
then a weird voice come xd
idk wtd
and what is passthru
Disables the voice changer and uses your real voice
kk
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- EasyGUI, by rejects Google Colab
- UVR5 NO UI for Google Colab, by Eddy Google Colab
Hey guys, ı have problem with applio goggle collab..
I can't downland the file
Or the audio doesn't play when ı press the button
Did you convert it
Yep
Many times
Im not using the collab right now, but ı got that problem multiple times
applio moment
And ı don't know why, when it start to downland, it took so long even it's just 5mb
will my pc work well with the ai thing.
i have a radeon 6900xt 32 gb ram 12th gen i7-2700k
I made these from mobile
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- EasyGUI, by rejects Google Colab
- UVR5 NO UI for Google Colab, by Eddy Google Colab
So, whenever I save an audio I generate with Applio, it downloads but the whole page resets. I'll have to replicate the error and look at the cli window but that's basically what's happening right now
it never did this before for some reason
Ahem...no one is ready to hear this but believe me : voice conversion and deepfakes is nothing new...it existed a few years ago, it's just now that's accessible to the public for personal use
someone has a google colab for real time voice changer
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
thx
WHAT THE FU-
First Kits, then Weights!?!?
Just use RVC
they get way too comfortable with sticking on one platform and don't want to learn anything new
. Can't force them since that anger issue guy also ignored the pings
For some reason that bro doesn’t want to use RVC when it’s basically the exact same thing with different ui
People don’t want to experiment since they want their ai e girl as fast as possible
I can fix you nah, you're cooked
its not letting me launch it anymore, when i click it nothing happens
Is everything unzipped?
yes
nevermind i found a differnet file with the exact name and that ones working
do yk how to upload a voice effect to the thing?
Ayo? @void stratus level 2 !!! 
try this if it closes instantly #🔍│help-w-okada message
hey guys
so, i cant actually upload files to gradio
it just wont show up at the dropdown menu
i'm using Ilaria-Mainline
can someone help me?
i don't really know what to do
Click refresh
“Edit”
i've did it a lot
still won't show up
Wait are you using Ilaria mainline locally?
Wdym not really
Are you using Ilaria RVC hf space or did you download it and run a file
Or are you using a colab
because i dont know how to use it locally
Im asking what platform your using it on
colab
Are you trying to make a cover or model
cover
I would recommend to use the current HuggingFace space for Ilaria RVC instead
It’s way faster and has easier ui
-spaces
- Ilaria RVC Zero, by thestingerx Huggingface Spaces
- RVC⚡ZERO, by r3gm Huggingface Spaces
- Applio, by IA Hispano Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
thank you sm
i actually didnt know wich one to use
thanks
ive been having a hard time with colab lately
damn it IS better
actually easier to use
so i have the app and stuff, but when i speak in the app or in a external app like discord i cant hear myself speaking, it doesnt detect a audio
-realtime you need to get a audio cable
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
i already got one
after i open up start_http it leads me to BOTH the website and the app, but before it only lead me to the website and it was working fine
after leading me to both the website and the app if i close the app the website automatically closes
okay so its working in thea pp i can hear myself
Is your output on w-Okada set to the cable?
nope ij ust made it so its default
it was cable before for the output
but it wasnt working
Ilaria RVC Zero is recommended now for online inference
Output should be the cable and monitor should be your headphones
-spaces @faint bloom
Suggestions for @faint bloom
- Ilaria RVC Zero, by thestingerx Huggingface Spaces
- RVC⚡ZERO, by r3gm Huggingface Spaces
- Applio, by IA Hispano Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
aha thank you that works
easygui has been discontinued, Ilaria RVC is the best bet for inference
https://docs.google.com/document/d/1YbXcLFPaGjhOdG5NFkK3QrucCEpHZBwFUxkeMO8aB18/edit
Table Of Contents Introduction Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Settings (Inference) Ilaria TTS Introduction Ilaria RVC Zero, is an RVC (Retrieval-based Voice Conversion) Fork made by Ilaria & mikus, running only on Hugging Face Spaces, it’s called this way...
How to use tts?
Guys how do you deal with sfx in the dataset? Im making a dataset for plankton but anytime he speaks there's some random cartoon sfx that stick even after I isolate the vocals, is there any way to remove em?
stuff like this
i could hear these sfx in the vocals when i was testing the model
Spectral denoise with rx izotope maybe
else try mvsep.com and use the bandit one i forgot where it was exactly
would try mvsep first
Thx so much I'm ganna try it out
Worst case cut out all the words where the noise happens
Oh well i guess you can also try BS-Roformer and use 2024.04 on mvsep
usually does the trick, i forgot about it for a sec
help with tts pls it generating tts but not convert to ai model voice
Ayo? @mint mulch level 1 !!! 
use it on ilaria rvc
hey guys im very confused, what is titan? is it the new rmvpe? and how do i use it?
its a pretrain
What does that mean?
its a thing for training models, dont bother
Oh okay thankyou
Ayo? @ember shore level 1 !!! 
One more question tho
Does it matter what I use titan on? I mean is it better or mango crepe or rmvpe, that sort of thing
the easy Gui website is not working?
its not a rmvpe or mangio crepe thing (its called pitch extraction method what youre saying), completely different thing
can you send the link?
thats super old
Table Of Contents Introduction Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Settings (Inference) Ilaria TTS Introduction Ilaria RVC Zero, is an RVC (Retrieval-based Voice Conversion) Fork made by Ilaria & mikus, running only on Hugging Face Spaces, it’s called this way...

Is there a tutorial to learn how to use this?
Ayo? @lunar ice level 1 !!! 
hun the link i sent you is the tutorial
oh sorry
no prob bbg
Hi! I'm trying to make a voice model for Cynthia Erivo because I need her voice in a couple phrases and a bit of a song for a small performance. I am at the phase of isolating her vocals in a bunch of songs but I'm very confused
AI HUB Docs