#✨│ai-help
1 messages · Page 177 of 1
first link
-rt if your on AMD use the w-Okada fork
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
No
Greetings, does anyone know how to make covers? I forgot how to make them?
And if so, can someone send me the link?
whats your pc GPU?
my voice changer is rly crackily when i talk
What do you mean PC? Now you have to make covers with a graphics card?
are you on mobile?
Yes
and yes there always have been a local way
you can just do it the cloud way
just use ilaria rvc zero which is the fastest for free on cloud
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
Thanks
How can I make my own model?
whats your pc gpu?
3060 I think I don't remember
You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
Just to be sure
hi does anybody know how to get the Kai Cenat voice in Okada??
You can search rvc ai voice model at:
- #1175430844685484042
- #🔍│find-models
- https://weights.gg/
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://applio.org/models
- https://voice-models.com/
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.wtf/essentials/how-to-make-voice-models/
I'm on phone but I checked a pic of my specs and yeah it's 3060
alr, rtx 3060 its good, u can train locally on that
check https://docs.aihub.wtf/ -> RVC -> Local -> Download either mainline (original rvc) or Applio (rvc fork with some extra features, same quality) on your pc
Last update: Mar 10, 2024
Yep
Just tell me all the steps or a link to a guide and I'll note it down for tomorrow
Isn't this helpful?
ye its the general steps, the docs have everything u need to know
yea if u followed what i said ud find the applio and mainline guide link
Oh mb
Ayo? @obtuse forge level 1 !!! 
my voice changer is rly crackily when i talk anyone know how to fix?
Im new can someone send real time voice changer link i have 1660 super and i5-11400 if can use local please send local version
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
1st is the wokada optimized fork
@low shard can you quickly explain what rvc and w okada is and what's better for real time voice changing stuff
I wanna make my own model and I did like an hour of research and
I found out ab mangio
That should work right
Ty
@violet swift dont spam
Cpu or gpu version?
Ayo? @violet swift level 1 !!! 
rvc (retrieval-based voice conversion) is mostly used for inference (use models on a prerecorded audio like ai covers), and training (making) models, i mean the original rvc (mainline) has a realtime version, but wokada is better for realtime stuff, especially the newest fork
mangio is an old fork of rvc, its not udpated since a year
Oh okay so
Do I use rvc to make my models then wokada to play them?
yes basically
Oh okay
Well it looks so much simpler to make a model with mangio
Mainline seems so complicated
if you are watching a yt video, its mostly outdated
i'd suggest u to use Mainline or Applio but alrigiht
Yeah it's a few months outdated but I couldn't find a more up to date video
Is applio simpler?
I saw that it's kind of the same as mangio
there is no up to date yt tuts, all youtube tuts are outdated
which is why all the guides here are written
the ui could be
Yeah okay
´does anyone has girl model?
Ayo? @brittle wing level 1 !!! 
again I will give up to train models on Google Colabs right now until I had a laptop and because most of my favorite groups/bands/artists are popular/famous or from big companies because it's hard to be fan of underrated musicians/celebrities (especially underrated K-pop groups/artists ones because most of K-pop stans are only supported popular groups since K-pop nowadays only care about popularity) because only few people do models for underrated celebrities so I rarely do or train models for underrated musicians/celebrities and I will just train their models on Weights and Astra Labs for awhile, but even underrated western/American artists like Maggie Lindemann is easy to do models I mean because most of western artists are solo artists so it's easier to do datasets for them compare to Kpop groups because we should cut each K-pop group members' parts from their songs just to make datasets and models for them haha
An error occurred extracting the index: need at least one array to concatenate
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.
does anyone know how to fix this, it appeared when i tried to generate the index at applio
I have a question
I have a very clean dataset of 35 minutes.
I would like to know if there is any improvement by using the titan medium.
or a pre-trained model in general
we came to the consensus that the original pretrain is better. Titan adds a bit of ringing noise at the end of each sentence. Still usable though for realtime
I have heard that it also distorts treble and was recommended to use klm.
for really clean datasets. Shouldn't be an issue with yours then
klm has gradient issues which in simple words means your training might become unstable
in my case I thoroughly cleaned my dataset xd
with uvr and rx 11 modules and T-De esser and Renegate
32k
also my audios are in .flac
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- UVR5 NO UI for Google Colab, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Does anyone know what voice changer clients support rvc 2
And what is a good rvc for a 3060 ti
Although not a helper or main IT staff, I'd assume you'd use the w-Okada Voice Changer. It should be good enough.
I'd check out #🔍│help-w-okada if you want to work with the Realtime Voice Changer.
-realtime
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
Wait the 2nd of the list one is getting free training too? I thought it's only Streaks and I've been on Weights's server for a months but I still don't know there's another free training option besides Streaks I mean that use referal code or get 5 subscribers thingy
Whats the best settings for rvc again?
i cant hear the voice in the voice changer i can hear mine but not the one i selected
!help
Unlock the world of LunaBotPrime, absolutely free! Dive into the realm of premium music quality, enjoy it to the fullest. LunaBotPrime - where the music never stops, and the magic continues!
You can invite LunaBotPrime with this link
LunaBot 🌙 is the perfect music bot! Feature rich with high quality music! And Custom Playlist
You can start listening music by just joinning a voice channel and typing: /play [song name or link] (Remove brackets).
We support only Spotify, soundcloud, bandcamp and more!
To view more help on a specific command or category, run
/help <command> or /help <category>
Important Links:
Support
Premium
Invite
Command Categories:
🎶: Music
💰: Premium
⚙️: Utility
📕: Admin
Select A Page From Dropdown Menu Below
oops
No Information found for command rvc
There's no command called
rvc
-rvc
- How to use RVC Mainline Colab by Cauthess
- Full AI Voice Model Training Guide (local) by Christopher Villanueva
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
what is a good site or anything else for getting acapella,s ?
and is UVR still the best de reverb tool ?
for when you get vocals with alot of reverb and echo
@timid olive from where i can download rvc
@crude bolt from where i can download rvc
@brittle wing from where i can download rvc
@steel forge from where i can download rvc
@hearty idol from where i can download rvc
@long forge from where i can download rvc
do not spam
bro please tell
Ayo? @bronze zenith level 1 !!! 
-rvc
Suggestions for @bronze zenith
- How to use RVC Mainline Colab by Cauthess
- Full AI Voice Model Training Guide (local) by Christopher Villanueva
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
okay then?
Is RVC Disconnected broken right now?
I keep getting "pickle.UnpicklingError: invalid load key, '<'," error when trying the training step
I was trying to use a custom pretrained model there
I have a question, is it mandatory to provide index file path while running inference? I'm asking because for some of the models I couldn't find index files but have .pth files. (I'm using Applio).
Ayo? @wise roost level 2 !!! 
Is separating the sample voice lines into individual files a bad thing
(I hope not)
sounds like asking for a ban hammer

Applio isnt able to generate an index file
Ayo? @obtuse forge level 2 !!! 
@flint geyser
applio colab
its actually the best practice
i ran all the extras too
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- UVR5 NO UI for Google Colab, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
you were training?
yeah
its recommended to first create the index file
before creating the pth
An error occurred extracting the index: need at least one array to concatenate
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.
i cant create the index file for some reason
so uh
Idk what i should make a model of
It's written when u click the 'i' of referral codes
Okay thanks
Yw
But it's free or need to pay?
Thanks
can some one help me with fork wokada
how do I manually add models to applio? idk where to put the index or model file
I'm too used to using normal rvc
Last update: Apr 01, 2024
How can I separate a track from double audio in Audacity?
What do you need help with
We should get 5 rewards to train models?
what? every 5 people that sign up using ur referall code get u a free premium job
Oh okay thanks
How to start the web
you mean this if we get people who joined using our invite code?
yea, u should be able to get 1 premium job for free if 5 people sign up using your code, u should be able to see the status clicking on your account
Thanks
But if we can train 5 models or more than just 1 model when we get 1 premium job unlike on Streaks that we can only train 1 model each/per 5 Streaks?
you get only 1 premium job free
Oh okay it's the same with Streaks that we can only train 1 model right?
yes
Okay thanks
the exe if your using the fork
how to run rvc locally
whats ur pc gpu?
hey , can we run rvc on node?
no?
i mean you can try but you wont success
12c | 256GB | 1.8TB | 1Gbps
what
oh
do you mean like as a comfyui node?
i thinhk he mean nodejs
aio
skidbi
i mean something like i can run it with console commands,
Ayo? @ocean pier level 1 !!! 
yea that wouldnt be possible
sed :((
so you mean CLI/NO UI version? maybe https://github.com/blaisewf/rvc-cli
yeah something like that
but node doesn't have gpu
12 cores cpu only
are u talking about nodejs?
nodeaio
@low shardwhat is voice changer rvc name
w-okada
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
real
okay if we can't train models , can we do voice covers?
guys what is metadata.json for
i downloaded the model from weights.gg
do u need to extract the zip file to use the voice model
yes
mines sounds like shit
i dont know much about that sorry
huh
i cant send any pictures
its not used for anything dw, its just info about the model
1.5.3.18a
Ayo? @tight halo level 1 !!! 
like, it has some delay or it doesn't sound as it should?
what are all those
when it just doesnt match your voice and sounds different
like a male voice on a female voice
also what does "Epochs" means
in models
it says like 200 Epochs
and stuff
so like what does 1000 Epochs mean
lot of detail put in the voice model?
how do you set up the voice models?
its an unit of measurement of the training cycles of the ai model, there isn't a right amount
I wanna try it out
yes i totally have those monitors
ohh, i dunno much about wokada but u should change the pitch, like if its lower, the voice sounds deeper, if its higher, the voice sounds higher yk
I downloaded the post malone thing and I wanna try it
do u have another good girl voice?
not the one u sent
its just how many cycles it has been trained on the dataset, it doesnt mean its good quality, it varies alot
@low shardWHAT ARE THOSE SETTINGS FOR
for realtime voice changing or ai covers and whats ur pc gpu
not really i dont even remember i sent u one
i think u confused me with someone else
Like ai covers
Ayo? @flat bison level 1 !!! 
I make music and I wanna make my voice sound different
Tbh idk the gpu stuff
is this a good girl voice @low shard
there should be an explaination of every setting https://rentry.co/ForkVoiceChangerGuide#audio-setup
Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update August 30th, 2024: New b2309 version
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVoiceChangerGuide...
can someone do it for me?
You can check your PC gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
I can send the song to you guys
if u got a good pc gpu you could do it locally which would be the best, else u could just use ilaria rvc zero
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
read the name
it says voice changer client demo
idk where to start haha im not smart
what do i download?
how do i start?
Hover over labels with dotted underline and read
so basically, locally means u use rvc on your pc, cloud means u use it remotely on a good pc
emojikage is a furry in disguise
the whole program for this, retrevial-based voice conversion
check your pc gpu first
No, I'm an anime girl
is nvidia rtx 4060 good for rvc
Ayo? @fleet cedar level 8 !!! 
With an army of neco arcs
lie, i refuse to believe that
I have nvidia geforce rtx 2060 or 70 i think
are you sure?
I think so yeah
can someone do the post malone ai for this one song I'm about to send?
so you got 2 ways:
- Locally (no limits, but u have to download the program and could be a bit hard)
- check https://docs.aihub.wtf/ -> RVC -> Local -> Download either mainline (original rvc) or Applio (rvc fork with some extra features, same quality) on your pc
- Cloud (easy and fast, but limited by the zerogpu quota if you are gonna use it very very much)
- Ilaria RVC Zero (Guide with Link)
Last update: Mar 10, 2024
Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Troubleshooting “No gpu is available for you for 60s” Introduction (with website link) Ilaria RVC Zero, is an RVC (Retrieval-based Voice Co...
@flat bison
can someone do the post malone ai on this song?
no
i was mostly talking to Nick
Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update August 30th, 2024: New b2309 version
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVoiceChangerGuide...
?
i already read that
the perf is green but tbe ping is high asf
can someone help
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
@low shardit dont work on discord
u have a vac?
Headset = normal voice,
line 1 = voice changer
I have lots I want to say but you just cant say 'it don't work ' 
i can
hey guys first time trying out the rvc tools, and okada and im having touble getting the voices to sound right, is simply just moving all the sliders a bunch all there is or is there some weird setting or software im missing
-rvc
- How to use RVC Mainline Colab by Cauthess
- Full AI Voice Model Training Guide (local) by Christopher Villanueva
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
-rvc
- How to use RVC Mainline Colab by Cauthess
- Full AI Voice Model Training Guide (local) by Christopher Villanueva
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- UVR5 NO UI for Google Colab, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
I'm not tech savy, can someone help me with my one track?
what are the minimum especifications for running rvc locally?
I'm trying to run a rvc voice of i show speed and kai cenat, but it doesnt work
Ayo? @dry kayak level 1 !!! 
install failed
Delete pretrain and model_dir folders then launch start_http again
Alright, I'll try it, thanks
i can only hear myself with the voice chnager it doesnt go through my mic and other people cant hear it
original wokada v1 is bugged with cpu, lower Extra to 8k or 16k
and f0 det: rmvpe / onnx
actually, I am using this setting. it's okay I guess
How I can make a model?
whats ur pc gpu?
NVIDIA GeForce RTX 4070
check https://docs.aihub.wtf/ -> RVC -> Local -> Download either mainline (original rvc) or Applio (rvc fork with some extra features, same quality) on your pc
Last update: Mar 10, 2024
thx
yw
@vagrant nebula, I have found 1 results that match your search!
@vagrant nebula, I have found 1 results that match your search!
@vagrant nebula, I have found 1 results that match your search!
do that to #1159514067187277865 too 🙏
people cant read channel names fr
@vagrant nebula use #🔍│find-models
is this the channel to get help about the weights bot?
i do /create and it says "One files uploading failed"
i think #1212837348463607890 should be the one ig
is there some type of post to make sure i get the best quality
i have a 4070 rn and it kinda sucks
Is free premium job will be reset if we don't do things everyday like on Streaks?
if u dont do the 5 day streak u wont get a free premium job
Oh okay thanks
But I mean that premium job from 5 subscribers via invite code will be reset as well if we don't have 5 subscribers for few days?
Ayo? @subtle cedar level 15 !!! 
i dont think it resets as long as u have 5 people who signed up, may be better to ask in weights.gg server but afaik it shouldnt
Okay yeah
your skill issue doesn't fix it
Guys i have a question and i'm quite new to AI stuff. I downloaded some voice models for W-OKADA (rvc models) and i tried some. For example a female voice, it sounds cool but also unrealistic sometimes when i heighten my voice or try to sing or something it sounds completely off. how to get a PERFECT female voice or a perfect male deep voice for example. what can i do? do i need to train or something?
help
Ayo? @mortal plaza level 1 !!! 
First time attempting to load an rvc/tts model locally through hugging face, I keep getting a missing config.json error when using AutoModel
I see a lot of models actually don't have these files, but comments indicate that people can successfully use them such as: https://discord.com/channels/1159260121998827560/1279874091754721300
So I am not exactly sure what I am missing
i want a realstic model anyone know good one ?
why my voice dont change in discord ?
I am trying to install RVC on docker with an RVC-Studio and same with chat , but it just wont Work. Is there any other open source alternatives ?
Sorry but there's no such thing as a perfect female model for W-Okada. (Mostly due to hifigan/W-Okada limitations)
Probably the model you used wasn't trained on singing and for that reason it sounded off when you tried to sing.
Check the #1175430844685484042 channel and try with the ones there.
Virtual Audio Cable suggestion?
What software has no limitations?
No one if we refer to voice-changer type softwares.
There's no W-Okada alternative with no limitations.
how do these g uys do it so realistic then. In a game there was some guy who sounded exactly like a female streamer
completely perfect
how did they do it
you could literally tell no difference at all
They sounded like 100% accurate
if u are on windows id suggest this
because it depends by the model quality
i tried like 10 different models, none of them were close to being somewhat realistic. you could always tell it's a Voicechanger
Probably he had a pretty powerful PC and a girl model trained on more than 30 mins or audio or more.
that one cut offs
i've got a good PC also, 4080Super GPU
You can search rvc ai voice model at:
- #1175430844685484042
- #🔍│find-models
- https://weights.gg/
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://applio.org/models
- https://voice-models.com/
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.wtf/essentials/how-to-make-voice-models/
But most GOOD female models are paid
ohh nice ;D
Because public models made by random people won't always work with voice changer properly
what wokada are u using? original or fork and whats ur pc gpu?
It's simple matter of testing.
What are some good ones? Can you send me some links for some very very good ones? It's okay if they cost some $
also my gpus 4050
thx for the link 🙂
before i reset my pc it was exactly like a woman
now it cuts off randomly
when i talk
I don't have any, sorry.
What are some good ones? Can you send me some links for some very very good ones? It's okay if they cost some $
sorry Leo i wanted to answer this guy 😄
wrong quot
quote
No worries.
oh thats not really suggested, i'd personally suggest using the wokada fork
wokada fork link?
i dunno tbh, its better u check out the model master shop
Also, you can just personally DM a model master from the https://discord.com/channels/1159260121998827560/1191429836321849435 channel for a paid model or post a paid request on the #1159289738314919936 channel
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
1st link
alright ty
yw, u may wanna use #🔍│help-w-okada for any issues as this is channel is about rvc lol
are local model trainers hard to set up?
Do you mean local RVC?
If you mean using RVC locally, nope, it's not that hard. All you need is a computer with a somewhat powerful GPU
is it faster than collab training
The speed will depend on your GPU and your training settings if you use local.
colab uses T4, you can find out performance comparison against other Nvidia GPUs
Ayo? @knotty moth level 36 !!! 
can anyone help
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- Ilaria RVC, by thestingerx Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- Ilaria RVC, by thestingerx Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
how to upload the model file i download to the realtime?
Are you using W-Okada or RVC Realtime?
If you're using W-Okada, on the GUI there's a button called "edit" you can click, there is a slot where you can add your model's pth.
Hi, is there a programmatic way to run models in quickwicks repo? no colab, no cloud hosting, i am looking for a way to run with either ( go/pytho/cpp ) or directly from a CLI app
#1159513888199540817 reads like unfiltered garbage mess, havent got any luck in there except finding garbage like deepfake (?????????) and other things
thanks for any help, smooch
@odd shale could u upload the file for me via megauploads or mediafire github takes 2 hours just for 5gigs
there is a cli version https://github.com/blaisewf/rvc-cli
unfiltered garbage mess? whats wrong with deepfake or ai things like that ??
thanks, i didnt meant to sound aggressive
sorry for that
❤️
yw, its fine
@jade marsh audacity > change your audio to mono and it'll show your true frequency in spectrogram view
What program is recommended for RVC local? I specifically want support for RVMPE and Titan. RVC GUI only has Crepe, crepe mini, dio, pm, and harvest
could u upload the file for me via megauploads or mediafire github takes 2 hours just for 5gigs @low shard
what file are u talking about ??
My audio seems to reach 21.5k on linear
mainline rvc (original, unedited rvc)
https://docs.aihub.wtf/rvc/local/mainline/
https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/releases/tag/2.2.231006
Last update: Mar 8, 2024
完整包 Complete package
For Nvidia GPU users:
https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/RVC1006Nvidia.7z
For AMD/Intel GPU users:
https://huggingface.co/lj1995/VoiceConversionWeb...
strange. i would 32k anyways since thats what it wanted 
ill just chalk it up to RVC being stupid and needy
when that happens see where your audio is more detailed
so for example if most of your audio data is below 21k then is a 32k model
but if theres a lot of data in the 21k zone then it can work as 40k
simple words:
if the 21k audio is mostly blue/dark, then there's not much there
yeah check the dm I sent you. It's an advanced view of a spectrogram
rvc1006 nivida
what??
@low shard
could u upload this
to mediafire or something like that
2hours for 5gigs it outrages
alright, sweet. hopefully RVC is fine with my new current settings for now.
There may be some blue noise all over your audio if you dont check in Izotope first. Just saying that's what razer showed you
I know, but I was already training as soon as you sent that lol
worse case scenereo i gotta train again
i think that really depends by ur internet speed
no bro the servers of github are just far away
@acoustic scarab
How does the 'save-frequency' work in RVC v2 Disconnected?
is it like a checkpoint so if it fails i can return to training at like 100 epochs instead of starting over?
if yes, how do i access the frequencies that i've saved
help im using applio
i was reffering to a private payment
I'm using the local version of RVC V2 (1006NVIDIA) and there's a bit where the voice model gets stuck on in a few songs that I'm trying to convert, specifically this part where it's supposed to say "I've built a little empire" but it keeps either getting too high pitcher or too low pitched
how would I go about fixing this?
if a model doesn’t have an index file, does changing the search feature ratio make any difference?
Is anyone else having issues using the real time voice models ? the client wont connect to the servers for me
Seems like some background vocals might be interfering with the main vocals - did you already try filtering those out?
If it's that wait web server stuff, just delete stored_setting.json and restart it.
Explain your issue
https://drive.google.com/file/d/12hwM7-nV1glcTBTfECyhSwS74CRcXThQ/view?usp=drivesdk
I have bad news I did it well I can’t do it I’m watching the video it’s a lie to all the people on discord said no
it doesn't work
I need making rvc model voice humans
how to use models located at hugging face? i downloaded freddie merkury 300k model, copy to weights dir of rvc, and if i load it "Inferencing voice" tab of web ui, i have an error
File "C:\Projects\RVC-WebUI\runtime\lib\site-packages\gradio\routes.py", line 321, in run_predict
output = await app.blocks.process_api(
File "C:\Projects\RVC-WebUI\runtime\lib\site-packages\gradio\blocks.py", line 1006, in process_api
result = await self.call_function(fn_index, inputs, iterator, request)
File "C:\Projects\RVC-WebUI\runtime\lib\site-packages\gradio\blocks.py", line 847, in call_function
prediction = await anyio.to_thread.run_sync(
File "C:\Projects\RVC-WebUI\runtime\lib\site-packages\anyio\to_thread.py", line 31, in run_sync
return await get_asynclib().run_sync_in_worker_thread(
File "C:\Projects\RVC-WebUI\runtime\lib\site-packages\anyio_backends_asyncio.py", line 937, in run_sync_in_worker_thread
return await future
File "C:\Projects\RVC-WebUI\runtime\lib\site-packages\anyio_backends_asyncio.py", line 867, in run
result = context.run(func, *args)
File "C:\Projects\RVC-WebUI\infer-web.py", line 439, in get_vc
tgt_sr = cpt["config"][-1]
KeyError: 'config'
Traceback (most recent call last):
File "C:\Projects\RVC-WebUI\infer-web.py", line 203, in vc_single
audio_opt = vc.pipeline(
NameError: name 'vc' is not defined
what to do with config.json that comes with the model?
...I think that's a SoVITS model
Use another model
An RVC model
Ayo? @ionic pewter level 1 !!! 
ok. i know what to do with pth file, with index file. but what is npy file?
im confused w
Picht raccometed +9 or +10 +11 +12 +13 for male
Dataset: 01:53
Rate 40k
Bucht size 8
Pretrein Original
Epochs 250
what are these supposed to mean
guys hows the rx6000 series work with rvc? Currently im using wokada and im curious which one works better in latency and conversion quality
btw it only works on linux with amd card or window is ok?
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
the wokada fork (1st link) is optimized for amd gpus and works good for windows too
yea im using it rn and im just wondering whether its possible to use rvc on window with amd card
Ayo? @marsh sonnet level 1 !!! 
guys how to prevent outside noise
hello lads Rvc v2 disconnected aint working for me, i try to load the dataset and it finds all the wav files but at the end of it
it always just throws up a error and i cant figure out why
heres the error code if anybody knows how to fix it:
FileNotFoundError Traceback (most recent call last)
/usr/lib/python3.10/shutil.py in move(src, dst, copy_function)
815 try:
--> 816 os.rename(src, real_dst)
817 except OSError:
FileNotFoundError: [Errno 2] No such file or directory: '/content/temp_dataset/monkey/craftyz' -> '/content/dataset/craftyz'
During handling of the above exception, another exception occurred:
FileNotFoundError Traceback (most recent call last)
3 frames
/usr/lib/python3.10/shutil.py in copyfile(src, dst, follow_symlinks)
252 os.symlink(os.readlink(src), dst)
253 else:
--> 254 with open(src, 'rb') as fsrc:
255 try:
256 with open(dst, 'wb') as fdst:
FileNotFoundError: [Errno 2] No such file or directory: '/content/temp_dataset/monkey/craftyz'
the dataset is zipped and in the google drive rvcdisconnected folder right
yeah
could u tell everything inside the .zip?
monkey1.wav
monkey2.wav
Ayo? @tulip radish level 1 !!! 
3 and 4 aswell
did preprocess actually work, and the feature part doesnt show nothing but "no-feature-todo"?
seems good, this happens when you load the dataset zip right?
it tells me no feature todo at the end yeah
yeah
wait if it didnt load the dataset how did it preprocess
lemme send screenshots of everything i see real quick
if you think the zip content structure is correct, re-export the wav files in audacity, or another audio editor with the include metadata option disabled
C:\audio\monkey.zip\monkey\craftyz this is how it looks
move the files to the folder "monkey", dont include any subfolders within it
oh my im silly
thank you so much sir
hey guys im using kaggle to train my voice model and im keep getting thsis error in the screen shot so could someone help me how to fix this?
hi erm i made a acount but i cant log in bc it says csfr token mismatch can someone help me
Ayo? @brittle wing level 1 !!! 
the musics that i convert in separate youtube tracks are glitched
the voice is good but the instrumental sizzle
can i fix that or its common for the site?
how do i use voice models
Since it looks like the maximum frequency is 15-16k I should train at 32k, right?
And should I cut any frequency above 16k before training, or is it fine if I leave it as it is?
rvc is going to resample it to 32k, so every frequency above 16k is going to be lost automatically so don't worry
Okie, nice, thank you so much :)
rvc resampling is very bad and is not advised to allow rvc to do the process, instead use either rx10/11 or audacity to resample your audio
Oh I see, since I only have audacity how do I do it?
then you export
(rvc uses mono audio, if it detects the samples are in stereo, is going to convert them to mono)
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- Ilaria RVC, by thestingerx Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Okie, thank you so much :)
Ayo? @tranquil blaze level 3 !!! 
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- Ilaria RVC, by thestingerx Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Which rvc would yall reccommend to have a dominant mommy voice
is there aw ay so that my VC changer through discord and coms doesnt come through as robitic, is there a way to change that?
Ayo? @lusty garnet level 1 !!! 
I have 2 gpus
Do you mean a rvc model?
Well, go and check the #1175430844685484042 channel.
what does extra do? im assuming chunk just processes it so higher the better? but the longer it takes? to process it
I think you can go and read the realtime docs.
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
I'm not sure which version of W-Okada are you using tho.
V1.5.3.17B i last downloaded this in 2023 of december , i tried to update it but i cant even download it just tells me link is broken on github and cant be downloaded
I would suggest you to change to the Deiteris' W-Okada fork.
Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update August 30th, 2024: New b2309 version
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVoiceChangerGuide...
It got improvements over the OG version.
oh nice
i gottta save my model real quick, since it doesnt exist in this discord anymorre for some reason
Yeah mb, i will
Ayo? @broken lynx level 1 !!! 
The original post of the model probably got deleted.
T_T, wonder why it was decent enough, tried like 40 models before this one worked the best with my voice
It's matter of testing models.
Some models won't work properly with your voice but other will
i have a very Raspy voice, because of health reasons, so i was tired of being harassed online for not sounding Feminine enough,
so i used Ai voice change okada which helped ALOT
I see.
every time i try to download anything
how do i get it realistic as possible
Nope, not possible.
Unless you probably use a model trained on crystal clear audio.
im talking about settings
like
which voice changer is the best, what settings to use for it to sound the best (idc about delay)
best voice extractor guys ??
to extract the acapella from a piece of music and make a model of it
I use UVR 5, except that we hear the voices in the background of the acapella, I don't know how to find the right setting
Try with the Deiteris' w okada fork.
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
There you got a guide for it too.
also another question
is there a way to optimize it for discord or other apps
cause
it sounds really good using playback but people say it sounds like a voice changer in apps
if u know ofc
is anyone else able to download this? ive tried 3 different browsers at this point
can anybody help me how to get the voices
i have download it but i cant play
with what app i can do that ?
Dm
so im looking through the applio documentation. where did the different TTS Options come from? I wanna find a specific accent
Ayo? @still stone level 2 !!! 
like are these microsoft ttses or
wdym
Ayo? @lusty garnet level 2 !!! 
this
how do i make w okada better i have a 4070 and it sounds like shit
like what settings make the quality higher
Friends of the helper Help, I haven’t worked with rvc for a long time How to make a data set better, also how to Remove noise (I mean how to make the Voice clearer so that it doesn’t sound like he’s in the bathroom speaking)
@mortal plaza -> #🔍│help-w-okada
oh, i use a 1080TI for it
Every time i try to download it it wont it just gets corrupted
finally was able to download it this and install it
is it supposed to open up in the browser this time?
refer to the pinned guide in #🔍│help-w-okada and try another better model if necessary, dont blame on ur gpu tho
yes forked w-okada uses browser (less ram usage vs desktop apps, at least on my PC)
- Applio Notebook, by Vidal Kaggle
- Applio Notebook, by Shirou Kaggle
- Music Source Separation, by Shirou Kaggle
- UVR5 NO UI, by Eddy Kaggle
- RVC Mainline, by Hina Kaggle
- Original W-Okada's Voice Changer, Kaggle
- Modified W-Okada's Voice Changer, Kaggle
- 📖 How to use RVC Mainline Kaggle by Cauthess
Note: Kaggle limits GPU usage to 30 hours per week.
seperating voices from each others is a bit harder.
the approach is to split instrumental and voice.
then use karaoke to split voice from background echo voice.
then de-echo / de-reverb on the background singers to get them sound dry.
This is not a 100% guarenteed method, especially like in cases you got duet voices and both singers are heavily on the foreground.
tweak / apply diffrent multiple steps depending your audio-source
sometimes you get also the background singers out it with just the de-reverb or de-echo.
it can depend on the browser, especially when you installed like bazilion extensions on your browser 😂
but yes lets assume we are on a clean vanilla browser 🙂
the rvc fork was for me a good incentive to get rid of extensions that i didnt actually need 😉
try in Edge or another browser if there's some issues in Firefox, though having uBlock origin is fine
my rvc looks weird
it doesnt show a picture or any of the free models
does anyone know why
i got all the settings
just no models
the forked okada dont contain any
may depend on the accent compatibility with the voice to infer
which one do i download
not sure if you particularly want the demo models (which imo are bad). So are these models what you really want?
if not, you can download models from here : #1175430844685484042
the zip downloads in #1175430844685484042 or weights.gg usually don't include the image, only .pth and .index files
you can edit the model in rvc and attach any picture you like
it may also depend on pretrain you're using, also have you tried different index rate values in inference?
the accent is stored particularly in the index. Do you generate an index?
there are many pretrains around depending on your target language you training for
i know theres a guide out there explaining which pretrain is trained for what use , but i dont know the link
Table Of Contents Table Of Contents Introduction Types of Pretrains Where can i find pretrains? Index of the most famous public pretrains: Where to find and share other Pretrains How to use them locally: Non Applio/Other RVCs Users : Applio Users: How to use them online (Google Colab/Kaggle): RV...
also take a look in #1235952130855010365
then do the try-approach
print the table out on a paper
and throw a dart on the paper
and pick that
or - choose something that looks good
there is no wrong or right here. Some sound better , some worste - its all subjective
beauty is in the ear of the beholder
❤️
pretrains have their pros and cons regarding quality and some others
they have 1 thing in common , they reduce trainingtime.
the whole point of pre-training
more like that training voice models will never be good without a pretrain as the foundation
not totally agree with it - but yea you easily will need to understand that training a model without a pretain will cost you like a week instead of a few hrs.
to get a descent (not perse good) model.
plus you need to do a big time of work to clean out the audio source, and have a lengthy big variety of spoken words of this person. ... All n all , do train with a pretrain rofl.
what I mean is the comparison of quality & how it generalizes, no matter what the optimal epoch is
for example, ~30m of assumingly clean voice dataset trained with og pretrain vs no pretrain
i was nitpicking on your statement that you cant make good models without pretrain. So yes you can do that for an unreasonable time with unreasonable effort compared if you actually train a model with pretrain. It's just not worth doing so. I would always recommend to use a pretrain.
Unless you really want to make your own pretrain.
using a pretrain will save you alot time and effort for making a good voicemodel.
there are veteran voicemodel makers out there totally unhappy with current pretrains and make their own pretrain that subjectively yields better results when creating their models with specific needs.
them who fallback to the og pretrain
If I'm training on kaggle and I'm using t4 x2, do I have to double the batch size, like, if I wanna use 8 do I have to put 16?
its batch size per GPU
Oh so it's still 8 right?
I stumbled upon the pretrains guide and on there it said to double the batch size if the gpus are two, but maybe that only applies to pretrains, idk
the actual batch size is doubled, so again I said the option is per GPU
Ohhh I see, so since I wanna use 8 I have to put 4 because it's per GPU, so 4 * 2, right?
the guide already says that
Yep, I just thought it was the opposite thing, thank you :P
say the same after u trained once
Mic picks up headphones, headsets and especially laptop speakers not recommended
U can try: lower volume, enable sup2, move s.threshold to the right
Hi everyone, I'm doing feature extraction for a model, Its been going for several hours, is that normal?
When I did my first model training some time ago it wasn't that slow
what are you using? mangio crepe in rvc disconnected?
Hi. I need stable and good rvc golab link\
what's the best model in mvsep for isolating vocals
hello
Bonjour
Translation: Bonjour
How do I use RVC with XTTS? I get this error when trying.
Ayo? @dark ginkgo level 2 !!! 
whenever im playing vr chat on vr i wannna use the voice changer but it doesnt work like the sound does not go to vrchat
what are you using exactly? like link of the fork
Any body give me link of rvc 3 one click install one
rvc 3 doesn't even exist
what's your pc gpu?
Rtx 4060
what? you just have to download a prezip and extract it
oh u deleted the message
I used the one click installer
its better you don't use those, youtube tutorials aren't maintained
I can tell you to use RVC or Applio that has TTS too
the problem shown in the screenshot is that it cant find the output file because it cant make it for some dependencies issues
whats your pc GPU btw?
RTX 3060
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- Ilaria RVC, by thestingerx Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
About one with XTTS that is maintained, there isn't afaik
but applio uses Microsoft Edge TTS API to make tts and then uses that audio as an input to use it with rvc model
💀
Which is best for song covers and also manually controllable emotion in voice?
XTTS doesnt seem to have emotion control from looks of it.
Also this feature looks cool
Neither RVC doesn't.
So which tool still works then?
because take away I'm getting is, XTTS-webui by
daswer123 is outdated and no longer supported
It does work if I dont use rvc so yeah, rvc setup seem to be the issue
vc colab?
Do you mean Ilaria Mainline?
for tts? neither rvc (as its Speech to Speech natively) neither XTTS
gpt so vits is the best for emotions, but it works only for chinese, japanese and english
what ?? show a screenshot
so it's 2 serperate software?
One for normal talking and other for music and songs?
what antivirus is that?
AVG antivirus
google colab has no trojan, its owned by google
idk then cause i tested with avast antivirus and its saying the same thing
wdym? like one for ai covers, another for tts emotions?
yeah I was using an old rvc version, now I'm using RVC mainline with rmvpe gpu but it takes 1 minute per epoch, idk why its that slow
yeah, for one, I want to use it to make for ai cover or turnin humming into other instruments. I seen some ai tts do that.
It's for like instruments and stuff
And the other, being able to do voice dubs.
ig its a false positive, i have literally 3 antivirus (malware bytes, bitwarden and avira) and theres nothing wrong in it (im running it right now), its literally owned by google
also, its running on a remote google pc
its not even running on yours
One of the project I'm doing is converting bad voice acting on english dub with original japanese. that's testing of concept anyways
are you running the one that comes up with -colab?
or is there a newer version of it somewhere?
ig it gets false positive bc it connects to google's pc, ngl never heard of anyone else saying colab is a trojan, i use google colab much as i dont have a good pc
you are talking about https://colab.research.google.com/drive/1mHKTGH5e3SAyDSBss1KtiYRbDdQzwSMs#scrollTo=pz3lRmDHl_JG ?
or are you following a random youtube video?
for ai covers, rvc is used, while for text to speech gpt so vits is the best
i'm downloading applio right now, does this cover both?
Ayo? @dark ginkgo level 3 !!! 
or do I need to download something else?
also, how about the instument side of thigs?
applio uses edge tts, which isnt the best for emotions so no
i dunno about u but for me its all fine
which one you recommend for emotion then?
and again, which ones does song cover and music?
gpt so vits, its 2 different programs, the most updated guide should be https://docs.aihub.wtf/tts/gpt-sovits/
RVC for ai covers
from what i seen of your screenshot it seems an audio upscaler, it uses resemble enhance https://github.com/resemble-ai/resemble-enhance
💀
we are NOT buying that 😭
fr, banned that scammer
ima just use applio for now but thanks for the help
wait, u get that warning ONLY when visiting the ilaria rvc mainline colab ?
like, after u enter the site instantly u get that warning or is it for when you enter any google colab in general ?
just that one when i enter and when i try to install the rot13
Ayo? @vale osprey level 4 !!! 
i went on applio and it was fine
so is this like a remaster of sort? If so, might be intresting to take some old classics and fix it to sound like it's recorded with modern instruments.
wtf that's very weird
yeah ikr
btw u can check even the code yourself
there is nothing malicious in it
its open source
its the same program that was used in the outdated ui you downloaded, it basically 'upscales' the audio quality of speech audios
by upscaling an audio, you mean like add detail to lost audio content? Like fake audiophile? Adding missing infromation to mp3 and convert them into flac sort of things?
If it's just that, it's not all that useful...
yeah im honeslty too lazy to go through it all so ima just turn off the antivirus next time lol
i mean that it enhances the quality of the audio, and resemble-enhance specifically denoises it before enhancing
like the example from their github is
i just sent it as a file as the link was too long
if it's just for speech, why is it under instrument tab
but yeah, if it's just for speech, it's only really useful to use it to enhance something you struggled hearing due to distortion then
but if thats the case, subtitle is enough
i think it was just meant as like 'other tools', because in the official resemble enhance github repository its said its just for speech
different question, you said GPT was best for emotion in speech right? Aka making character talk. I dont see an option on controlling the emotion in the screenshots. Is it prompt based?
i mean,its good for low quality audios, another audio upscaler that can work also for music is AudioSR, which of i have made a fork that fixs a bug in the webui version
also what is ApplioV good for if it's not good for emotion or songs?
Like it defeat both purpose of ai voice tts doesn't it?
i dont use gpt so vits locally myself as i dont have a good pc, but it depends by the reference audio as seen in https://docs.aihub.wtf/tts/gpt-sovits/#inference-
Applio is not made for TTS especially, Applio is just a fork of RVC, which is Speech To Speech natively, RVC is not for TTS, what applio does is make a tts audio with edge api, and uses it as an input with the rvc voice model, it has good quality but no emotions. About songs, rvc is used for ai covers, or are you talking about making songs like suno?
When is where I should enter a link to make the cover?
Ayo? @peak pebble level 1 !!! 
So Applio is for song covers then?
the public url
okii!
Applio is for AI Covers and multilingual TTS
so yea mostly for song covers
ok, thanks, so I now know what model to use for what.
yw
alr, yw
How do I find the voice I want to use? (already downloaded it)
i forgot how to do it
click 'Refresh' and you should find it in the 'Voice' drop down menu
Btw, i see you are using Ilaria RVC Mainline right? if you want i can suggest a faster ilaria rvc version
where to I place the rvc folders for ApplioV?
RVC folders?
please!
there is this guide if you want https://docs.aihub.wtf/rvc/local/applio/#inference-
Last update: Apr 01, 2024
there is nothing when i refresh
where in the rvc folder? there is ton of sub folder
if you want the faster one, there is Ilaria RVC Zero
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
whats the model download link?
dont worry you can upload it directly in the webui as seen in the guide
Uhhh, Mephiles the Dark
Tell me what you want to do.
the link?
He said the Huggingface link to the model.
you gave it a name without spaces right
your welcome
well
Ayo? @peak pebble level 2 !!! 
could u try to stop running the cell, run it again and accept the google drive thing (if u want to backup) or else uncheck the drive backup checkbox and run the cell?
on my way
Hey, how do you change GPT-SoVITS to English? It comes out as Chinese by default.
ready
could u retry to download again the model and re test if it works?
did u download it from https://docs.aihub.wtf/tts/gpt-sovits/ ? the prezip should be english
yeah this link
u downloaded the latest which is this one?
It tells me to select a file from the audio folder, but I don't know what it means
ah thought it was all english, unfortunately the helper who made that guide isnt here anymore, but the guide i sent should help you to use it all
could you show a screenshot ?
also, I dont see an option to make text to speech anywhere
btw, I already did everything again
dw you shouldn't need to do that, but i just tested myself and it seems like that colab isn't working as i tested with different models and audios unfortunately
@proven hill Ilaria RVC Mainline colab seems broken, no matter what model, audio, or if u used the ilaria tts, it seems to give always:
ow :(
colab is not mantained by me
who maintains it then?
its better you use ilaria rvc zero which i sfaster
theres the credit section
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
Haven't I been doing something wrong then?
This colab was created by
Angetyde
Poopmaster
l3af
Using ilaria mainline by thestingerx
but the program is made by you though
i guess i can either ask angetyde or leaf
yea but it always worked so i assume its a colab error
i will tell them about the error and if there is no fix ig i have to say that its broken in #📰│dev-updates if none is able to use it
@flint geyser seems like the Ilaria RVC Mainline Colab is broken, could you check it out and tell me if its fixable?
let me know because if its not fixable i will just tell users to use ilaria rvc zero instead
The last time I used this program was more than a year ago and it also gave me an error.
I think what you used was the old ilaria rvc mangio rather than mainline
its better to use the zero version tho
.
ooh!! i see, can i use the zero instead?
its in the inference part,
it was an old version of suno, its emotional but not that good quality
Yes
thanks man!
alr!
Nick talking about me? Now he hates me, even though I was hoping this moment would come
if you want to clarify you can dm me, since I can't, after you blocked me
So GPT-SoVITS is still best for emotion then?
Ayo? @dark ginkgo level 4 !!! 
yea good quality and emotions
i just asked u and leaf to check whats wrong in the colab
where?
yes, but I'm taking advantage of the situation to clarify
1c-inference
answer this
Nick088
no need, asked only to check whats wrong, because if its not fixable i have to say it in #📰│dev-updates
That's a little demanding
Yes, I'll check now, but for me I need to clarify
?
Yo?
Nick è morto
Nice, Applio was the first one that sorta works. But yeah, no emotion I guess. Unless maybe...
so just for confirmation, applio doesn't support emotion correct? Like gasping for suprise, happy noise, angry, etc.
Idk
@low shard
yes
so for tts, I'm kinda back to square 1 since I cant figure out how to get the other one to work
i dont know if there is any other helper that knows about gpt so vits sorry, there used to be @dawn nebula but he left, hopefully an helper can help you there
how do I use this now they recently updated it to a new dumb llayout and now no audio plays even when i set it up correctly
I wish people would stop updateing layouts i get use to
or how do I go back to the previous version.
that i knew how to use
well now it's being even stupider
nvm it's solved... sorry
I dunno if this is the right place, but every time I try to use the RVC it just spits out 'error' after convert
Ayo? @drifting sand level 1 !!! 
which rvc
be sure u arent watching a youtube video
im going of f memory
do you get an error similar to this in the colab ? #✨│ai-help message
(btw u can check the actual error code in the google colab output, in the gradio ui it will just say error)
im only really familiar with the front end, where would I find an error like tha- oh
uhh
would that be under the 'run' cell?
from what i know that colab is broken very recently, just asking if its the same error to be sure that its broken
yea, like after the public url thing
2024-09-04 22:30:34 | WARNING | infer.modules.vc.modules | Traceback (most recent call last):
File "/content/Ilaria-RVC-Mainline/infer/lib/audio.py", line 63, in load_audio
audio2(f, out, "f32le", sr)
File "/content/Ilaria-RVC-Mainline/infer/lib/audio.py", line 41, in audio2
ostream = out.add_stream(format, channels=1)
File "av/container/output.pyx", line 132, in av.container.output.OutputContainer.add_stream
File "av/stream.pyx", line 111, in av.stream.Stream.__setattr__
AttributeError: attribute 'channels' of 'av.audio.codeccontext.AudioCodecContext' objects is not writable
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "/content/Ilaria-RVC-Mainline/infer/modules/vc/modules.py", line 188, in vc_single
audio = load_audio(input_audio_path.name, 16000)
File "/content/Ilaria-RVC-Mainline/infer/lib/audio.py", line 67, in load_audio
audio = file[1] / 32768.0
TypeError: unsupported operand type(s) for /: 'str' and 'float'
Traceback (most recent call last):
File "/usr/local/lib/python3.10/dist-packages/gradio/routes.py", line 437, in run_predict
output = await app.get_blocks().process_api(
File "/usr/local/lib/python3.10/dist-packages/gradio/blocks.py", line 1349, in process_api
data = self.postprocess_data(fn_index, result["prediction"], state)
File "/usr/local/lib/python3.10/dist-packages/gradio/blocks.py", line 1283, in postprocess_data
prediction_value = block.postprocess(prediction_value)
File "/usr/local/lib/python3.10/dist-packages/gradio/components.py", line 2586, in postprocess
file_path = self.audio_to_temp_file(
File "/usr/local/lib/python3.10/dist-packages/gradio/components.py", line 360, in audio_to_temp_file
temp_dir = Path(dir) / self.hash_bytes(data.tobytes())
AttributeError: 'NoneType' object has no attribute 'tobytes'```
I got this-?
yea its broken
use ilaria rvc zero instead
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
thank you, I will give it a try 
stop using either of those two AVs
how about the latest one? it couldnt be so slow as mangio crepe, even with low end gpu like 1660 super
Ah very sorry about the late reply, had to take a break due to personal reasons, I use the built in vocal and instrumental divider of RVC, how would I go about filtering out the background vocals?
UVR BVE or mel roformer karaoke
I managed to get 30s per epoch using RVC Mainline rmvpe gpu (running on a 3070), but I feel like it shouldnt be that slow
Ayo? @strong flicker level 1 !!! 
When I trained a model some months ago on an older version of RVC i was getting 10s per epoch or something
I was talking about feature extraction stage
What does the BVE stand for? I assume you're talking about Ultimate Vocal Remover?
so you suggest to change rmvpe to mangio-crepe in the feature extraction?
stick on rmvpe/gpu if that's what you intend, only that it shouldnt be as slow as mangio crepe
Is rmvpe better quality than mangio-crepe?
when I trained months ago I didnt have rmvpe so I never tried it
rmvpe is generally good and pitch accurate, especially for voice changer use, but it's also fine for making covers
aight, ill stick to it then, thanks
thats where I am atm
get BVE in VR arch category
4B_SN
right?
Sorry if I'm asking too many questions I just don't want to misunderstand you
OH
OH HOLY SHIT
NO
I AM SO STUPID
THANK YOU ALYA AND MJ
What gpu batch size is recommended to use?
8 which fits on your 3070
Thanks!
Whats ur PC GPU
-rvc
- How to use RVC Mainline Colab by Cauthess
- Full AI Voice Model Training Guide (local) by Christopher Villanueva
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
My apologies
anyone have the public url?
Hi everyone
I wanted to create a model, so how much batch size do I get?
Please help me how to make the batch size better
what happened to the RVC docu for training locally?
if anyone has that please send it to me
Ilaria RVC on maintenance?
Last update: Mar 8, 2024
the ilaria rvc mainline google colab is broken #📰│dev-updates message
But you can use Ilaria RVC Zero which is better
the public url is different for everyone using the google colab
what are you looking for?
AI HUB Docs


