#✨│ai-help
1 messages · Page 179 of 1
Are there any means to create voice models for free without logging in?
have been for a whole hour
Ayo? @ionic crag level 1 !!! 
Hey guys I need help with RVC I downloaded it and extracted the files to the correct locations on it but it keeps saying error when I try to convert
I’m using rvc WebUI
Do you wanna train models or just use models?
If it's the second case, you can just use Applio Colab.
Use models
Appilo colab?
You don't have to install anything locally.
Last update: June 15, 2024
All you have to do is run certain cells and that's it.
its still a ai voice changer right?
Ayo? @nova geode level 3 !!! 
Do you mean realtime?
Do you want to use models on realtime?
Because W-Okada does that.
Not RVC.
so whats using models in real time?
Talking while your voice sounds like the character you're using a model of.
yeah thats it
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
Read Deiteris' guide.
ah ok thank you
i can add voices to this right
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
is there a way to combine audio to audio and tts to audio? Some song covers are hard to do as they have overlapping vocals and would like to manually edit the text myself to correct somethings the ai is mishearing
Ayo? @tight stag level 2 !!! 
my understanding is, rvc use voice to text then text to speech again correct more or less
so basically manually fix the text after voice to text
best way to currently train a model with raw vocals for music? had a collab but it doesnt work anymore
Use Kaggle Mainline or Applio colab.
-kaggle
- Applio Notebook, by Vidal Kaggle
- Applio Notebook, by Shirou Kaggle
- Music Source Separation, by Shirou Kaggle
- UVR5 NO UI, by Eddy Kaggle
- RVC Mainline, by Hina Kaggle
- Original W-Okada's Voice Changer, Kaggle
- Modified W-Okada's Voice Changer, Kaggle
- 📖 How to use RVC Mainline Kaggle by Cauthess
Note: Kaggle limits GPU usage to 30 hours per week.
I think it's still improving.
Nice, thank you :)
Altho you could test the pth's that match these two peaks.
No
Alright...?
The first one is at epoch 171, I can just try 170 right?
And speaking of this, when I have to test pth's I have to pick the lowest smoothed value, and not the lowest actual value, right?
Well, you can test both.
I can't wait to see how much this movie sucks KEKW
Ayo? @tranquil blaze level 5 !!! 
Okie, thank you :)
Mojang will totally make loss
But like, the lowest smoothed value should "guarantee" that the model didn't overtrain, no?
Oh yeah definitely, making that a live action movie was the biggest mistake ever to start with
Nep, if you wanna share memes you can do it on the verified channel, this channel is for helping.
What about fnaf tho
No...
That goofy ahhh springlock
Maybe, but the results of your model will also depend also on your dataset's quality.
And how you did clean it.
Yeah, that was good, but it fit the style of the game imho, minecraft is too cartoon-y for it, they should've made something like story mode but as a film or something like that
I miss story mode
Well, I'm making a model of a friend of mine's voice, and I had to use his voice notes that were in opus format, so I didn't have much quality to start with KEKW
Well, it's fine then.
I actually never played it, but watched a couple of episodes of a playthrough when I was little, I'll catch up on it though because it's good :)
@tranquil blaze
I actually found out about it when I searched "minecraft" on steam (because I saw that my cousin bought every game from there, so I thought it was on there too lol) and that came up hahaha :P
Jokes on you they shut down the server
However, both seasons of the game became unplayable due to the closure of Telltale Games in late 2018, causing the game to be ultimately discontinued on June 25, 2019.
Yeah, I know, but you can still find it if you know what I mean ||(arrrgh 🏴☠️)||
rolls up his sleeves
The hardware optimization will not exist
Nice, thank you :)
I think you both can keep talking on the verified chat channel.
This channel is meant for helping about RVC.
Bro transformed into a femboy
Yep, sorry, we'll go there
AYO WTF
Guys, Im trying to use a ai model I found from one of the channels here but when I paste the link provided into the box it comes with an error, am I doing it wrong?
No
Whenever I record something the first 2-3 seconds have random distorted robotic sounds but After that it works and sounds good. Even when I dont speak directly it makes distortion sounds before it works
applio was just working fine and now all of a sudden, all of my audio output comes out slowed down/pitched down. but i haven't messed with the actual pitch. i've changed models and songs, same result.
idk if this makes sense? but please lmk if anyone has had the same issue and has a fix
what UVR process method is the most consistent?
one of my voice models took like 5 minutes to download into the google colab gradio, and when i try to infer with it i get this error
Traceback (most recent call last):
File "/usr/local/lib/python3.10/dist-packages/gradio/queueing.py", line 532, in process_events
response = await route_utils.call_process_api(
File "/usr/local/lib/python3.10/dist-packages/gradio/route_utils.py", line 276, in call_process_api
output = await app.get_blocks().process_api(
File "/usr/local/lib/python3.10/dist-packages/gradio/blocks.py", line 1923, in process_api
result = await self.call_function(
File "/usr/local/lib/python3.10/dist-packages/gradio/blocks.py", line 1509, in call_function
prediction = await anyio.to_thread.run_sync(
File "/usr/local/lib/python3.10/dist-packages/anyio/to_thread.py", line 56, in run_sync
return await get_async_backend().run_sync_in_worker_thread(
File "/usr/local/lib/python3.10/dist-packages/anyio/_backends/_asyncio.py", line 2177, in run_sync_in_worker_thread
return await future
File "/usr/local/lib/python3.10/dist-packages/anyio/_backends/_asyncio.py", line 859, in run
result = context.run(func, *args)
File "/usr/local/lib/python3.10/dist-packages/gradio/utils.py", line 832, in wrapper
response = f(*args, **kwargs)
File "/content/program_ml/program_ml/core.py", line 119, in run_infer_script
infer_pipeline.convert_audio(
File "/content/program_ml/program_ml/rvc/infer/infer.py", line 317, in convert_audio
self.get_vc(model_path, sid)
File "/content/program_ml/program_ml/rvc/infer/infer.py", line 907, in get_vc
self.setup_network()
File "/content/program_ml/program_ml/rvc/infer/infer.py", line 943, in setup_network
self.tgt_sr = self.cpt["config"][-1]
KeyError: 'config'
I trained for 500 epochs, should I use that checkpoint or something from earlier than that point?
Or even this point here, before the first time the curve went up
does anyone know how to find social media by photo?
pick around 8.3k first and test if it sounds good
Okie, thank you :)
what do I do from here? @royal jacinth
how can i denoise voices without losing quality?
what does it mean when the error says: "index -1 is out of bounds for axis 0 with size 0"???
I tried a backup of Applio 3.0.8 I’ve had, and it worked normally, but I’d still like to use the updated version.
Help, I don't know what this error means haha
Is it something with the model? I've never encountered this issue before with other models
why does it say this when I'm trying to download applio? T^T
sorry forgot to say but the latest Applio has feature to apply reverb, not sure if the colab has that too
hello!
I've been trying to use the google collab for a while but it seems to be broken , I wanted to know if there is an alternative to use the application .
be more specific, which colab link?
outdated, don't use that
do you wanna inference (use models) or train (make models)?
use models
Ayo? @stone urchin level 1 !!! 
but if you have a guide for making models , may it can be useful
use ilaria rvc zero that is a zerogpu huggingface space (A100 gpu faster than colab's T4 gpu, so its faster than doing it on colab)
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
@low shard thank you! and Sul-Sul!
i dunno what's sul-sul but your welcome
either use Ilaria RVC or try this new one if you want
https://colab.research.google.com/github/ShiromiyaG/RVC-AI-Cover-Maker-UI/blob/main/assets/RVCAICoverMakerUI.ipynb
Ayo? @knotty moth level 38 !!! 
it's supposed to be a greeting in simlish , the language of the sims , I said it because of the rabbit you have in your php xd
Ohhh yea lol
Ilaria give me this error xd
if it's due to GPU usage limit, login to your account and try again
how long was the file u uploaded btw?
tbh that seems like the No GPU Avaible for 60s or either a too long file,
unless you already used it a some times
Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Troubleshooting “No gpu is available for you for 60s” Introduction (with website link) Ilaria RVC Zero, is an RVC (Retrieval-based Voice Co...
it works now
which it's the limit ?
im using blinding lights so its 2:55 long
I think it's probably the latest version's problem then... 
alright
can't say exactly but zerogpu inference limit is 300 seconds (in terms of time the A100 gpu takes to process your audio, not your audio lenght), but i think like 30 mins
@proper shale is it true that he composed the soundtrack for sonic 3 ?
thanks , it's good to know
your welcome
Actually, no.
The thing is, they were apparently planning to hire Michael to do some songs, but it failed because of the 1993 accusations and all that. The person who made the supposed "MJ" tracks was actually Brad Buxer, which was one of Michael's producers/studio engineers and was a keyboardist for his tours. Brad is responsible for some of MJ's sound after 1991, and you can kind of hear MJ's sound because it's Brad who made them.
reminds me of several cases where X songs sound similar to other songs by different singers but they are really the same producers.
now i got error on the vocal separator
i wanna to clean some audio but every model give error
use this UVR instead https://huggingface.co/spaces/TheStinger/UVR5_UI
dont use the UVR inside ilaria rvc zero, its doesn't use the gpu
any tips on how to make a voice model sound less robotic and breathy?
guys can you help i have 4060 laptop and i core i5 can you help please
whats ur pc gpu?
7300
Ayo? @tame widget level 1 !!! 
anyone knows how to solve this? its the UVR_UI
I hope you're doing well. Sorry to bother you, but I’m reaching out because I need one of you to help me create a voice model pls
ive got a voice model downloaded, but im not sure how im supposed to import it and be able to change my voice as the guides i looked at did not specify it
Do you wanna use a model on realtime?
Use the #1159289738314919936 channel and make a post requesting your desired model.
Which colab are you using?
yeah, im on realtime voice changer client for rvc
Are you using RVC Realtime or W-Okada?
I would highly suggest you to use Deiteris' w-okada fork.
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
okada i think but it says rvc so im really confused
i downloaded it from okada
RVC stands for Retrieval Voice Conversion
W-Okada is for using models on realtime.
Maybe you should take a look at the deiteris' fork guide including that i suggest you to use that fork.
i think its working but the volume meter isnt going up so it only comes works sometimes
Ayo? @tender sand level 1 !!! 
Maybe you should ask @viscid moss
Eddy ven a ayudarlo.
Apologies 🙏 UVR5 UI is under maintenance. I'm adding new compatibility features
Use Hugging Face or UVR5 NO UI instead
oh its fineee, i dont want to bother cause i've been bugging you for a couple days now so im sorry 😭
Ayo? @obtuse galleon level 2 !!! 
dw
Hello I have an rtx-4070 and I want to make some models. However I have no idea where to start. I'm Python literate and I have WSL and docker so if needed I can run on those for GPU stuff.
quest #1 how do i train one?
You can start by reading the docs buddy.
-rvc
- How to use RVC Mainline Colab by Cauthess
- Full AI Voice Model Training Guide (local) by Christopher Villanueva
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
ty
man I really should have asked here first lol. I was trying to code my own with torch in python haha
after training it i could not get it to load in the voice changer haha
wow this is so easy. 500 epoch should do it. Yo Leo dont know if you were being frank or jerkish ether way thank you. I'm off to the races !!! 🙂
Ayo? @simple mortar level 1 !!! 
Is there a way to view the output? it says processing but no terminal.
You're welcome buddy.
View the output of the model?
You can do that after fully finishing the training of a model and testing it with an audio
no i just mean the terminal. like in another model i made for game audio detection it would show the training details like accuracy, val_accuracy, loss, val_loss.
do you know if over fitting is an issue when making voice models?
nevermind i found it in the docs
TensorBoard
-Use the model that was closest to before the OT point.
For more information, check the guide for your RVC in this website.
how do i get the model before OT are they stored some place?
Why is it doing this
It was working fine like 5 minutes ago
I’ve refreshed it and everything
Did you get like an “No gpu is available for you for 60s”
is it over fitting?
Ayo? @simple mortar level 2 !!! 
I think so.
can i not stop it early?
okay found them it saves ever 5 lol and i found the output folder
RVC:: slotInfo.modelFile G_9345.pth
[Voice Changer] post_load_model ex: 'config'
Traceback (most recent call last):
File "restapi\MMVC_Rest_Fileuploader.py", line 92, in post_load_model
File "voice_changer\VoiceChangerManager.py", line 170, in loadModel
File "voice_changer\RVC\RVCModelSlotGenerator.py", line 37, in loadModel
File "voice_changer\RVC\RVCModelSlotGenerator.py", line 43, in _setInfoByPytorch
KeyError: 'config'```
Can someone help me transcribe a song?Most of the lyrics are clear to me, it's a rare SoundCloud song by an YouTuber rapper, I can send the file and the transcribed lyrics.I'm planning on making a lyric video, so I need someone's help.I need a native English speaker, thanks.
Also to correct some of the parts that are wrong etc.Also don't tell me to pay an AI or stuff, I have already tried to trasncribe with ai, helped to some extent but human ear is something else
How can i upload a model to weighs
how do I use the pretrained model with the D and G thing
what is the best realtime voice changer ?
An error occurred extracting the index: need at least one array to concatenate
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.
HELP
applio rvc google colab btw
Huh yes I use the colab but NVM audioalter works good with reverb, but I need a softer reverb
Also how much index rate is okay
surely most better reverb plugins are only available for PC/mac
How much index/feature ratio is good on ai vocals?
Mhm I can dm you some of my ai covers (not that I won't redo them) and tell me if they're good or not
it depends on whether you want the model's accent or the input voice's accent, but higher index rate may be more likely to cause more artifacts depending on the model quality
I use UVR denoise afterwards...SO...but does increasing the accent make it more realistic
you can try aufr33 melband denoise that may be less muddy
@knotty moth could you help me with the G and D file of the pretrained model? I uploaded them on rcv web ui and did some training with it, but idk where to get the weight
It takes a thin layer of the vocals and decreases their volume after adding reverb so no
why not try them both and compare by using the spectrogram as well? I have yet to try it later
I have tried aufr33's denoise it decreases the vocals volume
Also Uvr architecture is always TOP IMO.
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Ayo? @brittle wing level 4 !!! 
even modern laptops can actually run the fork voice changer locally in cpu mode: https://rentry.co/forkvoicechangerguide
Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update August 30th, 2024: New b2309 version
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVoiceChangerGuide...
Is it normal for UVR to convert my 48kHz mono wav into a 44.1kHz stereo wav? I'd like to keep the original sample rate. My source is definitely 48, verified with a spectrogram
Ayo? @slate zodiac level 1 !!! 
-guide
- How to use RVC Mainline Colab by Cauthess
- Full AI Voice Model Training Guide (local) by Christopher Villanueva
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
Ayo? @frail kestrel level 1 !!! 
as I know, there's only Bandit v2 in mvsep.com that outputs 48k wav files
however, you can try the slowdown trick by interpolate sample rate of the input file to 44.1k or lower, and then revert the output sample rate back
although that may possibly cause different result and may not work on deep male voices
Wouldn't advice it really, interpolation of 44.1 to 48 is worse than having plain 44.1 tbf, that's just simply due to the fact no interpolation or math in approaches like that is error-prone free
Comparing 44.1 and 48 has virtually no audible difference unless you're sensitive to high-ends and high freq brightness differences ( but again, if one doesn't work on audio daily and has sufficient gear, it's something an user on avg won't notice
yesterday it was working today i'm just getting 1 dot???
i see every 200 steps.
Maki! Congrats for getting the helper role.
Yep, that's how tensorboard works.
izotope RX spectral denoise could be better if there's relatively static background noise/ambience to capture
Can i do something if the model cuts some words ? maybe a setting problem ?
hi anyone help me what is the difference between the two? 64bit float and pcm 32?
If when inferencing an audio some words get cut, i think it's the audio, not the model.
This tends to happen if you use audio samples with overlapping vocals/stuff
I assumed there would be no real difference, so it's ok. So now that I have 44.1 wavs, does that mean I'm limited to training at 40k or below? I'm guessing that there's no reason to train at 48 now?
32-bit float is more preferrable and supported by most audio applications
depend on the spectrogram cutoff, if not 40k, could even probably be 32k
nope, you can train 48 as well. If it doesn't go well on your end after many trials and tweaks, go for 40k but for majority of cases, you should do just fine with 48s
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
cannot use rvc, but can use beatrice . Why?
Are you wanting to use W-Okada for using models on realtime?
Blud left
XDDDDD
i need help.. aicovergen in huggingface gives error and doesn’t work anymore :’I what other similar generators can i use?
-hf
Suggestions for @brittle wing
- UVR5 UI, by Eddy and Ilaria Huggingface Spaces
- Ilaria RVC Zero, by thestingerx Huggingface Spaces
- RVC⚡ZERO, by r3gm Huggingface Spaces
- Applio, by IA Hispano Huggingface Spaces
i recommend using ilaria rvc zero
Ilaria RVC Zero space on Huggingface or Applio Colab.
I reccommend not using rvc at all
- How to use RVC Mainline Colab by Cauthess
- Full AI Voice Model Training Guide (local) by Christopher Villanueva
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
XDDDDDDD
be specific, which space link?
does anyone know site where you can put .index and .pth files i didn't do that in a long time and i forgot
You can use the Applio colab.
Either Applio Colab or Ilaria RVC Zero which is faster
Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Vocal Separator (UVR) Troubleshooting “No gpu is available for you for 60s” Introduction (with website link) Ilaria RVC Zero, is an RVC (Re...
does anyone know where i can get the audio sample that says "my model was made with whatever the name of the model is"
afaik no, probably done with TTS like I used to
forks like Applio & Ilaria RVC have built in tts
Question, the VoiceChangerType, How do i know which is better? Or should i stick to RVC?
hey gang. Is there a CLI utility to blend two voice models?
I'm using the Mangio RVC Fork on a P4000 machine on Paperspace. Using the CLI tool I've been able to train my own models and run inference, but I don't see an option to do the "ckpt Processing" that you can do thru the Web UI
Use Deiteris' W-Okada fork.
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
Hello, does this only support NVIDIA gpu? I have a an AMD Raedon RX 7900 XTX and it is very choppy below 100 chunks. It isn't choppy at all at around 192 chunks but very delayed.
How much crepe hop length for rvmpe?
you can't change rmvpe's hop length
the setting only applies if you use mangio, if not, it doesn't do anything
is named crepe_hop length for a reason
Does it affect rvmpe
Really
yeah, that setting is only for mangio-crepe
rmvpe hop length is hardcoded, so it cannot be changed
some forks like applio calls mangio-crepe just "crepe"
regardless, if you're forced to place a number, choose 64
Hi! New to using RVC. I've been using Bark and models in a npz file to generate ai voices.
I tried using TTS Generation WebUI locally, but I keep running into issues. So, now I'm using Bark in a Jupyter Notebook. I could generate my own models, but they take forever. The file type of my models are npz.
Is there a way to convert RVCs to npz files, or use Bark with RVCs?
Nope, they're completely different architectures.
You could probably use generated audios from Bark and convert the voice with RVC, though
Haiii MJ!!!
hi Tortuguita
Hi Miguel Jackson.
For mangio crepe 64 or 128?
64
It used to be 128 before
for training or inference (audio to audio conversion)?
I used to set it for both?
for inference is fine, for training not because the pitch extraction is more innacurate
for mangio training a hop of 64 or 32 is fine
anything below those numbers also works but there is barely to 0 audible difference
I understand
when i have a rvc upload and the base ones dont work how do i ifx
What do yourecommend using for that?
is rx 6700xt 12 gb good for rvc?
Should be fine for realtime, but since it's AMD you won't be able to train voice models.
As in, forks/versions to use?
For RVC, I'd just recommend using Mainline. https://docs.aihub.wtf/rvc/local/mainline
Last update: Mar 8, 2024
I have a 6600AMD why is the voice changer using 90 to 100 percent of my cpu?
Ayo? @grave river level 2 !!! 
Thank you! Looking into it now.
Ayo? @slow creek level 1 !!! 
You're probably using the wrong version of Okada for your GPU.
Here's the guide for the fork that has better performance for AMD cards: https://rentry.co/forkvoicechangerguide
Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update August 30th, 2024: New b2309 version
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVoiceChangerGuide...
You're welcome :)
Let me know if something goes wrong
btw which one good, rx 6700xt or rtx 3050
The 3050 is garbage and the 6700 is an AMD card so none of them are ideal for RVC
Aim for a 3060, 12GB, should be enough for realtime, training models, all that stuff
i forgor where to get the ai lmao
128 is default value for inference
Okkk
you can get a 3060 for decent price and it's much better deal
that good for livestreaming and use rvc?
Ayo? @elder halo level 1 !!! 
its 12 GB vram should make it good enough for most use cases, but if you want a bit better performance and AV1 streaming support, you can consider 4060 despite its 8 GB vram
i got it, thank you sis!!
-hf
- UVR5 UI, by Eddy and Ilaria Huggingface Spaces
- Ilaria RVC Zero, by thestingerx Huggingface Spaces
- RVC⚡ZERO, by r3gm Huggingface Spaces
- Applio, by IA Hispano Huggingface Spaces
you have probably used different batch size than before
oh am i not able to change it?
better start over if you wish
What pretrain would be the best for raw studio vocals around 10 minutes that cutoff at 48khz
does it matter if i change the number of CPU processes?
not related at all since it only affects preprocess and feature stage, but it should be 2 regardless of your cpu to avoid possible crash upon that
it has reportedly crashed on feature extraction with crepe at any hop length and all cpu threads used, regardless if it's intel or AMD cpu
but that should be fine if you have done so
I'm not sure why there's no AI model out of 90's Snufkin. I tried asking @lunar needle about him, but he still won't listen to me at all.
models makers arent forced to do you models
You can search rvc ai voice model at:
- #1175430844685484042
- #🔍│find-models
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://applio.org/models
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.wtf/essentials/how-to-make-voice-models/
Hi guys 😊. Quick question
If i create a voicemodel in lets say the scale of F#. Will it pitch well in other keys or is it preferred to export it in multiple scales to fit the input audio better? I made a lot of VM’s in the same key (since it is easier) but got in doubt if it is a good idea 😊
how do i use the d and g files toegther?
for pretrain
that's the pitch shift/transpose option in the RVC inference that can vary depending on the voice model against the type of input vocals, and you can create a voice model from normal talking dataset, etc. not have to be singing dataset with your said key.
how do i use the d and g files for the pretrains?
the pretrain's D & G files or your trained model's?
the pretrain d and g files
Ayo? @quiet surge level 1 !!! 
read the guide here
https://docs.google.com/document/d/1j9J8A8Oop9bMOHmCs3jDXzPujuD6TQ0Q396rJ0MyuIc
Table Of Contents Table Of Contents Introduction Types of Pretrains Where can i find pretrains? Index of the most famous public pretrains: Where to find and share other Pretrains How to use them locally: Non Applio/Other RVCs Users : Applio Users: How to use them online (Google Colab/Kaggle): RV...
i was just gonna send that lol
i read it but im still very confused on what to do
what rvc are u using
that's the voice changer, not the one for training
oh mb guys
pretrains are only for training
If u need to train, ud need to check exactly this part https://docs.google.com/document/d/1j9J8A8Oop9bMOHmCs3jDXzPujuD6TQ0Q396rJ0MyuIc/edit#heading=h.66opz29d4d5e
Table Of Contents Table Of Contents Introduction Types of Pretrains Where can i find pretrains? Index of the most famous public pretrains: Where to find and share other Pretrains How to use them locally: Non Applio/Other RVCs Users : Applio Users: How to use them online (Google Colab/Kaggle): RV...
Thanks for replying!
This is mainly singing vocals. Maybe rap if it is really deep or high. Should I focus on creating more keys for the vm’s?
Would I be able to hear audible difference on a trained model in F compared to one in A in case my input audio is closer to that scale? Or would this only matter if we are talking maybe octave differences?
Ayo? @untold kernel level 1 !!! 
mb if I need to explain that even talking datasets with more variation of intonations and pitch range could cover enough pitch range for singing, and again the inferred voice will copy every input's pitch, offseted by the pitch shift I said.
Awesome! Thanks for clearing ❤️❤️
hey guys imma using uvr 5 but I can't select MDX23C (it doesn't show it) but it shows (uvr-mdx-net inst hq 3). What can I do?
find it in the MDX category if not yet
thanks I saw it
@knotty moth where is "process method" option on this program?
Ayo? @fair ravine level 2 !!! 
resolved
Traceback (most recent call last):
File "C:\Users\kkkkkkk\Documents\RVC1006Nvidia\gui_v1.py", line 874, in <module>
gui = GUI()
File "C:\Users\kkkkkkk\Documents\RVC1006Nvidia\gui_v1.py", line 114, in init
self.launcher()
File "C:\Users\kkkkkkk\Documents\RVC1006Nvidia\gui_v1.py", line 400, in launcher
self.event_handler()
File "C:\Users\kkkkkkk\Documents\RVC1006Nvidia\gui_v1.py", line 431, in event_handler
self.start_vc()
File "C:\Users\kkkkkkk\Documents\RVC1006Nvidia\gui_v1.py", line 558, in start_vc
self.gui_config.samplerate = self.rvc.tgt_sr
AttributeError: 'RVC' object has no attribute 'tgt_sr'
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
hey ik it's probably been asked before, but I'm training a 48K Mangio Crepe model, is there any pretrain that can help the process?
just use original
but tbh, this doesn't seem like a good idea in general, unless you have a suuuuper clean dataset the advantages will basically be minimal, and the disadvantages will be many
32k and rmvpe is just fine for most datasets, and may not have the same problems as a 48k crepe model
@analog obsidian stop typing an essay
lol i wasnt typing anything
i left my phone on with a single letter here lmao
i was about to say that mangio is less accurate than rmvpe xD
yeah right liar
woof
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
-kaggle
- Applio Notebook, by Vidal Kaggle
- Applio Notebook, by Shirou Kaggle
- Music Source Separation, by Shirou Kaggle
- UVR5 NO UI, by Eddy Kaggle
- RVC Mainline, by Hina Kaggle
- Original W-Okada's Voice Changer, Kaggle
- Modified W-Okada's Voice Changer, Kaggle
- 🆕 UVR5 UI, by Eddy & ArisDev Kaggle
- 📖 How to use RVC Mainline Kaggle by Cauthess
Note: Kaggle limits GPU usage to 30 hours per week.
-local
- 🍏 Applio, by IA Hispano GitHub
- Mangio-RVC-Fork, by Mangio621 Huggingface
- RVC Studio, by SayanoAI Huggingface
- AICoverGen, by SociallyIneptWeeb GitHub
- Replay, by Replay Team Website
- Original RVC, by the RVC-Project team GitHub
- GPT-SoVITS, by RVC-Boss GitHub
Credits to Faze Masta and Antasma for compiling these links.
-audio
- Creating Datasets for RVC using iZotope RX11, by Cauthess
- Gathering and Isolating Audio, by SCRFilms :snowflake:
- Vocal Mixing Tutorial, by Roomie
Audio Separation/Isolation
guys. i tried to use applio to train but this pop up
and this is the cmd look like
😦 is there any guide to fix it ?
Make sure the datasets lacks pops in those cut segments
Pops sounds are like crackling noise
Like when ur lagging in a game
Usually loud
You can fix the crackling sound in rx10 using mouth de clicker
Its fine to have cut sentences in the dataset, rvc already does that when finds long sentences
whats bad is the crackling/pops u get when you cut the sentence manually
Why does voice models have crdf virus in them
If you don’t fix the pops/crackling then rvc is going to occasionally add the crackling noise to the model
Ayo? @brittle wing level 8 !!! 
Yea if the crackling only happened once it most probably ignored it
Have you tried uvr denoise and mel denoiser?
?
Also rmvpe handles crackling/clicks very good so most of the time the models sound fine even if had a bit of crackling in the dataset (as long the noise doesn’t appear often in the dataset)
They dont have virus
it should be easy for you to identify the crackling, it is a loud sound that can be heard without any problem, in your case, the crackling should sound at the moment where you cut the audio.
this is how crackling looks in a spectogram
rx allows u to fix the crackling using the de-crackle module or mouth de clicker
you can use spek
if you have crackle it should look like this
the cut audio also may have clicking noise that is hard to spot in the spectogram, but you can also hear it
no prob, in case you don't how to spot clicking noise, it sounds like this
fyi crackles like shown in above examples are usually caused by clipping, so you can try de-clip and de-crackle (either only the latter or both) in izotope RX
you can fix that in izotope rx with the module mouth de clicker or de-click
so first picture is the clicking, notice is a large yellow line, after mouth de clicking is applied the yellow line dissappears and you cannot longer hear the click
i got those by cutting the sentence in half, he said he did that so i recreated it and got that pop
probably his pops/clipping looks like that as well
basically the pop u get after cutting any audio
but yea de-crackle or de-clip fixes those
bro is writting an essay
to cut some parts, I'd usually do replace with generated silence instead in RX, and it actually does kind of crossfade ig so it won't cause popping
lol i do the same!
you were typing that for quite a time 
I can't be as fast as chatgpt 💀
i have minimum budget, can i run voice changer and live streaming in this spec?
not bad for being a gaming cafe rig, but I don't recommend prebuilts if you have upgrade plan and want per part warranty
Already requested and tried searching him, but still, there wasn't.
also do you ignore my 3060 or 4060 recommendation before?
i still askin to my mom 😭😭 because RTX consumes a lot of watts of electricity
is the electricity bill so high? 
at that point, you can instead opt for an RTX 4060 laptop, or an AMD/intel igpu laptop (raptor lake/newer) may also be fine running on cpu mode, plus decent streaming & vtubing capability, but not for gaming
either pay someone, wait still, or make it urself
if its no where in all the sites i sent then u have only that options
Slightly dumb question: is there any way i can get a voice model of the opposite gender to match my original octave? (for like singing and whatnot, cuz i kinda wanna do my own singing for funsies lol)
maybe i'm not describing it quite correctly
Ayo? @neon island level 1 !!! 
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
i just updated applio and it stopped working and throws this erorr ```
pydantic.errors.PydanticSchemaGenerationError: Unable to generate pydantic-core schema for <class 'starlette.requests.Request'>. Set arbitrary_types_allowed=True in the model_config to ignore this error or implement __get_pydantic_core_schema__ on your type to fully support it.
If you got this error by calling handler(<some type>) within __get_pydantic_core_schema__ then you likely need to call handler.generate_schema(<some type>) since we do not call __get_pydantic_core_schema__ on <some type> otherwise to avoid infinite recursion.
For further information visit https://errors.pydantic.dev/2.9/u/schema-for-unknown-type
hi
Personally i don't suggest using other pretrains.
Just use OG one instead.
Sadly i don't think there's a way to fix that.
You can also use RVC Mainline on kaggle.
-kaggle
- Applio Notebook, by Vidal Kaggle
- Applio Notebook, by Shirou Kaggle
- Music Source Separation, by Shirou Kaggle
- UVR5 NO UI, by Eddy Kaggle
- RVC Mainline, by Hina Kaggle
- Original W-Okada's Voice Changer, Kaggle
- Modified W-Okada's Voice Changer, Kaggle
- 🆕 UVR5 UI, by Eddy & ArisDev Kaggle
- 🆕 RVC AI Cover Maker UI, by Shirou & ArisDev Kaggle
- 📖 How to use RVC Mainline Kaggle by Cauthess
Note: Kaggle limits GPU usage to 30 hours per week.
How do I make AI Voice Model?
You can start by reading the docs.
What reading the document?
Depending if you wanna make a model of your own voice or a fictional character/singer/streamer
-rvc
Suggestions for @slim owl
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
The one that says "docs aihub"
can RVC AI Cover Maker work on paperspace?
I'm not sure.
I don't know
tranquilo
what happened to Ilaria RVC ?
ilaria rvc zero works fine
if u are talking about google colab its ded
which i mean the below one
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
please help
what happened to the google colab rvc
it keeps saying modulenotfounderror
How do you get a hugging face download link?
Best inference settings?
-guides
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
does the new applio version give you guys an error or
does anyone know how to fix this i pressed process data and it said this
2024-09-14 23:55:38 | INFO | main | "runtime\python.exe" infer/modules/train/preprocess.py ""C:\Users\raypl\Downloads\Perch Perkins\PerchPerkinsDATASET.flac"" 40000 16 "C:\Users\raypl\Downloads\RVC1006Nvidia/logs/perch-perkins" False 3.0
['infer/modules/train/preprocess.py', 'C:\Users\raypl\Downloads\Perch', 'Perkins\PerchPerkinsDATASET.flac', '40000', '16', 'C:\Users\raypl\Downloads\RVC1006Nvidia/logs/perch-perkins', 'False', '3.0']
Traceback (most recent call last):
File "C:\Users\raypl\Downloads\RVC1006Nvidia\infer\modules\train\preprocess.py", line 11, in <module>
sr = int(sys.argv[2])
ValueError: invalid literal for int() with base 10: 'Perkins\PerchPerkinsDATASET.flac'
2024-09-14 23:55:39 | INFO | main |
Ayo? @vast fractal level 7 !!! 
Perch Perkins -> PerchPerkins
it still says 2024-09-15 00:34:29 | INFO | main | "runtime\python.exe" infer/modules/train/preprocess.py ""C:\Users\raypl\Downloads\Perch Perkins\PerchPerkinsDATASET.flac"" 40000 16 "C:\Users\raypl\Downloads\RVC1006Nvidia/logs/perchperkins" False 3.0
['infer/modules/train/preprocess.py', 'C:\Users\raypl\Downloads\Perch', 'Perkins\PerchPerkinsDATASET.flac', '40000', '16', 'C:\Users\raypl\Downloads\RVC1006Nvidia/logs/perchperkins', 'False', '3.0']
Traceback (most recent call last):
File "C:\Users\raypl\Downloads\RVC1006Nvidia\infer\modules\train\preprocess.py", line 11, in <module>
sr = int(sys.argv[2])
ValueError: invalid literal for int() with base 10: 'Perkins\PerchPerkinsDATASET.flac'
2024-09-15 00:34:30 | INFO | main |
stop spamming people
\Downloads\Perch Perkins -> \Downloads\PerchPerkins
thanks, i have not trained a model in a while, and forgot that the application is very sensitive to spaces
now when i tried to feature extraction i got 2024-09-15 00:41:12 | INFO | main | "runtime\python.exe" infer/modules/train/extract_feature_print.py cuda:0 1 0 0 "C:\Users\raypl\Downloads\RVC1006Nvidia/logs/perchperkins" v2
['infer/modules/train/extract_feature_print.py', 'cuda:0', '1', '0', '0', 'C:\Users\raypl\Downloads\RVC1006Nvidia/logs/perchperkins', 'v2']
C:\Users\raypl\Downloads\RVC1006Nvidia/logs/perchperkins
load model(s) from assets/hubert/hubert_base.pt
move model to cuda
no-feature-todo
2024-09-15 00:41:16 | INFO | main | ['infer/modules/train/extract/extract_f0_print.py', 'C:\Users\raypl\Downloads\RVC1006Nvidia/logs/perchperkins', '16', 'rmvpe']
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
no-f0-todo
['infer/modules/train/extract_feature_print.py', 'cuda:0', '1', '0', '0', 'C:\Users\raypl\Downloads\RVC1006Nvidia/logs/perchperkins', 'v2']
C:\Users\raypl\Downloads\RVC1006Nvidia/logs/perchperkins
load model(s) from assets/hubert/hubert_base.pt
move model to cuda
no-feature-todo
C:\Users\raypl\Downloads\PerchPerkins\PerchPerkinsDATASET.flac
-> C:\Users\raypl\Downloads\PerchPerkins
#📑│making-models message
thanks I figured it out
@glad zealot we are doomed
because pydantic v2 installed instead of v1, applio no longer works
can someone tell me how to install v1 instead
How comes i use it?
@knotty moth hey vro
@tame mica hello cutie

-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
what was the fix?
How do I make an rvc voice model
whats ur pc gpu?
which link
right click the download icon next to the .zip and copy adress link
Hello everyone, I have a problem. I followed the tutorial and launched the 'go-web' folder, but after that, it opens a console and then nothing happens. It asks me to press a key to continue, but nothing happens except that the console disappears. Could you please help me?
Could be anything , so check these things:
- make sure the rvc folder is not inside OneDrive
- no special characters in the folder path leading up to rvc folder, this includes things like ç ş ı and so on
- no spacebar in the folder path leading up to rvc folder
- antivirus maybe deleted files
I don’t remember much about pcs but RTX 3070 idk if that’s the right thing
I know everything other than cpu and gpu 😭 I built my pc like 3 years ago
If u wanna be sure, You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
it tells me how to turn on running with the system python no MoPress any key to continue on a black screen, what should I do?
i keep getting value nan on every epoch
i got a 1650 with up to date drivers
is there any solve for this?
get more vram gpu
don't train on a gtx
its better u do it on cloud
n how can i do that
As you dont got a good PC, its better you use cloud for training an RVC Voice Model:
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Mainline (UI, No guide as of right now)
- Applio (UI, No guide as of right now)
I'd personally suggest kaggle
8+ GB RTX gpu recommended, and you shouldn't use mangio that may cause that issue
thank you
yw
HELP
Ayo? @tired cloak level 1 !!! 
@native axleif you send dataset i can train for you
what should i use as a replacement for mangio
Ayo? @native axle level 1 !!! 
thank you for the offer but i lowkey wanna bust my brain until i figure it out
so i can train more models in the future
i can train on an rtx
yes
im just temporarily on a 1650
then switch
But would use only when u got ur rtx
until then u need to use cloud for training
thank you
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
where did u find that?
its an old deleted google colab
what are u looking for?
Umm.. That notebook seems to be deleted.
What you wanted to do?
Train models or use models?
geforce rtx 3070
dedicated 8gb
memory 24gb
shared memory 16gb
Ayo? @spice niche level 1 !!! 
Yea you can do it locally
and can i make voice models with this and is there any tutorials?
yes, the written tutorials are the links i sent
okay thank you what do i put for Enter the GPU index(es) separated by '-', e.g., 0-1-2 to use GPU 0, 1, and 2: do you know?
u should put the number of the tab in task manager that is rtx 3070, like GPU 0 could be the rtx 3070
I used it for more than a year (best so far, personally) and it disappeared few weeks ago. I was asking if the link changed or the author removed it. Would be awesome if I found it exactly, or very similar one. Thanks in advance
yeah this is the part im stuck on
That one is old and deleted, do you want to train (make) or inference (use models) ?
i think 0 should be the right one
so do i put just "0" or "GPU 0"
Yes, a colab link please?
im not really the best for local as i do it on cloud but u should just leave it to 0 yea
i need to know the answer first
what do u want to do
train, infer yk create models and and use them to produce AI voices
so u mean both of those things?
As you dont got a good PC, its better you use cloud for training an RVC Voice Model:
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Mainline (UI, No guide as of right now)
- Applio (UI, No guide as of right now)
I gave you all options, but for Training i would personally suggest any Kaggles, while for inference i would suggest Ilaria RVC Zero
Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Vocal Separator (UVR) Troubleshooting “No gpu is available for you for 60s” Introduction (with website link) Ilaria RVC Zero, is an RVC (Re...
Ofc u can inference also on other things like on some google colabs like applio, but ilaria rvc zero has better gpu for free (only for inference)
Appreciate it
Best settings for inferencing?
Rmvpe on pitch selection.
I'm trying to download this model from Jammable and upload it to Weights, but I can't find a way to do that. https://www.jammable.com/custom-dulcinea-v2
don't use jammable, its paid rvc
You can search rvc ai voice model at:
- #1175430844685484042
- #🔍│find-models
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://applio.org/models
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.wtf/essentials/how-to-make-voice-models/
i think they just scrape models so u should be able to find it in those sites too
dont bother on it if you only have the 3070
why do you want the former?
32k
Question do you guys know any group that has a song requester
Or cam you do it on this one?
@woeful shore
no it's diff-svc

It works correctly thank you
Is there a YouTube tutorial for Mainline (UI)? I can't seem to figure it out all by myself
nope, not even the 1+ year old videos
nope
all youtube vids are outdated
therews the written guide in the link i sent u tho
Well, I sent a close technerd friend close of mine the link, he'll upload a YouTube tutorial soon
hey folks I dont really want this message to get buried but are there any good tts voice cloners other than applio playground? for free at least.
either using rvc or a audio file upload
if you want some zero shot voice cloning: https://github.com/fishaudio/fish-speech
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.aihub.wtf/tts/gpt-sovits/
Freemium 11labs: A easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
oh thats good
will try them out
Ayo? @storm hatch level 2 !!! 
yea fish speech is goood for zero shot (no actual training, just using short clip, so good quality for being fast), while gpt so vits is good for emotions and quality which is few shows (requires few training, so takes more time and resources)
what model were you using? I still hear the instrument residuals from the beginning
how about try either of these in mvsep.com?
I'm using MDX23C-InstVoc HQ in UVR
Please don't post copyrighted acapellas on the server.
Oh sorry
I will delete it
So, is there any ways to fix that part?
Use BS Roformer.
I did and well it still have that vocal for some reason
Any different models to separate the vocals and the song?
Nope.
Also, you must remove the backing vocals, reverb and echo.
Using which models may I ask?
UVR-BVE, Mel-Band Karaoke, Reverb Removal (MDX23C), and UVR-DeEcho
Using it in that order right?
First use either Mel Karaoke or UVR BVE, then use Reverb Removal and then DeEcho
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
k
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
tips on recording own vocals for training?
and later processing like noise reduction, what models generally work best
How do i fix my the crazy echo and the ai sounding super weird like no matter the model
I am ok w hopping on vc and sharing screen if it makes it faster
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
what is RVC tts?
but that's not RVC
I'd recommend an easy way to make a zero-shot TTS: https://fish.audio/
Powerful, fast, and customizable text-to-speech solution. Ultra-low latency, rapid voice cloning, and flat-rate pricing for AI voice infrastructure.
-applio
- 🍏 Applio, by IA Hispano GitHub
- Mangio-RVC-Fork, by Mangio621 Huggingface
- RVC Studio, by SayanoAI Huggingface
- AICoverGen, by SociallyIneptWeeb GitHub
- Replay, by Replay Team Website
- Original RVC, by the RVC-Project team GitHub
- GPT-SoVITS, by RVC-Boss GitHub
Credits to Faze Masta and Antasma for compiling these links.
-help
how to add models tho
I uploaded audio and skipped samples. no audio on prompt
nvm i reloaded site
hello!
Ayo? @stone urchin level 2 !!! 
someone know a good echo remover ? i try some but they destroy the vocals
what am i supposed to do if i don't have nvidia gpu

if u got amd theres a way, else use cloud
idk which u are asking for tho
don't have it 
what do you mean by cloud though?
it runs the program on remote on a good pc
what are u looking for?
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.aihub.wtf/tts/gpt-sovits/
Freemium 11labs: A easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
to create a model and an ai cover in like, any way possible
because my attempts failed
Ayo? @wanton jacinth level 1 !!! 
As you dont got a good PC, its better you use cloud for training an RVC Voice Model:
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Mainline (UI, No guide as of right now)
- Applio (UI, No guide as of right now)
i personally suggest kaggles for training (making rvc model)
while for making an ai cover, aka inference, id suggest either using applio colab or even better ilaria rvc zero which is fasterr
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
thanks i'll give them a try 
yw
I'm a complete beginner for this. I was wanting to make an AI Kanye song from a reference track. I have a Mac and can't seem to have any luck finding a good install guide
Any help woould be massively appreciated
what rvc choose to make an ai covers
need a virtual cable, check https://rentry.co/VoiceChangerGuide
Github - Blanc-dot
Discord User - https://discord.com/users/824922747423031359
Special thanks to the following people : lusbert, poopmaster, felt, fazemasta, antasma, shadictl, x_hina, sushi
thanks are for anything added to guide, taken from any talks, settings added when previously collecting st...
check https://docs.aihub.wtf/
has basically everything you need to know
and btw, on Mac, you won't be able to train but you can use cloud stuff
Last update: Mar 10, 2024
honestly the best way to go is to not use local on Mac imo
Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Vocal Separator (UVR) Troubleshooting “No gpu is available for you for 60s” Introduction (with website link) Ilaria RVC Zero, is an RVC (Re...
nice! if u have any more issues lmk
I was on here but couldn't find anything on how to install a program to do it locally. Is training doing this locally?
I just wanted something I could run a vocal through and see if I can change it to how I want it to sound without running into limits
The RVC > Local tab list the local ones.
I think you're gonna convert vocals so that should be fine on Mac, but it might involve some extra installation steps and all that.
Which is why the cloud is probably better
I tried installing RVC and just was running into errors. I'll look more at cloud options
This might be a a really dumb question, but I’m just looking to change my voice for a goofy script read for like 12 people - does my voice have to be singing for the AI to change it?
Nope
Nope not for mac
theres no local mac guide
on mac u can only inference (use models) locally, not train (make models) btw
if u really want it locally, u can read https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/docs/en/README.en.md
or if u wanna do it cloud
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
ilaria rvc zero is the fastest for cloud inference
what is this pre trained?
uka???
its for
voices with an english/british accent
@cold raptor made it ithink
oh wait no
Never heard of that one
hello im new to rvc and i want to be able to convert both main vocals and adlibs/background vocals of a rnb/hip hop song, what is the best way to do it? is there software that can do it? or is there a good separator that can separate all vocals etc..
I have setup all of this and using the index file and pth. everything shows that its working but no audio is coming through my VB-Audio Cable
I cant send images but it all shows its working on Mangio-RVC Realtime-GUI
i dunno how to really fix that but i just wanna point out that mangio is an old fork, and for better performance u should use the wokada fork btw
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
Will give it a shot ty
yw
What on earth does this error mean: Unable to allocate 39.9 GiB for an array with shape (1920, 2786138) and data type float64
All i was trying to do was preprocess a dataset. I am assuming that error means that the dataset is to large but idk ive never ran into this before.
this is my dataset before processing and UVR:
Sample Rate: 22050
Duration: 10.28 minutes
Number of Samples: 13603329
Bits per Sample: 48000
Channels: Mono (1)
and this is after:
Sample Rate: 22050
Duration: 10.19 minutes
Number of Samples: 13486493
Bits per Sample: 44100
Channels: Mono (1)
I trained on 48k with the latter and the model ended up with tons of artifacts, any suggestions?
Ayo? @forest pewter level 1 !!! 
Train at 40k
Ight ty, but why is the sample rate so low? I recorded on audacity
my mic is the MPM4000U so it's not bad at all
48k on RVC tends to lead to artifacts.
And prolly because your dataset wasn't really 48k
Check your frequency cutoff in melspec and see if it reaches above 24k
Rvc downsamples to 16k when processing and then upsamples to your model sr.
Por favor no postees fragmentos de audio copyrighteado.
how do i make a voice model for the AI with my own voice?
pero era bueno o necesitaba mas limpieza
You can start by reading the docs.
Also, you can record yourself talking for 20-25 mins
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
where do i find them?
There
Está bien, creo.
okay thank you
...
You're welcome.
Yep, that's what you must do if you wanna make an AI model of yourself
ya po
Ilaria RVC: CLICK HERE 🤗
Guide on how to use it: CLICK HERE 📝
Don't forget to thank Ilaria if you find it useful! 💖
Suggestion for @limpid knot
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
Facefusion omg whaaaat
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
Ayo? @hollow sable level 1 !!! 
wher do i find the models?
n when i download models why do they not have crep files or anything i can put into rvc
Ayo? @latent geode level 1 !!! 
Uhmm how to fix this error
"'NoneType' object has no attribute 'setdefault'"??
!howtoask
How To Troubleshoot 
- Don't simply mention your issue, like "
my rvc is not working". - Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
- The more context, the better.
- Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
- It's okay if you're frustrated, but don't take it into this server.
- Don't DM without prior consent.
- Don't ask for every little instruction. Put your own effort & test things by yourself.
- Don't ask to ask.
- Check if your answer is a Google search away/on our guides website.
its a 0 shot deepfake tool
You can search rvc ai voice model at:
- #1175430844685484042
- #🔍│find-models
- https://weights.gg/ (login required)
- https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
- https://applio.org/models
- https://voice-models.com/
- https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)
if there isnt one, you can:
- #1159289738314919936
- #1191429836321849435
- make it yourself with our docs guides https://docs.aihub.wtf/essentials/how-to-make-voice-models/
Crep files? To use them u need only .pth and added.index
be sure u arent using a yt tut
most ones are old
How do I create an RVC model on my smartphone?
How do I create an RVC model on my smartphone?
Last update: Mar 8, 2024
you cant do it locally on ur phone as its not as powerful
U have to do it on cloud (remote good pc)
As you dont got a good PC, its better you use cloud for training an RVC Voice Model:
- Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
- Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI)
- Applio by Shirou (UI, no guide as of right now)
- Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Mainline (UI, No guide as of right now)
- Applio (UI, No guide as of right now)
Personally id suggst kaggle but the interface is kinda hard on mobile but u can do it
wait i have a question when i put in my model of vice it echos how do i change that?
Trying to use so-vits-fork for voice conversion, however when i try to input a file for config path (below model path) it is asking for a JSON file, however all the models provide an index file as opposed to a json file. Is there a step im missing?
lower the volume of the monitor or mute the monitor or set monitor device to none.
ofcourse you can also stop the voicechanger , but i think this is not really what you are trying 😉
i dont think theres anyone that still uses that, rvc is better than sovits svc
makes sense, thanks
if u want i could help you into using rvc
i will check it out when i get a chance but if i cant figure it out i will ping you, thank you!
Ayo? @tulip root level 1 !!! 
I can already link you to the best versions as u may run into outdated forks on yt, you got a good pc right?
yeah if you have links to a good download that would be extremely helpful, got a good pc
for inference (using model), even a gtx 1650 would be enough, while for training (making model), its recommended atleast n rtx 2060
thank you!
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
Yes you can
Nope!
You can also use weights.gg app
Are you unable to send images here ?
No I am unable to
Get level 5 then
Cant I just dm you my problem?
Yes you can
dms
Ayo? @celest owl level 1 !!! 
i too have a problem sadly, using the Ilaria RVC
I'm just trying out to make an AI cover of a song with a space marine voice and after clicking on convert it gives me the usual waiting on gpu and gpu available basically
then it runs for about 10 seconds and then it just says Error, no code no nothing just error
i have yet to find a guide to locally do ai covers as both guides are for online / cloud and neither have worked so far
-gui
stop using that old garbage from such 1+ year old video
ok what do I use now then?
-applio
Suggestions for @celest owl
- 🍏 Applio, by IA Hispano GitHub
- Mangio-RVC-Fork, by Mangio621 Huggingface
- RVC Studio, by SayanoAI Huggingface
- AICoverGen, by SociallyIneptWeeb GitHub
- Replay, by Replay Team Website
- Original RVC, by the RVC-Project team GitHub
- GPT-SoVITS, by RVC-Boss GitHub
Credits to Faze Masta and Antasma for compiling these links.
oooh this seems promising- which of these would ya'll recommend for song ai covers?
replay if u just wanna make covers is more than enough + easy to understand
can i insert ai models from #1175430844685484042 in there too? just can't see that in the little preview there
Ayo? @warm swan level 1 !!! 
ye
neat thanks
Hello, where can I find the app for make my voice change ?
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
I suggest you to read the Deiteris' one
thanks @odd shale
Ayo? @tawny harbor level 1 !!! 
can you share me the link to download the app, i dont find it please
The link to download it is on the guide.
is RVC TTS ?
Nope.
But what do you want to download, RVC or W-Okada?
RVC is for voice conversion with audios, not realtime.
W-Okada is for realtime
for a homework to add fun
?
I want to download voice ai (rvc) for a homework
Alright.
just check the docs.
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
thanks
I don't understand in uvr where i put the voice for the result
quick question with Pitch detect method do you guys think is best?
(feel free to ping on reply)
rmvpe or crepe if you have a clean dataset
quick extra throw in, Stem method? what is recommendet-
hello i just installed applio and watched a tutorial. im having issues with the voice models not showing in the drop down menu for voice model. from what i understand there should be at least some already included with applio. what am i missing? I appreciate the help.
Hi my volume and res stay at 0 no matter what, what can i do?
- Applio Notebook, by Vidal Kaggle
- Applio Notebook, by Shirou Kaggle
- Music Source Separation, by Shirou Kaggle
- UVR5 NO UI, by Eddy Kaggle
- RVC Mainline, by Hina Kaggle
- Original W-Okada's Voice Changer, Kaggle
- Modified W-Okada's Voice Changer, Kaggle
- 🆕 UVR5 UI, by Eddy & ArisDev Kaggle
- 🆕 RVC AI Cover Maker UI, by Shirou & ArisDev Kaggle
- 📖 How to use RVC Mainline Kaggle by Cauthess
Note: Kaggle limits GPU usage to 30 hours per week.
I would recommend you this one.
This guide for Mainline Kaggle is an alternative option to the Mainline Colab notebook for training voice models
It is complete and should walk you through every step of the way since Kaggle has a difficult learning curve. However, it will be updated constantly to go over parts that need more cla...
⠀
Local Forks 🖥️
⠀
Mainline RVC
Original project, suggested for advanced users,
by the RVC-Project team.
Applio
Simplified, suggested for all, by the Applio team.
RVC Studio
Simplified, suggested for all, by SayanoAI.
Mangio-RVC
Simplified, may not be supported anymore, by Mangio621.
AICoverGen
Simple yet great way to make covers, by SociallyIneptWeeb.
Replay
From the greators of weights.gg, excellent product for everyone.
⠀
hiiii ai hub, so this might not be the best channel to ask this, but i've seen a lot of models based on the drums of songs, that i assume u can run another drum track through and then it sounds the drums of the trained song. has anybody done this with guitar tones? there's a guitarist i like whose tone i REALLY like and idk i'm just theorizing what would happen if i trained a model of a bunch of his guitar playing
can someone send a tts rvc
hey, i reached e300 with training then stoped to save it on drive then i started again and it started from epoch 1? how can i solve this, what happens now?
did you save D and G files?
yes
yes
Uhmm i use the rvc aicover maker and it say it been limit in google collab and if i want to continue i have to pay and this program say can upgrade by gladio deploy i dont know how to do gladio deploy can some one show me the tutorial?? Sorry for bad eng
don't forget to load the preprocessed and the training progress to resume
should I do something about no graphs showing on the TensorBoard part?
⠀
Google Colabs 
⠀
AICoverGen-WebUI
Useful for making quick covers, by Hina.
AICoverGen-NoWebUI
Useful for making covers, doesn't inclued a UI, by Ardha, by Eddy, Hina and Gdr.
RVC Disconnected
To train new voice models, by Kit Lemonfoot.
EasyGUI
The OG interface, by Rejects.
⠀
is ilaria mainline on colab busted? hasnt been working for awhile and i ran out of gpu time on huggingface
Yep ilaria on colab doesn't work #📰│dev-updates message
Either wait for zerogou time or use the applio colab
Gotcha thanks!
Don't worry about deploying gradito u don't have to, but do you get an no GPU warning?
Yw
Yeah
Ayo? @brazen spear level 1 !!! 
Google colab has a 4 hours free daily GPU limit
I get the gpu warning
Either use an alt google acc
Or use -kaggle that gives 30 hours free weekly but it's harder
-kaggle
- Applio Notebook, by Vidal Kaggle
- Applio Notebook, by Shirou Kaggle
- Music Source Separation, by Shirou Kaggle
- UVR5 NO UI, by Eddy Kaggle
- RVC Mainline, by Hina Kaggle
- Original W-Okada's Voice Changer, Kaggle
- Modified W-Okada's Voice Changer, Kaggle
- 🆕 UVR5 UI, by Eddy & ArisDev Kaggle
- 🆕 RVC AI Cover Maker UI, by Shirou & ArisDev Kaggle
- 📖 How to use RVC Mainline Kaggle by Cauthess
Note: Kaggle limits GPU usage to 30 hours per week.
Ok but daily is 1 perday right and it refill??
Yes, 4 hours today, then it refills other 4 hours tmr
I'd personally suggest -kaggle if u want more time as they give 30 hours weekly for free of better GPUs, but it's harder and requires phone number so he choice
Thx my man
Yw
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.aihub.wtf/tts/gpt-sovits/
Freemium 11labs: A easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
Hello, are there any guides to separate 2 voices? I have a song here it's a duet and i want to separate the female and male vocal. Thank you
does anyone know how to remove echos and reverb in the vocals, driving me crazy lol
I want to add more voicelines to a existing rvc model, but don't know how. Also I'm using this rvc model maker.
- recommended to train it as a whole new model using the original pretrain or the one in #1235952130855010365
- do the same way as above but try using the existing model's G and D files as a "pretrain" (experimental, im not sure if it would be better or worse)
So do I make a model with the voicelines I newly made, and then train the newly made model with the older model?
Is there a ranking of gpus with their average it/s
the new voicelines with the same quality or even worse than before?
Same quality.
from different source? that would prob be less consistent, or how about train the whole "new voicelines" and compare it to the old?
Balls
who can send me a good model of a female voice? Please
Ayo? @summer dirge level 1 !!! 
Seek help
hello is there a way to instead of having to add an audio file for the pth file to make it say what the voice says is there a section on mangio-rvc-fork where i can just use the .pth's voice to say something by itself without it like combining with mine sort of like eleven labs where it just uses the voice and you type in what it says.
so basically just like can i use the pth file by itself without having to add my voice and where can i do it pls hlep
Ok
How much index is good 0.7 or 0.8?
Ayo? @brittle wing level 5 !!! 
what are u using?
phone and colab research from source youtube
i think what u mean is text to speech, also don't use mangio rvc is outdated
There are different Text To Speech (TTS) AIs:
GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.aihub.wtf/tts/gpt-sovits/
Freemium 11labs: A easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS
FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site
With RVC Models:
RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)
If you wanna do tts locally with RVC Voice Models (if you got a good pc):
If you don't got a good pc you can do tts with RVC Voice Models on cloud:
-
Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
-
Use Applio UI Colab (with google colab T4 free daily limit gpu)
-
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc
And how to fix that ?
dont follow yt tuts
they are 99% outdated
what are u tryna do?
Where is the new one
inference (use models) or train (make models)?
Im trying to my fav youtuber voice
also what vid are u looking at cus im pretty sure its old
But keep pyngrok not found
are you trying to train or use an already made model?
Ayo? @lone iron level 1 !!! 
Already model from google
are you looking for realtime voice changer for calls?
or using the model for pre recorded audios, like ai covers
Yea for game tho
yes yes
u cant use it in realtime for calls on phone as there is no virtual audio cable
you can use it for pre recorded audios tho
Ik i can use 2 phone tho
But how to fix pyngrok not found
Doesn't matter, u need a pc to do realtime
Aight bet
As you can't locally do it on ur phone, i can suggest you Ilaria RVC Zero which is a cloud (running it on a remote good pc) option to do ai covers
Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Vocal Separator (UVR) Troubleshooting “No gpu is available for you for 60s” Introduction (with website link) Ilaria RVC Zero, is an RVC (Re...
Ayo? @low shard level 104 !!! 
i dunno what u are using but i gave u the best option to make ai covers online
Did i can send link here or nah?
I use this link from youtube tutorial but they're working and im not working why i keep pyngrok not found?
yea u can send links and screenshots here
could u send directly the link
ay sure
yea don't use that, its not true that u can use realtime on android as u need always a pc for a virtual audio cable
also it looks like its a copy of ur colab @glad zealot
meow?
could u send the video where u found this
seems like someone copied ur colab and said its for android on yt lol
wait
XD
AI HUB Docs

