#✨│ai-help
1 messages · Page 147 of 1
Hi guys, I'm trying to integrate an RVC to a python program that I'm making but when running it shows me an error below. Im at loss with this 😭 . (Not quiet sure if this is appropriate. please do let me know and I will delete it right away)
OSError: tesune0316/MikoRVC does not appear to have a file named config.json. Checkout 'https://huggingface.co/tesune0316/MikoRVC/tree/main' for available files.
couldn't extract pitch and accent. it's probably because name of model has spaces
oh ok
huh....
it shouldn't even ask for a config.json file

oh...
I'm using transformers for this one not sure if its the correct way
AutoModelForSeq2SeqLM
AutoTokenizer
I'm going to learn Japanese and make an AI cover. Do you have any g/d.pth models you recommend?
G/D are for training purposes only, so uh... we can't really recommend anything
you mean this
wait wait wait you can't even train
It sounds a bit weird.
The ultimate purpose is AI cover, but I'm trainnimg through Japanese character model voice right now, so I was going to ask if there is a model you recommend here
NOOOOOOOOOOOOOOOO
if you're training, check the weights folder. can't recommend any because i don't really know how your training is going
yeah says you don't have a compatible gpu
you could do it via the cloud tho
Last update: Mar 8, 2024
ok
whats the issue i didnt understand
is there a guide anywhere to make my own models
is it as simple as just adding a bunch of audio files of the voice u want talking
Last update: Mar 10, 2024
has everything related to model making n stuff
yw :)
like 10+ mins is recommended to get a good model
oh i thought id need longer lol
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
Ayo? @gaunt citrus level 2 !!! 
Can anyone tey make a model of the female voice from this guy on youtube @Rabo9D
whats the difference between doing it on the cloud vs locally
locally will use your gpu n stuff, while via the cloud will just do it online. cloud services have limitations though, while locally it's just limited to ur own hardware and stuff
im guessing locally is better then right
yes
if possible, do it locally
any help with this?
yep
okay lemme try it again
imma nuke everything n restart from there bec i kinda noticed how very very senstivite it is in the colab
THERE WE GO
Ayo? @halcyon cliff level 2 !!! 
ty sm mj <3
daang
also which graph do i need to pay attaention to the most again? was it norm_g?
check g/mel, g/kl and g/total
grads are kinda important, but it's not really that essential compared to the others
lmk if you have more questions
okay!!! tysm
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
Does rvc use a lot of gpu memory clock?
Feel like doing a mod on my gtx 1660 supper vram cooling solution to boost clock speeds with copper thermal pads sitting on my shelf
Can somebody help me install the rvc mainline repo for amd gpus, I'm running the script from the downloads from the guide but it isn't working
showing these messages
Did you install Python and Pytorch correctly
Is there a vid I can watch to make sure I install it correctly?
I don't think I installed pytorch
You can skip the CUDA part since you're installing the AMD version
Which part is the cuda part
Should I select ROCm?
or cpu
Still have this error after installing pi torch
you're not in linux
the Nvidia version with runtime runs fine on cpu mode
i might need some help i was making an ai model and the train feature index button wasnt working and there was no index file for the model i created so i thought i could mabey just publish it without one, turns out you cant and i already closed the window is there any way to get the index file for the model i created? it still works for some reason and im really confused
Wait but I'm tryna do it for a amd gpu
Because I heard the rvc repo works better for amd gpus than wokada
yo dose colab still work
HELP- I trained a model with ov2 with 200 epochs and approximately 40 minutes of dataset had a very, very bad result, literally the voice was completely robotic and it was barely possible to clearly identify a voice. the dataset was very clean and good. Does anyone know why? I also trained without pretrain and it gave the same result, completely robotic. I have a 3060
dose colab work anymore
i keep seeing ai videos on ticktock i need too know how to do em
so did you download rvc to use it in real time voice changer?
yes
ov2 is not good for long and mediocre dataset, or you can try TITAN
I was trying to follow this https://rentry.org/VoiceChangerGuide
for realtime, instead of original one try the RVC realtime repo in the guide in #🔍│help-w-okada
out of my knowledge, @pastel oak might help you
I'm having trouble installing
Ayo? @tranquil raven level 3 !!! 
I tried too without any pretrained model but i have the same result, a model that was just some strange noise.
with default pretrain or none at all? the latter will be always bad
None, and probly would give me same result with default pretrain
Ayo? @worn trellis level 1 !!! 
go to https://rentry.co/RVCRealtimeGuide and download the RVCAMD_IntelLatest
nah, will be worst
How do i fix that?
I bet you havent tried not to use pretrain at all, aka from scratch? 
Ye, but it gave me similar results, like a strange noisy voice
some random noise in silences?
No, like even the voice part when someone is talking gets like super duper robotic and noisy, making me barely able to understand when someone is talking and when no one is talking.
Okay I have it
I'm getting no audio output
im using rvc while uploading my clips in training, and i'm getting "NotADirectoryError: [WinError 267] The directory name is invalid:'C:\Users\Admin\Downloads\clips\glebclipsv1.wav' while uploading my clips, i tried changing the audio files to flac and getting rid of spaces but it didn't work at all
like when makeing the back voacls in synthv then cover them to rvc was the steps?
Like is it 1
Get orignal back
2 put them in synthv
3 try best to copy back vocals form orignal
4 export it then covert to rvc?
Been told that my voice sounds quite electric, what to do about that?
pls anyone help me, what's the best way to make RMVPE model?
I tried everything, but it's always a failed model...
Hey, does anyone know how to fix my RVC from crashing? Everytime I try to start conversion, it stops responding. Any way to fix it? Also, if you know how, please ping me. I'm going to bed shortly, so I need a ping to see this in the morning tmrw.
Ayo? @white bone level 2 !!! 
idk sir sorry.
how do i get a character golden cheese cookie from cookie run kingdom voice i need it for a song im doing
make sure both the input and output use the same driver (aka both selected input and output have mme at the end)
Ayo? @lapis carbon level 1 !!! 
use mel-scale spectrogram as guide to copy/adjust the notes
Hey, I’ve been trying to download the weeknd voice model but none of them work
I get an error everytime
JSONDecodeError
posting this again because no one helped me, im using rvc while uploading my clips in training, and i'm getting "NotADirectoryError: [WinError 267] The directory name is invalid:'C:\Users\Admin\Downloads\clips\glebclipsv1.wav' while uploading my clips, i tried changing the audio files to flac and getting rid of spaces but it didn't work at all
it seems to be looking for a directory
agreed
Ayo? @neat matrix level 1 !!! 
try just C:\Users\Admin\Downloads\clips dont put the audio file to the path
alright
that worked, thanks a lot
guess when i put it in the path name it considered the clips as another folder
Do you know how to fix the JSONDecodeError on the download model step
Ayo? @trail snow level 1 !!! 
ModuleNotFoundError Traceback (most recent call last)
<ipython-input-1-d1577709b10a> in <cell line: 4>()
2 #@markdown Link the URL path to the model (Mega, Drive, etc.) and start the code
3
----> 4 from mega import Mega
5 import os
6 import shutil
ModuleNotFoundError: No module named 'mega'
I keep getting this
are you running locally?
or colab
Colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
What do I do? This is my first time doing all this
what notebook are you using? the link
hey guys, I'm new here, just a short question, does anyone know if I can just find chatgpt's voice model somewhere? I was trying to make it sing a song if that make sense. thank you.
how do i make an ai cover i have an aduio sample but i dont where or how to use it i do have a song i really want to use but im generally confused on what im doing this is my frist time doing this and i really really want to do this
How did this work out for you? I've got an AMD GPU and I'm trying to figure this all out
haven't seen it other than the Siri-like GPT4o demonstration
thanks, I guess I'll just try to train it on my own then.
you could instead integrate/pipeline the chatgpt, TTS and RVC process
hi, i just joined. could anyone be so kind as to explain how i could make chris mclean from total drama sing without me
Seems to work pretty well so far
I'm testing with a 6900xt tho so have no idea with lower end cards
as a complete newcomer how would I acquire a chris mclean voice model
Ok nice thanks
Ayo? @feral crescent level 1 !!! 
same issue on local install.
Index search FAILED or disabledSpent time: fea = 0.048s, index = 0.001s, f0 = 0.024s, model = 0.043ssola_offset = 322Infer time: 0.13Index search FAILED or disabledSpent time: fea = 0.050s, index = 0.000s, f0 = 0.027s, model = 0.044ssola_offset = 322Infer time: 0.13
and on and on
how do i use amd gpu fr
my nvidia has less vram and my amd has more
and stronger
so is there a way to use my amd instead of nvidia
for training or voice changer? 8 gb vram should be enough for that
my nvidia gpu got 6
also is there a way to use amd instead of nvidia
since the newest drivers added ai support
it is still feasible as bare minimum
yeah but i dont really wanna use ddu and then download the drivers again just to use it
so is there a way to just use amd instead nvidia?
why not dual gpu for offloading other workloads? (need a pcie riser depending on ur mobo)
i dont have a spare pcie x1 and pcie x16
it needs ROCm on linux, yet directml has shitty optimization and support
damn
Ayo? @brittle wing level 6 !!! 
so i have to wait another driver for it?
btw just a question does adding more vram to my nvidia gpu makes it perform better?
like upgrading my 1660 ti vram from 6gb to 12gb
since there was a empty vram slot
if u have enough budget for 3060 why not
idk if this is the right place to ask or not, but in the client, why does enabling the index to any level use so much CPU performance?
is this the channel where i ask for help for making models? i am trying to make a voice model with RVC Disconnected v2 but when i click training i just get this error: "FileNotFoundError: [Errno 2] No such file or directory: '/content/drive/MyDrive/rvcDisconnected/AM/%s/AM/filelist.txt'"
dont use spaces or strange unicode characters in the model name
hey, anyone would kindly show me how to upload my RVC model to huggingface by any chance?
Ayo? @grave plover level 1 !!! 
Is there a max file size for audio uploads for conversions? I'm using Ilaria-RVC and a longer clip about 100MB looks like it's frozen
-local
- 🍏 Applio, by IA Hispano GitHub
- Mangio-RVC-Fork, by Mangio621 Huggingface
- RVC Studio, by SayanoAI Huggingface
- AICoverGen, by SociallyIneptWeeb GitHub
- Replay, by Replay Team Website
- Original RVC, by the RVC-Project team GitHub
- GPT-SoVITS, by RVC-Boss GitHub
You can find more info on the #1159513888199540817 channel. If you can't find your answer, feel free to ask for help in #✨│ai-help. Credits to Faze Masta and Antasma for compiling these links.
can i pause training of an rvc-model
or stop and lateron continue ?
if it is possible; what do i have to do inside rvc-web-ui?
i hoped my pc be done by now - but it fell asleep during my sleep (powersaving mode) ^^;
also it spit out alot of D and G numberred .pth-files ... is that normal ?
its my first time i attempting to create a model
what do you need help with?
make sure to follow the rvc guide
https://rentry.co/RVCRealtimeGuide
When I use a huggingface space or google colab (that I duplicated on my profile) is the audio I upload to be changed saved on their servers? Or is it deleted after I convert it
ask
you use the g and d paths to continue training later
you should have normal pth files aswell which should be in the folder assets -> weights
you dont link the filelist.txt you only link the /AM/ folder. so remove the filelist.txt
im not quite sure why you are using a txt file to create a voice model anyway?
wokada is not the greatest with optimization
That's not wokada problem. Index just works on CPU and is quite demanding
pushes a 5950x 16-core to 100%, lol
Ayo? @pulsar anchor level 5 !!! 
https://app.kits.ai/convert/private/bf-v2-1 Is anyone have a model from this ai voice?
Example: https://youtu.be/gc95h153m64
@pastel oak Sorry for the ping but can I know which version of Python and Pytorch are you using for the rvc voicr changer
Just a curious question
Why do I use and not heard and vol 0
Ayo? @broken dawn level 1 !!! 
When I try to use it (press "start audio conversion") the software just crashes
here are what both interfaces look like:
Ayo? @upper trail level 2 !!! 
I dont know off the top of my head but think 3.10.8 or 3.10.6
your input and output are different types, i just suggest you download the latest version
https://huggingface.co/Shadicti/RVCLatest/blob/main/RVCNvidiaLatest.zip
and the guide
https://rentry.co/RVCRealtimeGuide
if you dont want to download the latest version, simply make sure output is (MME) at the end too
Ok its fixed now but I have realised it makes a very very minor impact on my voice compared to examples that I have hear in yt videos
I'm not sure if there is something I'm not doing or if my expectations were just too high
do -60 response threshold
what gpu do you have
and wdym "minor impact on my voice"
no impact
1660 super
like if im using a female one it will just make me sound like a squeaker
basically the only thing that changes is the pitch
then lower pitch until you find the sweetspot
If I lower the pitch it just sounds more robotic
use a streamer voice, or mommy stuff etc
that work well for you
https://huggingface.co/Blyuv/Mommy/resolve/main/MOMMY.zip?download=true
i think this was decent
alr tysm
tried that one
Aye same gpu
These are my current rvc settings for my gtx 1660 super
why is your input line 1 lol
thats odd
i generally recommend enabling output noise red and the vocoder thing on the right of it (at least i do, idk if you had any other experiences)
what does extra interence time do
and extra you can keep at 5.00 but i have recently switched down to around ~2.00 because the cutting off in the beginning is a bit annoying
quality of voice model
alr
you can comfortably use 5.00, if you notice first syllables get cut off more often, reduce it
ill set mine to 5 and see if that helps
thats not gonna fix much of your issue i think its just model related
but yah
i just downloaded rvc how to open ?
it can still work ig
yall how do i open rvc
this one sounds better but i think a lot of people would still take it as a 12 year old kid
but thats just with me talking normally without any voice imitation
up to you to find a voice you like or create your own
i cant put url and download on rvc how to fix
what
https://www.kaggle.com/code/hinabl/public-w-okada-voice-changer
need help it didnt work
always it says
WARNING: Error parsing requirements for aiohttp: [Errno 2] No such file or directory: '/opt/conda/lib/python3.10/site-packages/aiohttp-3.9.1.dist-info/METADATA'
for first cell
ask in #🔍│help-w-okada ig
hey is it normal for RVC GUI to take up CPU power rather than GPU? AMD btw
Can anyone help me understand when someone said this to me? use mel-scale spectrogram as guide to copy/adjust the notes
@pastel oak what can I do to make it use GPU more? RX6600M
Ayo? @brittle wing level 1 !!! 
I need help with that thing
like, sure it works great but I wanna do stuff with it on yk?
are you running go realtime dml
upload the zip into your google drive like it says and copy the URL of the folder
The name of the zip file?
im not sure about the behavior of RX-M gpus im not sure how to help
I set it to 20+5.00
no
you put in whatever your folder name is
so like /content/drive/rvcdisconnected or whatever it ks
idk anything about rvcdisconnected
it works fine, it just takes up CPU power
when I wanna use GPU
ryzen 7 5800H btw
50% just having it on
youll need to ask someone who uses rx-m gpu
maybe @wispy lodge is more knowledgable
do you know what command to run for RVC realtime to detect rx6600m
It should work with dml version, but it still may use some cpu
For me it uses around 30% with ryzen 7 5800h
yeah but having it on takes up 40-50%
I did that before the error
then idk sorry
Well, there isn't much you can do, that's how rvc realtime works. I used sample length 0.25 and 5.00 extra

I resolved this issue in my forked wokada version, but not in rvc
you need to start the right version of the 2.
and also installed the right version
to be more concrete
yeah that's what I did
on installing you have to include the requirements-dlm for amd
and start also the the dml variants
I Downloaded the AMDversion and started go-realtime-gui-dml
as far i know - cuz i get it from github there is 1 'master' version , on installing it you had to install like pip install requirements-dml.txt
there is no 'right' version to download the way i did it
github?
I got it from the rentry
Ayo? @brittle wing level 2 !!! 
ic ic - well most help here is provided you go the way we also provide the software , you can get it from diffrent forks or alternative sites we have no clue how all those variants are working
let alone fix the problem then
get this version please ^^; so there are more ppl in this channel to help you out if you got problems with 'this' particular version
that works
I'm afraid this won't help just because that's the issue with the code and dml version. Feel free to try though
yeh well if it doesnt work on this software at least we have ruled out the code by then.
sometimes totally approaching the problem from a diffrent direction gives you more insights ;3
Voices lag asf
Japanese text @_@
try out emojikages wokada fork instead then
lemme look for link rq
https://github.com/deiteris/voice-changer/releases/tag/b1877
windows dml one
Why does my storage tank whenever i use Applio
I converted once and it went down an entire gig
which one of these is the best to use
Ayo? @brittle wing level 2 !!! 
W-Okada sucks for me
cuz AMD
as i said.. emojikages fork is different and runs better
he even said he fixed the thing you mentioned in his modified version
the models in that just sound bad, even on ONXX
There's even a video in readme that demonstrates perf 😄
wheres readme
is there a way to seperate ad libs from vocals
hello everyone 🙂
I have a voice model, but not in the archive, but in the pth and index files
. what do I need to do with them to add to the rvc gui?
help
ah i failed to read this context , sorry @wispy lodge
is this the place where you can make brainrot remix's
im not
Ayo? @olive mural level 1 !!! 
hey faultrio do you know if you can make brainrot remixs in rvc applio?
we don't really give support to voice.ai... maybe if you have a decent GPU you could use W-Okada
EOL - No further Updates
Github - Blanc-dot
Discord - Blanc_dot
Despite being end of life, most if not all information has not really changed, so should be very accurate until actual new stuff comes out.
Other Links
Antasma's Local Error Fixes
Antasma's Colab guide
Sushi's useful Links - You need...
@brittle wing hi
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
I just unplugged my microphone that's why lol
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
i use 1.86 on my rtx 3060ti but idk how it affects other gpus
its a good value for me, i think others with my gpu use 1.01 but quality was too meh when i tried it
Oh yeah, I know someone who's tested this, and we think the silencing thing on max extra is model specific. Like, some models have the problem, and some don't, so the bug is prob brought on by some different things being done in the training phase, if I had to guess.
how do i use the batch inference thing in applio huggingface
what si better out of harvest and crepe
Does anyone know?
did #1175430844685484042 got nuked or smth? can't find any rvc v2 models, did the mods merge it with the normal rvc tag?
why im getting this error?
Does anyone know how to get the output to work? I can't hear the converted voice for the RVC. it doesn't crash on me now, but idk how to get the output voice.
@wispy lodge does this work on AMD Ryzen 5
can you tell me what I should use with a RTX 2050
I need help, it works fine on desktop. However I can run steamvr normally but the moment I run rvc it kills all my audio, I checked all outputs and inputs does anyone know what I can do to fix this?
You can try these settings
But try setting the sample length to 0.36-0.42
Because a RTX 2050 mobile is basically a GTX 1660 SUPER with similiar CUDA architecture but with DLSS and Ray tracing technology
But the rtx 2050 is pretty more superior when it comes to AI
In gaming gtx 1660 super performs 38% more better than it
@glad zealot Twist his balls off
balls twisted
EVERYONE HAVING ERRORS ON THE ILARIA RVC MANGIO (THE ONE IN THE DOCS) CHECK #📰│dev-updates message
Ok so Im using my voice changer and its so delayed how do I fix that
Ayo? @alpine shore level 1 !!! 
better hardware or settings tweaks for a little bit less
What gpu
oh yeah if ur amd convert to onnx
Doesn't work with rvc
You still need to use .pth
I've tried it before
i use onnx
i think its the version that supports it
oh lol u got no worries about it then lol
rvmpe_onnx what im on and its ass but it works kinda for amd
Ya
The AI voice translation was so ass on my RX 580
It was so delayed and stuttery
Ayo? @brittle wing level 5 !!! 
im trying to get colab to work rn cause i used to use it but now it just freezes and disconnects mid setup
do i need colab pro to get more resources or what
A gtx 1650 is probably like the bare minimum
for rvc
If you're on the Nvidia side
I have a 1660 super
i have 5700xt
Rx 6600 has newer architecture
So it's probably more better in AI
Gaming-wise 5700xt is more faster no doubt
Nvm
Rx 5000 series should be the bare minimum on AMD side
rvc voices have massive delay but beatrice works really well
True
Tbh they should've made beatrice the standard
Rvmpe is still pretty good
wut
the beatrice voices sound pretty damn bad
and, y'know, they're not the same thing as rvmpe
rvmpe is a pitch extraction method for RVC to use, beatrice is just another thing entirely. It does not use the same .pth files you use with rvc.
Interesting
Like, the beatrice voices are just another voice changer thing the w-okada dev added, because it didn't already have enough bloat, apparently.
hina colab realtime launches but always runtime disconnect as it finishes
Ayo? @urban hill level 2 !!! 
use rvc gui instead of colab lol
don't need onnx files, don't have to deal with the cpu bug
w-okada is not the rvc gui I'm talking about
Or use wokada fork that has cpu bug completely eliminated lol
ah
keep on forgetting about your fork, ngl
but i mean using anything other than colab would impact performance heavily
you need to get one of those youtubers who've made a, "how to sound like an annoying anime girl," videos with regular okada to feature your fork lol
only thing i can run with rvc locally is roblox or somethin
depends on the kind of games you play
can fix a lot of problems by maxing out your fps at 60 and lowering graphic settings
Tru lol
Though it's actively changing, so it most likely would be outdated the next day
this would be something that i could resort to but im just seeing if anyone knows how to help me with my colab issues since its supposedly still working
i would be willing to pay for colab pro if i know that it would solve my issues
Is there any issue with Colab? I think currently you just enter your ngrok token, hit Run all cells and go to your ngrok URL. Simple as that
is it ok settings for rtx 4060? chunk:256, extra:4096, f0 Det:harvest
my runtime disconnects like the image i screenshotted above
and i can get to the browser gui but no output because it disconnected
Hmm, let me check with my version. Maybe something has changed in colab runtime
its probably because of compute points but im not sure
or compute units whatever they are
idk ill just buy some and see
Nah, it seems like some error with notebook
On the right pane, you can see that your instance will last for 2 hours
can anyone answer pls?
And if it had Connect T4 button, that means it runs instance with T4 GPU
Кто нибудъ есть русский хелпер?
chunk 192, extra 16384, f0 det rmvpe
I love u
it freezes a lot during the server start cell, and it disconnects before i even touch any buttons on the gui
Ayo? @nocturne moon level 1 !!! 
whats the settings I need to set for better sound(gain in and out)?
Ayo? @analog ridge level 1 !!! 
without glitch
depends on your mic, and how loud you're speaking, but I think you can just mess with it while it's on to see what works best for you.
well, out is just making the ai voice louder
you wanna mess with input
ty
I'd say you probably don't want to mess with out gain and only adjust the in gain. And increasing the value above 1 (which is 100%) might distort the audio
?
Pls
You need to set Monitor to your headset to hear yourself
do I need to use it for better audio?
Dont ping random people.
Send a screenshot of your voice changer
Sup2 is noise suppression, to avoid unnecessary sounds in your background to get picked up
No point in using sup1, and echo you only need if your voice echoes
thank you so much!
Note that Sup2 may also filter out quiet syllables (really getting annoyed by this sometimes), so if you don't have problems with noise - just don't use any of them
ok, but I need)
Then yeah, Sup2 is usually what you want to use
If that doesn't work well, then routing audio through 3rd-party solutions like steelseries sonar or rtx voice is also an option
its working right, ty
Your rx 5700 xt should be already ample for rvc
mmmm
If you really wanna get the best results with rvc while gaming
ah
I would just really recommend setting your game's graphics to the lowest settings
And capping your framerate at 60 fps
yes
That way most of your gpu power is being focussed on the voice changer
And not the game or it will cause some stutters
HOW DO I CONTIUNE TRAINING
which folder is it
ok so i wrote in just here
wtf its at epoch 1 now
might be because spaces in its name
also new mj model yay
the experiment name has spaces in it
do I just make sure that the sample rate is the same and the experiment name is the same as before then I hit train model to contiune the training?
oh what do i do then
crap i hate my mobile intern
this too
make sure that your name, sample rate, batch size is all the same
and, definitely don't use spaces in the model name
it said all keys matched perfectly
btw I dont think it matter if I have spaces or not the thing works
i know about codenames and shid I know why u should have it as a standard
but if I can do spaces imma do that
that's the thing though, it breaks training most of the time
whats the diffrence between one click training and train model
one click training is broken
ok lol
what are you using?
and, does it do it everywhere or on one app specifically
why does the epoch restart tho
probably because of the spaces in the name
It said 25 in the log, keys matched when I clicked train model to contiune and now its 0
also it just saved the 5 and 10 epoch d/g
ok should I just delete the model and then make a new one
yeah
yup
clearly done something wrong here
Ayo? @daring heath level 4 !!! 
so with this fixed model when I go ahead and retrain it, its gonna contiune at the right epoch not from 0 right
mmm how did preprocessing and feature extraction go
I deleted the folder and I redid it
I see what you did there
wym
i got no idea what ure talking bout
im new
i forgot to the extraction
ah
that's batch size you have set, and on short dataset
the dataset is 16 min
im not sure if even 4090 or A100 is that fast
ah good then
so now its working
its going alot faster than last time cuz then it was like 1 minute
2024-05-31 15:56:23,566Mj Thriller EraINFO====> Epoch: 1 [2024-05-31 15:56:23] | (0:01:15.876951)
2024-05-31 15:57:07,253Mj Thriller EraINFO====> Epoch: 2 [2024-05-31 15:57:07] | (0:00:43.672445)
2024-05-31 15:58:27,593Mj Thriller EraINFO====> Epoch: 3 [2024-05-31 15:58:27] | (0:01:20.334218)
2024-05-31 15:59:33,094Mj Thriller EraINFO====> Epoch: 4 [2024-05-31 15:59:33] | (0:01:05.494266)
2024-05-31 16:00:30,874Mj Thriller EraINFO====> Epoch: 5 [2024-05-31 16:00:30] | (0:00:57.774335)
2024-05-31 16:01:25,446Mj Thriller EraINFO====> Epoch: 6 [2024-05-31 16:01:25] | (0:00:54.566728)
2024-05-31 16:02:29,393Mj Thriller EraINFO====> Epoch: 7 [2024-05-31 16:02:29] | (0:01:03.940795)
2024-05-31 16:03:35,701Mj Thriller EraINFO====> Epoch: 8 [2024-05-31 16:03:35] | (0:01:06.275938)
2024-05-31 16:04:20,485Mj Thriller EraINFO====> Epoch: 9 [2024-05-31 16:04:20] | (0:00:44.778808)```
i must have done something wrong with the gpu setting sthen
does anyone know how to set the voice so it doesnt hear like its out of breath?
out of breath??
lol
train it more i guess
i have no idea
How do I get a tensorboard displaying the training
like a graph
how do I export the model? Do I extract the latest pth file but how do I get the index shitty file
is ilaria rvc mainline not working too?
Ayo? @honest laurel level 1 !!! 
Does anyone know how to get the output to work? I can't hear the converted voice for the RVC. it doesn't crash on me now, but idk how to get the output voice.
pth is in weights, index in logs
hm? what are you using to convert?
I'm using the RVC GUI Client
send a screenshot 
alright, gimme a moment to boot it up
this one.
I was on cable to try and see if Discord can pick it up
hence why it says the output device is "Cable"
But even with headphones set, I still can't hear my voice.
Or well, the converted voice.
probably the spaces in the folder name
screws up everything yk
It still doesn't work even with the preset models
got it.
I managed to get an output, but it won't pick up my voice, and instead it gives it a choppy feeling. Any way to fix that?
probably decrease your response threshold
oh rly
Ah, gotcha
and maybe check input noise reduction if you got bg noise
It's still choppy, do I make the Response threshold -60, or make it higher?
wait... is it doing that only on discord or on both programs?
alright... shit 
do you want me to send a Screen recording of what it does?
yeah that would help tbh
Ayo? @white bone level 3 !!! 
wait increase that sample rate
no wait
sample length
my brain is dying
how much?
to the middle id say
Me everyday
now it won't pick anything up
I'm using the one related to my GPU, which is Intel.
A can't help with that but it sounds like a feedback loop
is it integrated or is it one of the Intel Arcs

gg
Maybe it's because I wasn't letting it boot up fast enough
Yep, it was because I wasn't giving it time
Thanks for the help, tho!! I highly appreciate it!
Ah, you're welcome! Glad you could fix it
Thought it was the iGPU screwing you over
I wonder if I can run that
how can I run RVC on m3 max?
Installing Applio is a simple process, you can download and install Applio in different ways.
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
u can ask more in #🔍│help-ai-art but...
-rvc
📚 All-In-One English documentation
Full AI Voice Model Training Guide (Local)
Link: YouTube
Credits: Christopher Villanueva
Model training with Mainline RVC
Link: Rentry
credits: Raven (ravencutie21)
AICoverGen Colab Guide
Link: Google Docs
Credits: Eddy (Spanish Helper)
Create a model with RVC disconnected (colab)
Link: Google Docs
Credits: Angetyde
Ayo? @foggy folio level 9 !!! 
alr what is the best version of rvc just for covers rn
like the one that gives the highest quality outputs rn
Hi does anyone know why step 2 of the RVC model isn't working for me? It says JSONDecodeError
Which colab are you using?
RVC v2 with crepe
Actually the quality of the output will depend on the quality of the model you use.
Not the version of RVC.
bruh no wonder lol
I found the link through a youtube video, do you have the updated?
tip: don't use video tutorials on youtube
These tend to be pretty outdated
Use this colab.
guide for it: https://docs.aihub.wtf/rvc/cloud/applio-colab/
Last update: Apr 01, 2024
lesson learned. I recently got into this so im super new
Ayo? @indigo pilot level 1 !!! 
thanks
thanks
yw :)
🙂
guys, which female model voice you think is the best and most natural for man?
You must test yourself with the public models on the #1175430844685484042 channel
ok nice so I just make a folder out of the two and then I can use it
man why is isnt this written anywhere tf
do everyone hear train their models more than the first pass
like i dont understand it completely u train ur model up to ur top epochs and after that its just fine tune training if u train it more?
@wispy lodge
Huuuh??? what do I do now I put it at 450
and now its going 451?
isnt it done training?
Well, this should be at least working
But the delay will be 1.87 seconds
is everything correct? now what do i do?
You can try pressing start and see how it works at least. It should output converted voice to your speakers
it should probably work just fine as it is
unless you didn't enable the "Save small finished model to weights" option
You can stop after a certain epoch count (make sure your D/G stuff has already saved)
Then just test and see
OMG ITS LIKE DELAYING BUT IM NOT SURE IF ITS THE AI ONE OR ME
wtf
@wispy lodge like im speaking into the mic but sometimes the voice doesnt even repeat just bg noise and its like really delaying, any more advice?
and sometimes the voice comes AFTER i press stop
It’s ballin by Roddy Rich
Well, do you actually use web camera as a microphone? Do you have a microphone on your headset?
no i use my webcamera as my microhpone and my headphones are seperate
Then you might need to enable noise suppression at least. Try enabling sup2
And to reduce the delay, you can try reducing Chunk a bit. You can try reducing it to value that corresponds to 1000ms
If it sounds ok, then you can try reducing it more
reducing or increasing?
Reducing
i put 80 chunk
is the voice suppose to be heard after i press stop?
80 is too small. Use the one that corresponds to 1000 ms, I don't remember exact value
This can happen if delay is too high or your CPU/GPU does not keep up
Is it a live voice changer
It's realtime, so yeah.
Turn up the processing time idk what it’s called batch size
it’s gonna be more delayed but better quality
yea
80 is 10240 ms
oh wait
i read wrong
im blind mb
@wispy lodge i put 384
which one is that
i gave a ss
No idea
did it work for u
Ayo? @brittle wing level 3 !!! 
ok its working but it cuts of mid sentence, like when i say hello nice to meet u, she goes 'HELLO....NICE TO....MEET...YOU'
i dont use live rvc
so my michael jackson model works hahahahahaha
damn must be nice
its my first model aswell
congratulations
btw how to add models in the voice changer
Hmm, what's your CPU?
i use rvc gui so i have no idea
Ayo? @daring heath level 5 !!! 
prolly move the zip into the models folder
unzip the zip folder and put it in the models
make sure the assets of the folder is the 1st directory of the folder
not a folder inside of a folder having the assets
Err, Core 2 Duo is way too old to run voice changer at decent level... Does it cut mid-sentence with chunk 704?
isnt it ancient already in vista era? 
ik my pcs pretty old
stawppp
what do u recommdent a good cpu, imma build my pc
mj a gangster
dude 😂
keep me away from ai bruh
depend on your budget
im not sure but smth atleast better than my current one and smth that most people use
Vista came out in 2007 and this CPU came around the same time so nah, it was fresh
Depends on the budget. Any current budget CPU would be better than much older CPU models
is there some kind of ai voice changer that u can use in games and stuff
and im a bit surprised it can still run win 10
Win 10 has surprisingly good support for old hardware actually
i see i see
but what do u recommend tho
what about win 11?
I remember having a first gen core i5 laptop in 2011 and was sticking on win 7 and 4 gb ram
even though u can bypass TPM requirement, I doubt if ur mobo could support it without issues
sigh, fr
i really gotta change everything huh
i dont even have a graphics card
igpu or "onboard VGA" like that in 00's era?
for low budget tier, ig i5 12600 or ryzen 5600 is decent. and gpu is either 1660 super or 3060 depend on how much budget u can spare.
You can also look for used parts and build cheaper setup with older parts as well
i see, noted, thank u thank u
i was never a pc working kinda person so i never used it much but when i actually needed it and realized how much money it requires to build a pc, had a mini heart attack
Though if you're not very familiar with computer hardware or don't have a friend who could help to find and verify parts, that may be not the best option
idek bro
alrr thank you so much

@wispy lodge @proud elbow thanks for helping me out, take love bros
Because you usually can get old top GPUs for cheap (like rx 480 8gb for example) at least and they serve well even in modern games. That's how I've been advancing with my hardware until I got more money to buy better new hardware 😄
im not sure if 2 gb vram already suffocates in the modern requirements, but despite being 10 year old, even GT 710 has 2 gb vram too
gt 710 is way too weak even for the time when it was released and 2gb of vram doesn't really help it
Ayo? @cosmic yew level 1 !!! 
how do i fix this?
Collecting parselmouth
Downloading parselmouth-1.1.1.tar.gz (33 kB)
Preparing metadata (setup.py) ... done
Collecting googleads==3.8.0 (from parselmouth)
Downloading googleads-3.8.0.tar.gz (23 kB)
error: subprocess-exited-with-error
× python setup.py egg_info did not run successfully.
│ exit code: 1
╰─> See above for output.
note: This error originates from a subprocess, and is likely not a problem with pip.
Preparing metadata (setup.py) ... error
error: metadata-generation-failed
× Encountered error while generating package metadata.
╰─> See above for output.
note: This is an issue with the package mentioned above, not pip.
hint: See above for details.```
HOW DO I OPEN TENSOR BOARD
Huh`? cant i train it more?
does ilaria colab still work?
@proper shale cant i train the model more
it was saved, you just need to Increase total epochs to resume later
wait so more epochs is more training
not necessarily
every model is different after all
that's why TensorBoard exists
you need to dl it btw, have you done that already or?
this is a web ui inference?
Ayo? @spiral osprey level 10 !!! 
yeah
Nope
How do I fix thst
do I need to fix that now
Or after the 1K training
I wanna do 1000 epochs
Btw is RVC that advanced it can make it sound realistic or is it held back by the robotic artifacts
All depends on your dataset, tbh
It can get really close
Last update: Feb 10, 2024
Ok so I would take it a yes it’s advanced as shid
thanks
You're welcome :)
Btw what’s Overtraining isn’t it just good that it masters it
Or wait ye
Like it becomes more data obv
Well not really. It makes your model worse
bruh
But it's not something that comes this quick
can I just move the RVC folder onto another pc and continue training there or does it fugg up the device info shid
like it starts going where the Rtx 3060 ti this a mx 550
Yeah you probably can
I think imma update the dataset cuz it has a lot of high saturation
Just copy over the logs folder
Ah I see
Ok nice
I feel like it goes over a threshold of ai managing if yk what I mean
like it has control on the bass and mids but not the highs it’s a bit over and it exposes the ai robot sound
Btw I posted some demos here of it aleready
It sounds great tho 😹
.
I added the autotune reverb and the high saturation
Tremble saturation**
hey has anyone else run into the problem of the voice changer not working properly in call of duty? it works in any other game but for some reason just goes insanely laggy in cod... i have no idea how to fix this. it has worked in the past.
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
Help, I can no longer access the applio's public link
Ayo? @opal kelp level 4 !!! 
how many minutes of recording should i have to make an ai model
Between 10, 15 or 20.
Preferencially, as clean as possible.
how should they be like all the voice clips in 1 audio track
or all of them seperate
in like short segments of speech
i havent checked docs yet sorry im just getting the voice files ready
You can split the audio in 5-7-10 second fragments.
all seperate mp3 files?
Don't use mp3.
wav?
Use wav or flac instead
how does it work is it basically just putting them in a software and letting it make it
Actually it's not that simple.

You gotta split your audio file in segments (you can use label sounds option in Audacity to do so then select "export multiple files") put the files in a folder and then put the folder of your dataset in the datasets folder of your RVC root folder.
would i edit the audio a little
like noise gate and stuff like that
so its clearer
Can I make a model out of 20 seconds of dataset?
Is that possible
I know it’s gonna be shit
I think you can make the dataset as clean as possible.
And play around with batch size.
how do u do labels in audacity
if you can of course.
Analyze > Label Sounds
You can use Truncate Silence on Audacity but you can do it yourself or use Label Sounds instead.
Use these settings for Label Sounds
After you clicked on apply, you'll see stuff below like this. (On the tag text you can put any name you want (a legible one preferencially) along a ##1
Then, just go to File > Export > Export Multiple
will that even work on these cuz they look like this
If you got various audios, just do it one by one.
Sadge
Ayo? @brittle wing level 3 !!! 

Label sounds will exclude the silence for you.
im just adding a limiter and stuff in fl studio cuz its easier to use for me
If you know what you're doing with FL, it's fine then.
it just basically needs to have minimal background noise right
If the background noise is barely audible, it's fine.
is it gonna be shitty cuz
its my friend im trying to make an ai voice of him to troll him
But if i was you i would make sure to delete all the noise.
but all of it is recorded thru a game chat
so sometimes it cuts and stuff like thar
Well, you're doing your best with what you got.
That's something at least.
Don't feel bad, pal
Then you'll get a window like this.
Create a folder where you'll export your audios before hand and select it.
And select flac or wav as format.
il just do it manually cuz alot of it is just him humming and background noise
so i need to listen thru it anyways
That's okay.
I'm just finishing my Label Sounds usage explanation.
so i just need a folder full of seperate audio files of him speaking right
Then select 8 as level and 16 bits, then hit "export"
yeah.
As clean as possible.
the audio file is 40 mins long and 400mb 
I'd recommend you to use flac ig.
i already exported it as wav
You exported the split audio files as .wav?
nah its all of them in 1
i put noise supressuion stuff on it
and exported it al las 1
to see how long it was in total
I would recommend you to use Label sounds bc RVC's built-in audio splitter isn't that good.
I'm not sure, but you can use Applio Colab too.
oh! thank you
Do you want the link to it and the guide?
I found the link and I think I'm working around it
thank you
actually maybe link me the guide please 
You're welcome.
Right away sir.
Last update: Apr 01, 2024
There you have.
.
Do you need something buddy?
can i use audio clips of them like singing and stuff
or does it just need to be normal speech
You can experiment and use singing audio too if you want.
I play on an RP project in the game Dayz, and I need the voice of an old grandfather, who can help? preferably in Russian
Use the #1159289738314919936 channel and make a free/paid post asking for your desired model.
Or you can read the docs too and learn how to make it yourself.
👍
In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.
There you have just in case.
is Ilaria broken?
I'm not sure.
i think it is
In that case, use the Applio Colab.
Last update: Apr 01, 2024
This is the guide you can follow to know how to use it.
And this is the link to the colab.
Tip: If gradio's public URL doesn't work, reset the colab and check "share tunnel" before clicking on Start Applio cell.
ok bet
What is batch size gonna do
I think I can make the dataset sound clean with a lot of manual work including IZotope rx 10
If you know how to use RX10, of course.
I think it's how much audio files are processed at a time.