#✨│ai-help
1 messages · Page 14 of 1
It doesn't work
RVC Guides (How to Make AI Cover)
Translation by country
idk abt minecraft 
not minecraft
Ayo? @grand orbit level 1 !!! 
i mean its lagging the vc in discord and everywhere
like the sound not coming correctly and ms is so high
@low shard Sorry, didn't notice I'd accidently deleted the guide while deleting some other documents, SMH.
dont worry, anyways i tried to improve it, there was a part that told the viewers to go view some guides to isolate vocals but they werent working because the guides were my italian ones, so i adjusted the font and putted all english guides for isolation of audios
angetyde should just delete his original post of ilaria rvc, btw dont worry abt this its already fixed and pubblished
That's great. I didn't actually think of that since I already had some prior experience, good catch.
what it this RVC
How can I make laughing sound normal and not glitchy?
Ayo? @worn echo level 2 !!! 
hello I am having out of memory error on my rvc cmd
any optimised code or solution to this i got 4gb gpu and 16gs ram
training on a 4GB gpu won't do unfortunately
hi guys....a new much more simplified colab will be released soon with added files and other files that download automatically
Does a mixed dataset with singing and speaking result into a better or worse voice model
-colab
Google Colabs
RVC
- AI-Cover-Gen-No-UI (inference only)
- RVC v2 Disconnected (training only)
🌐 Translated colabs
- PT-BR:
-colabs_br
Audio Separation/Isolation
-rvc
RVC Guides (How to Make AI Cover)
Translation by country

oh my
what colab is that?
CoverGen_No_UI.ipynb
i'll try using the huggingface one instead :/
Is in Spanish but is the same
works too but it's too slow
ooooo thanks! ill try this too
do i have to edit the dir_name or just leave it?
And for song_input, do I upload the audio to my google drive and copy & paste the path?;o
change the dir_name and.... works with a YT link
If you want to use a drive file, you must mount the cloud
create a code cell and paste this:
from google.colab import drive
drive.mount('/content/drive')
oo okie thanks!
Ayo? @glossy coyote level 4 !!! 
I have this error for some reason D:
BadZipFile: File is not a zip file
hey i need some help
when i made a private RVC space for me in hugging face.... the screen is just blank white , no interface or buttons are visible.... i followed the group guide but this found myself stuck in middle due to this
Ayo? @eternal finch level 1 !!! 
How do i find out how much to train my model, im using Mangio local RVC2 training, how many ephocs? and gpu blocks, ect, how do i find out that number
does anyone have a fork of AOD? the original github repository is down
Hello people, is this possible to convert an audio file directly and how ?
Is it normal that it takes only 1-2 seconds for one e-poch ? (in rvc v2 disconnected)
sorry, it's currently being reworked in #🔊│ai-development message. hopefully it'll be back soon
fuck
i would still like to use the old release version, if thats possible...
may i ask him directly? or if there is an archive available i would like to use it
ig you can ping him yeah. the repository is not available anymore and I'm personally not aware of another way to download it soo
@brittle wing hello, do you have the old release version of AOD?
Ayo? @quartz sky level 1 !!! 
why did you private the repository?
Why is my Res so high with RVC?
i understand, but if you still have the old release version id like to use it
Trying to use the RVC in discord, but it keeps lagging out, are there any better ways to use it?
It's either your GPU or the wrong settings.
I'm pretty sure my GPU should be fine for it, are there any known optimal settings?
What's your GPU btw
3080
Ayo? @agile oxide level 1 !!! 
i only have a 96 option
i cant type lmao forgive me for it
oh that's it
And then RVMPE as your f0
it's the fastest really
what do you mean by this? sorry
what do you mean bat version? the one in releases or source
f0 is basically just pitch detector
could be that ur running from onedrive? idk
Is there also a better like mic system to use? I'm just using one I found online.
oh
idk cause it says cannot import name I18nAuro from i18n
if you installed it in a virtual environment you should activate it first
i tried reinstalling i18n
what
try moving it out of onedrive
unfortunate, ill be looking forward to it then...
Any ideas why it's still super choppy or laggy in VC?
im sure people here have a copy of it somewhere.. right?
Have you tried like, 256 chunk size? Maybe that'll fix it
Sometimes low chunk size can bring choppiness
1 second
It will have a considerable amount of delay tho, but hey, it won't be as choppy
Try out anything from like, 128 to 320, and see what works best for you in performance and quality
I'm looking at a video rn, and he's using 40 chunk size and it sounds perfectly fine.
That's probably a 4090 then
Yeah, I think so.
Either way, maybe a bigger chunk size will solve the issue
How do I find an AI better than the search thing in search-models
just go to weights.gg
@proper shale Here are some settings I have rn
decrease that extra size
to what?
recommend lower extra size
to like 4096
Yeah, idk it's still cutting out.
theres certain chunk sizes that dont work well with certain extra values
Ayo? @quartz sky level 2 !!! 
idk why but its like that for me at least
Still bad
Ayo? @agile oxide level 2 !!! 
that extra is way too much
it works pretty well for me
Go for client audio tho
it picks up what youre saying better
ok what is going on
i use this
I don't know it just keeps cutting still.
i've tried
are you running a game at the same time ?
minecraft in the back but it's not super intensive
is the framerate unlocked ?
yeah it is
Lock and see
could be just lacking memory honestly
any help please
Still not working yeah, no idea.
do you have index on?
hi
guys help
is that normal with mangio?
If you have not pressed anything, yes
yeah it's
it's normal
there's no traceback stuff
so you're good to go?
I hope
how do i get the tensorboard
ahaha-
i did not no
he deleted the repo with no backups so theres nothing rn...
then- idk what is wrong with it
I think the all in one document has a little tutorial on it
the tutorial only tells to download the release so
thats something tensorboard will "tell" you
that's why i wanted it
but is this good? @proper shale
decrease that total epochs count because you will definitely not do that in 1 session
so like how much
otherwise it's... ok
300?
yeah that should be okay
i want it to be really good
(hopefully)
if ur dataset is good, then even at 50 epochs it'll already sound decent
idk, how much vram you have
16gb
oh
this is RAM
check uncheck check rest seems like unecessary automated stuff
^
and that mode collapse method won't save mode-collapse-prone datasets, will only delay it occuring
yo wtf
Ayo? @brittle wing level 5 !!! 
its so fast
mode collapse is caused by either:
A) Dataset is too small + Dataset/samples to Silence ratio is higher towards silence.
B) There's way too much silence ( or even contaminated silence which doesn't get discarded ) in the dataset
also, if you want best results, depending on the dataset, you should commission someone
what do i install applio or normal rvc
so fucking fast
batch_size 4 is usually never used unless in some special circumstances
why tho?
lemme explain you briefly what it is
it takes your dataset and divides it by 4
then you get groups
and that many groups ( of samples 0 are being used during training
-rvc
RVC Guides (How to Make AI Cover)
Translation by country
sometimes the groups are too small for that one specific dataset
to get a good " avg / weighted " results for one dataset / voice
oh i understand you
on avg, 16 is go-2
if no good results are present, only then you either go lower; 14, 12, 10, 8
or add more data ( 1 min, then another 1 min and so on
it's a gradual process
what's your dataset size?
i started training 2 mins ago
if it's below 1-2 mins
what's your dataset's length approx ?
30 mins
oh yeah, then that's def something off
ooooh no
how
( it contains .wav sliced samples used for training )
can i stream it to you??
RVC's folder
and there's assets I think
or logs, and in there, your models' folders
not sure how newest rvc or so handles it
its prolly training 1 epoch per step
gimme a sec
@brittle wing so
first
go into ur rvc's folder
now assets
hmmmm
weird structure, wait go back
ah cause that's applio
well, go root folder
it should be somewhere there
hmmm, I've no idea where applio stores models lmao
ye but those are pretrains
those are the " base " models
Got better idea, search for ur model's name in root folder
it's the main rvc's dir, well, applio's
in other words
a, there we go
check 0 gt folder
hmmm, samples are in there, so that's not the case then
@brittle wing I'd first check if it works
in applio
ah, so it's for training only?
a no, nevermind
ye, inference, you wanna test it there
Well, gonna move to unlimited's chat, I look like schizo here, hm 🤔
nevermind
@brittle wing check dm as well, provided a stress test audio
@brittle wing no that's not a rule
You should only train for as long tensorboard " tells you to "
because AI models have 2 states
overtrained and undertrained
nono, If you want it locally, I can give you something
nah, don't trust those misunderstood information
each model and dataset is individual
wait I'll just get a mic hooked up ig, brb
what do i do if it sounds good in audacity but when i listen to it in an mp3 it sounds low quality
Ayo? @heady fable level 1 !!! 
export to wav
thanks
how long does it normally take with 45 mins of voice lines?
anyone can help? i trained rvc locally, but i dont have g & d pth file? it does appear the paths when i first open the gui, then i clicked on extracting features it disappears
should i reinstall my rvc?
I don't know which channel to use, so I'll copy here what I posted to #🔍│help-w-okada .
I'm getting really high delay after a few seconds of live voice-changing, as in every couple of seconds the delay grows by a second. I'm also getting a "stretching" sound as the output cuts in and out. I have the AI set to low-ish settings and an Nvidia 1660 Super. I'm not sure if I have the GPU set up correctly or how I can do that.
Is my GPU too weak for okada or is there some other problem?
Apparently 16 series of cards have problems with W-Okada. I suggest opening an issue on GitHub
they should be in ur logs
the paths disappear, i cant enter the path, it will show errors
its chinese if you dont mind, i first opened the gui the paths appear, and then i extract voice features the paths disappear
and i enter manually it shows errors
so theres two blank spaces for these two path
I first trained locally and inference, the sound is terrible
Oh those are the pretrained paths
yea i dont have that
Huh
oh so i need to download the infer train ver?
Ayo? @woeful depot level 2 !!! 
i never train locally before
i modified the script but now it says
its literally missing files
tf am i supposed to do
literally had to copy my_utils from an old rvc version for it to actually stop giving an earlier error
so... what's the difference between the "f0 methods" (dio, pm, crepe, harvest, etc.)
is there any documentation on the diffrences between them?
ah ok
nobody uses em anymore
how do u use a model
uh, just a quick question
Do the perfect amount of epochs for a dataset depend on the quality of the audio, or the time of the audio?
If the time, is there some sort of chart for what the perfect amount of epochs per minute or second?
both
only the graph "tells" you how long you'll train for
oh, its because I usually use a dataset at about 3 minutes and 10 seconds at 600 epochs
the graph "tells" me is sort of a lie. I know I have to use trial and error, but it takes me like 2 hours to make a model and train it at 600 epochs, so no thanks.
...wha
look, trial and error is good for like
batch_size
the graph does give you important info you should see to make the decision to keep training or not
how am I supposed to know when it is overtrained? The graph either flucuates randomly, or I am too zoomed out to see and noticible things
because in the worst case scenario you can get a overtrained model, but most of the time you keep training it's still undertrained
-overtrain
Hey, @buoyant saddle!
👇 Here are some resources to help you identify if your model is overtraining
All-In-One Guide on how to make a good model
This guide explains how the D and G files works and much more: https://rentry.org/RVC_making-models
Credits: LUSBERT 
Automated Overtraining Detection (AOD)
Will be available soon in #1159513888199540817
Credits: grvyscale
this might help
but basically
if g goes up and doesn't come back down after a good while of training, then it's most likely overtraining.
thank you
Ayo? @zenith cairn level 2 !!! 
RVC Guides (How to Make AI Cover)
Translation by country
generally how long does ilaria rvc take?
depends on the song's duration
it's 4mins
since it uses cpu it could take like, 400 secs for a 4 min file
if it's fluctuating too much then it's either ur dataset or batch_size being too small
16
For any dataset?
you can't calculate it, each dataset and so the model, is unique in that matter
as a rule of thumb, start with 16 and only decrease if the model despite good graphs, sounds bad or not as good
( provided it's not because of your dataset ) or if simply, graphs are too chaotic
so, 16 -> 14/12 -> 10/8
but typically, it's either 16, 12, or 8 going below 8 is super rare and most of the time not required
tl;dr, more than 5-7 mins of hq audio? try 16 or 12 if 16 failed
less than 5 mins? 16 and if that fails, try 8 or 6
overal, you want your graphs not too zigzaggy but neither too flat
yes it’s studio recordings
legit studio or stems
legit studio
then rest assured, you should freely try 16
tho, if you can provide a short sample
I can evaluate it for you spectrally
wav I assume
I’ll send it in dms is that fine
naturally
alr
all good
so it's informative for every1
yh
alr, lemme see
say less
first thing would be, some voices have assymetrical waveforms, phasing issues
sometimes it's the mic
relative to the middle
getting it right helps with compression / normalization steps
also helps the loudness / dynamics a lot
another thing would be mic thumps
rvc does handle a bit of that ( incl dc offset issues ) up til around 75hz
some exceed that and could be audible as bassy pops in models ( not always tho but worth noting, esp for low voices )
Other than that, my congrats
you're actually the first person here that stayed true to their claims of the audio being HQ
check passed
suitable for 48khz training ( training will be a bit more sensitive butttt, extra 4.1khz handling is worth it )
Definitely use batch 16 and only attempt 14 or 12 if the model doesn't come out any good
Now, I'd be a lil hypocritical of myself to " estimate " stuff but, from my experience, model shouldn't exceed 300-400 epochs zone
it is super hq and 5 mins so, pay attention to tensorboard and don't trust ckpts past 300 or 400
Yeah same model shouldn't exceed 300-400
That's what I always do
it's more so that sometimes overtraining looks tricky on graphs
and ckpts that are 20-30% past the estimated threshold
It helps
sometimes look like they aren't overfit
overtraining is just, well, lemme do a simulation
Is audio engineer a job? It must be
quite a common ( if to, well, surrealistically assume there's no mode collapses ) scenario
zigzaggy due to either a bit lower than should be batch_size or just the nature of the set / or processing
but you can visualize the trending
so you'd want to pick the " somewhat stable " zone on the graph
and test those ckpts
key elements are:
- sibilants ( whether they glitch or not )
- volume / dynamics issues
- glitchy ( cutting / stuttering ) plosives
- lack of clarity or graininess, esp on breathing
but then, those graphs above are that " perfect scenario "
now, a bit more realistic with mode collapses
you wanna " ignore " collapses, thooo if you are suspicious, check them out too ( but don't get fooled )
and see the trending
again, these are example graphs. Each model has different scenario
this is a flat-lining graph
( too different from each other samples, bad processing, too much noise, not enough data compared to batch_size used or simply too much data )
an example of high / smooth batch_size scenario:
example of low / small batch_size scenario:
reason is, higher the batch_size, more samples are used per " update " of the model's parameters, having smoother and more full / more rich estimation
( values are for just demonstration, do not reflect how it's in reality )
higher batch_size = smoother
smaller batch_size = more random
decision depends on the situation. Sometimes one voice is so specific it requires less examples to show the model / AI / RVC sometimes needs more
Ok so I got myself a nice AMD GPU and someone here told me to use Applio so I got that set up and it doesn't give me any errors but it doesn't show the GPU index selection are in the "train" tab and when ever I train a model, it works just fine but when I try to use it with w-okada's RVC and it sounds nothing like the dataset I trained it on, and yes, w-okada's RVC is working for I tested a model from here and it worked fine.
nvm I read everything
Yes, they're perfect
Also, timestamp for me, for future
tensorboard 101
I’ve got a lot more like that
these are all drill songs so they tend to leak a lot
what does this mean 😭
#✨│ai-help message
@mighty parcel try to get it pinned or ask someone if you can please
this should def help majority of people
in terms of graphs, over and undertraining
yea thanks
so in order to make it good, you have to do a little editing like using compressor and EQ or smth for the dataset
compressing the peaks
becauseeeee
rvc won't care about peaks, it'll normalize all to -2 dB or so
risk of clipping
and lack of consistency in samples
yeahhhh that looks bad
best is to compress peaks + normalize it manually to -3 dB
and turn off rvc's normalization ( but with that I can't help, people use newer forks, mine's based on older variant )
until I update my stuff, that's that
so in this case:
- clean the audio
- compress
- normalize to -4 or -5 dB + denoise ( as noise is amplified )
- feed it rvc ( it'll hopefully and safely get it to -2 / -3
yea it's done manually
in preprocessing script
but lots of stuff changed and so did the file structure in newer stuff
once I am done with upgrading, will possibly share the file if one wanted
( in 3-4 days, waiting for saturn's compute hours to refresh )
hmmm...
gonna condense all info into a google doc overview lel
Yeah you should do that
discord screns tho, too lazy for full guide
one day tho, for sure
but it def won't be noob friendly
sadly
some hardcore audio stuff lmao
but honestly, better than nothing
Okay I have question
I can try to simplify it, just say or specify what exactly you gotta know
if something's unclear, remind me
these are the same training I did, just with different sample rate, it sounds bad in 40k
wait\
fuck i think that's the wornfg
that is because
your dataset is approx 16/18 khz
so anywhere from 32 to 36 maybe 38
uvr output based?
THis is the REAL 40k
Yeah
yeah so here's the thing
I basically use UVR
rvc's samplerate isn't for resampling really
or upsampling
it is just to match ur dataset
because it'll handle that specific or approx sample rate the best
by training, say, 48khz model on 35khz data
there's 13khz of frequencies gone
rvc won't create it
only smooth it out a lil bit, maybe tiny tiny bit of upsampling
but that's that
anything below 37/38khz is 32khz pretrains / model
stuff that are above 40, so, standard 44.1 are both 40 or 48 applicable
ok uhh imma import the question i asked in general cus am peepeepoopoocaca stoopid
Ayo? @old wagon level 1 !!! 
i'd go for 48 pretrains in that situation
as 48 will handle those .4.1khz that 40 can't and missing .3.9khz dw about it
how do i separate vocals if they are the same person
for example yoru ni kakeru
the background vocals are on different keys, but they are the same person
this is a 44/48 fusion model for instance ( 70% of data is 44.1 )
made on 48 pretrains
that's what we call harmonies
either manually in rx ( not recommended for unexperienced people )
or via UVR's VR arch models
bve ones ( iirc backing vocal excluded )
but prior to that, prepare mdx 23c's output
I used 6HP UVR
models are working on w-okada / voice changer and rvc
anyone know why
use official rvc
it supports directml fully
( as long you don't have 4gb card, it should go fine, at least with index )
if applio doesn't work for you, that is
i usually use online vocal removers cus my pc sux (cant handle uvr)
can someone send me a video link or something on how to use/apply the voice
Ayo? @devout haven level 1 !!! 
are you talking about https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Ayo? @dusk tendon level 1 !!! 
i just installed the files and idk what to do next
amd / intel variant, yes, official
can be glitchy ( or at least was on my end ) but worth trying
oh okay
I'll give it a try
I can screenshare for you the process if you wish
if adding voices ( models ) to rvc and using them ( doing inferences ) is all you need to know
ok
So does this work on windows or do I needa use a linux distro?
win
alright
(as I refered to but ima try and train it and see if it says "GPU not detected, falling back to CPU " again
ah
alr
@graceful obsidian What’s the minimum dataset you think should be used if you have studio stems
hq? reasonably I'd go for 5-6 mins
for optimal results, 10 mins
minimum perhaps 3-4
also should I need to remove silence?
5 days of raw studio acapellas 64bit/384khz are needed per model
trim it so it's at least 0.3-0.5s between each phrase
and make sure it's absl. mute, so enveloped preferably
when I mean like removing silence I mean like so the words are together correct?
there's a step for that
is that what ur on too
however, I'd always keep it manual for full control
yea it’s just a pain to do in fl studio
Ayo? @vapid gust level 9 !!! 
ye to remove the green part
so it's not grainy ( 2nd silence zone )
envelope actually
having it - infinity dB
uh-
heLP-
Temporary folder already found. Wiping...
Archive: /content/drive/MyDrive/rvcDisconnected/zipfile.zip
creating: /content/temp_dataset/zipfile/ariTestUnit/
inflating: /content/temp_dataset/zipfile/ariTestUnit/samp1.wav
inflating: /content/temp_dataset/zipfile/ariTestUnit/samp2.wav
inflating: /content/temp_dataset/zipfile/ariTestUnit/samp3.wav
inflating: /content/temp_dataset/zipfile/ariTestUnit/samp4.wav
inflating: /content/temp_dataset/zipfile/ariTestUnit/samp5.wav
inflating: /content/temp_dataset/zipfile/ariTestUnit/samp6.wav
Sanitizing...
Dataset Type: Multispeaker
---------------------------------------------------------------------------
FileNotFoundError Traceback (most recent call last)
/usr/lib/python3.10/shutil.py in move(src, dst, copy_function)
815 try:
--> 816 os.rename(src, real_dst)
817 except OSError:
FileNotFoundError: [Errno 2] No such file or directory: '/content/temp_dataset/zipfile/ariTestUnit' -> '/content/dataset/ariTestUnit'
During handling of the above exception, another exception occurred:
FileNotFoundError Traceback (most recent call last)
3 frames
/usr/lib/python3.10/shutil.py in copyfile(src, dst, follow_symlinks)
252 os.symlink(os.readlink(src), dst)
253 else:
--> 254 with open(src, 'rb') as fsrc:
255 try:
256 with open(dst, 'wb') as fdst:
FileNotFoundError: [Errno 2] No such file or directory: '/content/temp_dataset/zipfile/ariTestUnit'```
Ayo? @river gate level 1 !!! 
but then, if you wanna go automated which I don't recommend as it's less accurate
kalo's guide will help
ye?
knowing that I got studio stems
what would be the best settings here
I use rvc disconnected bcuz my pcs shit
leave it as it is

in this case it's already all set, ye
crepe hop length?
keep 64
what happens if I lower it
it is a
well
rms detection window
smaller = more accurate but also sensitive
in robloz terms?
going below 64 is not recommended because
in other words, keep it 64 because higher is for worse audios and older methods
ah ok
and lower is for highly vibrato / wobbly voices ( and for older methods
that's why I made my port for saturn
colab's been crap since good 3-4 months
or so
is urs easier?
depends, but it's private due to prev abuses
that's the only way as of now
......................................................i aint gonna say anything
so, what happened?
@graceful obsidian
Preprocessing
Specifically the Load Dataset part
weird stuff going on
Ayo? @graceful obsidian level 14 !!! 
well, I'm clueless on that specific error
you should perhaps ask disconnected's maintainers
or try to restart it all + reupload the zip
okay i restarted it all, reuploaded the zip, andddddd same error
does anyone know if there's a site where i can use elevenlabs but enter in my api key to bypass that vpn detection thing
Ayo? @sullen marsh level 1 !!! 
Heloo
Is the Google Collab for making AI covers usable now?
Is there any quick tensorboard to view my train logs
I don't want to run the whole colab again
Don’t use colab for interference
Anything else should work
Colab should only be used to train
Use spaces
Idk
mode collapses
is splitting of the acapella necessary? what tool can i use
huh
its like 3.5 minutes
it has low low low spikes
i js checked
ah
those are mode collapses
oh
how can i load a model?
ignore them unless final model is bad regardless of which ckpt u test
actually I'll be heading to sleep
alright
but ye I know, sometimes it's possible to get em decent
just try to experiment and test em around
model .pth goes into rvc folder/assets/weights
index goes into rvc folder/logs/here or rvc folder/logs/Make_ur_model_folder/here
( up to u to pick the scheme for this one )
I go to sleep. Take care y'all n gluck
♥
I used rvc disconnected and it finished training,how do I save the model
how do i get it to work on a steam deck?
Ayo? @coarse sun level 1 !!! 
Can anyone send ROD zip, plz? Cuz github link doesn't work.
How exactly do you convert pre-existing pth files to ONNX?
your supposed to put the name of your zipfile
I have 5 audio recordings
umm, you're supposed to zip file your recordings together into 1
oooohh
So I zipped it into one
should I just put zip
like this?
"zip.zip" the same as your file name
so like this?
this?
yeah like that
do you know how epoch works?
I used to do the old one with gradio
I just used epochs around 250 and it all worked fine
only the training right?
yep
Ayo? @daring verge level 24 !!! 
okayy, thank you so much
btw for epochs, in pinned messages theres a tabel with the suggested epochs based on the dataset lenght, its not like a rule to follow but js the suggested ones,it could ofc depend tho
honestly, I don't know what epochs really do. I just put 250 because it is what I usually do and all works out fine
its like how much the ai will train with the dataset, but the amount should change based on the dataset tho,depends
I've ran both, what now?
Ayo? @tired rapids level 3 !!! 
I made a voice model, but I don't know how to make another one and it keeps giving me errors
you don't need to run clone respiratories if you already run the first time
since, its already downloaded to your google drive, you just need to run install dependencies
guys im having in issue with the voice like they can notice that im using a voice changer i dont know is it because of the models or there is a way to make sound more real
try using a different voice model
any recommended once or pupolar model , i have try more than 5 until now i had the same issue with it
I did that, but it won't work
for training
did you run preprocess?
yes
its missing some files, try running preprocess again
I re did everything again, but it still won't work
lemme see the load dataset section
ok now i see the problem
delete the folder and Mangio-RVC-Fork folder and run the clone respiratories cell again
what folder?
the rvcDisconnected?
in my drive?
Ayo? @ancient jewel level 1 !!! 
Hey guys can anyone give me link to Ai cover
what browser are you using btw
opera gx
noice thanks
which one?
.
what kind of help ya need?
i would suggest u to use hugging face instead, its slower but theres no risk and isnt limitless
https://discord.com/channels/1159260121998827560/1167844377046040769 this is better than the AICoverGen Colab
you cant get banned or disconnected with this
okay ill try it when im free
i would suggest the hugging face version than the colab man, just so people dont have to risk account or get disconnected, the aicovergen is not really that stable
but it has limits bc of the colab gpu
the hugging face one can be used all day without having to worry abt ban or anything
ye thats why theres a warning on the colab one
thats cover
oh
is it real-time
imma go eat...
https://rentry.co/VoiceChangerGuide#gpu-chart-for-known-working-chunkextra will give you settings that should work 99% of the time.
If you are on AMD or INTEL ARC make sure your voice models are in ONNX format, refer to https://rentry.co/W-Okada-FAQ#the-default-voices-are-fine-but-ones-i-upload-arent it should guide you through it.
same
ignore the 2nd part
ok thx
Hi I am facing an issue where my virtual audio cables have been interchanged , like the input is in the output section and the output is in the input section . I tried reinstalling the drivers, didn't work! Anyone got something?
that is correct tho
in audio inputs it you need to put the cable output
Got it thanks
yes but i personally havent tried it
how do I do it
@glad zealot
mew
i even dont understand this
lewl
😭
are you runing it rn?
i mean idk what to do
DAMN IT
i only need to copy the settings on the 1. link u sent or do something with colab?
without doing anything?
on this part you need to put your ngrok token
only that?
yup
ye
okay
what this do?
Ayo? @lime dome level 3 !!! 
it downloads a model from huggingface
to my voicechanger?
ya
if you have the model download on your computer you can also just ignore that and upload it later
should i change the settings on my voice changer program?
before or after?
fatal: destination path 'Mangio-RVC-Tweaks' already exists and is not an empty directory.
OSError Traceback (most recent call last)
<ipython-input-20-bfb8a4688ddf> in <cell line: 13>()
11 get_ipython().system('git clone -b pr-optimization --single-branch https://github.com/alexlnkp/Mangio-RVC-Tweaks.git')
12 #Rename to keep backwards compatibility with old variants of Disconnected
---> 13 os.rename("/content/Mangio-RVC-Tweaks", "/content/Mangio-RVC-Fork")
14 get_ipython().system('git clone https://github.com/maxrmorrison/torchcrepe.git')
15 get_ipython().system('mv torchcrepe/torchcrepe Mangio-RVC-Fork/')
OSError: [Errno 39] Directory not empty: '/content/Mangio-RVC-Tweaks' -> '/content/Mangio-RVC-Fork'
i got this error
after
i will run cells before opening voice changer or after?
mean that the mangio-rvc-fork folder already exists
you cant open the voice changer without running the cells
just run the cells till the last one
but i have it downloaded
last one is ngrok one?
ye
i dont have anytrhing like that
@glad zealot
on your pc?
yes
then why do you need the colab one?
it doesnt work
then just ignore your local
wdym
okay
thats literally it
fatal: destination path 'Mangio-RVC-Tweaks' already exists and is not an empty directory.
OSError Traceback (most recent call last)
<ipython-input-28-bfb8a4688ddf> in <cell line: 13>()
11 get_ipython().system('git clone -b pr-optimization --single-branch https://github.com/alexlnkp/Mangio-RVC-Tweaks.git')
12 #Rename to keep backwards compatibility with old variants of Disconnected
---> 13 os.rename("/content/Mangio-RVC-Tweaks", "/content/Mangio-RVC-Fork")
14 get_ipython().system('git clone https://github.com/maxrmorrison/torchcrepe.git')
15 get_ipython().system('mv torchcrepe/torchcrepe Mangio-RVC-Fork/')
OSError: [Errno 39] Directory not empty: '/content/Mangio-RVC-Tweaks' -> '/content/Mangio-RVC-Fork'
i literally dont have anything like this
in my drive
OSError: [Errno 39] Directory not empty: '/content/Mangio-RVC-Tweaks' -> '/content/Mangio-RVC-Fork'
i dont have anything like that in my drive
my drive is empty
i only got my datsaset in it
its on the colab storage
it says i cannot delete it because it is not empty
add a new code cell
how
!rm -rf /content/Mangio-RVC-Tweaks```
hover betweeen 2 cells
got it
thanx
FileNotFoundError Traceback (most recent call last)
<ipython-input-36-10dc9864c428> in <cell line: 5>()
4
5 if not os.path.isdir("csvdb/"):
----> 6 os.makedirs("csvdb")
7 frmnt, stp = open("csvdb/formanting.csv", "w", newline=""), open("csvdb/stop.csv", "w", newline="")
8 csv_writer = csv.writer(frmnt, delimiter=",")
/usr/lib/python3.10/os.py in makedirs(name, mode, exist_ok)
223 return
224 try:
--> 225 mkdir(name, mode)
226 except OSError:
227 # Cannot rely on checking for EEXIST, since the operating system
FileNotFoundError: [Errno 2] No such file or directory: 'csvdb'
SETUP CSVBD
okay what i will do now
just end it and start from the beggining
you need to move to specific locations if you continue it
@glad zealot
Updating and installing system packages...
Installing build-essential...
Installing python3-dev...
Installing ffmpeg...
Installing aria2...
Updating and installing pip packages...
CalledProcessError Traceback (most recent call last)
<ipython-input-38-46706f3aac76> in <cell line: 12>()
10
11 print("Updating and installing pip packages...")
---> 12 subprocess.check_call(['pip', 'install', '--upgrade'] + pip_packages)
13
14 print('Packages up to date.')
/usr/lib/python3.10/subprocess.py in check_call(*popenargs, **kwargs)
367 if cmd is None:
368 cmd = popenargs[0]
--> 369 raise CalledProcessError(retcode, cmd)
370 return 0
371
CalledProcessError: Command '['pip', 'install', '--upgrade', 'pip', 'setuptools', 'wheel', 'httpx==0.23.0', 'faiss-gpu', 'fairseq', 'ffmpeg', 'ffmpeg-python', 'praat-parselmouth', 'pyworld', 'numpy==1.23.5', 'numba==0.56.4', 'librosa==0.9.2', 'gdown', 'onnxruntime']' returned non-zero exit status 1.
it should give you a link in the last cell
Updating and installing system packages...
Installing build-essential...
Installing python3-dev...
Installing ffmpeg...
Installing aria2...
Updating and installing pip packages...
CalledProcessError Traceback (most recent call last)
<ipython-input-40-46706f3aac76> in <cell line: 12>()
10
11 print("Updating and installing pip packages...")
---> 12 subprocess.check_call(['pip', 'install', '--upgrade'] + pip_packages)
13
14 print('Packages up to date.')
/usr/lib/python3.10/subprocess.py in check_call(*popenargs, **kwargs)
367 if cmd is None:
368 cmd = popenargs[0]
--> 369 raise CalledProcessError(retcode, cmd)
370 return 0
371
CalledProcessError: Command '['pip', 'install', '--upgrade', 'pip', 'setuptools', 'wheel', 'httpx==0.23.0', 'faiss-gpu', 'fairseq', 'ffmpeg', 'ffmpeg-python', 'praat-parselmouth', 'pyworld', 'numpy==1.23.5', 'numba==0.56.4', 'librosa==0.9.2', 'gdown', 'onnxruntime']' returned non-zero exit status 1.
i didnt run the last cell yet
should i?
yes
okay
Updating and installing system packages...
Installing build-essential...
Installing python3-dev...
Installing ffmpeg...
Installing aria2...
Updating and installing pip packages...
CalledProcessError Traceback (most recent call last)
<ipython-input-40-46706f3aac76> in <cell line: 12>()
10
11 print("Updating and installing pip packages...")
---> 12 subprocess.check_call(['pip', 'install', '--upgrade'] + pip_packages)
13
14 print('Packages up to date.')
/usr/lib/python3.10/subprocess.py in check_call(*popenargs, **kwargs)
367 if cmd is None:
368 cmd = popenargs[0]
--> 369 raise CalledProcessError(retcode, cmd)
370 return 0
371
CalledProcessError: Command '['pip', 'install', '--upgrade', 'pip', 'setuptools', 'wheel', 'httpx==0.23.0', 'faiss-gpu', 'fairseq', 'ffmpeg', 'ffmpeg-python', 'praat-parselmouth', 'pyworld', 'numpy==1.23.5', 'numba==0.56.4', 'librosa==0.9.2', 'gdown', 'onnxruntime']' returned non-zero exit status 1.
dependies
the first
for now i guess? thats raven's settings
it doesnt change my voice..
i tried it
can you send your settings?
this is my setting
how to set low batch in applio
okay
i refreshed page by accident
i need to open it again
1 minute
ill try
Ayo? @lime dome level 4 !!! 
what did u do the f0 det?
rmpve
okay lemme try
thanks
it worked
it has a bit delay but
it works
but
its
uhh
sound comes and goes
ok i fixed it
yea, you should learn how to check these out
I can't really be going around and doing it for people, like I include all the info on such things everywhere on the server
aaaand, am currently busy with new saturn port
yeah ik
I can already tell it’s not overtraining
I trained this once last night but it closed for some reason so I lost all that progress
but I went to 500 epochs and the graph was still heading down with never a spike up
because you have it smoothed out
set smoothing to 0 or 0.2 and you'll see more
how muhc u want me to keep it at
than that
you gotta select that middle icon under the graph
as I said
it's for scaling
aside of the sort of flat-lining behaviour
normally you'd, most likely, search around here
soo
no, it means you can't effectively evaluate the ckpts based on that graph
maybe ask @ lusbert
im gonna train another 30+ minute
on how to sync the graph
studio stems
without it, you can forget about tensor
as for me, I am too busy to explain it atm, that is
well, as i said, it doesn't matter, rise or not
whatever you'll see is inaccurate
o