#✨│ai-help
1 messages · Page 198 of 1
And the average loss from my fork ( you check given epoch's performance )
and if it clicks for ya ( you can tell it does well ), you pick that one
my small brain and english already melted sadly
Hmmm.. well, can always go live-stream in a call and show you what's what.
You can just listen if you wish
But either way~ that's that
if i understand these by myself it will be better i guess
ill check tutorials and your messages
Got some material prepared for such cases
A lil outdated but should still do, for the basics' sake
damn
probably but i am totally new right now
there is so many graphics in tb
well that's the reality
or you can consider my rare as hell offer
just happens to have a good test subject rn ( training a model rn
maybe you can create a youtube video about this
instead of teaching ppl 1by1
yeah but sad reality is, majority of folks aren't even willing to read a bit of text
and I just feel like my effort would go down the drain lol
Anyways, like said before. That's all I can say or do atm. Rest is up to you buddy
Np and alrighty, best of luck ✨
yuh, take it slow in your own pace
( courage msg: Took me 1~ month to get handy with it, when I first started
so dw if you can't at first
i already created one model from my friends discord calls
is links allowed?
i covered a song with it maybe u can take a look
-# not me taking 3 months just to download the tb
A, if you want me to evaluate it / review, yea go ahead

The sound is a bit muffled because it's from a discord conversation
a
seems like it, but discord itself can degrade audio quite a bit at times
Either way, it's quite good actually for your first model ( is it?
yeah i excepted for scream sounds but i got actually song
its my first model but i already used dozens of ready models
Ahh, ye
https://www.youtube.com/watch?v=THxNdEsSGLA
Here's mine, first model I've made
Ever wanted to hear Kurisu singing a catchy ~totally not dark~ vocaloid song? there you go ¬
Overal, this AI voice model is a part of some bigger AI project I've been planning since a while, codenamed " Project Amadeus "
but it's still in an early concept stage so that's that.
I'm planning on covering quite a lot 👀 so, stay tuned.
� Cover deta...
but then, it was a prototype + my learning / test field model, effect of 2-3 months of work ( 40khz model, 10 mins )
shortly after, 48k training became available
what does do that 32k 40k thingies
This is my current best ( and 2nd ever made oh irony ) model:
https://www.youtube.com/watch?v=2CW2Nyhtio8
One of my fave tracks from Eve. Remember back in my worst days I used to spam it a lot. Oh yeah, I kinda love how I don't have to readjust Kurisu's pitch with Eve's stuff, they just click on " 0 ". Enjoy ~
Original song by Eve and all people associated with the project:
https://www.youtube.com/watch?v=nROvY9uiYYk
� Cover details �
Inferenced ...
Yea those are sample rates
32k = 32khz = 32000 hertz
there's 32, 40, 48 and soon hopefully 44.1khz ( Me, Noobies are working on new rvc pretrained models - those base models we all use to make our models ( the process's called finetuning btw )
i can feel the improve
Yea, for a 10 min model, you can hardly find any that good sounding
probs one of the best around on stock pretrains
what it depend
It depends on what the sample rate of the audio you work with is
we have 44.1khz ( cd / standard music high quality ) and we have 48khz
what is sample rate 😭
it's related to audio really
im clueless rn
Means the amount of ( audio samples ) samples present in audio per second
higher the sample rate, higher the fidelity
but if we speak of 44.1 and 48, 90% of people can't tell a difference unless:
- they're working with audio
- are sensitive
- make music etc
basically, best way for newbies without some more advanced software, would be to use spek
its frequency of speak or smth
my data was non stop talking cuz i cutted the blanks
as you can see the vertical axis shows frequency range
measured in khz
( but given the nature of spectrograms, a form of representing the audio as you can see ) you wanna x2 the value
so, the audio file itself is 48khz since 24khz * 2 is 48
this looks to advanced
then you can see where on avg it reaches
for me rn
Yea, as mentioned before, take it within your pace
I'm mentioning it only because you asked
not that you should learn it rn or at all
Should be matched with your audio's sr ( well, more or less. some deviance is allowed this or that way
then you can get sub-optimal performance or, plain bad performance
it's actually pretty simple tbf, you drop the audio and look for this range
in this case, it'd be around 24 or 20khz
so you can go 2 ways: 40k model or 48k model
no black magic here
yuh, it's free btw
Just one key thing to remember is the sample rate of spectrograms ( mel spectrograms to be exact )
( they can vary in colors and range, depending on what program you work with )
should be *2
and that's pretty much it
Here's the spek
https://www.spek.cc
the rainbow one
is basically the same thing, just different color scheme I use
as I prefer it that way ( I can hear more details with varying " heat " in colors )
you can see the khz / hz axis on the right
and volume, in decibels ( db )
But that's iZotope RX program, advanced one for audio engineering / recovery / editing and so on
tbf, even audacity can preview spectrograms ( Audacity is probs the most know user-friendly ish program for audio editing and stuff ~ free as well )
i need understand these first
In any case.. here's some terminologies you can check within your free time as long you feel like it:
- Sample rate in audio
- Mel spectrograms, what are they?
- Tensorboard and models' overtraining / undertraining ( overfitting / underfitting - same terms, can be used in-place )
yea, again, your own pace
I leave it all just in case if you wanted to recheck in future
well, I won't hide, my english has gotten pretty rusty ( spoken, that is
don't feel confident enough lol
Oh yea, good point. But I don't quite like taking money for it
I was in your position, right, I was frustrated with lack of easy to take materials around
My goal in here, as long I feel like helping, is to make it as simple as possible yet without too overly-abstracted methodologies
and most importantly, not gate-keeping or pay-walling my knowledge, provided an individual is willing to give me and my teachings some time, just as I dedicate my time for them
Yea, np man. Best of luck and take care
Hi, I’m getting this error when running TensorVENV.bat why ?
File "C:\Users\mathi\AppData\Local\Programs\Python\Python310\lib\site-packages\tensorboard_vendor\html5lib_trie_base.py", line 3, in <module>
from collections import Mapping
ImportError: cannot import name 'Mapping' from 'collections' (C:\Users\mathi\AppData\Local\Programs\Python\Python310\lib\collections_init_.py)
Keeping the command prompt open...
Appuyez sur une touche pour continuer...
you should have just a regular tensorboard
not tensorboard_vendor, tbh
also since it is installed in global repo (under C:\users) you dont need a virtual environment activation (venv)
Hi guys, I have a question related to TensorBoard.
Does the lowest loss/g/total means the best quality?
how ?
pip uninstall tensorboard_vendor pip install tensorboard
lowest loss/g/mel
then you can just run it like tensorboard --logdir=X:\Applio-MRF\logs
I’m still getting this error.
File "C:\Users\mathi\AppData\Local\Programs\Python\Python310\lib\site-packages\tensorboard_vendor\html5lib_trie_base.py", line 3, in <module>
from collections import Mapping
ImportError: cannot import name 'Mapping' from 'collections' (C:\Users\mathi\AppData\Local\Programs\Python\Python310\lib\collections_init_.py)
C:\Users\mathi>
uninstall tensorboard_vendor
#✨│ai-help message
#✨│ai-help message
and foremostly:
#✨│ai-help message + pics associated with the msg
Trusting solely g total loss isn't the most ideal if you aim for " best quality "
In your TensorBoard,
Would it make sense to fuse your 6k FM with 14.5k model to get the best of all?
doesn't work like that sadly
sure, you'd reinforce traits good traits A but you'd also reinforce weak sides of each, traits B
You're kinda looking for a converged ish balance
Pretty much
can i use fp32 for model training on my rtx 4060 8gb? 👉 👈
Uhh, you can try
but quite likely you won't be able to go crazy with batch size
on my 12 gig 3060 I can squeeze typically 12-13 bs
( I'll see what I can do further in my fork with memory optimizations
should re-add in-place and gradient checkpointing maybe... perhaps would help a tiny bit in terms of memory
i always use it on 8
Well, then you should most likely be okay
is there a huge difference between fp16 and 32?
or you can start retraining at 6k for different result?
Is it an option, to find a balance, yes, but a balance favoring either FM or MEL, whatever you like more, instead of avarage?
Look man, I was having your mindset a year ago and I'll tell you right away
even if I used to say one can merge
it is no more a method I favor nor recommend
experience and tons of experiments have shown me it is def not the best way to go as well..
What would your advice then be?
to focus on all rather than pick the models based on individual metrics
if one metric dominates in performance or is heavily undeperforming, something's bad going on
typically, after certain point, stuff should be somewhat 'balanced'
fm is your most important metric, this one takes a while to truly kick in
mel is pretty much related to mel spectrograms 'fidelity', you can think of it that way
but what influences it the most is how clean and uniform your dataset is
For example:
- if you mix sources in your dataset, mel could be having issues
- if the fidelity is initially bad, that on it's own also has it's implications as we're using pretrains that learned on, sure, not the cleanest but nevertheless, full ( frequency range wise ) audio
what about mri hifigan?
not usable, needs pretrained models, same for refinegan ( not shown on the table
okay, thanks
Tho, if you really wanna merge ckpts, I'd typically keep the ratio 70:30
where the base ( 70% of it ) is the FM one
you just " clear it up " with mel one
and one consideration: try to ( if possible ) have them both similar in terms of kl loss
else funky results can happen at certain inferences
#✨│ai-help message
It's the only thing I found about difference.
fp16 without proper gradient clipping and nan parameter checks would ruin your model's training
sure it is fast, but is it fast to lose 10 hours of training while saving 30 seconds per epoch?
Thank you for your thoughts
thanks for explaining guys🙏
@simple ore does applio use fp32 by default?
^
no, you need to switch it in settings then restart the app
restart by killing it and starting again, not by opening a new browser tab
does your fork didn't have "sync graph" option?
it's automatic
been that way for a while in applio generally
ah okay
i see your fork use fp32 by default
Gotta push newsest rev1 zip
yes
if you need fp16, just update the precision in settings
yes, im using the repo
Alrighty
Does this message regarding what number in parameters of KL, MEL, FM you should aim for still good?
#✨│ai-help message
Those were what I believed at a time ( half a year ago or more ) were right
but in reality, it's all per-case, individual
generally, lower the better
there's no ceiling or limits
😂 These time remarks about your past-self are killing me
i got this error when trying to warmup training
Did you enable averaging
no
then enable it and set it to 100
tho, just in case
100% sure you did not enable that box?
@latent cypress
Alr, seems like I'll need to update the logic
in any case, for your specific situation where there's 600 steps per epoch, you can try 100
you'll get 6 loggings of average from a given epoch
for training from scratch there is a huge difference and for that reason fp32 is always needed in that specific case
for finetuning (when we train using a pretrain) there's barely any audible difference at all/none
as you're actually meant to use it
there's absolutely no penalty from that, in case you're worried ( and it does not affect the core training itself in any way
it worked now
Ah ye, gonna patch it today then
seems like instead of disabling the mechanism ( if it's unchecked ) it's still activating and since averaging involves division
default value 0 cannot be divided naturally
yea i also noticed that you cannot start the training if you leave that option unchecked
Yea, overlooked it I suppose or rather, something unexceptional in args passing happened.
Gonna look into it in a bit
now my model is took 104 steps per epochs, i thought the training will be stopped when the warming up done (i set it to 2 epochs)
wdym?
warmup epochs =/= training epochs
average steps =/= training steps
warmup should be 5-10% of your total chosen epochs for training
average steps ( n value ) should be anywhere from 10 to 35% of your total steps you get per epoch
pretty much
my bad, i can't understand english very well 😦
Where is that from?
my fork
@latent cypress in any case, bug's fixed, gonna wait for 20th epoch to finish ( doing some test rn ) then ensure the fix work and uhh, ye, send you a patch
guess I forgot of one switch when moving my fork's base from rvc to applio, my bad ~
it alr has 50
if I get what you're trying to do, you want to keep only 50 there
and not use the mute files
i wanted to add 50
yes
ye, then pretty much just keep in there:
[50],
no need to do that
just add 50
it allows the small slices to be stacked
other buckets will be empty
so i js delete the mute file?
or should i delete them from filelist.txt
do this
dont delete the mute folder from logs
delete them from the filelist
not sure if you answered that the last time so gonna ask in case:
Doesn't it get re-added to the filelist again upon starting the training?
that behavior was present in rvc itself
as for applio ain't sure, haven't tested yet
before removing the mute files i get collapses but after removing them from filelist.txt the collapses stops and the training goes well
this is how it looks without removing the mute files from filelist.txt
oh, yea I know, we already do know that
I meant it more in a way where I haven't tested 'removing' the mutes in applio scenario
in stock rvc it'd readd the entries to the filelist
ooo i see
the mutes do play a role actually
if you have them gone and use silence in the training, well, weird shit happens
it's pretty much not the " best best " handler ish for silence
quite literally mute reference
And generally, given it's been mutes all this time, it is no more collapses
👀 yeyy

btw, fixed the bug

no, just when feature extraction ends
a, alright then. That makes it even better
so this method is to avoid collapses?
Well, you can try but I feel like getting rid of mutes will have certain consequences when model encounters silent things / has own longer silences
you can actually be a good in-field tester 🤔 if you want, that is
we'll see...
okay
Yeah, whenever I have time, my next project is to test it with your Fork, 8 batch as you recommended for 30+ mins.
Since you gave me those numbers for Warm up phase and average running loss to train with.
actually, perhaps 15 to 25% of epoch's steps would be better
just divide your steps by 100 and * 15 or 25 and that's the n value to use
also, in any case, update's in a sec, doing last checks prior to pushing
Thank you
Are steps in TesorBoard supposed to be steps or epochs?
steps
the logging itself is done 'per epoch', or should I say, per N steps that = 1 epoch
How about this @brittle wing
Does this look confusing to an average user?
or not really guys?
Okay, for some reason it shows epochs as steps for me, not sure why.
wdym exactly
check this
im trying to train a model and i need some help. is this a good spot for help?
yes, what do u need help with
I keep getting this error when im trying to train. i click "process data" and it says
No such file or directory: '"C:\Users\BLANK\OneDrive\Desktop\RVC/preprocess.log'"
why do u got a forward slash there
I dont know
remove it
how
thats not the folder im inputting
the path is "C:\Users\BLANK\OneDrive\Desktop\RVC Train\Mangio-RVC-v23.7.0_INFER_TRAIN\Mangio-RVC-v23.7.0\MODELTEST[vocals]A.mp3"
Way better, so just divide your 1 or 2 N epoch by 100 then multiply 15% or 25%?
i enter that where it tells me to put my training folder
which this folder has only one file and its the voice
Nah, it's actually simple
example: your epoch is 35 steps, your .pth file displays it
you do 35 / 100 * % ( and my recommended %s are 15 or 25
can u screenshot
ss logs or the screen?
either way can i dm you? i dont want my name getting out (the "blank") or the voice im trying
okay
Does lower or higher percent mean anything?
Lower the better or higher the better of percentage?
not really, it quite literally means a percentage of your ( epoch's ) total steps
as for the size / number itself well
the concept is:
So, I think it's most common to balance these factors by reporting an averaged loss over N mini-batches, where N is large enough to smooth out the noise of individual batches but not so large that the model performance is not comparable between the first and last batches.
~ Some dude from whom I borrowed the idea
And for that, 15-25% range is typically okay
to not overburden the logging or cause any overly big mess
pretty much
If unsure which percent to pick, why not go for the medium of 20%?
ofc, you can experiment, if you want more fine logging, you can naturally do that
heck, you can even log every single step
but imagine the amount of points to analyze, pointless
that can be done too, it's just a general / broad recommendation I give
not enforced naturally
In fact, I encourage users to experiment to find what they like the best
afteral, it's just a diagnostic metric
and everyone might have own preferences or analytic capabilities 👀
Thanks for the update on Fork.
I appreciate it, thank you again!
it's my 295 epochs and it is exactly 295 step on the Horizontal Axis
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
@brave matrix
much appreciated
Np man
what batch size? what dataset size?
anything else I should know? ( what release you use of applio )
Maybe it's just my local colab thing. I didn't use applio.
Ill ping/dm if i need help. ill try to fend this on my own though. i dont wanna be a nuisance
Was using 16 batch size for 43 minutes of dataset.
eeee, local colab?
are you using a notebook on pc locally?
if so, why
well, google colab, but local I mean community like
a
well, something must be off with it then
what does it base on, applio? rvc?
if applio, which version ( same goes for rvc, the version
sure ping if u need help
rvc v2
well, weird to have it log every step regardless
the default behavior of rvc is to use 100 or was it 200 steps ( default )
and not 1 🤔
try some applio colabs maybe or something else, not sure what's currently recommended as I don't use colabs / cloud
but pretty sure there def are things to try out
Oh, are there any general rules between data size and what batch size to chose for it?
broad topic really
for now, as I'm a lil too busy to give full explanations
tho ye, it's advanced and rather than ' going by rules ' , there are principles to get familiar with, which in result allows you to understand what is what and why
if you don't wanna really bother with it, you should choose 6 or 8 ( 8 is a better / more universal choice ) and you should mostly be fine
unless dealing with some rather big ( and diverse ) sets or super small sets
Well, in files it does show that 295 epoch is actually 15045 step.
Yea that's what I mean, the behavior is not normal
Aside, it doesn't work in a way where user input or whatever is determining the actual real / file's steps
that's handled internally
What I meant by saying 100/200 is default, is for logging purposes
@analog obsidian @latent cypress
https://github.com/codename0og/codename-rvc-fork-3/releases/tag/v3.0.0-rev1
bugfix / update
many thanks
: ) 🔥
thanks 
Enjoy ~
Any thoughts on how to make FM fall?
@graceful patio You too should know, in case
uhhh, well.. It takes time
that's the main and most important metric
because generator learns from features ( FM )
Takes time, proper configuration and consistency in the dataset
Configuration and consistency?
well yea
batch_size for instance
consistency as in, the dataset's consistency
same source, quality, sample_rate, balance in terms of what is in there ( i.e.: styles )
( yes there are people who mix different sample rate files in 1 set )
never take a look on fm graph, is the graph normal?, all i know the graph should be fall
It should eventually start falling
if despite having arbitrary amount of epochs it doesn't
means something's wrong but unless that's the case, one should not worry and keep on training
The averaging includes all metrics from generator ( total g, fm, mel, kl ) and disc ( total disc loss ) so any form of training dynamics-balance problems can be easily spotted
How much is arbitrary?
got it thank you , just implemented it, training seems to be working rn btw (well the feature extraction , kinda slow , but it looks ppromising )
just got to see how it goes
The feature extraction speed isn't related to any of my changes really
in any case, it's on Applio's side
However I'll mention, my fork on default runs in FP32
So have that in mind
if you need to change the precision, it's in settings ( Recommend to restart the fork after switching it )
i literally only have one file from the feature extraction and its moving really slow maybe i should restart well do
got it
(in dataset i mean )
which shouldnt take long
to process
even in feature extraction
but let me check brb
Hey guys
I just downloaded the software and it sounds shit how do i tune it to sound good? Im currently on a 4060
tbh, i didn't expect the fp32 took that long to train with my 24 minutes datasets, it's like i train my previous model with 1 hours + datasets on fp16
FP32 is naturally slower and more intensive on memory
it is Full precision
Somebody help me please :(
not half precision like fp16, but that's too advanced and I ain't gonna go deep on this topic at this moment
is there a good youtube video explaining how to sort it out?
this is not a channel for realtime voice changer
oh my bad
Hey has anyone found the best pitch extraction for gravelly/deep voices? For me they always come out very mechanical sounding.
Hello, this model has overtrained after 15k, right? I'm not the best at detecting when a model is overtrained but I am trying to get better at it.
looks like a pretty small size of the dataset, right?
Correct, about 19 or so minutes
yeah, about 15k then
Thank you!
what gpu?
rx 6600
you either 1) did not do hip sdk install correctly
- running out of system memory
for 6600 you gotta use hip sdk 5.7 and proper libraries
Reminder on why it's worth considering the use average running loss + normal vs just normal:
you can see the disproportions in feedback
( training early on but that's juts a demontration ).
i did this and i get this error: Traceback (most recent call last):
File "C:\Users\biric\Downloads\RVC1006Nvidia\RVC1006Nvidia\infer\lib\train\process_ckpt.py", line 66, in extract_small_model
ckpt = torch.load(path, map_location="cpu")
File "C:\Users\biric\Downloads\RVC1006Nvidia\RVC1006Nvidia\runtime\lib\site-packages\torch\serialization.py", line 791, in load
with _open_file_like(f, 'rb') as opened_file:
File "C:\Users\biric\Downloads\RVC1006Nvidia\RVC1006Nvidia\runtime\lib\site-packages\torch\serialization.py", line 271, in _open_file_like
return _open_file(name_or_buffer, mode)
File "C:\Users\biric\Downloads\RVC1006Nvidia\RVC1006Nvidia\runtime\lib\site-packages\torch\serialization.py", line 252, in init
super().init(open(name, mode))
OSError: [Errno 22] Invalid argument: '"C:\Users\biric\Downloads\RVC1006Nvidia\RVC1006Nvidia\logs\aktaneren\sess\G_630.pth"'
can anyone help me about this?
Hey, Feldas! Please use the command !howtoask to increase your chance of getting help by structuring your question in a way others can understand better. Also make sure you're asking in the right help channel:
- General RVC help: #✨│ai-help
- W-Okada / Realtime RVC: #🔍│help-w-okada
- AI image related: #🔍│help-ai-art
i couldn't find the second .pth in weights folder because i didn't select the "To save a final small model in the 'weights' folder at each save point:" and i stopped the training middle of it. So i dont have the second pth Therefore i did the ckpt processing from the bottom line. now i have this error
you won't get a 2nd epoch from G file
1 G/D pair " holds " only 1 epoch
If you stopped mid-way during 2nd epoch's generation, it is gone
consistency over the quantity, for example in this case, the former part has lower cutoff and might be from another source, so you'd better yeet it out
that data does not exist as G ( and so, D file too ) did not get updated / overwritten
As for why you get the error in the first place, I'd need to see your settings / params you set. Send the screenshot of that section in the ui + provide training details
actually i had like 50 epoch
well, in that case you can't get it either
a
wait
you're doing it that way
well, show the screenshot of the ui
what do these box say
upper and lower one
Been a while I've seen that section in stock rvc
uh, tho man
do not use " "
provide the path as it is
without quotes
Also, pick the save name ( mode info box leave empty )
bro i tried everything
Do it and show the log
and now it solved
a
omg
well, then there's that
I'd suggest these options, the first one so that it will save and overwrite G_2333333.pth and D_2333333.pth to save disk space
i tried everything xdd
enjoy I guess
thank you
np
^
unless you do pretrains, which I doubt you do
If you've solved everything by yourself, then you don't really need me to do that.
save every small weight + only latest G/D weights
yeah this was my first time i wil do these settings ty
Good luck~
ty you guys too
First and foremost, do you have a gpu?
star
@restive mural In case you don't know to check it, let me know
Well no offense, but it'd be appreciated if you stayed as an active participant of the convo if you're already asking for help or a guide
This gif had me dying 
I'm in this picture and I dont like it.
does anyone know what configuration I should use on wokada? I have a ryzen 5600g, rx 6600xt 8gb, 32gb ram
Cray. 
Go #🔍│help-w-okada for W-Okada. This channel is for RVC.
nope
-gui
Super outdated
For Inference (use models) Mac, You can:
- Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides, probably won't be able to train, make models):
- Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Applio Colab: max 4 hours, not granted, of GPU
I got the path file and CKPT but no index, what do i do?
my ai model can't do high notes (specifically above G5) without cracking like crazy
how do i solve this issue? does this have anything to do with the pretrain model or something?
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
are you training it yourself or found one online?
you need only the added index and .pth
It could be related to your dataset not having high notes
A .ckpt file? No way, that file name extension is used with some Stable Diffusion model.
^
The ckpt file doesn't seem to be used anywhere in RVC and W-Okada, unless you're refering to pth which is the primarily file extension for RVC voice model.
ckpt is a model checkpoint file
safetensors is a safer version basically, which is usally the standard now
RVC uses Pytorch models format, even if it would be better to switch to safetensors tbh
-gui
-svc
Ckpt also used for certain LoRA models. 
more likely a GPT sovits model, yet many ppl still trying to use it for voice changer
singing pretrain like klm 4.3, pitching down the source a bit
Found it on this server's voice models channel
Send the link
you should see if it is tagged GPT sovits
how do i copy my link in weights 😓
Why do you have a gif of me? 
u mean ur profile link?
The problem I have is the prior setup with pytorch and all of that, why can't it just all be done already so it's easier
There is no problem. You run 1 .bat for installation, one .bat for running
Friends, I need help, a group of people and I are putting together a dataset to create a pretrain with Brazilian voices, but how do you train a pretrain for the RVC?
it's as simple
Oh yeah it is nevermind thank you
Yes. Byw thank you. I got my answer 😊
Getting runtime error in appio on kaggle
What to do now. Please help
This is my data transfer out limit
is anyone here i am also getting same error
Do you got any solution ? If yes please tell me
not yet , but trying to get in touch with any tech support who can help me out with this mess
@glad zealot what was the fix you did for fixing this in mainline?
it's happening to other notebooks too rather than just mainline
prob should tell vidal to use hina's fix
@carmine siren @latent kettle do yall get this while trying to train?
yes, i don't know what going wrong its auomatically intrupted the trainning process with 'run time error'
It's an issue with the code of the notebook, the mainline notebook fixed it recently
I completed 200 epochs amd then decided to train it more on 300 epochs.
but i'm guessing that vidal's applio's notebook is affected by the issue
Then on 260 epochs this error occurred
"It's an issue with the code of the notebook, the mainline notebook fixed it recently"
Guys, let's say I save every 5 epochs. Is there a way for me to retrieve let's say 3rd epoch?
Nah, simply train every single one
and if memory is your constraint, do like I did 1 year ago
supervise the training and actively delete crappy ckpts to save the memory / move all epochs periodically to some other drive
abstractions??
Good idea
You'd delete it once you're full, your storage obviously, it doesn't go permanent
💀
"if i download a file and then delete it, it technically isn't storage anymore, it was memory :D" follows that same logic
Memory and storage is used nowadays interchangeably, you're making it difficult by having an issue with simplifications
No the fuck it isn't
By definition naturally memory refers to temporary held data
You're actually insane
And you're supposedly an engineer
Also learn some respect towards people with higher rank in here
Please learn to be smarter
Because you're having an ass pain due to some terminology
Your rank does not correlate to respect
And regardless of respect, if you're wrong you're wrong.
You're a member on this server and you willingly accepted to follow the rules
It is in your duty to uphold, essentially what you agreed to or either leave
"If your memory is your constraint supervise the training and actively delete ckpts to save memory (WHICH ARE FILES NOT CURRENTLY EVEN LOADED IN MEMORY) / move all epochs periodically to some other drive" Great so you're taking about storage, and you mean storage.
Hmm show me the rule that says I must respect and agree with every thing someone with a higher rank than me says
Regardless of them being wrong
I said I used an abstraction / mental leap
And you're practically having a pain in the ass over terminology
You literally cannot
This ain't your essay or school work
Memory is not storage
In the same sense of permanent storage
Such as a drive
You literally said "If your MEMORY is your constraint" REFERENCING, RANDOM ACCESS MEMORY. "Delete ckpts to save memory" which ARENT LOADED IN MEMORY
Bro, you're literally making your position worse now. You're disregarding my statement on using an abstraction for the sake of simplifying the explanation
Your abstraction doesn't work when you referenced a pc component such as memory being a constraint..
You know you're wrong and you keep arguing because you can't admit you made a mistake? I'm out.
Nah I am not wrong. I literally said I used an abstraction / thought shortcut, if you can't accept that, this is only your own issue
"If you have a memory constraint, delete stuff off your ssd / hdd" 🤯 🤯 🤯
Alr, here's the thing. a fucking ssd, is a flash memory 🙂
you're the fucking dumbass in here 💀
end of the debate.
This is your last warning
people of your sort with 0 respect and toxicity are flushed instantly
no one is going to assume you mean flash memory when you say memory ❤️
Memory can be of any sorts my dude
You can clearly read the context
you lost your rocker because I called you wrong, and you went on a rant trying to prove you're right.
or are you a roblox brainrot
Nah, I have issues with people like you who can't read fucking context
You lost when you said "Memory and storage is used nowadays interchangeably"
Acting like they ate up all the knowledge despite 0 shit contribution nor understanding on what they even work with
If I walk up to someone and say im having issues with my memory, their first thought is about random access memory
im wondering, why would we be talking about ram?
Not flash memory
Honestly, if you wanna compete or whatever for the sake of your ego, go ahead and learn machine learning basics
because all of what you've been talking about is terminology so far
hold on let me be you for a second
💀
Since practically, right now? at given moment? your knowledge is of 0 worth
for me
Actually, there's no point producing myself on brainrot kids. Bro watched an anime 'bout Russian girl and a school genius and thinks he's the smartest, 1 step ahead
the fuck
Quit
glad we're bringing personal insults into this, shows you can't argue.
Getting to your level of cognitive abilities
and IM the one not showing respect
yea, I have no intents of respecting a person disrespecting me
LOL
having a differing view of something which turns out to be factual, is disrespecting you.
we in a dictatorship with this one
Nah, this is not factual, you're just an extremist when it comes to vocabulary or terminology
You can clearly see the tendency of impractical / mistaken interchangeable usage of words " storage " and " memory " especially when it comes to flash memory or phone's storage
I used abstractions of " temporary " in a sense the epochs get deleted shortly after, but you make it seem like the end of the world
starting a drama
You're in the wrong here
great, so lets say you did mean flash memory when you said memory. ❤️ you assumed he had an ssd
still haven't answered that question
Ofc, what is so wrong to assume having an SSD in 2025~
Don't make it seem like " gosh, you assume my gender " 💀
that's a fucking clown fuckettry
what is so wrong to assume when someone says memory they are talking about random access memory?
who the hells first thought is an ssd when someone says memory
Memory dosen't have to be ram, it can as well refer to brain's memory
it is about context
LMFAO THE CONTEXT IS HOME COMPUTERS
You should understand it well being a slavic language user
As far as I know, In russian there is no separate direct term for mass storage / pc storage and memory in a brain / biological sense
My primary language is English, thanks though.
Well then I shall let you know In Polish that's the way it is
so especially in my language, the terms are used interchangeably
yet, in this particular context, being the convo / my help for the dude, it was a thought shortcut
and you make a world-end's drama outta it
Welp sorry pal but you're speaking english. When you say "memory" the first thought is random access memory.
I shall let you know that the behavior you're presenting with being overly fixated on terms is pretty unhealthy
Once again, this is fucking discord full of newbies who rarely even realize they have gpu, if I decide to use the words or terminology I deem right, that's what I do
I shall let you know that you were wrong whilst writing the sentence, and then backpedaled the only way you could. To protect yourself, why? dunno.
and it is not up to you to correct me AND expect me to follow your rules as you plead
Holy shit wow you proved my point here
There is no point to be proven, you just like attention and being the smartest guy
If they can barely realize they own a gpu, then how the hell are they going to assume you mean SSDS when you say MEMORY
if you really wanna be smart? shut the fuck up, learn and prove your worth in here
And following that, aim for engineer
before that, your worth is of 0 to me as you've displayed no respect towards my person
🗿 I don't care about cosmetic discord roles
simple as that.
Nah, cause while you're being a drama queen here
It's okay, we know you can't admit you were wrong.
Go and search through this discord, I can quite confidently admit when I am wrong
we know you're a factual person already
See how simple it is to have you satisfied? 🙂
wish your parents did a lil better job
honestly.
has nothing to do with satisfaction
you flexed a discord role
and then told me i have to respect you because of a DISCORD ROLE
sounds like you need satisfaction
I ain't in a need of satisfaction, in fact, I'd be more grateful if shitheads like you weren't a part of the community
quite openly
just saying
^
too bad on ya my man, time to explore the internet or xda a bit better
just saying
😭 Yes they are used interchangeably, but not when you're talking about fuckin home pcs and people who are unknowledgeable
See, that's the problem with you, within majority of communities, the terms are used interchangeably, we are not talkin about the proper terminology
Literally no one here will say "yup you're talking about ssds" when you say MEMORY
ITS BASED OFF THE TOPIC OF THE CONVERSATION?
LOL
Context
but people of your cognitive abilities seem to not be able to comprehend it
too many neurons fired in the same time 💀
do you happen to be on the UV wave length?
lmao
yes im all seeing
honestly mandi, you remind me of that one ex that wasn't capable of getting any slightest bit of context or conversation happening at a given moment
any form of cognitive reasoning
you remind me of someone who can't have an argument or conversation without bringing up personal and non personal insults.
You disrespecting me was personal
Because if this is your entry attitude, first display
it is personal
I quite literally explained you what I deem as personal
you quite literally explained that the majority of the people in here barely realize they have a gpu, but for some reason want to assume that people are going to think of ssds and flash memory when you say memory. and not think of random access memory sticks
You disproved your own statement with something prior you said
like I said show me one personal insult ive said
Because up to this point, all you've really displayed is:
- You can't read context
- You're a type of drama-queen you'd observe when it comes to " phrasing, terminology and grammar ", close to grammar-nazi levels
- You're uncapable of connecting the topic of conversation with flash memory ( assumption from my side or not )
- You do not respect people and have no issues breaking the rules
some people just dont know when to stop 🙂
when have i talked bad about or insulted, disrespected. anything you like, or do, or have. outside of this conversation.
because i can call a few times you did 😂
go to dms please @crystal gull @glacial pollen
Nah litsa, this is already over the top, 5 or 6 fucking douchebags within 4 or 5 days tripping me upside down over some bs nonsensical crap.
I ain't gonna go around and tolerate the absence of moderation
Honestly, if you intend on muting or banning me, go ahead, I do what I do because I want to and I ain't obligated to volunteer if 0 respect is kept towards my person
not when people of that sort are around
sounds like a trend..
anyways, does anyone use anything other than applio? if so, what would you rate it. would you suggest it?
original rvc got outdated after applio devs started to do code changes
you can use it if you want but the original rvc is no longer receiving updates
been 1 year since they've updated it
i see who initiated the argument
@crystal gull you clearly started all this and poked at him, you shouldnt have done that. he wasnt even being hostile towards you anyway, whats your issue man?
Like I get it all, but when a, pardon, dumbass calls me out like that for absolutely no reason
I trip my shit
This is the 4th or 5th time this week as I said
I am constantly short on sleep due to the work on rvc and quite nervous lately
alright relax, no need for that anymore
i see whats going on
I am just being honest and open, that's all.
Doesn't give someone an excuse to carry on and provide multiple personal insults towards me based outside of the conversation now does it? My end was simply an argument over the terminology used and how he could have better explained it. ❤️
And perhaps codename should work on not getting so angry over an argument no?
Nevertheless, I am out of the convo.
Ain't taking more disrespect or provocations from kids with attention deficit.
Do as you please if you may. Need a break
And also Codename talks so often about rules and following them, but after told to stop. Continued the conversation in the same channel whilst I ignored :l
Just saying 
sure you bring up a good point however, you were still being so condescending and hostile towards him, perhaps maybe you should learn to mind your manners
ps. An adult and comprehensive person is able to tell the no-cross line and effectively estimate one's limits
because people have different thresholds
it's a basic skill to learn and have
you clearly pushed him, i dont think you can argue against that
And telling people that they HAVE to "respect" him purely because of his role? And that we essentially have to agree and cannot argue?
Him flexing his discord rank is a bit much lol
No, it only mentioned respect towards the higher ups as the rules mention it
it wasn't " respect exclusively towards me ".
you are missing the point here...
Never once have I insulted to this level
seems like you want to overlook the fact that you initiated the fight
this is a dm, if you push me forward, don't expect nice treatment
if your goal is to poke a frustrated person, that's what you get, being fully aware
And it'd be nice to not cut the context.
end of the debate.
oh well he called me "fucking stupid" at the top aswell
Idk what yall on but poor the people who come here for help and see the latest argument 😭
😭
you started the fight, does that make sense to you?
yea, as I told you Mandi, if you have no respect towards me, I am not obligated to show any towards you
💀
from help-rvc to help-arguing it took a single word man 😭
Don't expect kissing your cheek or pampering you if you're an ass towards me
not everyone agrees, and I get that, but that's how I work
seems like you want to overlook the fact that regardless of who initiated an argument, both parties willingly sent messages back and forth and participated in an argument...
end of story you literally told us to stop arguing, I did. Codename continued and made it a bigger deal in the public chat when it could've been over with.
U looking for help?
This new Ai Hub By Weights Drama Situation Is Insane
😭
alright relax just go now
Alr, you handle that.
mhm
Anyway like you told us, I'm done and I'm out. It's not my issue, I didn't wish to carry the conversation on here any longer after being told off to go to dms.
the drama, it's massive
You do always have good story to tell about your year-ago self, huh? 😄
habit, do not mind that
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
I think it was the canvas render? It needs to use argb instead of rgb in matproto lib or whatever that was called, then change the canvas size to dynamic instead of fixed for some reason
You should tell that to vidal too
The error on mainline only happens when training the model btw, doing everything else (process, feature extract, index) is fine
Guys is this overtrain? in g total it has been going kind of flat in g total for 2 hours, I used a 1 hour 30 minutes dataset
https://ibb.co/0YfxqJB (d total)
https://ibb.co/MDgYswZ ( g fm)
https://ibb.co/pL73YMB (g ki)
https://ibb.co/n8V21dF (g mel)
https://ibb.co/2hbh6vq (g total)
yea looks like it stopped improving
alr, it stopped around here right? https://ibb.co/MZn2wGX
yup, after that the model seems to be close to converge
not really overtrained
oh should I let it continue then?, I was aiming at 1000 epoch, or should I use it already?
you can wait until it fully converges (graph becoming extremely flat) or stop the training now and hear the epochs and see if you like them
tho the improvement is very minimal
not audible
Ok thank you so much 👌
no problem, in cases where my graph looks like yours i'd rather stop the training since is not really worth to wait it for fully converge
I don't know if you were trolling around or like you just didn't know how to talk to people. But like please understand that not everyone here would understand you as you do with others. On Discord, the message you sent, the message anyone sent, can be misinterpreted sometimes, you know, it's the internet right? If you come here to get help on AI, please talk like what a normal person would do. And sometimes, some people aren't really in their mood to talk, so like please give them some time to relax. Ok?
Nah, I'm way too late to respond all of this, man. If I had time to talk to him, things might have gotten worse because of the way I typed. I'm done doing dramama since the previous AI Hub. 
Sometimes, y'all gotta take a break from your screen and AI for a while to focus on your health. No offense, this is my recommendation. I've done this one time between after the beef with a kid and before the previous AI Hub server exploded. 
can i get help from learning how to get a ai voice changer
RVC the audio conversion or W-Okada the realtime voice conversion?
rvc
-rvc
- How to use RVC Mainline Colab by Cauthess
- AICoverGen Colab Guide by Eddy (Spanish Helper)
- Create a model with RVC disconnected (colab) by Angetyde
can we call so i can share screen
Would you like to run RVC on your PC locally or on a cloud service?

add meh
What GPU does your PC have? Just wanna make sure it's newer than GTX 10xx series to be able to run RVC.
This is the minimum GPU you can use for RVC and any AI program related.
Applio is a recently developed fork program of RVC GUI. Download this one.
I don't usually join in a DM conversion as to avoid spam, but anyways.
First of all, download the complied one of Applio. https://github.com/IAHispano/Applio/releases
wut do i do now
Is it finished downloading Applio?
yessir
Extract the zip to your directory like C:\Applio or D:\Applio.
Double click on run-applio.bat.
What? Did it show up any CMD? If not, then try run it with CMD and screenshot it.
For real
But you did the opposite regardless, this is just excuses
nvm it work
It looks like you didn't extract Applio zip into C:\ or D:\ but instead a Downloads folder, which can cause Applio to be unable to run. It's best to extract Applio to somewhere like this.
Now this part of putting the voice model, this can be tricky as since all pth and index files are stored in \Applio\logs unlike any other RVC GUI.
k
Well, I don't know which person started it first. But if they wrong, they wrong. 
Find a voice model at #1175430844685484042 or search up on Weights.gg.
Extract pth and index files from voice model zips into \Applio\logs, click refresh on the GUI part and the list for voice model will show up.
Clearly you're blind, also quit bringing up old things already dealt with kthx ❤️
Bro my eyesight is above 100%
What are you talking about
You're being disrespectful
Again
Stop speaking to me or ill ping the mod for continuing drama 👋
You're the one who enabled it.
Stop speaking to me or ill ping the mod for continuing drama 👋
i give up tbh to hard
And you keep threatening and calling me blind
Got it.
Yebye-
This was like 96% of finishing up Applio, and you gave up on it? Alright, might try again later next day, so good luck.
That bro's expecting anyone to give him attention by fighting him. 
Wouldn't be surprised if he's just a kiddo on Discord.
who me?
The other guy.
o
Me or...?
u
The other guy you talked to.
Idk if they're a guy or girl but nvm it's just arguments over nothing

Mindless meaningless stuff
I think
There are far more important things in life, childish stuff
An unserious argument an X/Twitter user would do.
I don't use Twitter
You mean 𝕏??
@red kayak
I still call it Twitter...
stop bringing this back
Nah bro, I'm not trying to aggravate you at all. I'm just here to give my basic knowledge of installing AI, not a doing a drama.
How can I make ai vocals sound more realistic
train a dataset of 30 minutes and above
Why would you think I brought up all of this? I was just just too late to the chat and knew nothing about it.
I know that, I mean after inferencing when mixing the instrumental with the ai vocals.
oh for covers, proper eq and compressors should do the trick
and for the inference audio it should be clean, no reverb and no harmonies
How do I regulate a preset
I kw that too
i would recommend watching a youtube tutorial on how to mix vocals
How to set it up correctly I mean
So like please understand that even if my typing can be hard to understand for certain people, I would never do such a drama at all. 
ohh no idea 
How much reverb, how much eq and stuff
that varies depending on the song

this is out of a rvc related topic but more of a fl studio/DAW topic
Is there a channel
for vocal mixing? not here, sorry 
If it's about training a model, go to #📑│making-models.
Uh it's okay.
I already know how to train & resume.
-colab
- Applio, by IA Hispano Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- RVC Mainline, by Hina Google Colab
- AICoverGen-WebUI, by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
- UVR5 NO UI, by Eddy Google Colab
- UVR5 UI, by Eddy Google Colab
- Modified W-Okada's Voice Changer, Google Colab
- 🆕 FaceFusion UI, by Nick088 Google Colab
- 🆕 FaceFusion NO UI, by Nick088 Google Colab
- 🆕 EasyGUI, by Rejekts Google Colab
While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.
So using mainline is good for now ?
help
- Search for it in AI HUB Docs or Applio Docs. You will probably find your answer there 📚
- Ask for help in #🔍│help-w-okada if it's related to real time voice changing but make sure to read #1297207135469305866 first
- Ask for help in #✨│ai-help for general help, but use the command
!howtoaskfirst to learn how to structure your question properly and increase your chances of getting a reply - Last but not least, ask for help in #🔍│help-ai-art if it's related to AI images.
I’ve been messing around with regular version w-okada for a bit, after a lot of researching and testing i got it to sound good, but not good to make me want to use it in friends discords. i’ve seen youtubers use it that had much better results, what am i doing wrong? or am i not getting high quality voice models?
is seed-vc real-time changer better quality and worth downloading? i can barely find any info on it and only one vid in english demonstrating it. or is deiteries fork worth my time trying out?
also i have 4080 so hardware not a issue
This is #✨│ai-help which is a channel for RVC. For W-Okada, go to #🔍│help-w-okada.
RVC generating channel? A channel in this server or a YouTube channel?
This server
There doesn't seem to be a RVC generating channel in this server. Unless you're referring to #🤖│bots, which is where you use bots to make AI cover.
There used to be one where
I can use a command
choose voice model and put a YouTube url
and it makes it
oh I forgot
With the Weights bot
I haven’t made one in a while

Yw
A channel for "RVC generating channel" could refer to a now-deleted channel in AI Hub.
Never heard of one named like that
Maybe astralabs
Yeah, I assume if it used to be there long before I joined this server. Making another channel for that would be such a waste for this server, as #🤖│bots is one and only channel you can use bot for anything. 
My loss/g/fm is increasing. Everything else loss/g/total, loss/g/mel, loss/g/kl etc are going down. What should I do
this is the case where averaging the loss window may provide a better outlook
the normal logging happens at the end of the epoch, so the FM numbers are prone to wild fluctuations depends on what the model pick as the very last random sample
AI HUB Docs
