#✨│ai-help
1 messages · Page 131 of 1
still seems training good
so i dont know the ins and out o thse things
Is that because its a not so extreme curve up?
yea its not that extreme, let it train more so you can see more if its training or overtraining
alright
btw one quick question
I js saw that it reached about 20k, multiplied so I couldve chosen 40k, but I chose 32k
is that bad?
and does it have a big effect on the models quality
i just realized when it asked to put a preview of the model i put a vocal of a regular song without the ai
well, it would have been better to choose 40k, as it should have a bit more of quality being an higher rate so that contains more info
ppl will think the model is so good 💀
Should I retrain at 40k if the model isn't good enough in the end?
you could try yea
I already thought that too
This is the exact point there
Do I js go back and get the closest one I can find?
this one?
where can i download this voice changer?
I used this one https://github.com/w-okada/voice-changer
dont know if there are any others
ty
yw
@low shard
try the s6000 as it seems more closer
-rt
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
oke, ty
Did that
It sounds a bit breathy tbh
you should also look at few other metrics
mel, fm, kl and total g
just send me the tensor file if you want
Imma tell you where to look
breh, how to download on github?..
and highlight few zones to pay attention to
g total was shown on there
I need the file, screens won't do
how do I do that?
your model's dir should have a file that's .0 in extension
or either way, is named tfevents
if it's in your model's dir, then ye
mhm it is
then that's the one
its too big for dms, ill send quickly here and del
js say it when u downloaded
@graceful obsidian
did u get it?
nope
its sending
alr, let's see
I did
in g/total
Forgot who, I think shad? No idea but someone asked us to if we share files, delete them as soon as the other gets them
I mean yea but that's just graphs
nobody will do anything useful with em lmao
hold on why did you tell me to use 0 smoothing
For your visual ease rn I use smoothing
now, orange you avoid
green tells you something's already going on
as there's a rise, downfall and rise + flatlining
so rise, down and then another rise and flatlining is probably overtraining?
but where did it start overtraining?
FM doesn't really improve over the training
FM?
esp past 2k
now gonna do side by side metrics to show you what to be warned of
???
what is epoches?
I already told you I didn't understand most of it
it has the literal explanations
of what they are
you can't get it any easier than that, explanation wise so, please have a read
there's just no other way, even if you might not like walls of text
@leaden yacht here
I just don't quite know the hell's with fm
maybe mangio has reversed fm calculation? up vs down? but that'd be kinda pointless
might be it's the set itself that fm doesn't really improve but degenerates
is the fm bad then or smth?
I was always against automating of things that require human evaluation
sooo can't really say much about it
if it's doing conclusions and comparing graphs, using logic, I trust my brain
not machine
but if it goes up, down, up and then stays flat its overtraining?
It's not universal, it's usually that way
but in reality, overtraining has a lot of shapes and forms
If that is the same thing Applio uses, it's pretty bad...
can be G flat then down
D up n stable
G up and D down, stable
can be idk.. thunder like shape for D and flatlining above avg loss
G would be stuck in flatline
lots of possibilities
I can type out ckpts I'd consider if I was you
I saw that in here too,
yea
js before 7k mark
that's right
so I used the s6750 model
now, you say 6750
But in the dataset the talking was too breathy I think
and here is an example of why log interval matters
you have no metrics for step 6750 logged
closest to that is the one on the ss
but that specific " epoch " doesn't exist
it can be anything inbetween " inside the neural network "
so you won't ever know what the s6750 is, loss wise
this is what I have btw
aha
not only that, you save every 10th epoch
that's even worse for accurate models
You see, epoch, say, 10th could be hella different
to 11th
like, entirely different
and smaller the batch_size, more extreme the difference potential
ohhh
because again, higher batch_size = more samples shown to ai at each epoch
so I should also turn the batch size back to 8 instead of 12?
that'd worsen it
or the opposite
ah
I'd stick to 12 if you already have it working
save every single epoch
sync the log
damn, every single one?
but then, mangio won't allow for the third
I mean yea, every single one
scenario where epoch 68 is the best one and you save every, idk, 19th
btw I am planning to do that with another dataset then, cuz this one is pretty bad tbh
what u gonna do?
too much breathy talking
Then I have to backtrack too much
which could change it a lot
ohhh
well, then differently
good would be 12 epoch, but you save every 8th
you get epoch 8 epoch 16 epoch 24
but there's no 12th
and there is no way to get it back / extract it from G/D files
hold up, first gonna recommend you the ckpt to try
But I don't wanna save THAT often, can I js use 5?
s5934
youre gonna recommend me I the ckpt to try?
that's sadly not gonna help
ye, that has the best fm, that is, if you had the actual epoch
which you don't
so saving every epoch is pretty smart then yeah
my bad, and the best mel would be
5221k is the best mel
in normal conditions, I'd fuse such models
2 of em. Mel and Fm
the best fm? I thought it was only on the g/total thingy as the lowest point
Wait is higher or lower better for the fm?
nope, it's aligned with fm in this case
fm with G
( doesn't always occur, where metrics align
total G is a sumup so, decent mel but best FM
which you then merge with best Mel
tbf. FM
total G = Fm, Mel, KL
Going by Kl is almost never a good idea as sadly, those are the most tricky
so you have Fm and Mel left
Mel is the pitch I read
thats what I read at least
ah alright
Mel is really mel spectrogram metric
such as these
this is a spectrogram in mel scaling
you can easily test the thing by doing an inference on best FM and best Mel separately
and notice the subtle and extreme changes in clarity
what point was best mel?
best fm was 5934 which I dont have tho
so should I use 6000 and 5250?
you can try but it won't be the same
those are just different totally unrelated epochs
rip
Ayo? @leaden yacht level 11 !!! 
yea?
this is a 40k sample rate right?
mostly but ur supposed to pick the highest right?
reason why it shows the frequencies divided in two is due to " Nyquist-Theorem "
if you're curious on that, do some research I think
well, by the avg response from there
you can safely assume 32 to 36 maybe 37khz
so I should always pick the one most seen and multiply that times 2?
so if its 36k I should use 32k right
mhm
in specs like that, you take the khz and double it
so, total 24 becomes 48
total 22.05 becomes 44.1
I knowwww
but the file / meta sample rate =/= the fresponse range
but do I pick the highest point? or the one most seen?
is there an app that can calculate that for me? Spek only shows it a bit
let's put it that way
press P
few times in spek to change colors
pick those most vibrant for your case
for instance, this one
file is 44khz but the response is kinda around 40
there
see the artifacts?
and color vanishing
where do u see the response?
mhm
This is like 20.1 or smth then?
lower than that
I have a ryzen 7 5700x, should it still be running slow?
because you saw the peaks reaching 20khz or so
yet that's just peaks / some parts
the " body " itself is mostly around 17-18khz
but the peaks are the max
your words
I know
but you js said the response is 0 to max
all in all, you'll still do 48khz training
or either resample the audio to 32khz
you asked about what is the response
and so I told you
response in literal way
and I talk about avg frequency response
ohhh
which in your file floats around 17-18khz
I'll tell you more but that in a bit, going shop
so the average shown of the whole thing is the avg freq response?
so is this 32k?
that is just info valuable for you
not quite ai
it tells you which pretrain you should rather go for
orrrr to which samplerate you should resample it
brb
alr
The avg for me is about I think 17kHz, times 2 so I think I need to use 32k sample rate
hey shad i dmed you some stuff
Why sometimes when inferring does the pitch of the model go up? I'll refresh and then it'll work fine
yuh
seems like it
@graceful obsidian good news and bad news..!
good news - rvc opened
bad news...
- audios still fucked
- crashed when i clicked "start audio conversion"
yeah okay my pc is just incapable of running any of these voice changers
only explanation
message still got across
:(
im kidding idc
im just bummed out that it doesnt work
what did it crash with though
just stopped responding
surely there's something in logs
also when opened, my headphones' audio bugs out big-time and any audio that i try to play thru them is super loud and distorted
whats a good live voice changer
well shit
which start am I supposed to use 😭
theres 3 bruh
also they all just open in python and the read me files are in Japanese
Ayo? @wind cove level 1 !!! 
nvm ill figure it out
JSONDecodeError Traceback (most recent call last)
<ipython-input-25-929c96c23701> in <cell line: 31>()
31 if os.path.exists(config_path):
32 # File exists, proceed with creation of creds and client
---> 33 creds = Credentials.from_service_account_file(config_path, scopes=scope)
34 client = gspread.authorize(creds)
35 else:
5 frames
/usr/lib/python3.10/json/decoder.py in raw_decode(self, s, idx)
353 obj, end = self.scan_once(s, idx)
354 except StopIteration as err:
--> 355 raise JSONDecodeError("Expecting value", s, err.value) from None
356 return obj, end
JSONDecodeError: Expecting value: line 1 column 1 (char 0)
it just shows this error in [ https://colab.research.google.com/drive/1Gj6UTf2gicndUW_tVheVhTXIIYpFTYc7?usp=sharing#scrollTo=OVQoLQJXS7WX ] second step no matter what i put in there
Outdated colab. Type -colab in this channel to get a list of the new ones
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
Is there any way to convert a 40k model to 48k for merging purposes?
how to make an rcv
good news for now (bad news is yet to be tested)
i did something wrong with installation
and am in the process of trying again
PERHAPS it will work
im not gonna get my hopes up
Let's hope for the best
will check up on here later
got some gaming session with the boys rn
W
its not going well
i think i gotta wait until i can get a new pc
cause this rinky dink laptop is temporary
we got the voice changer working but somethings up with his sound devices ngl
i mean u could have a look at it code if you wanted to but ive never experienced this before
think bigshot will just get a new pc and do it on there
i think rvc disconnected works on mobile in general
https://docs.aihub.wtf/rvc/cloud/rvc-disconnected/
Last update: Mar 8, 2024
@graceful obsidian sum frequencies above 20khz, would you personally train 48 or 40. want some of your thought process
I would highly suggest 40k, not 48k.
Also, codename is rn playing videogames with his friends.
what are the best settings for the rvc voice changer ? it keeps sounding so fake
RVC Guides (How to Make AI Cover)
Translation by country
You should ask that in #🔍│help-w-okada , and I think you should set the wait time/delay (ms) to a higher number, it will take a little longer but will give you a better voice output
That usually makes the voice sound a lot better
40, ye, like Leo mentioned
if it's just " slightly " above 40, then it's not worth the bad convergence risk
if it was like 42~ maybe 43khz
or 41 with dominating " bright sibilant " zones reaching sub 44-46khz ( ye, such cases can happen ) then I'd go for 48
tl;dr:
-
tragic / some game-sourced or heavily compressed audio of which freq. response avg. at 28 to 35khz = train 32k models
( sometimes 48k works too. For instance, in visual-novel-sourced audio ) -
yt / aac / ac3 / opus / vorbis sourced audio - 40k models ( or in really terrible cases, go for 32k models )
-
44.1 files ( some uvr / isolation cases ) where freq. response range averages at 38-41khz = train 40k
-
44.1 files ( lossy codecs; opus or some vorbis ) where freq. response range averages at 41-43 ( or plain 44.1 ) khz = train 48k
-
48 files ( lossy codecs; opus or some vorbis ) where freq. response range averages at 41 to straight 48 = train 48k
-
lossless pure 48khz and 44.1khz files? = train 48k models
@peak tusk I got a list index error when trying to use the Titan pretrain on RVC disconnected. I'm assuming it's not compatible for 32k sample rate yet?
kk, but ov2 still hasn't been fixed either
with the same list index error
No worries, I can wait a bit longer if it's not an easy fix #✦│chat message
Is it possible to change the sample rate of a model?
so when im using the voice changer, everything works fine but untill someone is super loud it playsback and sounds so wierd, anyone help?
your mic picks up your headphones
if you use wokada move s.treshold further to the right, maybe enable sup2
my thresh is at 0.001
Hi, i tried searching around a bit before asking this since it could have been asked already, but i didnt find something that could help me.
Is there a collab or offline rvc gui that could let me select multiple audio files? i have a lot of audio files which i want a model to use, but with all the stuff i've already used its all just, select an audio file, convert, and so on
i tried the sup2 thing already, dosent work
then lower your headphones volume or move it further away from your mic
or add noise suppression to your mic so it doesnt pick up everything
not much else to do
im pretty sure you can dump a folder path and it does everything. not sure which one but it exists
mmm... do you remember if it was one listed in the -colab command?
hi what is the comand for search ai models
Applio has a batch convert, will see if it works
why the search application doesnt work?
Ayo? @brittle wing level 1 !!! 
it says The application did not respond
Whenever I run go-realtime-gui I get this error:
C:\Users\Toby\Desktop\RVC\RVC1006Nvidia>runtime\python.exe guiv1.py
Traceback (most recent call last):
File "C:\Users\Toby\Desktop\RVC\RVC1006Nvidia\gui_v1.py", line 59, in
import torch
File "C:\Users\Toby\Desktop\RVC\RVC1006Nvidia\runtime\lib\site-packages\torch_init.py", line 122, in
raise err
OSError: [WinError 1114] A dynamic link library (DLL) initialization routine failed. Error loading "C:\Users\Toby\Desktop\RVC\RVC1006Nvidia\runtime\lib\site-packages\torch\lib\cublas64_11.dll" or one of its dependencies.
I have CUDA 11.8 installed with Pytorch 2.3.0. I've added environmental variables for CUDA bin and libnvvp. I've also tried reinstalling my drivers, Python and Cuda.
it worked ty
Hi, how can i use the voice changer in google colab?
Real time or covers
real time i guess I just want to congratulate a friend on her birthday
.
-realtime
This interaction has expired, use the command
/guides realtimeif you wish to see it again.
ty so much
guys wich one ?
og u cant do pics
Select the pitch extraction algorithm ('pm': faster extraction but lower-quality speech; 'dio': improved speech but slower extraction; 'harvest': better quality but slower extraction):
wich one do i choose: pm, harvest, dio, crepe, mangio crepe, rmvpe
For 9 minutes of data and a batch size of 8, how many epochs would you guys recommend?
im lost
if anyone see's this how do i do step 2?
i run it with the correct link and it loads for a second
then some code appears for a second then goes away
then i get a red ! beside the button
@odd shale can you help?
Ayo? @brittle wing level 1 !!! 
Thats very old and outdated
99.99% of the time rmvpe will do the job the best
Just use rmvpe and if that doesnt give good results try mangio crepe and if that doesnt give good results either its probably an audio input or model issue
Your using RVC-GUI which is very outdated
its my first time trying to make a model
Ayo? @magic lance level 1 !!! 
Rvc gui doesnt have rmvpe tho
Download original RVC or a fork like applio and use RVMPE
i was following a yt guide
Yt guides are always outdated
You can’t make models using RVC-GUI
Just follow this guide https://docs.aihub.wtf/
Last update: Mar 10, 2024
You downloaded a outdated version then
Original RVC > Mangio RVC
is applio better then ?
Yeah mangio havent updated since last year
Yep
yw
where do i click on the github page to dopwnload, im lost T-T
nvm sorry i got it SORRY
the what ?
are there any other RVC besides mangio rvc that support mangio crepe?
Whenever I run go-realtime-gui I get this error:
C:\Users\Toby\Desktop\RVC\RVC1006Nvidia>runtime\python.exe guiv1.py
Traceback (most recent call last):
File "C:\Users\Toby\Desktop\RVC\RVC1006Nvidia\gui_v1.py", line 59, in
import torch
File "C:\Users\Toby\Desktop\RVC\RVC1006Nvidia\runtime\lib\site-packages\torch_init.py", line 122, in
raise err
OSError: [WinError 1114] A dynamic link library (DLL) initialization routine failed. Error loading "C:\Users\Toby\Desktop\RVC\RVC1006Nvidia\runtime\lib\site-packages\torch\lib\cublas64_11.dll" or one of its dependencies.
I have CUDA 11.8 installed with Pytorch 2.3.0. I've added environmental variables for CUDA bin and libnvvp. I've also tried reinstalling my drivers, Python and Cuda.
how can i find the right settings for my voice
its taking soo long to extract features
i can help if u dont know how to setup
Why there's no public URL in easy GUI? Only local
Ayo? @torpid wasp level 1 !!! 
I need help with gpt sovits but Idk what's the right channel to get help for it
oh wait nvm I got it working now
Whenever I run go-realtime-gui I get this error:
C:\Users\Toby\Desktop\RVC\RVC1006Nvidia>runtime\python.exe guiv1.py
Traceback (most recent call last):
File "C:\Users\Toby\Desktop\RVC\RVC1006Nvidia\gui_v1.py", line 59, in
import torch
File "C:\Users\Toby\Desktop\RVC\RVC1006Nvidia\runtime\lib\site-packages\torch_init.py", line 122, in
raise err
OSError: [WinError 1114] A dynamic link library (DLL) initialization routine failed. Error loading "C:\Users\Toby\Desktop\RVC\RVC1006Nvidia\runtime\lib\site-packages\torch\lib\cublas64_11.dll" or one of its dependencies.
I have CUDA 11.8 installed with Pytorch 2.3.0. I've added environmental variables for CUDA bin and libnvvp. I've also tried reinstalling my drivers, Python and Cuda.
anywho
You have to follow that statement up with a contribution to the conversation
i was asking someone to follow me over here
from genereal
general*
Oh okay
But if you can help I would love that
cuz idk whats wrong
Did you ask ChatGPT?
no
Try that
dude sends his problem twice then tells someone to use chat-gpt to solve an issue
lol
Ok and?
I have a Bachelors degree in Computer Science and you're a literal child
aw lame it didn’t embed
then figure this shit out yourself instead of asking other literal children😭
like you don’t gotta flex a degree on a 16 year old 💀
Whatever you say little man
nah you got it bro 🙏
Help!! I am attempting to use the Hina Mod Google Colab and it's not giving me a gradio URL. It's still loading. It only gave me the local URL.
What do I use now?
Ilaria is doing the same thing. :(
Is Gradio down???
Could you elaborate?
Tells person to ask ChatGPT to solve a problem for a program that it 1. Wouldn’t know about since GPT only knows January 22 and older, and 2. Would most likely get wrong since GPT likes to pull things out of its robotic-ass.
I’m no nerd but your degree sure is showing 😅
OK retard
Didn't ask for your opinion
Ayo? @brittle wing level 3 !!! 
no, they are june 2023
i dont get any public url either
came in to ask
and what did they say?
Ayo? @brittle wing level 9 !!! 
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
possibly it has to do with an internal problem of the gradio
where is this at?
what does that mean
is it like
just wait untill it works again?
I don't know, maybe less than 24 hours?
Not so much, just that it will take a while to start because I forgot to put the precompiled
I think I'll put it up later
you made it?
How to use the applio without ui for infer cuz is say model_name but my useing a huging face link to dowanld a modle
RVC Guides (How to Make AI Cover)
Translation by country
im trying to train a model but this is happening anyone know how to fix this ?
yes...
try installing dependencies or rvc again
if you don't know the name of the model when you download it, look in the colab files in the applio/logs/ folder.
alr ill try reinstalling it
It's reassuring to see I am not the only one with that issue regarding Gradio.
no public url fix?
did it worked?
Can i use easyGUI again now?
we dont use voicemod here
is this better than the rvc i downloaded few months ago
okada is based on rvc, it only has better ui
i think
is there more detailed training guide?
can someone help me? im having trouble with a link to a voice model, it saying there was an error with the zip
where do i download rvc again? the one with directML for amd gpu
send me the link
Last update: Mar 10, 2024
which repo u using
How to make AI Cover with google colab ? It still not working
i need help
i want to clean audio, like remove static noise of microphone from background
so how do i do it using UVR?
is it possible to do it with it?
@loud flint, I have found 2 results that match your search!
Sorry, I could not find models that match your search "junichi kanemaru"
Sorry, I could not find models that match your search "sonic (junichi kanemaru"
the long explanation:
I cant answer anything detailed on that. You can wait for codename and ask him questions yourself
Dont gorget to check the drive link
forget*
Kl is just a part of it
in reality, you almost never pick ckpts based on em
I'd prioritize FM
then add Mel to it
and if there's a situation you have few " best " fms or few " best " mels
pick ones that are relatively lowest kl wise
pretty much
alternatively, you can fuse Fm+Mel hybrids with best kls ( say in 75:25 proportions )
sometimes
total is a mix of all 3
but issue of total G ckpts is that they are an avg best
they don't prioritize accuracy nor fidelity
so I use total G as sorta indicators to where more or less I focus on graph
to find other good-metric ckpts
I can help with graph but, do you have it synced
log interval matching epochs' steps?
I meant like, did you manually input the steps into config file
during 2nd training run ( actual one
Here's why it matters:
#✨│ai-help message
that part of convo
oh
for that check the #✨│ai-help message
last section
oh, wrong one
thought they linked this one:
#✨│ai-help message
here's all you'd need
ye, last one's about syncing, pretty easy to do
Yup, all the times you train models
Just, yeah. You'd have to train again
current graphs of yours have different logging points
those needed were never logged and are lost permanently
Unfortunately, yea
what's the set size?
dataset
but info on batch_size used appreciated too
so 14-18min ~ zone
can you go as high as 16 for batches?
or not quite
using colab or something?
happened every time or just once
musta been a bug or some freezing
cause they grant T4s for acceleration
they can handle even 35-45 mins at batch 16 ( tested
but ye, if not 16 then at least 14 I'd try
but I'd rather go for 16 cause it's faster than 14, 12, 17, 20 etc
Ayo? @graceful obsidian level 24 !!! 
and in ur case, most likely better for ur set
what ver you using of rvc?
mainline, applio or
neat
nah, don't have to yeet all
just those files mentioned on the ss
epochs ( in weights ) and
in model's directory ( in logs ) G, D, tfevents file, eval folder, train log file and config file
but before that
check your model's steps
first epoch in weights folder
what's the "s" value?
you'd check it after doing a test run on 16 batches
that's fine, you'd have to do a test run anyways
hold up
Pretty much:
-
You yeet all that's related to your unsynced models ( what you have rn ) - you keep only the dataset used in first step in the ui ( preprocessing )
-
You start the training as you normally would;
- preprocessing
- feature extraction
- index training
- model training; save every single epoch ( saving freq at 1 ) batch 16 ( in your case - recommended ) as for checkboxes: yes, no, yes
you train until you get 1 epoch
then stop
as a step 7
- you note the steps value
.pth models in weights and whole model folder from logs
ye, that's all there is to the model
it's experiment folder ( in logs ) and it's ckpts so, .pth weights
will be in: rvc folder / assets / weights / yourmodel_e1_s10.pth for instance
( hence why you train til first epoch )
ye
if lost just follow this ^
that's the exact same procedure
Np man, after you done with training just @ me
gonna help with the tensorboard
Nope
🗿
250+ models / prototypes and rvc mechanics-studying be like
meaning, you can trust me ¯_(ツ)_/¯
cause all info is tested in field, not guesses
what colab can i use on phone ?
check our docs https://docs.aihub.wtf you can find them here
also check #📰│dev-updates message cus there could be some issues rn
Last update: Mar 10, 2024
Ok thx
Could not create share link. Please check your internet connection or our status page
do I have to wait for Share API to be online again?
Ayo? @edgy fjord level 1 !!! 
I've no idea honestly
Not quite handy with Ngrok or such tunnelings ( neither I use em
Unfortunately nope @brittle wing
Personally don't use colab or kaggle
Saturn. Have a fully working notebook and fork for that but ya see
they're whitelisting people / account registrations in a way
and the free hours per month are even worse now
but generally, you won't even make acc sadly, atm
Uh mind if i ask how do i move the file "frpc_linux_amd64_v0.2" using Android device?
EasyGUI won't give public URL or there's something i didn't know
You can't
Its a problem of gradio api, its down
Read #📰│dev-updates
How do i know if it's fixed? Announcement?
On #📰│dev-updates they will say when its fixed, btw in it they also putted a link to check gradio status to see if it's down or not
Thanks!
RVC Guides (How to Make AI Cover)
Translation by country
sup?
gonna be eating and then processing my dataset and wut, need something?
yee
just wondering if we can hop in vc rq to optimize some settings
JSONDecodeError Traceback (most recent call last)
<ipython-input-11-8114ca135b8c> in <cell line: 31>()
31 if os.path.exists(config_path):
32 # File exists, proceed with creation of creds and client
---> 33 creds = Credentials.from_service_account_file(config_path, scopes=scope)
34 client = gspread.authorize(creds)
35 else:
5 frames
/usr/lib/python3.10/json/decoder.py in raw_decode(self, s, idx)
353 obj, end = self.scan_once(s, idx)
354 except StopIteration as err:
--> 355 raise JSONDecodeError("Expecting value", s, err.value) from None
356 return obj, end
what is this? when I try to paste the link in google collab and click on start
this happen
outdated colab, type -colab in this channel to get the newer ones
thanks
Ayo? @opal lagoon level 1 !!! 
-colab
- Applio, by IA Hispano Google Colab
- AICoverGen-WebUI, modded by Hina Google Colab
- AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
- AICoverGen-NoWebUI [Spanish], credits to Eddy, Hina and Gdr for translating and fixing Google Colab
- Ilaria RVC, by thestingerx Google Colab
- RVC Disconnected, by Kit Lemonfoot Google Colab
- easyGUI, by rejects Google Colab
- 🆕 UVR5 NO UI for Google Colab, by Eddy Google Colab
- Applio, by IA Hispano Huggingface Spaces
- Ilaria RVC, by thestingerx Hugginface Spaces
- RVC-HFv2, by r3gm Huggingface Spaces
- AICoverGen, by r3gm Huggingface Spaces
- Advanced RVC Inference, by r3gm Huggingface Spaces
- RVC v2 Huggingface version, by Clebersla Huggingface Spaces
Hello why does my rvc use my cpu instead of my gpu even though I installed the amd version pls?
RVC or wokada
Has this model trained properly?
I assume you mean wokada cause it has inconsistent cpu usage
-
Make sure you selected your gpu. Open task manager - performances to check for the number
-
Make sure you exported your voice models to onnx. Thsi is a MUST for wokada amd users. You upload voice model, then theres a button to export to onnx, then you reupload that onnx as PTH voice model file
-
dont move s.threshold all the way to the left or right, that causes 100% cpu usage
definitely finished at around 18k for the best average
Yeah I thought so too
or wait maybe even 7k
I can safely delete every save above 20k right?
you should try both
I saved every epoch, as recommended by code which gives me the perfect spots
u mean the deepest point?
Yeah I was gonna try that one too
ye
although at that moment other things like mel were still pretty high
rvc
is it safe to delete every epoch above 20k?
send screenshot and tell me your gpu name
thats wokada not rvc
mb
dont do s.thrshold 0.001 move it bit to the left
anything else is fine
amd wokada just sucks, your cpu usage should be lile 30-60%. its normal
if i was you id download the RVC voice changer instead. You want links? it runs better, less delay, better performance
^i agree
ok thank you but is it normal that my gpu works less than my cpu?
for amd wokada, yes its normal
can I have the link?
youre nvidia right
no can I have it?
https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/RVC1006AMD_Intel.7z
Download this
Then I recommend you download the (old) dev version as theres a quick trick to reduce some more delay. To do this, heres a quick step by step:
https://huggingface.co/Shadicti/rvc-old-dev/tree/main
Open the zip, open the first folder called Retrieval-based-Voice-Conversion-WebUI-dev and drag all the files into your RVC folder. If it asks you to replace any files, say yes. If it does not ask you to replace any files, you probably dragged the folder Retrieval-based-Voice-Conversion-WebUI-dev in it, but you should drag the 8 folders and 26 data files directly to the RVC1006 folder.
Then you run go-realtime-gui-dml.bat to start the voice changer
On there, make sure to change the default index thats there, just upload any random custom index to prevent crashes
Select input and output of the same driver type (MME) at the end. Is important
Sample length is delay (you should start with ~0.20-0.25 and see if you can go up or down), ignore harvest algorithm, do max fade length and extra you could do full
Choose rmvpe
Is that not working good with the rvc voice changer or smth?
Dont download the first link that i just sent, ill grab the nvidia version
Wait you already have Mainline installed so you only have to do the dev patch i linked
and which one is the right one?
mainline?
TY 🙏
if anyone is able to help me rq. I'm not sure what I'm doing wrong but im getting a error when i click convert
when you got rvc1006nvidia and the dev, you run go-realtime-gui without the dml to start it
alr so no dml
remove the spaces from the audi file, make it one word only
maybe that fixes it already
do I still need to download the rvc old dev thing?
I recommend it yes
yea you paste the files into the RVC1006Nvidia folder
alr thx
I wrote a guide to reduce more delay, and it depends on that dev thing so yes
ok
i'll try it
will do that in a sec but first, could it be that they rounded it up? As I haven't seen one with 1879
just use that yes
alr thx
youll rarely get the EXACT step number
I mean I save every epoch
then its rounded up
im getting the same issue. What else could it be? if you dont mid me asking
I dont use colab so i rlly dont know :/ maybe restarting helps, else try other voice models to see if the issue persists
or any error codes displayed?
AttributeError: 'NoneType' object has no attribute 'tobytes'
NameError: name 'cpt' is not defined
these are the errors im getting inside colab
i guess i'll try again
as in?
then perhaps 14 or 12 for batch if 16 is a nono
as for how to modify files on kaggle ( json), no clue
you should ask people here who use it
closer to 16, the better in ur case
as for editing, just an idea but, you can ask gpt to generate you some cell code for rewriting files
if you want
yea
Not an issue if you know what you're doing and actively checking tensorboard
I've been working in that manner for over half a year now
having max 4-5 gb for a training, maybeeeee 6
to maximize space, get rid of no f0 v2 pretrains, also v1 pretrains ( all of em, not just no f0 )
So uh, what is going on here guys??
it's weird, every single voice model I did and it kept instantly gave me this error
.
yeah
I had this yesterday with someone else too I think
JSONDecodeError: Expecting value: line 1 column 1 (char 0)
hm no your download link is working though
maybe try another collab?
there's another?
many more
i only know of this one google colab
Ilaria and Applio is just name, yes? Everything should work fine?
aside few gimmicks here n there, or utilities like tts or such
it's all more or less the same
where do I submit my trained model?
oh btw @graceful obsidian , thanks for the tips
I saved at every epoch and it really helped
I finally got my model now
uh I think you wait until theres a link
and then open that link
could take a few mins idk
Greetings!
sup
I think i can't send any pictures
By the way, how can i get RVC version 2?
RVC v2 Is usually built into any forks/mainline
If u think urs doesn't or wanna check, you could go to the train tab, and if it shows RVC v2 ur good to go
like that
u dont need to do anything else in there if u js wanna check
outdated colab, use only the updated ones in our docs https://docs.aihub.wtf
Last update: Mar 10, 2024
Ohhhh
Oh well ummm, i am using google collab the whole time 🐸
How do i get that?
so still ilaria and applio like I sent
Ayo? @leaden yacht level 12 !!! 
which collab?
Ayo? @peak osprey level 1 !!! 
I would use one of these as they are supported
but yours already has rvc v2 so ur still good to go
I'll use yours
TY!
alright
?
Yeah I know dw
I take it the collabs worked?
I need to know how to set up Chester's vocal to like
not be messed up with this song's vocals
You're confusing it a little bit
I went to appolio, where can i find the v2 choice?
Yeah i know, I shouldve said it a lil differently
RVC is how the whole thing is called, V2 because newest pretrains are in their version 2
as for mainline, it means the main / original repository
forks are things based on the mainline, mirrored ones but modified
ah
where does one go and talk about stuff like 'would this song work with this model's vocal' and stuff?
Uhh I think here
Oh!
Didn't get anything
btw code where do I submit my model?
Cuz after following ur tips everything worked out pretty well
and I got the exact epoch I needed
did you remove the vocals?
problems is the high keys, most models I've tried seems to just
work out
like at the normal voice, and yes, I separate the backing vocals and main voca
so what you could do, is change the semitone to like -4 or smth like that
but at high keys, it starts to just malfunctions
what you don't get
that could change it a little bit
his voice will be deeper though, so if its not the wanted effect you can always change it around
was there ever a model that's actually made for metal screaming here?
Hmmm, perhaps on weights.gg
or voice models channel
The whole thing
My question is, i am in appolio, how can i find V2 choice?
no I meant like to get the voice maker role etc
in this server
O, model maker chat should have something pinned about that
alternatively, you could ask @red kayak
cause I don't quite remember the procedure
It should be in the training tab
ah alr
tho, lemme check on that
Hmmm, seems like the applio is now something else
I haven't used it personally but, might be they've incorporated so-vits into it? vits gpt thingy from rvc's devs? ( that's a tts )
or just named it that way " vits based " rather than " retrieval based " (( tho, rvc itself uses some of vits things too
no they just have tts
alternatively they might have added gpt sovits support
but i doubt it heh
ah ye, so that's just rvc + vits gpt and microsoft based tts stuff yeye I recall that one
incorporating tts pipelines into rvc ye
Ye I do indeed plan to, just waiting for finetune availability
for japanese and english
unless they've already added it
the support chines and english
As for this
generally " pitch issues " or " screaming / non screaming " stuff etc
it is related to how well the model is traind + what kind of voice data the training was exposed to
maybe japanes too
yeee, sadly
the finetuning is for eng and chinese just yet
tho training iirc is jp, eng, chinese and maybe some other
yeah
For example
If the model was trained on rather soft / chill singing or speech, the model might not be able to replicate screaming / screamo type of sounds etc
especially if index is used ( or is used with higher ratio values )
Another thing is, the tonal / pitch range
if the model was, say, made on speech samples and they were monotone ( lacked pitch variations + at different volumes ) they won't have that high of a range - ofc, they won't " tear ", they just might not sound as ' accurate '
If the model is tearing, it might mean few things:
- Dirty acapella ( for example; harmonies, backing vocals etc. )
- overcompressed dataset ( compromised dynamic range : pitch relationship )
- overtrained / undertrained model
- or simply poor quality dataset at specific phonemes which your acapella at given pitch or volume might need
Am well aware it might be confusing for some ^ so feel free to ask about anything in particular or any of those related things, Imma simplify it
Hmmm, not sure but that's cause I don't share my models so.
Maybe check someone else's models and see what they've added in there?
or check rvc's repo for any mentions of that
alr
openrail
thx
Last update: Apr 01, 2024
Scroll down and you'll find a guide about how to upload models to HF
That's the guide about how to upload your model to the voice models channel.
First you need to upload your model to your HF account.
I guess
its a narrator of a game, and he doesn't show up
hes js a voice
do I js put fanart or smth? 💀
Show promo art for the game
smth like this?
Yeah
alr
I’m using applio and it says 2000 seconds… is that even normal?
Under what?
Can i use these RVC and make ai covers on mobile?
after you do the sync ( know the steps ) you gotta purge all cause if any from the actual training is of similar name, unknown conflicts could arise
- generally, always best to remove em all
- the test training is just to know the steps
- after syncing, you just train from scratch ( same settings - cause different batch changes steps )
Sir
Ayo? @peak osprey level 2 !!! 
Sorry, I have a question, how do I put titan on rvcv2 disconeted? (because I don't understand how to start it)
I clicked on convert and it’s still going on, it’s been 3000 seconds it says.
Ayo? @unborn viper level 2 !!! 
How long is your audio file
And what’s your GPU
Well it’s bout 5 minutes, 4.8MB to be exact. I use a GTX1650
any python library for only inference with files from weights.gg? not a webui
Shouldn’t take that long for inference
Should I try again?
Yes try restarting
don’t close any terminal windows
What’s a terminal window
would my 4060 8gb vram, with 12 batch per gpu setting, be able to train with an hour long dataset?
idk if it applies to Applio but when you open it there’s a black window with text that opens with it
yeah you can
you could use a collab on a mobile browser
same as other devices js another screen
no vol help meh
To applio
Yeah
Original RVC
It’s also slightly faster too
local or cloud?
Can i get a link to thar?
I would try restarting applio first and trying again
Make sure you selected your model and audio
If not
Here’s the link
-local
- 🍏 Applio, by IA Hispano GitHub
- Mangio-RVC-Fork, by Mangio621 Huggingface
- RVC Studio, by SayanoAI Huggingface
- AICoverGen, by SociallyIneptWeeb GitHub
- Replay, by Replay Team Website
- Original RVC, by the RVC-Project team GitHub
- GPT-SoVITS, by RVC-Boss GitHub
You can find more info on the #1159513888199540817 channel. If you can't find your answer, feel free to ask for help in #✨│ai-help. Credits to Faze Masta and Antasma for compiling these links.
wait @unborn viper u have a gtx 1650 right?
Yes
if the original rvc doesn't work, I recommend trying mangio
I used to have a gtx 1650 too
almost nothing but mangio worked
and if you wanna train you also have to change a bit of code, I debugged the code with someone and found something that was keeping gtx 1650 users from training and giving them errors
Well I’ll try applio again and if it dosent work then I’ll try that
Then why it says error here?
I wanna make TF2 heavy ):
smth wrong with the collab I think as it says connection errored out
you didn't close the collab tab did you?
nah
if its approximated you should js wait
How do I do this mangio thing
for me it approximated 5000 secs and still it was only 15 secs after all
you could just wait a bit
Or wait
Nvm I wanna try the original rvc
Then I’ll do mangio
So I think im supposed to download the Nvidia GPU version I think
https://github.com/Mangio621/Mangio-RVC-Fork
Go over to releases, then download either INSTALL_Mangio-RVC-v23.7.0_INFER.bat or INSTALL_Mangio-RVC-v23.7.0_INFER_TRAIN.bat
The infer_TRAIN.bat allows you to train your own model if you want to, so if you wanna train make sure to download that one
No, that's it
The .bat files will download a .rar, 7z, tar.gz (forgot which one), but either way you need to extract that and then you need to run go-web.bat
both the batch files?
I will restart the RVC again
or both mainline and mangio?
I’m trying both mangio and this original RVC one
alr
I’ll wait till it downloads
are you gonna train models too?
What happens when I train models?
Oh
mangio also is the one that worked best with gtx 1650
Have you tried orginal rvc
