#✨│ai-help
1 messages · Page 128 of 1
Yeah i plan to watch it
clicks are fixed in there, it's quite op
Wonder what it can do with Michael's snaps 🤔
it should be much more goated for synth datasets
if to ensemble it with mdx23c, It'd prob kill the stuff
but ye, ft bsroformer is currently world's best isolation
so for like any speech with music and sfx over it
i recommend watching new parasyte the grey, somehow netflix made it good
how is fm supposed to look like?
i need an answer to that lmao
in what way?
like on the graphs, how is it supposed to look like, is it supposed to be going down up?
i hope anjok adds the ft version in uvr soon
cuz like mine always went sky high
and that mysterious demudder thingy
Oh, yes, all metrics are meant to be going down
even D in ideal scenario should be fairly balanced, both going down, G and D
did someone say a new bsroformer, where can we use it
with slight G bias
fm always went up for me too
mvsep
lol
and x-minus (i think?)
yeah i really dunno why
if it goes up that'd be the balance is out of place
in such cases, try to verify the gradient normalization
check if G is skyrocketting
and D decreasing
nope ive had it happen when they were both fairly balanced too
oh the ver 2024.04 under bs roformer?
g and d being ideal per se
yep
if you have the log file I can take a look
would tell a bit more if I could see it ye
i train on kaggle, but i will train a model now so i will ping u here
Alrighty
ye tho, what's the purpose of this step ^ ?
Hi there, just saying that the new applio no ui is unusable. First I get an a100 GPU which I never get, then after installing it prompts me to restart the session and I lose the GPU. And after it gives errors of not being able to decode the audio file (ffmpeg error), good waste of a GPU.
curious
to export all the audio labels and have it split up already
and it usually helps rvc not mess up as much
try one to time to do side by side training
rvc split and your method split
curious on how rvc reacts to lack of contextual-overlap
okay i will train on both peices of audio
and i will let u know
O7
just let rvc split on its own right
no truncating and just
such*
uhhh
if possible ^
but I imagine it to be difficult if you're not doing it from the very start as you build the set
cause truncating doesn't care if it's inter-phoneme or inter-sample gap
i'll try to replicate that
tho, if you have quite a bit of data, effect of gaps isn't as devastating really
model's biasing towards silence mainly when presented with smaller sets + higher batches or like, when encountering difficult patterns to learn, then it cheats
rvc has the threshold set to -40 db and everything is well de noised and noise gated so splitting wont be an issue for it
that's not the noisegate that's the threshold for what to treat as "cuttable" silence
yes
that what im saying
but that thing doesn't quite work as fully intended, tested the set with wider gaps / normal and with small gaps
result is that the wider / normal gaped sets have more silence in there
at the beginning and ending
like, at times 0.3 to 0.6 sec
but if you pack it all tightly, you feed it more contextual voice data rather than silence
Take your time and remember, might have worked for the way I process my sets, might not for you
afteral, we all shaped our own methods right 👀
like I was focusing on handling smaller sets or those prone to collapses vs those 15/ 20+ mins
yush but i still like trying stuff out so yeah
i legit havent gotten a mode collapse in a hot minute
generator is always okay with it
yup, bigger sets have less to almost none of such issues ( maybe max 1 or 3 collapses per 2-4 training hours and even that, rarely
sadly that's not so easy for smaller sets
i did one with 6 mins didnt really get one
i needa see
i'll also train like 2 mins of audio
with labeling
and i'll also show u those
i use ngrok so ull have access to tboard
for instance, this mostly happens when doing standard sets with no gap control
oh, this one
so ^
1st one is actually when you pack it tightly
i see i see
and while those silence zones might not be an issue for generally bigger sets
for smaller? it's devastating
like so ^
but then when you scale it with right batch ( kinda negate effects of small set
hmmmm, you can try
but to replicate it, you gotta use batch at 16, 14 or 12
ohh thats what u want
i was going to use 4
yeah sure
ohhhh, then that's why
ill put that to the test
bigger batch trainings have more data to calculate more consistent kl
's why I don't go below 12 as minimum
also feature matching is more averaged too
well kl can still be consistent with 8
but we;ll see
i need to test all of those things out
it's still half less with 16 cause
gt 100 samples
100 / 16 = 6~ groups, each having 16 samples fed to the network at each gradients update
100 / 8 = 12/13~ groups but each having just 8 samples fed
so while there's less groups, each provides more averaged gradients / params update
hence why graphs tend to be less noisy and more uniform
yes smoothed out
ohhh
that too, but too much data lack of convergence, too little, same
is why I recommend doing range-finding trainings:
4, 8, 12, 16
then comparing 2 best sessions
and say, 12 was the most stable that was going down but could be better ( deeper in tendency )
while 8 was really good at going low but too noisy and would sometimes back-track itself
you'd then want to narrow it to 9-10-11 batch
kinda like that but then, it's tedious and not everyon can afford time for that
powers of 2 are usually best
speed wise for sure cause memory utilization
Hello friends, I need a model of a Russian teenager of 16-17 years old, a help desk
but while for one 10 min set 16 might work, some other 10 min might benefit from 7 or 13 batch due to more complex patterns and spacial characteristics etc
and needs more randomization ( lower batch ) to more efficiently capture the patterns + compare it with totally unrelated yet close enough ones
Hey bro, can you help me?
well, for models maybe try weights.gg website
you could find something
yet remember, if you need it for real-time voice changer, model being " russian " won't be enough, you also gotta do the accent as well, at least to some degree
if it's for covers / speech generation from input, any good russian model will do, provided you'll use high index ratio - if model allows you to go that high, that is
guys when i press save it says file not found what is the problem?
save? where?
oh you probs using some custom forks or something (hf applio? iliaria rvc?), in that case I can't help
im thinking its either W-Okada or voice.ai
voice ai
Ayo? @clever shadow level 1 !!! 
yeah don't use that
ohhh right, maybe w-okada
use w-okada instead
w-okada or rvc's native voice changer
free and open source
both of em are tons better than that website
yeah 
native has fcpe tho, so that's a plus
doesnt W-Okada have that tho? .18a must have added it 
from where i can download w okada
oh, then maybe, perhaps
read the thing, has download links there
can't tell as I switched from dml w-okada to native
kinda easier on artifacts and speed for me
and that is on pytorch not onnx
oh ye I remember why I switched, cause I use pytorch rather than onnx
kinda can't bite through accuracy issues with onnx so
where I didn't find it
amd
what does that mean
just scroll down for 1 or 2 seconds, there's a download link for Nvidia, intel/amd and mac
it's this one
also, please, read the guide, you definitely need it cuz you'll have to do onnx converting on models
yeah
ye that's the one
18a
if you'll have performance issues or stutters despite config testing:
amd 5700 xt
do i really need to set saving freq for this to work
👁️👄 👁️
cuz it isnt a com model
so im just
setting it as 5
so that i can let it bake
and not worry about drive overfilling
if you wanna replicate my environment so to have fair comparison
I'd set it to 1
should i press this
Ayo? @clever shadow level 2 !!! 
?
cause collapses can occur at any time
there's that " download " button
upper left corner
or that you showed too, ye
k tysm
but ye, was saying that, if performance will suck despite testing settings, try rvc's native voice changer too
Might be similar to my case where native won over w-okada
also note: w-okada needs you to convert the model to onnx, rvc's native doesn't
onnx models are a lil less accurate in few things but, you most likely won't notice ( just mentioning it in case
from where i can convert it
thats what the guide shows how to do
k
^
hm... native works out of the box like that for AMD stuff?
yes
damn, pretty cool
as my fork doesn't rely on that, gonna check what's needed and diffuse it within my repo
that's how native looks
main pro is that it doesn't require you to use onnx and has fcpe
w-okada's dml release iirc as I have 18a too, doesn't have it
and is full pytorch and for some magical reason lmao, cuts less
funny isn't it?
I just feel like native has few more sliders to play with
like overhead buffer ( unless wokada has it too and I forgot
Still taking an insane amount of time @graceful obsidian . Currently training the second epoch and the first one took over 7 minutes.
can you quickly go into model's folder
check the config file?
does it say true next to fp16?
@floral token
fp16_run false
aha there's the issue
yours runs in single precision
that's weird
@floral token a simple one but
did you:
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117
this exact one?
or did you:
pip install torch torchvision torchaudio
even some older gpus should support nvidia amp and mixed precision hmmmm
where can i find an e girl arabicc sound
is that better or the voice changer cilent
I'd try both to be fair, see which you like better / which works better for you
but if you asked from technical point of view, I'd say native
( but then, I am an accuracy freak so
@floral token
try this
or maybe in cmd:
nvcc --version
can't tell for sure as I don't have nvidia really
should i download it
oh, nono, that's for charm
oh, my bad lol, tagged wrong person
anyway, if you have nvidia, I'd just probably use w-okada as it's more documented
if it's amd uhhh, maybe native?
just try stuff around
oh okk
wdymm
oki
I'll try a driver update. Not sure I have the patience to install another client after taking so long to get back to what I was familiar with
you can keep the client
I had no problems, trained several models a year ago that were great. Now seems completely different
ah to just replace my current config file
Ayo? @floral token level 3 !!! 
not that one, there's one which switches between fp16 and fp32, it also controls sample length limit based on vram
Oh wait a second
right that's just my model's config file
when did you download the mainline?
nope, that's different
set it up yesterday
models configs are taken actively from other place
there are templates in configs/v2 v1 and ones that are in use in configs/inuse/v2 v1
those from inuse/.... are copied to your model's dir
so if init of rvc forces change in "inuse"
they quite possibly switch the dir's one too, and even if not, the training pipeline is taking info from inuse
tl;dr, if not to hardcode fp16 use in you, you gotta fix the driver ( if it's the case, that is )
okay, my config file under configs folder shows 10/20/2023 modified date
a
well, just get my fork
def something bugged
you can re-use some files from mainline to not waste net
you'll only have to copy from yours to mine these:
and:
pip install -r sox.txt
that's sox resampler for better audio resampling
And if that won't help, means it's the driver / gpu
but hey, you'll get features mainline doesn't have so, it's a win situation lol
Alternatively, you can try this:
hardcoded fp16 for ur rtx
guys when i press start i can't hear my voice why
I change the output to my mic and still I can't hear my self
Ayo? @graceful obsidian level 20 !!! 
giving it a go now @graceful obsidian thanks so much
Sure thing man
this one's probs not what you need
wdym
remove
how
k
this one might be more obvious but can't see what I'm missing @graceful obsidian
sorry, which txt file? @graceful obsidian
how am i supposed to change the singer of a song? can yall help
I didn't find the mic of voice ai in discord why?
cause it won't work like that
you need virtual cable or such, but I can't help with that
others know more than me about re-routing audio that way
you won't need that one unless you have my fork
sox is utilized only in my fork, mainline uses resampy or such
gotcha. I'll try it this way
the thing you need is uhhh
testing fresh rvc from repo
cause I believe main issue for you are the outdated files, perhaps
tl;dr:
- test if fp16 works for you ( when you run rvc, console should say something about half precision or fp16 ) using fresh / newest rvc from repo ( not zip / prepackaged versions! )
- if that doesn't help ^ It must mean the gpu or driver is the fault here
Is it meant to take like over an hour to try and do japanese vocals?
i was mainly curious as to how it would work
right you said just to replace the hubert_inputs and the rmvpe_inputs only right? @graceful obsidian
hello
when i close the app how to open it again from start http?
i get jsondecode error on rvc v2 colab, im using an rvc model, idk what to do
and should i leave the cmd or its fine if i close it
nono look.
rvc repos don't contain models like rmvpe, hubert, pretrains and so on
you gotta get the repo's rvc ( it's not packaged, there's no pretrains and models )
from your older rvc, you gotta copy over what I showed on the ss
so:
I already had all of those files is my confusion
and pretrained_v2 and v1 have f0G48k, f0D48k, etc, each for their own
when you dl the repo
My folder name is RVC1006Nvidia. Sorry if that's not actually Mainline
you just get blank rvc
which is what you gotta get
because pre-packaged rvc, namely, rvc1006 and so on
aren't being updated
see ?
yes for sure
whereas the repo itself, indeed, gets updated
but repo doesn't contain heavier files
namely, models ( hubert, rmvpe etc ) and pretrains
which you get from older rvc full releases, so in your case, 1006
what should i put this to
whatever the table showed as working for your gpu
this app is using to much cpu
it should be using gpu tho, not cpu
my gpu is rx 5700 xt
cpu is 80%
meaning, you don't use gpu hm
I suppose, restart w-okada
i change cunks and extra to
but I can't hear my self now but cpu and gpu is good
around 20%
which point very much good
select the lowest point from here and u should be good
so lowest point
what it say "value" attribute
do you know
what is that mean
for example in the graph lowest point maybe 5k step but 40 value. Other hand, a little bit high point maybe 7k is 35 value
why
Can you enlighten me
hi, I am having trouble with the RVC GUI
downloading any model always fails and I cant figure out why
@red kayak
where do you download
from where
from ur weights folder
but my colab worktime is stopped
i know
how can i continue
to training
from where i left off
can anyone help me here
outdated colab
use only the updated ones in our docs https://docs.aihub.wtf
Last update: Mar 10, 2024
aaaah
Ayo? @stuck spoke level 1 !!! 
thanks m8!
yw
so i'm using huggingface and it normally doesn't take ANYWHERE near this long but its taking literal hours just to do something as little as a 4 minute song
Did you duplicate the space
yes
Did you just create it
no
I just made a new one
Ayo? @finite pond level 2 !!! 
Hello Tangoville
yea it's
supposedly he had plans but something didn't work out and he got depressed even more (?)
after a while, weird statuses happened and I kinda stopped liking his attitude
simply unfriended him a while ago but ye, he was and most likely still is alive
@zinc anchor
I think he was an actual engineer
he chillin i think dw
as in?
non machine learning related engineer you mean, right?
Yeah I think so
hooo, interesting actually
wait what, oh, you meant lusbert, I thought you misspelled akimaya
lmao
Lusbert did left ai hub and now iran blocked discord so rip
I have to talk to bro through google chat
that means, I didn't know lus was an engineer
He's not
well, then I am confused
who's akitai
if it ain't bert's another nickname or akiyama miss-spell
man, seems I missed out too much
hi guys! sorry to bother, I just got in, do you know where to find the tutorial on how to make ai covers for begginers? 😦
Full AI Voice Model Training Guide (Local)
Link: YouTube
Credits: Christopher Villanueva
Model training with Mainline RVC
Link: Rentry
credits: Raven (ravencutie21)
AICoverGen Colab Guide
Link: Google Docs
Credits: Eddy (Spanish Helper)
Create a model with RVC disconnected (colab)
Link: Google Docs
Credits: Angetyde
How To Make an AI Cover With Ilaria RVC
Link: Rentry
Credits: 👽 Julia (ailen2091)
oh, there ^
thank you so much!
Sure, and for more advanced things later later, in future
check this:
#✨│ai-help message
Links r old 
oh
We gotta fix that
fffffffff
so if links are outdated or whatever then
you just gotta first figure out your gpu
nvidia or amd?
Leme get on pc rq
orrr ye, you can handle that
I'd perhaps just help with mainline or my fork
not that well aware of applio and other forks' differences
Have the audio file of your song ready, & let's extract the vocals from it with an audio isolation software.
this should link you to the tutorial that's right for you
omg i have no idea 😭 i think this is too advanced for me LMAO my friend send me the link and told me: there, u r gonna find tutorials
Ayo? @pulsar breach level 1 !!! 
thank youu
dw, all will be clearer once you get a better handle of it
ur too kind tysm
suree, mods, helpers, lots of people to help if you'll have issues so dw
the google colab ai songs maker is not working what should i do
How is not working and which colab are you using
RVC Guides (How to Make AI Cover)
Translation by country
There's no best pretrain (still)
I would recommend you to just use the original one.
tangoville
hello jimboville
what do the others provide tho
Others allow you to train models with shorter datasets.
But results may vary. (sometimes you can get better results with ones, worse with others)
I would recommend you to use original one if your dataset is near 10 mins and it's decently clean.
It's matter of testing.
There's no best pretrain.
Use Applio No UI Colab.
Tho make sure to have an instrumental and a clean/dry acapella with no backing vocals or harmonies for inference.
abt 5 mins rn, but this stuff is fairly new so idk how to work them
ive been using original for a bit now still, just was wondering. thanks
You're welcome.
Does anyome know if it is possible to make a free ai voice model with no dataset?
Like if I wanted to make a voice for a person that does not exist
Nope.
You can make any kind of model.
But you obviously need a dataset.
The only way to make an original voice is by merging at least 3 similar voices in a dataset.
Can you merge pre existing models?
Yeah.
Nope, but i think you must put the paths of both .pth's of the models you wanna merge
Put them where
ckpt Processing tab
Hai Code.
mhm. hai hai ~
- redacted -
wow
I understand
fyi, am decent enough as long I don't have to talk with certain individuals
the fact that there is a picture
well, you're meant to read it
there's " ckpt fusion " section
Can't I use ckpt processing in Mangio?
in a tl;dr format, you just want to put 1 model in slot A, other in slot B
each box has to have a full path to the .pth
Ok
you can
That's universal for any fork
right
just make sure to have the voices quite similar in timbre otherwise it might sound pretty bad
notes:
- always have v2 checkbox checked ( nowadays probs nobody is doing v1 ) when fusing
- 40 or 48, you gotta match that one box too
My mind feels like a bomb that is about to explode trying to figure this out
Go ahead, ask
Imma simplify it
Depending on which service you use
I haven't been here for a while and don't quite know which are working, which are not
but, generally, when you have rvc or fork running, you wanna put the models ( .pth ones ) into rvc main folder > assets > weights > here
or if it's old-rvc based fork / notebook
then there's just weights folder in the rvc folder ( so, no assets )
Deadass I feel like I am a dummy surrounded by a ton of geniuses
hmmm, then let's start this way
what service ( cloud ) you use?
Google colab? hugging face? uhhh, what else
Colab
Ayo? @frank shuttle level 3 !!! 
uh give me a sec
will be here when you click the folder icon
in there you should either see the rvc folder or whatever they call it nowadays for cloak purposes
Do I need to run anything first?
In a short, yes
And does it work with Ilaria
Hmmm.. maybe?
I haven't used any of those " newer gen " notebooks
also, I am not sure if ilaria or cover-gen ones have ckpt fusion tab
they're ( iirc ) focused on just generation rather than utilities ( such as ckpt fusing )
Do you have pc?
I have an old laptop that does not have much storage left
can't afford around 50 to 120 ish mb?
that is why I want to hold off on doing it locally unless it is a lass resort
yeah
Hmmmm.. I could do the fusion locally for you if you had the models and just share it on gdrive
if that's fine with you ofc
I don't even know what models I am using yet
Then my suggestion is, find any models you'd want to fuse and just lemme know, until you wanna handle it all yourself
but then again, can't promise all forks / notebooks have the ckpt fusion
ok
but just in case
once you install ( so usually init step in notebooks ) the rvc
you'll have the rvc ( insert potential cloaked name ) folder
in there you should have: assets/weights/ <put your .pth models here>
then you'd go into ckpt fusion tab and insert your dominant ( model you think should be the lead ) model into the slot A and secondary one in the slot B
.
ex.
Model A: path/to/rvc_folder/assets/weights/your_model_A.pth
Model B: path/to/rvc_folder/assets/weights/your_model_B.pth
you'll also have the " weight for model A " slider or something like that, it operates from 0.01 to 1.00 ( 100 )
It roughly means, how much of Model A you want to leave in the A + B mixture ( final hybridized model )
I recommend doing 50/50 ( for slider ), so, setting it at 0.5
for version pick " v2 " ( a checkbox ) and for "40k" or "48k" select ones that correspond with your models
(( whenever you dl a model, they should be properly named or tagged with their samplerate, so, 40khz, 48khz etc ))
the thing about " pitch something ", again a checkbox, ( Sorry I ain't giving full names. Been a while I used something else than my fork and I have my custom namings ), leave it at " yes "
@frank shuttle Hope all's clear so far. Best of luck ~ ✨
take your time, have a read
or if you prefer other format, just wait for other folks giving you links to guides or stuff
but ye, yw, gluck
thx
since w okada doesnt work no matter what i do, what is rvcs native whatchmacallit
Essentially, it's native to rvc realtime voice changer
do you have rvc on your pc?
no
and do you have time rn to get stuff done n ready or not quite?
i got all day brother
Alright, in that case
take this
name it rvc or something, put it on ur fastest drive ( C? if ssd )
yeah its C
so like in my downloads or what
alr, now you'll have to download one thing
gimme a sec, gonna confirm which one is it
lay it on me
you'll put that base.pt in here:
so, rvc folder of yours / assets/ hubert ( the hubert_inputs will be there already, or should at least ~ if not, lemme know
oh, uhhh
take stuff from rvc ( one on the ss ) and move it to the main " RVC "
rest you can yeet
just the insides ~
so go into the "rvc" folder and move its contents into the "RVC" folder?
ye
all for the sake of paths clarity
those can be tricky at times, in some cases
oh wait
bruh
like that..?
ye
ye, that's the whole rvc ( aside some bigger files needed for training )
what do i do with this (if anything)
Ayo? @frosty wolf level 3 !!! 
put it in hubert_inputs folder?
thats the only thing in here with "hubert" in its name
will be here
oh wait im dumb lol
the dir/ assets / hubert
alr, now you gotta wait for my upload of " runtime "
alrighty
4 more mins or so
I get that manual installation is better, but wouldnt it be easier to just send the binary release, paste the dev code inside and call it a day
it'll be approx 700 mb
the issue is
their pre-packaged releases aren't aligned with the newest changes and fixes
there's tons of bullcrap done recently and poorly documented too
for instance, recent config rewriting changes and stuff
Hence why I told him to get the newest mainline + imma share tested in field runtime
cause I can't promise the package's ver is compatible with fcpe realtime and stuff
Just for life's ease sake ✨
is this voice changer anywhere close/better/worse than okada
they have the same principle
main difference ( and reason why I personally switched )
is because I have amd
amd needs onnx
so its roughly the same but different layout?
and I don't like onnx as compared to pytorch, they're worse in accuracy
Thats fine, just think the average user really does not care about allathat and is fine with rmvpe + the old dev version + some wasapi or asio routing
and native rocks in terms of speed, pytorch and that is on amd
yeee I am aware
it is more so I can debug it easier
its not as bloated as wokada. better performance, and especially better for amd gpus
if I know what they work with 1:1
hope this one isnt as buggy and weird as the other
esp if it's files I tested in field / keep updated myself
will be fine
I use the voice changer in pytorch "mode" ( nvidia optimised naturally ) on old 4 gig amd, rx 570 👀
so dw
pytorch was a pain in the butt to install
hype for this summer so i can get a job and purchase an actually decent pc ⁉️
hooray
unpack the "runtime" folder from the archive straight to the rvc folder
so, rvc dir/runtime/
👍
also, a question, you have python installed, right?
yes
lemme look
might be just my luck with people that have python issues but, 3.12 seems to be often times problematic
I'd say, for rvc purposes, 3.10.6 is the most stable and reliable
might have to get
but let's stay optimistic for 3.12, if that won't budge then we'll look further into it
if this doesnt work i'll grab that up
ye
would try to get 3.10 100%
well that'd be a hassle and i'll just wait to see if this doesnt work
holy cow this runtime thing is huge
its taking a lot to extract it
as much as I was able to get my fork to run without it
57k items?!
ye
scripts, libraries, configs
it also contains stuff one would use for training-ready rvc
hm sounds good to me
hmm.. might actually isolate the voice changer from rvc in few days
perhaps the needed libs too for smaller dl
i hope this vc is good lol im going through a lot for this
spent all day trying to fix okada
(to no avail)
everything is better than something that's broken afteral
it has few extras too
for instance, pre-render buffering
haha.... whats that
sure, adds delay but can enhance stability
you kinda, infer ( generate ) speech in " memory "
which naturally causes delay, but as you have some overhead, yea
nono
in simple words
rather than have it literally realtime ( as long you want to use that feature ofc )
you have it delay it by whatever you set, say, 2 seconds
those 2 seconds it first fully generates the speech
kinda yt buffering
where buffered part is what you hear
and what buffers is your speech realtime
can help for weaker hardware to keep stability
etc
Ayo? @frosty wolf level 4 !!! 
yea.. and with that, I can safely say 7zip's superior to winrar
in terms of compression algos
lol
yea.. all for those few or so libraries that are needed but can't be isolated just yet 🤡
where do i put the voice on the app
rvc locally on your pc, right?
as in, on your pc and not some notebooks, colabs et-
ah
then what app you talk about?
🤔
heads up people dont know that wokada is wokada so they ask in here thinking rvc stands for real voice changer
We do not offer support for voice.ai on this server and would recommend not to use it
^
voice.ai is trash
and rvc means retrieval voice conversion not really realtime voice conversion
oh ok sorry
i got to know this server by voice.ai
on a tiktok
lma
much better if you do it locally on your own pc if you have a chance / or possibility - hardware wise
ye, and trashy
yeah its pretty dog water
yep
only legit " rvc " like services that are paid I can recommend is really elevenlabs
they're just gods in terms of style-replication for speech generation / tts, but ye clears throat
STILL GOING
mada madaaaa
81% man this is taking far longer than i thought
I am trying to figure out what singers sound like Ariana Grande
you tell me
would probs be 1-3 mins on my end
perhaps i should switch off of winrar
no?
well, I just wouldn't
or if you wanna be risking the biscuit 👀
ITS DONEEE
oh boy
you gotta install the reclist
this one:
requirements-win-for-realtime_vc_gui
pro tip
cmd in the address bar
and viola. No need of cd or contextual menus or whatever
this?
do this ( then hit enter ofc )
and paste in the cmd / console
@jovial pollen you might wanna read up on this conversation since you wanted to try out the new rvc update. good luck catching up lol
when you're in rvc folder, you have the address bar above
you wanna click there and just type in cmd and only that, then enter
ah thanks
it'll open the cmd which is already looking in the rvc folder ye
got it
neat
now
paste > enter
ye
's fine. we all start somewhere
anddddd, back in the day I'd help so many people at once, daily to the point I decided to quit some server lol or was it going perma-offline-status?
don't remember
youre cool
you too, you actually take time in doing things
but some ( pardon whoever reads this ) really impatient tiktok-era kids have difficulties typing in 1 command properly
when i wanna do something im gonna do it thats all
good attitude ye
Ayo? @graceful obsidian level 21 !!! 
oh, right
this isnt looking good
hold on, lemme think
what could have possibly have gone wrong 😭
uhhhh, well ye, wanna hear bad or good news first?
alright lets hear the worst of it
you'll have to get c++ build tools to get one specific thingy for the rest to push through ( building wheel as shown in the log, which failed )
and that is a pain in the ass. Finding it, downloading ( quite heavy in size )
all that ^ to use cuda itself
Search it on the internet.
Good news is, you can use it in DML mode for now
change the reclist to this one: requirements-win-for-realtime_vc_gui-dml
how good/bad is that compared to if i got c++
pip install -r requirements-win-for-realtime_vc_gui-dml
well, tbh? probably similar or same performance
given your gpu
alright cool
but in any case, if you were to ultimately decide to try cuda as well
that'd have to be tomorrow
He should install C++ Build tools?
Bc i found it.
rename the text file to this?
alright
as there's few reclists in rvc folder
installing
this one is for dml - direct ml
difference between direct ml and cuda is that direct ml is for all devices that support directx 12
cuda is for nvidia only
wut
Fcpe is optimized for realtime right?
yes
Nice, ig he can use that.
Alright so, we can actually use cuda if pyworld is all it is about
edit the first reclist
one without -dml
okay
it's a text file
Ayo? @frosty wolf level 5 !!! 
so open that sucker up?
hey my name is purple my favorite
then save and ye
pip install -r requirements-win-for-realtime_vc_gui.txt
so find that line and remove it
okay
oh, nevermind
okay im installing after backspacing pyworld out of the requirements-win-for-realtime_vc_gui.txt
what??
let's see how it goes
ah yes, so build tools that is
so i gotta cop that
You'll must download Build Tools.
ye, guess can't be helped
alright...
Alright code, i gave him the Build Tools link.
Guide him thru the installation.
🙂
ok ✨
im guessing free download or this is the wrong page
i hate spanish class
stuff I learned about spanish within the last 5-7 years?
Descargar means Download hermano.
xxxxx descargar gratis.
kek
i assumed
XD
¯_(ツ)_/¯
Download free no virus 100% real no password
Are you sure you don't wanna apply for helper?
(Bc that's what you're doing)
I feel more as audio / model engineer or specialist than helper
aaaand I only help when I feel like it or am bored ( like rn
you sure are HELPING me out a lot
Then apply for Engineer role.
individual components, I believe
nah, doesn't feel special
what can I say, I like feeling unique ✨
what do i select
search for visual c++ 14
alright
or anything that resembles it from the name
don't remember. Last time I got that crap was like a year ago or so
hmmmmmm
try 2014
or 14.0
or visual redist
or visual basic
perhaps?
it's for sure version 14.0
and c++
so I guess ye, visual c v140
msvc v140
this?
ye
after you done, just re-add pyworld to the reclist (( standard one, not the dml one
( positioning doesn't matter, can be at the end
okay
im more determined to make this work than i am for school
probably not the best thing
🗿
real 💀
hi, I have problems with the voice change, it doesn't work, the voice doesn't sound, only my normal voice in the passthru
🐢 👍
without
pip install -r requirements-win-for-realtime_vc_gui.txt
ty
there
btw, the format is really simple for installing reclists ( which are, requirement lists )
pip - packages manager
install - cause you install
-r means " requirements / reclist - you get it"
then you provide the list.txt
there should be absolutely no reason to fail, unless i got the wrong c++ or smth