#✨│ai-help

1 messages · Page 218 of 1

analog obsidian
#

but thats fine

#

rvc gets confused sometimes

bleak nymph
#

so if it goes up for more than 15 - 30 mins then i stop it

analog obsidian
#

it should start to raise forever in the 200 epoch zone

#

yeah

bleak nymph
#

alr

analog obsidian
#

but if you aren't sure just tell me

bleak nymph
#

alright tyy

#

u two have been a great help

analog obsidian
dusty bramble
#

all the credit goes to him, atp i been just watching

analog obsidian
#

after you noticed the g/total rises forever you choose the lowest point BEFORE the rising

#

thats your best model

bleak nymph
#

i see

#

also can someone describe what an epoch is in simple terms, i wanna explain it to my friend but i don't understand it well enough to explain it

analog obsidian
bleak nymph
#

snapshots?

#

like minecraft snapshots

analog obsidian
#

yeah

#

rvc models are numbers and calculations

#

funny numbers and calculations

crude flame
analog obsidian
#

rvc saves these calculations in epochs

bleak nymph
#

lmaoo

analog obsidian
#

there are bad calculations and good ones

#

the bad calculations are when the graph raises

#

and the good ones are when it goes down

#

in one point of the training rvc gets very smart

analog obsidian
#

(the lowest point in the graph)

#

and that epoch stores the very good calculations and predictions

dawn copper
#

Hey all! So, I'm trying to use Applio for the first time. I followed all the instructions, buuut.. When I try to do some TTS using a voice model, I get an error. My GPU is an RX 9070, which I'm assuming is the problem tbh.. So far, I've had no real luck in using it for playing with AI things. Anyway, here's the error, if it matters.


rocBLAS error: Could not initialize Tensile host:
regex_error(error_backref): The expression contained an invalid back reference.```
crude flame
dawn copper
#

Annnd that explains it! I didn't know it was zluda as a whole that didn't support them. I'll be looking forward to it getting some support.

#

Thank you.

bleak nymph
#

😭

dawn copper
#

I wish!! But no. I probably should have said "AMD 9070".

bleak nymph
#

lmao nah i understood

#

i was considering getting amd but decided nvidia

#

glad i did because literally everything python requires nvidia graphics card 💀

#

@analog obsidian uh the graph hasn't changed

#

a bit

analog obsidian
bleak nymph
#

o

analog obsidian
#

click that

bleak nymph
#

WOAHHH

analog obsidian
#

XD

bleak nymph
#

things changed

#

it's a flat line now 😭

#

nah it's still beginning basically

#

i gotta wait

analog obsidian
#

or something more ez

#

just press f5

analog obsidian
bleak nymph
#

oh yea

#

f5 just refreshes tab

#

anyway is late now

simple ore
#

for 9070

bleak nymph
#

i think imma continue it tmrw

analog obsidian
#

for resuming the training put the model's name
don't extract nor preprocess again (important)

#

put the same batch size and save epoch frequency

bleak nymph
#

is at 37 rn

#

alr

analog obsidian
#

and don't use fresh training

#

if done correctly the graphs are gonna be fine

#

and they will continue logging as usual

bleak nymph
#

alr

analog obsidian
#

🦈

bleak nymph
#

also what time is it for u rn

#

if i were to guess, 8pm

#

the previous day

#

am i correct

analog obsidian
#

10:09pm here

#

🦈

bleak nymph
#

ahh

#

nice

#

1:09am for me

#

am tired

#

honk shoo or whatever

#

💤

dawn copper
simple ore
#

i dont have AMD card in my PC any more, so I cant test the latest zluda and the patch it requires

#

rvc\lib\zluda.py

dawn copper
#

I think I understand. I'm downloading Rocm 6.2 now. Once that's done, I'll start trying zluda patches. Though, I'm getting pulled away for now.

simple ore
#

there are some hijacks in that file, but they may not be requred with latest zluda

crude flame
#

I got a 6700xt

simple ore
#

zluda 3.9.1?

crude flame
#

Yeah

simple ore
#

I think Lee made some changes like enabling CUDNN and other stuff

#

so there's no need to disable them

#

In order to enable cuDNN acceleration on supported device, you should download and unpack HIP SDK extension upon your existing HIP SDK 6.2 installation.

HIP SDK extension: DOWNLOAD

thick summit
#

is there any google collab active?

#

been trying since yesterday

#

tensorboard doesnt work

simple ore
#

make a new code cell and pip install it

dawn copper
# simple ore i dont have AMD card in my PC any more, so I cant test the latest zluda and the ...

So, peeking in at the zluda patch... And forgive my ignorance ahead of time!
But in the zluda patch, it has a line that says
curl -s -L then it points to a github Zluda release. I'll update that link to https://github.com/lshqqytiger/ZLUDA/releases/download/rel.ae0540beb129ffd140226ce956b386619b38f84c/ZLUDA-nightly-windows-rocm6-amd64.zip
? Which that's the newest Zluda release.

And I'll leave the stuff like copy zluda\cublas.dll env\Lib\site-packages\torch\lib\cublas64_11.dll /y alone, I assume? (There are 3 lines similar to that one, all with different .dll files.)

simple ore
#

you can download the .zip manually

#

unzip it to zluda folder

#

and just use the part of .bat that copies the .dlls into site packages

dawn copper
#

Ah, ok. Thank you!

tardy cedar
#

does anyone use software like voicemeter [i think] to modify the mic settings before it gets sent to RVC? i use a usb mic and theres no noise cancelation or anything on it to change settings

simple ore
#

mic -> app -> voice changer

tardy cedar
#

yeah thats true, i tried it and dident really get better results than whats in rvc. do you use that personally?

simple ore
#

it is amazing

tardy cedar
simple ore
#

that was 5 years ago

tardy cedar
#

thats wild lmao, i used it for a while on normal streams but i thought it made the audio quality rough compared to just the noise cancelation in obs

knotty moth
#

by not selecting that option, leaving those files that means training will resume

bleak nymph
#

wait what

#

ohh

#

i understand

simple ore
#

fresh training = i want to clean up the training atttempt and start over

#

it does not delete sliced audios and other things, just D/G and events

bleak nymph
#

yea

simple ore
#

if you have D/G files in the folder the training will resume as long as the number of epoch is larger than saved

bleak nymph
#

can’t wait to continue it tmrw

tardy cedar
grim bay
#

So is there a way to separate a part of the voices in this reverb audio? I need to get that since it was in the lyrics

tame mica
#

no

grim bay
#

Does that change anything?

simple ore
grim bay
simple ore
grim bay
warped thistle
#

Is a 3060ti 8GB TI LHR good enough to have a realistic sounding voice? I've gotten some models alright but ALL models give me static during long sentences.

#

I'm using MMVC

hallow thistle
warped thistle
#

I thought RVC meant realtime voice change

#

my bad

hallow thistle
#

In this context, RVC doesn't refer as realtime voice changer. RVC is another program that used to do AI voice cover.

vapid mantle
#

If I change the batch size to 3 instead of 4, will the quality be better?

#

@hallow thistle

#

Do you have any information about this subject?

simple ore
vapid mantle
simple ore
#

depends on the set

#

small batch = smaller steps, may get stuck in a local minima

#

very large batch = big steps, may overshoot the target

#

it is all about experimenting

vapid mantle
#

ok

hallow thistle
viscid moss
light pelican
#

Hey which rvc i should use for training voice? I have RTX 3060 12gb

should i go for mainline or applio pls?

light pelican
hallow thistle
grim bay
viscid moss
grim bay
viscid moss
knotty moth
grim bay
viscid moss
grim bay
#

Btw, when I use the UVR-DeEcho-DeReverb model, this works okay but idk if this model is the best option

viscid moss
#

Use the karaoke model first and then the dereveb model

viscid moss
knotty moth
# grim bay Similar result when using the Sucial ones

sucial ones are worse, and most lead vocals & speech under normal mics are centered so it should be easy to separate

the remaining one might be room reverb which can be removed using dereverb mono or RX 11 dialogue isolate

knotty moth
viscid moss
#

I see

knotty moth
#

uvr de-echo can also work quite more aggressive

#

though note that it basically noise gates weak signals

grim bay
viscid moss
#

Damn

grim bay
viscid moss
#

As far I know, nope yt_nails

grim bay
simple ore
#

and then end up with junk

#

but give it a try

heavy arrow
#

it takes a min for my local installation to start and i dont got enough resources to run it with UVR at the same time so im fishing for opinions before i try

simple ore
#

garbage in, garbage out

#

using it as a training set? double garbage

#

it is a repeated nonsense

viscid moss
#

(-)*(-) = +

viscid moss
heavy arrow
# simple ore inferring that file?

not using it as training or anything, literally just putting annoying orange voices over it to mock it
i get that its bad. thats why im putting shitty ai over a shit song. MYOB and answer my question next time instead of scrolling a help channel to start an argument

heavy arrow
viscid moss
#

XD

heavy arrow
viscid moss
#

-hf

karmic oliveBOT
heavy arrow
#

sweeet, tysm ^^

viscid moss
#

Ur welcome

bleak nymph
#

hi guys

#

i've been training for a few hrs

#

my model is at 355 epoch

#

but the tensorboard line is still flat

#

idk what that means

#

it's still improving a lot? or no

#

doesn't seem tobe going up

bleak nymph
analog obsidian
bleak nymph
#

how do i do that

analog obsidian
#

im not on my house rn lol

bleak nymph
#

o

bleak nymph
#

im not good at it

#

OH ON THE SIDE

#

i'm dumb

#

sorry

bleak nymph
analog obsidian
#

and press the third square button thats below the graph

#

wait

#

hold on

#

you should be looking the scalars tab

#

go there

#

gray graphs = not good

analog obsidian
bleak nymph
#

this one right

#

it's still improving ig

#

395 epoch almost 400

#

it seems if it continues at this rate, even 500 epoch will not be enough (i think)

analog obsidian
latent cypress
analog obsidian
#

around there

#

i cant tell

#

need a more smooth graph

#

anything above 15k is rising and overtraining

latent cypress
#

oh i see

bleak nymph
bleak nymph
#

wait i thought lower was better

#

actually, isn't this the best one?

#

idk how this really works

analog obsidian
#

choose the lowest point before 20k

bleak nymph
#

i stopped the training for now

analog obsidian
#

also show me the graph bc im not sure when it started to rise

bleak nymph
#

should i open it back up

analog obsidian
#

started to die around 15-20k but i want to be sure

bleak nymph
#

which graph should i show u

analog obsidian
bleak nymph
analog obsidian
#

click third box

bleak nymph
#

3rd box?

#

ah

analog obsidian
#

you still havent disabled ignore outliers bro

bleak nymph
#

wait it's cut off 😭

analog obsidian
#

no???

bleak nymph
#

this one

#

right

analog obsidian
#

in the scalars tab bro

#

ignore the cringe gray graphs

bleak nymph
#

i aint the sharpest tool in the shed

analog obsidian
#

choose the lowest point in the 10k zone

#

you know which one

#

pretty obvious

bleak nymph
#

does that matter

#

u mean tthis point, right?

analog obsidian
bleak nymph
#

alr

#

how do i choose it

#

it's 10.6k

analog obsidian
#

find the epoch closer to that number

bleak nymph
#

this one, right?

analog obsidian
#

yup

bleak nymph
#

alr sick

latent cypress
analog obsidian
#

its always the next epoch after that value just in case
if its 12.4k you choose 12420 ish, never one prior like 13800

bleak nymph
#

we all start from somewhere tbf

faint elk
#

iis there a guide somewhere on how to get rvc the newest version running? im a bit confused

latent cypress
bleak nymph
#

lmao

#

the model sounds

#

interesting

simple ore
#

and gradients

analog obsidian
#

well its a noisy dataset

#

also like noobies said
every losses actually matters

#

not just that one exactly

simple ore
#

G loss may go down like that, but if g norm is in 50000+ range, it is likely messed up

#

then there are things like this

#

disc is getting better, so fm loss is slowly going up and norm g also going up because of that

#

but as long as it is not a sharp rise from 2 to 10 it is fine

analog obsidian
simple ore
#

I saw that with finetunes when I used a very different dataset comparing to the pretrain.. .like singing dataset vs speaking pretrain

analog obsidian
#

idk how seoul managed to keep fm going down

simple ore
#

training a voice on his own hifigan klm4.9 pretrain

#

I bet

analog obsidian
#

loool yea

simple ore
#

lemme check that

analog obsidian
simple ore
analog obsidian
simple ore
#

next test is default pretrain

analog obsidian
#

nice

simple ore
#

gonna take about an hour to finish this

#

just doing 100 epochs

analog obsidian
#

its fine, fm raises pretty quickly

bleak nymph
#

um

#

chat

#

my model trained on me speaking only works better on singing then it does on actual speaking

analog obsidian
#

xD

bleak nymph
#

😭

#

i tried putting in a file of speaking but it sounded nothing like me because it copied the range of the original speaker (way higher than me) so i set it to -12st and it became so raspy and robotic and cursed

#

it never usually does this

analog obsidian
#

well your dataset is noisy and uncleaned

bleak nymph
#

that's not the issue i dont think

analog obsidian
#

pitch too low

#

increase it

bleak nymph
#

really? usually an octave works fine but alr then

analog obsidian
#

🦈

bleak nymph
#

thing is i'm a bass he's a tenor

#

soooo

#

yeah

#

i tried -6 still weird and dont sound like me

analog obsidian
#

try 0

bleak nymph
#

i did try 0

#

it was way too high

analog obsidian
#

-3 then

bleak nymph
analog obsidian
#

the thing that happen at 0:05 is when the dataset lacks vocal fry

bleak nymph
#

vocal fry? i don't know techniques that much

analog obsidian
#

aka raspy voice

bleak nymph
#

u mean like raspiness right

#

ah

#

my voice is usually rough and raspy which is the interesting part

analog obsidian
#

idk how you sound sooo

#

and i dont know the original audio either

bleak nymph
#

oh right 😭

#

that's true

analog obsidian
#

🦈

bleak nymph
#

i just got off an hour long vc lol

analog obsidian
#

and the audio used for inference?

bleak nymph
#

it couldn't be more different

#

imma try smth that everybody knows

analog obsidian
bleak nymph
#

fitness gram pacer test lol

bleak nymph
#

in interviews it's a lot lot more

#

when singing, not raspy at all i don't think

analog obsidian
bleak nymph
#

woah

analog obsidian
#

you just need to have raspiness in the set to inference raspy voices

bleak nymph
#

yeah seems like it

analog obsidian
#

also voice similarity depends in various complex factors

bleak nymph
#

do accents play a big part in it?

#

i have a weird blend of both american and british

bleak nymph
analog obsidian
bleak nymph
#

so a consistent accent?

analog obsidian
#

consistent voice

bleak nymph
#

ah

#

like volume

analog obsidian
#

so not mimicking someone's else voice

bleak nymph
#

well i wasn't mimicking anyones voice in my dataset

#

thing is i'm a complete noobie at singing too so i copy other people's voices for some reason 😭 it's so weird and idk why i do that

#

i feel like if i added singing to the dataset it'd be worse

analog obsidian
#

you were laughing a lot in the dataset?

bleak nymph
#

i don't think so?

#

i was playing games tho so possibly

#

it has a bit of shouting

analog obsidian
#

but i think what affects the most is when the set is too monotone

bleak nymph
analog obsidian
#

that usually causes the voice to not sound exactly like the original one

analog obsidian
bleak nymph
#

that's interesting, i would assume that monotone voices are easier to make ai's out of than expressive voices, right? but no apparently not

analog obsidian
#

monotone voices are easier to train

bleak nymph
#

oh

analog obsidian
#

but the least accurate ones

bleak nymph
#

yeah that's what i meant

#

i would assume it'd be the more accurate ones but nah

#

the apple one is ridiculously accurate though it's crazy

#

too bad it's just text to speech

analog obsidian
#

try to infer voices closer to yours

bleak nymph
#

what do you mean by that

analog obsidian
#

don't try to inference voices too high or too deep

bleak nymph
#

yeah no it wasn't

#

it was a regular VC i was in and i recorded myself in audacity

#

i didn't sing or go higher than i usually go

#

or lower

analog obsidian
#

my model always sound like the original person regardless of what i try to inference because the dataset has every pitch he can do

bleak nymph
#

yeah but every pitch i can do is not a lot

#

at all

dusty bramble
analog obsidian
#

skill issue

dusty bramble
#

how the model went?

bleak nymph
#

i think that's why it's harder to train ai's on me

bleak nymph
analog obsidian
#

due to his dataset lacking pitches

bleak nymph
#

i haven't really tried inferencing monotone voices

#

i tried a song and it wasn't bad

analog obsidian
bleak nymph
#

that's the full thing 😭

dusty bramble
#

gotta get a dataset of u playing a fun game💀

bleak nymph
#

LMAO

bleak nymph
#

it was a dataset of me talking to my friend and playing lots of games

dusty bramble
#

do karaoke

analog obsidian
#

no need to go crazy in the set in order to have near perfect voice similarity
just have recordings of your whole voice range

bleak nymph
analog obsidian
#

my model was trained using regular conversation, nothing crazy

bleak nymph
#

me straining my voice isn't a part of my vocal range

#

i strain when i sing

#

i don't strain when i talk

#

so the dataset was me talking comfortably and i wasn't straining

analog obsidian
#

speech models arent good at inferencing singing i already told u that

bleak nymph
#

imma try different speaking audio

#

fitness gram pacer test

analog obsidian
#

also the original pretrain was trained using very monotone speech

bleak nymph
#

oh

analog obsidian
#

is not a pretrain made for singing

#

lacks high pitches

bleak nymph
#

hm

#

my highest note i've hit was in the 3rd octave

#

that's quite low

#

is it not

#

i think i just need to get better and more expressive

#

def more of a me fault than ai fault

#

something happened

analog obsidian
#

i remember that error but i cant remember what causes that

bleak nymph
#

rip 😭

bleak nymph
#

i cut it down and it work

analog obsidian
#

O

#

yeah

#

it was actually that

#

now i remember

#

thanks

#

🦈

bleak nymph
#

too high or too low?

#

or smth else

analog obsidian
#

maybe too high

dawn copper
# simple ore see here https://github.com/IAHispano/Applio/issues/999#issuecomment-2736368448

Ok, so. I did all this (properly, I think?)
I can now run Applio. The only error I got when starting it, was

INFO: Could not find files for the given pattern(s).```
Which is odd, but I assume not problematic.

I'm trying to run an AMD 9070 with Applio. Noobies was trying to help me get it set up yesterday, which I'm grateful for.

Buuut I do have a problem. Which is that, when I try to use inference, or TTS, pretty much nothing happens. TTS gave me a driver crash every time I try it. My GPU usage doesn't move on either. On inference, my CPU will use about 15-20%. The Applio cmd window says "Compilation is in progress. Please wait...", so I waited for 15 minutes, and nothing seemed to happen. I've tried different voice models and audio clips. Just for the sake of testing it, I cut an audio clip down to 2 seconds. That's the one I waited 15 minutes on, with no result.
simple ore
dawn copper
#

Oh! I did read that, but I thought that was with the first startup of Applio. So the first time I started it up using the run-applio-amd.bat, I just went afk for about an hour, and came back. My apologies!

simple ore
#

no, it only does it when you run inference and traning

#

inference has a small subset of computing tasks, but anyway.. should take ~20 min

#

index is a lookup table that translates voice features.. like english audio 'the' into german voice model's 'ze'

#

use index search > 0

dawn copper
#

Okay. I Got the first Zluda thing done. Now it's just taking a couple of seconds on inference and TTS. However, audio is completely blank. On TTS, there's just a Pop! sound at the start, and end. On inference, it's purely silent the entire time. I opened it in Audicity to double-check, and yeah, it's just a flat line.

Sorry for being a pain. I am trying to search if there's a solution to my problems, before I ask.

graceful latch
#

why cant i send pic here idk how to describe the problem

simple ore
#

close applio, edit app.py and comment out line

#
import rvc.lib.zluda
#

with #

#

then try again

dawn copper
#

The place where I input audio is working. But the output (Bottom right of Convert), has no audio being output.

I did as you said, so now app.py has

# import rvc.lib.zluda```

Now, as soon as I click Convert, the Applio command prompt gives:
```  File "C:\Applio\env\lib\site-packages\torch\functional.py", line 665, in stft
    return _VF.stft(input, n_fft, hop_length, win_length, window,  # type: ignore[attr-defined]
RuntimeError: cuFFT error: CUFFT_INTERNAL_ERROR```
#

Oops, that's only the last line. It starts with
An error occurred during audio conversion: cuFFT error: CUFFT_INTERNAL_ERROR
And then it shows a total of 8 files. I can paste it all in here, if you like. And again, thank you for trying to help me with this! I do appreciate it.

simple ore
#

okay, revert back

elfin geode
#

I want to ask, I want to dub my own animation using rvc for female voice. If I use rvc will I get into trouble like copyright. How do I know which rvc model doesn't have that problem?

simple ore
#

it could be the model crashing with fp16

#

the empty audio i mean

dawn copper
#

I'm trying it in fp32, using Edge now instead of Firefox... Juuust in case that matters!
Annd it just finished. Same thing. No audio.

#

And I did restart Applio entirely, after changing to fp32.

dawn copper
#

Is rvc/infer/infer.py The only part I need? Or should I be using everything in that page?

simple ore
#

just that for inference

dawn copper
#

I did try to use just the infer.py changes (By opening infer.py, and then copy/pasting everything from the new one on github. If there was a better way to do this, that just shows how ignorant in all this I am!)

Annd I get yet another error. It ends with ModuleNotFoundError: No module named 'soxr'
But starts with Traceback (most recent call last):, then lists 12 files. Last file being C:\Applio\rvc\infer\inferp.y

simple ore
#

env\python -m pip install soxr

#

from applio folder

dawn copper
#

It is one thing after another with me, I'm so sorry!
RuntimeError: expected scalar type Half but found Float

#

I tried on fp16 and fp32. Same thing either way.

simple ore
#

where?

#

only means you missed some .float()

#

hm

#

or perhaps need to also change lib/predictors

dawn copper
#
  File "C:\Applio\rvc\infer\infer.py", line 602, in convert_audio
    audio_opt = self.vc.pipeline(
-------
  File "C:\Applio\env\lib\site-packages\torch\nn\modules\conv.py", line 306, in _conv_forward
    return F.conv1d(input, weight, bias, self.stride,
RuntimeError: expected scalar type Half but found Float```

I cut out a *bunch* in between, so it'd fit here.
#

There were like 10 or so other files.

simple ore
#

okay, change rvc/infer/pipeline

#

and those two lib/predictors

#

or alternarively, just clone the whole repo

dawn copper
#

... Ok, forgive my noobishness 😛 (Yet again)
I'll open cmd in Applio's folder, and do
git clone https://github.com/IAHispano/Applio/commit/837b945c69d3c057e5de452ef5af7d252ca8293b
Correct? I don't wanna mess things up.

#

(With that github link, being the one you sent a bit ago)

#

I have to step away for 30 mins or so. Once again, thank you so much for taking your time to help me!

graceful latch
#

can someone help me make a stupif ai song cover i needed it for a video but i tried half an hour yesterday trying to make one with rvc and it always shows some error

simple ore
#

no, whoe applio

simple ore
graceful latch
#

wdymm?? i know nothing about this and i cant send image here😭😭 i have no clue

simple ore
#

no telepathy either

graceful latch
#

well uhm sorry for bothering then i actually have no clue how to say the error

simple ore
#

by providing a screenshot?

graceful latch
#

i dont have img perm

#

if i had i wouldve sent it the first message here💔💔

simple ore
graceful latch
#

okay okay i will thanks 😭😭

dawn copper
# simple ore https://github.com/IAHispano/Applio/archive/refs/heads/main.zip

Surprise, surprise!.. It's me again >.>
I backed up my Appolio folder first. Then downloaded the file you linked here, updated my current install with this new main.zip.
And I'm greeted with a new error, sadly.

  File "C:\Applio\env\lib\site-packages\gradio\queueing.py", line 624, in process_events
    response = await route_utils.call_process_api(
--- (lots more files here, ending with:)
ModuleNotFoundError: No module named 'rvc.lib.algorithm.generators.hifigan_mrf'; 'rvc.lib.algorithm.generators' is not a package```

I went ahead and tried to do `env\python -m pip install`, along with the two names of the things listed. Since this error seemed the same as the `soxr` one from before. But both time I got an error saying `Could not find a version that satisfies the requirement rvc.lib.algorithm.generators.hifigan_mrf (from versions: none)`
#

I ran the error through ChatGPT (for better or worse?), and it said to put a blank file called __init__.py in rvc/lib/algorithm/generators/ (assuming the hifigan_mrf.py file exists there, which it does)

That got rid of the error, but I'm back at step 1: No audio comes out from anything I do in Applio.

simple ore
#

of course that's not a solution

#

the file should be there

dawn copper
#

Yup! The file is there. But as soon as I click the Convert button in Applio, it gives me the error
ModuleNotFoundError: No module named 'rvc.lib.algorithm.generators.hifigan_mrf'; 'rvc.lib.algorithm.generators' is not a package

#

Well, it has the whole

  File "C:\Applio\env\lib\site-packages\gradio\queueing.py", line 624, in process_events
    response = await route_utils.call_process_api(
--- (lots more files here)```
Stuff above that.
simple ore
#

that's on you.. i've just installed that main.zip, and everything works

#

so perhaps you did not unzip things properly, did not move all files over

knotty moth
dawn copper
#

Just to double check, I redownloaded it. Extracted using 7zip. Opened the newly downloaded Applio-main folder, where I then see all the files (Such as run-applio.bat).
I select all files, then go to my main install folder of Applio (C:\Applio), and move all the files there. It asks to overwrite, I click yes. All the files move over. Then I get that error again.

knotty moth
#

also the main repo is the most recent compared to the precompiled one

dawn copper
simple ore
#

well, the only thing you need to do is run the install and then replace torch and patch zluda

dawn copper
#

That's what I tried the first time around, and now we're here! I have no luck with this, it seems. I'll try it all again, but probably tomorrow.
Thank you for all the help though. Sorry if I've been a pain! I've been trying, but am waaay out of my element with all this.

hallow thistle
knotty moth
brittle wing
#

Hi, just wanted to ask about how people uploading their models have those vocal only files to show them off? Does anyone know what website they use to produce those?

hallow thistle
brittle wing
#

Or programs*

lunar quartz
#

Guys i'm having Trouble making the VC client Work. The only think that works is Beatrice T^T Also can someone send me or atleast message me a Tutorial on Downloading the VC Client Properly? I'm Strugging so hard 😭 Also i have Python and Pytorch and Cuda Installed Just tell me if Python and Cuda and Pytorch ain't needed but Overall the question is I want help installing the VC Client Properly T^T. Also I'm using a PnP Speaker and Mic not a Cable Jack or something

#

I just want some help T^T

#

Also Please?

#

I'm gonna wait.

hallow thistle
knotty moth
faint elk
#

is there a tutorial how to record something?
LIke a song or something with the realtime voice changer
I tried setting up virtual audio cable but im confused about how to route it and where to

analog obsidian
#

place your model in the logs folder

#

then for conversion follow the instructions of the gui

fading sentinel
#

whats rvc?

#

os there a channel for music creation?

analog obsidian
#

does not stand for realtime voice changer

#

but retrieval based voice conversion

fading sentinel
#

i see

#

so this discord is things like that mostly? was hoping it was big on music

analog obsidian
fading sentinel
#

ok thank you

knotty moth
#

but for recording, it's recommended to use applio rvc on prerecorded audios

brittle wing
#

The phase Fixer cell

#

There are options like "transfer magnitude" "transfer phase"?

knotty moth
analog obsidian
#

phase fixer is for instrumentals

knotty moth
#

it's not related to the rvc I was referring

brittle wing
#

On what kind of files

knotty moth
brittle wing
#

It's for noise removal okay what is "transfer magnitude" for?

knotty moth
knotty moth
faint elk
#

where's can I read the guide about applio rvc? I didn't find anything in pinned
https://applio.org/

is this the correct site?

low shard
faint elk
#

though i use prerecorded audio

low shard
#

so, you need RVC or wokada?

#

also your gpu is good

#

btw, Applio is an RVC Fork (modified version)

faint elk
#

i see
UHm well
I'm not sure what i need?
I just have pre recorded audio clips which i need to turn into a funny voice..

#

I already have the models
they are rvc models or something

low shard
#

what's your OS?

faint elk
#

i use windows

low shard
#

this is our documentation

low shard
#

also, our docs ofc have applio info

hallow thistle
#

RVC and W-Okada are two different programs. In this context, RVC isn't an abbreviation of realtime voice changer.

simple willow
#

where do i find the pitch

pastel oak
simple willow
#

maybe?

#

there is this extended pitch slider

#

so I assume there is an option for this like volume

pastel oak
#

what program

simple willow
#

it's general settings in voice ai

pastel oak
#

We dont offer support for voice ai

#

paywalled garb

simple willow
#

oh XD

#

HAHA

pastel oak
#

We can suggest u open source options

#

whats ur gpu

simple willow
#

rtx 2060s

pastel oak
simple willow
#

thanks man big help

grim glen
#

how do i get to the google collab for appolio

brittle wing
low shard
hard shadow
#

I'm getting an error on Applio that says "No module named 'tqdm.auto'". Any help, please?

low shard
cedar canopy
#

which fork/guide am i supposed to install/follow to run local inference on apple silicon?

simple ore
cedar canopy
#

ah that sucks

simple ore
#

MPS torch is not there yet

cedar canopy
#

i was under the impression that torch added metal support

#

thank you!!

simple ore
#

it only works in a single thread last time we checked

cedar canopy
#

for how much they push the AI capabilities of their arm chips you really cannot run much on them besides LLMs

#

at least UVR is fast on it

simple ore
#

opm_num_threads = 1, otherwise it just hangs after 1st inference

#

apple has its own AI model, it is tricky to convert RVC to it

#

Core ML

brittle wing
#

Anyone have any idea why AttributeError: 'NoneType' object has no attribute 'pipeline'
Is showing up every time I try to synthesize on Applio?

low shard
low shard
simple ore
simple ore
#

cpu works fine

#

since I dont have mac I cant do anything about it

cedar canopy
low shard
#

I checked it fast and it seems enabling mps

simple ore
#

so 3.2.7 should work?

#

technically it can be enabled back

#

rvc/configs/config.py

#

but I dont recommend

simple willow
#

volume increase nowhere to be found

low shard
low shard
simple willow
#

oh

broken vortex
#

can someone tell me why the uploadbutton while uploading models doest work

#

@low shard

bleak nymph
#

i'm kinda curious to see what it'd sound like if i made an ai of me talking and singing for 3 hrs 😭

#

it's 3 hr long audio file of me

#

insane

#

probably would be better than my current one

knotty moth
#

unless the audio is split at most 30 mins each

#

depending on ram capacity

bleak nymph
#

ohhh interesting

#

i mean i have 32 gigs of ram 😭

#

quite a lot

tight linden
#

is applio good for mac if I want to use ai voice model to say a specific word?

knotty moth
tight linden
#

Ok.

hallow thistle
knotty moth
bleak turtle
#

where do you download the voice changer (not models)

bleak turtle
#

thanks

broken tulip
#

Not sure if this is the right place and I don't have the software... but what software do I need to mess around with songs so that I can convert one voice to another?

low shard
low shard
#

I'm guessing you want RVC

broken tulip
#

And is there a specfic website to grab this software?

low shard
broken tulip
#

Nvida, I think.

low shard
broken tulip
low shard
#

some nvidia gpus could be too old, some others are good, some others are faster

#

I can't know if your gpu is good enough by just the brand

broken tulip
#

GeForce RTX 3060 Ti

low shard
#

ohh 3060

#

nice

broken tulip
#

I made a typo!

low shard
#

As you got a good PC, you can use RVC locally, you can choose between:

  • Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
  • Mainline: The original RVC
#

I would suggest you to use applio

broken tulip
#

Thank you.

low shard
broken tulip
low shard
broken tulip
#

Doesn't look like that on my end, but I'm gonna guess it's the ApplioV3.2.8-bugfix.zip

broken tulip
#

Thank you. I got it now. 🙂

broken tulip
low shard
broken tulip
#

Oh.

low shard
#

you can either try to lower the index (which contains the accent), or use another one

analog obsidian
#

looks like its a model problem

#

or using crepe with a high hop length can cause that too

broken tulip
#

Or I suck at this so far.

#

XD

knotty moth
broken tulip
#

fair...

dusty rune
tight crane
#

does anyone know how many epochs i should set for like 5mins of audio?

dusty rune
#

the guy im doing barely talks

livid cosmos
#

So I am using RVC v2 Disconnected, I also have done Feature Extraction and got this error? What did I done wrong?

/content/Mangio-RVC-Fork/logs/Kilixa_v1/3_feature768

Exception Traceback (most recent call last)
<ipython-input-38-86df4775937e> in <cell line: 0>()
29 listdir_res = list(os.listdir(feature_dir))
30 if len(listdir_res) == 0:
---> 31 raise Exception("No features exist for this model yet. Did you run Feature Extraction?")
32
33 try:

Exception: No features exist for this model yet. Did you run Feature Extraction?

jovial leaf
#

For RVC fandubbing projects

low shard
#

you shouldnt use it at all

#

it's based on mangio fork, an rvc fork abandoned since 2023

#

its slower than mainline and other rvc forks

#

and is more unstable

low shard
#

whats your pc gpu and what do you want to do

livid cosmos
hallow thistle
#

It's crazy to think that people using RVC v2 Disconnected in 2025, even if there are the Mainline and Applio available.

hallow thistle
#

Just give the locally program a try.

livid cosmos
#

Which is the best way to do it locally?

low shard
#

As you got a good PC, you can use RVC locally, you can choose between:

  • Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
  • Mainline: The original RVC
#

use Applio

#

locally

hallow thistle
livid cosmos
hallow thistle
livid cosmos
#

But yeah, I will try with Applio then

low shard
tame mica
#

does these look fine

#

ive ignored any tensorboard talk since it makes my head burst so idfk atp

lilac gust
#

@austere nexus ma'am what the hell r u doing here

austere nexus
#

LMAO WHY CAN'T I

lilac gust
austere nexus
#

it's the most recent one you sent 😭

simple ore
tame mica
#

she sounds like you put the saturator on max in fl but whatever

#

gonna let it cook fo a bit garfapproves

compact saffron
#

yo how can I change the effect of a voice model? i.e. i wanna sound to 25% like juice, but still recognize my voice, dm me if you can help

nocturne mica
#

Can I find here some modules with russian accent? Because using an English one cause some issues

low shard
#

be sure to not use ty tuts

low shard
#

You can search rvc ai voice models at:

if there isnt one, you can:

earnest muskBOT
eager rivet
#

yo

#

what software should i use to convert vocals

#

i have a rtx 4060

low shard
# eager rivet i have a rtx 4060

As you got a good PC, you can use RVC locally, you can choose between:

  • Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
  • Mainline: The original RVC
#

I gave you written guides, read them and lmk

eager rivet
zinc herald
#

who is good with making ai songs (like fake songs)

#

i need help

eager rivet
#

maybe help instead of being a dickhead

low shard
eager rivet
#

idk i downloaded it a long while ago

low shard
#

there is a written guide

eager rivet
#

had it for a while

low shard
#

be sure u got the latest version

eager rivet
#

for some reason whenever i do the run applio bat thing nothing opens in my browser

low shard
eager rivet
#

this the download right @low shard

candid meteor
#

can anyone help me? i download a model in Applio the model does not show up

#

nevermind

brittle wing
#

-colab

karmic oliveBOT
# brittle wing -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

eager rivet
#

@low shard

#

i opened the runapplio bat file

#

and nothing happens

opal quarry
#

Hola alguien tiene link de rvc?

low shard
#

Also elaborate ur PC GPU and what u want to do

low shard
eager rivet
#

also in my c drive

low shard
eager rivet
#

whenever i upload my index i get this error

low shard
#

Also share which index file download link

unique rock
#

Can someone help me with this?


FileExistsError Traceback (most recent call last)
<ipython-input-12-e49906b94ca3> in <cell line: 0>()
17 if os.path.exists(time_):
18 shutil.copy(time_, time__)
---> 19 shutil.copytree(source_path, destination_path)
20 print("Model backup loaded successfully.")

2 frames
/usr/lib/python3.11/os.py in makedirs(name, mode, exist_ok)

FileExistsError: [Errno 17] File exists: '/content/program_ml/logs/Ahyeon

eager rivet
low shard
low shard
low shard
#

Not just one single part

#

Also there isn't only 1 applio colab, there's the UI version and the no UI one

eternal oasis
#

Hello, its possible to make a release pre-compiled of Applio included the latest torchaudio for RTX 5000 users ?

low shard
# eternal oasis Hello, its possible to make a release pre-compiled of Applio included the latest...

How to (unofficially) use Applio for RTX 50 serie cards

Follow to download it as said it in https://docs.aihub.gg/rvc/local/applio/

After you extracted the precompiled, go to the path in Windows explorer, write "CMD" and press enter, then in CMD write env\python -m pip install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu128

If you get any already satisfied requirement issue, run env\python -m pip uninstall torch torchvision torchaudio then the command said above

Last update: Apr 01, 2024

unique rock
eternal oasis
#

Ok thank you Nick

low shard
low shard
unique rock
#

I installed the latest version of Applio in queue, then mounted the Drive, then connected the backup, loaded the model and got that error, and the files and name were correct (for retraining) and finally downloaded the custom pretrain to run Applio finally but I can't do it because of that error

knotty moth
unique rock
simple ore
hallow thistle
unique rock
hallow thistle
knotty moth
indigo wolf
#

where do i change voice for singer?

tame mica
grim bay
#

How can I fix this?

simple ore
low shard
viscid moss
#

Hmmmm

#

Weird

#

-colab

patent trellisBOT
# viscid moss -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

karmic oliveBOT
# viscid moss -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

viscid moss
low shard
#

Lemme guess , uv?

viscid moss
low shard
#

oh lol

hot lagoon
#

RVC-AI-Cover-Maker-UI

#

kaggle

#

only happen when trying to switch the Dereverb Model

#

Same with switching the Deeecho one

low shard
verbal oasis
#

Hi, does anyone have a link to EasyGUI google colab with the UI interface?

low shard
#

the creator is busy irl

#

what’s your pc gpu and what do you want to do?

verbal oasis
#

uh really? what can I use?
I've used that a while back to train 3 voices from audio and it was pretty easy to use

verbal oasis
low shard
low shard
#

however that depends on how good is your gpu

#

i mean if you really don’t wanna tell it’s fine, i’m just trying to help out

verbal oasis
#

yeah, better on cloud if there's any option

low shard
#

there’s colab only for inference, others only for training, others that are more automated for ai covers, others for both training and inference yk

verbal oasis
#

ah yeah, I want to train a voice model from an audio recording so that it gives me back a pth and a index file

#

in order to use the model in applio

#

I need only training

low shard
# verbal oasis ah yeah, I want to train a voice model from an audio recording so that it gives...

Train (make) RVC Models on cloud:

  1. Prepare the Dataset
  2. Setup RVC:
    Choose a cloud way to use RVC,
  • Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
  • Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
  • Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
  1. Be sure to know about the tensorboard

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.com/ which ofc uses RVC

RVC Inference (use models) on pre-recorded audio on Cloud

You can use either:

low shard
#

i also gave kaggles and lightning ai links

#

which are other cloud platforms

verbal oasis
#

can I train in applio too?

low shard
#

for example kaggle gives more time and better gpu than colab but its harder to setup

low shard
verbal oasis
#

kaggle is an alternative of colabs right?

low shard
#

Colab: random daily gpu of max 4 hours
Kaggle: 30 hours of free gpu weekly granted, with better gpus, but needs phone number and is harder

Both are owned by google

low shard
verbal oasis
#

thanks, I'll try for now to train on applio on colab

#

have you ever tried to do so?

knotty moth
#

I'm on the kaggle one

verbal oasis
#

is it complex to train from audio?

low shard
verbal oasis
verbal oasis
simple ore
viscid moss
finite pumice
#

guys

#

how can i use the huggingface models in rvc

#

can someone give like a tutorial please

#

@long forge

#

@acoustic scarab

#

on pinikio

viscid moss
# hot lagoon

I'm pretty sure that u are using an outdated version or u are using the outdated link from Shiromiya

#

-kaggle

karmic oliveBOT
# viscid moss -kaggle
📘 Kaggle Notebooks

Note: Kaggle limits GPU usage to 30 hours per week.

patent trellisBOT
# viscid moss -kaggle
📘 Kaggle Notebooks

Kaggle is a Cloud (Remote Good PC) Service that offers 30 hours of GPU weekly, but needs a phone number verification

• **Applio Notebook**

by Vidal Kaggle

• **Hina Mod Original Wokada**
• **Wokada Deiteris Fork**

by Hina & Deiteris
Kaggle

• **UVR5 UI**

by Eddy, ArisDev & Nick088 Kaggle

• **UVR5 NO UI**

by Eddy Kaggle

• **RVC AI Cover Maker UI**

by Shirou & ArisDev Kaggle

• **Music Source Separation**

by Shirou Kaggle

viscid moss
#

@low shard Kaggle links needs to be updated

low shard
#

oh right u took ownership of rvc ai cover maker

#

gonna fix it soon

brittle wing
#

What are some good websites to find rvc voice models?

low shard
# brittle wing What are some good websites to find rvc voice models?

You can search rvc ai voice models at:

if there isnt one, you can:

earnest muskBOT
near axle
#

Can anyone make a new RVC V2 api on replicate?

low shard
near axle
low shard
#

the only updated guides are written ones

#

by the way, what's your pc gpu and what do you want to do?

near axle
low shard
low shard
# near axle Yea

well, yeah you could run it locally but would be slow as hell, cloud (remote good pc) is way better in your case

#

just a reminder:
RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

#

I can give you cloud links, but I need to know what program you need to give you them

viscid moss
#

Hmmmm

#

I tested with normal deecho and bs roformer dereverb, lemme try that ones

#

cause that aggressive is too aggressive and bs roformer de-reverb it's better than MDX23C

hot lagoon
#

it only happens when I try to switch it to someing like bs roformer de-reverb or change the Deeecho Model to Aggressive

#

The normal settings works fine. It’s just that somehow switching it causes the error.

viscid moss
hot lagoon
#

yes change to BS Roformer Dereverb? dose it

#

and change Deeecho Model to Aggressive

viscid moss
#

ok, that's weird cause i change it to bs roformer de reverb and it's working

hot lagoon
#

🧐

viscid moss
#

lemme try aggressive

hot lagoon
#

weird

viscid moss
#

got an error but not the same error as u

#

mdx23c dereverb seems to not being working

#

ok found, if u choose MDX23C dereverb it gets bugged so if u cange to roformer again it will no work

#

Yep, MDX23C dereverb and Deecho Agg isn't working. The rest of the models are working

hot lagoon
#

Yeah I rest and chose just roformer for dereveb and its working

#

Just Deecho agg and the other Derever is not working

viscid moss
#

Ye, for now use it like that. I'll improve it when the CoverMaker rework is ready

spiral osprey
#

@low shard Hey Nick,could you look at the discord private dm please?

analog verge
#

?colab

#

-colabs

karmic oliveBOT
# analog verge -colabs
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

knotty moth
#

though it doesnt remove delay echoes

viscid moss
#

ye but haven't added that model yet

viscid moss
#

It will come with the rework of CoverMaker

vestal helm
#

what pretrain should i use for weeknd

#

dataset is

#

very clean

muted portal
#

hello beautifull people, if there is anyone out there that knows how to create ai voice models for rvc and has a bit of time to show me how, a dm or mention would be very greatly apreciated <3

pastel oak
atomic sparrow
#

if you training the same model in diferent days , you need to execute the "EXTRACT FEATURES " section in the 2nd or 3rd time? in google colab

rich moss
#

Hey everyone, I’m running into an issue with TensorBoard. Here’s the error I’m getting:

tensorboard_venv already exists, skipping creation and activation...
Launching TensorBoard...
'tensorboard' n’est pas reconnu en tant que commande interne
ou externe, un programme exécutable ou un fichier de commandes.
Keeping the command prompt open...
Appuyez sur une touche pour continuer...

low shard
#

I really hope your not using rvc disconnected or other old yt tuts

hallow thistle
hallow thistle
#

Because the latest update for this RVC program is from 2023. There's a better one available.

rich moss
hallow thistle
rich moss
#

ty

#

and btw how many minutes of audio data do you recommend I collect? Also, where can I find good quality datasets pls?

hallow thistle
#

30 - 60 minutes of combined dataset audio is enough, if you mean by to train a voice model. I'm not sure where the good source to find audio dataset is, it can be from some YouTube videos containing spoken audio.

rich moss
hallow thistle
#

While I can only give some basic ideas on how to train a voice model, some more advanced things go for some other helpers instead. If you have any problem training a voice model, you can go to #📑│making-models.

knotty moth
#

being shorter but stick on one of them is better

knotty moth
#

5 minutes of dataset from a single fine quality source is better than longer one using multiple sources or worse consistency

median island
#

I've tried a couple times to make a cover, but I keep getting a runtime error saying I don't have enough memory when I know I do. Is there something I'm doing wrong?

reef sonnet
#

[Voice Changer] MMVC_SocketIOApp initializing... done. how do i fix this

median island
simple ore
#

you may want to start with explaining what's your GPU, what error message you're getting

median island
#

I don't have a GPU, it reverts to CPU and the message is
RuntimeError: [enforce fail at C:\actions-runner_work\pytorch\pytorch\builder\windows\pytorch\c10\core\impl\alloc_cpu.cpp:72] data. DefaultCPUAllocator: not enough memory: you tried to allocate 379509352 bytes.

knotty moth
median island
#

I am using CPU

median island
knotty moth