#✨│ai-help

1 messages · Page 158 of 1

odd shale
#

That Mel Roformer drums model is not available publicly.

#

I mean, it's files. So no.

#

Sorry.

obtuse glen
#

ok

#

is this bad?

#

its still at 0%

cyan cobalt
#

thanks man works great, do you have any substitute for uvr as well?

odd shale
odd shale
cyan cobalt
#

thanks

odd shale
knotty moth
#
  • BS roformer 1053 in UVR beta (drum+bass, other)
  • demucs4_ft (drum, bass, vocals, other)
  • some good models in x-minus (https://uvronline.app/) but it needs pro subscription
eternal yoke
#

what are the best vocal remove app

odd shale
eternal yoke
#

does it got limit like 2 times a day or...?

odd shale
eternal yoke
#

alr thanks a lot

rare gobletBOT
#

Ayo? @eternal yoke level 1 !!! lfg

obtuse glen
#

oh i think i found an exe

obtuse glen
#

i forgot to select gpu

#

nvm

#

great.

knotty moth
obtuse glen
#

it was for vocals and intrumentals

obtuse glen
#

me dumb

#

completely ignored it by accident

knotty moth
obtuse glen
#

it doesnt have any vocals

graceful patrol
#

can someone help me im new to this

#

when i download the file it how do i upload

proper shale
#

epochs really vary, but for batch size, I'd say go high, 12+

#

lower will kinda be unstable really

#

idk how much VRAM you have so

eternal yoke
#

i use mangio rvc is it still good

proper shale
#

if you're using T4, go for 16 batch

#

yesn't

it might be more unstable and might not do well... but eh, it's all testing... if you wanna test that it's all up to you

proper shale
#

go for 250 (or 500 if it's fast enough) and check for overtraining... if not, keep going

#

if the training is fast

#

cause it's a 1 hour dataset 😭 can take a while

#

let's hope it ain't the dataset

eternal yoke
#

does those thing affect the ai sound

knotty moth
#

also check mel & kl as well

#

g/mel & kl should be there

reef trellis
#

@pastel oak My gpu is a 11th Gen Intel(R) Core(TM) i3-1125G4 @ 2.00GHz

proper shale
#

you're welcome :)

proper shale
reef trellis
#

@proper shale Intel(R) UHD graphics

rare gobletBOT
#

Ayo? @reef trellis level 1 !!! lfg

fallen panther
#

hi guys, in model training the number decreased to 6.8 but now nothing changes, can u help?

proper shale
proper shale
reef trellis
#

Oh ok ,thanks '^^

proper shale
fallen panther
#

btw previously the final number decreased, but now it doesn’t

fallen panther
#

I don't see any errors there :< or do I need something else?

rare gobletBOT
#

Ayo? @fallen panther level 1 !!! lfg

fallen panther
proper shale
#

how did feature extraction go

#

and preprocessing as well

fallen panther
#

i see "ffmpeg error" in the output information of the 2a step, is this the problem? before that everything was fine..

proper shale
eternal yoke
#

why did the ai sound like it got throat problems any tips

surreal schooner
#

How long does it take RVC-GUI to unpack from zip? Is it a very long time process?

fallen panther
violet heron
pastel oak
eternal yoke
#

i use mangio

violet heron
brittle wing
#

anybody know where i can find a GOOD google colab or just online version of rvc?

#

i cant install rvc itself

wild vapor
#

-colab

azure marshBOT
# wild vapor -colab
☁️ Google Colabs
cosmic holly
#

anyone have crazy lag when using rvc with vb audio virtual cable?

#

it's super jarbled and hard to hear but sounds fine if I'm listening to it using my headpohnes

#

but if i put it through the cable input it sounds terrible

brittle wing
violet heron
quasi maple
#

whats the best

#

-realtime

azure marshBOT
pastel oak
#

but pls make sure you actually got a gpu that is not like 10+ years old

modern surge
#

-colab

azure marshBOT
# modern surge -colab
☁️ Google Colabs
eternal yoke
pastel oak
empty raptor
#

Can you calm my nerves about it

rain iris
#

hi, i would like an up-to-date video tutorial on how to use the ai voices

#

last time i did anything with them was 2 years ago

#

i forgot everything

#

last time i made an ai cover was when the google colab thing still existed

rare gobletBOT
#

Ayo? @rain iris level 1 !!! lfg

rain iris
#

stfu

tidal trail
rare gobletBOT
#

Ayo? @tidal trail level 10 !!! lfg

rain iris
#

now i will be able to make a heavy x medic fanfictrol

proper shale
violet heron
#

It’s like crepe, harvest, pm

odd shale
#

Instead read the docs.

rain iris
#

already did and i successfully made a voice cover

main lichen
#

soo how do u cough withiut the ai tweaking

violet heron
main lichen
#

LMAOO

cobalt pilot
misty elk
#

my download custom pretrain thing in the appolio colab keeps saying


HTTPError Traceback (most recent call last)
<ipython-input-6-962fec43dc94> in <cell line: 44>()
44 for url in pretrained_urls:
45 filename = os.path.join(output_directory, os.path.basename(url))
---> 46 urllib.request.urlretrieve(url, filename)

6 frames
/usr/lib/python3.10/urllib/request.py in http_error_default(self, req, fp, code, msg, hdrs)
641 class HTTPDefaultErrorHandler(BaseHandler):
642 def http_error_default(self, req, fp, code, msg, hdrs):
--> 643 raise HTTPError(req.full_url, code, msg, hdrs, fp)
644
645 class HTTPRedirectHandler(BaseHandler):

HTTPError: HTTP Error 401: Unauthorized

#

how do i fix it

tidal trail
#

change the custom pretrained cell code

proven hill
# cobalt pilot I don't know if it's intentional or not but Ilaria RVC isn't there anymore
#

porcodio <3

junior halo
#

How big of a dataset is too big? Right now, I am sifting through a 4 hour Twitch stream. (don't judge my source it is my best option rn)

placid holly
thin stump
junior halo
# thin stump you would be lucky to have 30 minutes because most of the lines get removed duri...

YouTube....
If you're curious, I am creating a voice model of Gabriel from Ultrakill. I am sourcing from the VA's Ultrakill Twitch VOD from YT where he speaks like the character almost the entire time. I would use the original voice lines from the game, but the echo is messing with the quality. Assuming all of the voice models of Gabriel uses the game audio files, it is precisely why they all sound like shit. It's the echo and reverb that are messing with the output. The VOD, however, only uses reverb. I know how to remove reverb. I just can't remove echo. I've decided to make a hopefully higher quality voice model of him using this exact source.

thin stump
#

We do have a tool call Dialogue Isolate for echo. I'm assuming you're using RX11 already? I'm doing a twitch vod/youtube myself and I can't bring myself to review the whole entire audio so it's split into parts. At least 1 hour and 45 minutes -> currently bring brought down to whatever length because of unremovable mic noises

#

oh yeah, that echo is easily removable if you use that. I looked at one of Gabriel's video

junior halo
#

I see. I'm just gonna go through the entire thing and knock out similar phrases.
And no, I am not familiar with RX11. I am using the deverb plugin for FLS.

thin stump
junior halo
#

Ah, neat. I'm curious how this works.

#

I assume it's not a DAW plugin?

thin stump
#

it just gives you plugins, which you can also use on FL

junior halo
#

By "paid," you mean it's got a free trial or something?

thin stump
#

free trial? joe_what

faint steeple
#

does anyone know why voice ai keeps crashing everytime i try to speak with my voice changer?

knotty moth
knotty moth
graceful patrol
#

when i download a girl voice model it doesnt sound good at all

violet heron
#

are you sure you aren’t using a troll model

#

Make sure your using a good tune

edgy fjord
#

I can retrain a voice?

pastel oak
edgy fjord
warm oriole
#

guys, while i`m downloading model at Google Colab it says JSONDecodeError no matter what model i use. What am i supposed to do to fix it?

rare gobletBOT
#

Ayo? @warm oriole level 1 !!! lfg

pastel oak
pastel oak
#

use -colab in this channel for working ones

edgy fjord
rare gobletBOT
#

Ayo? @edgy fjord level 2 !!! lfg

warm oriole
pastel oak
#

-colab

azure marshBOT
# pastel oak -colab
☁️ Google Colabs
mild quiver
#

any way to apply voice to a recording?

pastel oak
calm crown
#

Guys, does someone knows why some of voice models breathing sound sounds like a robot?

glacial pollen
quaint dock
#

hey guys, is Applio safe?

#

sorry for the stupid question

pastel oak
#

We use it

quaint dock
#

alright, thanks!

patent pasture
#

I trained model for 500 epochs and found out that 1 epoch = 7 steps. 3927 / 7 = 561. How?

glacial pollen
#

provided you use a manual sync approach

#

tho, that should not happen

#

never in my training career encountered such issues

patent pasture
glacial pollen
#

or is disconnected using " auto syncing " bullshit

proper shale
proper shale
#

but it's Mangio RVC and I've heard it ain't accurate when it comes to tensorboard logs

glacial pollen
glacial pollen
#

@red kayak Hmmm, I think Imma update my fork - specifically, move from mainline base to fumiama's base

#

noticed some changes here n there, esp for training pipeline itself

#

after I'm done, will need some testers. you up for it maybe? ( will probs add some different gradient clipping in case of fp16 hence why I'd need testers as I have only so many sets on my disposition

unkempt flame
#

Hello, I want to use V1's voice from ultrakill and I don't know how to use it, can somene explain me how to sue this AI concepts?

wispy lodge
knotty moth
glacial pollen
#

bfloat might be a lil more stable than fp16 cause better hparam compatibility but sacrifices a lil bit of performance ( model quality wise ) but it's too small to notice for casual users and there aren't any gains

fp16 has it's own fair share of problems; exploding gradients, NaNs at the start so, awful training initialization stage and is more prone to single-mode collapses

fp32 is obviously the best choice for quality models, sadly it's naturally the slowest ( unless.. well, you can use tf32 but let's be real, not a lot of people here can use it.

unkempt flame
#

use*

#

My bad

tame mica
#

uhh barely any1 uses v1 models anymore but it should still be useable if i'm not wrong

#

-docs

azure marshBOT
unkempt flame
#

And can you help me knowing where or how to use this models? pease, I just- dont know lmao

rare gobletBOT
#

Ayo? @unkempt flame level 1 !!! lfg

tame mica
#

there's tuts on how to use the model in the docs

red kayak
#

do

#

thats how ull get the latest stuff

#

rvc boss isnt interested in updated rvc really

#

:p

unkempt flame
red kayak
#

so let the community do its thing

pale sorrel
#

is anyone else having problems training with applio?

subtle phoenix
#

Is the RVC v2 architecture multilingual/language neutral?

analog obsidian
rare gobletBOT
#

Ayo? @subtle phoenix level 1 !!! lfg

glacial pollen
#

Just concerned a lil about my own integrations and 'older' ported mechanics but we'll see.

#

but guess people do be taking inspirations from me lol

glacial pollen
#

you see, a lot of stuff I had in was slightly rewritten mangio code

#

to have it compatible with new structures

#

fumiama further did split a lot of stuff even further and changed some integrations but ye, should be fine

#

just wonder why they disabled the vhq sox

#

hm

analog obsidian
#

Fumiama saving rvc development

glacial pollen
#

nonsense

red kayak
glacial pollen
#

sox resampler

red kayak
#

ahh

glacial pollen
#

set to very high quality

#

it's commented out for whatever reason

#

I personally do use it

#

other than that, envelopes

#

so most of stuff Imma keep in

#

but ye, can't wait to test everything out

red kayak
#

well if it causes less aliasing

#

then good

glacial pollen
#

sox vhq is the best algo you can get really

analog obsidian
#

is fp32 training possible in fumiama rvc?

glacial pollen
analog obsidian
#

yay

red kayak
#

and then also the config.json too

glacial pollen
#

ye, cause it's set once per machine

#

from then onwards, only " inuse " ones are used

#

so if you change it on your own, that's what it is kept

analog obsidian
#

oh

glacial pollen
#

I still am not sure what was causing optimizers with adaptive mechanics to not work properly

#

so hopefully it's somewhat fixed in fumiama's revision

#

Really want to utilizer Ranger ngl

#

or AdamW with a lil lookahead support n gradient clipping for fp16

red kayak
glacial pollen
#

and so, NaNs also

#

and that's actually a huge deal

red kayak
glacial pollen
#

cause sometimes you get NaN during epoch 0 / 1

red kayak
#

gradients also wont explode?

glacial pollen
#

so, during initialization

#

and that has it's consequences later on

red kayak
#

like from 100 it goes ti 1925

#

huge spike up

#

gradient boom

glacial pollen
#

that on it's own can happen in fp32 too

#

so that's just a matter of param updates

#

or re-sitting

red kayak
#

yeah but will the optimizers help

glacial pollen
#

but it won't cause overflows or underflows

#

so it's out of range

glacial pollen
#

like with everything

red kayak
#

drop me the repo

glacial pollen
#

Lookahead is meant to be looking ahead with small weights

#

compared to normal ones

#

and sorta guide the convergence

#

but then, yea

#

a lot of adaptive stuff I've tried before

#

won't cooperate

#

which is a shame, cause even simple stuff as warmup or lr warmup was out of options

red kayak
#

ig a new gan can help ><

glacial pollen
#

I think it's more bound to how the schedulers were written in there

#

huge mess of a code ngl

#

a lot of stuff being " hardcoded "

#

and taken directly from hparams ( from file

#

rather than dynamically accessed n stored

#

iirc, I saw some changes related to that so, having big hopes

glacial pollen
#

there's no eta

red kayak
glacial pollen
#

you meant the optimizers?

#

or what repo

red kayak
#

repo

glacial pollen
#

yea but, what repo

red kayak
glacial pollen
#

oh, that

red kayak
#

mhm

glacial pollen
red kayak
#

yuppie

glacial pollen
#

mh mh

glad zealot
#

@brittle wingimma make when im bored enough

red kayak
fading rivet
#

what is crepe hop

#

what is it doing

night pier
#

😎 👌🏿

proper shale
# fading rivet what is crepe hop

basically the length it will use to check for pitch variation. lower means it'll get more but it might crack, higher will get less but will be faster and less prone to cracking

if you're using rmvpe it doesn't change anything

glacial pollen
#

steps are directly related to the batch, dataset and so the data loader

glacial pollen
#

it does not

#

yes

red kayak
#

the more u know

glacial pollen
#

steps are directly related to batches and amount of data really

#

yup

#

but again, still can't get more " into it " cause the data loader works internally so

red kayak
#

i'll ask gpt for more info abt the parameters then

glacial pollen
#

Good idea

stark wadi
#

-overtraining

azure marshBOT
# stark wadi -overtraining
Overtraining

You can detect if a model is overtraining if the TensorBoard graph starts to rise and never comes back down. An overtrained model will sound robotic, muffled, and won't be able to articulate words well.

Check these resources to learn more about this topic

brittle wing
#

Does anyone know any tips to make the ai sound less glitchy and make it not voice break a lot?

maiden raft
#

-uvr

azure marshBOT
# maiden raft -uvr
Ultimate Vocal Remover

One of the best free and open source vocal and instrumental isolation tool.

maiden raft
#

what happened to the uvr guide

brittle wing
#

how to install rvc models? theres many files

proper shale
#

id just use weights.gg if you don't wanna handle more complex stuff, involves no installing shit and is simple af to use

brittle wing
proper shale
#

just unzip and use the .index and .pth files

#

ignore the others

brittle wing
rare gobletBOT
#

Ayo? @brittle wing level 1 !!! lfg

rare gobletBOT
#

Ayo? @proper shale level 109 !!! lfg

proper shale
#

dw

brittle wing
#

alright, thank you so much!

proper shale
#

index is optional after all

proper shale
brittle wing
proper shale
# brittle wing Sorry i got another question, how does widgets.gg work or how to use it? specifi...

weights gg is for covers n stuff. they host basically all our models and a bit more

to use okada anywhere else, you need a virtual audio cable

check https://rentry.co/VoiceChangerGuide virtual audio cable section

brittle wing
#

alright tysm

jagged coral
#

guys, what can i do, if some sounds are kind of loopping? i already increased chunk, still have this problem

#

im using radeon rx5700xt graphic card

shut fog
#

-audio

azure marshBOT
gray crow
#

Hey guys, anyone knows if RVC will run on 3.10.14 since im getting this message even tho i always used 3.10.12 and never had that before or just straight up ignored it lol.

2024-07-04 17:45:28,316:WARNING - WARNING[XFORMERS]: xFormers can't load C++/CUDA extensions. xFormers was built for:
    PyTorch 2.3.0+cu121 with CUDA 1201 (you have 2.0.1+cu118)
    Python  3.10.14 (you have 3.10.12)
  Please reinstall xformers (see https://github.com/facebookresearch/xformers#installing-xformers)
open mountain
#

-colab

azure marshBOT
# open mountain -colab
☁️ Google Colabs
pastel oak
#

should use 10.12

open mountain
#

-hf

azure marshBOT
gray crow
maiden raft
#

oh, sorry I just needed the uvr guide

brittle wing
#

So sorry

keen hound
#

does anyone know how to add downloaded vocies into rvc?

pastel oak
keen hound
pastel oak
#

put it there

keen hound
faint cloud
#

anyone know where to put your path and index files in applio?

#

nvm figured it out

ornate gulch
#

hi

rare gobletBOT
#

Ayo? @ornate gulch level 7 !!! lfg

edgy lion
#

what is the recommended amount of epochs to train for a audio sample that is only like a minute long?

crystal sky
#

where can i find some audios that i can just dump into applio to test them.

proper shale
proper shale
#

oh and Applio has their TTS as well .... if that works

empty raptor
#

Can i set mvsep to not to have long ass file names

ashen cipher
#

i have a good voice model, but when i try it out, it sounds like shit...

rare gobletBOT
#

Ayo? @ashen cipher level 1 !!! lfg

ashen cipher
#

what settings do i need?

proper shale
ashen cipher
#

making songs

proper shale
#

pitch? pronunciation?

ashen cipher
#

pronunciation

proper shale
#

mess with "Search feature ratio" in case the accent ain't similar to the person

if that ain't enough it's probably the audio you're using

ashen cipher
#

ok thanks

proper shale
#

you're welcome, lmk how that goes

ashen cipher
#

it sounds a bit better, but still not so good. It's probably because of my audio input file. I extract audio from songs and try to replace the vocals with ai vocals, but I think the vocals have a bad quality

violet heron
ashen cipher
#

i use flac

#

is this good?

proven hill
violet heron
#

YouTube will be lossy

brittle wing
#

-colab

azure marshBOT
# brittle wing -colab
☁️ Google Colabs
brittle wing
#

-audio

azure marshBOT
junior halo
#

It should be okay if I just feed the RVC trainingt Colab an hour long wav file, right? Or should I break it up?

agile apex
#

so im trying to download my ai covers but when I click download it redirects me to a cloudflare 404 page does anyone know how to fix this

agile apex
rare gobletBOT
#

Ayo? @agile apex level 1 !!! lfg

low shard
brittle wing
#

-uvr

azure marshBOT
# brittle wing -uvr
Ultimate Vocal Remover

One of the best free and open source vocal and instrumental isolation tool.

agile apex
small iron
#

Yo how do i make my own voice rvc

#

On mobile

low shard
low shard
small iron
#

Im completely lost here

low shard
small iron
#

Does it work on mobile

low shard
small iron
#

Or have a complete high quality

#

Of a conversation

low shard
small iron
#

Data*

low shard
#

Be sure that the audio is cleaned and only vocals

small iron
#

Which im mobile

#

Isnt there a mobile version of this

low shard
low shard
small iron
#

"Press CTRL + A to select the whole audio."

low shard
shy hemlock
#

I am training a model of my voice, and was just wondering what the best training settings I can use, mainly the amount of epochs and batch size... i got a lot of audio 7 files in approx 2hrs and 50mins in wav.

small iron
#

Ob mobile

#

On mobile

hasty canopy
#

Hello i was wondering if all the voice models were safe or only the popular ones

agile apex
small iron
#

Ho how do i make a zip file

#

In mobile

#

??

#

How do i clean my database with using phone apps

#

I dont have pc

odd shale
#

What do you mean with "safe"?

#

No model should contain viruses.

agile apex
#

@low shard sorry to ping you but I made a new cover and it let me download it pls don't ban me

low shard
# small iron How do i do this

You can make a folder on your phone and zip it, you can just go to whatever storage manager your phone has built in, and for example for mine i hold the folder and click the 3 dots to compress it to zip

rare gobletBOT
#

Ayo? @low shard level 94 !!! lfg

hasty canopy
#

I mean no risk

hasty canopy
odd shale
odd shale
small iron
hasty canopy
#

ok cause I wanted to try some models in my native language but nobody tested them it seems so idk if it was safe

small iron
#

To remove all disturbing noises and

#

Stuff

odd shale
low shard
small iron
hasty canopy
#

french 🥖

rare gobletBOT
#

Ayo? @hasty canopy level 1 !!! lfg

odd shale
low shard
hasty canopy
#

I mean as long as there's no virus it's good for me

small iron
odd shale
agile apex
hasty canopy
#

In that case it's good

#

lmao

#

thx

agile apex
#

but it lets me download new covers

odd shale
low shard
low shard
agile apex
small iron
#

Which requires pc

potent helm
#

how do I continue training the same model in the same colab I started training it in if I got the Error
Connection errored out error

knotty moth
small iron
knotty moth
sand beacon
#

i need help

#

can someone send goof yt

#

good

errant osprey
#

anyone have any tips or tricks for getting an AI voice to work well with natural inflections and pitch changes while talking (realtime), assuming thats even possible

opal depot
#

Would an rx 580 be fine for running rvc?

craggy sorrel
#

does anyone know how to deal with this error i paperspace?"root@nzdhhlx08h:/notebooks/Mangio-RVC-Fork# make run-ui
python infer-web.py --paperspace --pycmd python
Traceback (most recent call last):
File "/notebooks/Mangio-RVC-Fork/infer-web.py", line 26, in <module>
import faiss
ModuleNotFoundError: No module named 'faiss'
make: *** [Makefile:59: run-ui] Error 1
root@nzdhhlx08h:/notebooks/Mangio-RVC-Fork# "

shy yacht
#

Does anyone know where A RVC model of ajitani hifumi?

pastel oak
#

path could have issues if theres spaces or OneDrive or special chatacters

pastel oak
pastel oak
inland jasper
#

hey so, it might be a stupid question but i'm trying to make my first voice model for the first time and i trained my model and i can't find it, where does it go to?

hard belfry
#

Good day everyone! Can somebody please advise, there used to be a desktop app, within which you could convert audio using voice models you downloaded and even search for other voice models withing the software. Had a nice black interface, please advise if it's not up anymore of if you know which one was it, struggling with finding it online boohooh

pastel oak
glacial pollen
#

if that's the case, it's heavily outdated and nobody uses it / shouldn't be using

#

tho wait.. I read it again, hmmm.. it had a search for models built in.. hmm

#

Applio is the only thing I can think of

#

it has built in search

inland jasper
pastel oak
#

else if you saved every x epoch for models theyre also in assets - weights

inland jasper
#

thank you so much man really appreaciate it 🙏

rare gobletBOT
#

Ayo? @inland jasper level 1 !!! lfg

low shard
#

You can't convert the voice model, even if not natively, you can use tts with rvc models, some forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)

NOTE: RVC isn't as good as GPT So Vits for tts, but gpt so vits can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese

If you wanna do tts locally (if you got a good pc):

If you don't got a good pc you can do it online:

Last update: Apr 01, 2024

Last update: June 15, 2024

subtle phoenix
#

When finetuning or training a pretrain from scratch, the large dataset doesn't have to all of the same speaker right? I checked out the pretrains guide but still need some clarification on this point. Thanks!

hard belfry
analog stream
#

Hi, I just started using this today and have been messing around with various voice models with the intention of recording some stuff, everything was working as expected, however all of a sudden in my recording the voice isn't changing, I have tried to remove the application and such but am unable to fix it, does anyone know how to fix this?

patent pasture
#

I guess Ilаria RVC Zero isn't confidential

pastel oak
patent pasture
pastel oak
#

Not sure

subtle phoenix
#

Thanks

brittle wing
#

@viscid moss can you add bs roformer to Karafan Improved?

#

-colab

azure marshBOT
# brittle wing -colab
☁️ Google Colabs
analog stream
fresh thistle
#

Sometimes, the output comes out choppy and laggy and even with trying out different chunk sizes, I don't get any desired result
Until I restart and spend more time, and then it works? but once I restart the program and try again, it's still choppy so I'm not sure why

idle hedge
#

is there a way to download illaria rvc to run local on my machine on mac?

odd shale
idle hedge
odd shale
idle hedge
odd shale
azure marshBOT
knotty moth
idle hedge
# odd shale -rvc

sorry, dont want to be a bother, but i only see the windows and linux installation guides, nothing for macs

fresh thistle
rare gobletBOT
#

Ayo? @fresh thistle level 1 !!! lfg

odd shale
#

You can just use Applio Colab, all you need is a google account.

idle hedge
rare gobletBOT
#

Ayo? @idle hedge level 2 !!! lfg

tidal trail
#

aparte que la instalacion es un infierno

viscid moss
brittle wing
tawny tinsel
#

Hi, I have changed both of my desktop input and dscord input to Dubbling Virtual Devices but when I tested on discord sound and voice, it couldnot detect anything

brittle wing
#

Other people are asking for that too.

proper shale
brittle wing
#

@viscid moss

proper shale
#

The input output settings should be like this in W-Okada:

Input - Your mic
Output - CABLE Input (line 1 if you use VAC, or whatever your virtual cable output is)

And in Discord:

Input - CABLE Output (same thing as okada output as well)
Output: Your headphones

basically

tawny tinsel
#

Let me try

viscid moss
brittle wing
#

Karafan is ensemble

#

@viscid moss

viscid moss
#

The problem is that KaraFan has not been updated in 8 months, the results are quite good but it uses the UVR5 code from 8 months ago, which will require more effort to implement the new models (Roformers).

KaraFan is a good tool but it is not receiving updates, for now I can only tell you that I will review it but I cannot promise that I will fix everything completely.

I know that KaraFan makes an ensemble with UVR5 models, that's what I'm working on for UVR5 UI and UVR5 NO UI.

brittle wing
viscid moss
#

It's what I'm working on, along with other projects

brittle wing
#

That's useful

#

When will it be added soon

#

So you have to update the code first ah yes I get it

proper shale
#

give @viscid moss some time ik she can pull off cool shit

viscid moss
viscid moss
#

month

brittle wing
#

Okay

simple hedge
#

A question, what page do you use to make audios with AI???

#

with rvc

crystal sky
#

and it changes the audio

#

there is a text to speech button too

#

but that just uses a basic text to speech

simple hedge
#

I asked what program they use

#

Why does that appear to me?

odd shale
brittle wing
#

-colab

azure marshBOT
# brittle wing -colab
☁️ Google Colabs
brittle wing
#

anyways what does the allign inputs feature do in uvr v5?

low shard
#

-colab

azure marshBOT
# low shard -colab
☁️ Google Colabs
low shard
#

not training

brittle wing
#

Well then he should use these colabs provided

odd shale
worthy lantern
#

hmm which collab best for using public? 🤔

snow rose
#

How to make text to speech fit song?

azure marshBOT
#
🍏 Applio-RVC-Fork

Enhanced RVC Variant: Optimized Performance Through Modifications, Built upon Mangio-RVC-Fork.

#
Documentation

📚 All-In-One English documentation

https://docs.aihub.wtf/

Full AI Voice Model Training Guide (Local)

Link: YouTube
Credits: Christopher Villanueva

Model training with Mainline RVC

Link: Rentry
credits: Raven (ravencutie21)

AICoverGen Colab Guide

Link: Google Docs
Credits: Eddy (Spanish Helper)

Create a model with RVC disconnected (colab)

Link: Google Docs
Credits: Angetyde

nimble blaze
rare gobletBOT
#

Ayo? @nimble blaze level 3 !!! lfg

worthy lantern
#

have someone a good google collab to use public url?

spice thorn
#

how can i install hina rvc on my pc?

dawn tusk
#

oops, I need help, I've already downloaded the voice change app, but when I press it to start, the voice doesn't come out, can anyone help me?

west cosmos
#

I don't mean to comment over the person above (sorry) but I just wanted to ask if anybody has any experience doing batch conversions on Hugging Face using RVC V2? I do a lot of conversions but I cannot work out how the batch converter works - I can never find the files to download once they're done

rare gobletBOT
#

Ayo? @west cosmos level 1 !!! lfg

spice thorn
#

Try use ngrok

#

Not gradio

worthy lantern
spice thorn
#

Yes

#

I do

proper shale
west cosmos
#

Right, understood. Thank you!

brittle wing
#

rvc cli stopped working for some voice models for no reason

#

do models need replacing every now and then?

#

it'll generate but then it won't make a wav file

analog current
#

guys, do you know anything about applio app? is it better than other apps that use RVC?

glacial pollen
#

lol, and yet you use " ignore outliers "

#

that's not the approach

#

mark it off, then evaluate

#

set the smoothing to 0.70

#

or less

#

100 ( smoothing at 1 ) is for general trending recognition

rare gobletBOT
#

Ayo? @brittle wing level 4 !!! lfg

proper shale
#

looks like a collapse

#

you should be fine using the most recent ones though, probably

violet heron
slow dirge
#

hello when i use rvc a message appear and i cant use rvc someone can help me pls ?Error opening Stream: Illegal combination of I/O devices [PaErrorCode -9993]

naive shoal
#

yo tiny tim!!

#

That's precicely what happens

thin stump
naive shoal
#

To see what would happen if I trained a base model on tiny's baritone then did another model on his falsetto

#

I dont know how that works though at all

thin stump
naive shoal
#

And RVC seems to just kinda mash them together between those ranges

thin stump
naive shoal
#

On top of that, the baritone voice can't ever belt out anything higher up in his range because its trying to mash into the quiet falsetto

naive shoal
#

Tiny can hit the same notes in chestvoice as he can headvoice

#

But the AI tries to force everything past a certain limit into headvoice

#

Or some weird inbetween

thin stump
naive shoal
thin stump
#

v3 soon, no guarantees it will improve from v2 but you never know

naive shoal
#

heh it doesnt even look like itll do anything

#

Why the pretrains?

thin stump
thin stump
# naive shoal Why the pretrains?

well maybe I was onto something because the pretrains have to be trained on hifigan? So if we switched then it would be way better

naive shoal
#

Also whats hifigan

thin stump
naive shoal
thin stump
#

i spelled it wrong shocked

naive shoal
#

My only worry is that with things advancing my GPU will become too old to use

#

Its already going on 6 years old

#

It was topend at the time but now itsj ust old

#

It does the same things as the modern RTX cards but is just slower

#

But if they introduce anything that minimally requires more VRAM ill have issues

thin stump
#

oh, it will definitely increase. I don't know where I read that discussion from

naive shoal
#

(i got the RTX 2080super card)

naive shoal
#

"never hit your grandma with a shovel - it leaves a bad impression on her mind!" XD

thin stump
naive shoal
thin stump
naive shoal
#

He's the perfect mix of comedy and scary

#

And goofy

#

He was plain goofy he couldn't be scary

#

But his looks could be

#

This is him when he was young

thin stump
#

I wonder if that was heavily inspired for Joaquin Pheonix since he's a more grounded guy

naive shoal
#

He had his acts but if you listen to him talking he's a very real guy

#

Catholic and very oldschooly conservative XD

oblique bay
#

Hello, I'm having issues with discord where the virtual audio (or the rvc voice changer) sounds very glitchy. However when I listen to the actual audio of the virutual audio it sounds perfectly fine on playback on my system. I've turned off all noise cancelation and it still sounds glitchy, I suspect it being the transfer over to virtual microphone to discord playback.

violet heron
oblique bay
rare gobletBOT
#

Ayo? @oblique bay level 1 !!! lfg

oblique bay
violet heron
oblique bay
violet heron
#

VAC Lite is free

#

Make sure you download the lite version not trial

oblique bay
#

I see, I downloaded the VAC trial one

cloud jetty
#

I noticed a lot of the RVC models are singing models - do they still work for speaking?

cloud jetty
violet heron
#

Try both and see which one works better

cloud jetty
junior halo
#

How do I resume a training session?

#

(non local RVC)

#

I've started over countless times because I was inactive for too long.

#

I need someone to train this for me. Please.

brittle wing
#

can someone help me with the ai voice?

#

i need help

#

im trying to use the google collabs thing

#

when i click on a model

#

it says

#

Frequent errors occur. Please check if the model of the framework being targeted is loaded.

misty elk
#

--colab

#

shoot

brittle wing
#

can anyone hop into vc with me to help me with starting the training

#

everythings set up and i have my audio ready fully

knotty moth
fair onyx
#

Hi. Is there any way to use RVC with an amd gpu (rx 6600). I want to use inference locally, but I don't need to train my own models.

rare gobletBOT
#

Ayo? @fair onyx level 1 !!! lfg

azure marshBOT
violet heron
#

It will use your cpu

misty elk
#

-colab

azure marshBOT
# misty elk -colab
☁️ Google Colabs
fair prism
#

what do yalls use for tts?

dull dagger
oblique bay
agile apex
#

I made heavy from TF2 sing rainbow connection from the muppets movie and it kind of sounds like he's actually singing it

wispy pawn
#

how can i fix no interface i running right now

rare gobletBOT
#

Ayo? @wispy pawn level 1 !!! lfg

wispy pawn
#

each time i load a local url in google colab it say that

golden karma
#

-guide

azure marshBOT
golden karma
#

-spaces

azure marshBOT
golden karma
#

-rvc

azure marshBOT
golden karma
#

-audio

azure marshBOT
modest abyss
#

Any help with this? (I tried searching the server but couldn't find anything)
I made sure to reinstall Python just in case, delete and redownload the zip, moved the folder to my main drive instead of my 2ndary, nothing

rare gobletBOT
#

Ayo? @modest abyss level 2 !!! lfg

modest abyss
#

It's been awhile since I've done this (last time I tried making a model was when Google Colab was still the top option lol)

brittle wing
dawn dew
#

using RVC GUI for the first time. How long does it usually take to convert the voice after clicking convert button?

modest abyss
dawn dew
#

felt like it's been 15 to 20 minutes so far and it's still converting.

wispy lodge
#

May take a very long time on CPU

#

Especially with crepe f0

dawn dew
rare gobletBOT
#

Ayo? @dawn dew level 1 !!! lfg

wispy lodge
#

Not sure about rvc-gui, but at least rmvpe as f0 would speed up the process a bit

dawn dew
#

ok it just finished

#

it seemed like 40 - 45 minutes. The next one I do I'm going to time it.

exotic mural
#

i finsihed training i think, but my assets/weights folder has no added_(smth).index file

#

i only see the .pth

exotic mural
sick canopy
#

i have a question what does the segment and overlap do?

brittle wing
#

Is there anyway to make the AI voicr sync with your laughing without it sounding like wind

#

Or is it just not possible

proper shale
#

But I'm not sure if that's a limitation or just the lack of laughter in datasets

chrome orbit
#

when i launch the tensorboard in applio rvc it just keeps reloading, is there something else i need to install for it?

brittle wing
#

Where gpu's like the RTX 5090 already being released

proper shale
brittle wing
#

Nvidia is pretty much carrying AI

glacial pollen
#

Laughing compared to speech / singing is more airy, it contains more " noise " type of data compared to phonetics, so, formants and so on

#

base models rvc uses are done on vctk dataset which barely if even had laughing-like pattern samples in it

#

rvc is just not accustomed to handling such well

#

And so, it is not bound to tech or gpus, it is just how the base models were made

#

Laughing is definitely do-able but not in all cases. It's a lil more tricky than that

#

Oh yea, not to mention contentvec rvc uses for " recognition " of what is the content of input audio ( for inference ) - I believe - has some harsh time or issues detecting such " unusual " sounds like that

pastel oak
#

i cough, laugh and moan daddy bulldozer on rvc 🔥

glacial pollen
#

So really, it all comes down to many variables.
Contentvec, base models' dataset used and obviously, quality of models

glacial pollen
analog obsidian
glacial pollen
#

There is no such a thing as perfect X in rvc

#

or AI

analog obsidian
glacial pollen
#

there always be artifacts or irregularities

#

maybe if some mid-way step was to be added

#

such as diffusion denoising or whatever

#

then maybe it'd be very very close to what we'd call perfect

#

I just feel like people expect too much from tech that's been with us for barely few years

analog obsidian
#

ye ai is new tech

glacial pollen
#

not really AI, but voice cloning and reconstruction in a way we do it

#

normally, in fact, you'd want to literally train from scratch your own voice or a dataset

#

not using any " universal " pretrains

#

but having your own end-to-end finetuned

#

so that's also to be added to the mixture

#

People expect way too much from something that was meant to just work for broader audience

analog obsidian
#

O yea i remember someone said our models are actually all finetunes

glacial pollen
#

That's right

#

that's the reason we use pretrained bases

#

our models are just " snapshots "

#

of finetuned generator and discriminator networks - kinda, simplifying it.

#

which were previously trained from scratch on vctk dataset

#

What I think could work is doing a 2 run training but I haven't tried it myself yet.

#

but that'd require you to have a rather big dataset for your model

#

first training is the fine-tune training, you'd then take your G and D from model's folder, and use them ( provided you can adapt hparams properly ) and train again

#

on same voice / person but different set - and that'd be your main training but let's be real, not a lot of people here can do it

analog obsidian
glacial pollen
#

Not really

#

1st run you'd do like normal, but preferably 10-25 mins

#

2nd run would be 15-35

#

but I think 10-15 could work again

#

just each run has different set ( or different section of the set provided the consistency is kept )

#

but if I had to be honest, first run should be as good as you can get it to be

#

'll be your foundation for the main model

analog obsidian
glacial pollen
#

Definitely

crude flame
glacial pollen
#

Go for it

#

tho when I mention that 2nd run should be a different set ( in ideal conditions ) I mean it

#

as the first run ( first model, so, your base ), provided you legit trained it to the best of your possibilities, will be already accustomed to the voice

#

so the risk overtraining is drastically increased

crude flame
glacial pollen
#

content of it

distant turtle
#

-colab

azure marshBOT
# distant turtle -colab
☁️ Google Colabs
glacial pollen
#

the voice should match but the content should be, ideally, new

#

or at least, much more diverse than the base

analog obsidian
glacial pollen
#

Quite possibly

crude flame
glacial pollen
#

Neat

knotty moth
glacial pollen
#

yey. More testers the better.

chrome orbit
#

Im getting an error: 'FeatureInput' Object has no attribute 'device' when trying to extract features

#

whats the fix for this?

glacial pollen
#

and foremostly, what's your gpu

chrome orbit
glacial pollen
#

gpu's detected?
mentions running half precision / fp16 in the log?

chrome orbit
#

says running fp32

analog obsidian
chrome orbit
#

yea its applio rvc

analog obsidian
#

yeah crepe is bugged on applio

#

doesnt work atm

chrome orbit
#

ahh

analog obsidian
#

use codename's fork if u want to use mangio

chrome orbit
#

ight is mangio better than mainline

glacial pollen
#

Mangio is just crepe but allows you for adjustable hop

chrome orbit
#

ight cool

analog obsidian
glacial pollen
#

You'll find more about it in the fork's faq tab

#
  • some more technical info
#

if that's within your interest

analog obsidian
glacial pollen
#

btw, I doubt Imma fully move to " new " fumiama's base

glacial pollen
#

the hash / encryption and such, is a pain in the ass

analog obsidian
#

aww

glacial pollen
#

will be all from there

#

excluding hashing

#

to me it's a nonsense and bs

knotty moth
glacial pollen
#

it tries to read info from models despite most of em made on other rvc

#

and some of that, such as author or few others

#

aren't present

analog obsidian
glacial pollen
#

and if they're not, it spills errors

#

and I can't make it cooperate with older mangio's fork code structure, the hash

analog obsidian
glacial pollen
#

So most of the changes, including training part and few others, will be added

#

but Imma stick to the core of what I have

analog obsidian
#

ye codename has them

glacial pollen
analog obsidian
#

there u go xD

glacial pollen
#

Yea, made quite a lot of these back in the day

#

was tired of constantly repeating myself lol

#

yeah sure, yw

glacial pollen
#

aside of some qol changes, code restructure and few differences here n there, there's not that much of new stuff

glacial pollen
#

so with that being said, gonna stick to new fumiama's structure

#

but most of what I deem necessary, in the core, will be kept as it is

analog obsidian
#

its fine

glacial pollen
#

#

ofc, new features and all of that will be actively ported

glacial pollen
#

btw, there we go

#

example of why fp16 sometimes sucks

#

NaNs

analog obsidian
#

also i noticed that im not getting those NaNs at the start

#

after switching

#

totally worth waiting a bit more

glacial pollen
#

Yep, exactly what I meant

tame mica
glacial pollen
tame mica
#

ah gl

pallid patio
#

Good evening

#

I have a doubt, what a "Strong Version" of a model is? Like, its just better vocals or smth?

glacial pollen
#

there's no such a thing

pallid patio
glacial pollen
#

so like, strong vocal, soft vocal etc or maybe speech

#

cause as long you're referring to rvc models, there's no official or community accepted / agreed terms as " strong " version of model or such

#

so it must most likely refer to the model's speech / singing style it was trained on

brittle wing
#

Does anyone know where I can find that typical AI voice that’s used for Instagram philosophy quotes/ideologies?

snow hazel
#

How do I remove harmonies in UVR?

brittle wing
#

Or something like that, i serarch and try but got nothing

rare gobletBOT
#

Ayo? @worn river level 1 !!! lfg

snow hazel
#

How do I remove harmonies in UVR?

modest abyss
#

Could I get help with this?
Fixed the previous problem, but now this happens, and then crashes shortly after when trying to load Ilaria RVC Mainline

split parcel
#

I had a question, how can I make a voice model?

pastel oak
pallid patio
misty elk
#

-colab

azure marshBOT
# misty elk -colab
☁️ Google Colabs
modest abyss
shut abyss
#

I'm trying to use the voice changer but it says I don't have a gpu, is it because i have a radeon gpu?

#

I dont think it should matter too much since i have a pretty decent cpu

#

but my gpu is way better

proper shale
#

Get the AMD version

proper shale
#

You're welcome! Let me know if you face any issues :)

shut abyss
rare gobletBOT
#

Ayo? @shut abyss level 1 !!! lfg

proper shale
proper shale
#

You're welcome

modest abyss
proper shale
modest abyss
#

GPU is NVIDIA GeForce GTX 1060 6GB, current batch_size is 6, and my dataset is a little under 7 minutes long (was going to make it longer but wanted to use the shorter version for a test thinking it would go faster)

glacial pollen
proper shale
#

You could probably cache dataset

glacial pollen
#

if it doesn't say anything of that sort / says fp32 / full precision
or anything like that

#

Won't do

proper shale
#

Someone said it was because of the FP stuff 😭

#

GDR iirc

glacial pollen
#

That too but caching additionally shouldn't be done on less than 12 gig vram cards

glacial pollen
#

esp for above 7-8 mins sets

#

aside, the gains aren't even that big tbh

rare gobletBOT
#

Ayo? @glacial pollen level 17 !!! lfg

glacial pollen
#

oh yea, then that'd make sense

#

You're training in FP32 mode ( single precision / full ) better quality models but performance ( speed wise for training ) takes a hit

wispy lodge
#

1000 and 1600 series have horrible fp16 performance anyway. It shouldn't be used on them and I think rvc mainline disables fp16 for them

glacial pollen
#

From what I can read, gpus from that ti-

#

yea, was about to say

modest abyss
#

How do I change it? The last time I trained a model was about a year ago when it was as simple as importing the sound you wanted, pressing a button, and have a pretty accurate model in an hour

glacial pollen
#

If you do not see half precision or anything like that in the log ( which you do not ) then it fallbacks to fp32, which is the case for you

#

tl;dr.: Just bite through it ~ it's worth it.

shut abyss
#

is it better to have the sensitivity threshold higher or lower?

proper shale
modest abyss
proper shale
#

if it doesnt cut away your words

#

thats fine

shut abyss
#

I have it at max right now

glacial pollen
shut abyss
#

but sometimes it will pick up noises of me like sighing etc

glacial pollen
#

if it's not 10-15 or 20 minutes, you won't wait weeks lol

#

chille

proper shale
modest abyss
glacial pollen
#

fp32 for me on rtx 3060 ( 12 gig ) - 10 min set - 16 batch size takes 4-6 mins

#

(( but then, I am using cuda memory fallback so ))

#

But still, you're fine

modest abyss
#

Alright (the reason I say about a week is because I did the math and it would take at minimum 4 whole days to process the model at its current rate)

rare gobletBOT
#

Ayo? @modest abyss level 3 !!! lfg

glacial pollen
#

right?

modest abyss
#

I wasn't making that decision blindly, that's the settings I used back then when, like I said, it was as simple as just throwing an audio file at it and having a pretty damn good model in an hour

glacial pollen
#

Then it is blindly " calculated "

#

unless you're doing an exact same dataset, same parameters

shut abyss
#

is there any optimization for response time i can do while maintaining quality

glacial pollen
#

Each dataset is unique, each model is unique, each needs different hparams, each has own training curve and initialization
and each requires own tensorboard supervision

shut abyss
#

because its taking about 7000ms right now

modest abyss
#

It isn't the same exact dataset but it's the same length as most of my other datasets that only took an hour back then

glacial pollen
#

RVC or Machine learning is not linear

#

you can't train X1 with similar data length and expect X2 to follow the pattern

#

in fact, in rvc, 1 slight hiccup in initialization ( during model's initial stage of training ) or a taking away of few secs from the set can drastically change the outcome

#

But anyways, you do you.
All I can say, your time per epoch is fine for fp32 and your hardware

modest abyss
#

Maybe I'm just getting confused then because of those older methods which seemed quite linear (not on the inside I know, but on the outside it was simple to calculate how long it would take)