#✨│ai-help

1 messages · Page 38 of 1

sand iris
#

What

twin quiver
sand iris
stray lantern
#

i have duplicated space, i got the voice model i want, i put a mp3 as my track, but it still says error? (huggingspace)

sand iris
#

But you should have a .index file with your .pth model

#

you can keep it empty if there somehow isn't one

twin quiver
#

how do i listen

#

to my voice

stray lantern
#

it just says error

#

im stuck

#

i literally have everything

plush compass
#

How much training do you do on a 16-minute long dataset? (RVC2)

small wolf
#

how big is it?

#

Once I get my server working I have an idea I'd like to try... trying to train the base RVC model from scratch but adding my voice to the dataset

stray lantern
#

how do i know when its done?

#

even after that it says error

small wolf
#

¯_(ツ)_/¯

small wolf
rare gobletBOT
#

Ayo? @small wolf level 4 !!! lfg

small wolf
#

try using colab

small wolf
#

or maybe I'll just mess with my own multi-shot architecture ¯_(ツ)_/¯

tardy flint
#

this isnt for rvc but im trying to split the backround vocals from a song and it just wont work can someone please help me. pwetty pwease 🥺

mortal plinth
small wolf
#

How does RVC work, like how can it learn a target voice without any source voice as input?
surely at least the base model was trained with some kind of source voice input at some point?

#

like during training, what is it even using as a source voice to derrive loss from???

median surge
#

im a totally new to this whole rvc thing, but i can not get to the part where i can put my voice in, i just wanna make a voice model of my voice and make christmas covers, but everytime i download the rvc file from github, there is no go-web.bat file and if there is one(which there has been since i tried different ones) it says "server not found" and "press any key" and if i press a key it just ends the window, Please help i have windows computer, am i dumb or something like that? am i missing something?

proper shale
#

You should've downloaded it from Huggingface and not GitHub

#

Therefore you only got the code and not the usable application

modern oxide
proper shale
#

^

small wolf
modern oxide
small wolf
#

(Yes I've tried pitch shifting, all the advanced parameters, etc etc)

#

ugh, at this point I might just make my own model from scratch 🤣

modern oxide
#

hmm, can you send an example?

#

because i've made multiple voice models of my own voice, and they all sounded fine

small wolf
small wolf
#

it just sounds a lot worse for most models than it should

modern oxide
#

nah, all rvc's have the same bases

proper shale
#

yg

small wolf
#

yeahhhh :/

proper shale
#

yh

#

so the thing is, maybe the dataset wasn't that good?

modern oxide
small wolf
small wolf
modern oxide
#

oh, yeah that's problematic xd

small wolf
#

XD

modern oxide
#

to get a good inferencing you also need to provide an rvc with a good audio input

#

so it's better to buy a microphone and use that

median surge
rare gobletBOT
#

Ayo? @median surge level 1 !!! lfg

modern oxide
#
  • have a good volume settings, so that it won't pick up much noise background and will pick up your voice
small wolf
#

fair...
but what exactly makes sound like... "good"? (opposite of an audiophile here)

small wolf
modern oxide
#

and + you yourself should have a good voice with correct pronounciation of words, and have a good intonation control

small wolf
#

what's intonation control? (yes I tried googling it)

#

ngl I'm kinda leaning towards just mangling together two HuBERT models in an autoencoder config atm...

modern oxide
#

and it will take a lot of time

proper shale
small wolf
# modern oxide it won't help

I mean, it could, given matching audio samples from my mic and desired output it could probably overcome the noise

proper shale
#

Anything else just works

modern oxide
#

our russian community has already tried to implement a russian hubert base with pretrains for a month

#

still no good results

small wolf
#

:/

#

how big is their dataset atm?

#

also what are they training on?

modern oxide
#

it's not as easy as training a regular model

small wolf
#

obv lol

rare gobletBOT
#

Ayo? @small wolf level 5 !!! lfg

small wolf
#

finally

modern oxide
#

1300 hours of russian speech for hubert and around 300 for a pretrained model

#

but

#

yeah

#

you won't be able to make your own pretrain

#

or hubert

#

it's easier to just buy a microphone and practise your voice, because the program works completely fine

median surge
small wolf
#

ok... so maybe training HuBERT from scratch not good idea LOL

but surely there must be a way to finetune existing model with audio pairs?

modern oxide
small wolf
modern oxide
#

like any other machine learning stuff

small wolf
proper shale
#

download the second one if u wanna train

small wolf
#

except the RVC dataset consists only of the target voice, no input so to speek

proper shale
small wolf
#

lol

modern oxide
#

discriminator takes content from given data, shows it to generator, generator tries to fool the discriminator trying to generate stuff based on the dataset, it does that till discriminator believes it and then uppers it's expectations, and they do that till they reach some immovable point of learning

small wolf
#

makes me wonder if I could use
matching audio pairs

to calculate loss
[that's my endgame here it's just somehow training or finetuning using my voice as an input...]

modern oxide
#

part of the given dataset

small wolf
#

ok I see a problem here

#

if the dataset consists only of the target voice
then it's just learning to turn target voice into target voice

#

right?

#

then I assume it "just works" because of the base model, which was trained with the other source voices

proper shale
#

i think it uses part of the pretrain

#

would make a bit more sense that way

small wolf
#

either way I suppose my goal now would be to figure out how to change this... get it to calculate loss using some other source...

small wolf
modern oxide
#

uhh, no, it tries to generate something resembling that data, remember i told you that rvc is built on a hubert which is basically a mess of harmonies

small wolf
#

generate...
from nothing?

modern oxide
#

hubert is like a dough, the dataset is like a recipe to bake a pancake

small wolf
#

dataset is the pancake

small wolf
#

I feel like I've misunderstood something fundamental here...

proper shale
#

that'd make more sense

small wolf
#

ye if it's basically running inference with the audio samples from the VCTK datseet that would make more sense idek

#

traditionally GANs just input random data for training is this... also the case for RVC?

stray lantern
#

Hello, this TikToker makes his covers with RVC disconnect, what is rvc disconnect?

#

And anyone know how to use a polio

latent kettle
proper shale
stray lantern
#

Can you get models from there?

proper shale
#

You make your own models there.

proper shale
latent kettle
stray lantern
#

Cool!

latent kettle
#

To train your own models..You have to prepare a detaset!

stray lantern
#

Btw, I have made my first cover, and I feel like it could sound better, how do I make the voice stay as the same tone?

#

As in like the movie/show that it’s from?

#

I used replay btw

#

I might try applio

#

What do you guys use?

latent kettle
proper shale
#

yeah both replay and applio use rvc

proper shale
latent kettle
proper shale
#

just increase or decrease the pitch according to the voice you wanna replicate and the voice of the song

stray lantern
proper shale
#

yeah

#

there should be, at least

stray lantern
#

Alright

#

If it doesn’t have it I’ll try applio

stray lantern
proper shale
#

exactly

#

they all use the same base but are all different forks

#

of the same thing

latent kettle
stray lantern
proper shale
#

...yeah?

#

they're literally all the same under the hood

latent kettle
median surge
#

im so sorry if this is a dumb question but if i use rvc disconnected to make voice model what is a dataset?

stray lantern
stray lantern
latent kettle
proper shale
#

np

stray lantern
#

Just glad I’m not using one that lessens the quality

median surge
small wolf
#

idea:
synthetic dataset

latent kettle
willow raft
#

I'm trying to use model extraction from checkpoint processing but I'm getting an error code, does anyone know what the error is or how to fix?

  File "C:\Users\Public\Mangio-RVC-v23.7.0\train\process_ckpt.py", line 64, in extract_small_model
    ckpt = torch.load(path, map_location="cpu")
  File "C:\Users\Public\Mangio-RVC-v23.7.0\runtime\lib\site-packages\torch\serialization.py", line 791, in load
    with _open_file_like(f, 'rb') as opened_file:
  File "C:\Users\Public\Mangio-RVC-v23.7.0\runtime\lib\site-packages\torch\serialization.py", line 271, in _open_file_like
    return _open_file(name_or_buffer, mode)
  File "C:\Users\Public\Mangio-RVC-v23.7.0\runtime\lib\site-packages\torch\serialization.py", line 252, in __init__
    super().__init__(open(name, mode))
OSError: [Errno 22] Invalid argument: '"C:\\Users\\Public\\Mangio-RVC-v23.7.0\\logs\\Depeche-Mode\\D_2333333.pth"'```
tardy flint
proper shale
#

not the usable pths

willow raft
#

where would i find the right file? the website only says "Model extraction (enter the path of the large file model under the ‘logs’ folder). " and i don't know what file it's refering to

proper shale
#

do you not have a pth named "depechemode.pth"

#

or something similar

willow raft
#

i don't

proper shale
#

oh no should use the one in weights

willow raft
#

ohh

#

i found it, thank you

proper shale
#

np

#

any other question?

tardy flint
willow raft
proper shale
#

np! good luck ;) hope the model's great

willow raft
#

actually sorry to bother again, it's still giving me the same error, i tried bothing the topmost file and the bottom one

proper shale
#

i have never used this

willow raft
#

checkpoint processing tab because i didn't finish doing all of the epochs

proper shale
#

but uh

willow raft
#

i think i'm supposed to have a .ckpt file somewhere

proper shale
#

you already have all of the models in between the epoch you stopped on

#

so I don't see why it's useful

willow raft
#

how do i turn them into a usable model? i don't have any index files

proper shale
#

the pths in weights are already usable models

#

u can go back in the train tab to uh

#

train the index

latent kettle
# median surge yeah?

It should be in .wav format
It should be of minimum 10 minutes
It should be clear with music and background noises!
There should not be any pause(s)

willow raft
# proper shale train the index

You have saved my life, I understand how all of this stuff works now 🙏 thanks for your patience, the model needs a bit more training but now I know where to get all the files I need and where they go

latent kettle
# median surge ok but how no pauses?

Pause meaning. When someone speaks or sings .. he/she may Stop for a couple of seconds. Which will increase your dataset length without any need. your Main motive should be to cover maximum data in a short length.

latent kettle
latent kettle
severe rapids
#

Is it normal to get triangles?

finite galleon
#

Perhaps your dataset has too many silences?

severe rapids
#

Maybe, I'll cut more silences from it then

#

Should I restart training from 0 or just continue it?

rare gobletBOT
#

Ayo? @severe rapids level 2 !!! lfg

proud elbow
# severe rapids Is it normal to get triangles?

probably your training session gets interrupted and resumed? mode collapses are supposed to be hard dip on <15 value, and silence aren't always only cause of collapses, it can be if dataset length is too short, etc.

severe rapids
#

there's like a 1-2 second silences between sentences

proud elbow
#

you can do noise gate & truncate silence using Audacity tho

severe rapids
#

Audio is clean, there's no noises so I don't think I need noise gate

severe rapids
# proud elbow probably your training session gets interrupted and resumed? mode collapses are ...

why would it get interrupted?

INFO:saturn:Train Epoch: 57 [0%]
INFO:saturn:[3080, 9.927757679628145e-05]
INFO:saturn:loss_disc=nan, loss_gen=nan, loss_fm=nan,loss_mel=21.914, loss_kl=7.541
INFO:saturn:====> Epoch: 57 [2023-11-30 15:21:35] | (0:01:20.316749)
INFO:saturn:Train Epoch: 58 [2%]
INFO:saturn:[3136, 9.926516709918191e-05]
INFO:saturn:loss_disc=nan, loss_gen=nan, loss_fm=nan,loss_mel=21.711, loss_kl=8.367
INFO:saturn:====> Epoch: 58 [2023-11-30 15:22:59] | (0:01:24.328599)
INFO:saturn:Train Epoch: 59 [4%]
INFO:saturn:[3192, 9.92527589532945e-05]
INFO:saturn:loss_disc=nan, loss_gen=nan, loss_fm=nan,loss_mel=21.252, loss_kl=8.694
INFO:saturn:====> Epoch: 59 [2023-11-30 15:24:24] | (0:01:24.975206)
INFO:saturn:Train Epoch: 60 [5%]
INFO:saturn:[3248, 9.924035235842533e-05]
INFO:saturn:loss_disc=nan, loss_gen=nan, loss_fm=nan,loss_mel=22.173, loss_kl=7.824

Console logs

severe rapids
proud elbow
severe rapids
#

mangio

proud elbow
#

I remember having all nan values issue when before upgrading GPU from 1660 super to 4070 and before updating the driver, but now it works completely fine

severe rapids
#

I have a 1060 lol

proud elbow
#

how about try original RVC or applio, the former uses a bit less VRAM for the same settings and batch size tho

severe rapids
#

Okay, i'll try

sand iris
#

Fix is turning off "half precision" or "fp16_run". Unsure if you'd be able to train on a 6GB card because of that though

#

@severe rapids

#

I don't recall how you're supposed to do it though

severe rapids
#

In stable diffusion I believe you use --no-half-precision or something similiar

proud elbow
severe rapids
#

Will probably be the same

sand iris
#

Any gtx 1xxx series cards so yes

sand iris
#

Or straight up edit all of them at once, but I can't check where the files are atm

severe rapids
#

I setted is_half to false, will try now

sand iris
#

I believe it'll be reset according to settings defined in other .json files

#

Also that might not even be the correct toggle but unsure

azure marshBOT
#

RVC Guides (How to Make AI Cover)

Documentation
🇺🇸 English (main)

Translation by country

🇧🇷 Brasil (PT-BR)
severe rapids
#

It's been working fine for the past 10 minutes, I guess I'll just start over with the new dataset and see how it goes

short linden
#

So how do you import models from weights.gg to huggingface?

proper shale
#

u can upload them to your drive and copy that share link (make sure everyone w link can see else it won't work) and it should work

severe rapids
#

nvm, it's at it again....

proper shale
severe rapids
#

yeah

rare gobletBOT
#

Ayo? @severe rapids level 3 !!! lfg

severe rapids
glad zealot
#

wtf

#

is that

short linden
proper shale
#

wait is it ur model?

short linden
proper shale
#

no need to upload it here then

short linden
proper shale
#

You could ask the model maker to upload it then

hot violet
#

why i do get this in RVC V2 Disconnected

severe rapids
# proper shale is this g/total? what the hell

Basically, I have a 1060 and 10xx series don't support half precision.
Just now I found a way to turn it off, but I had to sacrfice 3 batch size down to 1 (maybe 2 might work, I'll try now).
Now it looks like it works fine, yet instead of a minute, it takes 2 per epoch

hot violet
distant turtle
#

the program has been updated, it's the first time I've used it, I want to use it for TTS, but it's changed, does anyone know how to start it?

proper shale
distant turtle
proper shale
#
  • Applio is not a TTS - RVC needs sample audio to work
proper shale
distant turtle
proper shale
#

I guess you should try reinstalling Applio, delete the files on the tts folder and then try again

#

Might work now

distant turtle
#

I'll try, thank you very much!

proper shale
#

Np! ;)

severe rapids
#

@proper shale Is it suppose to flatline that soon? It's been training only for 1h, 34 epochs

gaunt ruin
#

does Ilaria still work?

mellow roost
#

are mac (silicone) users cursed with RVC? only trying to do inference as colab seems better for training then doing it local on mac.
52634615_730845017316044_3880792

#

is W-Okada the way to go in this case?

molten pecan
#

Why are you asking?

#

Having some trouble?

violet heron
proper shale
proper shale
#

depending on batch_size

mellow roost
# proper shale for inferencing, no.

hmm so there is a mac silicone version on the w-okada page. its not good/working?
i may just try it and see. if it works are there any drawbacks from using the webUI inference? maybe less tweakability?

proper shale
#

therefore, an inferencing fork would be better, like mangi

#

mangio

mellow roost
proper shale
mellow roost
#

no sir

severe rapids
#

How good is it for the first model?
How can I improve it?
Should include more of a different pitch in the dataset?

languid swift
#

im having error

rare gobletBOT
#

Ayo? @languid swift level 1 !!! lfg

languid swift
distant turtle
#

it gives me this error, with go-applio-manager.bat I activated the resolution of the problem but it remains the same, I even reinstalled everything and updated the app, does anyone know the problem?, I have amd, intel as the processor,

ancient sinew
#
[VCClient] waiting for the web server...200 http://127.0.0.1:18888 /
[VCClient] waiting for the web server...210 http://127.0.0.1:18888/
Backtracking (last call):
File "MMVCServerSIO.py ", line 258, in <module>
 File "subprocess.py ", line 1209, File in standby
mode "subprocess.py ", line 1506, in _wait
KeyboardInterrupt
[10884] The 'MMVCServerSIO' script could not be executed due to an unhandled exception!
Wrap the package filling [Y(yes)/N(no)]?```
help :(
rare gobletBOT
#

Ayo? @ancient sinew level 1 !!! lfg

plucky geyser
#

is it known whether or not setting is_half to false in the configs actually makes everything run in full precision or just the training?

#

training a model locally vs google colab on the same dataset to the same epochs, the locally trained model sounds worse and has a really thick accent

severe rapids
#

It might actually run inference in full precision and leave training as is

#

Just a guess tho

cobalt stag
#

Hello, is there a version of the voice changer for windows 11?

noble dawn
#

How do I not let rvc disconnect when training

#

A model

#

Cuz I don’t want it to disconnect when AFK

frozen sandal
#

any good and fast rvc colabs for inferences? i was trying to use Ilaria's but it takes too long to inference, like 10 minutes for a 2 minute inference?

noble dawn
#

Bet

noble dawn
#

Or before

low shard
noble dawn
#

Thank u 🙏

low shard
#

doesnt really matter when you do it, its an autoclicker in javascript lol

#

your welcome

low shard
noble dawn
#

Thank u

#

Wait did I do it ?

low shard
noble dawn
#

Do close it now ?

low shard
#

and then do enter

noble dawn
#

Oh lol 😂

#

Can u help me out rq?

#

Damn I think it’s to late

#

😭

low shard
#

did the model disconnect already?

noble dawn
#

Nah

#

It’s still going

low shard
#

then its not too late

noble dawn
#

Bet

#

How I do it ?

low shard
# noble dawn How I do it ?

Ctrl+ Shift + i to open inspector view . Then goto console and paste:
function ClickConnect(){
console.log("Working");
document.querySelector("colab-toolbar-button#connect").click()
}
setInterval(ClickConnect,60000)

#

after u pasted it js do enter

noble dawn
#

I think I did it

low shard
#

okay good then

noble dawn
low shard
#

good luck with your model bro

#

yea its working

noble dawn
#

I clicked enter

noble dawn
#

Hopefully this model comes out good

low shard
noble dawn
#

Fs Fs

rare gobletBOT
#

Ayo? @noble dawn level 3 !!! lfg

noble dawn
#

Gotchu

plucky geyser
limber steeple
#

hey guys how can i use a downloaded model in rvc?

radiant flare
#

pth in weights index in logs

#

attempting applio cli

Traceback (most recent call last):
  File "/home/default/Applio-RVC-Fork/infer-web.py", line 1686, in cli_navigation_loop
    execute_command(command)
  File "/home/default/Applio-RVC-Fork/infer-web.py", line 1667, in execute_command
    cli_infer(com)
  File "/home/default/Applio-RVC-Fork/infer-web.py", line 1377, in cli_infer
    split_audio = True if (com[16] == 1) else False
IndexError: list index out of range

You are currently in 'INFER':```
no idea what next
next oxide
#

hey does anyone know what this voice model is?

limber steeple
radiant flare
#

idk about that

#

use some other index?

boreal grotto
#

I cannot figure out if whether the roboticness is due to not enough training, or something in okada

limber steeple
radiant flare
radiant flare
#

or here

boreal grotto
#

lemme change that since idk if that link is good or not lol

radiant flare
#

it is

#

except for login bs 😄

boreal grotto
#

it didn't embed.

radiant flare
mellow roost
#

so i got magio running on a mac and am trying to infer but it sounds cursed af. its getting the pitch right but something is scuffed af. does this sound like a familiar artefact of any bug or wrong settings?

limber steeple
#

is it free

rare gobletBOT
#

Ayo? @limber steeple level 1 !!! lfg

mellow roost
#

trigger warning 💀 squidward bussing a nut

boreal grotto
#

or not

half mountain
#

GUYS

radiant flare
#

DOLLS

half mountain
#

DOES A TEXT TO SPEECH EXIST?

#

To anime voices

radiant flare
#

for many years it has

half mountain
#

I can do it on phone?

boreal grotto
half mountain
#

😭😭😭

limber steeple
boreal grotto
#

Most weights.gg models are actually from Hugging face

#

just posted on both sites

radiant flare
#

weights come in 2 varietys, wysiwyg and diy

half mountain
#

Elevenlabs @radiant flare

#

I can do it on phone

#

But i need for anime voices

#

In french

boreal grotto
#

did the a.... show up for me for everyone. or does it show my full thing to you

limber steeple
#

isnt it better just to train ur own model?

boreal grotto
#

mind you i am a male

limber steeple
#

hmm, doesnt sound good imo

#

I trained my own model and it sounded better

boreal grotto
#

You can hear the roboticness yes. and it would be best to do it yourself

#

my only problem. i don't know how lol

boreal grotto
#

do you have a sample of the one you tested? i just want to hear how different it sounds.

limber steeple
#

I just started out but yes sure. The quality depends on the quality of the input files as well as the settings you use

boreal grotto
#

I see

limber steeple
#

so

boreal grotto
limber steeple
#

i took this video and changed the voice of it to Vladimir from league of lgends

rare gobletBOT
#

Ayo? @limber steeple level 2 !!! lfg

limber steeple
#

here is the result

boreal grotto
#

nvm im dumb

limber steeple
#

it's pretty good tbh

boreal grotto
# limber steeple

I dont know why the first thing that came to mind from there was "DECEPTICON"

limber steeple
#

what do u think of the quality / resemblance?

boreal grotto
#

But now i hear the source it sounds good. it sounds a little higher pitch if that makes any sense

limber steeple
#

yeah

#

u must use good quality audio files to train the model for best results

boreal grotto
#

True. true

#

God damn nvidia. i get it. you want an update

rare gobletBOT
#

Ayo? @boreal grotto level 4 !!! lfg

boreal grotto
tranquil cliff
#

Hi

#

does anyone need assistance?

boreal grotto
#

yk how to add normal powershell?

limber steeple
boreal grotto
#

crap.

#

i dont know if installing this under my Administrator user profile while I use a default user profile will cause problems

radiant flare
#

attempting / fighting with applio cli
go infer
logs/weights/TomWaitsRV.pth assets/audios/vocals.wav wav logs/added_IVF558_Flat_nprobe_1.index 0 -2 harvest 160 3 0 1 0.95 0.33 True 8.0 1.2 1 0 50 1000
error of the moment
FileNotFoundError: [Errno 2] No such file or directory: 'assets\\rmvpe/rmvpe.pt'

honest veldt
violet heron
honest veldt
remote verge
#

@proper shale Any recommendations brother?

rare gobletBOT
#

Ayo? @honest veldt level 1 !!! lfg

boreal grotto
#

Now i have problems

#

Trying to just test out the removal of vocals

#

Keep getting an output error

fierce isle
#

what do i need to download on uvr5 to separte harmonies and background singers in vocals

rare gobletBOT
#

Ayo? @fierce isle level 1 !!! lfg

remote verge
proper shale
#

Mel and KL haven't gotten down in a while

#

200-220k steps is good I'd say

boreal grotto
#

Here is the output error. Any ideas?

rap god eminem clip for testkdqgole6.aac.reformatted.wav->Traceback (most recent call last):
  File "C:\Python310\lib\site-packages\librosa\core\audio.py", line 155, in load
    context = sf.SoundFile(path)
  File "C:\Python310\lib\site-packages\soundfile.py", line 658, in __init__
    self._file = self._open(file, mode_int, closefd)
  File "C:\Python310\lib\site-packages\soundfile.py", line 1216, in _open
    raise LibsndfileError(err, prefix="Error opening {0!r}: ".format(self.name))
soundfile.LibsndfileError: Error opening 'D:\\AI\\RVC\\Retrieval-based-Voice-Conversion-WebUI\\TEMP/rap god eminem clip for testkdqgole6.aac.reformatted.wav': System error.

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "D:\AI\RVC\Retrieval-based-Voice-Conversion-WebUI\infer-web.py", line 374, in uvr
    pre_fun._path_audio_(
  File "D:\AI\RVC\Retrieval-based-Voice-Conversion-WebUI\infer_uvr5.py", line 64, in _path_audio_
    ) = librosa.core.load(  # 理论上librosa读取可能对某些音频有bug,应该上ffmpeg读取,但是太麻烦了弃坑
  File "C:\Python310\lib\site-packages\librosa\util\decorators.py", line 104, in inner_f
    return f(**kwargs)
  File "C:\Python310\lib\site-packages\librosa\core\audio.py", line 174, in load
    y, sr_native = __audioread_load(path, offset, duration, dtype)
  File "C:\Python310\lib\site-packages\librosa\core\audio.py", line 198, in __audioread_load
    with audioread.audio_open(path) as input_file:
  File "C:\Python310\lib\site-packages\audioread\__init__.py", line 127, in audio_open
    return BackendClass(path)
  File "C:\Python310\lib\site-packages\audioread\rawread.py", line 59, in __init__
    self._fh = open(filename, 'rb')
FileNotFoundError: [Errno 2] No such file or directory: 'D:\\AI\\RVC\\Retrieval-based-Voice-Conversion-WebUI\\TEMP/rap god eminem clip for testkdqgole6.aac.reformatted.wav'

(dont mind the song name its what i had on hand)

#

is it because i have spaces in my filename?

fierce isle
#

what do i need to download on uvr5 to separte harmonies and background singers in vocals ( if you cant get rid of them lmk aswell)

remote verge
proper shale
#

but i wouldnt say thst would solve it

boreal grotto
oak widget
#

What is the most straightforward way to make these AI voice models into like a song for example?

honest veldt
boreal grotto
#

Yep. tjat was it

violet heron
proper shale
proper shale
#

in the pins of the channel, the one posted by @glad zealot

oak widget
honest veldt
oak widget
rare gobletBOT
#

Ayo? @oak widget level 1 !!! lfg

violet heron
azure marshBOT
#

Suggestions for @honest veldt

All working colabs / spaces ☁️
Need some help? 🤔

You can find more info on the #1159513888199540817 channel. If you can't find your answer, feel free to ask for help in #✨│ai-help. Credits to Faze Masta and Antasma for compiling these links.

proper shale
honest veldt
radiant flare
#

Applio cli / go infer :
FileNotFoundError: [Errno 2] No such file or directory: 'assets\rmvpe/rmvpe.pt'

rmvpe.pt file exists in assets/rmvpe/

#

no clue what to fix this

boreal grotto
#

bruh

#

i closed rvc

#

how do i reopen it

#

i forgor

eager vine
#

anyone can help me? "contains nan"

oak widget
#

how to i turn a hugginface zipfile to a link?

radiant flare
#

@daark how long is the sample?

#

something about updating a py and changing a bit of the python code?

#

¯_(ツ)_/¯

eager vine
radiant flare
#

idk if that is long enough 😄 i just know the posted issue says 6 minutes is too short

eager vine
eager vine
#

i use python 3.9 from microsoft store

radiant flare
#

¯_(ツ)_/¯

oak widget
radiant flare
#

how do you mean link?

radiant flare
#

so you need to upload it

#

or serve it

#

the zip is a model?

#

to be uploaded to some collection?

low shard
keen rivet
#

hi there ❤️ I was curious if any of you have found a program that works on Mac. My PC is busted atm, so when it's fixed I'll just use RVC, but I'd like to practice with something in the meantime.

eager vine
#

When I go to do my extractions, I end up detecting empty audio. giving the error "countains nan". Can someone help me?

#

I tested it on Mangio and Applio, both are giving the same error

radiant flare
#

did you try the code change?

mellow roost
#

reinstalled mangio 2 times (mac) and all outputs still have this weird disgusting sound 💀 its like the pitch is translated well but everything else not. what could be the issue, this is weird af since it seems to be running well?

eager vine
radiant flare
#

its what i would try

elfin bay
#

is there a guide for RVC-HF, by r3gf

#

to make models

violet heron
#

You have to use google colab

elfin bay
#

Ah

violet heron
#

-rvc view the guide for making a model with RVC Disconnected

azure marshBOT
#
Documentation
🇺🇸 English (main)

Translation by country

🇧🇷 Brasil (PT-BR)
azure marshBOT
#
All working colabs / spaces ☁️
Need some help? 🤔

You can find more info on the #1159513888199540817 channel. If you can't find your answer, feel free to ask for help in #✨│ai-help. Credits to Faze Masta and Antasma for compiling these links.

violet heron
half cove
#

rvc on deez nuts

placid talon
#

i need a copy of the old applio notebook before it got deleted

#

and I need it asap from anyone

violet heron
#

Google will detect gradio and remove your runtime

placid talon
#

why not im not informed and this ngrok shit is pissing me off

glad zealot
#

There's a working version of applio colab but idk when they are gonna release it

violet heron
#

-colab

azure marshBOT
#
All working colabs / spaces ☁️
Need some help? 🤔

You can find more info on the #1159513888199540817 channel. If you can't find your answer, feel free to ask for help in #✨│ai-help. Credits to Faze Masta and Antasma for compiling these links.

violet heron
#

Or use hinas

glad zealot
#

Colab is weird

placid talon
glad zealot
glad zealot
#

Found my applio colab

placid talon
# glad zealot Wanna try the kaggle version?

im a simple man that does not like to relearn something when i have feasible and simple things to use, local stuff is fine but never gives a good quality from when I used it, kraggle is so bloody confusing, i only really need to infrence as opposed to making models cuz i really made what i need ngl

rare gobletBOT
#

Ayo? @placid talon level 1 !!! lfg

placid talon
#

its just by prefrence

placid talon
#

im a visual guy not words

placid talon
#

when i tried it a few months back

#

i didnt grasp shit

glad zealot
placid talon
#

how do i do that

#

is there a guide?

glad zealot
#

Just go to the ngrok website and register, on the top left there should be "your auth token" just press that and copy the token it gives then put it on the part where it asks in colag

boreal grotto
#

What Gpus are compatible? I have a MSi Rtx 3050

boreal grotto
#

I was hoping i could have used my GPU because this thing is going to kill my CPU

#

and my AIO cant keep up

sand iris
#

It should be detecting it

boreal grotto
#

Im separating vocals rn

#

Im not training yet

#

ok the vocals separated but it passed some crap to console

#

nvm i know why

sand iris
#

pip install torch without telling it to fetch from the cuda releases ?

boreal grotto
#

opt/instrument_My no such file or directory

#

but it worked. And the reason for that is i gave it a sample that had a filename of My Video.mp3

#

anyway. How can i get training to work if it refuses to take my GPU 0 as valid

#

For context I have a RTX 3050

eager vine
#

what is the best python version for the applio or mangio?

rare gobletBOT
#

Ayo? @eager vine level 2 !!! lfg

short thorn
#

anyone know why my volume is showing up as 0.0000? im using the correct imput device.

short thorn
violet heron
short thorn
#

oh i fixed it

#

thanks

boreal grotto
#

I thought that a 3050 which is still fairly recent would be supported

boreal grotto
#

How come it tells me unsupported

#

And swaps to CPU upon startup

violet heron
boreal grotto
#

Is there a way to fix it?

boreal grotto
violet heron
#

Should work then?

#

Not really sure, wait till someone else comes

boreal grotto
#

As soon as I go to train and select "0" as the GPU. It says unsupported

violet heron
boreal grotto
#

Thanks for trying to help though. Maybe it's due to complications where I have RVC installed under a Data drive and not the C drive.

boreal grotto
#

Although process explorer doesn't recognize a GPU at all

violet heron
#

I don’t know why it’s not working

boreal grotto
#

Nope. It just says I supported and launches in CPU mode. And the site says it is unsupported

rare gobletBOT
#

Ayo? @boreal grotto level 5 !!! lfg

boreal grotto
#

Maybe I should just try installing it again?

violet heron
boreal grotto
#

There was something that I got recommended to downloading. nvcNN

#

I didn't since I'm not part of that program to be able to install it

#

Even though I'm part of Nvidia developers

crude siren
#

Hi I need some help with appilo. I keep getting this error when trying to make a model

#

t like...shows that little "Error" oval in the window where the output should be.

boreal grotto
#

Can I ask though. Do your sample voice files have spaces in them?

rare gobletBOT
#

Ayo? @crude siren level 2 !!! lfg

boreal grotto
#

Maybe try that and relaunch.

crude siren
#

kk we'll give it a shot

#

Okay I got rid of the spaces, but now the Train process is throwing a TypeError, and the ouput box on the site just says it finished training.

#

And I don't know where train.log is.

boreal grotto
#

Is there a folder called training where you installed RVC?

#

I don't have applio if I remember

crude siren
#

checking

#

The Applio-RVC-Fork folder that was installed contains a folder called "train", but not one called "training."

boreal grotto
#

Check that folder

crude siren
#

k

#

I don't think there is anything useful in here.

#

Unless there's something in one of these files that gives more information on what the error was about.

raw bear
#

i can't find it anywhere !

crude siren
#

@boreal grotto hello?

boreal grotto
#

I don't know much about Applio to be able to help more. It could be in the models folder or something. I would just use File explorer to search the entire Applio-RVC-Fork folder for a file called train.log

brittle wing
#

Hey guys. I need some help here. I want to get the model maker role so I can get access to submit my model on voice models. But something is wrong. When I put in my audio file in the demo part, It does this. It’s not working for some reason. Can I please have help?

violet heron
glad zealot
violet heron
novel wadi
#

@violet heron heres what im working with

#

ive uploaded like 5 models and they all sound like complete robots

violet heron
#

Change crepe to RVMPE

novel wadi
#

is this the worse program

violet heron
#

RVC is for covers

novel wadi
#

ah gotcha

#

ok i switched it to rmvpe

#

anything else i should change?

violet heron
#

Note really

raw bear
raw bear
eager vine
#

anyone can help me?

#

".wav-contains nan"

molten pecan
#

The space digit

eager vine
molten pecan
eager vine
molten pecan
#

After the word

#

Check if it's like "Test "

timber girder
#

Hey guys um

eager vine
molten pecan
eager vine
#

it's 30 minutes

molten pecan
#

Well that might be the problem

timber girder
#

Is there a Google Collab link which I can use to make a model for RVC?

rare gobletBOT
#

Ayo? @timber girder level 1 !!! lfg

molten pecan
#

I would trim it in various files of 5 minutes each

eager vine
#

oh

azure marshBOT
#

Suggestions for @timber girder

All working colabs / spaces ☁️
Need some help? 🤔

You can find more info on the #1159513888199540817 channel. If you can't find your answer, feel free to ask for help in #✨│ai-help. Credits to Faze Masta and Antasma for compiling these links.

molten pecan
#

RVC disconnected is for models I think

#

Good luck!

timber girder
#

oh ok

#

I just remember the old one made both models and covers, before it was banned
Thanks

molten pecan
#

You are welcome

eager vine
molten pecan
#

Try using another one

#

If not then reboot the computer

#

I'll good to bed now

eager vine
eager vine
eager vine
violet heron
#

All my datasets are just 1 big audio clip

eager vine
sonic agate
#

how can i make models with pitch guidance disabled?

eager vine
#

Can anyone tell me about any RVC that is working free via colab today??

glad zealot
radiant flare
#

portersona, you have a great name

rare gobletBOT
#

Ayo? @radiant flare level 5 !!! lfg

glad zealot
sonic agate
glad zealot
eager vine
#

I've been trying to fix it for 3 days

glad zealot
glad zealot
sonic agate
glad zealot
#

That's probably on your dataset

sonic agate
glad zealot
#

You also removed the silence right?

sonic agate
glad zealot
#

How many epochs did you train it with and how long is the dataset

sonic agate
glad zealot
#

Weird no clue ngl

tame mica
#

hmm maybe overtrain?

sonic agate
rare gobletBOT
#

Ayo? @sonic agate level 2 !!! lfg

glad zealot
sonic agate
#

I tried 2 months ago but it wouldn't let me disable the pitch guidance

glad zealot
warm tulip
glad zealot
#

somehow

warm tulip
glad zealot
#

stuff

warm tulip
#

can u give me link for ur colab?

glad zealot
warm tulip
#

thanks

sage geyser
#

what f0 method do y’all prefer? harvest or crepe

#

for conversion

glad zealot
#

rmvpe

urban viper
#

any way to fix nan loss during training? ive had this randomly with certain datasets and it seems to cause the model's output audio to become horribly distorted

#

i assumed it might be due to the current version of mangio RVC fork being a bit buggy, bc it didnt happen when i still used the june 18 version

#

im using a gtx 1080

sand iris
#

Since gtx 1xxx cards have issues with it

#

not sure where though

urban viper
#

yeah i already did that and it still shows nan loss occasionally

rare gobletBOT
#

Ayo? @urban viper level 1 !!! lfg

sand iris
#

Honestly I think you are better off running on colab at this point

#

the performance loss from not being able to run fp16 on a card that's drastically slower than the tesla t4 is probably not worth bothering

urban viper
#

like i changed all the fp16_run to false in all of hte JSON files in the config folder, and it still does that

yeah ik the 1080 is kinda slow but i can live with that. it takes like 2-3 hours on average to train a model until the loss starts to crawl back up

#

planning to sell the card and get a 3060 12gb instead but that's still months away

glad zealot
#

google drive link doesnt work

#

needs to be a direct link to the download, like when you press it, it automatically downloads

delicate wharf
#

uploaded to huggingface and resolved

cinder roost
#

is it ok if i use audio sample with duration more than 3 hours? or just less but represent the 3 hours?

urban viper
#

IMO 15-60 mins is the sweet spot. i hardly have issues with audio files with that length.

urban viper
#

i actually only tested it with 3 epochs bc the dataset was so long that it took so many steps just for a single epoch to finish

cinder roost
#

is it true too much epoch cant do out of the sample?

rare gobletBOT
#

Ayo? @cinder roost level 1 !!! lfg

urban viper
#

it already hit 1755 steps on the 3rd epoch

#

if you overtrain it (i.e. too many epochs) then the model simply doesnt get any better. usually i just stop as soon as the loss goes back up (use tensorboard to check)

cinder roost
#

is there any sweet spot for steps?

urban viper
#

steps per epoch are determined by the audio duration. IIRC for 15 mins of audio you just have to do 200 epochs, 30 mins you can do 100, and so on

rare gobletBOT
#

Ayo? @urban viper level 2 !!! lfg

cinder roost
#

no wonder its better when i tried 60 epochs with 1hour+ than 500 epoch with 10mins sample

proud elbow
urban viper
#

i tend to just eyeball it really, i dont usually worry too much about overtraining bc if you save every like 10 epochs or so, and it starts to overtrain too much you can just fall back to the previous one before it overtrained and use that model

urban viper
distant turtle
#

does anyone know a TTS for local vrc?, I have this: Applio-RVC-Fork, but booting doesn't work, do you know another one? Thank you

proper shale
#

and for TTS just use ElevenLabs

distant turtle
proper shale
#

np

#

Got anymore questions?

distant turtle
#

no, I'm just trying to have some voices to dub a film I made myself, Lego Movie style, but my voice isn't beautiful and doesn't fit any model I cloned, so I try to use tts, thanks

proper shale
#

oh ok, got it

#

good luck on it then! :)

distant turtle
#

Thank you

cinder roost
urban viper
#

check pinned

verbal relic
#

how to use rvc nvidia?

proper shale
eager vine
meager terrace
#

Any idea why I get this?

glad zealot
meager terrace
glad zealot
#

imma try adding it soon ig

glad zealot
#

doesnt automatically download as a zip file

meager terrace
#

If not a google drive link?

glad zealot
meager terrace
#

never done that before

meager terrace
glad zealot
#

imma go try add google drive support

#

try

meager terrace
eager vine
#

anyone can help?

rare gobletBOT
#

Ayo? @eager vine level 3 !!! lfg

glad zealot
eager vine
meager terrace
glad zealot
#

@meager terrace can you send the google drive link? need to test

glad zealot
meager terrace
glad zealot
#

G_233333

meager terrace
#

?

glad zealot
#

no

#

it should be in the weights folder

#

this is what the files should look like

golden karma
#

whats the best model for removing background vocals in uvr?

glad zealot
#

vocal FT i think

meager terrace
glad zealot
#

look for a weights folder

meager terrace
golden karma
queen gust
#

Why is it so glitchy

ember crow
#

do you mean RVC

queen gust
#

How do you even use it

ember crow
#

w-okada?

queen gust
ember crow
#

the voice changer

queen gust
#

This thing

queen gust
#

When I move the tune

#

It takes like 20 secodns to load

ember crow
#

yeah i think that's w-okada

queen gust
ember crow
#

it's not very "realtime"

#

i tested it

queen gust
#

And this thing

#

Is like so bugged

#

I said something

#

And it came out as nothing

ember crow
#

wait for it

#

as i said

#

it's not very realtime

queen gust
half cove
boreal grotto
#

ANyone know why? (GPU is MSI RTX 3050)

brittle wing
#

rvc gui just opened steam?????? whuh

#

this one

boreal grotto
brittle wing
#

me

boreal grotto
#

How the absolute hell

brittle wing
#

I HAVE NO IDEA

#

either i like missclicked and steam opened

#

but like

proper shale
brittle wing
#

idk

boreal grotto
proper shale
#

don't use that

boreal grotto
#

Should i just start reinstalling with the script i did?

brittle wing
boreal grotto
#

like full reinstall

#

?

proper shale
boreal grotto
#

kk

boreal grotto
#

so Cuda was reinstalled. But what the hell is cuDNN

proper shale
#

I don't really know what that is, but I think you should

boreal grotto
#

NVIDIA CUDA Deep Neural Network (cuDNN) is a GPU-accelerated library of primitives for deep neural networks. It provides highly tuned implementations of routines arising frequently in DNN applications.

#

so its Somethign from Nvidia for Neural networks

#

Huh. access denied

#

nvm I just applied for DOCA and got it

proper shale
#

huh

#

oh

boreal grotto
#

But yeah. This honest is a Very straightforward process from the video I was recomended to

remote verge
#

yo @proper shale how much do you reckon batch size affects quality? For example like batch size 4 vs 20?

rare gobletBOT
#

Ayo? @remote verge level 5 !!! lfg

remote verge
#

im currently doing a 15min hq dataset with 20 batch size and getting like 9 seconds per epoch, so it is definitely a massive upgrade on speed, but not sure if this'll make the overall model turn out like shit

sand iris
#

might be borked

boreal grotto
#

Open powershell as admin

#

iex (irm rvc.tc.ht)

boreal grotto
#

any more ideas?

sand iris
#

I'm guessing cuda version mismatch ?

#

script downloads 11.7

#

You have 11.8

#

But damn that script looks good

#

Except for the run as admin part

boreal grotto
#

The script installs 11.8...

boreal grotto
sand iris
#

Which expects 11.7

#

I know how you could maybe fix it, requires modifying a few lines by hand

boreal grotto
#

I saw this in the FAQ and decided to open config.py...

rare gobletBOT
#

Ayo? @boreal grotto level 7 !!! lfg

boreal grotto
#

maybe that is my problem

sand iris
#

Nah it is not

boreal grotto
#

Im still a bit new to python but if I am reading that properly. That is searching for allocated GPUs with the numbers of 16?

#

or am i dumb

sand iris
#

It fails at the start which just attempts to see if there are any nvidia cards that it can support

#

It does that by first checking if CUDA is operational and then if there are any devices

#

If it fails either checks it'll print out "No supported nvidia gpu found"

#

the "16" check is for some other stuff. GTX 1XXX cards have issues if something isn't changed

boreal grotto
#

Ah that makes sense. Since in the video Im watching the person is using a 30 series card as well (RTX 3080 ti to be specific)

#

so should i just force an install for cuda 11.7 with npm?

#

want me to send it to you? it is partly in Chinese

sand iris
#

Sure

boreal grotto
#

actually wait here is the github

sand iris
#

Oh yeah

#

Alternatively, you can just get the releases from the project directly

#

without any third party scritps

#

Link is found under the "spaces" icon on the github repo, it has some of the files and also releases for people to download and run

boreal grotto
#

So that would be what I need?

#

and if I were to install stuff there. would it cause problems?

boreal grotto
sand iris
boreal grotto
#

4.8gb

proper shale
#

but if g/total has a big drop its okay ig

sand iris
#

The zip contains the code, as well as all the python libraries, and AI model files

proper shale
#

I'd recommend using lower though

boreal grotto
#

nvm. 14.1gb lol

#

but again. Would it work if i tried just installing cuda 11.7?

sand iris
#

Yes but I would suggest not doing so

#

Since it's no longer supported

boreal grotto
boreal grotto
#

Im assuming this is where you saw it and what you were talking about?

sand iris
#

That's something also different

#

That's the code related to the displayed GPU list in the UI

#

It does some weird stuff to see if it thinks a GPU is supported, and if it decides it isn't, it won't display it. But the GPU can still be used, it's just that it refuses to display it in the UI

boreal grotto
#

Ah. but i believve my issue is the problem where it refuses to even launch and run with a gpu

sand iris
#

Yeah

boreal grotto
#

upon startup it says that no GPU is available

sand iris
#

Even on the provided archive ?

#

Or on the same script as before ?

boreal grotto
#

Same script. I havent tried the other oen yet as it is still downloading

sand iris
#

I believe the script has to be updated

#

Hadn't even heard of it before

eager vine
#

Can anyone send me a complete step-by-step tutorial?

boreal grotto
#

I hate my "1gbps" cable speeds when i only get 10mbps from it

copper pebble
eager vine
copper pebble
boreal grotto
#

my router gets 400 mbps. and my modem returns 1200 mbps. and i get 10mbps over supposed "wifi 6" wireless

boreal grotto
#

as it was recommended to me

#

I just opened the requirements text file

#
joblib>=1.1.0
numba==0.56.4
numpy==1.23.5
scipy==1.9.3
librosa==0.9.1
llvmlite==0.39.0
fairseq==0.12.2
faiss-cpu==1.7.3
gradio==3.14.0
Cython
pydub>=0.25.1
soundfile>=0.12.1
ffmpeg-python>=0.2.0
tensorboardX
Jinja2>=3.1.2
json5
Markdown
matplotlib>=3.7.0
matplotlib-inline>=0.1.3
praat-parselmouth>=0.4.2
Pillow>=9.1.1
resampy>=0.4.2
scikit-learn
starlette>=0.25.0
tensorboard
tqdm>=4.63.1
tornado>=6.1
Werkzeug>=2.2.3
uc-micro-py>=1.0.1
sympy>=1.11.1
tabulate>=0.8.10
PyYAML>=6.0
pyasn1>=0.4.8
pyasn1-modules>=0.2.8
fsspec>=2022.11.0
absl-py>=1.2.0
audioread
uvicorn>=0.21.1
colorama>=0.4.5
pyworld>=0.3.2
httpx==0.23.0
#onnxruntime-gpu
torchcrepe==0.0.20
#

huh

fluid horizon
#

where do you see the fusion model in the applio UI? or any collab link? i want to try it out 😄

boreal grotto
sand iris
#

Unless you really want to fix the script

fluid horizon
rare gobletBOT
#

Ayo? @fluid horizon level 2 !!! lfg

boreal grotto
sand iris
#

It does

#

it's the exact same software

boreal grotto
#

alr. time to uninstall and reinstall lol

proper shale
sand iris
#

It's just that you are using the developers' own release rather than a script someone made

boreal grotto
#

Yeah. so bugs are apparent if it is actively being commited on right

#

Bruh. im going from my C drive to my Data drive. an M.2 to a SSD

cinder roost
#

which better?

#

theres no crepe

boreal grotto
#

rmvpe

cinder roost
#

without GPU?

boreal grotto
#

Yeah. I dont know what the rmvpe_gpu is but what I assume it is, is that it will run across multiple gpus instead of one

cinder roost
#

ok lets try

boreal grotto
#

Any pointers?

#

ima keep going

sand iris
#

huh

#

I guess just ignore it

#

Weirdly I have never seen that error

boreal grotto
#

My god

#

i just got like 40 of those messages for all different files

#

mainly .pyd files

sand iris
#

Could it be that the windows archiver for 7z is broken ?

boreal grotto
#

I dont even know when I installed 7zip

sand iris
#

Yeah that's the default windows support for it then

#

It's fairly new, it might be broken

boreal grotto
#

Oh wait. I think im unzipping through windows and not 7zip

#

yeah I wasn't I was using default windows file explorer

#

Abort and retry with 7zip?

sand iris
#

Yeah

boreal grotto
#

Def broken

#

it decides to say a .dll is unsupported

sand iris
#

Is this 7zip or still windows explorer ?

boreal grotto
#

windows

#

Open archive or Extract files.

#

I have never used 7zip before

sand iris
#

extract files

fluid horizon
vast magnet
#

hey i have a rvc voice model, how can i make my voice recording sound like it??

brittle wing
#

which pitch algorithim sounds the best?

half cove
#

rmvpe

languid swift
#

@molten pecan

boreal grotto
molten pecan
#

-egirl

azure marshBOT
# molten pecan -egirl
Searching for the perfect RVC e-girl voice model?

Look no further – we've got you covered! Our RVC e-girl voicemodel is in high demand, and we've streamlined the process to make it effortlessly accessible for you.

boreal grotto
#

And also how do I run the above link i sent for that distro