#✨│ai-help

1 messages · Page 170 of 1

brittle wing
#

nvm its not working

#

can anyone help?

brittle wing
#

can anyone help me with delay issue? idk if its the matter of microphone itself or settings

teal dome
#

when i try using fcpe in applio, i now always get a “FCPEF0Predictor” error saying “unexpected argument ‘sample_rate’” which wasn’t happening before. is something wrong with applio or is it me? the other algos work fine

rare gobletBOT
#

Ayo? @teal dome level 7 !!! lfg

brittle wing
#

-sovits

#

me irl

low shard
pastel oak
rare gobletBOT
#

Ayo? @brittle wing level 3 !!! lfg

deep oasis
#

can someone teach me how to generate an AI kpop voice? i’ve been trying a lot but I failed 🥹

low shard
#

If you got a good pc use Mainline or Applio, else use the Google Colabs, or for a better free cloud experience but harder read the Mainline Kaggle Guide

void dome
#

guy i need help

#

voiche chger dont work

#

he makes sounds but not what i say

pastel oak
proven hill
brittle wing
#

doesnt let me post a picture but ineed help

#

getting this error

#

content/voice-changer/server

ModuleNotFoundError Traceback (most recent call last)
<ipython-input-6-c4b2eef38e98> in <cell line: 24>()
22 get_ipython().run_line_magic('cd', '/content/voice-changer/server')
23
---> 24 from pyngrok import conf, ngrok
25 MyConfig = conf.PyngrokConfig()
26 MyConfig.auth_token = Token

ModuleNotFoundError: No module named 'pyngrok'


NOTE: If your import is failing due to a missing package, you can
manually install dependencies using either !pip or !apt.

To view examples of installing some common dependencies, click the
"Open Examples" button below.

brittle wing
low shard
azure marshBOT
# low shard -rt

This interaction has expired, use the command /guides realtime if you wish to see it again.

unborn sigil
#

should I keep training or stop?

glad zealot
#

looks good, you can save that model but still keep training if it goes down more

#

but it does already look good tho

unborn sigil
#

to keep training it, I would set all the setting in applio again, then jsut press begin training right?

#

no need to do other buttons

proven hill
#

yes

glad zealot
#

yup, just make sure its the original logs file, name, settings

unborn sigil
#

and like what step in the graph should I use to inference with?

#

or weight

#

i'm thinking right around here

glad zealot
#

try the latest one

unborn sigil
#

oh ok

glad zealot
#

since thats seems to be your lowest point

unborn sigil
#

the guide says not to do that, but that sounds like a better idea

#

oh! i read it wrong. Thanks

brittle wing
#

do i upload the python file into model or index>

proven hill
#

what

glad zealot
#

???

brittle wing
#

i cant send it here

proven hill
#

im confused ngl

unborn sigil
#

wow! all those hours of training worked out. I made a model from all my past vocal takes and it sounds decent. Thanks again y'all

glad zealot
proven hill
brittle wing
#

whys the voice uploading keep changing from 5% to 17% to 29% back down to 16%

rare gobletBOT
#

Ayo? @brittle wing level 1 !!! lfg

dry bane
#

does anyone know how to separate overlapping vocals from the acapella of one song? (say someone sings another lyric while the other person sings)

pastel oak
#

Theres some others to try, someone else might know more

#

Same applies on UVR

dry bane
#

got this, thank you!

brittle wing
#

i dont understand how to use the voice on discord or something

odd shale
pastel oak
# brittle wing i dont understand how to use the voice on discord or something

download Virtual Cable, explained on the guide step 6

https://rentry.co/VoiceChangerGuide

Afterwards you do this on the screenshot

languid grove
#

hello all, im new to this discord and would lke to ask how to instal the voice files on this server into so-vits-svc-fork to get a real time voice changer, can anyone lend a hand?

pastel oak
rare gobletBOT
#

Ayo? @pastel oak level 55 !!! lfg

pastel oak
#

only rvc models

languid grove
#

oh ok thank you, ill look into that right now

unborn eagle
#

Why I have this problem?

Ignoring faiss-cpu: markers 'sys_platform == "darwin"' don't match your environment
Collecting numba==0.56.4 (from -r requirements.txt (line 1))
  Using cached numba-0.56.4.tar.gz (2.4 MB)
  Preparing metadata (setup.py) ... error
  error: subprocess-exited-with-error

  × python setup.py egg_info did not run successfully.
  │ exit code: 1
  ╰─> [8 lines of output]
      Traceback (most recent call last):
        File "<string>", line 2, in <module>
        File "<pip-setuptools-caller>", line 34, in <module>
        File "C:\Users\wojte\AppData\Local\Temp\pip-install-3yy19jri\numba_299a783593934f0f9abf5b4ef7faca8b\setup.py", line 51, in <module>
          _guard_py_ver()
        File "C:\Users\wojte\AppData\Local\Temp\pip-install-3yy19jri\numba_299a783593934f0f9abf5b4ef7faca8b\setup.py", line 48, in _guard_py_ver
          raise RuntimeError(msg.format(cur_py, min_py, max_py))
      RuntimeError: Cannot install on Python version 3.11.4; only versions >=3.7,<3.11 are supported.
      [end of output]

  note: This error originates from a subprocess, and is likely not a problem with pip.
error: metadata-generation-failed

× Encountered error while generating package metadata.
╰─> See above for output.

note: This is an issue with the package mentioned above, not pip.
hint: See above for details.

C:\Users\wojte\OneDrive\Dokumenty\RVC>```
fiery raptor
#

whats happening with applio

#

its been incredibly slow

odd shale
fiery raptor
#

thats exactly what ive been doing

#

usually it takes it several seconds to convert my audios

#

but now it can take so long that the colab stops

odd shale
fiery raptor
#

its never been this slow

odd shale
fiery raptor
#

I'll try

#

ive uploaded longer audios for shorter times

#

but ill try

odd shale
fiery raptor
#

its still slow

#

how weird

odd shale
fiery raptor
#

reminds me of the early ilaria huggingface rvc

#

those things would take hours to load

#

oh wait i think something's happening

knotty moth
knotty moth
proven hill
rare gobletBOT
#

Ayo? @unborn eagle level 1 !!! lfg

unborn eagle
fallow crown
#

@steel forge i need help with rvc

rare gobletBOT
#

Ayo? @fallow crown level 1 !!! lfg

regal temple
#

Does Mangio run on linux?

fallow crown
#

im having trouble

last cove
#

Every body i have one question

#

English or Spanish

#

Whoever type First is gay

violet heron
violet heron
pastel oak
pastel oak
odd shale
regal temple
rare gobletBOT
#

Ayo? @regal temple level 2 !!! lfg

unborn eagle
frozen monolith
#

pls help

frozen monolith
misty elk
#

-hf

azure marshBOT
finite pumice
#

how to fix this

unborn eagle
# pastel oak Yes

C:\Users\wojte\Downloads\RVC>python -m pip install -U pip setuptools wheel
Nie mo
Why?

last cove
unborn eagle
violet heron
violet heron
#

I think you should be able to download the version depending on your GPU and open the sh file inside

regal temple
frozen monolith
#

when i run rvc it crashes everytime i try to convert

timid valve
#

technical question because i'm curious.. how are accents stored in the index file? and how does it extract them so fast? it only takes seconds to train the index file whereas it can take several hours to train the main pth, depending on the size of the dataset

finite pumice
#

@acoustic scarab what should i put the vac in? input or output?

acoustic scarab
#

vac in goes to out, vac out goes to in

formal tartan
#

while i'm talking it's sounds like robot how can i fix this i want make it closest to girl voice

analog obsidian
#

index doesn't really store the accent of the model, the accent is in the model itself, what index does is to learn the characteristics of the audios, this can be how the speaker pronounces certain consonants or how the speaker talks, so for example if you have a raspy voice model, decreasing index will decrease the raspy voice a bit but not fully remove it

#

so thats why is faster

violet heron
timid valve
#

super interesting

violet heron
hot ledge
#

guys m having problem with applio its said omegaconf 2.0.6 has a non-standard dependency specifier

#

does someone know how to fix it

glad zealot
#

Depends on your dataset

analog obsidian
#

values are more stable and the model sticks more to the dataset, leading to having less versatility
batch size is not really a quality setting, but a stability setting

#

8 or 16 for 7-10 minutes and above
4 for 5 minutes and below

however like hina said, it depends on* your dataset

#

16 is more accurate to the dataset

austere tartan
#

can someone come to the vc i am in and help me a lil bit setting up the a voice mod

knotty moth
indigo crater
#

How do I prove that the model works?

odd shale
#

Whenever you finish training a model, for testing it you can place any kind of audio sample (clean ofc with no instrumental or noise) on the "audios" folder, and then you put the path to the audio you wanna test on the Inference tab

maiden remnant
#

how do i downlaod rvc

#

i ahve inidia

dense cape
#

does rvc work with amd?
cuz i tried two version(MMVCServerSIO_win_onnxdirectML-cuda_v.1.5.3.18a) and (vcclient_win_cuda_2.0.58-alpha) and they both said that i have an outdated cuda driver.

#

torch\nn\utils\weight_norm.py:28: UserWarning: torch.nn.utils.weight_norm is deprecated in favor of torch.nn.utils.parametrizations.weight_norm.
warnings.warn("torch.nn.utils.weight_norm is deprecated in favor of torch.nn.utils.parametrizations.weight_norm.")
2024-08-08 04:31:50.6181048 [E:onnxruntime:Default, cuda_call.cc:116 onnxruntime::CudaCall] CUDA failure 35: CUDA driver version is insufficient for CUDA runtime version ; GPU=-1364851804 ; hostname=ASUS2 ; file=D:\a_work\1\s\onnxruntime\core\providers\cuda\cuda_execution_provider_info.cc ; line=125 ; expr=cudaGetDeviceCount(&num_devices);
*************** EP Error ***************
EP Error D:\a_work\1\s\onnxruntime\core\providers\cuda\cuda_execution_provider_info.cc:125 onnxruntime::CUDAExecutionProviderInfo::FromProviderOptions [ONNXRuntimeError] : 1 : FAIL : provider_options_utils.h:153 onnxruntime::ProviderOptionsParser::Parse Failed to parse provider option "device_id": CUDA failure 35: CUDA driver version is insufficient for CUDA runtime version ; GPU=-1364851804 ; hostname=ASUS2 ; file=D:\a_work\1\s\onnxruntime\core\providers\cuda\cuda_execution_provider_info.cc ; line=125 ; expr=cudaGetDeviceCount(&num_devices);
when using ['CUDAExecutionProvider']
Falling back to ['CUDAExecutionProvider', 'CPUExecutionProvider'] and retrying.

crude flame
dense cape
#

what is zluda?

crude flame
dense cape
#

does amd work w-okada?

#

i tried it and also cuda error

crude flame
dense cape
#

(RVC1006AMD_Intel) i have this and t doesn't work

rare gobletBOT
#

Ayo? @dense cape level 1 !!! lfg

crude flame
dense cape
#

i have an rx 6600

regal temple
#

I am so sorry for being such a nub, what .sh file should I be running on linux in the mainline distribution in order to be able to run RVC?

crude flame
#

You only need zluda if you want to train a model

dense cape
#

oh ok thank you ❤️

proper mesa
#

i just downloaded the voice changer and i can hear myself fine but it wont work in game or anything like that and i am not sure how to get it to work can anyone help me?

dense cape
#

thx it worked

regal temple
#

I'm getting this error when trying to run RVC in linux:

run.sh: line 54: ./tools/dlmodels.sh: Permission denied

What could be causing this?

violet heron
azure marshBOT
merry eagle
#

how to fix this error
when i press download button i am getting this

solar thunder
#

Help me to dowload the whole thing
i have a macbook
im so confused

solar thunder
#

i dont even know how to start it

#

or where

violet heron
#

Like let’s say you went a restaurant, and you ask for food

#

They don’t know what food you want

solar thunder
#

ai

rare gobletBOT
#

Ayo? @solar thunder level 1 !!! lfg

solar thunder
#

from the github

violet heron
solar thunder
#

but he has a window

#

and i have mac

#

and im lost

violet heron
azure marshBOT
hidden crow
#

on mac

#

it gives u windows

solar thunder
hidden crow
#

search up how to get bootcamp on mac

solar thunder
#

it didnt work

#

im gonna buy windows soon

hidden crow
#

u have to do alot of stuff though ts hard gng

solar thunder
#

so i just left it as it is

hidden crow
#

😭

violet heron
#

(Apple silicon is like M1/M2/M3)

solar thunder
#

it says

#

potential

#

damage

#

careful

violet heron
hidden crow
#

@violet heron whats download link for windows?

violet heron
hidden crow
solar thunder
#

imma just give up

violet heron
solar thunder
#

bye

hidden crow
solar thunder
#

thanks for trying to help

hidden crow
#

uhh lemme see

violet heron
# solar thunder bye

Mac users when they have to do something that isn’t dragging the file to the applications folder

solar thunder
#

i like windows a lot

violet heron
solar thunder
#

everything of it

violet heron
solar thunder
#

idk why i choose this

#

i could have gotten windows

violet heron
solar thunder
#

MY ADOBE STUFF COULD HAVE BEEN EASY THOSE SCHOOL STUFF

#

idk why

#

i bought

#

this damn mac

violet heron
solar thunder
#

i have m2

#

chip

hidden crow
#

@violet heron radeon pro 555x

rare gobletBOT
#

Ayo? @hidden crow level 1 !!! lfg

hidden crow
#

is my gpu

violet heron
#

Which means your underage

solar thunder
#

this

#

that was 2 years

#

ago

#

my memeory

#

si not memeorying

solar thunder
rare gobletBOT
#

Ayo? @solar thunder level 2 !!! lfg

solar thunder
#

if i was i would not even know these

violet heron
azure marshBOT
solar thunder
hidden crow
#

deadass

solar thunder
#

im not 8

hidden crow
#

deadass

solar thunder
#

im 13

#

with a short term memory loss

violet heron
hidden crow
#

alr bro

solar thunder
#

i have my passport

#

to verify

#

again

#

i been banned on this acc a lot

violet heron
solar thunder
#

all for the same stuff

#

but i verified it

#

with my passport

final sentinel
#

I have edited the audio in bandlab after extracting the audio with Uvr5 ui and changing the voice with ilaliarvc, but this time there is a definite discrepancy, what should I do?

knotty moth
final sentinel
#

When I put two audio data together in bandlab(daw), one with vocals only and the other without vocals, there is definitely a discrepancy between the two audio files.

#

It looks like this.

knotty moth
# final sentinel It looks like this.

misaligned tempo, right? or you should probably do proper mixing
テンポがずれてるよね?あるいは適切なミキシングを行う必要があるでしょう

pastel oak
pastel oak
# frozen monolith

Set (MME) at the end for input and output, you selected two different types

#

any other stuff on rvc realtime check here

https://rentry.co/RVCRealtimeGuide

brittle wing
#

-colab

azure marshBOT
# brittle wing -colab
☁️ Google Colabs
rare gobletBOT
#

Ayo? @brittle wing level 2 !!! lfg

opaque spoke
#

how do i start training models?

rare gobletBOT
#

Ayo? @opaque spoke level 1 !!! lfg

pastel oak
#

If you need cloud option, check pinned message and open Ilaria RVC the one for training, thats also a good one to use for training

opaque spoke
#

ok thanks

#

im using nvidia

#

holy that download is huge

#

how long do you guys think it will take to make a model?

knotty moth
opaque spoke
#

how to do it

knotty moth
opaque spoke
knotty moth
opaque spoke
#

how long does it normally take??

rare gobletBOT
#

Ayo? @opaque spoke level 2 !!! lfg

opaque spoke
#

i was just wondering how much time I would need to dedicate for this

knotty moth
pastel oak
#

Learning to clean a dataset is more important which the basics are explained and you can use right away
Training you do with a few clicks, the actual training process you monitor over with Tensorboard which is explained there too. training can take a few hours

#

You dont have to do anything while it trains

main tinsel
#

Why my training go back to epoch 1????

qte14_t3 | epoch=10 | step=3800 | time=20:27:54 | training_speed=0:02:16 | lowest_value=15.088 (epoch 10 and step 3687)
Training has been successfully completed with 10 epoch, 3800 steps and 34.271 loss gen.
Lowest generator loss: 15.088 at epoch 10, step 3687
Saved model 'C:\Users\dmg03\Downloads\Applio-main\logs\qte14_t3\qte14_t3_10e_3800s.pth' (epoch 10 and step 3800)
Successfully synchronized graphs!
Starting training...
Loaded pretrained (G) 'rvc\pretraineds\pretraineds_custom\G_KLM42_T4_40k.pth'
Loaded pretrained (D) 'rvc\pretraineds\pretraineds_custom\D_KLM42_T4_40k.pth'
0%| | 0/380 [00:00<?, ?it/s]C:\Users\dmg03\Downloads\Applio-main\env\lib\site-packages\torch\autograd_init_.py:251: UserWarning: Grad strides do not match bucket view strides. This may indicate grad was not created according to the gradient layout contract, or that the param's strides changed since DDP was constructed. This is not an error, but may impair performance.
grad.sizes() = [64, 1, 4], strides() = [4, 1, 1]
bucket_view.sizes() = [64, 1, 4], strides() = [4, 4, 1] (Triggered internally at C:\actions-runner_work\pytorch\pytorch\builder\windows\pytorch\torch\csrc\distributed\c10d\reducer.cpp:334.)
Variable._execution_engine.run_backward( # Calls into the C++ engine to run the backward pass
qte14_t3 | epoch=1 | step=380 | time=20:30:29 | training_speed=0:02:13
qte14_t3 | epoch=2 | step=760 | time=20:32:38 | training_speed=0:02:08 | lowest_value=15.547 (epoch 2 and step 535)

This is what I reproduced. When I resume training, all D and G files will be deleted and go back to 1 epoch. crazy

molten marlin
#

Will the voice changer work on the rx?

#

570

pastel oak
#

download amd Version

molten marlin
#

with no delay?

pastel oak
#

with rx 570 maybe 1 second not sure

molten marlin
#

thanks

rare gobletBOT
#

Ayo? @molten marlin level 1 !!! lfg

molten marlin
#

I will try

brittle wing
#

the voices are so choppy and not clear at all for some reason

odd shale
knotty moth
brittle wing
rare gobletBOT
#

Ayo? @brittle wing level 1 !!! lfg

proper shale
#

@brittle wing send your okada settings

#

let us see what's up

odd shale
#

Show your settings so MJ can help you.

odd shale
proper shale
odd shale
proper shale
#

how have u been

proper shale
odd shale
#

Hiding like a turtle.

proper shale
#

i see

brittle wing
molten marlin
#

colab still working?

odd shale
molten marlin
#

can i have it

proper shale
brittle wing
#

everything else is fine?

proper shale
#

ig mess with chunk & extra to see what works best for you

#

and maybe use sup2 if u have BG noise

brittle wing
#

alr tq

jagged marsh
#

Hi how can I make the voice on w-okada sound like the samples? Any voice sounds nothing like the samples

pastel oak
jagged marsh
#

Yea I wanted to sound like Trevor Phillips lol

#

Or Obama, but the voice sounds like a white person

pastel oak
#

Maybe using index a bit can help

#

id guess Obama works fine since its mostly taken from speech

analog obsidian
#

Models not sounding like the original voice in realtime are because of multiple factors, but the main one is when your voice is too different from the original one, for example my voice is very high pitched and i talk very differently compared to the model i am using, this causing the model sounding weird despite being well trained

#

I noticed increasing index to 0.5 ish forces the model to pronounce consonants like the og voice and mimic more the way the original voice talks (still not perfect, the change is minimal, some models can do 0.7 index in realtime but not all),
but this can affect the model pronunciation (gets worse, it makes the model pronounce certain consonants like the original voice but can make even simple words have pronunciation issues) and sometimes even adds artifacts

#

So the best u can do for characters/known people is to actually mimic the way they speak
Sometimes the model may be undertrained, sadly u can’t fix that if u didn’t made the model

pastel oak
#

i love you lyery

abstract vortex
#

Hi guys, how long does this process usually take? Making an Ai cover btw

rare gobletBOT
#

Ayo? @abstract vortex level 1 !!! lfg

odd shale
odd shale
#

Pth file's will always only have 50+ MBs of size

#

But the .index size will always depend.

abstract vortex
#

Ohh okay, thank you! I havent made ai covers in a year and I was so used to using the easy gui haha everythings so different now

brave garnetBOT
#

Settings for Nvidia GPUs nvidiagpu

F0 Det.: rmvpe (suggested for all series)

RTX 40-series: 80-96 chunk | +16384 extra
RTX 30-series: 96-112 chunk | +16384 extra
RTX 20-series: 112-128 chunk | +16384 extra
GTX 16-series: 128-192 chunk | +8192 extra
GTX 10-series: 128-192 chunk | +8192 extra

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

pale rock
#

INSTALL_Mangio-RVC-v23.7.0_INFER where does this install what location?

brave garnetBOT
pastel oak
brave garnetBOT
#

Settings for Nvidia GPUs nvidiagpu

F0 Det.: rmvpe (suggested for all series)

RTX 40-series: 80-96 chunk | +16384 extra
RTX 30-series: 96-112 chunk | +16384 extra
RTX 20-series: 112-128 chunk | +16384 extra
GTX 16-series: 128-192 chunk | +8192 extra
GTX 10-series: 128-192 chunk | +8192 extra

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

misty elk
#

-kaggle

azure marshBOT
misty elk
#

-overtraining

azure marshBOT
# misty elk -overtraining
Overtraining

You can detect if a model is overtraining if the TensorBoard graph starts to rise and never comes back down. An overtrained model will sound robotic, muffled, and won't be able to articulate words well.

Check these resources to learn more about this topic

abstract vortex
#

here r my settings thingy

brittle wing
#

Cannot connect to GPU backend
You cannot currently connect to a GPU due to usage limits in Colab. Learn more
To get more access to GPUs, consider purchasing Colab compute units with Pay As You Go.

#

any way to fix without paying

golden karma
#

/guides uvr

#

-uvr

azure marshBOT
# golden karma -uvr
Ultimate Vocal Remover

One of the best free and open source vocal and instrumental isolation tool.

pastel oak
brittle wing
#

what does rvc even mean

pastel oak
#

it does not mean real voice changer

brittle wing
#

hm

#

that’s enough for me tbh

golden karma
#

how to separate two voices in uvr?

rare gobletBOT
#

Ayo? @golden karma level 2 !!! lfg

pastel oak
golden karma
#

okay thanks

radiant vortex
#

Please wait

#

run with the runtime pythong

#

Or something

#

What the hell do I do=

pastel oak
#

What

stiff helm
#

How can I get RVC inference running locally on my Mac? I was using the Applio WebUI, but now I want to run it locally, without a WebUI. I want to be able to run it in my VSCode terminal. I just need to run inference, and be able to adjust the settings. I am familiar with Docker if I need to use that.

sharp solstice
#

-rvc

azure marshBOT
rare gobletBOT
#

Ayo? @sharp solstice level 1 !!! lfg

stiff helm
stiff helm
#

I want to be able to run it in VScode, without the webui

violet heron
regal temple
#

w/ RVC AIO is it possible to train more than 1000 epochs?

stiff helm
#

But I can check ine of the RVC repos

vivid pewter
#

-rvc

azure marshBOT
rare gobletBOT
#

Ayo? @vivid pewter level 1 !!! lfg

vivid pewter
#

no module named gradio

#

why isnt it working

timid valve
#

stereo or mono? does it even make any difference?

analog obsidian
timid valve
#

ah alright

stiff helm
stiff helm
vivid pewter
#

-uvr

azure marshBOT
# vivid pewter -uvr
Ultimate Vocal Remover

One of the best free and open source vocal and instrumental isolation tool.

analog obsidian
#

@glad zealot this happen when trying to use mainline colab cryingmyreefer

opaque spoke
#

is there a guide on how you should merge models and stuff

glad zealot
#

werd i just started it and it works no problem

analog obsidian
opaque spoke
#

uh i have no clue what theyre saying lol

#

I'm like new and I was trying to see how I could make a realistic model

open ivy
#

need help with RVC, downloaded it and it shows me 6 options and wont show me any of the default voice models

rare gobletBOT
#

Ayo? @stiff helm level 8 !!! lfg

analog obsidian
thin stump
#

322 was best g/total and best mel/kl 442 epoch

golden karma
#

what's better crepe or rmvpe

thin stump
rare gobletBOT
#

Ayo? @thin stump level 35 !!! lfg

analog obsidian
golden karma
#

realtime conversion

analog obsidian
thin stump
opaque spoke
opaque spoke
thin stump
opaque spoke
thin stump
#

first of all get started on your model first before you worry about tensorboards

thin stump
golden karma
#

what's the best model in uvr for separating harmonies?

opaque spoke
thin stump
#

Again, it's a clarity boost for the overall model. But it wouldn't matter much if your dataset is noisy so please read the guides

analog obsidian
opaque spoke
thin stump
analog obsidian
opaque spoke
analog obsidian
#

it will help it understand the way u speak more

#

but has the risks of sounding worse actually

opaque spoke
#

or is it not possible?

analog obsidian
#

in realtime mic quality affects how clear your model sounds

#

and locally (rvc) the audio of the inference has to be clear as well

#

if your model doens't sound clear despite having both a good mic or a good audio, is a model problem

thin stump
opaque spoke
rare gobletBOT
#

Ayo? @opaque spoke level 3 !!! lfg

opaque spoke
#

oh ok

#

hmmm

analog obsidian
#

basically for realtime you need a clear mic, low amount of background noise, and good volume mic

opaque spoke
#

do you have any reccomendations?

analog obsidian
#

if its sounds clear there but not clear in realtime, is the mic

analog obsidian
#

follow the tutorial

opaque spoke
#

ok

#

thanks

analog obsidian
#

if your pc can't run the software, you can use a cloud solution (colab, kaggle)

#

but it should work
edit: btw i don't recommend training rvc models in a laptop, it can potentially overheat them, but inference should be fine (just converting audio files)

opaque spoke
analog obsidian
#

for training? yes

opaque spoke
#

would that be enough?

analog obsidian
#

training is very intensive for laptops

opaque spoke
#

blow it with a fan???

#

or what

analog obsidian
#

idk

#

i just told u about that in case u notice high temperatures while training in your laptop

opaque spoke
#

like do i stop it if it gets too high or smth

analog obsidian
opaque spoke
#

can you have pauses in between training your model??

analog obsidian
opaque spoke
#

oh ok

vestal cradle
#

I'm trying to integrate rvc into a python script for tts I see this https://github.com/blaisewf/rvc-cli and it seems to be what I need but I can not seem to get it to work in either Linux or Windows. I want it to do the tts and inference from inside of a python script and generate and output.wav does anyone have any experience with this or an implementation of RVC that can do this?

dreamy berry
#

how do i run beatrice v2? i downloaded the zip and ngl idk what to do next

hushed steppe
#

Can someone explain this

#

I've tried it in mp3 and wav it won't work
Even putting the name in lower case then reuploading it

#

Tag me if anything

round quartz
#

I'm currently using very high quality datasets to train models and i am wondering which pretrains would be the best to use? just the standard RVC V2?

thin stump
#

AIhub were all about pretrains just for the clarity boost and values during training. So the pendulum has switched around because at the end of the day, Hifigan is crap

round quartz
rare gobletBOT
#

Ayo? @round quartz level 1 !!! lfg

vestal cradle
#

I've trained several convincing sounding models but i still haven't figured out how to read the tensorboard

thin stump
final sentinel
#

What is the setting for separating two or more hams in a hagginface uvr?

hidden zodiac
#

any fixes for voice cutting?

odd shale
odd shale
final sentinel
#

How do I set up Uvr5 ui to separate multiple hamori parts?

odd shale
#

You can try using the karaoke models.

#

Also, there's UVR online site which got Mel Karaoke.

final sentinel
#

メインボーカルを除く複数のバックボーカルがある設定を教えてくれませんか❓

odd shale
knotty moth
misty elk
#

-colab

azure marshBOT
# misty elk -colab
☁️ Google Colabs
tame wharf
#

hi

#

i have this error in google colab

#

can't connect to gpu backend

pseudo flint
#

best settings for uvr5

#

roformer models included beta

odd shale
#

You can solve it by just using an alt google account

tough fiber
#

~paperspace

#

-paperspace

azure marshBOT
odd shale
rare gobletBOT
#

Ayo? @tame wharf level 4 !!! lfg

tough fiber
wispy lodge
#

That I dunno, I don't use paperspace

tough fiber
tough fiber
odd shale
#

RVC is incompatible with Paperspace since months ago.

tough fiber
#

paperspace

#

they do refund or?

odd shale
tough fiber
#

-kaggle

azure marshBOT
odd shale
tough fiber
odd shale
tough fiber
#

making a model is annoying

#

ill try on my local

#

really

odd shale
tough fiber
#

i need place like paperspace

odd shale
#

But there are no up to date tutorials or notebooks for running RVC on runpod or vast.

tough fiber
#

-applio

azure marshBOT
tough fiber
#

@odd shale i want to do perfect voice model. i have 3 hours good data
How long should the data last for best efficiency?

proud pecan
#

Hey Guys, what does this error in the colab version mean??

odd shale
#

It's not mandatory to use various hours.

proud pecan
# proud pecan Hey Guys, what does this error in the colab version mean??

Building wheel for gin (setup.py) ... done
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
xgboost 2.1.1 requires nvidia-nccl-cu12; platform_system == "Linux" and platform_machine != "aarch64", which is not installed.
albucore 0.0.13 requires numpy<2,>=1.24.4, but you have numpy 1.23.5 which is incompatible.
albumentations 1.4.13 requires numpy>=1.24.4, but you have numpy 1.23.5 which is incompatible.
albumentations 1.4.13 requires pydantic>=2.7.0, but you have pydantic 1.10.17 which is incompatible.
chex 0.1.86 requires numpy>=1.24.1, but you have numpy 1.23.5 which is incompatible.
pandas-stubs 2.1.4.231227 requires numpy>=1.26.0; python_version < "3.13", but you have numpy 1.23.5 which is incompatible.
torchtext 0.18.0 requires torch>=2.3.0, but you have torch 2.0.1 which is incompatible.
torchvision 0.18.1+cu121 requires torch==2.3.1, but you have torch 2.0.1 which is incompatible.

Successfully installed all packages!

proud pecan
#

ok

pastel oak
#

-colab

azure marshBOT
# pastel oak -colab
☁️ Google Colabs
pastel oak
#

or check pinned message for ilaria stuff

#

they good

proud pecan
#

maybe i shoul up date my things too 😅

proud pecan
zinc kraken
#

Can someone help me I'm using local applio to train models. Does training take up a lot of space?

#

@odd shale ,

low shard
pastel oak
#

i was p sure that errorlog was aicovergen being broken

#

Guess i was wrong

tough fiber
#

which one should i choose

#

custom?

pastel oak
#

dont ask me

tough fiber
#

well okay

odd shale
#

Don't change it

odd shale
# pastel oak dont ask me

Shad, if someone makes you a question about which embedder to use, just say "don't alter it, keep it on contentvec"

pastel oak
odd shale
low shard
# tough fiber <@277821614345945089> yoo emoji do u know how can i setup applio or mangio on p...

You could try https://docs.google.com/document/d/1ooG2hJrfNNLUln0reTKKIOpNBjp53G0joak50H_sQhE/edit that is made by a user here, but no one really uses paperspace here anymore so i dont think anyon can help you if it goes wrong

#

made by @plush dew , but idk if they still work

plush dew
#

if anyone still uses it i'll update

upbeat cypress
#

is there a specific program that you can use to use RVC models on silly tavern. I just have been using alltalk but was curious if I could actually load the RVC models since I know the extras api is been depracated.

vestal cradle
#

Is anyone familiar with https://github.com/blaisewf/rvc-cli I'm building a voice assistant python script that reads the response from openais api. I got it to do inference after modifying a function call in the rvc_cli.py but I never managed to get it to do tts. Are there any options for RVC TTS that can be called without loading a GUI?

low shard
low shard
#

i thought silly tavern was mostly about llms and tts

upbeat cypress
#

Yeah your correct. looks like the alltalk beta might have some rvc conversion though.

plush dew
low shard
#

u still use paperspace?

upbeat cypress
#

checking it out now

inland cobalt
#

does anyone know good settings for a women? (ex: index rate: 9)

low shard
#

seems interesting

upbeat cypress
#

seems to work towards improving the xtts model if nothing else.

proven hill
proven hill
pastel oak
brittle wing
#

-colab

azure marshBOT
# brittle wing -colab
☁️ Google Colabs
proven hill
#

2 months and no need to fix or update kek

proud goblet
#

I installed RVC, but this is what I get:
**Launching Retrieval-based Voice Conversion WebUI...

C:\Users\pozde\anaconda3\envs\rvc\python.exe: can't open file 'C:\WINDOWS\system32\Retrieval-based-Voice-Conversion-W ebUI\infer-web.py': [Errno 2] No such file or directory

TroubleChute One-Line installer:

(rvc) PS C:\WINDOWS\system32\Retrieval-based-Voice-Conversion-WebUI>**
what should I do to be able to launch it?

rare gobletBOT
#

Ayo? @proud goblet level 2 !!! lfg

odd shale
#

(you)

proven hill
worthy tulip
#

sorry to ask but does anyone have the working rvc v2 disconnected colab the one i used to use is now offline

naive spire
#

Tiraram minha permissão de falar

rare gobletBOT
#

Ayo? @quiet cairn level 1 !!! lfg

timid valve
#

what training algorithm tolerates bad audio quality best?

#

i dont even want it to necessarily improve audio quality

#

i just want to know which one replicates it best

analog obsidian
timid valve
#

oh they don't? alright

#

do you advise against using custom pretrains?

#

aren't those just an extension of the OG pretrain?

analog obsidian
#

rmvpe is more robust (clear) and is more pitch accurate
mangio-crepe is less robust (less clear) and is slightly less accurate (by a very small amount, like 1% less), capturates more details than rmvpe, but is highly sensitive to noise

analog obsidian
#

original already does everything

timid valve
timid valve
#

oh alright

analog obsidian
#

but also can be any type of noise, mouse clicking, a sfx residual, everything

#

mangio is very sensitive to that

timid valve
#

i see

analog obsidian
#

rmvpe handles those better

proud anvil
#

Please drop the google colab that works

proud anvil
analog obsidian
#

unlike rmvpe which likes to make your model sound a bit metallic/robotic

timid valve
#

oh okay

#

that all makes so much more sense now

#

thank you for the fast responses btw, you are always such a big help

#

like genuinely, i mean it

analog obsidian
#

no problem, in simple terms, if you want to capturate every detail about your audio, use mangio, making sure no residual noise is present

#

noise sensitive means is gonna add that specific sound in the whole model no matter what

#

hifigan is very bad at cloning noise profile sadly

timid valve
#

got it 👍

rare gobletBOT
#

Ayo? @timid valve level 7 !!! lfg

timid valve
#

anyway, i used to be a pretty well-known modder in some other community and dealing with a lot of questions could really get on your nerves, especially when people weren't very specific about what they wanted to know or when there was a language barrier

#

so i really appreciate what you're doing

#

and your patience

#

what are the downsides to setting the hop length as low as possible? since that does make it more pitch accurate, allegedly?

proud anvil
#

Please drop Google colab, a neural network, to create covers that will work

azure marshBOT
# tame mica -hf

Suggestions for @proud anvil

<:huggingface:1179800228946268270> Hugginface Spaces
tame mica
#

second one

#

whar

proud anvil
timid valve
#

could it by any chance affect the quality of the model in a negative manner?

analog obsidian
timid valve
#

ahh

#

im going to train it on a hop-length of 32 for now since that's already half of what the default is

analog obsidian
#

setting the hop length below that is fine xD

#

however i remember below 64 you can't really hear the change

timid valve
#

oh okay

analog obsidian
#

is not bad to set it below 64
i even trained with a hop length of 8 just fine xD

timid valve
#

it's just finished inferring so ig it doesn't matter at this point

timid valve
#

that must've taken forever

analog obsidian
timid valve
#

ooohh

#

👍

analog obsidian
#

remember that crepe (mangio) is less pitch accurate than rmvpe, so is not very good at speech

knotty moth
analog obsidian
#

yea because is more pitch accurate

#

so less voice cracks and notes wrong

analog obsidian
#

also remember that inferencing a model in crepe is not going to add the benefits of crepe (removing the metallic sound of rmvpe)

#

to do that you have to train a mangio model and inference mangio

#

then you can notice the change

#

rmvpe models can inference crepe but ofc they perfom better inferencing rmvpe

#

they simply don't get the benefits of crepe

timid valve
#

so i should just use rmvpe when inferencing no matter what

analog obsidian
#

mangio models sound better inferencing mangio, not because is more pitch accurate for them but because the audio get more smooth for them

timid valve
#

that makes sense

knotty moth
analog obsidian
#

yes, always

#

speech models always have to use rmvpe, mangio perfoms poorly on speech

#

asmr voices, soft voices can hide that fact tho

timid valve
#

yea i've noticed a lot of pitch inconsistencies

#

completely unrelated bujt something i noticed is how much faster training is once i disable the webui by turning off my browser..like pretty much twice as fast than with the webUI running

#

im using firefox and it eats up so much ram

analog obsidian
timid valve
#

it shoouuuld be main line

#

but i'm not 100% certain

#

how do i know?

analog obsidian
timid valve
#

ah no main line hten

analog obsidian
#

mainline has some chinese text in the training tab

timid valve
#

yes

#

main line then

#

which one's better anyhow?

analog obsidian
timid valve
#

i'm on the right path for once then

#

yay

analog obsidian
#

yea is true that closing the webui can save some resources

timid valve
#

okay so i'm not just imagining things xd

#

good

#

why are so many models roughly the same size, even when trained on much bigger data sets? they all seem to be around 54MB for me

#

i once trained a refined model with twice the amount of data than the first model and the size of model stayed roughly the same

knotty moth
timid valve
#

oh so that's what mostly determines what size the model will have?

timid valve
#

yea that's where i noticed the highest discrepancies

timid valve
#

once again, thanks for all the help, i learn so much from you

proud anvil
#

Please drop a google colab that will work, a neural network

hybrid wagon
#

-rvc

azure marshBOT
ocean quail
#

I got Error
"'NoneType' object has no attribute 'setdefault on AiCoverGen

#

How do I fix it?

pastel oak
ocean quail
#

I have problems with that as well

pastel oak
#

What kind of problems?

#

What did you open (link), and whats the error

ocean quail
#

GPU crash out something like that

rare gobletBOT
#

Ayo? @ocean quail level 1 !!! lfg

final sentinel
#

なんかずっとやってもエラーばっかりなんですけどどうすればいいですか?

dim helm
#

here it says error

pastel oak
# dim helm

Maybe it lost connection? It works for me, so perhaps restart, and then scroll back to top when you convert and check the top right for the updates on connecting to gpu etc.

opaque spoke
#

im trying to train a model. but where do i find the 32k sample rate option??

pastel oak
#

is bugged

opaque spoke
#

what should i do?

opaque spoke
#

is there a recommended settings to train my voice model with? is there like a guide or something?

#

like what do i put here?

brittle wing
#

1

tame mica
opaque spoke
#

ok

opaque spoke
proud pecan
#

Hey Guys, does anyone know why cant I uplod models to the colab verion of the vc?

rare gobletBOT
#

Ayo? @proud pecan level 1 !!! lfg

wanton bane
#

there's Extraction type named rmvpe but i only got crepe and harvest in my RVC GUI
do i need to download rmvpe or need to download another version of rvc ?

proud pecan
proud pecan
opaque spoke
#

do you guys know what this error is?

rare gobletBOT
#

Ayo? @opaque spoke level 4 !!! lfg

opaque spoke
#

oh nvm

#

i am stupid

#

i didnt add extension

final sentinel
#

I keep trying and keep getting errors, what should I do?

opaque spoke
opaque spoke
#

how long does it normally take to train a model with a 13 minute dataset

knotty moth
proud pecan
#

My Model is failing to import does someone know why?

knotty moth
proud pecan
#

the Collab v2

knotty moth
#

what Collab v2

proud pecan
#

this one

knotty moth
#

if it were local okada, I would be able to answer

rare gobletBOT
#

Ayo? @knotty moth level 28 !!! lfg

proud pecan
#

what?

pastel oak
proud pecan
pastel oak
#

Can you link it instead of screenshot

rare gobletBOT
#

Ayo? @proud pecan level 2 !!! lfg

pastel oak
#

ah this is colab of the new alpha version

#

First time seeing it for me 😭

knotty moth
#

ah I see nails

proud pecan
pastel oak
#

the discord msg i linked links to that already

proud pecan
#

and the error is "ran out of input" btw

#

and how does this part work?

pastel oak
#

says so on the msg i linked

#

you upload models on the app when it opens

proud pecan
#

oh ok

#

thx again

final sentinel
#

I have exceeded the GPU utilization of the UVR5UI, what should I do? What can I do to avoid exceeding the gpu utilization as much as possible?

knotty moth
proud pecan
low shard
# proud pecan i tried it like 15-20 times and it wont even start because it cant connect with ...

If you finished the Google colab GPU,
You can:

  • Use alt google acc
  • Use the Kaggle version (a bit harder than colab and requires a phone number)
  • Wait till tmr
  • Pay for colab
knotty moth
steep mantle
#

I want to know how to make a model and put it in the program

rare gobletBOT
#

Ayo? @steep mantle level 1 !!! lfg

steep mantle
final sentinel
#

さっきまたuvr5やったんですけど設定変えたらできてこれってモデルが良くないってこちですよね?

low shard
final sentinel
#

Is there any setting other than karaoke model or uvr-bve that separates the background vocals from the main vocals?

steep mantle
low shard
#

You can't train with that locally

#

You have to do it on cloud

#

For colab (4 hours of daily gpu for free, not much hours, but easy to use):

Last update: June 15, 2024

Last update: Mar 8, 2024

knotty moth
proud pecan
cursive flare
odd shale
#

So you don't have to wait till tomorrow.

hidden zodiac
#

what does gpu0 gpu1 ect mean

summer lantern
#

i have one good working <333 after 4 months

hidden zodiac
#

i have issue with my own voice cutting any solutions?

pastel oak
misty elk
#

-colab

azure marshBOT
# misty elk -colab
☁️ Google Colabs
teal dome
#

will fcpe ever be fixed for applio

#

it was really useful for pronunciation differences

glad veldt
#

how do you denoise an instrumental?

rare gobletBOT
#

Ayo? @glad veldt level 5 !!! lfg

worldly totem
#

where i can get onnx voice?

#

help me someone pls

pastel oak
rare gobletBOT
#

Ayo? @worldly totem level 3 !!! lfg

worldly totem
pastel oak
#

Whats the problem rn why do you need onnx

worldly totem
worldly totem
pastel oak
#

I dont know why it does that but why do you need onnx anyway? for what

#

Maybe theres an alternative

grave mortar
#

Hi everyone! I have a problem installing fairseq through pip. Does anybody has the same issue

knotty moth
grave mortar
unborn geyser
#

where i can get snowie? pretrain

worldly totem
pastel oak
pastel oak
glad veldt
wild vapor
#

-colab

azure marshBOT
# wild vapor -colab
☁️ Google Colabs
rare gobletBOT
#

Ayo? @wild vapor level 2 !!! lfg

grave mortar
rare gobletBOT
#

Ayo? @grave mortar level 1 !!! lfg

grave mortar
#

-rvc

azure marshBOT
hasty crow
#

-audio

azure marshBOT
low shard
low shard
grave mortar
#

locally, on my pc

low shard
grave mortar
#

or troubleshoot using the given link?

odd shale
grave mortar
misty elk
#

-colab

azure marshBOT
# misty elk -colab
☁️ Google Colabs
grave mortar
low shard
glacial rapids
#

how do I calculate the processing time of a conversion? says 642/0.9 atm and still going / am running a local conversion

naive spire
#

They took away my permission to speak, bro

#

Just because I spoke my native language on a call that wasn't in my language

proper shale
#

better training colab

#

easygui is straightforward tho, if u wanna keep using that, but no guide for it

narrow sentinel
proper shale
narrow sentinel
proper shale
#

guide + link

rare gobletBOT
#

Ayo? @narrow sentinel level 1 !!! lfg

proper shale
proper shale
glacial rapids
#

im using whatever RVC was available on the ".wtf" guide for local

proper shale
proper shale
#

applio, mainline, mangio

glacial rapids
#

mainline

rare gobletBOT
#

Ayo? @glacial rapids level 1 !!! lfg

proper shale
#

mm

#

yeah that's probably the gpu in this case

glacial rapids
#

the 1060 does good enough work for me, ill give it another year before i need to upgrade. will probably scale the entire build up again

gaunt moss
#

my rvc app keeps crashing everytime i launch it

viral crater
#

Does anybody know how to fix
'No model for model_common loaded. Please confirm the model uploaded.'

gaunt moss
#

like something about tmp

pastel oak
pastel oak
pastel oak
viral crater
pastel oak
viral crater
#

i uploaded the index file and pth file

#

but when i start the console says: 'No model for model_common loaded. Please confirm the model uploaded.'

rare gobletBOT
#

Ayo? @viral crater level 1 !!! lfg

pastel oak
viral crater
#

blue box?

pastel oak
#

where the voice model is uploaded in

#

you have to select the voice model after you uploaded it, is it selected?

viral crater
#

i see this in my program

pastel oak
#

Oh brother

#

this version is old

viral crater
#

hm?

#

oh

pastel oak
#

How do people keep downloading the old version lately haha

viral crater
pastel oak
#

has latest versions

viral crater
#

so do i just delete it and reinstall latest version?

pastel oak
pastel oak
viral crater
#

how to know what gpu i have?

pastel oak
viral crater
#

uhh where gpu

#

im not sure i think i have nvidia tho

pastel oak
#

if you scroll down

viral crater
#

no

pastel oak
#

Then you have no gpu

viral crater
#

O_O

pastel oak
#

CPU voice changing is not recommended, youll damage your pc in the long run probably if you play a game with it too and cant do much with it xd

#

If your internet is good, you can try the online hosted alternative

wispy lodge
#

There cannot be no gpu, there should be at least some gpu

viral crater
rare gobletBOT
#

Ayo? @viral crater level 2 !!! lfg

wispy lodge
#

Could you check device manager maybe?

pastel oak
#

I forgot 💀

viral crater
#

ok i will just download nvidia ig

#

i have nvidia drivers so it makes sense to download nvidia

pastel oak
#

Yeaa

#

Did you check device manager tho?

viral crater
#

what section in dm?

pastel oak
#

Device Manager > Graphics Card

viral crater
#

._.

#

i dont have a gpu do i

pastel oak
#

You should have one

viral crater
#

ok then windows is broken lol

#

god does this program ever take long to download

odd shale
viral crater
#

i use external drive

grave mortar
#

Sup everyone! Is there a program to create an rvc dataset for me woth given .wav files? All programs that i saw online is buggy/not working at all. Thanks in advance

rare gobletBOT
#

Ayo? @grave mortar level 2 !!! lfg

viral crater
#

help the client doesnt reopen

viral crater
#

it says 'Web Server Launch Exception, Expecting value: line 1 column 1 (char 0)' and then never opens

odd shale
#

The best way to make a dataset is manually

#

Gather at least 12-15-20 mins of dataset, split them on clips of 5-10 seconds and place them on a folder.

pastel oak
viral crater