#✨│ai-help

1 messages · Page 184 of 1

analog obsidian
#

😭

prisma grove
simple ore
#

lil longer

#

30-60 seconds?

prisma grove
#

aren't rvc samples supposed to be 10sec or less?

simple ore
#

you dont have a normal speech?

prisma grove
#

no? it's miku

analog obsidian
#

she is from a vocal synthetizer meant for singing

prisma grove
#

I'll try to find a human cover of karma

simple ore
analog obsidian
prisma grove
#

uvr goes brrr

#

it's taking a while tho

analog obsidian
#

mvsep bsroformer is better trolley

prisma grove
#

I'm using replay, it's only got uvr

analog obsidian
prisma grove
#

and it didn't separate the vocal for some reason

analog obsidian
#

anyways, how slower the training is going to be with this lr and fp32?
i got my lowest point in 2 hours with this dataset using the default lr and fp16

simple ore
#

depends on gpu

analog obsidian
simple ore
#

can probably even do BF16 with it

analog obsidian
simple ore
#

I made a script to use accelerate and a chosen precision

#

well, just to try

rare gobletBOT
#

Ayo? @simple ore level 18 !!! lfg

simple ore
#

with AMD BF16 is super slow

analog obsidian
#

if u want i can try it but i have 0 coding knowledge so idk if i need to do some magic trick besides moving files

simple ore
#

it is for Applio

prisma grove
#

ok so

analog obsidian
prisma grove
analog obsidian
#

if thats the case i can try it

prisma grove
#

jubyphonic cover → uvr (kim vocal2) → miku rvc

simple ore
#

nah, just dropping the file, installing one package, configuring it, then running the training from command line

prisma grove
prisma grove
#

she has no problem pronouncing "r" (like in the word "dark")

#

she is pronouncing "th" (like in "the") as "d" though

simple ore
#

that's because the trained set has no 'the'

#

so the inded looks for something similar and finds 'de'

prisma grove
#

no

#

she has many instances of the word "the"

#

this song (karma) is literally in the dataset

analog obsidian
#

i find funny last time i said applio training was slower for me but after i tried the new version now is faster than mainline trolley

simple ore
#

newer torch, better performance

prisma grove
analog obsidian
serene horizon
#

So I started again in Kaggle, but when I go to train, it still says “error” in gui.

But in Kaggle says it’s working and is going through the epochs.

🤷‍♂️

analog obsidian
#

the only thing i want to get know is a stable fm that doesn't overfits like at 15 minutes of training

prisma grove
#

I'm gonna try with a tts audio

#

the people at de party

#

it got it right once

turbid pagoda
#

Its been a while since I've used applio. but the voice model i want to use is not appearing after i copy and download the link does anyone know why?

simple ore
#

downloads have been broken since the website rework

#

download manually and unzip into logs

turbid pagoda
#

got ya thanks!

low shard
prisma grove
rare gobletBOT
#

Ayo? @prisma grove level 6 !!! lfg

prisma grove
#

imma try something in polish to see how that goes

#

it should do good considering there's miku v4 chinese in there too (the required sounds are pretty much all there)

#

[rvc folder]/assets/weights

serene horizon
prisma grove
#

the "y" is very good tho

#

I'm gonna make a 30 min dataset and see how that goes, I'll throw in some more v4 chinese too

#

actually I just noticed

#

is this any good?

#

also what's the deal with pretrain models?

serene horizon
#

Hmm. My 50 epoch training .pth I just did in Kaggle as a test works in inference, but in the resulting file, some lines are really clear/loud, while some lines very muffled/very quiet.

Any ideas what is causing this?

vagrant carbon
#

can someone help me... the voice ai is repeating after everyone in the vc on discord, is there any way to fix this?

pastel oak
vagrant carbon
#

it worked and its better now, thanks!

gusty scroll
prisma grove
brittle wing
#

Huggingface illaria rvc gives a error on inference @low shard

low shard
#

it should say smt like 'GPU task aborted' or smt depending on the error

#

example:

brittle wing
low shard
#

do you get any?

brittle wing
brittle wing
low shard
#

you can just post a screenshot of the one at the top right

brittle wing
#

im so confused samsungsob

low shard
#

see if it shows

#

an error

#

at the top right

#

of the space

brittle wing
low shard
#
low shard
#

but alr

brittle wing
#

Is there a colab for Illaria RVC? @low shard

low shard
#

Ilaria RVC on google colab is broken

#

and the ilaria rvc zero version is made specifically for ZeroGPU of huggingface space

#

could you please

#

just show the error at the top right when u click convert

#

it should be like:

  1. u click convert
  2. after sometime u get an error at the top right
  3. screenshot and show me that at the top right not the one in the middle of the interface
#

be careful cus the one at the top right dissapears after like 10 seconds

brisk grove
#

What are these information's about dataset and batch size and 32k and else?

solar parrot
#

What happen to the colab that would automaticially do the vocal seperation and take the source straight from a youtube link?

rare gobletBOT
#

Ayo? @solar parrot level 1 !!! lfg

pastel oak
brisk grove
pastel oak
brisk grove
simple ore
#

batch size has little to do with VRAM

molten merlin
#

Hi guys, I'm new on training AI Voice models, what does the D, DUR, and G stands for on the outputs? (i.e. D_1000.pth, DUR_1000.pth, G_1000.pth)

brittle wing
#

hi i have an issue that i dont have friends

brittle wing
#

Does anyone know if its possible to download the voice models that people use on character.ai for the voice calls? Theres so many made of this one I want on that website but they're not posted anywhere so im just wondering if its possible through f12 or another program to download it to use in rvc

#

Ive literally scowered the internet and their discord for answers but theres nobody asking online and nobody answering in the server and im surprised with the amount of trained voice models there are on there that nobody is interested in ripping the higher quality voice models on there to use for RVC

mild birch
#

does anyone know how to work dis shi?

#

how do i get my voice sounding like carti

low shard
#

whats ur pc gpu?

mild birch
#

3080

#

like to record a song

low shard
mild birch
#

yeee

low shard
# mild birch yeee

As you got a good PC, you can use RVC locally, you can choose between:

  • Applio: A fork of RVC with some extra features like Applio TTS, same quality tho
  • Mainline: The original RVC
#

if u want an easier way, there is ilaria rvc zero which is on cloud (remote good pc)

brave garnetBOT
low shard
#

but id suggest to do it locally as you got a good pc and will have no gpu limit

mild birch
#

imma be honest i have no iidea what all this means what even is a RVC? what is the first step i need to take??

rare gobletBOT
#

Ayo? @mild birch level 1 !!! lfg

low shard
low shard
#

locally = on ur pc, u download it and use ur pc gpu

cloud = remote good pc, used mostly for those who dont wanna download it locally or dont got a good pc

low shard
topaz nimbus
low shard
low shard
fading steeple
#

Goat thank you bro

mild birch
#

thank you bro fr fr

#

makes alot more sense now

low shard
#

your welcome both of yall

topaz nimbus
topaz nimbus
low shard
#

ofc its NOT SUGGESTED as a phone cpu will be super slow

#

but its doable lol

mild birch
#

i got a r9 and 3080

#

im good right

low shard
mild birch
#

got it

topaz nimbus
low shard
#

the super bare minimum is 6gb of ram

#

even if 8 or more is suggested

mild birch
#

so any voice in the channel named voice models is a rvc hugging face model?

low shard
#

u can see it from the model post tag, rvc models got the rvc one, gpt so vits got the gpt so vits tag

#

but like 80% of them are rvc, so dw

#

there are thousands of rvc models

mild birch
#

ok bet thank you

rare gobletBOT
#

Ayo? @topaz nimbus level 1 !!! lfg

low shard
#

so i really think if u try to do it the app would just crash, i mean u are free to try but 4gb are too low

#

Id suggest to use cloud, like ilaria rvc zero for inference

brave garnetBOT
topaz nimbus
low shard
#

none of the phones was harmed (i even used it on my own which has 8gb) but ur phone seems too weak

topaz nimbus
#

To lower the requirements

#

Might not be possible but worth a try

slim wadi
#

would a 4060 (8gb) be good enough to run rvc well at low delay? or atleast compared to a 1660 graphics card?

low shard
# topaz nimbus if a linux ps4 jailbreak comes out for 12.00, well then I'm using rvc on that, b...

would actually be interesting tbh, i never seen someone run ai on a linux ps4, but a reminder that ps4’s gpu is AMD, the guide would be kinda different, check https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/docs/en/README.en.md

GitHub

Easily train a good VC model with voice data <= 10 mins! - RVC-Project/Retrieval-based-Voice-Conversion-WebUI

topaz nimbus
low shard
# topaz nimbus To lower the requirements

removing non inference features wouldnt really lower the requirements, it doesnt check ur pc gpu when u install it, like u could intall it and even if ur pc isnt good for training ud be able to inference as fine if its good enough for that

#

so u dont need to remove them as as long as u dont use training, running inference would be just fine

mild birch
#

where is the files and versions tap

#

im a little lost

low shard
slim wadi
low shard
low shard
#

-rt

azure marshBOT
# low shard -rt

This interaction has expired, use the command /guides realtime if you wish to see it again.

low shard
#

its the first link

I mean ofc u can just use wokada too, the fork could help for performance tho

low shard
slim wadi
low shard
#

i use cloud mostly too

topaz nimbus
low shard
slim wadi
mild birch
#

yo

low shard
#

i honestly have a ps4 too but i dont think im ever jailbreak it as i use it for gaming

low shard
slim wadi
topaz nimbus
rare gobletBOT
#

Ayo? @topaz nimbus level 2 !!! lfg

fading steeple
low shard
low shard
#

It should be explained in the guides i sent u, there are blue texts that redirect u to the guides

topaz nimbus
#

@low shard do you know what this could mean? run-install.sh: 51: Syntax error: "elif" unexpected root@localhost:~/Applio#

slim wadi
low shard
mild birch
#

like where can i access files and versions i dont know how you got to that page in ur ss

low shard
low shard
#

its not cloud (remote good pc), dw

topaz nimbus
low shard
mild birch
#

like which model? or method? or?

rare gobletBOT
#

Ayo? @mild birch level 2 !!! lfg

low shard
topaz nimbus
low shard
topaz nimbus
low shard
#

btw for everyone im helping, if i disappear its bc im on school ipad lol

low shard
mild birch
#

illaria rvc zero

low shard
low shard
# topaz nimbus same result

yea seems like they changed the .sh 2 days ago lol, i need to check a new way as its very different than before sorry for the issue

#

will update it when i get home

topaz nimbus
low shard
#

easygui on termux should work tho

mild birch
#

or was i suppose to download one from the site itself ?

low shard
mild birch
#

okay yes i just did that mb '

#

how do i unzip ?

topaz nimbus
#

if not either way, then I'm fine with the cloud

low shard
#

u can find it in the models list when u refresh the models

low shard
#

its kinda the same steps as appkio except u have to do one line to upgrade pip, and can optionally upgrade gradio to have a better looking ui even if its not needed

rare gobletBOT
#

Ayo? @polar tendon level 2 !!! lfg

low shard
brittle wing
#

-colab

azure marshBOT
# brittle wing -colab
☁️ Google Colabs
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

haughty schooner
#

what does
[Voice Changer] Pipeline is not initalized...
[Voice Changer] Waiting generate pipline.
mean?

pastel oak
radiant loom
#

how can i change the model name?

simple ore
#

just rename them?

radiant loom
#

how?

#

when i right click on this, just dont figure the option to rename it

simple ore
#

win11 may be a bit retarded with right click options

radiant loom
#

nice

oblique bay
#

Hello, I'm currently using a NVIDIA Geforce GTX 750 Ti (Also tried installing CPU version of the voice changer but my cpu is Intel Core i5-8400 at 2.8 GHz and its worse than my graphics card).

The voice changer I'm downloading off of:
(https://huggingface.co/wok000/vcclient000/tree/main)

Is it even possible to make it sound smooth voice with my graphics card? I want to see if this graphics card can even handle voice changer. If any tips or solution please let me know. Thanks

pastel oak
low shard
gleaming river
low shard
#

anyways the quality between of rvc and its fork is always the same, that one is just made to be easier especially for ai cover making

tame mica
#

in a zip file

#

perchance

low shard
brittle wing
#

what do i do with config json

#

i couldnt find dynamike voice model anywhere

odd monolith
#

Is there tutorial how to make your own rvc ai voice?

brittle wing
#

can someone walk me through making an ai voice modell

fresh cairn
#

has anyone got this working on Linux?

low shard
# brittle wing i couldnt find dynamike voice model anywhere

You can search rvc ai voice model at:

if there isnt one, you can:

low shard
low shard
low shard
low shard
#

be sure to be using rmvpe as pitch extraction,
and that your dataset is clean, and to also use the tensorboard

#

If i may ask, what language is it in ?

#

sure if u want to

#

Also, are u using the tensorboard ?

#

yea loss/g/total

mystic crest
#

Guys does anyone know how to like clone a rapper voice to urs for example a rapper “vocals” but u turn to that you made it

low shard
#

or u mean making the model first

#

and, whats ur pc gpu?

mystic crest
low shard
mystic crest
#

i just wanna turn other vocals into mine

mystic crest
#

i got a asus

patent pasture
#

Is there any method to get exact pitch value by using some software?

low shard
#

just in case to check if u could do it locally (on ur pc) or need to use cloud (remote good pc)

radiant loom
patent pasture
radiant loom
#

yes

#

ty

mystic crest
#

sometimes when i be recording on fl ts lags

low shard
#

For Realtime Voice Changing for Calls on Cloud (remote good pc for those who don't have a good one, YOU CANT DO THIS ON MOBILE):

#

Its way better to use kaggle btw

mystic crest
#

i don’t even understand this gang like all im tryna do is the simple shit what i jus told u

rare gobletBOT
#

Ayo? @mystic crest level 1 !!! lfg

low shard
#

Google Colab and Kaggle are cloud computing services:
Google colab = easier but short gpu time
Kaggle = Harder but WAY more gpu time

mystic crest
#

so what im tryna do

low shard
#

however it wont have much gpu time, but the other links dont have a guide

serene horizon
rare gobletBOT
#

Ayo? @serene horizon level 4 !!! lfg

brave garnetBOT
#

Settings for Nvidia GPUs nvidiagpu

F0 Det.: rmvpe (suggested for all series)

RTX 40-series: 80-96 chunk | +16384 extra
RTX 30-series: 96-112 chunk | +16384 extra
RTX 20-series: 112-128 chunk | +16384 extra
GTX 16-series: 128-192 chunk | +8192 extra
GTX 10-series: 128-192 chunk | +8192 extra

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

low shard
brave garnetBOT
#

Settings for Nvidia GPUs nvidiagpu

F0 Det.: rmvpe (suggested for all series)

RTX 40-series: 80-96 chunk | +16384 extra
RTX 30-series: 96-112 chunk | +16384 extra
RTX 20-series: 112-128 chunk | +16384 extra
GTX 16-series: 128-192 chunk | +8192 extra
GTX 10-series: 128-192 chunk | +8192 extra

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

serene horizon
#

When using Kaggle is there anything different I need to do to look at the graphed in Tensorboard?

low shard
brittle wing
#

So i'm trying to train my first model based off a spoken (admittedly very short, 12 sec) sample, for use in Okada, but it just comes out like a buzzing sound? I tried 250 epochs but the tensorflow graph was just a single point. Anything obvious i'm doing wrong, considering I've seen models based off single sound effects

#

other models work fine, i just made a bad one

low shard
#

And also u need to use the tensorboard (not sure if u just misspelled it)

brittle wing
#

sorry yeah tensorboard. Quality seems good to be, very clean and crisp, no background noise

violet badger
#

Does this still work? It just says error every time I try to convert

brittle wing
# brittle wing So i'm trying to train my first model based off a spoken (admittedly very short,...

Found the issue! these steps fixed it, in case anyone else sees this

Applio recently updated their version of PyTorch, so recently trained models may not work (They have static instead of speech) in:

rare gobletBOT
#

Ayo? @brittle wing level 1 !!! lfg

violet badger
polar tendon
serene horizon
low shard
#

Use Ilaria RVC zero

low shard
low shard
#

Like not from the same notebook u made yesterday,but make a new one

serene horizon
low shard
glad zealot
#

Oh wait that's Ilaria RVc

#

Mainline kaggle seems to work no problem for me tho

serene horizon
glad zealot
serene horizon
glad zealot
#

It should be version 3

serene horizon
glad zealot
#

Download or use it for training?

serene horizon
glad zealot
#

What error it says?

serene horizon
#

To use to train.

snow hazel
#

How long till I can use X-Minus again?

serene horizon
arctic spear
#

hey I did the clone repository and install depndencies on the colab page and when I tried to start a server it said there was no such file or directory '/content/voice-changer/server'
/content

Is there somewhere I need to go to download these things?

glad zealot
glad zealot
#

It's the 3rd cell iirc

azure marshBOT
# glad zealot -colab

Suggestions for @arctic spear

☁️ Google Colabs
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

glad zealot
#

Use the modified voice changer

#

In this list

#

-realtime

azure marshBOT
arctic spear
#

thank you

serene horizon
glad zealot
#

Noice

brave garnetBOT
#

Settings for Nvidia GPUs nvidiagpu

F0 Det.: rmvpe (suggested for all series)

RTX 40-series: 80-96 chunk | +16384 extra
RTX 30-series: 96-112 chunk | +16384 extra
RTX 20-series: 112-128 chunk | +16384 extra
GTX 16-series: 128-192 chunk | +8192 extra
GTX 10-series: 128-192 chunk | +8192 extra

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

snow hazel
#

How long till I can use X-Minus again?

#

Cause I keep getting this

knotty moth
brave garnetBOT
#

Settings for Nvidia GPUs nvidiagpu

F0 Det.: rmvpe (suggested for all series)

RTX 40-series: 80-96 chunk | +16384 extra
RTX 30-series: 96-112 chunk | +16384 extra
RTX 20-series: 112-128 chunk | +16384 extra
GTX 16-series: 128-192 chunk | +8192 extra
GTX 10-series: 128-192 chunk | +8192 extra

Advanced Settings

Protocol : Sio or Rest
Crossfade: 4096 start 0.2 end 0.8
Trancate: 300
Silencefront: Off
Protect: 0.5
RVC Quality: Low

slim wadi
#

can somneone link me the download for the virtual audio cable?

polar tendon
unique stratus
#

is there any trick to generate output that is less likely to have voice cracks ? It looks like all the Gura models have the same issue with this one song. I don't know about the settings like Voice conversion options and Audio mixing options so I would really appreciate it if anyone can help me out in this part. Thank you

polar tendon
#

If it's at 0.5 then turn it down

unique stratus
polar tendon
#

Play around, see what what works

polar tendon
unique stratus
rare gobletBOT
#

Ayo? @unique stratus level 2 !!! lfg

polar tendon
unique stratus
#

There's a part after 1:05 that always crack

#

Can I send the audio directly here

polar tendon
#

Sure

unique stratus
#

The rest of the song is fine but that ONE part near 1:07 keeps cracking no matter what Gura model I try

polar tendon
rare gobletBOT
#

Ayo? @polar tendon level 3 !!! lfg

polar tendon
#

Like the model used

#

In the audio files

unique stratus
#

I actually tried various different ones actually, but all of them end up having the same crack in that one part

polar tendon
unique stratus
#

Thank you 🙏

knotty moth
unique stratus
polar tendon
unique stratus
#

Btw Sorry for my ignorance, I was doubting if there was an issue with the models, turns out it's the input's issue 🙏

polar tendon
#

And then merge them back into one audio file

#

Until you're only removing the back vocals and not the instrumental too

polar tendon
unique stratus
rare gobletBOT
#

Ayo? @unique stratus level 3 !!! lfg

fresh cairn
polar tendon
unique stratus
polar tendon
finite prawn
#

preprocessing gets stuck here. tried restarting applio and my laptop but still stuck here

#

just realized what it could be, actually.... gonna test that theory... yeah nvm its not what i thought it was. has nothing to do with the model name replacing an already existing name

polar tendon
knotty moth
finite prawn
#

oh i found out what it was. im too damn impatient lmfao. the progress bar wasn't moving but it actually WAS preprocessing. just took a while presumably because i have a huge dataset

fresh cairn
#

Okok I will try in morning then

polar tendon
finite prawn
#

8gb

polar tendon
finite prawn
#

wait they let you use 15gb vram over on the cloud side? that's so generous lol

rare gobletBOT
#

Ayo? @finite prawn level 4 !!! lfg

knotty moth
# finite prawn

make sure you are using a folder containing wav files exported through audacity (if it doesnt work)

original rvc should work tho

polar tendon
finite prawn
#

already am using audacity exported .wav's

low shard
polar tendon
low shard
knotty moth
snow hazel
polar tendon
low shard
fresh cairn
low shard
# fresh cairn The one that can be used for discord and such

So realtime voice changer for calls on discord, yeah wokada, there is a Linux part in the Wokada fork: https://rentry.co/forkvoicechangerguide

knotty moth
low shard
fresh cairn
polar tendon
low shard
low shard
knotty moth
polar tendon
rare gobletBOT
#

Ayo? @polar tendon level 4 !!! lfg

snow hazel
#

Do they both have ways to seperate harmonies?

polar tendon
knotty moth
low shard
snow hazel
rare gobletBOT
#

Ayo? @snow hazel level 11 !!! lfg

snow hazel
#

?

#

Oh on mvsep

#

Which vocal type?

finite prawn
#

should it be considered normal for training with an hour-long dataset to take around an hour for every epoch? I have an RTX 4070 with 8GB VRAM, and all of the models I've trained before had pretty fast epochs (but they did have 15-20 minute datasets)

#

oops

#

my forgetful ass forgot i had the same issue and fixed it a long time ago

#

disable cache dataset in GPU

knotty moth
finite prawn
#

my settings were fine except for 'cache dataset in GPU' being enabled. encountered this problem before but i just forgot the solution

#

just disabled it and retrained and now it's running smoothly

knotty moth
finite prawn
#

i just made an educated guess with my lack of computer knowledge that enabling it would be beneficial

snow hazel
knotty moth
brittle wing
#

@low shard I stil get the error in the illara rvc, i did not see the Gpu task aborted message so what is going on!?

brittle wing
low shard
brittle wing
brittle wing
low shard
low shard
#

Yea pls do that so it's easier for me to understand the exact issue u are having

#

My guess would be that it's the GPU quota but you would get a "retry in.." with the time at the top right

#

So that's why I'm asking much for the error, cus you have to do different things based on the error you get

brittle wing
#

i waited 24 hours for the gpu quota

#

@low shard

low shard
# brittle wing here

Before clicking convert, be sure it shows the waveform of the audio you upload which means the audio is successfully uploaded

low shard
#

(the waveform is that thing shown in the image, it seems like you convert before it fully loads the audio)

#

Try waiting first for that to appear, and then click convert

brittle wing
low shard
brittle wing
#

i give up

#

@low shard

low shard
brittle wing
low shard
#

it has the trained index, instead of the added index

low shard
brittle wing
low shard
mild coral
#

where is gpu

low shard
# mild coral where is gpu

oh idk why but in ur pc is called 'graphic processor' which is the gpu, that seems an integrated graphics gpu which is bad, meaning you won't be able to train voice models locally (on ur pc)

#

As you dont got a good PC, its better you use cloud (remote good pc) for training an RVC Voice Model:

  • Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
  • Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
  • Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
    • Mainline (UI, No guide as of right now)
    • Applio (UI, No guide as of right now)
#

Google Colab = Easy but low gpu time
Kaaggle = Harder but MUCH Gpu time

low shard
low shard
#

where did you train?

brittle wing
low shard
#

let me know if u find it

low shard
#

an example

#

u should download that, then zip it with the .pth and the model should be working now

spiral notch
#

Hi, I'm re-adjusting to using RVC after it's migration from Google, how do I change the octave the voice is singing in?

#

I realized I put a male voice on a female singer and he sounds.... Bad. Lol.

low shard
spiral notch
#

I'm using the downloadable version off github, I was under the impression google took RVC down so it migrated lol!

low shard
#

What's ur pc gpu?

serene horizon
#

Is there a place where I can do inference convertions online like training in Kaggle?

low shard
#

however, for inference the fastest one is ilaria rvc zero

brave garnetBOT
cloud lance
#

implementing RVC for an app. if anyone knows optimal block_time settings please lmk

spiral notch
rare gobletBOT
#

Ayo? @spiral notch level 1 !!! lfg

low shard
# spiral notch My dad built the thing so I had to ask him, an AMD RX6600

Btw You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU
(telling u just in case u ever forget lol)

However, for the Mainline RVC (which is prolly what u used) needs linux for AMD GPUs, for windows its better u use Applio (an rvc fork, modified version, the quality doesnt change, only the UI which is the interface): https://docs.applio.org/getting-started/installation#amd-gpu-support-windows

Documentation for a high-quality, open-source speech conversion ecosystem designed for simplicity and optimized performance

spiral notch
noble dawn
#

Yooo

#

Anyone here expert at training in local ai

#

?

#

My stuff be not sounding good when I have a Nevida gpu 3060 Ti

low shard
spiral notch
#

filename is RVC1006Nvidia

elfin dome
#

hi guys ,. i just have now mmvc_client_v0.4.3_x86_64_win and this one need a ONNX models
how can i get ONNX models
thx u

low shard
low shard
stark wadi
#

Have a question. Certain words sound like they are speaking in like a japanese accent. Would a pretrained starting model with english singers help with this? I'm using RVC mainline and the pretrained weight that comes with it and training off that. I assume that because this was made by a non native English speaker that some pronunciation issues are to be expected, because the initial training data was not English. Is there anything I can do to rectify this?

simple ore
stark wadi
#

The index is english. But some words, like hard G sounds trigger it.

#

I meant, the training data is english I meant.

arctic spear
#

Hey, I noticed that when I'm playing a game like Cs2, the voice changer glitches out more. Is there a way I can stop this problem?

rare gobletBOT
#

Ayo? @arctic spear level 1 !!! lfg

simple ore
#

limit fps?

raw bridge
#

hello

#

im trying to do a text-to-speech program with a cloned voice

sterile sparrow
#

can anyone help?

#

its not working for me

#

at all

low shard
patent trellisBOT
# low shard !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
stiff moat
#

How do I use the model i made? I remember there was a local website for it and now I can't find it

low shard
#

And do u mean using it for pre-recorded audios or realtime for calls

stiff moat
stiff moat
#

although i think i've found the page i was looking for

#

thank you for replying though

rare gobletBOT
#

Ayo? @stiff moat level 1 !!! lfg

low shard
# stiff moat 1650

you would be able to do inference (using models) only locally (on ur pc)

#

but wouldn't be very fast, i would honestly recommend cloud (remote good pc)

low shard
stiff moat
low shard
stiff moat
#

yeah

#

i think so

low shard
# stiff moat yeah

I'd suggest you Ilaria RVC Zero, it's faster as its using ZeroGPU (A100, faster than Google Colab's T4)

brave garnetBOT
stiff moat
#

thank you

#

exactly what i was looking for

low shard
#

yw

brittle wing
#

I can't use Ilaria RVC mainline, I keep getting errors

#

-hf

azure marshBOT
humble valve
#

what is the best one to use for chatting i want it to use for a rpg character

low shard
#

It might be u finished the quota

brittle wing
#

I closed runtime

#

But

#

I can't convert

low shard
brittle wing
#

That's the problem

low shard
#

Yeah it's broken

brittle wing
#

Ouch 💔

low shard
brave garnetBOT
low shard
#

Use Ilaria RVC zero instead

#

It's a ZeroGPU HuggingFace Space which has faster GPU than Google colab

brittle wing
#

I already inferenced with applio but I wanted to check how the results will be because it's slightly different with every Colab

knotty moth
brittle wing
low shard
brittle wing
low shard
brittle wing
#

In some more realistic

#

Idk

low shard
#

Are you really really sure? Might be a placebo

#

The only difference is basically the UI, interface

brittle wing
#

Okay where is the newest ai cover generator Colab it's too slow

brittle wing
knotty moth
low shard
#

ZeroGPU Is faster

#

And easier to setup ye

brittle wing
brittle wing
low shard
brittle wing
#

And it still works

#

I use applio tho

#

It's the fastest for me

brittle wing
low shard
#

They pay people to promote their shit, used to do it alot on tiktok

#

They ALL use RVC

brittle wing
#

Wait

#

Is it the same as any other

low shard
#

They pay people to promote their shit and make others use their site, that's prolly why u heard that lol

brittle wing
# low shard Yes...

Then why did that one person tell me it's the best I mean I've inferenced the same stuff with applio and the one from jammable has better sound

low shard
low shard
#

If you really think there is, might be the audio wasn't cleaned well and in jammable they just cleaned it a bit more, other than that there is no other way as it's just RVC

low shard
#

There's tons of those ai covers site

#

At the end they all use RVC

brittle wing
#

But it sounds more realistic than what I get

#

Like I can't get such realistic results I even use the exact same models

low shard
#

The reason why u can't import is not bc it's different, it's just like kits.ai

brittle wing
#

Idk

#

The sound is more realistic

knotty moth
low shard
brittle wing
#

The person I literally messaged and talked to literally has credited the models and creators!

low shard
#

U ofc need to get a good quality RVC model for having good quality conversion

brittle wing
#

I use the best ones

low shard
#

You are just comparing 2 different RVC models

brittle wing
#

Five of them are literally the same I use

#

See?

#

And the cover is still better than what I can convert

knotty moth
brittle wing
low shard
#

They used to steal covers on TikTok and put the voicify logo on it too

#

Bc people get paid to promo that shit

knotty moth
#

who is this bot @brittle wing ? shocked

low shard
# brittle wing

In the jammables site it doesn't show that u can import external RVC modela

brittle wing
low shard
brittle wing
#

Then why did this person literally credit the creators with the exact same models I use

#

Do the developers steal models from here and profit off them for money?

low shard
#

But the models can't be imported any other way as I just checked

brittle wing
#

My two old models are probably here hahaha

#

They look like they got them from weights...

low shard
brittle wing
#

The models aren't that good but

rare gobletBOT
#

Ayo? @brittle wing level 7 !!! lfg

brittle wing
#

How do they sound realistic here

low shard
#

You need to properly clean the vocals

brittle wing
#

Cause there's no longer need for that Melband karaoke doesn't generate noise

low shard
brittle wing
#

I'm just lazy

low shard
#

Then rip

brittle wing
#

But Melband karaoke doesn't generate noise when removing the Reverb it's not MDX or UVR architecture
Roformer is noise free

low shard
# brittle wing What-

i mean that ofc you need a good quality model to generate good results, i seen u said u overtrained

brittle wing
#

But isn't overtrained

knotty moth
brittle wing
#

Don't ask me too much questions I'm not stupid I know how to train models

#

Well it was last year the dataset was shorter than what is supposed to be 750 epochs I probably looked at the tensorboard or I didn't I don't remember

#

Is RMVPE+ better than RMVPE? Harmonify uses that too

#

Ilaria gives better results than applio

#

Clearer vocals

#

But tends to decrease vocals' volume

knotty moth
brittle wing
#

How do I fix that

#

Volume envelope should be default

plush compass
#

Is it okay to use Hubert while using a pre-trained model?

rare oriole
#

Google drive doesnt seem to like RVC

#

crap i cant send an image

#

but it basically said I executed code that's "not allowed" in the free tier

cloud finch
#

Whenever i do applio i get this

#

When i convert

#

it gives me nothing

#

and gives that error

dusty plume
#

Can someone when i try to convert vocals it gives me errors

carmine hearth
#

-howtoask

azure marshBOT
carmine hearth
#

!howtoask

patent trellisBOT
# carmine hearth !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
dusty plume
#

does anyone know how to terminate the .bat file after your done

stone lintel
#

anyone know whats causing a weird g/fm graph on tensorboard?
my other graphs look fine but this one bouncing around i dont think its a good sign

sudden lintel
#

i have no idea how to download the RVC can someone help me ?

still phoenix
#

what to do?

brave garnetBOT
glacial pollen
#

as that's the most important characteristics-type metric for models

#

also, pro tip. Don't use " ignore outliers " or whatever was it called

#

instead, keep the graph as it is + smoothing at 0.2 to 0.6

#

Ignoring outliers serves practically no purpose in case of rvc

glacial pollen
# glacial pollen FM graph will oscillate

To have it more stable in general, it's either you have to take care of samples' not being too diverse / too dirty or just increase the batch size

I've talked about it too many times so won't go full in-detail but, smaller batch_size = more noisy and non-stable graphs whereas higher batch_size gives you more stable and " flat like " graphs

glacial pollen
low shard
# glacial pollen It is the same " under the hood", just the functionality differs a bit. The only...

Different types of RMVPE Pitch Extraction:

  • rmvpe: A Robust Model for Vocal Pitch Estimation in Polyphonic Music, the normal version of the best pitch extraction, it's robust and not sensible to noise
  • rmvpe+: has a pitch threshold, it limits the max and minimum pitch possible, basically deleting f0 values below and above certain thresholds
  • rmvpe-gpu: Training ONLY, uses your gpu for the feature extraction process, using more gpu so making training faster
  • rmvpe-onnx: Wokada ONLY, its a must for AMD Users who use ONNX models

Its basically the same as rmvpe except it uses a min and max pitch

fleet fiber
rare gobletBOT
#

Ayo? @fleet fiber level 1 !!! lfg

brittle wing
#

!howtoask

patent trellisBOT
# brittle wing !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
fleet fiber
low shard
#

since when ?

fleet fiber
low shard
#

its one of those scam ai sites

#

just use RVC, that's what every of those sites use, its Open Source

fleet fiber
#

It's not, go and try it urself? i tried kits.ai before and it didn't work that well but Jammable is so good like? the results i get are way better that using any other site

fleet fiber
low shard
low shard
#

its literally just rvc

fleet fiber
fleet fiber
low shard
low shard
#

rvc is literally easy to use

fleet fiber
#

it's not a scam tho, why u keep saying that? If the quality was bad and the results were bad then i wouldn't pay for it

low shard
fleet fiber
low shard
fleet fiber
low shard
fleet fiber
rare gobletBOT
#

Ayo? @fleet fiber level 2 !!! lfg

fleet fiber
low shard
low shard
fleet fiber
fleet fiber
low shard
fleet fiber
#

Also, on applio, the site always stops working so I have to start all over again, that's why i switched to Jammable

odd shale
low shard
low shard
# fleet fiber Also, on applio, the site always stops working so I have to start all over again...

Also, on applio, the site always stops working so I have to start all over again
Pheraps you mean applio colab? that's not a bug, Google Colab is a cloud computing service, it cant give free gpus forever so ofc u can't let it run 24/7

Ilaria RVC Zero is already set up, and uses ZeroGPU from huggingface (an A100 is way faster than google colab's T4 GPU)

fleet fiber
fleet fiber
low shard
fleet fiber
low shard
#

its like talking to a wall atp

odd shale
# fleet fiber Yes

Ofc that occurs because you exceeded the usage limit. That's how colab works.

#

Including the fact Google got very strict with RVC colabs.

fleet fiber
low shard
fleet fiber
odd shale
#

That was what i'm talking about.

fleet fiber
low shard
fleet fiber
fleet fiber
low shard
low shard
#

its the same logic as running DOOM on 2 same laptops in terms of hardware with the same OS, the results won't change

fleet fiber
fleet fiber
low shard
#

i cant really help without u telling whats the error message tho, it depends on what step u did wrong or if u downloaded a model with unnecessary files or if u finished gpu quota, or etc

brittle wing
low shard
quasi dagger
fleet fiber
fleet fiber
low shard
fleet fiber
fleet fiber
brittle wing
low shard
fleet fiber
#

same or not, again it's better in the sense of quality

rare gobletBOT
#

Ayo? @fleet fiber level 3 !!! lfg

fleet fiber
#

Level three of what huh

odd shale
low shard
#

it is the same.. u can literally import the RVC models

brittle wing
low shard
#

if it wasn't the same u wouldn't be able to do that, it would end up like trying to load a GPT So Vits Model in RVC, which wouldn't work

fleet fiber
fleet fiber
low shard
brittle wing
fleet fiber
brittle wing
#

I said that applio and harmonify outputs have a little difference in sound

low shard
brittle wing
#

Bro you aren't deaf come on

fleet fiber
#

Now, you're not trying to listen at all

#

ur just trying to prove ur point when it's literally wrong

brittle wing
#

@fleet fiber 💜

low shard
#

i literally explained yall how those sites work

#

its not like they have magic servers that make it run with better quality

brittle wing
#

And... that's why it's paid cause it's better

low shard
#

its the same program

#

which is literally Open Source

brittle wing
#

Itsss the same program okay but the sound is more realistic

#

Idk what kind of coding they set here

low shard
#

its like saying iZip is better than Winrar just because you are forced to buy an iZip license

fleet fiber
low shard
quasi dagger
brittle wing
#

Also Mel-roformer karaoke model on X-Minus sounds better than the one on MVSEP.

#

That's an example

brittle wing
#

Explain this

fleet fiber
quasi dagger
low shard
odd shale
low shard
brittle wing
#

Have you tried them all

brittle wing
fleet fiber
brittle wing
#

Well currently imo the best thing is Ilaria with 0 gpu

odd shale
brittle wing
#

That says enough
Yes personally I wouldn't pay money too but the result is realistic.On each their own

odd shale
fleet fiber
glacial pollen
#

¯_(ツ)_/¯

brittle wing
brittle wing
odd shale
#

There is no difference.

glacial pollen
#

btw, here, catch 🥬

analog obsidian
glacial pollen
#

All metrics are supposed to go down

odd shale
#

Thank you Cody..

glacial pollen
#

just that FM from what I've noticed, at least in those rvc builds I tested including mine, usually is reversed

#

( the graph )

#

yet the value on it's own should go down

analog obsidian
#

so is like a visual bug?

glacial pollen
#

either a bug or just a quirk devs left in the code

#

tho I myself couldn't spot what causes it last time I checked the code ( might be I didn't notice it but ye

analog obsidian
#

uhhh so how we can tell if fm is fine?

glacial pollen
#

just look at the value itself

#

rather than visual graph

#

if it's reversed in ur case, as in, up = down ( based on what values tell you )

#

then ideally it should be going up ( virtually, down )

analog obsidian
#

Hmm what if is going down instead?

#

then is bad?

glacial pollen
#

and the values increase?

#

as in, fm loss

analog obsidian
#

They decreased for me

glacial pollen
#

then if values decrease and graph is descending, it means it's all good

#

and just how it should be

#

values down + graph down = good ( normal behavior
values up + graph up = bad ( normal behavior

values down + graph up = common bug / quirk of rvc but it just still = good
values up + graph down = common bug / quirk of rvc = bad

analog obsidian
#

You said higher batch sizes helps fm to be stable, when i do that visually the graphs goes up very fast at the start of the training
Yesterday i tried a small batch size and it went down with a few fluctuations but it did not go up forever

#

why that happen?

odd shale
glacial pollen
#

higher batch size contributes to more stable models because more data " at once " ( too complex to explain in easy words so imma skip it ) is used

#

however, it can lead to too flat converging or just overfitting quicker / more often

#

there's also that thing if your dataset is too diverse

#

for instance, weird balance of speech / singing / screaming

#

and you use high batch size, model might confuse the stuff during inferencing so it's best to, in that case, go for lower batch or balance the ratio

analog obsidian
#

hmm interesting i see

glacial pollen
#

as for lower batch sizes

odd shale
glacial pollen
#

it's either your set is small ( yet balance is kept in the set ) so that's what it is
or your set's balance is kinda weird or broken, then it should help

glacial pollen
#

in terms of characteristics

#

i.e.: screaming, shouting, whispering etc

odd shale
glacial pollen
#

if it's rather uniform and balanced
high batch size should be good ( for ur case leo ) but sometimes you can decrease it to improve results / generalization

analog obsidian
odd shale
#

Gracias Cody.

analog obsidian
glacial pollen
#

alternatively

#

if one wants to maintain the batch size but not tinker too much with the set

#

just add more " normal samples " and keep the " diverse / outlier ones / unusual ones " in 2-5% whereas 95-98% is your standard ones

analog obsidian
#

too diverse can cause fm to be unstable? The more you know

glacial pollen
#

matching of features

#

if there's too high diversity, model is confused a lil

#

on what is the " right " features' distribution

#

more or less, simplified

Because, well, I talked about it a lot as it's crucial

Diversity is fine and sometimes welcomed yet uniformness and so, a bunch of data that can keep it's identity is the most important

analog obsidian
#

also other thing that happen to me is that rmvpe fm graph always went up regardless of batch size but when i switched to mangio crepe hop 32 and batch 2 made the graph not going up visually but instead looked like the one i sent

#

the end result was fine like u would expect to sound like

red kayak
brittle wing
# odd shale

I used to have two water turtles as a kid and turtles were my favorite animal

glacial pollen
#

it isn't always the set on it's own

#

can be just

#

well, voice's complexity and it's nature

brittle wing
glacial pollen
#

Now pardon me folks, gotta go back to sorting samples.
Ripped all data from danmachi game lol, over 30k samples to sort 💀
take care!

red kayak
#

the way it comes out the website

brittle wing
#

I haven't paid for jammable

red kayak
brittle wing
#

But the person who uses jammable has messaged me and I compared both

rare gobletBOT
#

Ayo? @brittle wing level 8 !!! lfg

red kayak
brittle wing
#

Hm I gave to ask this person for a sample first

red kayak
brittle wing
#

But it has reverb and music

red kayak
analog obsidian
glacial pollen
#

@analog obsidian Sure thing! If you need more technical info, please search for my msgs on this server. I do leave explanations or stuff here n there, incld. metrics etc.

brittle wing
#

I'm waiting for the person to reply

red kayak
brittle wing
#

Ohhhhh

#

It's clear

red kayak
#

the clean up vocals and then enchance their clarit

brittle wing
#

Bandlab has a mastering feature

red kayak
#

yeah u can mix and master with bandlab

brittle wing
glacial pollen
#

gimme a brief tl;dr

red kayak
#

both models were trained on different datasets

brittle wing
red kayak
brittle wing
#

It's the same exact model

#

Again

analog obsidian
#

Bro if they were two different things u wouldn’t be able to use your rvc model there

red kayak
#

simple as that

brittle wing
#

Can I manually do that

glacial pollen
#

How can Litsa know what was the exact post processing chain they used

red kayak
# brittle wing What post processing?

well they could've used some sort of de esser or eq to smooth out some frequencies and make them more natural and yes u can manually do that you just have to learn how to. Now i don't know what exactly they run the audio files through since i dont have the code they use at hand so i cant tell u anything thats accurate to that :p

brittle wing
glacial pollen
#

de-essing, eq, compression, saturation, there's tons of what can be done

red kayak
#

yeah what codename said

brittle wing
analog obsidian
glacial pollen
#

Might be you could try to reverse the impulse maybe?

brittle wing
glacial pollen
#

Read on irs / convolvers

red kayak
analog obsidian
#

Magical husky dog

glacial pollen
#

get the exact same input they used and make sure it's the same model

#

get an irs response out of it effectively replicating the effects' chain

#

and apply it over ur model's output
tho, I won't teach you how it's done as that's not my specialization, just proposing an idea ~

brittle wing
brittle wing
red kayak
#

just keep that in mind

analog obsidian
#

Eqing usually makes rvc models to sound realistic

#

U can decrease the volume of the artifacts