#✨│ai-help

1 messages · Page 204 of 1

thorn abyss
#

-audio

azure marshBOT
thorn abyss
#

-colab

azure marshBOT
# thorn abyss -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

thorn abyss
#

-overtrain

azure marshBOT
quiet crystal
#

yo can someone have one of those

#

like download links

#

to rvc

#

and okada

#

pls

proven hill
azure marshBOT
hallow thistle
proven hill
#

-rt

azure marshBOT
azure marshBOT
hallow thistle
proven hill
hallow thistle
#

That's crazy. You wanna run both Applio the RVC and fork W-Okada at the same time? goofy

heady gorge
#

where is chat

hallow thistle
valid spruce
#

What sample rate should I use?

proven hill
lean grail
#

I have been out of the loop for a while, whats the best way to use speech to speech ai now?

craggy brook
#

It's such a SHIT in audio recording applications or audio files that I'm embarrassed to listen to it and turn it off immediately. Even if I sound normal, my voice sounds incredibly deep on the recording. I can use it with my own voice and automatically detect it. I can make it better, more realistic, without distortion, or with what kind of application?????

proven hill
# lean grail I have been out of the loop for a while, whats the best way to use speech to spe...
lean grail
#

Ah thank you

proven hill
#

no probs

swift zephyr
#

anybody here got a kacy hill voice model?

swift zephyr
#

gonna give up

craggy brook
proven hill
proven hill
craggy brook
#

yes

proven hill
craggy brook
#

Is there a shorter and faster training? as video. I don't speak English anyway

craggy brook
#

F***K

lean grail
proven hill
lean grail
#

tytyty

craggy brook
# craggy brook F***K

And people say don't use it, they say it is very difficult to differentiate. I want to make another one with my own voice, nothing comes out other than sounding like a robot.

lean grail
#

Bruh, the guide tells me that a 3090 can do 20-40 ms chunk plus 2.7 but my 4090 needs to be above 100 ms chunk for it to not lag

#

Any place i can sort voice models by ranking or popularity?

golden rune
#

guys I'm using colab for real time voice changer

#

how can I bind my output voice to discord or any other software that uses my mic

glad mountain
glad mountain
#

so somethings wrong lmao

brittle wing
#

-colab

azure marshBOT
# brittle wing -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

median finch
#

is there an easy way to upsample and downsample my audio stream on windows to pin it to the sample rate that different models use?

proven hill
#

its useless

#

just check the sample rate with spek

fickle dust
#

Can u use okada live rvc on discord voice messages?

median finch
sudden tree
#

anyone know when 3.2.9 or 3.3 comes out for applio

#

i failed at custom installing the damn package update

simple ore
#

just get a compiled build, sheesh

sudden tree
#

what? ppl yesterday were saying compiled build was outdated even though i installed 12/25

#

they said g/loss/total graphs were not correct

lone nest
#

Does RVC disconnected stem vocal from input file?

#

My data is already clear single person voice so if it does any step to separate vocal from background sounds or from another person voice I want to skip that step

tight frost
#

I managed to train my model, but it gets some robotic voice at the end of the sentence, tried many epochs (250-500). Do i need to train it more? How to set it correctly?
Should I train it more times? 23mins of dataset, totally clean with silence reduction in audacity

proven hill
#

-docs

azure marshBOT
proven hill
#

first link

brittle wing
#

Will it be okay if I include laughs & speech in my dataset of the same person aside from vocals?

tight frost
#

Thank you, so lets summarize:
1.download applio and pretrain model
2.train it correctly

proven hill
brittle wing
proven hill
brittle wing
#

How many minutes of laugh and speech?

proven hill
#

use the og pretrain

proven hill
brittle wing
proven hill
brittle wing
proven hill
brittle wing
proven hill
proven hill
#

wdym no

brittle wing
#

Pretrains exist to ease model training also to improve the model results

proven hill
brittle wing
proven hill
#

make sense

brittle wing
#

Nothing to do w you

brittle wing
#

In what sense I think pretrains are a modern variant to ease model training and all

proven hill
brittle wing
proven hill
#

its a fine tune

brittle wing
#

Also a little noise doesn't matter that much

proven hill
low shard
proven hill
low shard
#

It all depends on ur dataset language and lenght

low shard
proven hill
#

ofc im assuming hes trainingnin english

low shard
#

imagine u want an anime image, would u use an anime or realistic image model trolley

proven hill
#

this doesnt make any sense 😭

low shard
proven hill
low shard
#

presumably

low shard
brittle wing
simple ore
tender basalt
#

What is the best version for mmvc
And where I can download it

low shard
hallow thistle
low shard
#

dw

sudden tree
simple ore
#

no

#

colab should always be an official release

hallow thistle
#

Some files that have been uploaded to Ilaria RVC from Hugging Face are all saved somewhere in %temp%, I found out that today.

hallow thistle
brittle wing
brittle wing
proven hill
brittle wing
#

Also I'm the one making the model so I decide for myself

proven hill
#

no one is arguing here

brittle wing
#

Okay does KLM x4 handle speech and laughs

brittle wing
knotty moth
hallow thistle
#

Finally, it indeed went well. drayke

hallow thistle
#

It took an hour to finish because of how slow my laptop CPU is. It would be faster with a better PC and an NVIDIA GPU. 1ski_smug

lone nest
#

Huh

#

I just made a 300 MB .index file

#

Is that normal?

proven hill
#

depends how long is the dataset

lone nest
#

40 mins

proven hill
#

completely normal

lone nest
#

Meanwhile the 4 hours dataset gives 30 MB .index

proven hill
#

…how

knotty moth
#

with the normal faiss it would be ~2 GB

hallow thistle
knotty moth
#

applio has the option whether to use faiss or k-means

lone nest
#

if big_npy.shape[0] > 2e5 or force_mbkm: oh makes sense

#

It was on RVC disconnected

lone nest
hallow thistle
#

Imagine that Ilaria RVC from Hugging Face being developed to run locally. nails

lone nest
#

Thinking of changing the code to force it to use faiss but maybe Colab usage will run out first lol

knotty moth
hallow thistle
proven hill
#

lol

simple ore
#

after ~4000 starts shuffling and showing a text output as it attempts to reshuffle pieces that are somewhat unique

#

with 200k chunks the extracted features get grouped into clusters and most of them get discarded

#

that again brings the index to somewhatmanageable size

verbal moon
#

Bro i have delay and my voice is very distorted can one fix this

red violet
#

Could anyone help me creating ai cover? i have the model and isolated vocals ready ! MeltDetermined

azure marshBOT
red violet
#

RTX3050 laptop

past ibex
#

lary over

low shard
# red violet RTX3050 laptop

Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), not the best to train (make models) even if still possible

You can:

  • Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
    • Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
    • Mainline: The original RVC
  • Cloud (remote good pc, easier and faster than ur PC but it's limited):
    • Ilaria RVC Zero: fastest and simplest that you can get for free
    • Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
    • Applio Colab: max 4 hours daily, not granted, of GPU

Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio

low shard
red violet
agile isle
#

hey i need a app to run my Models and i simply cant find any :c

trim sparrow
#

Is Applio broken ? I have a linear graph on a 20minutes audio training (with google collab), does anyone have a solution ? Here are my parameters Sampling rate : 40k; Pitch extraction : Crepe; Batch Size 15; and 500 epoch in total. I kept the batch size at 15 because it generally gives me better results. Hope you can help me. (The graph is loss/g/total)

trim sparrow
#

Ohh ok thanks a lot !

narrow nova
#

Is RVC v2 Disconnected down?

tight frost
#

What is it issue?

#

pretrains from DMR razer doesnt work ;c

simple ore
#

513 - 32k model

mellow flax
#

sorry for tagging after so long, but where i can download these models? when i unfold vr or mdx these models are not there

viscid moss
#

and u also need UVR5 beta for that

mellow flax
quartz holly
mellow flax
#

Do you know where I can download them?

viscid moss
#

here's the doc

mellow flax
#

whoa, that's mate!

#

I appreciate it very much

viscid moss
#

ur welcome

tight frost
#

is it ok if my dataset is 1hour and 32 mins? XD

mellow flax
tight frost
mellow flax
#

time will tell

narrow nova
valid spruce
#

Help

File "/usr/lib/python3.10/multiprocessing/process.py", line 314, in _bootstrap
self.run()
File "/usr/lib/python3.10/multiprocessing/process.py", line 108, in run
self._target(*self._args, **self._kwargs)
File "/content/Mangio-RVC-Fork/train_nsf_sim_cache_sid_load_pretrain.py", line 254, in run
train_and_evaluate(
File "/content/Mangio-RVC-Fork/train_nsf_sim_cache_sid_load_pretrain.py", line 438, in train_and_evaluate
"slice/mel_org": utils.plot_spectrogram_to_numpy(y_mel[0].data.cpu().numpy()),
File "/content/Mangio-RVC-Fork/train/utils.py", line 231, in plot_spectrogram_to_numpy
data = np.fromstring(fig.canvas.tostring_rgb(), dtype=np.uint8, sep="")
AttributeError: 'FigureCanvasAgg' object has no attribute 'tostring_rgb

pastel oak
valid spruce
#

What?, I used it earlier today and it was working perfectly!

#

I did the procedure I always do on all my models!

#

Could it be that I created a new account?

narrow nova
valid spruce
#

This just happened now @narrow nova

valid spruce
#

It always disconnects by itself and I end up losing everything

pastel oak
valid spruce
#

What is isnt?

#

Sorry I don't know English

pastel oak
#

forget it

#

You are using colab?

narrow nova
#

yes colab

valid spruce
pastel oak
#

Ok cant help wait for someone else

valid spruce
#

I create my models on my cell phone

pastel oak
#

But rvc disconnected & mangio rvc are really not recommended these days

#

i cant stop you from using it so goodluck

valid spruce
#

Unfortunately Applio is very unstable and always disconnects

#

@pastel oak Is there any version of Applio without UI?

cyan torrent
#

why do i need discord to use applio

#

@pastel oak

pastel oak
cyan torrent
sterile saddle
#

-colab

azure marshBOT
# sterile saddle -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

sterile saddle
#

-train

azure marshBOT
# sterile saddle -train
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

unique rock
#

Help!

simple ore
viscid moss
simple ore
#

or pip install matplotlib==3.7.0

unique rock
#

I train on a cell phone and it's a bit confusing. Can you explain the process to me in more detail?

viscid moss
unique rock
viscid moss
unique rock
mild sleet
#

Man when using weights the models are eh

lone nest
#

Is it used in the training step? i.e. will it affect the .pth file?

opal cobalt
mortal stirrup
#

can somebody help me i wanna train my voice in applio

carmine siren
#

-collab

#

-colab

azure marshBOT
# carmine siren -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

opal kelp
#

Could anyone help me? training does not start

upper furnace
#

Help

modest thunder
#

guys what is "epochs 300"?

azure marshBOT
opal kelp
upper furnace
upper furnace
opal kelp
#

-colab

azure marshBOT
# opal kelp -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

flint solar
knotty moth
knotty moth
# upper furnace

you two pls read the pinned message about matplotlib fix here, or you can try my tweaked version here: #1159290752195633273 message
but anyway I'd recommend the latest Applio 3.2.8 colab/kaggle with more features and improvements

hallow thistle
#

-gui

low shard
#

People never check pins

viscid moss
#

can u do it? till Kit Lemonfoot fix it

low shard
viscid moss
low shard
viscid moss
low shard
#

Bc I feel like telling them to upload the file will just make it harder for the average user

opal cobalt
viscid moss
low shard
#

I genuinely thought fewer people would be using RVC Disconnected since there's Kaggle

unique rock
low shard
low shard
unique rock
valid spruce
low shard
viscid moss
#

ye so that's strange

#

there's also another tunneling methods

low shard
#

Did you have that issue recently or heard it from others?

#

Because that used to happen before, but shouldn't anymore

valid spruce
#

I use my cell phone so that must be why

valid spruce
viscid moss
#

cause on phone u need to switch the browser tabs on UI ones

valid spruce
#

I usually leave the tabs like this

low shard
#

disallowed code execution? Inactivity?

valid spruce
valid spruce
#

When the collab reconnected I had to reload my gradio

nova rock
#

yo can someone help me rq?

azure marshBOT
low shard
#

if it was for the UI, it wouldn’t automatically reconnect

nova rock
#

!howtoask'

#

!howtoask

patent trellisBOT
# nova rock !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
low shard
viscid moss
# unique rock Yeah

u face a warning message right? If so press cancel. Warning says "U need to restart or something"

#

And continue

nova rock
low shard
#

what error

#

You have to be specific

valid spruce
low shard
#

what pc gpu?

nova rock
#

like the voice changer isnt working, i can hear myself but the voice changer isnt applying

low shard
nova rock
#

oh mb

#

thanks

viscid moss
viscid moss
#

xd

uneven relic
#

Hi, i have a question, The Hugging Face Pro plan is not useful for using Ilaria_RVC?

unique rock
#

Now it comes out like this, without needing to put: !pip install matplotlib==3.7.0

low shard
unique rock
uneven relic
viscid moss
#

I saw ur ticket and it's kinda weird how ur Zero GPU time is full

uneven relic
viscid moss
#

and still can't use it

viscid moss
hallow thistle
#

Unless the Hugging Face system is just glitching out.

viscid moss
#

lemme try rq

#

Before...

hallow thistle
#

I realized 2 FFmpeg .exe files were missing from this repository, so I added them using ones from Ilaria RVC from GitHub. The first time launching on my laptop, it just picked up FFmpeg from C:\FFmpeg rather than from inside the Ilaria RVC folder itself.

uneven relic
reef tusk
#

does anyone have the links from cloud RVC?

hallow thistle
hallow thistle
viscid moss
azure marshBOT
# viscid moss -colab

Suggestions for @reef tusk

📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

reef tusk
#

what about kaggles

#

-kaggles

#

nope

viscid moss
#

-kaggle

azure marshBOT
# viscid moss -kaggle
📘 Kaggle Notebooks

Note: Kaggle limits GPU usage to 30 hours per week.

reef tusk
#

oh

hallow thistle
#

The Ilaria RVC version I'm talking about is the one that used in its Hugging Face space, and I downloaded it in a dirty way. catblush

uneven relic
viscid moss
#

per account if u re out of GPU just switch acc

uneven relic
viscid moss
#

in terms of inference time ye

hallow thistle
low shard
#

if you meant about having more time to use it, or uploading longer audio files, then yes

#

but zerogpu huggingface time is faster

#

unless its a cpu one

uneven relic
low shard
low shard
#

why don’t you do it locally on your pc instead?

#

huggingface space and google colab are meant only for people who got a bad pc, they are cloud computijg services

uneven relic
low shard
#

locally, it will be harder to setup and won’t be as fast as huggingface for example, but you will have unlimited time

low shard
#

i would suggest you applio

uneven relic
low shard
meager lava
#

Does anyone know what this means and how to fix it?

2025-01-09 [E:onnxruntime:Default, cuda_call.cc:116 onnxruntime::CudaCall] CUDA failure 35: CUDA driver version is insufficient for CUDA runtime version ; GPU=1040 ; hostname=DESKTOP; file=D:\a_work\1\s\onnxruntime\core\providers\cuda\cuda_execution_provider_info.cc ; line=125 ; expr=cudaGetDeviceCount(&num_devices);

simple torrent
#

Yes you can, you just need a pre-compiled fork and VxKex to run it

simple torrent
latent kettle
#

Do anyone know which TTS is best?

simple ore
low shard
meager lava
low shard
#

Are you on windows? Which GPU exactly

ancient current
#

what does "AttributeError: 'NoneType' object has no attribute 'tobytes'" mean

#

and ""HTTP/1.1 500 Internal Server Error""

meager lava
#

i tried another version (std instead of cuda) and it works now

low shard
low shard
#

this is the wrong channel, RVC is not realtime voice changer, wokada is

wise cove
#

What's the difference between training model services and personal AI Voice model? ^^

worthy crest
#

what else do i get

#

do i need mmvcserver?

pastel oak
#

like what do you need what do you want

wise cove
worthy crest
#

but ye i havvent used this for some time forgot what i needed to install

wise cove
#

Third one down I believe

worthy crest
#

i initially installed the other one yk

wise cove
#

They changed it a but still technically the same thing assuming

worthy crest
#

ye i installed it

worthy crest
#

where do i post a photo#

#

can u tell me if my advanced settings are alr lol#

pastel oak
pastel oak
#

youre talking in the wrong channel

worthy crest
sand bison
#

Hello, is there anyone who can help me with the error: FileNotFoundError: [Errno 2] No such file or directory: '/content/Mangio-RVC-Fork/logs/

pastel oak
#

I didnt even read the error I just read mangio rvc fork and assumed its the same error, though you might have to do it either way

#

it cant find the folder, so the folder doesnt exist. check again

sand bison
#

The folder is created and the index is created, what isn't created is the pth.

hallow void
#

is there a way to use realtime rvc gui on roblox?

#

like so people on voice chat could hear?

low shard
#

Old

#

Wokada deiteris fork is better

hallow void
#

is that realtime

low shard
#

And yes it can do that

#

Yes

hallow void
#

where can i download pls?

low shard
sand bison
#

my error is : FileNotFoundError Traceback (most recent call last)
<ipython-input-18-5d26394eab49> in <cell line: 86>()
86 set([name.split(".")[0] for name in os.listdir(gt_wavs_dir)])
87 & set([name.split(".")[0] for name in os.listdir(feature_dir)])
---> 88 & set([name.split(".")[0] for name in os.listdir(f0_dir)])
89 & set([name.split(".")[0] for name in os.listdir(f0nsf_dir)])
90 )

FileNotFoundError: [Errno 2] No such file or directory: '/content/Mangio-RVC-Fork/logs

hallow void
low shard
#

-rt

azure marshBOT
low shard
#

1st guide

hallow void
#

tysm

low shard
sand bison
#

I don't know.

low shard
sand bison
#

ok

#

This is the first time this error has occurred

sand bison
#

What is the latest version of rvc?

sudden tree
#

hey if i have raw studio session audio files but you can hear the backing instrumental through headphone leak what mvsep model would you use to clear that

#

also what settings in rx 11 to clear noise pre train

brazen sparrow
#

yo

#

can anyone help me with a good girl voice

latent kettle
maiden wadi
#

Hi, is it not possible to download rvc anymore?

hollow thunder
#

I keep getting this error, but everytime I restart the runtime it happens again, and it always disconnects before I get to do anything. I'm using Hina_Mod_AICoverGen_colab.

latent kettle
jaunty mason
#

anyone have sunoAI ?

lone nest
honest canyon
#

how can i make my own voice a model that i can use

low shard
honest canyon
low shard
brittle wing
#

What will happen if I include a few seconds of breathing in my dataset?

glacial pollen
knotty moth
#

the old guide that suggests removing breaths is kinda misleading

glacial pollen
#

^

brittle wing
glacial pollen
#

well, the deal is always balance but if we speak of misc such as breathing, there's no rule
same goes for laughs and speech tbh

brittle wing
#

Cause I want to include so many things also does KLM x4 support that?

glacial pollen
#

Key is balance so the model does not bias towards one and not the other

#

Not sure about KLM's behavior exactly as I do not maintain it ( aka, ain't sure what it does support and what not. I haven't tested it )

brittle wing
glacial pollen
#

It depends man, in world's ideal scenario, you want to have equal balance for each subtype of content

#

we can't predict how things come out really

glacial pollen
#

well, laughing, breathing, moaning, spitting whatever really, is a subtype

#

I can only give you an example of how I'd manage it
5 mins of diverse talking, 5 mins rapping, 5-7 mins of singing, 1 min of breathing ( good quality )

brittle wing
glacial pollen
#

was just an example

#

by no means was it in " sequential manner "

#

not that it'd matter anyway, all gets chopped inside and mixed / shuffled

brittle wing
#

In my case some rap verses 4 vocal songs

brittle wing
#

Will the training support that

#

Deeper, higher

#

I don't mean pitch

glacial pollen
#

wdym deeper / higher if it's not pitch

brittle wing
#

5 minutes laughing, 10 speech?

glacial pollen
#

well no, that's a bad idea

brittle wing
glacial pollen
#

Laughing should be treated as misc, as an extra

#

not the primary purpose of the model

brittle wing
#

A person can go from higher and louder tones

glacial pollen
#

well ye, that is a pitch

brittle wing
#

And lower deeper

glacial pollen
#

" higher, lower, tone "
that's a pitch

brittle wing
#

Hmm yes will the training support that stuff as well

glacial pollen
#

It supports anything you throw in
as long it's all balanced and enough of it ( not too little, not too much ) it'll do fine

brittle wing
#

And when it goes lower the voice will sound like the person's lower tone?

#

Deeper

glacial pollen
#

In ideal situation, yea

#

All comes down to how well you train it and how good is the set

brittle wing
#

Cause I'm thinking of making ICP covers the laugh is required

glacial pollen
#

No idea man, as I said, we nor nobody else in here can give you any estimates

glacial pollen
#

machine learning like this is not deterministic where you can estimate how much of X is needed

#

better if you just trained a few times and see how it goes in various set proportions

brittle wing
#

Yes but I wanna include the person laughing

brittle wing
#

And talking

glacial pollen
#

seriously man, people used to make damn chainsaw or water drop models

brittle wing
#

You day not too little, not too much so 5 mins of laughing?

#

And how many of talking

glacial pollen
#

no..
too little can be 1 min of laughing vs 10-15 mins of speech

#

you get the deal

#

if you overdone one type of content, model will bias towards it

brittle wing
glacial pollen
#

and potentially suck in the other

#

that's all I can say

brittle wing
#

I understand

glacial pollen
#

Yea so just try some proportions, do a few training runs and see how it goes

#

Doing a good model needs some testings naturally

brittle wing
#

5 mins of laughing 10 of speech?

glacial pollen
#

I suppose you can try ^
if that fails, try 2-3 of laughing and 8-10 of speech
and if that failed too, 5 of laughing, 6-7 of speech

brittle wing
#

Because when I convert into RVC the laughs sound so unnatural like aehehdjdo

glacial pollen
#

Then you won't get your " perfect results "

glacial pollen
#

My best model was a matter of 2~ months worth of experiments

#

you can't toss all in, do a 1 run of training and expect miracles.. that's just the truth

brittle wing
glacial pollen
#

What's the problem of giving 40 mins to 1.5h of training

brittle wing
#

It's not my first time training either

glacial pollen
#

Well, then best of luck m8

#

I've said all I can to help ya

#

rest is up to you

brittle wing
#

Can I send you the dataset one day?

glacial pollen
#

for?

#

as in, y

#

Cause like, I don't really do datasets inspection
( Neither It'd help the case in any way. )

brittle wing
glacial pollen
#

All you gotta know is that I haven't made nor maintain custom pretrains ( which supposedly do or do not support laughing or whatever )
the core concept of rvc is to handle speech or singing, that's the og premise. Whatever customs do, is out of my reach.
Now, if you intend to perfect out or nail experimental stuff or things that are known to rather fail most of the time, you gotta put some time and effort in

brittle wing
#

And tell me what's unnecessary or not

glacial pollen
#

Well, I don't really think it's necessary
You said it's not your first time training, right? so you should know the deal

#

and yeah, I can't tell you to what include and what not, without experiments you can't really estimate anything or guess how model'd perform

#

If you don't trust your skills or are short on time, just commission me or anyone else, see what'll come out of it
else give it some time and work, nail it and be happy, ye

#

no pain no gain as they say lol

brittle wing
#

Mhm I'll just send you

#

So you can tell if it's enough data

glacial pollen
#

Sorry man, that's not something I am up to

brittle wing
#

@tight ether hi does KLM x4 and 5.0 support laughs & speech

#

Or just rap and vocals

brittle wing
uneven relic
#

Hi, on Applio, can we not record vocal longer than 15 seconds? Every time I try to make a vocal longer than 15 seconds, this error pops up

simple ore
#

at least use Audacity

latent kettle
#

@simple ore I want best tts for English and hindi language. I want to use it locally. I don't want to use any website or any software like that. I need a fully open source project. Do you know any tool ?

simple ore
#

for english the usual recommendation - f5 tts, fish speech, fpt-sovits (needs training)

#

for hindi it is tricky, the perhaps fairseq vits from facebook

#

or you need to train the models above

brittle wing
#

its not opening i got python and shit

latent kettle
simple ore
#

not for fairseq

latent kettle
low shard
patent trellisBOT
# low shard !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
unique rock
#

Does RefineGan work on rvc disconnected?

unique rock
tame mica
#

what

unique rock
tame mica
#

you mean colab?

unique rock
tame mica
little pier
#

tutorial br para baixar o gravador?

rain solar
#

hey guys!

#

im very confused on how make an ai model, i got the vocals and all those things

#

but idk if i have to download a program that uses gpu or alot of cpu

#

to train it

jaunty anvil
#

honestly i was actually wondering where you even put in the models

#

is it elevenlabs?

#

sorry that i'm not helping i'm just hoping someone knows

bold yarrow
#

you mean where you store them?

#

you store your models in huggingface

jaunty anvil
#

i just wanna know which plan is the best now

#

or

#

are you talking to him

bold yarrow
#

sorry?

jaunty anvil
#

the best plan for elevenlabs

bold yarrow
#

idk about elevenlabs

jaunty anvil
#

oh mb

bold yarrow
#

i mean i know it, i've used it

#

but i dont think its good for ai

jaunty anvil
#

hm

#

what else do you recommend then

bold yarrow
#

RVC

jaunty anvil
#

does that work with tts?

bold yarrow
#

yeah but the tts sounds kinda robotic

jaunty anvil
#

yeah i'm mostly looking for tts

bold yarrow
#

doesnt elevenlabs have free models that you can use?

#

generic ones

jaunty anvil
#

yeah

#

but i mean custom tts

bold yarrow
#

oh

#

custom tts, no i dont think so

#

many custom tts i've seen sound like throat cancer

jaunty anvil
#

rip

bold yarrow
#

yeah, who knows what the guy behind 15.ai is doing

#

its been offline for like a year now

jaunty anvil
#

honestly i'm just trying to find a good tts

bold yarrow
#

there arent many in the field if im gonna be honest

#

the most you can do

#

is use elevenlabs to generate generic speech

#

and then use RVC like ilaria rvc

jaunty anvil
#

hm

#

alright

bold yarrow
#

to turn it into a character you wish it to be

low shard
# jaunty anvil yeah i'm mostly looking for tts

There are different Text To Speech (TTS) AIs:

GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/

Freemium 11labs: An easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS

FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site

With RVC Models:

RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)

If you wanna do tts locally with RVC Voice Models (if you got a good pc):

  • You can get Applio in our docs
  • While Ilaria RVC Mainline here (no guide as of right now)

If you don't got a good pc you can do tts with RVC Voice Models on cloud:

  • Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide

  • Use Applio UI Colab (with google colab T4 free daily limit gpu)

  • if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc

jaunty anvil
#

thanks for helping

low shard
#

this might help

bold yarrow
#

nvm

#

this guy destroyed every argument i had

jaunty anvil
#

lmao

low shard
#

except that it's paid

bold yarrow
#

11labs is good tts

#

but its not good at imitating

jaunty anvil
#

nah i've got the money

bold yarrow
#

it makes the voice sound a bit different

#

and you cant even control which accent it develops

jaunty anvil
#

it's nice hearing all of the sides

#

pros and cons

low shard
low shard
jaunty anvil
#

alright

#

so i'm guessing you can't tell me which plan is best either?

#

...sorry that came off rude

low shard
#

I don't want give you bad info ofc

bold yarrow
#

hey nick

#

is rvc disconnected still available for training?

low shard
low shard
bold yarrow
#

alr cool

#

im trying to make a heavy 2007 ai model

#

but idk what to do with steps and epochs

#

epochs im guessing i can figure out using tensorboard, but ive never used it

#

and as for steps, idk what they are

low shard
bold yarrow
#

i thought there was

#

im trying to make my model not undertrained but not overtrained either

low shard
low shard
#

btw don't share datasets here

bold yarrow
#

ok

bold yarrow
#

copyright?

low shard
brittle wing
#

Should I load dataset in rvc Disconnected when resuming?

austere stream
#

question, is there a guide out there that suggests a general total training time/epochs for a dataset length? like "if you have a 20 minute dataset, train for x minutes/epochs"

unique rock
#

To retrain you skip all the cells inside the "preprocessing" cell and you skip to the train one.

worn breach
#

whats a website that has real time voice changers?

simple ore
turbid tiger
#

Is there any RVC Text to speech program out there that doesn't require a ton of github usage and python stuff?

#

I can't always afford to talk into it.

hallow thistle
#

And also Ilaria RVC on Hugging Face can do this.

turbid tiger
#

Thank you for being so quick, gah damn.

hallow thistle
#

Voice.ai also has realtime voice changer service, but this is a real scam and never advised to use it.

bitter kite
#

Which voice model should I choose to insert the .json file?

hallow thistle
#

If it's some kind of RVC fork program that uses .json file for a voice model to display its information, I don't know. But Applio doesn't seem to do that.

bitter kite
#

I wish I could upload a photo

hallow thistle
#

If it's W-Okada, the only .json file that found inside a voice model folder is params.json.

hallow thistle
knotty moth
turbid tiger
hallow thistle
turbid tiger
#

It loaded eventually, cmds are weird sometimes.

hallow thistle
#

Can you screenshot that?

turbid tiger
#

There's nothing to screenshot, it loaded.

hallow thistle
#

Really? That's fine.

turbid tiger
#

Where do you put weights/logs for this btw

turbid tiger
#

ah, aight

radiant quartz
#

o/

i was on my way to train a model using google colab, but two combined datasets have 12 and 15 minutes. however, when i'm training using "rvc v2 disconnected", it keeps on writing

TF-TRT Warning: Could not find TensorRT

what is the recommended setting for training a good model?

(can't send the dataset in question because copyrekt moment. total of these voicelines when separated are 500 files)

carmine siren
#

-colab

azure marshBOT
# carmine siren -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

analog onyx
#

if my audio stops working

#

does that mean i got the wrong file?

rustic slate
#

how to configure work for a video card?

hallow thistle
rustic slate
#

1 min

hallow thistle
#

Sorry, but I don't a direct message from random user if nothing private. To talk about W-Okada, go to #🔍│help-w-okada.

rustic slate
#

ok ok

carmine siren
#

i am getting this error while making cover

carmine siren
rough plinth
#
2025-01-11 10:58:57.4020271 [E:onnxruntime:, sequential executor.cc:514 onnxruntime:: ExecuteKernel] Non-zero status code returned while running Pad node. Name:'/rmvpe/mel extractor/Pad' Status Message: CUDA error cudaErrorNoKernel ImageForDevice:no kernel image is available for execution on the device

have solution?

quasi ginkgo
#

How can i use these models for text to speech locally?

quasi ginkgo
#

Okay thank you!

#

can i get the github link

hallow thistle
azure marshBOT
quasi ginkgo
#

great thank you!

brittle wing
#

@tight ether I keep getting errors when trying to train with KLM x4

brittle wing
#

Also I can't resume

#

Why

tight ether
#

does the error occur during training?

brittle wing
tight ether
#

It seems like it might not recognize multiple speakers... I haven't tested it on disconnected, so I'm not sure.

quasi ginkgo
brittle wing
#

Yesterday worked but I couldn't resume

tight ether
#

huh

brittle wing
#

I should use Applio or what

hallow thistle
#

Ckpt and pth files? RVC voice model only use pth and index.

tight ether
quasi ginkgo
#

Idk i am using applio for first time

#

so where i do place the files

brittle wing
hallow thistle
#

I'm not sure about RVC Disconnected, but "RVC GUI" fork is indeed outdated.

tight ether
#

not really, but we can't know all the updates the developers made, so it’s safer to use a fork that has already been tested.

quasi ginkgo
#

okay thank you!

tight ether
#

x4 was trained with lastest ver of applio so.. yeah

brittle wing
tight ether
azure marshBOT
hallow thistle
#

!howtoask

patent trellisBOT
# hallow thistle !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
brittle wing
#

-guides

azure marshBOT
brittle wing
#

How do I give the dataset path?

brittle wing
tight ether
brittle wing
tight ether
#

colab?

brittle wing
#

Yes

tight ether
#

I've never used colab before. 52634615_730845017316044_3880792

brittle wing
#

Uhhh

knotty moth
# brittle wing

you should train with either og pretrain or KLM using applio

brittle wing
knotty moth
brittle wing
#

I still haven't started training

#

Also I asked how ro I set the correct dataset path

knotty moth
brittle wing
knotty moth
brittle wing
#

Can you link the guide

#

Also how do I set the right dataset path

radiant quartz
#

i have read the guide but rvc v2 disconnected does not work not because it exceeded time limit

brittle wing
#

-colab

azure marshBOT
# brittle wing -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

radiant quartz
#

(well, dataset path was even right but seems like it doesn't work.)

#

i was lost about how many good epochs to train for that large dataset

#

12 and 15 minute dataset; then it's 28 minutes in total

simple ore
#

aka not running feature extraction

#

@brittle wing make sure you're not skippig steps

brittle wing
# simple ore it is a skill problem

Hello how do I import the right dataset path on Applio colab also how do I resume training and ... Where do input the pretrain links on Applio Colab

brittle wing
#

Yesterday It worked idk what happened now

radiant quartz
#

🫠
tensorboard looks broken af

lilac raptor
#

can somebody help me out? Am trying to troll my friend that am a girl in an alt account , but he doesnt believe me until i say "hi im evelin and this is my voice" , can somebody help me out with an AI girl voice saying that 💀

simple ore
#

if you trained with something else, you are breaking new horizons then

#

but dont worry, it is simple

brittle wing
simple ore
#

prepare dataset, extract features, then put d/g weights from rvc disconnected into the model's folder

brittle wing
simple ore
#

applio colab or what?

#

ui/no ui?

#

pretrain is a starting set of weights, if you are resuming training, it is no longer needed - resume starts with D_xxxx.pth/G_xxxx.pth

brittle wing
#

Is this already overtrained I can't believe it

#

I mean

#

12 minutes data w KLM 4.3 x4 Pretrain

#

The other graphs decreased idk

#

What's this

radiant quartz
brittle wing
radiant quartz
#

nice, thanks

#

i think i'll spend the whole night finding my way to train a model with that large set

radiant quartz
#

😭

#

but tensorboard says it's all blank

#

censored the name because it's for my oc/private purposes

hallow thistle
radiant quartz
#

it's fine

tame mica
radiant quartz
#

(i don't have time then i have 1 hr and 40 minutes)

#

i'll make a help forum post for this tomorrow

hallow thistle
radiant quartz
#

rounding off to the nearest is 44000

brittle wing
#

Overtraining?

radiant quartz
#

praying for my model tho

#

i have 500 files of dataset

#

then i separated it into 250

radiant quartz
odd shale
#

Have you cleaned it properly?

brittle wing
radiant quartz
brittle wing
radiant quartz
radiant quartz
odd shale
brittle wing
#

Hi I was just asking if my model is finished

radiant quartz
odd shale
#

With a bit of luck the model will may come out decent anyway if these noises and sfx aren't too present on the dataset

radiant quartz
#

how many epochs for 12 to 15 minutes?

#

i asked for a friend, then he told me like 450-500 epochs

odd shale
#

That's wrong.

brittle wing
knotty moth
brittle wing
#

Sir, I'm sorry to disturb but

brittle wing
odd shale
brittle wing
odd shale
#

🐢

brittle wing
radiant quartz
#

really?

odd shale
#

All will depend on the g/total graph

radiant quartz
#

i did 128 instead of 64 in crepe hop length

knotty moth
odd shale
radiant quartz
brittle wing
knotty moth
radiant quartz
#

oh

analog obsidian
tame mica
analog obsidian
#

but has 0 effects if you used rmvpe

odd shale
brittle wing
radiant quartz
brittle wing
#

Nah I used a pretrain

tame mica
brittle wing
odd shale
brittle wing
#

I'm asking of the model is finished or not

#

The noise doesn't interrupt training

odd shale
brittle wing
#

Also there's almost no noise

analog obsidian
brittle wing
radiant quartz
brittle wing
#

Wdym

odd shale
tame mica
brittle wing
odd shale
analog obsidian
odd shale
analog obsidian
#

if you dont want to clean the dataset use rmvpe for training

radiant quartz
brittle wing
analog obsidian
odd shale
analog obsidian
#

tho one thing worth mentioning is that if you really want to the model to sound more like the dataset you can slightly increase it but this comes with the problem of the model having weird pronunciation

#

bc loses generalization

brittle wing
radiant quartz
#

i need to finalize the pitch extraction tho

#

if i'm new to training models, without cleaning it (like only the dataset.zip placed aside experiment folder), can i use rmvpe?

analog obsidian
#

Safest options

radiant quartz
#

how many recommended crepe hop length (scrolling back, is it 64?)

odd shale
analog obsidian
#

only affects crepe

radiant quartz
#

i thought it's mangio crepe

brittle wing
odd shale
analog obsidian
knotty moth
brittle wing
#

Then why is the line going like...

analog obsidian
#

If u want to use mangio set a hop length of 64

brittle wing
#

Look

#

Maybe after 150 epochs it will overtrain

radiant quartz
#

150 epochs in my experience: 20% sloppy
450 in my opinion: 95% accurate since it's completely becoming human because of pronunciation

radiant quartz
#

might train tomorrow, thank u all for the advice doggowave

odd shale
#

So it will be always random.

radiant quartz
#

current save frequency rn is 25 so i'm unsure

#

can i reduce it to 10 or smth

rustic arch
brittle wing
#

Started going up after 5k!!!

#

170 epochs the end?

analog obsidian
#

i usually wait 1 hour to confirm it stopped improving

#

overtraining is pretty obvious when you see it in a graph

brittle wing
analog obsidian
brittle wing
#

I tested it at 150 and it already sounds like the person

analog obsidian
brittle wing
#

It already looks like this...

#

After 170-180

#

I downloaded the 170e weight

analog obsidian
#

if ur happy with the result then stops the training

obsidian niche
#

yo guys, im new to this, whats the issue with this where it keeps glitchintg?

brittle wing
obsidian niche
#

im using rvc

low shard
analog obsidian
#

don't inference audio with harmonies

obsidian niche
#

i want to get minecraft steve to sing Die With A Smile

#

There's alot of harmonies (i think thats what it is) in that song

analog obsidian
obsidian niche
sand path
#

whats the best sample rate between 32,40 and 48k for the most accurate to use with voicechanger, with rvc2 and a dataset of about 35 mins (i just know some ai knowledge from about 1 year ago or more) and i use The Mangio-RVC-Fork so i dont know if its giving me disadvantages.

#

and should i turn pitch guidance of or on if im using a talking dataset

brittle wing
craggy brook
#

How can I easily find the voice dialogues used by a game character? In tones and emotions

brittle wing
#

Overtraining?

crude flame
brittle wing
#

Thank you for finally accepting me

#

It's a short dataset

crude flame
brittle wing
#

1:30 minutes w titan pretrain

#

Yeah but the line is going like...collapse

crude flame
brittle wing
brittle wing
crude flame
#

Do you have the avg graphs?

#

Is this applio or codenames fork?

brittle wing
brittle wing