#✨│ai-help

1 messages · Page 219 of 1

knotty moth
#

-gui

median island
#

Ah

#

What should I be using?

median island
#

I was suggested to use this, but I don't know what else I should be using

knotty moth
karmic oliveBOT
median island
#

Thx

median island
#

I'm now on the new one, but I'm getting this error and I have no idea what this could mean
AttributeError: 'NoneType' object has no attribute 'dtype'

median island
#

If anyone could help me with resolving the error I'd be grateful

simple ore
#

full error message

median island
#

if data.dtype in [np.float64, np.float32, np.float16]:
AttributeError: 'NoneType' object has no attribute 'dtype'

#

Is that good?

simple ore
#

i mean the full error stack

median island
#

I'm slow at this 😭 what do you mean?

#

Sorry

simple ore
#

the entire error message that shows the entire chain of the failing modules

median island
#

I really only see that as the error area, but I could send a ss rq

simple ore
#

your conversion ran out of memory, then it failed to process the empty output

median island
#

Oh shoot

#

What

#

Hold on

#

I swear I already solved that

hallow thistle
#

Are you sure your RAM is large enough to handle that?

knotty moth
# median island

depending on your ram capacity, try processing a shorter file, or split it first

primal tree
#

Hey guys! I'm looking to find/create an rvc voice that is free to commercially use. Idc if I have to collect massive sets of text-to-speech voice lines in order to create the rvc, I just want to know what options are out there.

knotty moth
simple ore
primal tree
analog obsidian
knotty moth
analog obsidian
#

it's safer to pay some random fiverr vas

primal tree
#

icic

#

thank you

primal tree
analog obsidian
#

good luck

#

🦈 🤙

primal tree
#

thank you.

knotty moth
primal tree
#

I understand

#

Like I said, I just know there are options out there like voicevox or elevenlabs voice to voice stuff

crude flame
primal tree
#

was wondering if anyone else had other options

analog obsidian
analog obsidian
knotty moth
analog obsidian
#

yeah

analog obsidian
cedar hazel
#

yo

#

my error is VB cable not working for me

hallow thistle
hallow thistle
#

That's why I told him to go there.

knotty moth
#

posting the same question in different channels confuses ppl tho

hollow thunder
#

Is the Hina mod colab not working anymore?

hallow thistle
hollow thunder
#

I'm trying to make covers, not ai voices

hallow thistle
#

Has Hina ever made the "RVC" Colab? Never heard of this information. I've only heard of Hina making W-Okada Colab notebook.

#

Still. If you mean by RVC that can do AI cover, there's another better one available.

hollow thunder
#

Which is what RVC

hallow thistle
#

What is your PC GPU?

hollow thunder
#

I'm using my phone

#

I always have

hallow thistle
#

What does that mean? You only have a phone and no PC?

hollow thunder
#

I have a laptop

#

No pc

hallow thistle
#

Laptop is still a PC.

hollow thunder
#

Oh, thought they were considered different

knotty moth
hallow thistle
#

Using a Colab notebook on mobile phone is much harder than using it on desktop/laptop PC, you know.

hollow thunder
#

I know

#

But most of the time I'm not at home, and when I am I pass out from exhaustion

#

So I multi task and do what I can on my phone, so I was using hina mod

#

Since it was easier, but it's not working now

low shard
#

Which are you talking about? Link?

#

And what's your PC GPU?

#

And what do you want to do?

hallow thistle
#

He said he has a laptop, aside from using phone, but never tell the name of GPU of it.

hollow thunder
knotty moth
#

or the laptop package with spec details

low shard
#

always check that channel

low shard
#

Cloud )remote good pc) services like google colab should be the 2nd option only if you got a bad pc, not the first

broken tulip
#

Now that I messed around with the voices a bit, how do I train my own voice?

low shard
broken tulip
low shard
low shard
#

i personally suggest applio

knotty moth
low shard
solar token
#

Yo can anyone send me the link to download RVC?!?

#

I cant find it

hallow thistle
#

!howtoask

patent trellisBOT
# hallow thistle !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
solar token
hallow thistle
solar token
hallow thistle
#

You can run RVC with only CPU, but it will be hella slow.

solar token
#

sadspongebob welp its over

solar token
knotty moth
solar token
#

Didnt know just having to change your voice could be so demanding 😔

hallow thistle
solar token
hallow thistle
#

It is best RVC program out there.

solar token
#

it'll probably lag

#

but ill check it out

hallow thistle
#

For better performance, I'd suggest to run on a cloud website instead.

knotty moth
broken tulip
low shard
primal tree
#

Hey guys! I made a post about this on their Reddit, but I never got a response. Wondering if anyone here would have an answer.

I was wondering if it is within elevenlabs terms of service to record a large set of audio from a voice I made on their site, to then turn it into a RVC model that I can use in real time with a program like w-okada. Other than having to stay subscribed, would this go against their TOS at all?

(also sorry if this is confusing at all, I'm still very new to AI voices and am unsure if I'm using the correct terms here lol)

simple ore
#

if you're the one who only crosses the road at a designated pedestrian crossing, then violating ToS is very bad

#

same as using ablock on youtube

#

/s

primal tree
#

Haha I appreciate that.

#

Didn’t think about using a synthetic voice and how that might effect tho

analog obsidian
analog obsidian
#

but ye, converting it to a rvc model might be against their tos, and even if isnt, the rvc results will be worse actually

jaunty shale
#

I love how I do my recording with my new mic with 48k sample rate and voice ends up being pixely as always.

tiny idol
#

i need help

#

how i can upload a voice

#

idk if i did somethiong wrong

low shard
#

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

hollow thunder
#

Intel(R) Iris(R) Xe Graphics

#

@low shard

#

Is my gpu

low shard
#

you got any other gpu?

hollow thunder
#

I told you I have a laptop, not a desktop

hollow thunder
low shard
#

you can use RVC AI Cover Maker UI instead

small timber
#

What is the best progeram to clone voice to then use in Applio?

tiny idol
#

this one good ?

low shard
tiny idol
#

yh i forgot

blazing solar
#

-colab

karmic oliveBOT
# blazing solar -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

patent trellisBOT
# blazing solar -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

shadow sparrow
#

anyone else getting the problem with applio google colab right now? everytime i try to click on the public url, it just takes me to a website error

blazing solar
#

I was just enjoying it an hour ago

#

But then erorr

shadow sparrow
#

huh, thats weird

blazing solar
#

Ik right..

#

I dont know how to use the rest either

#

-rvc

karmic oliveBOT
blazing solar
#

-audio

karmic oliveBOT
white bough
#

The problem with Colab seems to be coming from pydantic. Before clicking on "Start Applio", run this:

!pip install pydantic==2.10.6

And it should work

rain mulch
#

most models I put sound really robotic... I tried so many models, even adjusted my settings over and over again

#

but nothing

#

at this point... I think its just my voice

#

I use the w-okada fork voice changer

knotty moth
viral mason
#

got this error in applio kaggle

#

prob gonna make a new acc on kaggle to test

simple ore
#

make a new cell

#

!pip install pydantic==2.10.6

viral mason
#

idk how to do all that

viral mason
#

can the kaggle space be updated so it doesn't do that, literally didn't do that yesterday

#

als remove the old bugfix code

abstract gulch
#

are hugging face links still currently used by users or is there a newer way to?

knotty moth
abstract gulch
hasty idol
#

im available to commission for a voice model or smth, need it for my vtuber project

#

where do i look for artists?

simple ore
#

welcome to python development

viral mason
#

Sure, is it possible to fix the error or should I just wait?

#

dms ok?

simple ore
#

as I said, you make a new code cell using +Code button

#

paste the pip install command, then run the cell

viral mason
simple ore
viral mason
#

which cell do I put that before

simple ore
#

the one that +Code creates

viral mason
simple ore
#

Ive provided all the require information for you to do it

unique rock
#

applio is working now?

knotty moth
unique rock
simple ore
#

create what

blazing solar
unique rock
blazing solar
# simple ore

I alr follow the instruction but nothing seem to work

#

It says TypeError: argument of type 'bool' is not iterable

simple ore
blazing solar
prisma kettle
#

-realtime

karmic oliveBOT
prisma kettle
#

Do I need to set cable as default for realtime? or no?

#

nothing is coming thru cable atm

simple ore
#

use different virtual cable

#

no need to set defaut, you pick the device in the app

prisma kettle
#

Glad to be rid of it so soon again. What cable should I be using?

#

Nvm I see which one to use

#

They need to update the guide. I was wary of using VB because it’s been pretty shitty in the past

hallow thistle
prisma kettle
#

I'm not using Okada

#

Do I need to restart PC after installing muzychenko virtual cable

#

I'm not getting any voice through it rn

hallow thistle
#

Do you wanna use the virtual cable with RVC or something?

prisma kettle
#

Yea, realtime rvc

#

I have the gui

#

I'm just getting no sound thru it

#

wait

#

im dumb

hallow thistle
#

Realtime mode of old RVC is so far outdated than W-Okada.

prisma kettle
#

oh... haha

#

I just followed the guide... ill get okada tomorrow...

hallow thistle
#

W-Okada is literally the realtime voice changer.

prisma kettle
#

shit

#

ill get it tonight fuck it

hallow thistle
prisma kettle
#

the guide didnt say that!!!

analog obsidian
#

rvc realtime gui is pretty outdated

hallow thistle
#

You were too confident in yourself on this one.

prisma kettle
#

Cut me some slack...

analog obsidian
#

anyways thats wokada

prisma kettle
#

this is all I got.

hallow thistle
#

There's a better one available. That's why I'm telling you to go to #🔍│help-w-okada for the realtime voice changer.

prisma kettle
#

I will, thank you

#

ooh... where did my virtual cable install

#

found it

signal mauve
#

how do i test out my model using sample?

verbal oasis
#

hi guys any alternative to applio to generate text to speech and lbe able to download and load custom models from huggingface?

#

is applio not working also for you on colab at the moment?

jaunty shale
#

what settings are recommended for FL Studio Export for dataset (I used it to fix harsh frequencies).

jaunty shale
#

dunno which sample rate to use

low shard
# verbal oasis hi guys any alternative to applio to generate text to speech and lbe able to dow...

There are different Text To Speech (TTS) AIs:

GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/

Freemium 11labs: Easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS

FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site

You can check TTS in our tts index

With RVC Models:

RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)

If you wanna do tts locally with RVC Voice Models (if you got a good pc):

  • You can get Applio in our docs
  • While Ilaria RVC Mainline here (no guide as of right now)

If you don't got a good pc you can do tts with RVC Voice Models on cloud:

low shard
flint solar
low shard
#

@simple ore another day, another colab broken (applio)

#

Google colab breaks soooo much

simple ore
#

UI colab?

#

the fix I gave yesterday should work

low shard
flint solar
simple ore
#

yes

#

err... I guess it needs to be !uv pip install pydantic==2.10.6

#

since that was changed to uv

flint solar
#

U should ping dev updates do that everyone sees it

unique rock
simple ore
#

reread

verbal oasis
simple ore
#

for UI colab, yes

verbal oasis
simple ore
#

yes

low shard
verbal oasis
#

working again

#

THANKZZZZ

unique rock
#

it's no use to me

simple ore
#

show the full output

#

of the new cell

#

i've tested it and it works

orchid flint
#

This isn’t a question about RVC so much as it is about the weights bot. All of the models that I have uploaded to the server have never been credited to me on weights. Am I doing something wrong? I have no idea why or what to do.

unique rock
brave garnet
#

I need to help, I have been using cover AI for 1 month and now when I use it again, it is a mistake

pastel oak
hallow thistle
#

!howtoask

patent trellisBOT
# hallow thistle !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
hallow thistle
#

That still doesn't give enough context of the problem you encountered.

brave garnet
#

Sorry but my laptop can't send pictures, does it matter if I send it to you separately

hallow thistle
#

No need to hop into my direct message to send that. A mod here will give you the image permission.

brave garnet
#

Can I write?

hallow thistle
#

Can you tell me the name of RVC program name you're trying to use?

brave garnet
#

the rvc i use is Hina_Mod_AICoverGen_colab.ipynb

#

The error reads as follows: "Traceback (most recent call last):
File "/content/Hina_RVC/src/webui.py", line 8, in <module>
import gradio as gr
ModuleNotFoundError: No module named 'gradio'"

hallow thistle
#

This one is guite too abandoned at this point. What is your PC GPU?

brave garnet
#

GPU is my pc or gg colab's?

hallow thistle
#

GPU = Graphics processing unit. It is a graphic card in your PC.

brave garnet
#

ohh ok

#

wait a 1 minute

hallow thistle
#

To check your GPU name, open Task Manager, go to Performance tab, and spot where GPU 0 or GPU 1 is in the left side.

brave garnet
#

My GPU is 0

hallow thistle
brave garnet
#

It's called Intell HD Graphic 5500

#

isn't it suitable for RVC anymore?

low shard
hallow thistle
low shard
#

Check what I told u

brave garnet
#

yes, I know my GPU is not suitable so I use online data for example gg colab

brave garnet
brave garnet
#

I really like AI training but the AI ​​training website I often use is gone. Is there any solution for the server?

low shard
# brave garnet I really like AI training but the AI ​​training website I often use is gone. Is ...

Train (make) RVC Models on cloud:

  1. Prepare the Dataset
  2. Setup RVC:
    Choose a cloud way to use RVC,
  • Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
  • Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
  • Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
  1. Be sure to know about the tensorboard

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.com/ which ofc uses RVC

RVC Inference (use models) on pre-recorded audio on Cloud

You can use either:

brave garnet
#

wow

#

I struggled all day and when I entered this server everything was easy, thank Nick088 by Weights and the server very much

mystic tangle
#

how many epoch should i do if i have 55 minutes of talking no silence only words and no background noise

low shard
#

Use the tensorboard

mystic tangle
#

wheres that

low shard
#

I hope not YouTube tut

mystic tangle
#

local mainline

#

the one i found in guide channel

#

first one

#

it doesnt mention how many epochs i should do for my dataset

low shard
mystic tangle
#

a link to the guide im using?

#

wdym

#

my gpu is a rtx 3060 kinda buns

hallow thistle
mystic tangle
#

yea im trying to train my own

#

but im wondering how much epochs i should use for my dataset

#

i dont wanna overtrain it or anything

low shard
mystic tangle
#

alr

#

this one

low shard
#

Also I would suggest Applio personally

mystic tangle
#

is it better

#

or just more simple

hallow thistle
amber skiff
#

Starting preprocess with 16 processes...
100%|████████████████████████████████████████████████████████████████████████████████████| 1/1 [00:38<00:00, 38.59s/it]
Preprocess completed in 38.60 seconds on 00:23:34 seconds of audio.
Starting pitch extraction with 16 cores on cuda:0 using rmvpe...
0%| | 0/1 [00:00<?, ?it/s]An error occurred extracting file C:\Applio\logs\tep piseth AI\sliced_audios_16k\0_0_0.wav on cuda:0: cuDNN error: CUDNN_STATUS_NOT_SUPPORTED. This error may appear if you passed in a non-contiguous input.
100%|████████████████████████████████████████████████████████████████████████████████████| 1/1 [00:24<00:00, 24.75s/it]
Pitch extraction completed in 31.58 seconds.
Starting embedding extraction with 16 cores on cuda:0...
100%|████████████████████████████████████████████████████████████████████████████████████| 1/1 [00:03<00:00, 3.52s/it]
Embedding extraction completed in 10.07 seconds.
Starting training...
Loaded pretrained (G) 'rvc/models/pretraineds/pretrained_v2/f0G40k.pth'
Loaded pretrained (D) 'rvc/models/pretraineds/pretrained_v2/f0D40k.pth'
Process Process-1:
Traceback (most recent call last):
File "C:\Applio\env\lib\multiprocessing\process.py", line 314, in _bootstrap
self.run()
File "C:\Applio\env\lib\multiprocessing\process.py", line 108, in run
self._target(*self._args, **self._kwargs)
File "C:\Applio\rvc\train\train.py", line 497, in run
reference,
UnboundLocalError: local variable 'reference' referenced before assignment
An error occurred extracting the index: need at least one array to concatenate
If you are running this code in a virtual environment, make sure you have enough GPU available to generate the Index file.

@low shard

low shard
mystic tangle
#

okay thanks 👍

amber skiff
amber skiff
amber skiff
#

yessir the latest one

#

Applio 3.2.8

hallow thistle
#

Have you checked if your GPU driver is up to date?

low shard
mystic tangle
#

hi how do i make my model show up in tensor board

#

do i have click train model then it will show up?

low shard
hallow thistle
mystic tangle
#

no like

#

how do i get a pth file

#

do i have to start training for tensor board to work

#

ik where tensorboard is but theres nothing here when i train will something appear?

#

i already have a dataset and i processed it do i just train now

#

then it will appear in tensorboard?

knotty moth
mystic tangle
#

oh okay

#

thanks

#

doe sit matter what i have the epoch at

#

i left it at 500

#

how many epoch should i let it run for

knotty moth
#

they should also appear in the logs folder

tulip cloak
mystic tangle
#

oh my bad i was just finishing up on this

mystic tangle
hallow thistle
#

This is the TensorBoard.

mystic tangle
#

What does this mean

#

i have the smoothing at 0.999

fast scarab
#

hi i got an error with codename rvc and i have no idea what to do 😭
it says:
TypeError: argument of type 'bool' is not iterable
and also
unable to find files for the specified model(s)

rain kelp
#

So, I know this isn't exactly an RVC issue, but I've recently decided to try out "f5tts" model in the AllTalk web-ui, and uh... the actual voice just generates gibberish. Like, it sound as if it's speaking in reverse at an increased speed or something. Any idea what's going on there?

simple ore
#

f5tts does not support long texts

rain kelp
#

Well, just a short sentence produces the same result.

unique rock
frail umbra
#

mmm my rvc webui doesnt even attempt to make an index

#

it just spits out a couple of these

simple ore
unique rock
frail umbra
#

im trying applio now, looks betta

cinder wyvern
#

does the applio gradio google colab not work anymore?

#

every time i try to run it i get an error

cinder wyvern
cinder wyvern
# unique rock aid

click 'show code' under the install applio section, and then add !uv pip install Pydantic==2.10.6 after all the other pip install lines, then run the install again

#

then when you run the start applio section it should give you the gradio links without errors

mystic tangle
#

i trained till 450 epoch should i keep going

knotty moth
tropic garden
knotty moth
tropic garden
#

thanks i'll try again

weak cipher
#

Can the STT api do real time?

amber skiff
#

When creating a dataset, should I upload the audio as one file or separate it into multiple files of 10 seconds each?

#

@knotty moth

verbal oasis
#

hi guys, is there a way to make a generated audio more human?
I'm using applio, I generated my audio file and I'd like to make it sound more human after I generated it.
Is it possible?

low shard
knotty moth
simple ore
vapid mantle
#

Tam anlamadım 32K mı? 40K mı? 42K mı? hangisi?

simple ore
#

32k

vapid mantle
#

thx

bleak nymph
#

how good is laughing in a dataset?

#

would that ruin the model

#

i recorded my voice in audacity while playing cards against humanity and was laughing so much

#

it's very expressive this one unlike my other dataset

analog obsidian
bleak nymph
#

yeah ik that i'm just wondering if it'll ruin the dataset and if i should remove it

#

like if it'll make the model worse

analog obsidian
#

as long it's not excessive, should be fine

#

but idk what could happen if the dataset has excessive laughing

bleak nymph
#

how much is excessive

analog obsidian
bleak nymph
#

eh i'll just experiment

bleak nymph
analog obsidian
#

what matters in the set is using the whole vocal range, i confirmed that yesterday when i trained a model of my voice

#

10 mins and was using my whole voice range without problems

#

also helps speech models have decent singing

bleak nymph
#

wow

#

i was speaking very expressively

analog obsidian
#

yuh, go for 30 minutes of expressive audio and you'll be fine

#

10 mins still felt a bit unnatural

#

looks like the sweestpot for a natural model is rlly 30 mins

#

rvc does not stand for realtime voice changer, it means retrieval-based-conversion

#

rvc and w-okada are two separate things

#

rvc is an ai voice cloning software
w-okada allows the usage of rvc models in realtime

#

one is for training and inference of audio recordings
the other is for realtime inference

bleak nymph
#

WOAHHH

#

i started training

#

and it shows previous model graph too

#

that's so cool

#

bro why is the new model (blue) taking so much longer 💀

analog obsidian
#

the new one

bleak nymph
#

but shouldn't be THAT much

#

22 mins

#

total

analog obsidian
#

check model_info.json of the old model

bleak nymph
#

it was originally over 3 hours but i cut it shorter

analog obsidian
bleak nymph
#

total_dataset_duration": "00:29:53"

#

30 mins

#

the newer one is shorter

#

hol up

#

old dataset

#

new dataset

analog obsidian
#

both are batch size 8?

bleak nymph
analog obsidian
#

idk then lol

bleak nymph
#

😭

#

that's so weird

#

anyways i remember i put me singing thru my ai model cause i was curious

#

it made me have an american accent

#

like a country accent

#

i do not have an american accent why'd it give me one

analog obsidian
#

this is the voice of the original pretrain, if you're curious
your model is training using this as a base

#

the point of training using a pretrain is to change this voice ^ to the one in your dataset

bleak nymph
#

ohhh interesting

#

i see

#

wait so how do i train a pretrain or whatever

analog obsidian
bleak nymph
#

oh hell naw

#

😭

analog obsidian
#

then you have to do a pretty weird folder structure since every speaker require it's own folder

bleak nymph
#

right

analog obsidian
#

and train for like 1 month idk

bleak nymph
#

make a british one 🗣️

analog obsidian
#

well contentvec was trained using multiple languages

#

but it had more english data

bleak nymph
#

oh

#

so the english data contains british accents too? or nah

#

i'm assuming it had a lot more american accents

analog obsidian
#

i have no idea lol

edgy tangle
#

Hii

#

Im using kaggle on applio

#

this is a model with a dataset of 7 minutes and 4 batch size

#

but it gives me just 12 steps

#

On local training it gives me about 43 steps

#

but my pc explodes cat_blep

#

same dataset, same slicing, same pitch extraction

simple ore
#

kaggle is 2 gpus

#

so batch 4 means 2x4

#

7 minutes = 7 x 20 x 3s segments... so it should be ~105 steps with batch 4

#

so perhaps you did not slice audio properly and a bunch of segments are too big or too small

edgy tangle
#

I used simple slicing

#

3s

analog obsidian
# edgy tangle I used simple slicing

batch size 4 in kaggle it's batch size 8 in local
because every gpu runs the same batch size

so if you put batch size 4 in kaggle, the two gpus will run at batch size 4, which makes it bs 8

#

try batch size 2

#

in kaggle

#

thats it's the equivalent of batch size 4 in local

edgy tangle
#

oh

#

Ok ill try it

#

thx!

edgy tangle
#

I already sliced the audio and extracted features

simple ore
#

it is a simple check - if the train loader has less than 3 samples it means you f'd something up

#

show the extract features log

#

and preprocess too for a good measure @edgy tangle

edgy tangle
#

wait me a sec

#

I did everything again

#

still not working

karmic oliveBOT
#
☁️ Google Colabs
simple ore
edgy tangle
#

uh

#

sorry

#

what do you mean by logs

#

console or files?

#

im not native english speaker

#

anyways, ill send both

simple ore
#

that seems like 5s slices were used?

edgy tangle
#

ough

#

i forgot

#

but if I put 3s slices the problem still happens

simple ore
#

the training script should handle that, but it is weird

edgy tangle
#

Idk what is happening

simple ore
#

start anew

edgy tangle
#

Should I delete the code?

#

maybe I messed it up without noticing it

simple ore
#

I've tested myself

#

5s slices

edgy tangle
#

Ok

simple ore
#

but you should not use 5s, it is too much

edgy tangle
#

I didn't

#

Just forgot to set it at 3s

#

hehe

#

I don't know what I did

#

but I deleted the notebook and created a new one

#

And now is working

#

hehe

#

Anyways, thanks for the help!

#

now with 2 batch size gives me 22 steps

#

I think it is better(?)

bleak nymph
#

@analog obsidian hi

analog obsidian
#

you're still using 5s slicing

bleak nymph
edgy tangle
bleak nymph
#

or overtrained or need more training

edgy tangle
#

im using this config

analog obsidian
#

because from what i looking at, looks like it got stuck

#

but could be because smoothing 0 + the second box enabled

#

something like this

bleak nymph
analog obsidian
#

very flat line pattern

#

should be like mine

#

can you

#

disable the second box

#

and click the third one

#

maybe it's just that

#

if still looks like a flat line

#

then uhh the model got stuck

bleak nymph
analog obsidian
bleak nymph
#

oh

analog obsidian
#

bro

bleak nymph
#

😭

analog obsidian
#

disable this sheet

bleak nymph
#

i sent the same image twice before accidentally

analog obsidian
#

yeah the model got stuck

bleak nymph
#

what do i do then

analog obsidian
#

ideally g/total has to go down
but yours got stuck in a flat line

#

hmm

bleak nymph
#

do i change settings? delete model? restart?

analog obsidian
#

honestly i have no idea, last time i had that issue was because my set had eq and compression filters

#

but yours didn't had those

#

i know that too high batch size may cause the model to get stuck

#

the dataset was too hard for rvc

#

probably caused by the excessive laughing

#

but im not 100% sure

#

there are rare cases where rvc really hates one voice in specific

#

and refuses to learn it

#

could be a lot of things really

bleak nymph
#

hmm

bleak nymph
#

not many voice ai's can learn my voice

#

i was really expressive in this dataset

analog obsidian
#

maybe you can try increasing the batch size

#

9, 10, 11, 12, etc

bleak nymph
#

hm i'll do it another day

analog obsidian
#

see if the dataset is damaged in a way too

bleak nymph
#

would i need to start from scratch

analog obsidian
#

maybe the denoise was too much

bleak nymph
analog obsidian
#

things like that

bleak nymph
#

using de echo aggressive in uvr

#

normal de echo dont work that nice

analog obsidian
#

probably the dataset got too damaged during the cleaning process

bleak nymph
#

really?

analog obsidian
#

and that could be also the real problem

#

yeah

#

eq and compressors kills rvc training because they remove vital information needed

#

excessive cleaning do that as well

bleak nymph
#

right

analog obsidian
#

you can actually train a noisy dataset just fine

#

it'll require more epochs

analog obsidian
#

and the model will of course sound noisy

bleak nymph
#

yeah

analog obsidian
#

but it will work

bleak nymph
#

having mic reverb in it will be funny

#

it drives my friends MAD

analog obsidian
#

lol

bleak nymph
#

idk why it happens either, £5 mic stand fault ig

analog obsidian
#

i think probably something went wrong during the cleaning process

bleak nymph
#

natural plate reverb whenever i shout too 🔥

analog obsidian
#

oor it could be that the set was too expressive and hard for rvc to learn

bleak nymph
#

it was difficult to get rid of

#

i had to use an aggressive model for it

#

not many models recognised the reverb noise

analog obsidian
#

rvc can also train room reverb

bleak nymph
#

im gonna try the model anyway lmao, even knowing it won't be good

analog obsidian
#

distorts the voice too much

#

it can't really handle room reverb

#

in my testings of training noisy datasets, natural noise also randomly adds clicks to the output

#

but in very rare ocassions

analog obsidian
#

i trained that with this

#

and rvc outputs something like this

#

i recorded my dataset with my cheap 20$ usd mic
so far the only problem i noticed is that the model sounds metallic

im gonna add more samples to the dataset and if that doesn't fix it, it could be the noise thats causing the problem

bleak nymph
#

HOLD ON

#

ok some parts

#

it sounds really weird

#

but when i first clicked play i was blown away

#

sounded just like me

#

and then it sounded weird after that

#

like

#

some words are good

fast scarab
#

Hi everyone, I need some help please.
I'm trying to run Codename-RVC-Fork v3.0.4 on my PC, but I’m getting errors when launching it.
The terminal shows this error:
TypeError: argument of type 'bool' is not iterable
It also says:
Information: unable to find files for the specified model(s)
Then a bunch of errors follow related to Gradio and FastAPI, and the app crashes.
It also tells me: "When localhost is not accessible, a shareable link must be created. Please set share=True or check your proxy settings to allow access to localhost."
Does anyone know what could be causing this? I'm a bit stuck.
Thanks in advance.

analog obsidian
# bleak nymph some words are good

imo the best you can do is to record the dataset while reading a script and pretending to speaking to someone else, using your whole voice range

#

thats what i did, and it's sounds like me in every audio i use

bleak nymph
#

oh woah

analog obsidian
bleak nymph
#

lmao

#

alr just generate a script?

#

what's the prompt u used

analog obsidian
#

and then pretending to talk to someone

#

maybe ask to generate a dialogue between to people

#

and pretend you're them lol

bleak nymph
#

lmao i just listened to the model

#

it has my mic noises

#

the de echo didn't remove it 😭

analog obsidian
#

rx11 dialogue isolate does

#

actually now i think that the reason why the model got stuck is because the dataset had room reverb

karmic oliveBOT
#
☁️ Google Colabs
bleak nymph
#

it's mic stand

#

it genuinely creates reverb noise 😭

#

my room is tiny

edgy tangle
#

sounds like room reverb

fast scarab
#

?

odd shale
simple ore
fast scarab
simple ore
#

yeah, one of the libraries it depends on got updates last week and broke 5.13.x

#

newer gradio works fine

simple ore
fast scarab
simple ore
#

if code name is using conda, like applio

#

then it is simply

fast scarab
simple ore
#

env\python -m pip install gradio==5.23.1 from the app folder from cmd

rancid jay
#

hi

#

omg sorry worg cht

analog obsidian
#

he tested it and in fact they had a problem

knotty moth
#

tbh im not sure how much rvc tolerates against little reverb that may be left by anvuew mono derverb or RX dialogue isolate

knotty moth
bleak nymph
#

wait how can i use different pretrains

#

still not fully sure what a pretrain is tbh

weary stream
#

a

south orchid
#

I need a ai hub website so I can use the ai

hallow thistle
#

Pretrained RVC voice models are voice models that have been trained with many audio datas, typically larger than average RVC voice models (53MB).

hallow thistle
karmic oliveBOT
quasi condor
#

click the second one

hallow thistle
#

Duh, you simply just called it "the AI" again. Momoisnap

tropic garden
#

Talking about pretrains, for the newer Refinegan models can we use the cloud Applio?

formal wind
#

So is the lowest point on the graph or lowest spike?

knotty moth
formal wind
#

I can't do ts by ear. My hearings Ass 😭

knotty moth
knotty moth
simple ore
formal wind
#

Small is just a "little" understatement yt_nails

scarlet mesa
#

hey, so I'm having a bit of a problem
I just started this stuff up, and I'm not sure why, but the voice I imported doesn't seem to be working. It's just playing my voice through my headphones.

#

I'm not quite sure what I'm doing wrong, or if passtru is what's screwing me, but still

#

This is the error I seem to be getting on repeat
To the point where the client itself is telling me that it's getting frequent errors

[Voice Changer] VC PROCESSING EXCEPTION!!! Argument #4: Padding size should be less than the corresponding input dimension, but got: padding (0, 20) at dimension 2 of input [1, 128, 12]```
#

It wants me to check if the model is loaded, which I'm pretty damn sure it is

#

Fuck it, I'll just figure this out in the morning
This is giving me a headache

knotty moth
buoyant frost
#

is it possible to even run rvc gui on mac?

knotty moth
buoyant frost
#

what do you recommend i use?

knotty moth
buoyant frost
#

I thought that applio is only for windows

knotty moth
#

I suppose it should be similar to the installation for linux

simple ore
#

I use pyenv on linux, so it is pretty straightforward

#

install pyton 3.10 or 3.11, activate local env, pip install requirements

buoyant frost
#

finally got it to work thanks to chatgpt and terminal

hallow thistle
#

Only this time, but you can't expect ChatGPT to help you everything about the RVC program itself. Blue_lobster

buoyant frost
#

trying to get applio to work, finally got a voice model and an audio file uploaded but the conversion seems to have a problem. How long does it normally take to convert?

knotty moth
keen crescent
#

-colab

patent trellisBOT
# keen crescent -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

karmic oliveBOT
# keen crescent -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

hallow thistle
#

Is this the correct channel you're asking about W-Okada?

clear carbon
#

Eh... Thought w-okada was rvc (real time voice changer)
sorry bout that

analog obsidian
odd shale
#

W-okada is for realtime usage of models.

#

RVC is for model usage with prerecorded audios/inference

analog obsidian
clear carbon
glass rivet
#

is there a colab notebook where I can input audio files to train an rvc model of a particular singer?

patent trellisBOT
# knotty moth -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

karmic oliveBOT
# knotty moth -colab

Suggestions for @glass rivet

📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

knotty moth
#

(the applio link)

glass rivet
#

Thank you so much. omg

vale osprey
#

-colab

karmic oliveBOT
# vale osprey -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

patent trellisBOT
# vale osprey -colab
📒 Google Colab Notebooks

Google Colab is a Cloud (Remote Good PC) Service. While the Free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

• **Applio**

by IA Hispano
Google Colab

• **RVC Mainline**

by Hina
Google Colab

• **UVR5 NO UI**

by Eddy
Google Colab

• **UVR5 UI**

by Eddy
Google Colab

• **Wokada Deiteris Fork**

by Deiteris & Hina
Google Colab

• **Hina's Modified Original Wokada**
• **RVC-AI-Cover-Maker-WebUI

by Shiro & Eddy
Google Colab

• **FaceFusion UI**

by Nick088
Google Colab

• **FaceFusion NO UI**

by Nick088
Google Colab

• **Music Source Separation Training (Inference)**

by Jarredou & Makidanye
Google Colab

empty knoll
#

someone knows why the kaggle notebook doesn't work, I get some kind of error with gradio and local tunel

knotty moth
#

!howtoask

patent trellisBOT
# knotty moth !howtoask

How To Troubleshoot AIHC_WaitWhat

__**GIVE CONTEXT.**__ 📝
  • Don't simply mention your issue, like "my rvc is not working".
  • Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
  • The more context, the better.
__**BE POLITE.**__ <:matsuripray:1159685390156967936>
  • Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
  • It's okay if you're frustrated, but don't take it into this server.
  • Don't DM without prior consent.
__**BE PRODUCTIVE.**__ 🤝
  • Don't ask for every little instruction. Put your own effort & test things by yourself.
  • Don't ask to ask.
  • Check if your answer is a Google search away/on our guides website.
hallow thistle
#

Also make sure you screenshot the problem you encountered somewhere on Kaggle fror more context. The "Kaggle notebook" doesn't refer to a specific RVC notebook link.

low shard
#

@empty knoll what kaggle link? What issue? What u want to do and what’s ur pc gpu?

frail umbra
#

im training this green model and it just keeps getting better

knotty moth
frail umbra
#

I did give it like almost 2 hours of material

#

and it was all split into sentences by hand

knotty moth
frail umbra
#

idk how

knotty moth
frail umbra
#

line higher = mo' betta

#

I think it'll come down soon

knotty moth
#

what kind of misinformation did you get from?

frail umbra
#

I got a feeling it's gonna be good

#

the red one was a lil lackluster, I didnt use any pretrain though

#

it wasnt bad

knotty moth
#

how do you know it without the actual testing by inference?

#

you have baked a cake, how would you think it is "good" without tasting it?

shadow sparrow
#

Hey everyone, I'm training an AI model with Applio and ran into a TensorBoard error on the Colab page that says:

'Data could not be loaded. The TensorBoard server may be down or inaccessible. Last reload: Mar 31, 2025, 2:27:49 PM. Log directory: logs.'

I'm not sure if this is a solution or not, but since I can't access TensorBoard, I stopped the training on Applio, enabled 'Overtraining Detector,' and set it to stop after 50 epochs without improvement—just so the training doesn't run indefinitely.

Does anyone know what might be causing this issue or how to fix it?

analog obsidian
knotty moth
#

also don't lose the tfevents file in model's eval folder

flint solar
#

i was training a model the other day and my fm graph was at 2

#

like wtf lmfao

frail umbra
#

no I said the red

#

it has no pretrain

flint solar
#

its quite the opposite

frail umbra
#

nah nah nah I can tell something's cookin

#

a legendary model is manifesting itself

flint solar
frail umbra
#

yeah but this isn't a conventional voice

simple ore
#

you need to look at other chart to see which one is messing the whole thing up

#

fm and mel

flint solar
#

its probably fm

shadow sparrow
frail umbra
#

idk what these are

flint solar
#

😭

knotty moth
frail umbra
#

im not at a checkpoint for another hour

flint solar
#

why arent u using a pretrain

frail umbra
#

wait that was the wrong one

#

I forgot it was green

flint solar
frail umbra
#

the pretrain has an accent

knotty moth
# frail umbra

d/total approaching zero means it suffers mode collapse, which means the model would produce relatively same results, i.e. static noise

frail umbra
#

that's good

knotty moth
#

that's because you didn't use pretrain for the green graph model too

frail umbra
#

no I did

frail umbra
#

it's 2x as fast

knotty moth
frail umbra
#

it means the pretrainer is active

knotty moth
#

you still haven't figured out what's wrong, what are you smoking of? PekoCry

simple ore
#

This is so weird.. even without pretrain fm and mel should go down

#

this is complete fuckery

#

like trying 32 batch maybe with a small set

flint solar
#

funny ass graphs

knotty moth
#

with default mainline configurations and insufficient dataset for training from scratch, it will never be good, the results will be the same static noise, causing the discriminator easily identify the generated samples as fake (hence the d/loss keeps approaching zero)

frail umbra
#

it has pretrain

#

8 batch

#

somewhere between 90min-120min of content

#

and 2000+ files separated into sentences (by hand)

analog obsidian
#

i see the old graphs but could be also the compiled version of applio
disabling the pretrain in mainline is a bit tricky (just dont put anything in the pretrain path misc_trolley )

knotty moth
#

that's why you should use an existing pretrain, and the process is called finetuning

training from scratch would need massive amount of dataset containing several speakers (for example, VCTK dataset)

analog obsidian
frail umbra
#

the green literally has pretrain

simple ore
#

you have to go to advanced options and uncheck it

#

okay, so using pretrain and 2 hours of audio

frail umbra
#

my expert analysis just tells me we have to push this to 2000 epochs or beyond

simple ore
#

can you send a piece of dataset to check?

#

with a pretrain training to 2000 epochs would wipe most of the pretrain's knowledge, especially if you got 2hr set

frail umbra
flint solar
#

what the fuck is this bullshit

frail umbra
#

it's my uncle

flint solar
#

na bro nah 😭😭

#

u cant train ts

simple ore
#

bro

#

i dont even

flint solar
#

are these recordings from the 60's

#

and why is it .wav.mp3

#

😭

#

so much shit going on

frail umbra
#

the audiofiles all been converted a couple times...

simple ore
#

6khz from shitty chinese answering machine

knotty moth
simple ore
#

i mean... this is real

frail umbra
simple ore
#

i've not tried to run training with shitty recording like that

frail umbra
#

exactly this is a new frontier

#

AI pioneers

crude flame
#

💀

flint solar
#

bro recorded his dreams

#

😭

simple ore
#

no, it is from an answering machine

#

but it is not even 12k

#

i've resampled my set to 12k and it still sounds better than this shit

knotty moth
#

the worst thing I ever had was when trying to train a dataset from some mobile games in 96/128 kbps mp3 quality, yea it sounded robotic as hell but the graph pattern with the default mainline pretrain was not really much different

simple ore
frail umbra
#

it's un-reproducible

#

exquisite audio

simple ore
#

I think it would be better to just get good 30-60s and then use that as reference for XTTS

#

and make some tts outputs

frail umbra
#

I had a really good model but I accidentally deleted all my AI stuff

#

😢

#

it was GOOD

#

excellent

#

i dont remember how I made it really

knotty moth
simple ore
#

i mean to clone the voice

#

it will be less shitty, even considering that synthetic audio is not good for training

frail umbra
#

it needs the grime for flavor

knotty moth
#

rvc miku is going real

knotty moth
knotty moth
hallow thistle
#

I'm fine, thank you.

#

What are you looking for? Is it RVC?

steep stirrup
#

yes

idle osprey
#

whats this

hallow thistle
idle osprey
#

what do i click on the other tab

#

oh were on rvc

hallow thistle
hallow thistle
steep stirrup
#

I used three fork models
Mangio-RVC
applio
RVC1006Nvidia

hallow thistle
#

What is your PC GPU?

steep stirrup
hallow thistle
#

That's good enough.

#

Did you install the latest version of Applio?

steep stirrup
#

ApplioV3.2.8

#

Yes, but the problem was that the need for embedders, which cannot be found in our language, was an example of the use of which the sound became extremely awful.

simple ore
#

while the embedder is mostly trained on english, your sibilant and breathing problem is not related to that

steep stirrup
crude flame
#

the sibilants sounding bad are prob due to them being overfitted or you only have those bas esses in the dataset

steep stirrup
#

I don't know what to do, I'm confused

knotty moth
flint solar
empty knoll
low shard
empty knoll
#

I had already used it a week ago but for whatever reason now it doesn't work peepoban

low shard
#

Also, have you tried deleting the Kaggle notebook and refolloeing the guide steps?

frail umbra
#

OK U GUYS WERE RIGHT THE MODEL IS NOTHING BUT SINE WAVES

#

I think I mismatched the sample rate or something

flint solar
flint solar
frail umbra
#

thats at 100epoch too lol

flint solar
# frail umbra

In my 2 years of making rvc models I never seen anything like this before

flint solar
crude flame
frail umbra
#

yeah it shits out between 20 and 40 epoch

flint solar
#

How could it be THAT ass

crude flame
frail umbra
#

ok I resampled them all at 32khz

#

gonna run it again

flint solar
#

Oops

#

I said the n word …

#

I forgot this is ai hub

frail umbra
#

I didnt see it

flint solar
#

I think I’m getting timed out lmfao

flint solar
frail umbra
#

ehhh I made them all wav's

#

and now it's really big

crude flame
flint solar
frail umbra
#

and I deleted the mp3s...

flint solar
frail umbra
#

ts?

flint solar
crude flame
#

just how

flint solar
#

4k

flint solar
frail umbra
#

I trained a good one a few months ago, I just gotta re-figure it out

#

it's duncan

crude flame
#

how does someone get this bad audio

frail umbra
flint solar
#

6k audio is funny asf