brittle wing Feb 18, 2025, 12:54 PM

#

oh, sorry

serene musk Feb 18, 2025, 2:50 PM

#

-rvc

karmic oliveBOT Feb 18, 2025, 2:50 PM

#

serene musk -rvc

📚 Documentation

AI HUB Docs

https://docs.ai-hub.wtf

🍏 Applio Docs

https://docs.applio.org/

✨ More guides

How to use RVC Mainline Colab by Cauthess
AICoverGen Colab Guide by Eddy (Spanish Helper)
Create a model with RVC disconnected (colab) by Angetyde

hallow wasp Feb 18, 2025, 3:27 PM

#

-rvc

karmic oliveBOT Feb 18, 2025, 3:27 PM

#

hallow wasp -rvc

📚 Documentation

AI HUB Docs

https://docs.ai-hub.wtf

🍏 Applio Docs

https://docs.applio.org/

✨ More guides

How to use RVC Mainline Colab by Cauthess
AICoverGen Colab Guide by Eddy (Spanish Helper)
Create a model with RVC disconnected (colab) by Angetyde

warm oriole Feb 18, 2025, 5:33 PM

#

Is there any local alternative for suno ai ?

low shard Feb 18, 2025, 5:33 PM

#

warm oriole Is there any local alternative for suno ai ?

don't think so yet

stoic viper Feb 18, 2025, 5:34 PM

#

Hey everyone,
l have enabled the overtraining threshold in Applio so that training stops automatically if no improvement is detected.
In this case, do I still need to use TensorBoard to monitor the training, or is the threshold enough on its own to prevent overtraining?

crude flame Feb 18, 2025, 5:35 PM

#

stoic viper Hey everyone, l have enabled the overtraining threshold in Applio so that traini...

Dont use the overtraining detector because its inaccurate

#

tensorboard is much better

#

even more so if you have the avg graphs

stoic viper Feb 18, 2025, 5:38 PM

#

crude flame Dont use the overtraining detector because its inaccurate

Thanks for the advice! I’ll stick with TensorBoard then.

brittle wing Feb 18, 2025, 7:04 PM

#

i don't think so, g/total isn't rising up

#

keep training until g/total starts rising up and never goes down

fair ivy Feb 18, 2025, 7:16 PM

#

Help I can't seem to download okada the online way

low shard Feb 18, 2025, 7:17 PM

#

fair ivy Help I can't seem to download okada the online way

Wrong channel

#

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-reocrded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

#

Tell ur PC GPU in #🔍│help-w-okada first

edgy tangle Feb 18, 2025, 7:26 PM

#

brittle wing i don't think so, g/total isn't rising up

I'll try to resume training, i just put it to 250 epochs

brittle wing Feb 18, 2025, 7:27 PM

#

edgy tangle I'll try to resume training, i just put it to 250 epochs

next time you can try putting 1000, so you can see better the lowest point of the tensorboard

#

and when the overtrain actually starts

#

(you know, g/total rising up forever)

edgy tangle Feb 18, 2025, 8:26 PM

#

brittle wing next time you can try putting 1000, so you can see better the lowest point of th...

Yeah… im using local training because i dont have money for colab pro, and i dont have time for not being afk kicked by colab

#

5 minutes per epoch

simple ore Feb 18, 2025, 8:29 PM

#

training for more may not bring any improvement and can actually make things worse

edgy tangle Feb 18, 2025, 8:34 PM

#

I think so, my dataset is ≈1h

brittle wing Feb 18, 2025, 10:11 PM

#

simple ore training for more may not bring any improvement and can actually make things wor...

you're right, it causes overtraining

#

what i want to suggest the user is train 1000 epochs, but stop when g/total actually overtrains

simple ore Feb 18, 2025, 10:13 PM

#

g/total may not even go up

#

but the model still go to shit and lose all the knowledge from pretrain

brittle wing Feb 18, 2025, 10:13 PM

#

yea on long datasets overtraining is rare to see

simple ore Feb 18, 2025, 10:13 PM

#

5min audio, 1700e - lost all ability to sing

brittle wing Feb 18, 2025, 10:13 PM

#

simple ore but the model still go to shit and lose all the knowledge from pretrain

you mean with the og pretrain?

brittle wing Feb 18, 2025, 10:14 PM

#

simple ore 5min audio, 1700e - lost all ability to sing

because of overtraining, right?

#

1700 epochs is crazy for a 5 minute dataset

simple ore Feb 18, 2025, 10:14 PM

#

but it did not become overtrained = robot voice

#

and total g did not go up

#

brittle wing Feb 18, 2025, 10:15 PM

#

simple ore

50k and g/total didn't die

#

maybe the model will sound shit at some point

simple ore Feb 18, 2025, 10:15 PM

#

no, it was fine for speaking

brittle wing Feb 18, 2025, 10:15 PM

#

oh nice then, maybe the dataset is just speaking

#

that's why it may be bad at singing

simple ore Feb 18, 2025, 10:16 PM

#

it could sing at 500e, not could not at 1700

#

the trained model retains pretrain features, overtraining in most common sense is training a model so much it forgets the previous training

brittle wing Feb 18, 2025, 10:31 PM

#

simple ore the trained model retains pretrain features, overtraining in most common sense i...

so more epochs = less ability to sing?

simple ore Feb 18, 2025, 10:48 PM

#

more epoch = more of learning from the dataset, more of losing previous knowledge

brittle wing Feb 18, 2025, 11:00 PM

#

simple ore more epoch = more of learning from the dataset, more of losing previous knowledg...

so more overtraining makes the model forget about previous information/knowledge

#

and that's why models lose ability to sing, right?

simple ore Feb 18, 2025, 11:01 PM

#

pretty much. the ability to generate higher harmonics was pushed out by some other realignment

brittle wing Feb 18, 2025, 11:12 PM

#

interesting about the realignment thing in the harmonics

#

wdym by realignment anyways?

simple ore Feb 18, 2025, 11:19 PM

#

I dont know specific parts of the model that change this way, there are many millions of parameters responsible for the waveform generation after all

#

inference uses speaker latents and noise to generate a predicted spectrogram

#

so with overtraining it fails to make one with higher harmonics

hallow thistle Feb 19, 2025, 12:42 PM

#

fair ivy Help I can't seem to download okada the online way

For W-Okada, go to #🔍│help-w-okada . This channel #✨│ai-help isn't where you asking where to download a working W-Okada program.

vale oak Feb 19, 2025, 1:49 PM

#

Why is it changed in Weights to get songs on YouTube??? It is way easier and idk how to get songs in audio files 💔

hallow thistle Feb 19, 2025, 1:55 PM

#

vale oak Why is it changed in Weights to get songs on YouTube??? It is way easier and idk...

More likely they had problems trying to get YouTube downloader to work again. But most of audio files that have been downloaded from YouTube before the removal are still there in their database.

vale oak Feb 19, 2025, 1:56 PM

#

Is it gonna come back?

hallow thistle Feb 19, 2025, 1:58 PM

#

I don't know, but better look out for more information in Weights' Discord server. Although you won't be able to submit any YouTube link there on Weights, you can still do this with an already AI converted track that used an audio from YouTube to convert.

#

These two tracks were made after the removal of YouTube link feature on Weights, but can still see the YouTube icon marked on both.

edgy tangle Feb 19, 2025, 2:02 PM

#

simple ore but it did not become overtrained = robot voice

literally my ai voice rn

#

I'll create another dataset then

hallow thistle Feb 19, 2025, 2:16 PM

#

https://tenor.com/view/i-saw-what-you-deleted-cat-gif-25407007

Tenor

fast scarab Feb 19, 2025, 2:21 PM

#

Hey everyone,
I think my model is overtraining because my loss/g/total is increasing sharply after 46k steps.
Should I stop training now, or is there something I can do to fix this?
Also, if someone could help me understand this better, I would really appreciate it because I'm a bit lost.
Here’s my TensorBoard screenshot for reference.

simple ore Feb 19, 2025, 2:57 PM

#

fast scarab Hey everyone, I think my model is overtraining because my loss/g/total is increa...

go to Scalars tab and show the whole chart

#

increase sharply is not the increase from 34 to 34.5

fast scarab Feb 19, 2025, 3:05 PM

#

simple ore go to Scalars tab and show the whole chart

Hey @Noobies,
Here’s the full chart from the Scalars tab. Does this confirm overtraining, or is there something else I should look at?

simple ore Feb 19, 2025, 3:19 PM

#

this is not scalars tab

#

you're showing Time Series

#

fast scarab Feb 19, 2025, 3:35 PM

#

simple ore this is not scalars tab

simple ore Feb 19, 2025, 3:37 PM

#

how hard is to click 'SCALARS' ???

hallow thistle Feb 19, 2025, 3:40 PM

#

That doesn't look like what the Scalars tab should be. The button is supposed to be orange, not gray.

fast scarab Feb 19, 2025, 3:40 PM

#

simple ore how hard is to click 'SCALARS' ???

SORRY 😂

hallow thistle Feb 19, 2025, 3:41 PM

#

https://cdn.discordapp.com/emojis/1015914956971057213.webp?size=48

simple ore Feb 19, 2025, 3:43 PM

#

loss?

#

anyway, gradients look like crap, fm going up from the start, not good at all

urban wasp Feb 19, 2025, 3:45 PM

#

-collab

#

-colab

karmic oliveBOT Feb 19, 2025, 3:45 PM

#

urban wasp -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
Hina's Mod AICoverGen WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Hina's Modified Original W-Okada's Realtime Voice Changer, Google Colab
FaceFusion UI, by Nick088 Google Colab
FaceFusion NO UI, by Nick088 Google Colab
EasyGUI, by Rejekts Google Colab
🆕 Music Source Separation Training (Inference), by Jarredou & Makidanye Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

fast scarab Feb 19, 2025, 3:46 PM

#

simple ore loss?

simple ore Feb 19, 2025, 3:50 PM

#

how big is the dataset?

fast scarab Feb 19, 2025, 3:53 PM

#

simple ore how big is the dataset?

My dataset is about 34 minutes long.

simple ore Feb 19, 2025, 3:55 PM

#

batch size?

fast scarab Feb 19, 2025, 3:56 PM

#

simple ore batch size?

My batch size is 20

crude flame Feb 19, 2025, 3:57 PM

#

fast scarab My batch size is 20

lower that to 8 and try training again

20 is way to high

simple ore Feb 19, 2025, 3:59 PM

#

even to 4

#

FM has to go down

fast scarab Feb 19, 2025, 4:00 PM

#

crude flame lower that to 8 and try training again 20 is way to high

Okay, I'll set the batch size to 8 and restart training. Thanks!

crude flame Feb 19, 2025, 4:01 PM

#

simple ore FM has to go down

that fm is pretty normal imo, usually it goes down for several epochs then goes up infinitely

analog obsidian Feb 19, 2025, 4:04 PM

#

crude flame that fm is pretty normal imo, usually it goes down for several epochs then goes ...

doesn't go that up for me unless im overtraining it

#

its like a slow rising

fast scarab Feb 19, 2025, 4:07 PM

#

I'll try that, thanks for your help!

edgy tangle Feb 19, 2025, 4:17 PM

#

I just optimized my dataset and the time per epoch went from 4¿ minutes to 2 minutes

craggy brook Feb 19, 2025, 4:20 PM

#

Why is there this error?

knotty moth Feb 19, 2025, 4:21 PM

#

craggy brook Why is there this error?

-gui

karmic oliveBOT Feb 19, 2025, 4:21 PM

#

knotty moth -gui

https://cdn.discordapp.com/attachments/1122285248844144733/1203460490475343953/caption.gif?ex=65d12cec&is=65beb7ec&hm=bd2fb8d010006dd7c6e3c1c67d3ae846fd1478e1a3124c544c31b43086fe54aa&

knotty moth Feb 19, 2025, 4:22 PM

#

you should use one of these newer ones

#

-rvc

karmic oliveBOT Feb 19, 2025, 4:22 PM

#

knotty moth -rvc

📚 Documentation

AI HUB Docs

https://docs.ai-hub.wtf

🍏 Applio Docs

https://docs.applio.org/

✨ More guides

How to use RVC Mainline Colab by Cauthess
AICoverGen Colab Guide by Eddy (Spanish Helper)
Create a model with RVC disconnected (colab) by Angetyde

craggy brook Feb 19, 2025, 4:22 PM

#

knotty moth -gui

what?

knotty moth Feb 19, 2025, 4:24 PM

#

do you think rvc could have loras?

tame mica Feb 19, 2025, 4:27 PM

#

ai image help is obviously in #🔍│help-w-okada

gloomy lynx Feb 19, 2025, 5:09 PM

#

guid

low shard Feb 19, 2025, 7:42 PM

#

Realtime voice changer for calls?

#

this is the wrong channel then

#

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-reocrded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

#

@autumn viper tell your PC GPU in #🔍│help-w-okada

#

so, you just want to use it on pre-recorded audios right?

#

then let's use this channel, what's your pc gpu?

#

that's too weak, it could run locally on CPU if you got enough ram and good enough cpu, but it would be extremely slow anyways so not worth to run locally

#

Train (make) RVC Models on cloud:

Prepare the Dataset
Setup RVC:
Choose a cloud way to use RVC,

Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI, no guide as of right now)
- Applio by Shirou (UI, no guide as of right now)
Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Applio (UI)
- Mainline (UI, No guide as of right now)

Be sure to know about the tensorboard

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.gg which ofc uses RVC

RVC Inference (use models) on pre-recorded audio on Cloud

You can use either:

Weights.gg: Easiest Possible Ever Automatic
Ilaria RVC Zero: Fastest free on cloud
Applio (ui)

#

Via cloud, you will run it on a remote good pc

molten fog Feb 19, 2025, 9:35 PM

#

#

why is only the eval model in my tensorboard on lightning ai?

simple ore Feb 19, 2025, 10:02 PM

#

eval is the folder for logs

low shard Feb 19, 2025, 10:14 PM

#

@warm glacier #🔍│help-w-okada message

#

You can:

Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
- Mainline: The original RVC
Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours daily, not granted, of GPU

Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio

warm glacier Feb 19, 2025, 10:17 PM

#

thanks for the information

low shard Feb 19, 2025, 10:19 PM

#

warm glacier thanks for the information

Yw and lmk

quaint tusk Feb 19, 2025, 10:20 PM

#

Hey, everyone, can I ask for help on an error here or is it on the making-models channel?

#

It happened when trying to reload a model in a new session, in colab applio

simple ore Feb 19, 2025, 10:37 PM

#

@jovial pollen if you need something, ask here

jovial pollen Feb 19, 2025, 10:46 PM

#

simple ore <@274540091630813185> if you need something, ask here

My apologises I wanted to know if you had any test model to be able to test on the new realtime

#

For refinegan

#

Since the other ones doesn’t work due to being old

#

Of course if u don’t have anything im sorry for disturbing then

simple ore Feb 19, 2025, 10:52 PM

#

jovial pollen My apologises I wanted to know if you had any test model to be able to test on t...

https://huggingface.co/Aznamir/test/blob/main/Finetune_44k_500e_15500s.pth

jovial pollen Feb 19, 2025, 10:54 PM

#

simple ore https://huggingface.co/Aznamir/test/blob/main/Finetune_44k_500e_15500s.pth

Lord thank you

jovial pollen Feb 19, 2025, 11:08 PM

#

simple ore https://huggingface.co/Aznamir/test/blob/main/Finetune_44k_500e_15500s.pth

hmmm just wanted to know this is a test model right ? Not made to sound good ?

#

overall it sound amazing!

#

on the low and the high the voice doesn't break anymore

simple ore Feb 19, 2025, 11:12 PM

#

it is a model made with a very small dataset, 5:30s

jovial pollen Feb 19, 2025, 11:12 PM

#

simple ore it is a model made with a very small dataset, 5:30s

doesn't break, do you hava female per chance ?

#

(maybe too much to ask 🙏)

simple ore Feb 19, 2025, 11:13 PM

#

no

jovial pollen Feb 19, 2025, 11:13 PM

#

simple ore no

aight welp thanks a lot!

edgy tangle Feb 19, 2025, 11:20 PM

#

#

Is this overtrained?

gleaming chasm Feb 19, 2025, 11:31 PM

#

Is there any way wo make the AI voice sound more crisp by doing some adjustements to your mic?

analog obsidian Feb 19, 2025, 11:33 PM

#

gleaming chasm Is there any way wo make the AI voice sound more crisp by doing some adjustement...

no
also voice changer questions belongs in the voice changer help channel -> #🔍│help-w-okada

analog obsidian Feb 19, 2025, 11:34 PM

#

edgy tangle

graphs aren't very accurate in showing overtraining, you just have to hear the epochs

#

its possible for a model start to overtrain even when the g/total graph is still going down
the model forgets most of what it learned from the pretrain if you train it for too long
overtrained models are pretty obvious, the model sounds robotic/disorted and it struggles to inference any audio, tho every model overtrains differently, some overtrained models still are able to do some stuff despite forgetting things

crude flame Feb 19, 2025, 11:47 PM

#

analog obsidian its possible for a model start to overtrain even when the g/total graph is still...

Does this mean we should try making lower epoch models so the models doesnt forget stuff the pretrain taught it

analog obsidian Feb 19, 2025, 11:48 PM

#

crude flame Does this mean we should try making lower epoch models so the models doesnt forg...

no idea, im actually interested in trying this

#

i did it with the jeff model
after e64 the model forgot how to sing

simple ore Feb 19, 2025, 11:48 PM

#

well, mainly dont run 1000+ epochs

edgy tangle Feb 20, 2025, 1:04 AM

#

analog obsidian graphs aren't very accurate in showing overtraining, you just have to hear the e...

Well, i’ll test it later

mossy crest Feb 20, 2025, 9:44 AM

#

Hello, I want a voice similar to children chorus. Is there any?

brittle wing Feb 20, 2025, 10:33 AM

#

Hey all, been outta the loop since the initial ai boom and need to brush up my knowledge.
Is the RVC client by w-okada still the best tool for the job, or have people moved to something else by now?
Thanks.

#

Ah, I see there's guides sections in here, I'll poke around a bit.

knotty moth Feb 20, 2025, 10:36 AM

#

brittle wing Hey all, been outta the loop since the initial ai boom and need to brush up my k...

please go to #🔍│help-w-okada and read the pinned guide there

agile storm Feb 20, 2025, 12:01 PM

#

guys, is there any working rvc training colab?

low shard Feb 20, 2025, 12:03 PM

#

mossy crest Hello, I want a voice similar to children chorus. Is there any?

It's prohibited to share models about kids.

low shard Feb 20, 2025, 12:03 PM

#

agile storm guys, is there any working rvc training colab?

First, did you try checking your PC GPU in case it's good enough?

neat glacier Feb 20, 2025, 12:22 PM

#

joe_shhh

fast scarab Feb 20, 2025, 12:38 PM

#

Hello,
I set the batch size to 4 as recommended, but now the graphs seem to be stagnating. Here are the charts:
I’m not sure if this is normal or if it indicates a problem. Could someone tell me if this is okay or if I need to adjust something else?
Thanks for your help!

#

knotty moth Feb 20, 2025, 12:42 PM

#

fast scarab

click this damn thing

fast scarab Feb 20, 2025, 12:47 PM

#

knotty moth click this damn thing

Alright, thanks! I’ll try that right now.

neat glacier Feb 20, 2025, 12:48 PM

#

fast scarab Alright, thanks! I’ll try that right now.

am i geeked or is this graph absolutely fucked

fast scarab Feb 20, 2025, 12:55 PM

#

neat glacier am i geeked or is this graph absolutely fucked

Yeah, it doesn’t look good. Any advice on how to fix it?

neat glacier Feb 20, 2025, 12:58 PM

#

what does your dataset look like? is it nice and clean, and how long is it?

fast scarab Feb 20, 2025, 1:02 PM

#

neat glacier what does your dataset look like? is it nice and clean, and how long is it?

Yes, it was processed with UVR to remove noise, and it’s an MP3 file.

neat glacier Feb 20, 2025, 1:03 PM

#

some basic questions

what's the learning rate
what pretrain (cuz that's a thing now)

#

that is some INSANE levels of mode collapse

low shard Feb 20, 2025, 1:08 PM

#

neat glacier <:joe_shhh:1169303487964786709>

Baffled

craggy brook Feb 20, 2025, 1:14 PM

#

I will be doing this voice as Meggy Spletzer. How can I adjust it in the best way that is realistic and not robotic? Or can you send it to me?

fast scarab Feb 20, 2025, 1:33 PM

#

neat glacier some basic questions - what's the learning rate - what pretrain (cuz that's a th...

Is there a tutorial somewhere instead?

simple ore Feb 20, 2025, 2:34 PM

#

fast scarab Hello, I set the batch size to 4 as recommended, but now the graphs seem to be s...

you're not actually training anything

#

i mean.. 2 steps per epoch?

#

you f'd up preprocess/extract features

knotty moth Feb 20, 2025, 3:47 PM

#

there's no way it's 0.1 steps per epoch, perhaps it means it reached 12k epochs

knotty moth Feb 20, 2025, 3:48 PM

#

neat glacier some basic questions - what's the learning rate - what pretrain (cuz that's a th...

the default learning rate is 1e-4 and there's no reason to touch it for most cases in rvc

neat glacier Feb 20, 2025, 3:52 PM

#

knotty moth the default learning rate is 1e-4 and there's no reason to touch it for most cas...

why?

knotty moth Feb 20, 2025, 3:52 PM

#

neat glacier that is some INSANE levels of mode collapse

I don't think so, it could possibly happen if the loss disc keeps approaching zero, for example

neat glacier Feb 20, 2025, 3:52 PM

#

i remember the original Ilaria RVC had that as an option

#

also how come we can't change activation functions?

#

rvc has not aged well

crude flame Feb 20, 2025, 3:53 PM

#

neat glacier that is some INSANE levels of mode collapse

Mode collapses aren’t really a thing, those large jumps are just rvc learning silence which is good

neat glacier Feb 20, 2025, 3:54 PM

#

what happened between interval 10 and 12?

#

looks like a collapse to me

knotty moth Feb 20, 2025, 3:55 PM

#

neat glacier also how come we can't change activation functions?

it's proven that changing it may make things worse or take more resources/vram

neat glacier Feb 20, 2025, 3:56 PM

#

that's interesting

#

gelu should theoreticall be better than relu?

knotty moth Feb 20, 2025, 3:57 PM

#

neat glacier gelu should theoreticall be better than relu?

check noobies' comment on this #🔊│ai-development message

neat glacier Feb 20, 2025, 3:58 PM

#

no access

knotty moth Feb 20, 2025, 3:59 PM

#

activate the ai testing role here

neat glacier Feb 20, 2025, 4:00 PM

#

knotty moth check noobies' comment on this https://discord.com/channels/1159260121998827560/...

interesting

craggy brook Feb 20, 2025, 4:16 PM

#

craggy brook I will be doing this voice as Meggy Spletzer. How can I adjust it in the best wa...

How can I make Meggy's voice realistic?

ember hollow Feb 20, 2025, 4:41 PM

#

#

@low shard

#

When I try to put a mic

knotty moth Feb 20, 2025, 4:43 PM

#

ember hollow

go to #🔍│help-w-okada and read the pinned guide there for some troubleshooting

simple ore Feb 20, 2025, 4:48 PM

#

neat glacier what happened between interval 10 and 12?

There's no way for tensorboard to show such X axis... unless you're training on 2 mute files and all your unsliced audios were tossed out

low shard Feb 20, 2025, 5:09 PM

#

ember hollow

wrong channel, tell me ur browser in #🔍│help-w-okada

craggy brook Feb 20, 2025, 5:23 PM

#

I did these as examples, but none of them turned out as good as I wanted.

remote karma Feb 20, 2025, 5:29 PM

#

How do I download RVC V2?

low shard Feb 20, 2025, 5:29 PM

#

remote karma How do I download RVC V2?

what's ur pc gpu
what do you want to do?

remote karma Feb 20, 2025, 5:30 PM

#

4060 ti and i just wanna troll with a female voice

low shard Feb 20, 2025, 5:30 PM

#

remote karma 4060 ti and i just wanna troll with a female voice

ttroll in calls, meaning realtime voice changer for calls?

remote karma Feb 20, 2025, 5:30 PM

#

Yes

low shard Feb 20, 2025, 5:30 PM

#

then this is the wrong program and channel

#

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-reocrded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

#

@remote karma ur pc is good, I will ping u in #🔍│help-w-okada

remote karma Feb 20, 2025, 5:31 PM

#

Sounds good thanks

craggy brook Feb 20, 2025, 6:07 PM

#

craggy brook I did these as examples, but none of them turned out as good as I wanted.

...?

woeful cave Feb 20, 2025, 8:56 PM

#

is applio inference not working?

craggy brook Feb 20, 2025, 9:03 PM

#

craggy brook I did these as examples, but none of them turned out as good as I wanted.

It works but the settings are not correct and the voice sounds like a robot.

coarse crest Feb 20, 2025, 11:09 PM

#

does anyone know how to create your own RVC model?

fast scarab Feb 20, 2025, 11:18 PM

#

Hey everyone,
I finally managed to get the training to work! However, I’m having trouble locating the checkpoint files in RVC. I want to use the best checkpoint I found, but I can’t seem to find where it’s saved.
Does anyone know the exact folder or path where RVC saves the checkpoint files? I checked a few places but no luck so far. Any help would be appreciated!

#

simple ore Feb 21, 2025, 12:01 AM

#

fast scarab Hey everyone, I finally managed to get the training to work! However, I’m having...

assets/weights

#

for mainline RVC

chilly slate Feb 21, 2025, 12:40 AM

#

#✨│ai-help #🔍│help-ai-art Hey I was trying to use voice model to change my ai assistant voice but couldn't do it bcz of some error is there anything who can help?

fast scarab Feb 21, 2025, 2:22 AM

#

simple ore assets/weights

I didn’t activate this.

hallow thistle Feb 21, 2025, 5:47 AM

#

!howtoask

patent trellisBOT Feb 21, 2025, 5:47 AM

#

hallow thistle !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

opal hill Feb 21, 2025, 7:05 AM

#

heya, what does crepe hop do for RVC f0 method crepe?

grand pecan Feb 21, 2025, 7:45 AM

#

Hi guys quick question, is this overtraining / converging? From the point I think it's the lowest it has started going kind of flat. Ty
https://ibb.co/RT4S1HLY

simple ore Feb 21, 2025, 10:13 AM

#

grand pecan Hi guys quick question, is this overtraining / converging? From the point I thin...

check other charts, fm and mel

pastel oak Feb 21, 2025, 10:13 AM

#

opal hill heya, what does crepe hop do for RVC f0 method crepe?

stone dawn Feb 21, 2025, 1:45 PM

#

Hi! Is there an RVC that works for conversion voice only?

craggy brook Feb 21, 2025, 2:25 PM

#

help

simple ore Feb 21, 2025, 4:06 PM

#

stone dawn Hi! Is there an RVC that works for conversion voice only?

it is called RVC (Retrieval-based-Voice-Conversion)

tame mica Feb 21, 2025, 5:06 PM

#

stone dawn Hi! Is there an RVC that works for conversion voice only?

ilaria rvc

graceful patio Feb 21, 2025, 5:08 PM

#

Can anyone help me on this

#

So im training a model that has

#

a dataset with about 10-15 mins

#

and its taking 40 steps per epoch

#

on a batch size of 8

#

is this a good sign or a bad sign?

#

using refinegan btw

#

using noobies base train 44k sample rate

jovial pollen Feb 21, 2025, 5:23 PM

#

graceful patio using refinegan btw

sorry to ask, i'm not a helper but you're doing a refinegan model as we speak ?

graceful patio Feb 21, 2025, 5:37 PM

#

jovial pollen sorry to ask, i'm not a helper but you're doing a refinegan model as we speak ?

Yeah its all good, im currently using refinegan right now

#

its training

graceful patio Feb 21, 2025, 5:38 PM

#

jovial pollen sorry to ask, i'm not a helper but you're doing a refinegan model as we speak ?

using https://discord.com/channels/1159260121998827560/1327283027776245810

jovial pollen Feb 21, 2025, 5:38 PM

#

graceful patio Yeah its all good, im currently using refinegan right now

do you plan on making it public or no if u don't mind telling me ? I wanted to see how well refinegan performs on realtime

graceful patio Feb 21, 2025, 5:38 PM

#

jovial pollen do you plan on making it public or no if u don't mind telling me ? I wanted to s...

im planning on posting it yes

#

its an artist

#

that im training on rn

jovial pollen Feb 21, 2025, 5:39 PM

#

graceful patio that im training on rn

english ?

graceful patio Feb 21, 2025, 5:39 PM

#

jovial pollen english ?

yep all english dataset

#

i can ping you when results are ready

#

probably needs another hour or so

#

because of the steps its taking a while (to make progress)

#

well right now i am actually at 2k steps for every 50 epochs

#

let me see if i can get a sample right now

jovial pollen Feb 21, 2025, 5:42 PM

#

graceful patio let me see if i can get a sample right now

many thanks!

graceful patio Feb 21, 2025, 5:44 PM

#

jovial pollen many thanks!

hmmm seems like it needs more training but im still gonna send you a sample (this is a rapper model / lil uzi vert) and you can hear some progress but i heard that it needs like way more epochs to get an actual really good result

jovial pollen Feb 21, 2025, 5:44 PM

#

graceful patio hmmm seems like it needs more training but im still gonna send you a sample (thi...

aighty thanks!

graceful patio Feb 21, 2025, 5:44 PM

#

hm its not robotic you can tell

#

its just very like airy weird

jovial pollen Feb 21, 2025, 5:45 PM

#

yeah weird

graceful patio Feb 21, 2025, 5:45 PM

#

its only 50 epochs rn

jovial pollen Feb 21, 2025, 5:45 PM

#

that explains haha

graceful patio Feb 21, 2025, 5:45 PM

#

heard you need to reach 200-500 to hear any noticeable difference lmao

#

since this one is more advanced

jovial pollen Feb 21, 2025, 5:45 PM

#

lets see around that then

graceful patio Feb 21, 2025, 5:46 PM

#

jovial pollen lets see around that then

alright i will ping you when ready then smokesalute

jovial pollen Feb 21, 2025, 5:53 PM

#

graceful patio alright i will ping you when ready then <:smokesalute:1178450188780712118>

bet

earnest knoll Feb 21, 2025, 6:10 PM

#

might be a stupid question, but how do I have applio generate the index file at a specific epoch once i've found the lowest g/loss value on the tensorboard, or do I have to restart training and stop at that epoch to generate the index?

royal marsh Feb 21, 2025, 6:18 PM

#

Hello anyone have problems with w-okada like gpu goes to 100% usage and freeze totally the game you playing?

opal hill Feb 21, 2025, 6:26 PM

#

pastel oak

cheers

low shard Feb 21, 2025, 6:35 PM

#

royal marsh Hello anyone have problems with w-okada like gpu goes to 100% usage and freeze t...

this is the wrong channel

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-reocrded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

#

elaborate:

your pc gpu
what guide link did you follow
a screenshot of your wokada

in #🔍│help-w-okada

red kayak Feb 21, 2025, 6:57 PM

#

graceful patio

thats very likely ur dataset messing with the learning process. are you using studio recording or are you using uvr isoalated vocals

graceful patio Feb 21, 2025, 8:21 PM

#

red kayak thats very likely ur dataset messing with the learning process. are you using st...

im using uvr isolated vocals

#

cleanest though

#

its a lil uzi vert model in the making rn

#

i can send you a sample

red kayak Feb 21, 2025, 8:22 PM

#

graceful patio im using uvr isolated vocals

send a 3 sec sample here

graceful patio Feb 21, 2025, 8:22 PM

#

of the dataset

red kayak Feb 21, 2025, 8:22 PM

#

yeah

graceful patio Feb 21, 2025, 8:22 PM

#

minor double vocal (but not included in all dataset)

#

i heard it just cleans it out its very minimal

simple ore Feb 21, 2025, 8:23 PM

#

shit quality

graceful patio Feb 21, 2025, 8:23 PM

#

simple ore shit quality

whys that? its ai isolated

simple ore Feb 21, 2025, 8:24 PM

#

bad isolation

red kayak Feb 21, 2025, 8:24 PM

#

well there are so many issues with this

graceful patio Feb 21, 2025, 8:24 PM

#

red kayak well there are so many issues with this

whys that ? as long as there clarity in the vocal

#

thats what should matter right?

red kayak Feb 21, 2025, 8:24 PM

#

graceful patio whys that ? as long as there clarity in the vocal

you see

#

you have backing vocals in there

#

thats bad

#

causes vocal doubling and confuses the model lots

#

2nd theres instrumental bleed

#

also very bad practice for training

graceful patio Feb 21, 2025, 8:27 PM

#

red kayak you see

and just to mention i dont have izotope 11 so i cant clean it as good as usuing ai isolation

#

nor dont want to crack it bc of virsues and stuff

#

so what would be best in ai isolation

#

to clean it

red kayak Feb 21, 2025, 8:27 PM

#

a lot of frequencies between 3000 and 6000 were butcher, which may impair perforamance and training stability

jovial pollen Feb 21, 2025, 8:27 PM

#

refinegan is also very sensitive to dataset

red kayak Feb 21, 2025, 8:27 PM

#

graceful patio and just to mention i dont have izotope 11 so i cant clean it as good as usuing ...

izotope isnt whats really needed here, you can get away with other plugins just fine

red kayak Feb 21, 2025, 8:28 PM

#

jovial pollen refinegan is also very sensitive to dataset

gans arent really the main issue here, ideally you will need to effectively clean out your dataset first b4 anything

jovial pollen Feb 21, 2025, 8:28 PM

#

red kayak gans arent really the main issue here, ideally you will need to effectively clea...

aaaa aight aight

graceful patio Feb 21, 2025, 8:29 PM

#

red kayak izotope isnt whats really needed here, you can get away with other plugins just ...

what can be used then? and whats an alternaitve to izotope to keep my dataset clean?

red kayak Feb 21, 2025, 8:29 PM

#

you even have left over reverb residues which is bad ofc

graceful patio Feb 21, 2025, 8:29 PM

#

red kayak you even have left over reverb residues which is bad ofc

i used mel reformer dereverb by avenue since its more aggressive and better

#

heard its better than the old fox joy model

red kayak Feb 21, 2025, 8:31 PM

#

graceful patio i used mel reformer dereverb by avenue since its more aggressive and better

yeah but still u can have some of it left over so you will need to delete it urself

graceful patio Feb 21, 2025, 8:31 PM

#

red kayak yeah but still u can have some of it left over so you will need to delete it urs...

how can i do that tho?

red kayak Feb 21, 2025, 8:31 PM

#

graceful patio how can i do that tho?

with a noise gate

#

and by manually silencing that part

graceful patio Feb 21, 2025, 8:33 PM

#

i do hear oddly some clipping tho is there any way to fix those artifacts

#

i get that sometimes

#

like that mild clipping

red kayak Feb 21, 2025, 8:33 PM

#

graceful patio i do hear oddly some clipping tho is there any way to fix those artifacts

which audio are you talking abt

graceful patio Feb 21, 2025, 8:34 PM

#

im hearing it in this audio you sent me / has some sort of clipping in it like bleed?

#

kind of like a distortion

#

to your sample

#

like very low

red kayak Feb 21, 2025, 8:34 PM

#

no no, as you fore mentioned, thats distortion

#

clipping is different

graceful patio Feb 21, 2025, 8:35 PM

#

red kayak no no, as you fore mentioned, thats distortion

then i think thats the term , is it even possible to fix the distortion

red kayak Feb 21, 2025, 8:35 PM

#

and this type of distortion is usually to the loudness of the vocals + the instrumental removal which damages the vocals

red kayak Feb 21, 2025, 8:35 PM

#

graceful patio then i think thats the term , is it even possible to fix the distortion

not really no

graceful patio Feb 21, 2025, 8:37 PM

#

red kayak not really no

any way you can walk me through my cleaning process just quickly to see if there is anywhere wrong? i use mel reformer for vocal, and then dereverb with either fox joy or the mel reformer dereverb, de-echo if needed, denoise , and then if theres any backing vocals i use melband karakoe / then open audacity, apply a noise gate/trunacate silences and then normalize

#

what you seen (the sample i sent) was what i used and did

red kayak Feb 21, 2025, 8:39 PM

#

u can just do mel roformer and de reverb

#

not fox joy though

crude flame Feb 21, 2025, 8:40 PM

#

Anvuew mel dereverb v2 is the best btw

graceful patio Feb 21, 2025, 8:42 PM

#

red kayak u can just do mel roformer and de reverb

should i get izotope btw

#

is that reccommended

#

if i want super clean vocals

red kayak Feb 21, 2025, 8:42 PM

#

graceful patio should i get izotope btw

yes since its a great audio repair tool

#

it'll help with annoying clicks, left over reverb residues, noise removal and more

crude flame Feb 21, 2025, 8:43 PM

#

graceful patio should i get izotope btw

do you need the "funds" to get it?

red kayak Feb 21, 2025, 8:43 PM

#

crude flame do you need the "funds" to get it?

i think bro needs the "funds"

#

hook my mans up

low shard Feb 21, 2025, 8:43 PM

#

fuck yes the "funds" 🙏

crude flame Feb 21, 2025, 8:44 PM

#

graceful patio should i get izotope btw

sent you the "funds"

graceful patio Feb 21, 2025, 8:44 PM

#

red kayak it'll help with annoying clicks, left over reverb residues, noise removal and mo...

getting it

graceful patio Feb 21, 2025, 8:44 PM

#

crude flame sent you the "funds"

appercaite it

#

thank you for these funds

ruby vector Feb 21, 2025, 8:54 PM

#

how do i input audio for voice training in weights.com?

#

is it broken? because there's nowhere to import voiceclips

#

earnest knoll Feb 21, 2025, 8:56 PM

#

ruby vector

you have to press next

ruby vector Feb 21, 2025, 8:56 PM

#

oh

earnest knoll Feb 21, 2025, 8:59 PM

#

ruby vector oh

also, I wouldn't recommend training on weights, you can set up rvc in the cloud by following this tutorial if your computer can't handle it:https://docs.aihub.gg/rvc/cloud/applio-kaggle/

graceful patio Feb 21, 2025, 9:04 PM

#

red kayak i think bro needs the "funds"

by the way do esemble models like bs reformer + mel reformer combined

#

do better?

#

then the normal model itself?

red kayak Feb 21, 2025, 9:05 PM

#

not really

#

thats over kill

#

and wont really help

graceful patio Feb 21, 2025, 9:06 PM

#

red kayak not really

but has a better sdr and should perform better tho? dont you thin

#

since its combined with the best models

red kayak Feb 21, 2025, 9:06 PM

#

graceful patio but has a better sdr and should perform better tho? dont you thin

sdr is heavily inaccurate

#

especially for creating models

#

mel band has a lower sdr than bs roformer but it isnt always better

#

always look at the spectrograms and compare

ruby vector Feb 21, 2025, 9:10 PM

#

earnest knoll also, I wouldn't recommend training on weights, you can set up rvc in the cloud ...

i tried to install but i gave up halfway through because the tutorial is not very clear

#

or maybe i'm just autistic and couldn't understand it

graceful patio Feb 21, 2025, 9:25 PM

#

how does this sound

#

ill unsend

#

its just a sample of what i have

#

it does have minor echo residue

low shard Feb 21, 2025, 9:28 PM

#

@simple ore btw does Applio work with Python 3.12.3? asking because I'm updating the termux guide for ubuntu24.04 which doesn't support 3.10 anymore, I'm git cloning and running the run-install.sh

#

seems like it works until it gets to numpy 1.23.5, so I feel like there's a dependency problem

ruby vector Feb 21, 2025, 9:31 PM

#

do you have to use a vocal only version of a song for an ai cover in weights.com?

simple ore Feb 21, 2025, 9:32 PM

#

low shard <@155030383648440320> btw does Applio work with Python 3.12.3? asking because I'...

you can always use virtual environment

#

pyenv

low shard Feb 21, 2025, 9:34 PM

#

simple ore you can always use virtual environment

yeah that's right, but I was asking if you ever tested it yourself on python3.12 since you're one of the devs, just to check if it's an issue on my end lol

#

doesn't run-install already make a virtual environment btw?

low shard Feb 21, 2025, 9:36 PM

#

ruby vector do you have to use a vocal only version of a song for an ai cover in weights.com...

you can use either, they automatically separate vocals and instrumentals, or if you want upload your vocal only file

ruby vector Feb 21, 2025, 9:36 PM

#

oh i already found it out

simple ore Feb 21, 2025, 10:06 PM

#

low shard yeah that's right, but I was asking if you ever tested it yourself on python3.12...

not testing, no plans for that

low shard Feb 21, 2025, 10:06 PM

#

alright, thanks

simple ore Feb 21, 2025, 10:06 PM

#

there are some differences, not sure what needs to change

low shard Feb 21, 2025, 10:06 PM

#

gonna use pyenv

low shard Feb 21, 2025, 10:07 PM

#

simple ore there are some differences, not sure what needs to change

the difference I can confirm you is upgrading numpy

#

after that the installation didn't work so I dunno if other packages would need other changes too

simple ore Feb 21, 2025, 10:08 PM

#

should work fine with numpy 1.26.4, <2.0

low shard Feb 21, 2025, 10:08 PM

#

should

#

https://tenor.com/view/do-not-run-python-python-computer-python-coding-coding-funny-coding-meme-gif-10365831290651691441

Tenor

simple ore Feb 21, 2025, 10:18 PM

#

I can try installing python 3.12

#

but...

brittle wing Feb 22, 2025, 12:18 AM

#

when i try to use it on discord then test it i cant hear myself can someone help?

silver dirge Feb 22, 2025, 1:32 AM

#

I created a song with my own voice a long time ago but I forgot it, it's been a long time, how can I do it now?

silver dirge Feb 22, 2025, 2:08 AM

#

where can i train my model

vagrant bolt Feb 22, 2025, 5:28 AM

#

Hello, i have trouble with the voice training, every time i try to use it there is an error : AttributeError: 'FigureCanvasAgg' object has no attribute 'tostring_rgb'
Can someone helps me with it ?

weak cipher Feb 22, 2025, 6:38 AM

#

guys how to add model +tts

tough shuttle Feb 22, 2025, 6:48 AM

#

how to trainnn

simple ore Feb 22, 2025, 7:58 AM

#

vagrant bolt Hello, i have trouble with the voice training, every time i try to use it there ...

use updated colab/local app

low shard Feb 22, 2025, 9:06 AM

#

brittle wing when i try to use it on discord then test it i cant hear myself can someone help...

Wrong channel

#

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-reocrded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

#

elaborate:

ur PC GPU
the guide link u are using
a screenshot of ur discord and wokada settings

in #🔍│help-w-okada

low shard Feb 22, 2025, 9:07 AM

#

silver dirge where can i train my model

What's ur PC GPU

silver dirge Feb 22, 2025, 9:07 AM

#

ryzen 5 5500

low shard Feb 22, 2025, 9:07 AM

#

vagrant bolt Hello, i have trouble with the voice training, every time i try to use it there ...

Elaborate ur PC GPU and what guides link u are using

low shard Feb 22, 2025, 9:07 AM

#

silver dirge ryzen 5 5500

That's not a GPU

#

That's a CPU

silver dirge Feb 22, 2025, 9:08 AM

#

right

low shard Feb 22, 2025, 9:08 AM

#

weak cipher guys how to add model +tts

There are different Text To Speech (TTS) AIs:

GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/

Freemium 11labs: An easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS

FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site

With RVC Models:

RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)

If you wanna do tts locally with RVC Voice Models (if you got a good pc):

You can get Applio in our docs
While Ilaria RVC Mainline here (no guide as of right now)

If you don't got a good pc you can do tts with RVC Voice Models on cloud:

Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
Use Applio UI Colab (with google colab T4 free daily limit gpu)
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc

silver dirge Feb 22, 2025, 9:08 AM

#

rx 6600

low shard Feb 22, 2025, 9:08 AM

#

tough shuttle how to trainnn

What's your PC GPU

low shard Feb 22, 2025, 9:08 AM

#

silver dirge rx 6600

You can:

Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Applio (AMD Windows) : A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
- Mainline (AMD Linux/Windows) : The original RVC
Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours daily, not granted, of GPU

Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio

#

I would suggest you Applio with Zluda

silver dirge Feb 22, 2025, 9:09 AM

#

thanks

low shard Feb 22, 2025, 9:09 AM

#

silver dirge thanks

Yw and lmk

simple ore Feb 22, 2025, 9:12 AM

#

silver dirge thanks

with 6600 you'll be able to train 30-45m dataset model overnight

low shard Feb 22, 2025, 9:13 AM

#

simple ore with 6600 you'll be able to train 30-45m dataset model overnight

Damn I thought it would be faster

low shard Feb 22, 2025, 9:13 AM

#

simple ore but...

Python moment boohooh

silver dirge Feb 22, 2025, 9:13 AM

#

I forgot everything, I need to learn it again sometime. Is there a video you can recommend on the internet that would make it easier?

low shard Feb 22, 2025, 9:14 AM

#

silver dirge I forgot everything, I need to learn it again sometime. Is there a video you can...

Nope all videos are extremely outdated

#

The only updated guides are the written ones, which I have sent you the link

simple ore Feb 22, 2025, 9:14 AM

#

if the video is 6+ month old, it is likely outdated

silver dirge Feb 22, 2025, 9:14 AM

#

sad

low shard Feb 22, 2025, 9:15 AM

#

silver dirge sad

https://docs.aihub.gg/essentials/how-to-make-voice-models/ you could also check this but AMD tutorials aren't included, however it's the same program

How to Make Voice Models

In the context of RVC, the dataset is an audio file containing the voice the model will replicate. It can be either speaking or singing.

weak cipher Feb 22, 2025, 9:41 AM

#

low shard There are different Text To Speech (TTS) AIs: GPT So Vits: RVC isn't as good a...

I want to make a live2d model using reactjs and tts but it doesn't run locally. Specifically, when I text, the model will move and speak after I finish, I will deploy it using the web so the phone can use it, but the problem is that reactjs can't do tts or I don't know how to do it, can you show me?

#

My computer is good enough but I want to do it on the web only.

hallow thistle Feb 22, 2025, 9:42 AM

#

weak cipher I want to make a live2d model using reactjs and tts but it doesn't run locally. ...

How does the TTS even work with Live2D?

weak cipher Feb 22, 2025, 9:44 AM

#

hallow thistle How does the TTS even work with Live2D?

https://www.youtube.com/watch?v=oCFm-rXI6HU&t=24s I want this but pure code with reactjs and can run on phone

weak cipher Feb 22, 2025, 9:45 AM

#

hallow thistle How does the TTS even work with Live2D?

I don't know what else to do

hallow thistle Feb 22, 2025, 9:45 AM

#

weak cipher Feb 22, 2025, 9:46 AM

#

need to find someone who can help me

#

when i finish i will build it through expo into apk

#

skullfacedistorted

hallow thistle Feb 22, 2025, 9:48 AM

#

Don't know about this one. I'm only here for basic RVC the audio changer and W-Okada the realtime audio changer steps.

weak cipher Feb 22, 2025, 9:49 AM

#

hallow thistle Don't know about this one. I'm only here for basic RVC the audio changer and W-O...

bro u knw fire base?

#

or github

#

cvt to cdn

hallow thistle Feb 22, 2025, 9:49 AM

#

Um.

#

Firebase is a cloud-based database API, that's what I know.

weak cipher Feb 22, 2025, 9:50 AM

#

what if i put the full song on github will it exceed the limit?

low shard Feb 22, 2025, 9:51 AM

#

weak cipher I want to make a live2d model using reactjs and tts but it doesn't run locally. ...

I don’t use reactjs

#

Do you really need the tts model to run locally?

#

you could maybe try adding edge tts api, It works on python so should too on js

weak cipher Feb 22, 2025, 9:51 AM

#

low shard I don’t use reactjs

javascript like that

low shard Feb 22, 2025, 9:52 AM

#

yeah I don’t use js

hallow thistle Feb 22, 2025, 9:52 AM

#

weak cipher what if i put the full song on github will it exceed the limit?

GitHub is a place to upload anything about code, not a music service like SoundCloud.

low shard Feb 22, 2025, 9:52 AM

#

weak cipher when i finish i will build it through expo into apk

so you want to make an apk that has live2d with custom 3d vtuber, tts and rvc locally?

weak cipher Feb 22, 2025, 9:52 AM

#

low shard yeah I don’t use js

u use python?

weak cipher Feb 22, 2025, 9:52 AM

#

low shard so you want to make an apk that has live2d with custom 3d vtuber, tts and rvc lo...

YES

hallow thistle Feb 22, 2025, 9:52 AM

#

weak cipher u use python?

Of course.

low shard Feb 22, 2025, 9:52 AM

#

weak cipher YES

that’s gonna run slow

weak cipher Feb 22, 2025, 9:53 AM

#

hallow thistle Of course.

call too much api bro

low shard Feb 22, 2025, 9:53 AM

#

it won’t work for realtime

weak cipher Feb 22, 2025, 9:53 AM

#

low shard it won’t work for realtime

why bro

low shard Feb 22, 2025, 9:53 AM

#

it would run on the phone CPU literally

hallow thistle Feb 22, 2025, 9:53 AM

#

Don't expect any AI to run that fast on mobile smartphone locally.

low shard Feb 22, 2025, 9:53 AM

#

Yes it would work on cpu, but it’s going to take time

weak cipher Feb 22, 2025, 9:54 AM

#

maybe try deploy on web and run?

hallow thistle Feb 22, 2025, 9:54 AM

#

But if you host the service on cloud, and make it as a hybrid-web application, that would work.

low shard Feb 22, 2025, 9:55 AM

#

I used RVC Applio on Termux on my honor 90 lite, it took 69 secs to inference 8 seconds

weak cipher Feb 22, 2025, 9:55 AM

#

make it into a web and display it on your phone=))

hallow thistle Feb 22, 2025, 9:55 AM

#

low shard I used RVC Applio on Termux on my honor 90 lite, it took 69 secs to inference 8 ...

Baffled

low shard Feb 22, 2025, 9:56 AM

#

weak cipher make it into a web and display it on your phone=))

so, you mean making a website that phones can use? Meaning the AI would run on cloud

weak cipher Feb 22, 2025, 9:56 AM

#

low shard so, you mean making a website that phones can use? Meaning the AI would run on c...

yes bro

low shard Feb 22, 2025, 9:57 AM

#

weak cipher yes bro

if it’s going to run on cloud (remote good pc), rather than the phone’s power, then yeah it would work

weak cipher Feb 22, 2025, 9:57 AM

#

low shard if it’s going to run on cloud (remote good pc), rather than the phone’s power, t...

how to use idk bro

hallow thistle Feb 22, 2025, 9:58 AM

#

But you'll have to pay the cloud service for that.

weak cipher Feb 22, 2025, 9:58 AM

#

hallow thistle But you'll have to pay the cloud service for that.

maybe try hungface?

#

free

hallow thistle Feb 22, 2025, 9:58 AM

#

Yes, you can make a space on Hugging Face.

low shard Feb 22, 2025, 9:58 AM

#

weak cipher u use python?

python, html, Jupyter, some assembly x86 and some C

low shard Feb 22, 2025, 9:59 AM

#

weak cipher free

huggingface gives just CPU for free, it wouldn’t be that good for realtime

weak cipher Feb 22, 2025, 10:00 AM

#

low shard python, html, Jupyter, some assembly x86 and some C

maybe only reactjs?

hallow thistle Feb 22, 2025, 10:00 AM

#

But if you wanna have GPU for that, you'll still have to pay for that one.

knotty moth Feb 22, 2025, 10:01 AM

#

weak cipher I don't know what else to do

if this is what you're developing, sorry no one but your own dev team could help you
you can also try asking chatgpt or claude to help fixing some code issues

low shard Feb 22, 2025, 10:01 AM

#

weak cipher maybe only reactjs?

wdym

weak cipher Feb 22, 2025, 12:12 PM

#

knotty moth if this is what you're developing, sorry no one but your own dev team could help...

No team bro

knotty moth Feb 22, 2025, 12:26 PM

#

weak cipher No team bro

I mean how could you get anyone's help unless you open source the project?

weak cipher Feb 22, 2025, 12:29 PM

#

knotty moth I mean how could you get anyone's help unless you open source the project?

show the code and will help?

knotty moth Feb 22, 2025, 12:38 PM

#

weak cipher show the code and will help?

too bad I've lost my interest on the dev shit, but I suppose you could use chatgpt/claude anyway

chilly slate Feb 22, 2025, 12:56 PM

#

#✨│ai-help hello is anyone here who is good in setup and use of RVC i was trying to use but i cannot i don't know how can anyone please help me its urgent

low shard Feb 22, 2025, 1:06 PM

#

chilly slate <#1159290139609137264> hello is anyone here who is good in setup and use of RVC ...

Elaborate your issue

chilly slate Feb 22, 2025, 1:08 PM

#

low shard Elaborate your issue

rvc when i input my sample voice model.pth and index.index it shows error of missing hubert file downloaded and aaded then statted showing another like that

low shard Feb 22, 2025, 1:09 PM

#

chilly slate rvc when i input my sample voice model.pth and index.index it shows error of mis...

What's your PC GPU? What guide/RVC link are you using? Show a screenshot of the error too

hallow thistle Feb 22, 2025, 1:10 PM

#

chilly slate <#1159290139609137264> hello is anyone here who is good in setup and use of RVC ...

Which RVC program are you trying to use? RVC GUI, Applio or anything else?

chilly slate Feb 22, 2025, 1:10 PM

#

low shard What's your PC GPU? What guide/RVC link are you using? Show a screenshot of the ...

gpu nvdia 1650 don't knw abt guide error of ss can uh text me personally

low shard Feb 22, 2025, 1:11 PM

#

chilly slate gpu nvdia 1650 don't knw abt guide error of ss can uh text me personally

Which rvc download link did you use?

chilly slate Feb 22, 2025, 1:11 PM

#

hallow thistle Which RVC program are you trying to use? RVC GUI, Applio or anything else?

don't knw i searched on chat gpt and it gave me a github link and i used that

low shard Feb 22, 2025, 1:11 PM

#

Also you have permission to send SS, is there a reason why you want to do it in DMS?

hallow thistle Feb 22, 2025, 1:11 PM

#

chilly slate don't knw i searched on chat gpt and it gave me a github link and i used that

No. Never trust anything about RVC from ChatGPT.

low shard Feb 22, 2025, 1:11 PM

#

chilly slate don't knw i searched on chat gpt and it gave me a github link and i used that

ChatGPT can't know much about RVC/Wokada

chilly slate Feb 22, 2025, 1:11 PM

#

low shard Also you have permission to send SS, is there a reason why you want to do it in ...

nahh i will send here i thought i don't have permission

chilly slate Feb 22, 2025, 1:12 PM

#

hallow thistle No. Never trust anything about RVC from ChatGPT.

i guess thats right

low shard Feb 22, 2025, 1:12 PM

#

chilly slate nahh i will send here i thought i don't have permission

You should be able to send them since you have permission

chilly slate Feb 22, 2025, 1:12 PM

#

low shard You should be able to send them since you have permission

okey lemme start rvc again and then send uh

hallow thistle Feb 22, 2025, 1:13 PM

#

You can screenshot the folder of it.

#

I can identify the RVC program by folder name.

low shard Feb 22, 2025, 1:13 PM

#

I mean even the UI is recognizable

hallow thistle Feb 22, 2025, 1:14 PM

#

true matsuripray

chilly slate Feb 22, 2025, 1:14 PM

#

hallow thistle I can identify the RVC program by folder name.

ok

chilly slate Feb 22, 2025, 1:15 PM

#

hallow thistle I can identify the RVC program by folder name.

Retrieval-based-Voice-Conversion-WebUI-main

chilly slate Feb 22, 2025, 1:15 PM

#

low shard I mean even the UI is recognizable

Retrieval-based-Voice-Conversion-WebUI-main

hallow thistle Feb 22, 2025, 1:15 PM

#

chilly slate Retrieval-based-Voice-Conversion-WebUI-main

-gui

karmic oliveBOT Feb 22, 2025, 1:15 PM

#

hallow thistle -gui

https://cdn.discordapp.com/attachments/1122285248844144733/1203460490475343953/caption.gif?ex=65d12cec&is=65beb7ec&hm=bd2fb8d010006dd7c6e3c1c67d3ae846fd1478e1a3124c544c31b43086fe54aa&

chilly slate Feb 22, 2025, 1:15 PM

#

hallow thistle -gui

low shard Feb 22, 2025, 1:15 PM

#

chilly slate Retrieval-based-Voice-Conversion-WebUI-main

https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI

GitHub

GitHub - RVC-Project/Retrieval-based-Voice-Conversion-WebUI: Easily...

Easily train a good VC model with voice data <= 10 mins! - RVC-Project/Retrieval-based-Voice-Conversion-WebUI

hallow thistle Feb 22, 2025, 1:15 PM

#

Damn, I guessed it right. The RVC GUI is too old now.

low shard Feb 22, 2025, 1:16 PM

#

hallow thistle -gui

That's not RVC GUI

#

RVC GUI is another fork made by t1g3r

#

This is Mainline RVC

chilly slate Feb 22, 2025, 1:17 PM

#

hallow thistle Damn, I guessed it right. The RVC GUI is too old now.

fuck man i wasted my night in this shit

low shard Feb 22, 2025, 1:17 PM

#

chilly slate

Show a screenshot of the error too, also is it a public model which you can send the model download link?

hallow thistle Feb 22, 2025, 1:17 PM

#

Not the same one as RVC GUI, but still sounds like it.

low shard Feb 22, 2025, 1:17 PM

#

chilly slate fuck man i wasted my night in this shit

It's not RVC GUI, it's the original/mainline rvc, it isn't much updated as Applio (a fork) but it still can be used

hallow thistle Feb 22, 2025, 1:18 PM

#

chilly slate Feb 22, 2025, 1:18 PM

#

low shard Show a screenshot of the error too, also is it a public model which you can send...

uh sent the model remember

low shard Feb 22, 2025, 1:19 PM

#

chilly slate uh sent the model remember

Can you re-send it? And also show the SS of the error

chilly slate Feb 22, 2025, 1:21 PM

#

low shard Show a screenshot of the error too, also is it a public model which you can send...

tats it

hallow thistle Feb 22, 2025, 1:21 PM

#

Rmvpe model file is missing?

#

Are you running RVC from an installed Python and not compiled-portable one?

chilly slate Feb 22, 2025, 1:23 PM

#

low shard Can you re-send it? And also show the SS of the error

https://www.weights.gg/models/clroz1aic012sjmfug54yft0u , https://huggingface.co/legitdark/psych2go-By-Dan/resolve/main/psych2go-By-Dan.zip

these twop

chilly slate Feb 22, 2025, 1:24 PM

#

hallow thistle Are you running RVC from an installed Python and not compiled-portable one?

what uh mean first it was saying hubert_base.pt is missing i downloaded it from google and added then this rmvpe

hallow thistle Feb 22, 2025, 1:25 PM

#

That's so far messed up.

chilly slate Feb 22, 2025, 1:27 PM

#

hallow thistle That's so far messed up.

ohh belive me i know that very well

chilly slate Feb 22, 2025, 1:27 PM

#

hallow thistle That's so far messed up.

can uh guide me to download nd install from start i hope that should do the trick

hallow thistle Feb 22, 2025, 1:28 PM

#

Typical folder path for the already compiled RVC isn't supposed to be inside an installed Python folder, which is found in C:\Users\"your username"\ or Program Files, unless you're trying to develop a fork RVC by yourself.

low shard Feb 22, 2025, 1:28 PM

#

chilly slate tats it

I feel like you missed some steps of the manual installation

#

you can delete that mainline you got

#

Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), not the best to train (make models) even if still possible

You can:

Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
- Mainline: The original RVC
Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours daily, not granted, of GPU

Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio

#

the local links I sent are guides for the precompiled version, if you want to do it locally

hallow thistle Feb 22, 2025, 1:31 PM

#

You can't say you know everything on how to install a Python program when you struggle to get it to work by hands. boohooh

chilly slate Feb 22, 2025, 1:38 PM

#

low shard Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), ...

dude can uh guide me step by step?

chilly slate Feb 22, 2025, 1:39 PM

#

low shard I feel like you missed some steps of the manual installation

may b donno

chilly slate Feb 22, 2025, 1:39 PM

#

low shard Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), ...

i am not a veteran so i didn't get this at all

hallow thistle Feb 22, 2025, 1:40 PM

#

chilly slate dude can uh guide me step by step?

Never ask someone to teach you every too little step.

#

!howtoask

patent trellisBOT Feb 22, 2025, 1:40 PM

#

hallow thistle !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

weak cipher Feb 22, 2025, 1:40 PM

#

too slow chat guys =))

hallow thistle Feb 22, 2025, 1:41 PM

#

Now make it to speak in English. skullfacedistorted

weak cipher Feb 22, 2025, 1:41 PM

#

I don't know if coqui still supports it guys.

low shard Feb 22, 2025, 1:41 PM

#

chilly slate dude can uh guide me step by step?

I already gave you all the links with step by step guides, is there a specific issue?

chilly slate Feb 22, 2025, 1:42 PM

#

low shard I already gave you all the links with step by step guides, is there a specific i...

yeah while instaliling man it suckes so much

low shard Feb 22, 2025, 1:42 PM

#

chilly slate yeah while instaliling man it suckes so much

what's the issue?

weak cipher Feb 22, 2025, 1:42 PM

#

hallow thistle Now make it to speak in English. <:skullfacedistorted:1159654720735035392>

trash bro angry

low shard Feb 22, 2025, 1:42 PM

#

I told you to uninstall the one you got, and to choose one of the working links

weak cipher Feb 22, 2025, 1:42 PM

#

trash meta

low shard Feb 22, 2025, 1:42 PM

#

I also explained you the difference between them

chilly slate Feb 22, 2025, 1:42 PM

#

low shard what's the issue?

are uh available i will install it later if i get any issue can i count on uh?

chilly slate Feb 22, 2025, 1:42 PM

#

low shard I told you to uninstall the one you got, and to choose one of the working links

sure

hallow thistle Feb 22, 2025, 1:43 PM

#

weak cipher trash bro <:angry:1169302699699875891>

Sounds like you didn't make the English-speaking variant of it. That's what I know.

weak cipher Feb 22, 2025, 1:44 PM

#

hallow thistle Sounds like you didn't make the English-speaking variant of it. That's what I kn...

Is there a limit to the number of chats in one session?

low shard Feb 22, 2025, 1:45 PM

#

chilly slate are uh available i will install it later if i get any issue can i count on uh?

you can ping me for any issue

hallow thistle Feb 22, 2025, 1:46 PM

#

weak cipher Is there a limit to the number of chats in one session?

I can still see some Vietnamese texts in the code section.

chilly slate Feb 22, 2025, 1:46 PM

#

low shard you can ping me for any issue

for now can uh tell me which one should i use my goal is to achive a voice that can express emotion perfectly

weak cipher Feb 22, 2025, 1:47 PM

#

hallow thistle I can still see some Vietnamese texts in the code section.

limit ? =)))

hallow thistle Feb 22, 2025, 1:47 PM

#

chilly slate for now can uh tell me which one should i use my goal is to achive a voice that ...

A voice model that can do the most emotional expression? Don't think that's a thing.

low shard Feb 22, 2025, 1:48 PM

#

chilly slate for now can uh tell me which one should i use my goal is to achive a voice that ...

any of the ones I said can be good quality, if you train it yourself good

#

also it has some limits like it can't laugh well

chilly slate Feb 22, 2025, 1:48 PM

#

low shard any of the ones I said can be good quality, if you train it yourself good

i don't have good voice samples so i have to go with pre trained for now

low shard Feb 22, 2025, 1:48 PM

#

I'm not sure what you mean exactly with emotional expression though

low shard Feb 22, 2025, 1:49 PM

#

chilly slate i don't have good voice samples so i have to go with pre trained for now

then the quality depends on the model you use

chilly slate Feb 22, 2025, 1:49 PM

#

low shard I'm not sure what you mean exactly with emotional expression though

wait lemme send uh a link my goal is to achive a voice like that

hallow thistle Feb 22, 2025, 1:49 PM

#

weak cipher limit ? =)))

Well, the code you made can do much just that.

chilly slate Feb 22, 2025, 1:49 PM

#

low shard I'm not sure what you mean exactly with emotional expression though

https://youtu.be/E8EMBCzmvdc?si=W-pzMt0zlrQouHKN

crude flame Feb 22, 2025, 1:50 PM

#

low shard also it has some limits like it can't laugh well

Models can laugh you just need enough laughing in the set and maybe it needs it's own only laughing slices

low shard Feb 22, 2025, 1:51 PM

#

chilly slate https://youtu.be/E8EMBCzmvdc?si=W-pzMt0zlrQouHKN

I think that would be possible, might need some voice acting too

weak cipher Feb 22, 2025, 1:51 PM

#

hallow thistle Well, the code you made can do much just that.

use api 🐧

chilly slate Feb 22, 2025, 1:51 PM

#

low shard I think that would be possible, might need some voice acting too

so ?

low shard Feb 22, 2025, 1:51 PM

#

crude flame Models *can* laugh you just need enough laughing in the set and maybe it needs i...

fair point

low shard Feb 22, 2025, 1:52 PM

#

chilly slate so ?

so you can choose any

Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio

#

local = runs on your pc

weak cipher Feb 22, 2025, 1:52 PM

#

don't know how to remove its limit or it has some error ❓

low shard Feb 22, 2025, 1:52 PM

#

cloud = remote good pc

#

as you got a gtx 1650, cloud will be faster, but you will have limited gpu time on cloud

chilly slate Feb 22, 2025, 1:54 PM

#

low shard as you got a gtx 1650, cloud will be faster, but you will have limited gpu time ...

cloud will be paid if i use free one i won't get a good one

low shard Feb 22, 2025, 1:54 PM

#

chilly slate cloud will be paid if i use free one i won't get a good one

you will get a good one, it's the same program you would run locally and paid

chilly slate Feb 22, 2025, 1:55 PM

#

low shard you will get a good one, it's the same program you would run locally and paid

did't get that

low shard Feb 22, 2025, 1:55 PM

#

you would either use google colab or kaggle, which are basically remote good pc borrowed by google that run the code

#

it still runs the same RVC program

#

the quality doesn't change if you use it on cloud for free or locally

#

what changes is if you pay for cloud it will be even more faster

chilly slate Feb 22, 2025, 1:56 PM

#

how much time limit of this google cloud?

hallow thistle Feb 22, 2025, 1:56 PM

#

The quality of voice model depends on how you train it. The settings, audio dataset you use.

low shard Feb 22, 2025, 1:56 PM

#

chilly slate how much time limit of this google cloud?

google colab gives random 4 hours max daily, it can be random

kaggle gives 30 hours weekly of better gpus than colab

#

this is the free limit

hallow thistle Feb 22, 2025, 1:57 PM

#

chilly slate how much time limit of this google cloud?

4 hours average a day if running with GPU on Google Colab.

chilly slate Feb 22, 2025, 1:57 PM

#

low shard google colab gives random 4 hours max daily, it can be random kaggle gives 30 h...

if i m gonna use pre trained isnt running locally is a good choice

weak cipher Feb 22, 2025, 1:58 PM

#

using hugface and no gpu how long will it take =)))

hallow thistle Feb 22, 2025, 1:59 PM

#

Even if I had no experience of development in the past, I still know how to install RVC and any other Python related program locally. nails

crude flame Feb 22, 2025, 1:59 PM

#

weak cipher using hugface and no gpu how long will it take =)))

Training will take weeks, inference couple minutes

weak cipher Feb 22, 2025, 2:00 PM

#

crude flame Training will take weeks, inference couple minutes

real time how long?

hallow thistle Feb 22, 2025, 2:01 PM

#

Despite how weak my laptop is, RVC and Applio even worked, but took hours to finishing a single audio.

crude flame Feb 22, 2025, 2:02 PM

#

weak cipher real time how long?

Idk depends on chunk and extra

low shard Feb 22, 2025, 2:02 PM

#

chilly slate if i m gonna use pre trained isnt running locally is a good choice

it will work, just won't be as fast as cloud, but it won't be time limited

#

it's your choice

chilly slate Feb 22, 2025, 2:03 PM

#

low shard it will work, just won't be as fast as cloud, but it won't be time limited

little speed issue can be compromised but limit is a issue

low shard Feb 22, 2025, 2:04 PM

#

chilly slate little speed issue can be compromised but limit is a issue

alright then, if you go with local, I suggest you Applio

chilly slate Feb 22, 2025, 2:04 PM

#

low shard it's your choice

so what should i choose Applio or Mainline?

low shard Feb 22, 2025, 2:05 PM

#

weak cipher real time how long?

inference = use models

realtime inference could take a while too to the point it's not realtime anymore

chilly slate Feb 22, 2025, 2:05 PM

#

low shard alright then, if you go with local, I suggest you Applio

okey thanks do uh knw after installation how to integrate it with my ai assistant

low shard Feb 22, 2025, 2:05 PM

#

chilly slate so what should i choose Applio or Mainline?

I would uggest Applio as it got more updates and easier user interface

chilly slate Feb 22, 2025, 2:07 PM

#

chilly slate okey thanks do uh knw after installation how to integrate it with my ai assistan...

@low shard

low shard Feb 22, 2025, 2:08 PM

#

chilly slate okey thanks do uh knw after installation how to integrate it with my ai assistan...

what AI assistant

chilly slate Feb 22, 2025, 2:08 PM

#

low shard what AI assistant

ai ? like the one uh saw in video

weak cipher Feb 22, 2025, 2:09 PM

#

low shard inference = use models realtime inference could take a while too to the point ...

so will pay for gpu brud so trash

#

why not free boohooh

low shard Feb 22, 2025, 2:12 PM

#

chilly slate ai ? like the one uh saw in video

that seems like a mix of an LLM + TTS, RVC is STS, the only way to use RVC as a TTS would be to generate an audio with another TTS first then use it as an input in rvc

#

TTS = Text to Speech

#

STS = Speech to Speech

#

LLM = Large Language Model, like chatbots

chilly slate Feb 22, 2025, 2:13 PM

#

low shard LLM = Large Language Model, like chatbots

yes that what i am hoping to achive

weak cipher Feb 22, 2025, 2:16 PM

#

boohooh

low shard Feb 22, 2025, 2:18 PM

#

chilly slate yes that what i am hoping to achive

Ig the only way for you to do that would be using either SillyTavern or OpenWebUI with Ollama

#

they have also a Speech to text integration

#

but you wouldn't be able to use it with RVC models

#

https://docs.sillytavern.app/extensions/rvc/ actually I think you can with SillyTavern

Retrieval-based Voice Conversion (RVC) | docs.ST.app

This guide will walk you through using RVC, a technique that allows transferring voice features from one audio clip to another, enabling voices to...

#

I don't really use SillyTavern though

weak cipher Feb 22, 2025, 2:22 PM

#

https://github.com/jaywalnut310/vits guys how to use this

low shard Feb 22, 2025, 2:22 PM

#

weak cipher why not free <:boohooh:1176674698629750975>

GPUs are expensive, why would someone give 24/7 unlimited free gpus?

weak cipher Feb 22, 2025, 2:23 PM

#

low shard GPUs are expensive, why would someone give 24/7 unlimited free gpus?

does it work on hugface with free account?

weak cipher Feb 22, 2025, 2:23 PM

#

weak cipher https://github.com/jaywalnut310/vits guys how to use this

this

weak cipher Feb 22, 2025, 2:23 PM

#

low shard GPUs are expensive, why would someone give 24/7 unlimited free gpus?

yeah why 🤣

low shard Feb 22, 2025, 2:24 PM

#

weak cipher https://github.com/jaywalnut310/vits guys how to use this

that's a TTS, not a STS like RVC

#

it won't work with rvc models

#

tts = text to speech

#

sts = speech to speech

weak cipher Feb 22, 2025, 2:25 PM

#

low shard that's a TTS, not a STS like RVC

yeah i mean run that tts vits on hugface and let the live2d model speak ok =))

#

and to stt for another free cpu service?

low shard Feb 22, 2025, 2:27 PM

#

weak cipher and to stt for another free cpu service?

CPUs are slow, they aren't that good for AI

#

a good stt is whisper-v3-large

low shard Feb 22, 2025, 2:29 PM

#

weak cipher yeah i mean run that tts vits on hugface and let the live2d model speak ok =))

i'm not sure how fast it would be on cpu

weak cipher Feb 22, 2025, 2:30 PM

#

with just one model i think it will be ok

weak cipher Feb 22, 2025, 2:30 PM

#

low shard a good stt is whisper-v3-large

whisper-v3-large what is that

low shard Feb 22, 2025, 2:30 PM

#

weak cipher with just one model i think it will be ok

it's still just a CPU

low shard Feb 22, 2025, 2:30 PM

#

weak cipher whisper-v3-large what is that

https://huggingface.co/openai/whisper-large-v3

openai/whisper-large-v3 · Hugging Face

weak cipher Feb 22, 2025, 2:32 PM

#

brud

#

english only

low shard Feb 22, 2025, 2:33 PM

#

weak cipher brud

read the whole thing

weak cipher Feb 22, 2025, 2:35 PM

#

oh see

weak cipher Feb 22, 2025, 2:37 PM

#

low shard read the whole thing

if made of coqui is it optimized to eat cpu?

#

I see him using this much capacity with coqui

chilly slate Feb 22, 2025, 2:47 PM

#

low shard Ig the only way for you to do that would be using either SillyTavern or OpenWebU...

can uh guide me how i can achive this??

chilly slate Feb 22, 2025, 2:48 PM

#

weak cipher I see him using this much capacity with coqui

dude can uh share link of this vid?

low shard Feb 22, 2025, 2:48 PM

#

weak cipher I see him using this much capacity with coqui

that is using a GPU

warm garnet Feb 22, 2025, 2:49 PM

#

i need help

low shard Feb 22, 2025, 2:49 PM

#

chilly slate can uh guide me how i can achive this??

Sorry but I don't use SillyTavern myself so I can't help you on that

low shard Feb 22, 2025, 2:50 PM

#

warm garnet i need help

!howtoask

patent trellisBOT Feb 22, 2025, 2:50 PM

#

low shard !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

low shard Feb 22, 2025, 2:50 PM

#

Please, elaborate your help request so helpers can help you

warm garnet Feb 22, 2025, 2:50 PM

#

everytime i try to open start_http it wont let me open

low shard Feb 22, 2025, 2:51 PM

#

for example, tell:

your pc gpu
the link of the guide you're following
what's the issue

low shard Feb 22, 2025, 2:51 PM

#

warm garnet everytime i try to open start_http it wont let me open

oh, you're following an outdated youtube tutorial about wokada for realtime voice changing

#

that file is only on that program

#

youtube tuts are old

#

don't follow them

warm garnet Feb 22, 2025, 2:51 PM

#

https://huggingface.co/wok000/vcclient000/tree/main i download from this one

weak cipher Feb 22, 2025, 2:51 PM

#

chilly slate dude can uh share link of this vid?

no free =)) trash bro

warm garnet Feb 22, 2025, 2:51 PM

#

warm garnet https://huggingface.co/wok000/vcclient000/tree/main i download from this one

18a

#

12 months ago

chilly slate Feb 22, 2025, 2:52 PM

#

low shard Sorry but I don't use SillyTavern myself so I can't help you on that

oh its okey

low shard Feb 22, 2025, 2:52 PM

#

warm garnet 18a

that's an old version of the original wokada, it has worse performance and worse quality

weak cipher Feb 22, 2025, 2:52 PM

#

low shard that is using a GPU

optimized to eat cpu ❓

low shard Feb 22, 2025, 2:52 PM

#

also this is the wrong channel

warm garnet Feb 22, 2025, 2:52 PM

#

oh wrong channel

#

so where the right one

low shard Feb 22, 2025, 2:52 PM

#

warm garnet oh wrong channel

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-reocrded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

pls tell ur pc gpu in #🔍│help-w-okada

chilly slate Feb 22, 2025, 2:52 PM

#

weak cipher no free =)) trash bro

i found out myself lol

low shard Feb 22, 2025, 2:53 PM

#

weak cipher optimized to eat cpu ❓

to eat cpu? you mean optimized to use cpu? I'm not sure how fas thtat will be, also coqui has shutdown it's site and their TTS are discontinued btw

weak cipher Feb 22, 2025, 2:53 PM

#

chilly slate i found out myself lol

Can you make it to display on the web?

chilly slate Feb 22, 2025, 2:54 PM

#

weak cipher Can you make it to display on the web?

dude i found out vdo that i was asking uh to send link

weak cipher Feb 22, 2025, 2:54 PM

#

low shard to eat cpu? you mean optimized to *use* cpu? I'm not sure how fas thtat will be,...

so sad

weak cipher Feb 22, 2025, 2:54 PM

#

chilly slate dude i found out vdo that i was asking uh to send link

https://www.youtube.com/watch?v=oCFm-rXI6HU&t=24s this?

chilly slate Feb 22, 2025, 2:54 PM

#

weak cipher https://www.youtube.com/watch?v=oCFm-rXI6HU&t=24s this?

yeah but i already foundd it

weak cipher Feb 22, 2025, 2:55 PM

#

chilly slate yeah but i already foundd it

can you do it =))

chilly slate Feb 22, 2025, 2:56 PM

#

weak cipher can you do it =))

what exactly showing 2d model on screen ? or a ai assistant that have 2d model ?

weak cipher Feb 22, 2025, 2:57 PM

#

chilly slate what exactly showing 2d model on screen ? or a ai assistant that have 2d model ?

it is just a model

chilly slate Feb 22, 2025, 2:57 PM

#

weak cipher it is just a model

uh can do pleanty with just a mdel if uh how

weak cipher Feb 22, 2025, 2:57 PM

#

and they used coqui llma to make it speak and convert text to speech

chilly slate Feb 22, 2025, 2:58 PM

#

weak cipher and they used coqui llma to make it speak and convert text to speech

yeah

weak cipher Feb 22, 2025, 2:58 PM

#

exactly that's what i wanted to do but it works on web brud

#

i think you know how to do

chilly slate Feb 22, 2025, 3:00 PM

#

weak cipher exactly that's what i wanted to do but it works on web brud

i also aming same thing i didn't quite reached to that state but i do have some knowledge

weak cipher Feb 22, 2025, 3:01 PM

#

distress

chilly slate Feb 22, 2025, 3:03 PM

#

weak cipher <:distress:1159577413244686407>

uh should use VSeeFace + VRM Model

#

uh can create 3d model with it

weak cipher Feb 22, 2025, 3:06 PM

#

chilly slate uh should use VSeeFace + VRM Model

I have a live2d model, I don't like 3d very much, now I just don't know how to make tts and status on the web.

knotty moth Feb 22, 2025, 3:09 PM

#

chilly slate uh can create 3d model with it

live2d and 3d models are different thing, not all vtubers would want to use the latter

weak cipher Feb 22, 2025, 3:10 PM

#

knotty moth live2d and 3d models are different thing, not all vtubers would want to use the ...

yeah

chilly slate Feb 22, 2025, 3:10 PM

#

weak cipher I have a live2d model, I don't like 3d very much, now I just don't know how to m...

use vtuber studiio

weak cipher Feb 22, 2025, 3:11 PM

#

chilly slate use vtuber studiio

then how to make it speak and convert speech to text brud

chilly slate Feb 22, 2025, 3:12 PM

#

weak cipher then how to make it speak and convert speech to text brud

search on youtube vtube studio tutorial uh will get many vdos its ezy

weak cipher Feb 22, 2025, 3:13 PM

#

I mean I want to be like him bro =))

knotty moth Feb 22, 2025, 3:14 PM

#

weak cipher then how to make it speak and convert speech to text brud

the lipsync and model rigging part is out of my knowledge, but I think you should try integrating the live2d models and stuffs in a unity project

weak cipher Feb 22, 2025, 3:16 PM

#

knotty moth the lipsync and model rigging part is out of my knowledge, but I think you shoul...

unity is game design and i want to do web

knotty moth Feb 22, 2025, 3:20 PM

#

weak cipher unity is game design and i want to do web

fun fact: the vtube studio application is actually made using unity

weak cipher Feb 22, 2025, 3:21 PM

#

knotty moth fun fact: the vtube studio application is actually made using unity

🐧 it will probably take up a lot of ram

knotty moth Feb 22, 2025, 3:22 PM

#

weak cipher 🐧 it will probably take up a lot of ram

yea on a 2008 era laptop

weak cipher Feb 22, 2025, 3:22 PM

#

Either way, it has to convert speech to text and vice versa.

weak cipher Feb 22, 2025, 3:22 PM

#

knotty moth yea on a 2008 era laptop

do you think this is enough

#

I only have 1 64g bar left lol

knotty moth Feb 22, 2025, 3:26 PM

#

weak cipher do you think this is enough

dont even think it is more demanding than genshin impact

potent helm Feb 22, 2025, 3:27 PM

#

Is the a Applio Collab that's able to use the new hifigan pretrains?

weak cipher Feb 22, 2025, 3:27 PM

#

knotty moth dont even think it is more demanding than genshin impact

u play?

low shard Feb 22, 2025, 3:33 PM

#

potent helm Is the a Applio Collab that's able to use the new hifigan pretrains?

u already got answered in #🧬│ai-chat

fast scarab Feb 22, 2025, 3:46 PM

#

Hello, I have a question about using Kaggle for training. If I turn off my PC, will the training process stop, or does Kaggle continue running the notebook in the cloud? Is there any way to keep it running even if I close my computer?

weak cipher Feb 22, 2025, 3:48 PM

#

too slow

stone wyvern Feb 22, 2025, 4:17 PM

#

can someone please help me ?

weak cipher Feb 22, 2025, 4:18 PM

#

stone wyvern can someone please help me ?

help?

stone wyvern Feb 22, 2025, 4:18 PM

#

yeah kinda

low shard Feb 22, 2025, 4:18 PM

#

stone wyvern can someone please help me ?

!howtoask

patent trellisBOT Feb 22, 2025, 4:18 PM

#

low shard !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

low shard Feb 22, 2025, 4:18 PM

#

please elaborate your request

stone wyvern Feb 22, 2025, 4:19 PM

#

i dont know how to use these AIs to change my voice into my fav voice model

low shard Feb 22, 2025, 4:19 PM

#

stone wyvern i dont know how to use these AIs to change my voice into my fav voice model

do you want to do it on realtime or pre-recorded audios

#

also what's your PC GPU

stone wyvern Feb 22, 2025, 4:19 PM

#

pre-recorded

stone wyvern Feb 22, 2025, 4:19 PM

#

low shard also what's your PC GPU

its integrated one

#

i dont have a dedicated GPU

low shard Feb 22, 2025, 4:20 PM

#

stone wyvern its integrated one

damn, I'm guessing you don't have another GPU so your only way is via cloud (remote good pc with a daily gpu limit)

#

Train (make) RVC Models on cloud:

Prepare the Dataset
Setup RVC:
Choose a cloud way to use RVC,

Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI, no guide as of right now)
- Applio by Shirou (UI, no guide as of right now)
Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Applio (UI)
- Mainline (UI, No guide as of right now)

Be sure to know about the tensorboard

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.gg which ofc uses RVC

RVC Inference (use models) on pre-recorded audio on Cloud

You can use either:

Weights.gg: Easiest Possible Ever Automatic
Ilaria RVC Zero: Fastest free on cloud
Applio (ui)

stone wyvern Feb 22, 2025, 4:20 PM

#

idk how to use these

#

...

low shard Feb 22, 2025, 4:25 PM

#

stone wyvern idk how to use these

for each one, I sent an hyperlink, blue text that when clicked redirect u to a link, in this case, a step to step guide

#

you just have to read it

stone wyvern Feb 22, 2025, 4:26 PM

#

ook

low shard Feb 22, 2025, 4:26 PM

#

if you just want the easiest way ever possible to do inference, try weights.gg

stone wyvern Feb 22, 2025, 4:26 PM

#

and what about the hugging face one ?

#

i tried that once

#

that was good

low shard Feb 22, 2025, 4:27 PM

#

stone wyvern and what about the hugging face one ?

it's good too, just that you have to manually separate the vocals and instrumentals https://docs.aihub.gg/rvc/resources/dataset-isolation

Dataset & Isolation

Last update: Dec 24, 2024

stone wyvern Feb 22, 2025, 4:27 PM

#

low shard it's good too, just that you have to manually separate the vocals and instrument...

noo

#

the reason why i need AI is

#

that

#

i want to change my pre recorded vocals into some famous rappers or singers

low shard Feb 22, 2025, 4:30 PM

#

stone wyvern i want to change my pre recorded vocals into some famous rappers or singers

yeah then you can use either, they both will work

stone wyvern Feb 22, 2025, 4:30 PM

#

okk

weak cipher Feb 22, 2025, 5:08 PM

#

hugface gives 100GB for free, so if I upload 100GB of photos and transfer to CDN, will it violate their policy?

echo oasis Feb 22, 2025, 7:17 PM

#

hey guys do you have any recommended free site for vocal isolation?

tame mica Feb 22, 2025, 7:32 PM

#

echo oasis hey guys do you have any recommended free site for vocal isolation?

mvsep, x-minus, or this colab notebook https://colab.research.google.com/github/jarredou/Music-Source-Separation-Training-Colab-Inference/blob/main/Music_Source_Separation_Training_(Colab_Inference).ipynb

trim dune Feb 22, 2025, 7:36 PM

#

How to use voices zip to convert into ai voice

brittle forge Feb 22, 2025, 8:16 PM

#

what to do in Realtime Voice Changer when I speak, my voice is choppy?

stoic viper Feb 22, 2025, 9:59 PM

#

Hi,
I’m having issues using custom pre-trained models on Applio with RVC. Default models work fine, but with custom models (like KLM5 RefineGAN), I get this error:
“The parameters of the pretrain model such as the sample rate or architecture do not match the selected model.

low shard Feb 22, 2025, 10:23 PM

#

stoic viper Hi, I’m having issues using custom pre-trained models on Applio with RVC. Defaul...

did you use the main branch?

low shard Feb 22, 2025, 10:24 PM

#

trim dune How to use voices zip to convert into ai voice

what's ur pc gpu? what do u want to do exactly?

low shard Feb 22, 2025, 10:24 PM

#

brittle forge what to do in Realtime Voice Changer when I speak, my voice is choppy?

wrong channel

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

elaborate the issue, your pc gpu, and the guide link you followed in #🔍│help-w-okada

stoic viper Feb 22, 2025, 11:11 PM

#

low shard did you use the main branch?

Yes, I’m using the main branch and the latest version of Applio (3.2.8 bugfix), but I’m still facing the issue. Default models work perfectly, but with custom pre-trained models like KLM5 RefineGAN, I get the error:
“The parameters of the pretrain model such as the sample rate or architecture do not match the selected model.”

low shard Feb 22, 2025, 11:12 PM

#

stoic viper Yes, I’m using the main branch and the latest version of Applio (3.2.8 bugfix), ...

I’m using the main branch and the latest version of Applio (3.2.8 bugfix)
those are 2 different versions

#

are you doing it locally or on colab ui?

stoic viper Feb 22, 2025, 11:19 PM

#

low shard > I’m using the main branch and the latest version of Applio (3.2.8 bugfix) thos...

Yes, I’m using the main branch and running it locally on my PC.

midnight crater Feb 23, 2025, 1:35 AM

#

Hello, I couldn't find the answer to this question searching through the chat history, so hopefully this isn't a repeat question with a really obvious answer. I'm trying to do a song cover of a female voice singing Fuck Her Gently by Tenacious D. Most the song goes just fine, but at the end when he rises into his falsetto the model becomes staticy and robotic. I'm using the base RVC program, and I've tried messing with the different settings with no improvement. Any advice or guidance is welcome.

vagrant marsh Feb 23, 2025, 2:17 AM

#

0%? I guess something is wrong

Screenshot_2025-02-22-23-16-09-053_com.android.chrome.jpg

#

What are these percentages on the epoch?

#

Btw

tough shuttle Feb 23, 2025, 3:10 AM

#

how do i train from one of my save points

green shell Feb 23, 2025, 3:24 AM

#

where do i get rvc

hallow thistle Feb 23, 2025, 3:41 AM

#

green shell where do i get rvc

What is your PC GPU? Applio the RVC is one of the easiest RVC program you can install locally, and it also runs with GPU.

hallow thistle Feb 23, 2025, 3:42 AM

#

weak cipher hugface gives 100GB for free, so if I upload 100GB of photos and transfer to CDN...

Why did you ask like this?

#

imdead

hallow thistle Feb 23, 2025, 3:44 AM

#

echo oasis hey guys do you have any recommended free site for vocal isolation?

Most of websites for audio separation are paid to use. It would be better to go for Google Colab instead.

molten fog Feb 23, 2025, 3:54 AM

#

#

what sampling rate would i use in this case?

brittle wing Feb 23, 2025, 3:59 AM

#

-colab

karmic oliveBOT Feb 23, 2025, 3:59 AM

#

brittle wing -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
Hina's Mod AICoverGen WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Hina's Modified Original W-Okada's Realtime Voice Changer, Google Colab
FaceFusion UI, by Nick088 Google Colab
FaceFusion NO UI, by Nick088 Google Colab
EasyGUI, by Rejekts Google Colab
🆕 Music Source Separation Training (Inference), by Jarredou & Makidanye Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

knotty moth Feb 23, 2025, 4:15 AM

#

molten fog what sampling rate would i use in this case?

https://cdn.discordapp.com/attachments/1159290193619189821/1339226216560984064/image.png?ex=67bbcadb&is=67ba795b&hm=ebd7245e947e7433ed50664198a216a0516ec296267670c36e1f4e4263b21d24&

molten fog Feb 23, 2025, 4:19 AM

#

knotty moth https://cdn.discordapp.com/attachments/1159290193619189821/1339226216560984064/i...

spek shows the khz in increments of 2

#

so i cant tell what im actually at

weak cipher Feb 23, 2025, 4:22 AM

#

hallow thistle Why did you ask like this?

I want to do that to display images on the web boohooh

molten fog Feb 23, 2025, 4:30 AM

#

knotty moth https://cdn.discordapp.com/attachments/1159290193619189821/1339226216560984064/i...

just checked in a diff analyzer mines around 35k when doubled, do i round to 40 or 32

knotty moth Feb 23, 2025, 4:52 AM

#

molten fog spek shows the khz in increments of 2

it'd be better to not let the model learn the missing frequencies

molten fog Feb 23, 2025, 4:55 AM

#

knotty moth it'd be better to not let the model learn the missing frequencies

so round to 32 kz instead of 40

weak cipher Feb 23, 2025, 8:55 AM

#

knotty moth it'd be better to not let the model learn the missing frequencies

Love u 🐧

chilly slate Feb 23, 2025, 8:56 AM

#

#✨│ai-help will anyone help me in setup of Applio and then integrate it with ai assistant?

carmine siren Feb 23, 2025, 10:19 AM

#

-kaggle

karmic oliveBOT Feb 23, 2025, 10:19 AM

#

carmine siren -kaggle

📘 Kaggle Notebooks

Applio Notebook, by Vidal Kaggle
Applio Notebook, by Shirou Kaggle
Music Source Separation, by Shirou Kaggle
UVR5 NO UI, by Eddy Kaggle
Original W-Okada's Voice Changer, Kaggle
Modified W-Okada's Voice Changer, Kaggle
🆕 UVR5 UI, by Eddy, ArisDev & Nick088 Kaggle
🆕 RVC AI Cover Maker UI, by Shirou & ArisDev Kaggle
📖 How to use RVC Mainline on Kaggle by Cauthess

Note: Kaggle limits GPU usage to 30 hours per week.

weak cipher Feb 23, 2025, 11:36 AM

#

Guys I have skill issue, where can I get hugface api?

molten fog Feb 23, 2025, 12:03 PM

#

#

is this ot or keep going?

thin edge Feb 23, 2025, 1:21 PM

#

how to get higher "khz" for dataset ?

#

im stuck with 15 khz voice

simple ore Feb 23, 2025, 1:42 PM

#

thin edge how to get higher "khz" for dataset ?

most speaking audio is around 15-16KHz

thin edge Feb 23, 2025, 1:43 PM

#

simple ore most speaking audio is around 15-16KHz

but I see an option for 42K where did that come from?

simple ore Feb 23, 2025, 1:48 PM

#

????

thin edge Feb 23, 2025, 1:49 PM

#

simple ore ????

You know... just forget about it,

What happens if I increase the gpu batch size ?

simple ore Feb 23, 2025, 1:49 PM

#

i mean.. where did you find 42KHz?

thin edge Feb 23, 2025, 1:49 PM

#

simple ore i mean.. where did you find 42KHz?

yeah

simple ore Feb 23, 2025, 1:50 PM

#

there's 32k.. or 44k

#

not 42k

thin edge Feb 23, 2025, 1:50 PM

#

simple ore there's 32k.. or 44k

Yes, that I forgot

simple ore Feb 23, 2025, 1:51 PM

#

as for batch size - larger the batch size is, larger the stride it makes each step

#

could reach the goal faster, but can also miss it

thin edge Feb 23, 2025, 1:53 PM

#

simple ore as for batch size - larger the batch size is, larger the stride it makes each st...

I see... so what batch size do you recommend if I save every 10 E for a total of 350E?

simple ore Feb 23, 2025, 1:54 PM

#

depends on the dataset size

thin edge Feb 23, 2025, 1:54 PM

#

simple ore depends on the dataset size

as in minute ?

simple ore Feb 23, 2025, 1:54 PM

#

yes

thin edge Feb 23, 2025, 1:55 PM

#

simple ore yes

6:30

#

6 and half minute

simple ore Feb 23, 2025, 1:57 PM

#

4

thin edge Feb 23, 2025, 1:57 PM

#

simple ore 4

aight thanks

#

and i find this too, what should i do ?

simple ore Feb 23, 2025, 1:58 PM

#

you're using some old software

thin edge Feb 23, 2025, 2:02 PM

#

matsuripray

chilly slate Feb 23, 2025, 2:14 PM

#

will anyone help me in setup of Applio and then integrate it with ai assistant? @low shard

hallow thistle Feb 23, 2025, 3:43 PM

#

chilly slate will anyone help me in setup of Applio and then integrate it with ai assistant? ...

Why would you need an AI assistant for Applio?

low shard Feb 23, 2025, 3:48 PM

#

chilly slate will anyone help me in setup of Applio and then integrate it with ai assistant? ...

#✨│ai-help message sorry but i dunno how else u could make that other than reading the sillytavern docs

chilly slate Feb 23, 2025, 4:00 PM

#

hallow thistle Why would you need an AI assistant for Applio?

I want to change my ai assistant voice I need a voice that is like a human and can synthesize emotions perfectly

hallow thistle Feb 23, 2025, 4:01 PM

#

chilly slate I want to change my ai assistant voice I need a voice that is like a human and c...

You mean TTS or W-Okada the realtime voice changer?

chilly slate Feb 23, 2025, 4:01 PM

#

low shard https://discord.com/channels/1159260121998827560/1159290139609137264/13428705082...

But uh knw how to do with RVC right so why not use that

chilly slate Feb 23, 2025, 4:01 PM

#

hallow thistle You mean TTS or W-Okada the realtime voice changer?

Nick told me that it have integrated tts so I can use it for real time

low shard Feb 23, 2025, 4:02 PM

#

chilly slate But uh knw how to do with RVC right so why not use that

so, you want to know how to use rvc generally? bc i can't help for making your own LLM+STT+STS+TTS

low shard Feb 23, 2025, 4:03 PM

#

chilly slate Nick told me that it have integrated tts so I can use it for real time

RVC is STS natively, I told you the only way to use it for TTS is by using another TTS like edge tts then using that audio as an input in rvc, like what Applio, which is an RVC fork, does

weak cipher Feb 23, 2025, 4:06 PM

#

low shard RVC is STS natively, I told you the only way to use it for TTS is by using anoth...

bro

low shard Feb 23, 2025, 4:06 PM

#

?

weak cipher Feb 23, 2025, 4:07 PM

#

I have tts and stt, now how do I create a chat bot?

chilly slate Feb 23, 2025, 4:08 PM

#

low shard RVC is STS natively, I told you the only way to use it for TTS is by using anoth...

Well can uh chk and suggest me some tutorial of applio

low shard Feb 23, 2025, 4:19 PM

#

chilly slate Well can uh chk and suggest me some tutorial of applio

you want to do it locally (runs on ur gtx 1650) or on cloud (remote good pc)?

honest dew Feb 23, 2025, 4:24 PM

#

yo

low shard Feb 23, 2025, 4:29 PM

#

honest dew yo

do you need any help?

molten fog Feb 23, 2025, 4:53 PM

#

low shard do you need any help?

i need help

#

#

where in this would be considered the overtraining point?

#

#

im thinking at that third notch before it indefinitely rises but im not sure if im correct or not

low shard Feb 23, 2025, 4:56 PM

#

molten fog

set the smoothing to the max

crude flame Feb 23, 2025, 5:15 PM

#

molten fog i need help

look at the avg graphs if you have those because they are more accurate

simple ore Feb 23, 2025, 5:16 PM

#

molten fog im thinking at that third notch before it indefinitely rises but im not sure if ...

it seems that there's much variation in your dataset, and untrimmed silence?

molten fog Feb 23, 2025, 5:17 PM

#

simple ore it seems that there's much variation in your dataset, and untrimmed silence?

lots of variation yes untrimmed silence not that i know of

simple ore Feb 23, 2025, 5:21 PM

#

show fm and mel charts too

molten fog Feb 23, 2025, 5:24 PM

#

simple ore show fm and mel charts too

#

#

lmfao i accidentally refreshed my kaggle page so i mightve just botched the training altogether but no big deal tbh

simple ore Feb 23, 2025, 5:28 PM

#

fm ooof

#

that's way too much. what's dataset size / batch size ?

tropic phoenix Feb 23, 2025, 6:52 PM

#

weak cipher I have tts and stt, now how do I create a chat bot?

Check out rvc-chat. That was a tech demo I did for a voiced chatbot before gpt-4o was released.

cold tinsel Feb 23, 2025, 8:55 PM

#

ello. is a batch size of 8 for 15min of data a good place to start?

lone temple Feb 23, 2025, 10:57 PM

#

is there any tool that allows phonetic tts with rvc models?

analog obsidian Feb 23, 2025, 11:03 PM

#

cold tinsel ello. is a batch size of 8 for 15min of data a good place to start?

i would use 4 but 8 works too in your case

vestal cloud Feb 23, 2025, 11:40 PM

#

how to i get Unwa's big mel roformer beta 4 on uvr?

simple ore Feb 24, 2025, 12:09 AM

#

lone temple is there any tool that allows phonetic tts with rvc models?

no, but you can run rvc voice changing on top of tts output

lone temple Feb 24, 2025, 12:24 AM

#

simple ore no, but you can run rvc voice changing on top of tts output

oh you're right lol thanks

knotty moth Feb 24, 2025, 1:34 AM

#

lone temple is there any tool that allows phonetic tts with rvc models?

text doesn't carry voice features like pitch, intonation, non-verbal things, and rvc does copy all of those, not guess them

simple ore Feb 24, 2025, 1:37 AM

#

knotty moth text doesn't carry voice features like pitch, intonation, non-verbal things, and...

an LLM-based TTS model can generally do such things

#

a non-LLM based TTS model can use phonetics to produce the audio, although it will be relatively bland

#

unless the model uses both phonetics and an LLM/GPT engine

knotty moth Feb 24, 2025, 1:47 AM

#

simple ore a non-LLM based TTS model can use phonetics to produce the audio, although it wi...

basically vocaloid/utau, though I think it may be somehow possible or still needs a base voicebank behind it

hallow thistle Feb 24, 2025, 2:00 AM

#

lone temple is there any tool that allows phonetic tts with rvc models?

RVC the voice changer and TTS program are two different things. You can use any TTS program and let RVC to process the audio. Applio has this feature.

crimson depot Feb 24, 2025, 3:05 AM

#

One question. Is index file mandatory for IA covers?

glossy grove Feb 24, 2025, 4:41 AM

#

@crimson depotno it's not. Sometimes it can help, sometimes it can hurt.

crimson depot Feb 24, 2025, 4:43 AM

#

glossy grove <@806611115059445830>no it's not. Sometimes it can help, sometimes it can hurt.

Pth si enough?

glossy grove Feb 24, 2025, 4:43 AM

#

sometimes

crimson depot Feb 24, 2025, 4:44 AM

#

For diferent language

weak cipher Feb 24, 2025, 6:15 AM

#

tropic phoenix Check out rvc-chat. That was a tech demo I did for a voiced chatbot before gpt-4...

where bro I can't find

dusty ivy Feb 24, 2025, 8:06 AM

#

-overtrain

karmic oliveBOT Feb 24, 2025, 8:06 AM

#

dusty ivy -overtrain

Moved to /faq command.

royal marsh Feb 24, 2025, 12:46 PM

#

anyone know how to change hz in steel series engine?

paper finch Feb 24, 2025, 1:17 PM

#

While trying to change the voice with Harvest, RVC shuts down, and Python says 'press any key.' When I press a key, it closes without giving any error code.

simple ore Feb 24, 2025, 1:29 PM

#

paper finch While trying to change the voice with Harvest, RVC shuts down, and Python says '...

check event viewer, most likey just ran out of memory

#

and python crashed

distant tundra Feb 24, 2025, 2:39 PM

#

Hi, is there any software like RVC but it can clone a voice just with some seconds of a wav file?

tame mica Feb 24, 2025, 2:39 PM

#

distant tundra Hi, is there any software like RVC but it can clone a voice just with some secon...

looking for zero shot ?

#

seed-vc does that

#

though it's really not that good

distant tundra Feb 24, 2025, 2:40 PM

#

tame mica though it's really not that good

I have only 3 minutes of the voice I want to clone 😦

distant tundra Feb 24, 2025, 2:49 PM

#

tame mica looking for zero shot ?

I guess this is what I'm looking for

distant turtle Feb 24, 2025, 3:40 PM

#

-colab

karmic oliveBOT Feb 24, 2025, 3:40 PM

#

distant turtle -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
Hina's Mod AICoverGen WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Hina's Modified Original W-Okada's Realtime Voice Changer, Google Colab
FaceFusion UI, by Nick088 Google Colab
FaceFusion NO UI, by Nick088 Google Colab
EasyGUI, by Rejekts Google Colab
🆕 Music Source Separation Training (Inference), by Jarredou & Makidanye Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

chilly slate Feb 24, 2025, 3:59 PM

#

low shard you want to do it locally (runs on ur gtx 1650) or on cloud (remote good pc)?

train on gloud if need be but other it will be locally

low shard Feb 24, 2025, 4:03 PM

#

chilly slate train on gloud if need be but other it will be locally

Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
- Mainline: The original RVC

I suggest Applio

chilly slate Feb 24, 2025, 4:04 PM

#

low shard - Locally (runs on your pc so the speed depends on that, you will have to set it...

where i can find some tutorial on Applio?

low shard Feb 24, 2025, 4:05 PM

#

chilly slate where i can find some tutorial on Applio?

that's the tutorial for applio

#

hyperlink is a blue text, that when clicked will redirect you to the link

chilly slate Feb 24, 2025, 4:05 PM

#

low shard hyperlink is a blue text, that when clicked will redirect you to the link

oh oky

low shard Feb 24, 2025, 4:05 PM

#

if you don't understand what I'm saying, either click the lil blue Applio text, or directly go to https://docs.ai-hub.wtf/rvc/local/applio/

Applio

Last update: Apr 01, 2024

chilly slate Feb 24, 2025, 4:06 PM

#

low shard if you don't understand what I'm saying, either click the lil blue Applio text, ...

nahh i understood

distant tundra Feb 24, 2025, 4:29 PM

#

tame mica seed-vc does that

Got it working and the quality is not that bad, probably on par with RVC

stoic viper Feb 24, 2025, 4:43 PM

#

Hey everyone,
I’m training a voice AI model with a 35-minute recording, but I heard that using too many Epochs can make the voice sound robotic.
Any advice on how many Epochs I should use to keep it sounding natural? Also, if anyone can explain how Epochs really work, I’d appreciate it!
Thanks!

distant turtle Feb 24, 2025, 4:43 PM

#

-colab

karmic oliveBOT Feb 24, 2025, 4:43 PM

#

distant turtle -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
Hina's Mod AICoverGen WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Hina's Modified Original W-Okada's Realtime Voice Changer, Google Colab
FaceFusion UI, by Nick088 Google Colab
FaceFusion NO UI, by Nick088 Google Colab
EasyGUI, by Rejekts Google Colab
🆕 Music Source Separation Training (Inference), by Jarredou & Makidanye Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

polar tendon Feb 24, 2025, 4:46 PM

#

royal marsh anyone know how to change hz in steel series engine?

I guess the audio device doesn't provide any other sample rate

tame mica Feb 24, 2025, 4:51 PM

#

distant tundra Got it working and the quality is not that bad, probably on par with RVC

aight goodluck

low shard Feb 24, 2025, 5:54 PM

#

stoic viper Hey everyone, I’m training a voice AI model with a 35-minute recording, but I he...

U should use the tensorboard and high quality audios

nocturne pine Feb 24, 2025, 6:20 PM

#

Is RVC GUI still viable?

low shard Feb 24, 2025, 7:23 PM

#

nocturne pine Is RVC GUI still viable?

Nope it's really old and outdated

#

Don't follow yt tuts

#

What's ur PC GPU and what do u want to do

brittle gazelle Feb 24, 2025, 9:11 PM

#

Hey does rvc runs in the 5000 series ?

simple ore Feb 24, 2025, 9:26 PM

#

brittle gazelle Hey does rvc runs in the 5000 series ?

not right now, still waiting for windows wheels for pytorch and torchaudio

#

there's pytorch, but no torchaudio

fast scarab Feb 24, 2025, 10:41 PM

#

Hey, when should I stop the training? It looks like it's starting to overfit. Thanks!

distant turtle Feb 24, 2025, 11:48 PM

#

-colab

karmic oliveBOT Feb 24, 2025, 11:48 PM

#

distant turtle -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
Hina's Mod AICoverGen WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Hina's Modified Original W-Okada's Realtime Voice Changer, Google Colab
FaceFusion UI, by Nick088 Google Colab
FaceFusion NO UI, by Nick088 Google Colab
EasyGUI, by Rejekts Google Colab
🆕 Music Source Separation Training (Inference), by Jarredou & Makidanye Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

stray raven Feb 25, 2025, 12:16 AM

#

yo dudes getting a rtx 5080 card soon, i understand from previous messages here that its not working with the voice changer atm? is thats still the case?

simple ore Feb 25, 2025, 12:21 AM

#

still no updates

knotty moth Feb 25, 2025, 12:41 AM

#

stray raven yo dudes getting a rtx 5080 card soon, i understand from previous messages here ...

50-series gpus have been getting several issues, even some 5080s have also missing ROPs. imo 4080 super is still a better choice for similar spec

crimson depot Feb 25, 2025, 6:35 AM

#

Local CoverGen????

#

I need one

proud hound Feb 25, 2025, 6:54 AM

#

-hf

karmic oliveBOT Feb 25, 2025, 6:54 AM

#

proud hound -hf

🤗 Huggingface Spaces

UVR5 UI, by Eddy and Ilaria Huggingface Spaces
Ilaria RVC Zero, by thestingerx Huggingface Spaces
RVC⚡ZERO, by r3gm Huggingface Spaces
Applio, by IA Hispano Huggingface Spaces

glass igloo Feb 25, 2025, 11:46 AM

#

Share a link to the official github of rvc2 and maybe there is some modern tutorial how to install and train the model? I found a tutorial video on youtube but there is a link to github where the last update is 2023.

low shard Feb 25, 2025, 12:02 PM

#

glass igloo Share a link to the official github of rvc2 and maybe there is some modern tutor...

YouTube tuts are very old

#

Tell your PC GPU and what you want to do first

glass igloo Feb 25, 2025, 12:05 PM

#

low shard YouTube tuts are very old

I've written a story set in the witcher world, and I'm currently making a youtube video where I voice characters with voices from the game. I have trained models for so-vits-svc, but they work badly, with speech defects. I want to train models for RVC2 in hopes that it will work better. I have a GTX 1080 video card.

low shard Feb 25, 2025, 12:06 PM

#

glass igloo I've written a story set in the witcher world, and I'm currently making a youtub...

Yeah so vits SVC has been replaced by RVC since 2 years

#

Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), not the best to train (make models) even if still possible

You can:

Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
- Mainline: The original RVC
Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours daily, not granted, of GPU

Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio

hallow thistle Feb 25, 2025, 12:07 PM

#

SVC is too old now.

glass igloo Feb 25, 2025, 12:09 PM

#

low shard Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), ...

Thank you for your reply. I'm on my way to look into it

low shard Feb 25, 2025, 12:21 PM

#

glass igloo Thank you for your reply. I'm on my way to look into it

Yw and lmk

slim mauve Feb 25, 2025, 1:43 PM

#

Hi guys! Do you know how can I create my own voice model for RVC-Gui? Do I have to take a 10 minutes recording and the renaming it with the .pth extension?

low shard Feb 25, 2025, 2:17 PM

#

slim mauve Hi guys! Do you know how can I create my own voice model for RVC-Gui? Do I have ...

rvc gui is outdated

#

delete it

#

don’t use youtube tuts

#

and no model training isn’t just renaming the extension

#

what’s ur pc gpu

slim mauve Feb 25, 2025, 2:24 PM

#

Thank you! My gpu is NVIDIA GeForce RTX 3060

low shard Feb 25, 2025, 2:27 PM

#

slim mauve Thank you! My gpu is NVIDIA GeForce RTX 3060

As you got a good PC, you can use RVC locally, you can choose between:

Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
Mainline: The original RVC

#

With Applio/Mainline you can do both training and inference on pre-recorded audios, they are more updated RVCs

slim mauve Feb 25, 2025, 2:31 PM

#

Thank you!

low shard Feb 25, 2025, 2:41 PM

#

slim mauve Thank you!

yw lmk

distant tundra Feb 25, 2025, 9:08 PM

#

Hi, where should I put the .index file of a model (using RVC here)

zinc raft Feb 25, 2025, 9:51 PM

#

-colab

karmic oliveBOT Feb 25, 2025, 9:51 PM

#

zinc raft -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
Hina's Mod AICoverGen WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Hina's Modified Original W-Okada's Realtime Voice Changer, Google Colab
FaceFusion UI, by Nick088 Google Colab
FaceFusion NO UI, by Nick088 Google Colab
EasyGUI, by Rejekts Google Colab
🆕 Music Source Separation Training (Inference), by Jarredou & Makidanye Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

distant tundra Feb 25, 2025, 10:14 PM

#

ok I installed applio, dunno where to put the models

simple ore Feb 25, 2025, 10:37 PM

#

distant tundra ok I installed applio, dunno where to put the models

logs/modelname, click refresh on inference screen

next karma Feb 25, 2025, 11:17 PM

#

hello can you help me with voice.ai

simple ore Feb 25, 2025, 11:24 PM

#

voice.ai ➡️ 💩

low shard Feb 25, 2025, 11:28 PM

#

next karma hello can you help me with voice.ai

Voice.ai sucks, don't use it, if you want realtime voice changer, tell ur PC GPU in #🔍│help-w-okada

hallow thistle Feb 26, 2025, 12:19 AM

#

next karma hello can you help me with voice.ai

Never go for the Voice.ai again.

knotty moth Feb 26, 2025, 12:20 AM

#

next karma hello can you help me with voice.ai

cold tinsel Feb 26, 2025, 12:37 AM

#

hmm, which sampling rate is best to choose in applio?

hallow thistle Feb 26, 2025, 12:52 AM

#

Either 40000 or 48000, but 48000 gives better quality.

knotty moth Feb 26, 2025, 12:58 AM

#

cold tinsel hmm, which sampling rate is best to choose in applio?

cold tinsel Feb 26, 2025, 1:07 AM

#

ok ty. i think i’ll go with 40k

simple ore Feb 26, 2025, 1:15 AM

#

cold tinsel ok ty. i think i’ll go with 40k

that output looks like a mix of bunch of stuff

#

32k probably

#

but you can go with 48 as well

knotty moth Feb 26, 2025, 2:11 AM

#

consistency is more desirable, not ideal to mix up from multiple sources that may have different cutoffs, but at least you can go 32k

simple ore Feb 26, 2025, 12:10 PM

#

knotty moth consistency is more desirable, not ideal to mix up from multiple sources that ma...

and if it is from separate recordings there's likely volume / effect differences as well

#

different room / different reverb

glass igloo Feb 26, 2025, 3:04 PM

#

Hi. Can I change the dataset when training a model? For example, I originally had a file 30 minutes long and I want to change it to 1 hour. How do I do that? Do I just change the file in the dataset folder or do I have to do something else?

final arch Feb 26, 2025, 5:36 PM

#

hey im having a problem that i havnt gotting into since today, it seems like my inputs get filtered alot when i convert into ai, doesnt matter which voice im running into. Anyone with some good settings to help me out with?

unique rock Feb 26, 2025, 6:26 PM

#

How do you enable the Use RefineGAN options in Applio colab? + KLM 5?

glass igloo Feb 26, 2025, 6:51 PM

#

Hi, can you please tell me where the audio file is saved after conversion? In the interface there is an option to download the generated file but it takes too much time, I would like to be able to work directly with the generated audio.

#

It would be even better if you could specify a place to save the converted file automatically.

white bough Feb 26, 2025, 7:26 PM

#

Is anyone else having problems with the Applio ColabUI? It cannot find the gradio module. It was working an hour ago

#

Traceback (most recent call last):
File "/content/program_ml/app.py", line 1, in <module>
import gradio as gr
ModuleNotFoundError: No module named 'gradio'

simple ore Feb 26, 2025, 7:39 PM

#

virtual environment ded

#

reinstall

white bough Feb 26, 2025, 7:42 PM

#

simple ore reinstall

Are you talking about Google Colab, or something else?

simple ore Feb 26, 2025, 7:43 PM

#

yes

white bough Feb 26, 2025, 7:43 PM

#

What should I do to make it work? How do I "reinstall"?

simple ore Feb 26, 2025, 7:44 PM

#

i guess?

#

someone else may know for sure, I dont use colab

#

the error you got is a missing requirement

white bough Feb 26, 2025, 7:46 PM

#

Oh I did reinstall. Looks like it's a Colab problem. Not sure

simple ore Feb 26, 2025, 7:46 PM

#

I assume when you discronnect from colab it deletes some installed stuff or something

distant mulch Feb 26, 2025, 8:01 PM

#

is there any way to make ai covers with my phone

lucid glacier Feb 26, 2025, 8:08 PM

#

Help

#

The collab named "CoverGen_NO_UI_v2_en.ipynb" appears with this section: The installation takes about 3 minutes, if it takes much longer ping me at AI HUB

Pitch_Change:
12

#

What I do?

#

@cyan hare

simple ore Feb 26, 2025, 8:14 PM

#

uv is a replacement for pip install

white bough Feb 26, 2025, 8:18 PM

#

Sorry deleted my message, but indeed replacing !uv run by !python works, although I see some other warnings now. But it seems to work

#

It works for now. So that's good enough for me ^^

lucid glacier Feb 26, 2025, 8:25 PM

#

El Colab denominada "CoverGen_NO_UI_v2_en.ipynb" aparece con esta sección: La instalación demora alrededor de 3 minutos. Si demora mucho más, envíeme un mensaje a AI HUB

Cambio de tono:
12
¿Qué hago?

void ravine Feb 26, 2025, 9:09 PM

#

my colab's not workiing

#

does anyone have a working google colab link

shadow moth Feb 26, 2025, 10:02 PM

#

white bough Traceback (most recent call last): File "/content/program_ml/app.py", line 1, ...

Same error

white bough Feb 26, 2025, 10:03 PM

#

I replaced uv run by python in the cell and it worked

shadow moth Feb 26, 2025, 10:08 PM

#

Thanks a lot!

weary peak Feb 27, 2025, 12:05 AM

#

When starting to train a model I get an error at 5.3 seconds no matter what I run it own, I can seem to get an error code either. To be fair I'm stupid, idk if it has to do with pytorch and python dont work 100% of the time on my PC

simple ore Feb 27, 2025, 12:10 AM

#

weary peak When starting to train a model I get an error at 5.3 seconds no matter what I ru...

colab or local?

distant hamlet Feb 27, 2025, 12:15 AM

#

anyone getting this on mainline colab? it was working perfectly fine just this morning 😑

weary peak Feb 27, 2025, 12:17 AM

#

simple ore colab or local?

i assume local, using RVC WebUI

simple ore Feb 27, 2025, 12:19 AM

#

there should be output in the console window with the error

weary peak Feb 27, 2025, 12:22 AM

#

is colab an easier process, found one error msg AttributeError: 'FigureCanvasAgg' object has no attribute 'tostring_rgb'

simple ore Feb 27, 2025, 12:27 AM

#

old colab

formal wind Feb 27, 2025, 2:10 AM

#

is kaggle or applio no ui better?

tame mica Feb 27, 2025, 2:11 AM

#

kaggle would be better but if you need the latest branch use noui colab

woeful cave Feb 27, 2025, 2:30 AM

#

applio isnt working?

simple ore Feb 27, 2025, 2:43 AM

#

if you mean colab, it seem there's an issue with requirements install

woeful cave Feb 27, 2025, 2:44 AM

#

aww man

simple ore Feb 27, 2025, 2:44 AM

#

replacing uv with python may work in the install cells

formal wind Feb 27, 2025, 3:53 AM

#

tame mica kaggle would be better but if you need the latest branch use noui colab

Wdym by latest branch boohooh

tame mica Feb 27, 2025, 3:53 AM

#

like

#

the branch that could use the non hifigan vocoders

formal wind Feb 27, 2025, 3:56 AM

#

I have no idea what tf any of that means kar

#

Ill try my best to figure it out ig

tame mica Feb 27, 2025, 4:19 AM

#

oh 😭

#

like basically the latest version of applio

formal wind Feb 27, 2025, 4:24 AM

#

Thank you lol 🙏 (Mb for being stupid)

tame mica Feb 27, 2025, 5:01 AM

#

nono dw its fine xd

weak cipher Feb 27, 2025, 5:55 AM

#

Hey guys have you used Whisper Tiny yet?

frozen ledge Feb 27, 2025, 6:33 AM

#

-colab

karmic oliveBOT Feb 27, 2025, 6:33 AM

#

frozen ledge -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
Hina's Mod AICoverGen WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Hina's Modified Original W-Okada's Realtime Voice Changer, Google Colab
FaceFusion UI, by Nick088 Google Colab
FaceFusion NO UI, by Nick088 Google Colab
EasyGUI, by Rejekts Google Colab
🆕 Music Source Separation Training (Inference), by Jarredou & Makidanye Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

tame mica Feb 27, 2025, 7:08 AM

#

frozen ledge -colab

omg nene !!!!

#

https://media.discordapp.net/stickers/1315369651093373030.png?size=160&name=ネネぬいぐるみ

#

https://cdn.discordapp.com/emojis/1317142759790088232.webp?size=48&name=scarynene

proper mountain Feb 27, 2025, 9:25 AM

#

ModuleNotFoundError: No module named 'gradio'

what should I do? it's on Hina's Mod AICoverGen. it was fine last night 😌

white bough Feb 27, 2025, 9:50 AM

#

From what I understand, the command "uv run" was supposed to call "python", but it does not anymore. So replace all the "uv run" instances by "python" and check if it works

tawdry spade Feb 27, 2025, 11:04 AM

#

Is there a solution for Mainline Google colab?

low shard Feb 27, 2025, 11:05 AM

#

tawdry spade Is there a solution for Mainline Google colab?

What's the issue?

glass igloo Feb 27, 2025, 11:15 AM

#

Can anyone know why in applio, when I start the continuation of model training, the epoch time increases several times? When I start training from the beginning the epoch lasts about a minute, but if I finish training and then continue, the epoch takes 5 minutes.

tawdry spade Feb 27, 2025, 11:17 AM

#

low shard What's the issue?

unique rock Feb 27, 2025, 11:56 AM

#

What error does Applio Colab present at this time?

fading relic Feb 27, 2025, 1:13 PM

#

how do i stop delayed voice

vapid mantle Feb 27, 2025, 1:20 PM

#

-colab

karmic oliveBOT Feb 27, 2025, 1:20 PM

#

vapid mantle -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
Hina's Mod AICoverGen WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Hina's Modified Original W-Okada's Realtime Voice Changer, Google Colab
FaceFusion UI, by Nick088 Google Colab
FaceFusion NO UI, by Nick088 Google Colab
EasyGUI, by Rejekts Google Colab
🆕 Music Source Separation Training (Inference), by Jarredou & Makidanye Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

low shard Feb 27, 2025, 1:49 PM

#

fading relic how do i stop delayed voice

This is the wrong channel

RVC = Retrieval-based-Voice-Conversion, the best Speech To Speech AI Models (on v2), Inferences (use models) pre-recorded audio (ai covers) and train (make) models

Wokada = uses RVC for realtime inference

Tell your PC GPU, the guide link you're using and a screenshot of your wokada in #🔍│help-w-okada

hallow thistle Feb 27, 2025, 2:15 PM

#

unique rock What error does Applio Colab present at this time?

What kind of Colab notebook Applio error are you talking about?