gray rover Jan 6, 2025, 9:33 PM

#

Oh, so unwa's then

ancient swan Jan 6, 2025, 9:33 PM

#

but

gray rover Jan 6, 2025, 9:33 PM

#

So what's up with that? available anywhere or, still premium

ancient swan Jan 6, 2025, 9:33 PM

#

they are pretty noisy

gray rover Jan 6, 2025, 9:33 PM

#

the buzzy / after-sep noise?

ancient swan Jan 6, 2025, 9:33 PM

#

https://huggingface.co/pcunwa

pcunwa (unwa)

ancient swan Jan 6, 2025, 9:34 PM

#

gray rover the buzzy / after-sep noise?

sort of yeah

gray rover Jan 6, 2025, 9:34 PM

#

well fuck

#

in that case gotta resign from em

ancient swan Jan 6, 2025, 9:34 PM

#

https://huggingface.co/becruily

becruily (-)

gray rover Jan 6, 2025, 9:34 PM

#

Need to build a set for rvc so ye

#

Any you guys know that don't have aftereffects but still outperform mvsep's bsrofo?

#

think I'd be satisfied enough with such

ancient swan Jan 6, 2025, 9:34 PM

#

but they are insanely good at fully extracting vocals from the song

gray rover Jan 6, 2025, 9:35 PM

#

Hmmm.. guess I'll have to test stuff around then

ancient swan Jan 6, 2025, 9:35 PM

#

gray rover Any you guys know that don't have aftereffects but still outperform mvsep's bsro...

you could try syhft v3

gray rover Jan 6, 2025, 9:35 PM

#

what are they run on? uvr? or some other framework?
if uvr, which ver

ancient swan Jan 6, 2025, 9:35 PM

#

https://huggingface.co/SYH99999

SYH99999 (E-L @ YH)

ancient swan Jan 6, 2025, 9:35 PM

#

gray rover what are they run on? uvr? or some other framework? if uvr, which ver

uvr and msst script

gray rover Jan 6, 2025, 9:36 PM

#

got any links?

ancient swan Jan 6, 2025, 9:36 PM

#

https://github.com/ZFTurbo/Music-Source-Separation-Training/

GitHub

GitHub - ZFTurbo/Music-Source-Separation-Training: Repository for t...

Repository for training models for music source separation. - ZFTurbo/Music-Source-Separation-Training

#

gui-wx.py launches the gui where you can set up the models and their configs

#

and it has download models thingy

gray rover Jan 6, 2025, 9:36 PM

#

Bet

ancient swan Jan 6, 2025, 9:36 PM

#

for uvr you will need to look into audio sep server

gray rover Jan 6, 2025, 9:37 PM

#

oh yea, cause for uvr I see there's 5.6

#

I still run on first beta that supports bsrofo lol

ancient swan Jan 6, 2025, 9:37 PM

#

as anjok has released new patches only in the server currently but he's preparing a big new update

gray rover Jan 6, 2025, 9:37 PM

#

I see.
Well, thanks for help man

#

Gotta catch up with it all asap

ancient swan Jan 6, 2025, 9:37 PM

#

np

#

i personally recommend to try big beta 4 by unwa if you want "full" vocals but without the static noise that are created by models that are trained with emphasis on some metric

gray rover Jan 6, 2025, 9:38 PM

#

ye actually wanted to but

#

Got a lil lost

#

I suppose it's not this one

#

or is it, but uses different internal naming? / ver indexing ?

#

I suppose not cause of the date

ancient swan Jan 6, 2025, 9:39 PM

#

so

#

he has big beta models those are for vocals, the models with "e" are models with emphasis

#

he also made kim ft models which are basically slightly improved versions of kim's melband roformer

gray rover Jan 6, 2025, 9:41 PM

#

links on audio sep?

#

or released for closed circle / premium?

ancient swan Jan 6, 2025, 9:41 PM

#

gray rover links on audio sep?

the server?

gray rover Jan 6, 2025, 9:41 PM

#

ye

#

as in, links somewhere

#

or is there a website / hosting I dunno of

ancient swan Jan 6, 2025, 9:42 PM

#

for unwa models?

gray rover Jan 6, 2025, 9:42 PM

#

big beta

ancient swan Jan 6, 2025, 9:42 PM

#

gray rover big beta

#

all of his models are on his hf

gray rover Jan 6, 2025, 9:42 PM

#

oh, there

ancient swan Jan 6, 2025, 9:42 PM

#

that i linked

gray rover Jan 6, 2025, 9:43 PM

#

Got it. That should be all

#

Thanks once again

#

( ps, sorry I asked so much. I just happen to now have limited bandwidth so, gotta be careful with what I potentially get rip me

#

So better ask than be sorry later

ancient swan Jan 6, 2025, 9:43 PM

#

also if you want to dereverb vocals use anvuew melband dereverb v2

#

it's the best dereverb model for singing, it literally eliminates 99.9% of the reverb somehow

#

and it even cleans up the bleed

gray rover Jan 6, 2025, 9:44 PM

#

you think It'll work for speech too? ( artificial reverb's artificial reverb but ye, guess I'll check

ancient swan Jan 6, 2025, 9:45 PM

#

gray rover ( ps, sorry I asked so much. I just happen to now have limited bandwidth so, got...

yeah no problem, man, ask whenever you want, i'm always glad to help

gray rover Jan 6, 2025, 9:45 PM

#

✨ 🙏

ancient swan Jan 6, 2025, 9:45 PM

#

gray rover you think It'll work for speech too? ( artificial reverb's artificial reverb but...

sadly no, for speech better to use dialogue dereverb

#

but it's the best for singing vocals

gray rover Jan 6, 2025, 9:46 PM

#

oh yea, in that case Imma stick to my ai vst

#

dialogue derev genuinely sucks ass ( all in rx that's " machine learning " pretty much

#

tho ye. Imma go off for some testing now, wish me luck

ancient swan Jan 6, 2025, 9:47 PM

#

gray rover dialogue derev genuinely sucks ass ( all in rx that's " machine learning " prett...

i like it in rx11

ancient swan Jan 6, 2025, 9:48 PM

#

gray rover oh yea, in that case Imma stick to my ai vst

what vst do you usually use?

gray rover Jan 6, 2025, 9:48 PM

#

For me personally it's too damaging

#

but then, I am quite sensitive for that

#

wave's dereverb pro

ancient swan Jan 6, 2025, 9:48 PM

#

ah

gray rover Jan 6, 2025, 9:48 PM

#

still rocks for me

#

good thing is it does magic on speech

ancient swan Jan 6, 2025, 9:48 PM

#

i tried it, didn't like it, leaves too much echo for my taste

gray rover Jan 6, 2025, 9:48 PM

#

and if you give it a lil bit of your input in rx then it shines
else ye, for users not willing to work a lil on the output, might not be the best

#

as you pointed it out

#

leaves a bit of trails to remove manually

#

But good thing about it is, it doesn't damage the spectrum on it's own so, for audio geeks like me that like to play around, it's perfect

ancient swan Jan 6, 2025, 9:50 PM

#

i usually try acon's deverbarate first, imo it does a pretty good job of being balanced between aggressiveness and accuracy, and if it fails i just do rx 11 dialogue dereverb with low sensitivity and it works for me

#

sensitivity put to 10 is pretty bad most of the time, so i put 4-5

gray rover Jan 6, 2025, 9:51 PM

#

mmm

#

also, just in case
big beta 4's for gui-wx as well?

#

about to test it in a bit

ancient swan Jan 6, 2025, 9:52 PM

#

gray rover also, just in case big beta 4's for gui-wx as well?

yeah

gray rover Jan 6, 2025, 9:52 PM

#

alr

ancient swan Jan 6, 2025, 9:52 PM

#

msst script supports all architectures and models

#

the gui is just bas curtiz's lil script that makes the usage of the script a lil bit more convenient

gray rover Jan 6, 2025, 9:54 PM

#

got it

ancient swan Jan 6, 2025, 9:54 PM

#

gray rover alr

have you ever looked into apollo models?

gray rover Jan 6, 2025, 9:54 PM

#

not really

#

past the time when bsrofo was the king and mel roformer was just barely usable, I stopped using separators tbh

#

so, got tons of new things to try really

ancient swan Jan 6, 2025, 9:55 PM

#

gray rover not really

it's really cool, maybe not super useful for rvc, but still

#

basically an apollo model aims to improve low quality (32kbps-128kbps) audio into lossless quality

#

the original model was already doing pretty well, but it was small and trained on low amount of data

#

but the guy called lew on audio sep trained 2 different apollo models

#

first one is vocal enhancer, which tries to fix the muddiness of bs roformer vocals

gray rover Jan 6, 2025, 9:57 PM

#

ah that

#

I remember it now

#

is it based on the premise of audio super res? / upon it's core ?

#

as in, diffusion-based ?

ancient swan Jan 6, 2025, 9:58 PM

#

and it does it pretty well but is pretty noise, the second version is better though

ancient swan Jan 6, 2025, 9:58 PM

#

gray rover as in, diffusion-based ?

i have zero clue, but it works slightly in a different way than audio super resolution

#

i think it's some kind of generative model, i'm not sure

#

and the second model that lew made is called universal enhancer, and it's so fucking amazing

#

it handles low quality compression artifacts really well

#

works for vocals too but also adds noise, but you can denoise it easily

gray rover Jan 6, 2025, 10:00 PM

#

Hmmm.. I could see how it does one time when enhancing the visual novel based audios

#

some novels are sadly.. well yea, using trashy compression

#

some obscure crap codecs

ancient swan Jan 6, 2025, 10:01 PM

#

basically that model has bigger dataset, more params and aims at a wider variety of different lossy formats

gray rover Jan 6, 2025, 10:04 PM

#

aaa

#

I suppose folks that wanna revive some old collections of mp3s or aac can thrive now

ancient swan Jan 6, 2025, 10:15 PM

#

absolutely

elder willow Jan 6, 2025, 11:12 PM

#

-colab

rare sorrelBOT Jan 6, 2025, 11:12 PM

#

elder willow -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
FaceFusion UI, by Nick088 Google Colab
FaceFusion NO UI, by Nick088 Google Colab
EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

ancient zenith Jan 7, 2025, 12:34 AM

#

Onuro AI > Cursor AI

unkempt current Jan 7, 2025, 12:36 AM

#

What is the best way to isolate vocals, and leave it clean (without mvsep, it takes too long)

stray mulch Jan 7, 2025, 12:37 AM

#

unkempt current What is the best way to isolate vocals, and leave it clean (without mvsep, it ta...

you can use a website for this lemme get it rq

#

https://vocalremover.org/splitter-ai

#

you can do this

#

also has other options

ancient zenith Jan 7, 2025, 12:37 AM

#

https://www.whatplugin.ai/reviews/best-ai-vocal-remover

polar flax Jan 7, 2025, 12:37 AM

#

stray mulch https://vocalremover.org/splitter-ai

ew spleeter in 2025

stray mulch Jan 7, 2025, 12:38 AM

#

polar flax ew spleeter in 2025

eh gets the job done

#

personally i use it for insturmentals

polar flax Jan 7, 2025, 12:39 AM

#

stray mulch eh gets the job done

I'd recommend this colab notebook for best roformer models, much much better than that spleeter model

#

unwa's mel roformer inst v1/v1e is the best for instrumental stem

#

https://colab.research.google.com/github/jarredou/Music-Source-Separation-Training-Colab-Inference/blob/main/Music_Source_Separation_Training_(Colab_Inference).ipynb

Google Colab

stray mulch Jan 7, 2025, 12:40 AM

#

polar flax I'd recommend this colab notebook for best roformer models, much much better tha...

i....have no idea what half of those words mean

polar flax Jan 7, 2025, 12:41 AM

#

stray mulch i....have no idea what half of those words mean

you typed when I forgor to include link

#

https://tenor.com/view/spongebob-punch-boxing-boxing-glove-punching-myself-gif-21654165

Tenor

stray mulch Jan 7, 2025, 12:41 AM

#

polar flax you typed when I forgor to include link

if this link gives me a virus ill send an airstrike your way

#

god i hate mac

#

why is windows just so much better

polar flax Jan 7, 2025, 12:42 AM

#

stray mulch if this link gives me a virus ill send an airstrike your way

it's literally a colab link, why do you suspect it?

stray mulch Jan 7, 2025, 12:43 AM

#

im a noob/have next to no experience when it comes to technology

#

literally only joined this server so i could get ai voice models to do a funny bit for a video of mine

#

but imam try and see if i can actually learn this stuff

#

lowkey soudns interesting

unkempt current Jan 7, 2025, 12:46 AM

#

stray mulch https://vocalremover.org/splitter-ai

It helped so much!

unkempt current Jan 7, 2025, 12:46 AM

#

polar flax https://colab.research.google.com/github/jarredou/Music-Source-Separation-Traini...

THATS SO CLEAN THANK SOU SO MUCJ

unkempt current Jan 7, 2025, 12:48 AM

#

polar flax I'd recommend this colab notebook for best roformer models, much much better tha...

Hey @polar flax , can u say me whats the model that separate lead vocals and background vocals?

#

And a model that remove reverb and delays

#

If u can pls :3

stray mulch Jan 7, 2025, 12:48 AM

#

unkempt current THATS SO CLEAN THANK SOU SO MUCJ

yeah use this instead

unkempt current Jan 7, 2025, 12:49 AM

#

stray mulch yeah use this instead

you helped me to thanks

#

wft why my autocorrect keeps correcting "helped" to "help-desk"

ancient swan Jan 7, 2025, 12:50 AM

#

https://github.com/Chenglin-Yang/1.58bit.flux @covert lake

GitHub

GitHub - Chenglin-Yang/1.58bit.flux

Contribute to Chenglin-Yang/1.58bit.flux development by creating an account on GitHub.

#

https://www.arxiv.org/abs/2412.18653

arXiv.org

1.58-bit FLUX

We present 1.58-bit FLUX, the first successful approach to quantizing the state-of-the-art text-to-image generation model, FLUX.1-dev, using 1.58-bit weights (i.e., values in {-1, 0, +1}) while maintaining comparable performance for generating 1024 x 1024 images. Notably, our quantization method operates without access to image data, relying sol...

#

this paper says that it will basically reduce the size and memory usage of flux dev by 7.7 and 5.1 respectively without losing much of the quality

#

3 gb flux models coming soon ig

#

that may produce good quality

covert lake Jan 7, 2025, 1:23 AM

#

ancient swan https://github.com/Chenglin-Yang/1.58bit.flux <@911742715019001897>

Ye I have heard of it

#

I feel like there will be deffo a big quality loss tho

ancient swan Jan 7, 2025, 1:24 AM

#

maybe, but if it'll be like q8 quality or even better then it's massive

covert lake Jan 7, 2025, 1:25 AM

#

ancient swan maybe, but if it'll be like q8 quality or even better then it's massive

If it's less than that, prob not worth it

ancient swan Jan 7, 2025, 1:26 AM

#

covert lake If it's less than that, prob not worth it

i mean, q4 and q6 still look decent, so if it'll match the quality of q8 it'll totally be worth it

#

who knows maybe it'll also improve inference speed

covert lake Jan 7, 2025, 1:28 AM

#

ancient swan i mean, q4 and q6 still look decent, so if it'll match the quality of q8 it'll t...

We just gotta wait

#

I'm glad people are finding ways to run good ai on not super expensive cards

ancient swan Jan 7, 2025, 1:29 AM

#

covert lake I'm glad people are finding ways to run good ai on not super expensive cards

i mean, not only that, but it also might give more room for further improvements

covert lake Jan 7, 2025, 1:29 AM

#

#1159290752195633273

ancient swan Jan 7, 2025, 1:30 AM

#

imagine someone just makes gigantic model with huge amount of params and then just scales it down with that thing

covert lake Jan 7, 2025, 1:30 AM

#

ancient swan i mean, not only that, but it also might give more room for further improvements

Yea

#

SLM are getting good too

#

Btw, did u ever train a LoRA locally?

ancient swan Jan 7, 2025, 1:31 AM

#

covert lake Btw, did u ever train a LoRA locally?

no

lean ridge Jan 7, 2025, 3:28 AM

#

its been way too long

#

whats new

solar torrent Jan 7, 2025, 3:30 AM

#

lean ridge whats new

AI Hub by Weights. boykissercat

polar flax Jan 7, 2025, 3:31 AM

#

unkempt current Hey <@681186927151546397> , can u say me whats the model that separate lead voca...

melroformer karaoke (by aufr)
mvsep male/female separation (for duets, better than Sucial's but may not be perfect)
melroformer dereverb (by anvuew)

btw u can try the tweaked version by me: https://colab.research.google.com/drive/1IC6Q1hLF55_tK6mhky0SWYKGVF9T5WsY?usp=drive_link#scrollTo=vKOCPJkyw9yh

Google Colab

barren kiln Jan 7, 2025, 4:23 AM

#

why didn't it work for me, I can stupidly hear my real voice

#

Please help me

plain cove Jan 7, 2025, 4:30 AM

#

@barren kiln -> #🌏│русский

barren kiln Jan 7, 2025, 4:30 AM

#

plain cove <@1019891614908420126> -> <#1159346439424573440>

ok

plain cove Jan 7, 2025, 4:30 AM

#

aah

barren kiln Jan 7, 2025, 4:31 AM

#

I've already written there.

plain cove Jan 7, 2025, 4:31 AM

#

or u can write #🔍│help-w-okada

plain cove Jan 7, 2025, 4:31 AM

#

barren kiln I've already written there.

so i saw sowwy

#

Maybe you have "monitor" turned on or something like that + You need a virtual cable

#

that's all i know 😅

barren kiln Jan 7, 2025, 4:39 AM

#

plain cove + Maybe you have "monitor" turned on or something like that + You need a virtual...

should I put a virtual cable in the "monitor" option?

plain cove Jan 7, 2025, 4:40 AM

#

no monitor for you to listen to yourself

#

Honestly, I'm not talking about this 😅

#

I'm not a master

barren kiln Jan 7, 2025, 4:41 AM

#

plain cove no monitor for you to listen to yourself

Like there's a dash in the "monitor" option?

barren kiln Jan 7, 2025, 4:41 AM

#

plain cove Honestly, I'm not talking about this 😅

(

plain cove Jan 7, 2025, 4:41 AM

#

Hm..

#

I think I had instructions somewhere

#

wait

#

https://rentry.co/VoiceChangerGuide ? 😅

Guide for W-Okada's RealTimeVoiceChangerClient

Guide Written by:
Github - VTArcelia
Discord User - https://discord.com/users/824922747423031359 aka VTArcelia
Thanks to the following people : lusbert, poopmaster, felt, fazemasta, antasma, shadictl, x_hina, sushi
thanks are for anything added to guide, taken from any talks, settings added when...

#

#

barren kiln Jan 7, 2025, 4:53 AM

#

plain cove

what should I do if I have already clicked on the "passthru" button?

plain cove Jan 7, 2025, 4:54 AM

#

honestle im not sure sowwy

#

wait for the others ok?

barren kiln Jan 7, 2025, 4:55 AM

#

plain cove wait for the others ok?

ok

plain cove Jan 7, 2025, 4:55 AM

#

barren kiln ok

https://tenor.com/view/furry-gif-21204972

Tenor

solar torrent Jan 7, 2025, 5:43 AM

#

barren kiln what should I do if I have already clicked on the "passthru" button?

The "passthru" button outputs your actual microphone audio instead of the converted voice audio.

tacit isle Jan 7, 2025, 6:32 AM

#

can you download ur weights.gg models you trained? thanks for now answering, yes you can

spiral timber Jan 7, 2025, 10:53 AM

#

https://www.theverge.com/2025/1/6/24337530/nvidia-ces-digits-super-computer-ai

pliant laurel Jan 7, 2025, 11:45 AM

#

Does anyone know if you can train a UVR model for a specific artist?

hollow tangle Jan 7, 2025, 11:45 AM

#

pliant laurel Does anyone know if you can train a UVR model for a specific artist?

no

pliant laurel Jan 7, 2025, 11:45 AM

#

hollow tangle no

Ah dang

hollow tangle Jan 7, 2025, 11:45 AM

#

https://tenor.com/view/ah-dang-it-butters-stotch-south-park-s14e2-scrotie-mcboogerballs-gif-22863672

Tenor

pliant laurel Jan 7, 2025, 11:46 AM

#

hollow tangle no

What about just training your own uvr model?

#

can you do that?

hollow tangle Jan 7, 2025, 11:46 AM

#

ofc you can do that but its super complex

alpine granite Jan 7, 2025, 12:03 PM

#

pliant laurel Does anyone know if you can train a UVR model for a specific artist?

your better off asking the audio separation server though yeah you can

hollow tangle Jan 7, 2025, 12:03 PM

#

alpine granite your better off asking the audio separation server though yeah you can

but it will be shit

alpine granite Jan 7, 2025, 12:03 PM

#

theres different archetypes though

#

i tried doing a vocaloid one like almost half a decade ago and it was decent lol

hollow tangle Jan 7, 2025, 12:04 PM

#

because youre skilled

alpine granite Jan 7, 2025, 12:04 PM

#

too bad i lost the data. probably would be better off trained on newer archetypes anyway

hollow tangle Jan 7, 2025, 12:05 PM

#

v6

pliant laurel Jan 7, 2025, 12:05 PM

#

alpine granite your better off asking the audio separation server though yeah you can

thank you, you do have a link for it?

alpine granite Jan 7, 2025, 12:07 PM

#

pliant laurel thank you, you do have a link for it?

AVbr9KV3kw

#

just put that on here

still nacelle Jan 7, 2025, 12:12 PM

#

is this main chat

ancient swan Jan 7, 2025, 12:13 PM

#

yes

still nacelle Jan 7, 2025, 12:13 PM

#

k

alpine granite Jan 7, 2025, 12:16 PM

#

maybe

still nacelle Jan 7, 2025, 12:33 PM

#

Do you guys know how to Change the sound files in Minecraft

hollow tangle Jan 7, 2025, 12:34 PM

#

still nacelle Do you guys know how to Change the sound files in Minecraft

with a pack

still nacelle Jan 7, 2025, 12:34 PM

#

hm

#

I'm going to try to make a villager say some random things

buoyant escarp Jan 7, 2025, 12:57 PM

#

i am wondering, currently ai voice models doesn't do good on non-vocal things (cough, sneeze, moans, etc). Is it because the lack of data on those sounds? Will they be able to do it if we provided enough data of those on the training model? like 5-10 minutes dedicated to only noises like those?

hollow tangle Jan 7, 2025, 12:58 PM

#

buoyant escarp i am wondering, currently ai voice models doesn't do good on non-vocal things (c...

maybe its too much, but also depends on the pretrain, probably not many data present in that

buoyant escarp Jan 7, 2025, 12:58 PM

#

hollow tangle maybe its too much, but also depends on the pretrain, probably not many data pre...

so around 2-3 mins?

#

so like 10 mins of speaking

hollow tangle Jan 7, 2025, 12:58 PM

#

maybe even less

buoyant escarp Jan 7, 2025, 12:58 PM

#

huh

hollow tangle Jan 7, 2025, 12:59 PM

#

i usually do 3 cough, 5 laugh and 3 sneeze if i can

#

not minutes, literal ones

buoyant escarp Jan 7, 2025, 1:00 PM

#

huh

#

ooh

#

is that enough for the ai's to do coughs and sneeze? owo

hollow tangle Jan 7, 2025, 1:00 PM

#

for my experience

buoyant escarp Jan 7, 2025, 1:01 PM

#

do you have any examples that i can listen to if i may ask?

hollow tangle Jan 7, 2025, 1:01 PM

#

not public, sorry

buoyant escarp Jan 7, 2025, 1:01 PM

#

ahh okay

hollow tangle Jan 7, 2025, 1:01 PM

#

i usually dont give people models away

buoyant escarp Jan 7, 2025, 1:01 PM

#

oh its a comms?

hollow tangle Jan 7, 2025, 1:01 PM

#

yup

buoyant escarp Jan 7, 2025, 1:02 PM

#

nono im not asking for the models, i was asking for a sample short output (mp3s?)of the models coughing/sneezing but i think thats still private property hahah

hollow tangle Jan 7, 2025, 1:02 PM

#

ohhhhhh

buoyant escarp Jan 7, 2025, 1:02 PM

#

yes

hollow tangle Jan 7, 2025, 1:02 PM

#

no i dont save those

buoyant escarp Jan 7, 2025, 1:02 PM

#

ahh

#

okioki

#

then can i ask about your training settings? is mangio crepe 32 hop length still the best?

#

or did the community discover something better?

hollow tangle Jan 7, 2025, 1:04 PM

#

nah use rmvpe

buoyant escarp Jan 7, 2025, 1:04 PM

#

rmvpe is better?

#

is it better for talking only or in general use model like singing too

hollow tangle Jan 7, 2025, 1:04 PM

#

in my opinion

#

it works best

buoyant escarp Jan 7, 2025, 1:04 PM

#

how many steps usually and how long is the data you usually use?

hollow tangle Jan 7, 2025, 1:05 PM

#

depends on how much data i have

#

less than 10 minutes 300 epochs, less than 20 500 and then i check if everything is okay

buoyant escarp Jan 7, 2025, 1:05 PM

#

optimally how much / how many minutes do you prefer

#

because i usually do 10 mins 600 steps

hollow tangle Jan 7, 2025, 1:05 PM

#

depends for the use

#

do you want for realtime?

buoyant escarp Jan 7, 2025, 1:06 PM

#

both usage but mostly for inference not realtime

#

do you have different settings for both?

#

or is there a good setting for both usage at once?

hollow tangle Jan 7, 2025, 1:07 PM

#

for normal inference id say 10 minutes is enough for a good result, sure not PERFECT but you still have a good result

buoyant escarp Jan 7, 2025, 1:07 PM

#

sorry for the many questions, the way i use these are the old ways like around a year ago , i don't know if there are much findings hahah

hollow tangle Jan 7, 2025, 1:07 PM

#

no prob i love answering when i can :)

buoyant escarp Jan 7, 2025, 1:07 PM

#

hollow tangle for normal inference id say 10 minutes is enough for a good result, sure not PER...

10 mins with 500 steps?

hollow tangle Jan 7, 2025, 1:08 PM

#

300 otherwise theres overtraining issue, also epochs not steps

buoyant escarp Jan 7, 2025, 1:08 PM

#

i want to make a model where it is good for both singing and speaking(inference, not realtime) , and is capable of non-vocal noises like coughs

hollow tangle Jan 7, 2025, 1:08 PM

#

and btw if you want really better results you should learn to use the tensorboard

buoyant escarp Jan 7, 2025, 1:08 PM

#

to see the overtraining?

hollow tangle Jan 7, 2025, 1:08 PM

#

buoyant escarp i want to make a model where it is good for both singing and speaking(inference,...

important thing is to check if the model is clean, the more clean it is the more you can do basically

hollow tangle Jan 7, 2025, 1:09 PM

#

buoyant escarp to see the overtraining?

or undertraining

buoyant escarp Jan 7, 2025, 1:09 PM

#

hollow tangle 300 otherwise theres overtraining issue, also epochs not steps

isnt epoch like, 10 epoch 200 steps meaning 1 epoch is 20 steps?

hollow tangle Jan 7, 2025, 1:09 PM

#

i think it changes everytime, i dont check steps tbh

buoyant escarp Jan 7, 2025, 1:09 PM

#

huh

#

thats weird-

hollow tangle Jan 7, 2025, 1:10 PM

#

again, im not sure, i suffer from severe memory loss :)

still nacelle Jan 7, 2025, 1:10 PM

#

yall know how to give villagers guns?

hollow tangle Jan 7, 2025, 1:10 PM

#

i used to code rvc but now i dont remember anything basically

hollow tangle Jan 7, 2025, 1:10 PM

#

still nacelle yall know how to give villagers guns?

not the right server for that

still nacelle Jan 7, 2025, 1:10 PM

#

oh

ionic pumice Jan 7, 2025, 1:20 PM

#

mods

solar torrent Jan 7, 2025, 1:21 PM

#

Believe it or not, I have the entire repository of Ilaria RVC Hugging Face space downloaded from Hugging Face. huggingface Baffled

hollow tangle Jan 7, 2025, 1:22 PM

#

why

solar torrent Jan 7, 2025, 1:22 PM

#

I thought this repo was the only repo of Ilaria RVC available. imdead

#

I tried to launch it with run.bat, but it won't launch. boohooh

hollow tangle Jan 7, 2025, 1:24 PM

#

LMFAO

#

you need to create a venv, install the dependencies and then python launch the app

solar torrent Jan 7, 2025, 1:27 PM

#

Oh wait, I think I have an idea on how to launch this Ilaria RVC repo with the already installed Python instead of the one from C:/Program Files/Python311. monka

hollow tangle Jan 7, 2025, 1:28 PM

#

update us

buoyant escarp Jan 7, 2025, 1:32 PM

#

what is the diference of laria rvc and normal rvc

#

@.@

hollow tangle Jan 7, 2025, 1:32 PM

#

ilaria rvc is made by me

buoyant escarp Jan 7, 2025, 1:34 PM

#

hollow tangle ilaria rvc is made by me

well

#

Baffled

hollow tangle Jan 7, 2025, 1:35 PM

#

@covert lake

ancient swan Jan 7, 2025, 1:36 PM

#

ugh, i would ban but i'm so lazy to pull off that mods ban guideline thingy

hollow tangle Jan 7, 2025, 1:36 PM

#

wdym

#

i know yall lazy but i wasnt told why

solar torrent Jan 7, 2025, 1:37 PM

#

Finally, real.

hollow tangle Jan 7, 2025, 1:37 PM

#

wow

ancient swan Jan 7, 2025, 1:37 PM

#

hollow tangle i know yall lazy but i wasnt told why

basically now we should copy and paste official looking reasoning from the pins of specific channel

solar torrent Jan 7, 2025, 1:39 PM

#

Inspired by that environment.bat file inside Automatic1111 folder, so this is how I get the virtual environment path to Python done. voidblep2

hollow tangle Jan 7, 2025, 1:39 PM

#

ancient swan basically now we should copy and paste official looking reasoning from the pins ...

dang thats lame now i understand why reasonings are "not professional"

hollow tangle Jan 7, 2025, 1:39 PM

#

solar torrent Inspired by that `environment.bat` file inside Automatic1111 folder, so this is ...

ilaria rvc zero local is real

covert lake Jan 7, 2025, 1:40 PM

#

hollow tangle <@911742715019001897>

Huh what happened

hollow tangle Jan 7, 2025, 1:40 PM

#

deleted

#

the solution is pretty simple too, ill talk to vj

solar torrent Jan 7, 2025, 1:42 PM

#

hollow tangle ilaria rvc zero local is real

matsuripray

rare sorrelBOT Jan 7, 2025, 1:42 PM

#

✍ Suggestions

Search for it in AI HUB Docs or Applio Docs. You will probably find your answer there 📚
Ask for help in #🔍│help-w-okada if it's related to real time voice changing but make sure to read #1297207135469305866 first
Ask for help in #✨│ai-help for general help, but use the command !howtoask first to learn how to structure your question properly and increase your chances of getting a reply
Last but not least, ask for help in #🔍│help-ai-art if it's related to AI images.

solar torrent Jan 7, 2025, 1:42 PM

#

rare sorrel

hollow tangle Jan 7, 2025, 1:43 PM

#

solar torrent <:matsuripray:1159685390156967936>

try to make it a precompiled trolley

solar torrent Jan 7, 2025, 1:43 PM

#

hollow tangle try to make it a precompiled <:trolley:1159468147133395025>

Lmao. skullfacedistorted

tired patrol Jan 7, 2025, 1:52 PM

#

Hi guys, im trying to do something in rvc but i dont want to install again the program and im using mimicpc and to use the voice that i want it says that I should paste the link to a voice model to download, does anyone know how to do this? I paste one for example from github and it didnt work

hollow tangle Jan 7, 2025, 1:52 PM

#

tired patrol Hi guys, im trying to do something in rvc but i dont want to install again the p...

if you want to do it online i suggest ilaria rvc

#

https://docs.google.com/document/d/1YbXcLFPaGjhOdG5NFkK3QrucCEpHZBwFUxkeMO8aB18/edit?tab=t.0

Google Docs

Ilaria RVC Zero

Table Of Contents Introduction (with website link) Model Loader (Download & Upload) Inference (use RVC AI Voice Models) Ilaria TTS Settings (Inference) Vocal Separator (UVR) Extra Model Fusion Troubleshooting “No GPU is currently available for you after 60 seconds” “Where can I see my ZeroGPU ...

solar torrent Jan 7, 2025, 1:54 PM

#

I just got that Ilaria RVC done locally lmao. joe_cool

tired patrol Jan 7, 2025, 1:54 PM

#

hollow tangle https://docs.google.com/document/d/1YbXcLFPaGjhOdG5NFkK3QrucCEpHZBwFUxkeMO8aB18/...

THANK YOU!!!!!! I will try this

hollow tangle Jan 7, 2025, 1:55 PM

#

tired patrol THANK YOU!!!!!! I will try this

no problem!

solar torrent Jan 7, 2025, 1:59 PM

#

So not only I downloaded the Hugging Face one, I've downloaded another one from GitHub where everyone here called it the mainline Ilaria RVC. This repo was made to run locally, yet, it has been long outdated.

hollow tangle Jan 7, 2025, 2:02 PM

#

solar torrent So not only I downloaded the Hugging Face one, I've downloaded another one from ...

rest in development hell

tired patrol Jan 7, 2025, 2:11 PM

#

So after getting an error at first and then reloading the page it finally worked, Thanks 👍

hollow tangle Jan 7, 2025, 2:11 PM

#

tired patrol So after getting an error at first and then reloading the page it finally worked...

happy to hear that! no problem and have fun!

solar torrent Jan 7, 2025, 2:12 PM

#

Then I run two different Ilaria RVC GUIs at the same time. fefe

hollow tangle Jan 7, 2025, 2:14 PM

#

i dont see that ui since forever

swift berry Jan 7, 2025, 2:14 PM

#

is ilaria rvc still have the rmvpe+ for infrence?

hollow tangle Jan 7, 2025, 2:15 PM

#

swift berry is ilaria rvc still have the rmvpe+ for infrence?

normal rmvpe

swift berry Jan 7, 2025, 2:15 PM

#

ah ok ok

solar torrent Jan 7, 2025, 2:17 PM

#

I just revived the Ilaria RVC project back. trolley

hollow tangle Jan 7, 2025, 2:17 PM

#

solar torrent I just revived the Ilaria RVC project back. <:trolley:1159468147133395025>

resurrecting the deads

solar torrent Jan 7, 2025, 2:19 PM

#

If you're wondering how I got those files from Hugging Face even if there's no download the entire repo option to be found, I used huggingface_hub in Python to achieve this. catblush

solar torrent Jan 7, 2025, 2:20 PM

#

hollow tangle resurrecting the deads

ionic pumice Jan 7, 2025, 2:30 PM

#

solar torrent Then I run two different Ilaria RVC GUIs at the same time. <:fefe:11595769110103...

is this not ila rvc old wtf\

solar torrent Jan 7, 2025, 2:41 PM

#

ionic pumice is this not ila rvc old wtf\

Yeah, that's why I said the mainline Ilaria RVC repo is currently outdated.

ionic pumice Jan 7, 2025, 2:48 PM

#

ah

#

lmfao

hollow tangle Jan 7, 2025, 2:49 PM

#

ionic pumice is this not ila rvc old wtf\

older than you

ionic pumice Jan 7, 2025, 2:50 PM

#

https://cdn.discordapp.com/emojis/1323673228513902632.webp?size=48&name=mikuv2

still nacelle Jan 7, 2025, 3:39 PM

#

a

hollow tangle Jan 7, 2025, 3:40 PM

#

b

glad nebula Jan 7, 2025, 3:41 PM

#

c

still nacelle Jan 7, 2025, 3:42 PM

#

d

plain cove Jan 7, 2025, 3:44 PM

#

E

still nacelle Jan 7, 2025, 3:47 PM

#

f

night lake Jan 7, 2025, 3:49 PM

#

g

still nacelle Jan 7, 2025, 3:50 PM

#

h

quaint jewel Jan 7, 2025, 3:54 PM

#

guys, any good AI agent that can read e-mails and sort/respond them with trained content ? (i am not very proficient in this field lmao)

#

the content would be, previous e-mail

stark scarab Jan 7, 2025, 3:58 PM

#

@covert lake sorry for ping but i think this is kinda important

HF only takes away the time the inference took, even if the GPU request time is 60 seconds. My inferences takes just 30s

covert lake Jan 7, 2025, 4:00 PM

#

stark scarab <@911742715019001897> sorry for ping but i think this is kinda important HF onl...

yoo that’s good

stark scarab Jan 7, 2025, 4:00 PM

#

yep

covert lake Jan 7, 2025, 4:00 PM

#

🔥

broken axle Jan 7, 2025, 4:19 PM

#

hi

stark scarab Jan 7, 2025, 4:20 PM

#

covert lake 🔥

The maximum file length to separate for UVR5 UI on HF seems to be 21-22 min (with model already loaded) That takes 57 - 59 seconds to separate, file size doesn't matter (my test was 250MB)

#

still nacelle Jan 7, 2025, 4:21 PM

#

broken axle hi

we aint chat gpt

stark scarab Jan 7, 2025, 4:21 PM

#

I think that's insane ngl, 21-22 audio takes just 1 to separate

#

hollow tangle Jan 7, 2025, 4:21 PM

#

stark scarab

eddyyyyy

#

please check #🔍│help-w-okada

#

people cant send images

stark scarab Jan 7, 2025, 4:23 PM

#

It was the lvl i think

hollow tangle Jan 7, 2025, 4:24 PM

#

probs

#

but should be fixed anwyay

turbid cypress Jan 7, 2025, 5:49 PM

#

still nacelle h

i

stark scarab Jan 7, 2025, 6:31 PM

#

j

tired patrol Jan 7, 2025, 6:41 PM

#

How much does RVC take in disk space , I dont remember I think it was like 30gb (?

hollow tangle Jan 7, 2025, 6:47 PM

#

tired patrol How much does RVC take in disk space , I dont remember I think it was like 30gb...

mainline is 8GB iirc

tired patrol Jan 7, 2025, 6:47 PM

#

Are we talking about the same RVC from GitHub? the one with 25k stars?

hollow tangle Jan 7, 2025, 6:48 PM

#

tired patrol Are we talking about the same RVC from GitHub? the one with 25k stars?

i think so

tired patrol Jan 7, 2025, 6:51 PM

#

Well here we go again 😔

still nacelle Jan 7, 2025, 7:27 PM

#

k

quaint jewel Jan 7, 2025, 8:32 PM

#

chat

#

have you ever used Crew AI ?

#

if so

#

is it useful ?

empty swift Jan 7, 2025, 8:52 PM

#

I put a movie reference into a project I'm working on.
Output:

Audio shape: (32000,)
Transcribing audio...
Wake word detected. Ready to assist.
Listening...
Listening...
Transcribing audio...
You said:  What do you see when you close your eyes?
Stop phrase detected. Exiting.```

topaz lake Jan 7, 2025, 9:27 PM

#

Does anyone know how to use Dolby Atmos stems to separate vocals? I searched on YouTube and couldn't find any tutorials.

thorn cedar Jan 7, 2025, 9:51 PM

#

where i can find a programm to train my own model?

hollow tangle Jan 7, 2025, 9:51 PM

#

thorn cedar where i can find a programm to train my own model?

what gpu do you have?

thorn cedar Jan 7, 2025, 9:52 PM

#

hollow tangle what gpu do you have?

rtx 3060ti 12 vram

hollow tangle Jan 7, 2025, 9:52 PM

#

thorn cedar rtx 3060ti 12 vram

use applio

thorn cedar Jan 7, 2025, 9:52 PM

#

thanks ;>

hollow tangle Jan 7, 2025, 9:53 PM

#

no problem

cedar crown Jan 7, 2025, 10:08 PM

#

Is there a place I can send my AI vídeos to you guys?

hollow tangle Jan 7, 2025, 10:08 PM

#

cedar crown Is there a place I can send my AI vídeos to you guys?

#🌇│ai-videos

have fun :)

river verge Jan 7, 2025, 10:17 PM

#

DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Such Gossip (Drum model no. 565)

drifting nacelle Jan 7, 2025, 10:40 PM

#

For anyone that thinks AGI is not available to us right now. Please tell me how if we give for example Copilot: Current internet access, a tesla bot body and a yottabyte (YB) of storage how within a few years that would not obtain AGI?

#

What are some of the road blocks that would inhibit?

tepid basin Jan 7, 2025, 10:52 PM

#

drifting nacelle For anyone that thinks AGI is not available to us right now. Please tell me how ...

Its not that we don't "think", it's that "its literally not possible" 😭

unkempt current Jan 7, 2025, 11:33 PM

#

Hi! I have a question: is there any way to separate two voices that are singing together at the same time, like in a small two-person choir? I really wanted to do this for a specific song. I tried using a backing vocals separation model, imagining that one person's voice would be on the lead vocals stem and the other on the backing vocals, but I realized that this only works well when there is a clear separation between the main and backing vocals. When both people are singing together, with almost equal emphasis, I couldn't separate them. Do y'all know if there is any tool or technique that works in this case?

covert lake Jan 7, 2025, 11:34 PM

#

@ancient swan https://www.reddit.com/r/StableDiffusion/s/5dkRwW9WX3

From the StableDiffusion community on Reddit: Nvidia Compared RTX 5...

Explore this post and more from the StableDiffusion community

unkempt current Jan 7, 2025, 11:43 PM

#

unkempt current Hi! I have a question: is there any way to separate two voices that are singing ...

If anyone can help me just reply to this message

north remnant Jan 7, 2025, 11:53 PM

#

guys does anyone know anything about crayo ai

brisk gust Jan 8, 2025, 1:36 AM

#

anyone here ever used openrouter? does it have a minimum paymen amount or can i add like a single dollar to it?

#

openrouter wont tell me and from that im assuming i could pay like a cent

polar flax Jan 8, 2025, 2:17 AM

#

brisk gust anyone here ever used openrouter? does it have a minimum paymen amount or can i ...

haven't heard that random service

~~3840x2160 with DP2.1 UHBR20 is better than 5120x2880 with DP1.4 lmao https://www.tomshardware.com/monitors/gaming-monitors/acers-predator-x323qx-elevates-16-9-gaming-to-5k-at-144hz~~

ancient swan Jan 8, 2025, 2:18 AM

#

covert lake <@597420942594932756> https://www.reddit.com/r/StableDiffusion/s/5dkRwW9WX3

yeah, that's why i'm saying that it's better to wait till youtubers do their benchmarks

covert lake Jan 8, 2025, 6:27 AM

#

ancient swan yeah, that's why i'm saying that it's better to wait till youtubers do their ben...

Yup

hollow tangle Jan 8, 2025, 9:08 AM

#

unkempt current Hi! I have a question: is there any way to separate two voices that are singing ...

you can try uvr5

#

or uvr ui

hollow tangle Jan 8, 2025, 9:10 AM

#

north remnant guys does anyone know anything about crayo ai

it seems the most lazy thing ever

solar torrent Jan 8, 2025, 2:16 PM

#

To promote your YouTube, go to #1159290752195633273.

covert lake Jan 8, 2025, 2:35 PM

#

#1159290752195633273

finite imp Jan 8, 2025, 2:48 PM

#

could anyone link me to uvr beta?

#

nvm found it

jagged juniper Jan 8, 2025, 4:53 PM

#

hey guys I am trying my luck with AI influencing but am pretty new to all the AI and Stable diffusion stuff. If anyone has worked around this, can you dm me ? I need some help

sand surge Jan 8, 2025, 5:01 PM

#

Hi. I can help you @jagged juniper

trail lake Jan 8, 2025, 5:04 PM

#

does anyone know how many songs you can make on weights.gg for free or is it unlimited?

stuck crane Jan 8, 2025, 5:56 PM

#

Does anyone have a collection of celebrity/character TTS weights for sale or anywhere to download them? Not voice to voice but Text to voice

stiff drum Jan 8, 2025, 6:43 PM

#

when it comes to training RVC models do i want to use the Mangio-RVC fork or some other software?

chilly lake Jan 8, 2025, 7:00 PM

#

Applio, unless you really want an outdated and unsupported Mangio fork

wild coral Jan 8, 2025, 7:03 PM

#

Best memes with ass

#

https://tenor.com/view/ice-age-scratch-nut-acorn-gif-6377058886590820641

dreamy ibex Jan 8, 2025, 9:16 PM

#

C

feral apex Jan 8, 2025, 10:21 PM

#

I hate you

unkempt current Jan 8, 2025, 11:36 PM

#

polar flax - melroformer karaoke (by aufr) - mvsep male/female separation (for duets, bette...

@polar flax sorry for pinging, but does the male/female separation model only apply to duets where one voice is male and the other is female? Or can it also be applied to a duet where both voices are female or both voices are male?

tepid basin Jan 9, 2025, 1:24 AM

#

@radiant canyon #1159290752195633273 please

solar torrent Jan 9, 2025, 2:58 AM

#

jagged juniper hey guys I am trying my luck with AI influencing but am pretty new to all the AI...

To get some help about using Stable Diffusion, you can go to #🔍│help-ai-art

hollow terrace Jan 9, 2025, 4:28 AM

#

phew i took the 3 good usernames

polar flax Jan 9, 2025, 5:14 AM

#

hollow terrace phew i took the 3 good usernames

yea dont use inappropriate names

granite onyx Jan 9, 2025, 8:17 AM

#

How can i download applio?

#

and train?

solar torrent Jan 9, 2025, 10:06 AM

#

granite onyx How can i download applio?

-rvc

rare sorrelBOT Jan 9, 2025, 10:06 AM

#

solar torrent -rvc

📚 Documentation

AI HUB Docs

https://docs.ai-hub.wtf

🍏 Applio Docs

https://docs.applio.org/

✨ More guides

How to use RVC Mainline Colab by Cauthess
AICoverGen Colab Guide by Eddy (Spanish Helper)
Create a model with RVC disconnected (colab) by Angetyde

oblique marsh Jan 9, 2025, 11:21 AM

#

guys where to put epochs in voice changer?

chilly lake Jan 9, 2025, 11:28 AM

#

oblique marsh guys where to put epochs in voice changer?

same place you put 87 gas in a car engine

solar torrent Jan 9, 2025, 11:30 AM

#

trolley

oblique marsh Jan 9, 2025, 11:31 AM

#

but actually where?

chilly lake Jan 9, 2025, 11:32 AM

#

epoch is a property of a model.. how long it was trained

#

voice changer does not give a f

#

it is for you to decide whether the model is good to use or not

#

"300 epochs" means nothing without knowing how big the dataset was. 300 epoch on 10 hour set is crazy high, 300 epoch on 20 sec set is polishing a turd

wet frigate Jan 9, 2025, 1:07 PM

#

did any changes happen for AMD gpu users?

or should i still use the Online Realtime one

#

last time i tried to do realtime ai voice changer it went horrible xd or not that great

quaint jewel Jan 9, 2025, 1:14 PM

#

chat, wtf is huggingface ?

solar torrent Jan 9, 2025, 1:16 PM

#

https://cdn.discordapp.com/emojis/1016021422809817160.webp?size=48

#

The Hugging Face is a website service that stores code repositories similar to GitHub.

quaint jewel Jan 9, 2025, 1:17 PM

#

got it, tks

worthy coyote Jan 9, 2025, 2:06 PM

#

facehugger

exotic burrow Jan 9, 2025, 3:29 PM

#

lime raven Jan 9, 2025, 3:49 PM

#

its been so long since ive seen a real queue in weights.gg

#

what happend? did some of the servers get put offline?

jovial veldt Jan 9, 2025, 4:52 PM

#

lime raven its been so long since ive seen a real queue in weights.gg

Lol me too, today i created a model, and later show me this same queques for the another ai covers, i though was my fault for create an model before, but now i can appreciate that i not the only with the same problem

#

Ohh just in case Weight was remove his shop option and the option for the vocal remover, they will come back?

pseudo swallow Jan 9, 2025, 5:45 PM

#

hi

quaint jewel Jan 9, 2025, 5:58 PM

#

chat, do you know models that are able to generate product photos ? like levels photoai, but focused in products ?

heady helm Jan 9, 2025, 6:26 PM

#

quaint jewel chat, do you know models that are able to generate product photos ? like levels ...

chatgpt LOL

quaint jewel Jan 9, 2025, 6:27 PM

#

it works with existing products ? cause everytime i give it something it hallucinates and return some variation of what i gave

queen kernel Jan 9, 2025, 6:40 PM

#

pseudo swallow hi

Hlo

drifting trellis Jan 9, 2025, 8:27 PM

#

Uhh

drifting trellis Jan 9, 2025, 8:28 PM

#

jovial veldt Lol me too, today i created a model, and later show me this same queques for the...

Hmm

viscid cypress Jan 9, 2025, 9:41 PM

#

drifting trellis Uhh

https://tenor.com/view/ahem-excuseme-rihanna-what-wut-gif-21329285

safe flume Jan 9, 2025, 11:07 PM

#

@hidden grotto

hidden grottoBOT Jan 9, 2025, 11:07 PM

#

safe flume <@1138318590760718416>

:wave: @safe flume, How can I help?

Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image

polar flax Jan 10, 2025, 12:18 AM

#

viscid cypress https://tenor.com/view/ahem-excuseme-rihanna-what-wut-gif-21329285

https://tenor.com/view/valsfavoritegifs-gif-16718928798951018101

Tenor

ruby bronze Jan 10, 2025, 1:04 AM

#

alguien tiene la IA de cosmic kid?

trim pecan Jan 10, 2025, 1:04 AM

#

@hidden grotto

hidden grottoBOT Jan 10, 2025, 1:04 AM

#

trim pecan <@1138318590760718416>

:wave: @trim pecan, How can I help?

Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image

ruby bronze Jan 10, 2025, 1:05 AM

#

trim pecan <@1138318590760718416>

gracias, ty

solar torrent Jan 10, 2025, 1:23 AM

#

quaint jewel chat, do you know models that are able to generate product photos ? like levels ...

Stable Diffusion

still nacelle Jan 10, 2025, 1:41 AM

#

I made the villager sing "fly me to the moon" 😭

minor crystal Jan 10, 2025, 1:49 AM

#

hey, just to be clear, okada is the real time voice changer and rvc is speech to speech? sorry if this is dumb im new to all of this

sharp lotus Jan 10, 2025, 2:17 AM

#

@hidden grotto

hidden grottoBOT Jan 10, 2025, 2:17 AM

#

sharp lotus <@1138318590760718416>

:wave: @sharp lotus, How can I help?

Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image

weak plinth Jan 10, 2025, 3:08 AM

#

so i want do AI Cover again and then i found UVR Roformer, what reccomend to use for separated vocal?

UVR 5.6 i use MDX stuff

weak plinth Jan 10, 2025, 3:09 AM

#

minor crystal hey, just to be clear, okada is the real time voice changer and rvc is speech to...

RVC also has realtime VC, but RVC just like have both VC & Inference (not Applio)

minor crystal Jan 10, 2025, 3:32 AM

#

weak plinth RVC also has realtime VC, but RVC just like have both VC & Inference (not Applio...

so i need to find a rvc app where i can locally transfer a mp3 file of my orignial voiceover to the new v/o?

stark scarab Jan 10, 2025, 4:06 AM

#

weak plinth so i want do AI Cover again and then i found UVR Roformer, what reccomend to use...

#

https://docs.ai-hub.wtf/rvc/resources/dataset-isolation/#the-best-models-for-uvr-are

Dataset & Isolation

Last update: Dec 24, 2024

polar flax Jan 10, 2025, 4:23 AM

#

stark scarab

you can also consider:

vocals: unwa's beta 5e or becruily's vocals
inst: v1/v1e or becruily's

spark agate Jan 10, 2025, 4:27 AM

#

would someone here help me de reverb something?

stark scarab Jan 10, 2025, 5:01 AM

#

polar flax you can also consider: - vocals: unwa's beta 5e or becruily's vocals - inst: v1/...

Becruily's models needs the updated bs/mel roformer scripts?

polar flax Jan 10, 2025, 5:05 AM

#

stark scarab Becruily's models needs the updated bs/mel roformer scripts?

nothing different from other roformer models

buoyant escarp Jan 10, 2025, 5:53 AM

#

is there any model that can split songs that have multiple singers now in UVR? like 2 lead vocals?

buoyant escarp Jan 10, 2025, 6:33 AM

#

Hello again, i tried installing roformer model to UVR but there is no Roformer Model check button to click when choosing the model param.
I have installed Anjok's https://github.com/Anjok07/ultimatevocalremovergui/tree/v5.6.0_roformer_add%2Bdirectml?tab=readme-ov-file

#

I have tried reinstalling too @.@

polar flax Jan 10, 2025, 6:37 AM

#

buoyant escarp Hello again, i tried installing roformer model to UVR but there is no Roformer M...

full install: https://github.com/Anjok07/ultimatevocalremovergui/releases/download/v5.6/UVR_12_8_24_23_30_BETA_rofo_full_install.exe
patch only if you have the base one: https://github.com/TRvlvr/model_repo/releases/download/uvr_update_patches/UVR_Patch_12_3_24_1_18_BETA_small_patch_rofo.exe

buoyant escarp Jan 10, 2025, 6:38 AM

#

polar flax full install: https://github.com/Anjok07/ultimatevocalremovergui/releases/downlo...

i tried installing the patch and it resulted in an error, cannot import name rename_privateuse1_backend

#

ill try fully installing

dim crater Jan 10, 2025, 6:39 AM

#

hlo my name is madhav i want to lern about ai things can you help me please

elder willow Jan 10, 2025, 7:03 AM

#

dim crater hlo my name is madhav i want to lern about ai things can you help me please

I want to learn inserts a broad topic without specifying details can you help me please

#

Wdym by ai things 💀

buoyant escarp Jan 10, 2025, 7:09 AM

#

hello, i have problem with Anvuew mel dereverb v2 where it gives an error that says
RuntimeError: "The size of tensor a (352768) must match the size of tensor b (352800) at non-singleton dimension 1"

and where can i find Mel roformer karaoke model?

gilded sierra Jan 10, 2025, 7:52 AM

#

someone covers songs from a live song to a sunoAI cover

alpine granite Jan 10, 2025, 8:20 AM

#

lol

#

it is funny that it ends up beating the actual repo on google search

covert lake Jan 10, 2025, 8:24 AM

#

alpine granite it is funny that it ends up beating the actual repo on google search

L

rapid dragon Jan 10, 2025, 9:18 AM

#

so im stuck with the file for the voice changer

gray rover Jan 10, 2025, 9:19 AM

#

rapid dragon so im stuck with the file for the voice changer

Stuck in what way?

#

You gotta provide more details

#

In case you're not using it, you should definitely go for the fork;
https://rentry.co/ForkVoiceChangerGuide

The website has all the information on how to use it and so on.

Guide for deiteris' optimized W-Okada RealTime Voice Changer Client...

Guide style is in the same as Blanc_dot's. Thanks Blanc_dot for corrections. Most technical information comes from deiteris.
Last update December 12: NEW UPDATE VERSION b2332
Translations added for:
German: https://rentry.co/ForkVoiceChangerGuide_de
Turkish: https://rentry.co/ForkVoiceChangerGuid...

#

Other than that, you should check #🔍│help-w-okada if it's about real-time voice changer support.

river verge Jan 10, 2025, 11:13 AM

#

DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Skyfall (Drum model no. 566)

solar torrent Jan 10, 2025, 11:23 AM

#

alpine granite it is funny that it ends up beating the actual repo on google search

Minecraft Fandom over Minecrart Wiki moment. skullfacedistorted

west oyster Jan 10, 2025, 11:36 AM

#

does anyone know how to turn long form scripts into an AI voiceover?

elder willow Jan 10, 2025, 12:13 PM

#

west oyster does anyone know how to turn long form scripts into an AI voiceover?

Maybe break down the parts of the script and then use aj on it ChillBar_shrug i never used ai btw in development so idk

solar torrent Jan 10, 2025, 12:26 PM

#

west oyster does anyone know how to turn long form scripts into an AI voiceover?

You meant text to speech (TTS) or some kind of AI program that generates AI voice?

west oyster Jan 10, 2025, 12:27 PM

#

i usually use elevenlabs but the text is just so long so its not feasible to break down to parts or anything like that - so im looking for ways or softwares that can make this work.

Just need my script to be readout by an AI voice

polar flax Jan 10, 2025, 12:30 PM

#

west oyster i usually use elevenlabs but the text is just so long so its not feasible to bre...

c'mon that's not so hard to break it down into each voiceline, and there's nothing better than doing it manually

west oyster Jan 10, 2025, 12:31 PM

#

polar flax c'mon that's not so hard to break it down into each voiceline, and there's nothi...

its 3 hr script 🙂

stark scarab Jan 10, 2025, 1:56 PM

#

polar flax nothing different from other roformer models

Do u know about that update on bs/mel roformer scripts? What's it for?

polar flax Jan 10, 2025, 1:59 PM

#

stark scarab Do u know about that update on bs/mel roformer scripts? What's it for?

perhaps compability to the phantom center model or something

stark scarab Jan 10, 2025, 1:59 PM

#

I see.

ancient swan Jan 10, 2025, 4:21 PM

#

stark scarab Do u know about that update on bs/mel roformer scripts? What's it for?

if you're talking about .py files then there are some new features to it, but so far no models that are trained with them yet

stark scarab Jan 10, 2025, 4:23 PM

#

ancient swan if you're talking about .py files then there are some new features to it, but so...

Oh, I thought new models needed to update those files to work

iron gazelle Jan 10, 2025, 4:46 PM

#

Hey guys, nice to meet you! My name is Ruan, and I'm from Brazil. I'm not a programmer, but I'd like to create a female voice to be my virtual assistant on WhatsApp. Can you help me figure out how to get a voice that speaks Portuguese without sounding like an AI?

gilded sierra Jan 10, 2025, 5:12 PM

#

anyone sunoai ?

brittle summit Jan 10, 2025, 5:53 PM

#

hey, anyone in here wants to connect in a call this evening? I'm a full time marketer looking to get into coding (nextjs/ts)

thin kite Jan 10, 2025, 6:07 PM

#

https://garticphone.com/ru/?c=2402c720c0

#

go play

summer valley Jan 10, 2025, 7:02 PM

#

Hello

queen kernel Jan 10, 2025, 8:06 PM

#

summer valley Hello

Hii

cloud python Jan 10, 2025, 9:41 PM

#

@opal marsh

opal marshBOT Jan 10, 2025, 9:41 PM

#

thatrandomdude.exe, My prefix is g.
I can't read messages here

cloud python Jan 10, 2025, 9:41 PM

#

wat

ionic pumice Jan 10, 2025, 9:53 PM

#

?

covert lake Jan 10, 2025, 10:19 PM

#

https://www.wired.com/story/bytedance-intern-best-paper-neurips/

WIRED

Former ByteDance Intern Accused of Sabotage Among Winners of Presti...

Keyu Tian and his coauthors won the Best Paper Award at the annual NeurIPS machine-learning conference for their work on a new technique for generating images. Some have objected to the decision.

leaden cave Jan 10, 2025, 10:55 PM

#

What wrong with this code

#

from datetime import datetime

class Task:
def init(self, title, description, due_date):
self.title = title
self.description = description
self.due_date = datetime.strptime(due_date, '%Y-%m-%d')
self.completed = False

def mark_completed(self):
    self.completed = True

def __str__(self):
    status = "Completed" if self.completed else "Pending"
    return f"Task: {self.title} | Due: {self.due_date.date()} | Status: {status}"

class TaskTracker:
def init(self):
self.tasks = []

def add_task(self, task):
    self.tasks.append(task)

def remove_task(self, task_title):
    for task in self.tasks:
        if task.title == task_title:
            self.tasks.remove(task)
            break

def get_pending_tasks(self):
    return [task for task in self.tasks if not task.completed]

def get_overdue_tasks(self):
    today = datetime.today()
    # Deliberate mistake: Should be `task.due_date < today`
    return [task for task in self.tasks if task.due_date > today and not task.completed]

def display_tasks(self):
    if not self.tasks:
        print("No tasks available.")
    else:
        for task in self.tasks:
            print(task)

Example usage

if name == "main":
tracker = TaskTracker()

# Adding tasks
tracker.add_task(Task("Finish Project", "Complete the pending project module", "2025-01-15"))
tracker.add_task(Task("Team Meeting", "Discuss project updates with the team", "2025-01-10"))
tracker.add_task(Task("Submit Report", "Send the project report to the manager", "2025-01-05"))

print("All Tasks:")
tracker.display_tasks()

print("\nPending Tasks:")
for task in tracker.get_pending_tasks():
    print(task)

print("\nOverdue Tasks:")
for task in tracker.get_overdue_tasks():
    print(task)

chilly lake Jan 11, 2025, 12:17 AM

#

leaden cave What wrong with this code

ask chatgpt, sheesh

solar torrent Jan 11, 2025, 2:02 AM

#

stiff ridge Jan 11, 2025, 2:05 AM

#

is weights.gg a real time or just a generator?

#

chat legit deader than my dog

#

no grammar

#

ahem

#

my bad

solar torrent Jan 11, 2025, 2:28 AM

#

stiff ridge is weights.gg a real time or just a generator?

#

Weights.gg uses RVC audio conversion. It is not a real-time voice changer service.

stark scarab Jan 11, 2025, 3:14 AM

#

solar torrent Weights.gg uses RVC audio conversion. It is **not** a real-time voice changer se...

not yet

#

solar torrent Jan 11, 2025, 3:22 AM

#

gray rover Jan 11, 2025, 4:48 AM

#

kittystare

night lake Jan 11, 2025, 4:51 AM

#

https://tenor.com/view/cat-flashbang-gif-1379499439817330079

Tenor

gray rover Jan 11, 2025, 5:00 AM

#

nails

polar flax Jan 11, 2025, 5:37 AM

#

https://tenor.com/view/emoji-12-emoji12-cat-stare-stare-cat-uhh-gif-9567425571520721039

Tenor

vagrant trout Jan 11, 2025, 7:31 AM

#

any voicechanger suggestions?

solar torrent Jan 11, 2025, 7:33 AM

#

vagrant trout any voicechanger suggestions?

Fork W-Okada.

vagrant trout Jan 11, 2025, 7:35 AM

#

solar torrent Fork W-Okada.

i dont rly understand where to go directly tho once im on the site i usually dont use github

solar torrent Jan 11, 2025, 7:35 AM

#

vagrant trout i dont rly understand where to go directly tho once im on the site i usually don...

Uh.

#

-realtime

rare sorrelBOT Jan 11, 2025, 7:35 AM

#

solar torrent -realtime

Interaction has expired, use the command again for a new interaction.

💻 Local Realtime RVC

solar torrent Jan 11, 2025, 7:35 AM

#

First link is fork W-Okada. This version runs better than the second link.

vagrant trout Jan 11, 2025, 7:38 AM

#

solar torrent First link is fork W-Okada. This version runs better than the second link.

oh ok where do i download

solar torrent Jan 11, 2025, 7:39 AM

#

vagrant trout oh ok where do i download

What? Download links and recommended settings are all said on the guide.

#

If you have any problem about W-Okada, you can go to #🔍│help-w-okada.

vagrant trout Jan 11, 2025, 7:41 AM

#

solar torrent What? Download links and recommended settings are all said on the guide.

oh ok ty for your time!

solar torrent Jan 11, 2025, 7:41 AM

#

vagrant trout oh ok ty for your time!

#

You're welcome. doggowave

last hull Jan 11, 2025, 9:20 AM

#

somebody make me a blair waldorf voice model and ill paypal u

gray rover Jan 11, 2025, 9:28 AM

#

last hull somebody make me a blair waldorf voice model and ill paypal u

You just gotta commission someone #1191429836321849435

solar torrent Jan 11, 2025, 9:28 AM

#

last hull somebody make me a blair waldorf voice model and ill paypal u

You can #1159289738314919936 here.

ancient swan Jan 11, 2025, 9:28 AM

#

last hull somebody make me a blair waldorf voice model and ill paypal u

#1159289738314919936 put a request here and make sure to get the model master to work on it

last hull Jan 11, 2025, 9:29 AM

#

okay, thanks everyone

solar torrent Jan 11, 2025, 9:31 AM

#

river sparrow Jan 11, 2025, 9:31 AM

#

How can i use these models for text to speech locally?

ancient swan Jan 11, 2025, 10:02 AM

#

river sparrow How can i use these models for text to speech locally?

they're not text to speech models

#

but applio has built in edge tts

river sparrow Jan 11, 2025, 10:41 AM

#

Can i use gpt sovits model in this

ionic pumice Jan 11, 2025, 10:41 AM

#

no

edgy bloomBOT Jan 11, 2025, 10:41 AM

#

Congratulations kar !!!!

Your Samurott is now level 50!

ionic pumice Jan 11, 2025, 10:41 AM

#

gpt sovits is it's own tts

river sparrow Jan 11, 2025, 10:43 AM

#

how to use it for tts tho

white schooner Jan 11, 2025, 10:46 AM

#

hello fellow humand

#

is my first time in gay chat

river verge Jan 11, 2025, 10:54 AM

#

DIlly ding, dilly dong! A new RegalHyperus drum model just released!
TV Off (Drum model no. 567)

onyx summit Jan 11, 2025, 11:35 AM

#

does anyone know a decently fast tts model that sounds decently realistic that i can use with ollama or smtg

crude delta Jan 11, 2025, 12:55 PM

#

how do i use the voice models

solar torrent Jan 11, 2025, 12:58 PM

#

crude delta how do i use the voice models

#

W-Okada the realtime voice changer or RVC the audio conversion program?

crude delta Jan 11, 2025, 1:00 PM

#

solar torrent W-Okada the realtime voice changer or RVC the audio conversion program?

so i downloaded the model

#

but unusre how to use it from there

fresh cave Jan 11, 2025, 1:01 PM

#

Hi

solar torrent Jan 11, 2025, 1:01 PM

#

crude delta but unusre how to use it from there

Realtime voice changer or the audio conversion program?

crude delta Jan 11, 2025, 1:01 PM

#

uhh

solar torrent Jan 11, 2025, 1:01 PM

#

Answer one.

crude delta Jan 11, 2025, 1:01 PM

#

looking for a tts

#

so prob audio conversion program

#

it's for a video

solar torrent Jan 11, 2025, 1:02 PM

#

crude delta Jan 11, 2025, 1:02 PM

#

if i need to talk to have it converted that's fine aswell

solar torrent Jan 11, 2025, 1:02 PM

#

crude delta if i need to talk to have it converted that's fine aswell

Applio can do TTS.

elder willow Jan 11, 2025, 1:26 PM

#

We need something like the photo model on weights.gg, where you can upload photos to get an exact likeness, but for music. You’d upload a singer’s songs, and the model would create new tracks that sound exactly like their style. Imagine hearing brand-new music from artists who’ve been gone for years. It’s just an idea, and there might be copyright stuff to figure out, but it would be incredible!

ancient swan Jan 11, 2025, 1:27 PM

#

elder willow We need something like the photo model on weights.gg, where you can upload photo...

that would create copyright issues

elder willow Jan 11, 2025, 1:27 PM

#

True :/

#

But it would've been cool

fiery grove Jan 11, 2025, 1:48 PM

#

covert lake Jan 11, 2025, 1:57 PM

#

crude delta looking for a tts

There are different Text To Speech (TTS) AIs:

GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/

Freemium 11labs: An easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS

FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site

With RVC Models:

RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)

If you wanna do tts locally with RVC Voice Models (if you got a good pc):

You can get Applio in our docs
While Ilaria RVC Mainline here (no guide as of right now)

If you don't got a good pc you can do tts with RVC Voice Models on cloud:

Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
Use Applio UI Colab (with google colab T4 free daily limit gpu)
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc

#

this could help wisemysticaltree

covert lake Jan 11, 2025, 1:59 PM

#

onyx summit does anyone know a decently fast tts model that sounds decently realistic that i...

well, maybe could try fish speech or F5

#

you can also check the index above ^^

#

There's also PiperTTS which is fast and lightweight, but wasn't added to the index since it's not really good quality

covert lake Jan 11, 2025, 2:01 PM

#

river sparrow How can i use these models for text to speech locally?

#🧬│ai-chat message

covert lake Jan 11, 2025, 2:02 PM

#

stiff ridge is weights.gg a real time or just a generator?

weights.gg uses RVC for inference (use models) on pre-recorded audios and Training (make) Models, so no realtime yet

#

are you looking for realtime?

#

I could help you setup realtime if u tell me your PC GPU

regal gate Jan 11, 2025, 4:51 PM

#

Hiii I’m looking for the best option to voicechange pre-recorded audio

#

im very new to this

#

but would be so cool to be able to sing something with my own voice and then just change the voice to any voice model

#

and is there a way to do it directly in Ableton Live? like as a VST

minor blade Jan 11, 2025, 5:00 PM

#

regal gate and is there a way to do it directly in Ableton Live? like as a VST

Nope, that's not possible.

minor blade Jan 11, 2025, 5:01 PM

#

regal gate but would be so cool to be able to sing something with my own voice and then jus...

But in fact this is possible with RVC.

#

Lemme hand you the docs.

regal gate Jan 11, 2025, 5:01 PM

#

Okay thankk you so much

minor blade Jan 11, 2025, 5:01 PM

#

regal gate Okay thankk you so much

A question, what's your GPU?

regal gate Jan 11, 2025, 5:01 PM

#

also would love to know where is the best place to find models for it

#

are they called RVC models?

minor blade Jan 11, 2025, 5:02 PM

#

regal gate are they called RVC models?

Yep.

minor blade Jan 11, 2025, 5:02 PM

#

regal gate also would love to know where is the best place to find models for it

You can use the #1175430844685484042 channel.

#

what

untold nebula Jan 11, 2025, 5:03 PM

#

aint it a ai pic generator?

minor blade Jan 11, 2025, 5:03 PM

#

regal gate Okay thankk you so much

https://docs.ai-hub.wtf/

Home

Last update: Oct 21, 2024

#

There you have the guides. You got either the option of installing RVC locally (if you got a nice GPU) or just using online colab/kaggle versions.

untold nebula Jan 11, 2025, 5:03 PM

#

okay thank you

regal gate Jan 11, 2025, 5:04 PM

#

my GPU is 2060S Dual Evo

#

is it a nice GPU

minor blade Jan 11, 2025, 5:06 PM

#

regal gate is it a nice GPU

Welp, i guess you can try installing RVC locally.

#

Either install mainline or applio.

regal gate Jan 11, 2025, 5:06 PM

#

minor blade Either install mainline or applio.

what is that?

#

is it free

minor blade Jan 11, 2025, 5:06 PM

#

regal gate what is that?

RVC forks. Welp, the og version and a alt version with various extra features.

minor blade Jan 11, 2025, 5:06 PM

#

regal gate is it free

Yes.

regal gate Jan 11, 2025, 5:06 PM

#

amazing

#

Thanks!

#

is there a way to search somewhere for specific types of voice models?

#

like search keywords

#

like “cute”

#

female vocal

minor blade Jan 11, 2025, 5:07 PM

#

regal gate like “cute”

Umm.. nop.

regal gate Jan 11, 2025, 5:07 PM

#

deep male vocal

#

No oki

minor blade Jan 11, 2025, 5:08 PM

#

regal gate deep male vocal

Welp you can try.

#

If there isn't a model of the voice you're looking for, you can just request it making a post on the #1159289738314919936 or commission it to any model master on the #1191429836321849435

regal gate Jan 11, 2025, 5:11 PM

#

Thank you! This helps alot

flat urchin Jan 11, 2025, 6:04 PM

#

@dry lotus

#

i actually despise you

#

for stealing my name

cursive tangle Jan 11, 2025, 6:10 PM

#

hi

regal gate Jan 11, 2025, 6:24 PM

#

minor blade Yes.

and is there a way to do all these things when im only on my iPhone?

worthy coyote Jan 11, 2025, 6:24 PM

#

no

regal gate Jan 11, 2025, 7:12 PM

#

Oki

minor blade Jan 11, 2025, 10:12 PM

#

regal gate and is there a way to do all these things when im only on my iPhone?

I don't think so..

gentle trench Jan 11, 2025, 11:44 PM

#

minor blade I don't think so..

what is this and what do I click

#

I have never seen this before

lusty yarrow Jan 12, 2025, 1:13 AM

#

anyone can help with the rvc on how to use on discord?\

elder willow Jan 12, 2025, 1:17 AM

#

Hola

#

alguien sabe hacer modelos de ia?

regal gate Jan 12, 2025, 1:23 AM

#

elder willow alguien sabe hacer modelos de ia?

Yo tb quiero saber esto

wind stratus Jan 12, 2025, 1:36 AM

#

elder willow alguien sabe hacer modelos de ia?

rvc?

#

-guides

rare sorrelBOT Jan 12, 2025, 1:36 AM

#

wind stratus -guides

📚 Documentation

AI HUB Docs

https://docs.ai-hub.wtf

🍏 Applio Docs

https://docs.applio.org/

✨ More guides

How to use RVC Mainline Colab by Cauthess
AICoverGen Colab Guide by Eddy (Spanish Helper)
Create a model with RVC disconnected (colab) by Angetyde

wind stratus Jan 12, 2025, 1:37 AM

#

si es muy complicado, yo iria directamente a weights.gg a hacer el modelo, es gratuito

elder willow Jan 12, 2025, 1:45 AM

#

pero el modelo que necesito es pa cantar

#

con mi voz

#

no para hablar

#

alguien sabe?

#

le pago al que sepa

steady surge Jan 12, 2025, 2:30 AM

#

Hello bitches

#

is mainline or appila the way to go rn?

wind stratus Jan 12, 2025, 2:45 AM

#

elder willow pero el modelo que necesito es pa cantar

las guias y la opcion en weights es justamente para eso

steady surge Jan 12, 2025, 2:45 AM

#

marlbro

stark scarab Jan 12, 2025, 2:52 AM

#

Spanish chat lfg

#

@elder willow @regal gate revisen las guias que les pasaron

#

Allí explica los pasos para hacerlo local o en la nube

elder willow Jan 12, 2025, 3:03 AM

#

ok gracias

steady surge Jan 12, 2025, 3:05 AM

#

stark scarab Spanish chat <:lfg:1159355870119993496>

Es mejor Applio o RVC ahora mismo, caballo gordo

#

?

stark scarab Jan 12, 2025, 3:09 AM

#

steady surge Es mejor Applio o RVC ahora mismo, caballo gordo

Applio ig

stark scarab Jan 12, 2025, 3:09 AM

#

elder willow ok gracias

Si tienes alguna duda igual me dices

steady surge Jan 12, 2025, 3:10 AM

#

guessing isnt enough, i need my rvc cover to be clean enough to caress and lick

ionic pumice Jan 12, 2025, 3:12 AM

#

https://cdn.discordapp.com/emojis/1311429177840369664.webp?size=48&name=hur

solar torrent Jan 12, 2025, 3:12 AM

#

https://cdn.discordapp.com/emojis/1015964749470650400.webp?size=48

velvet drift Jan 12, 2025, 7:45 AM

#

does anyone know where to find really high quality rvc voice's i just need the .pth files, i've tried hugging face rvcmodels and this discord but i cant find one like david attenborough if anyone were to try to use a similar voice to david attenboroguh or any of the famous voice speakers for documentary's how would they go about it without obviously using eleven labs.
Currently i have my voice but i couldnt find any good voices with australian accents to combine with mine

solar torrent Jan 12, 2025, 7:46 AM

#

velvet drift does anyone know where to find really high quality rvc voice's i just need the ....

That's too long to read. But I think you wanna like find the best voice model here? skullfacedistorted

velvet drift Jan 12, 2025, 7:47 AM

#

sort off by voice model do you mean like mangio, illaria, or voice specifically im looking for a particular voice that is just high quality but yes i guess if there is a good voice model i could use that instead

solar torrent Jan 12, 2025, 7:48 AM

#

I don't know any generic voice model that sounds better for your needs. Voice models here are full of fictional and famous people, all of them are fanmade.

velvet drift Jan 12, 2025, 7:49 AM

#

oh right yeh well i was trying to find one that was high quality in the voice models section but a lot of them were bad and had glitches and bg audio i thought there would be like a area that has more high quality but if not i guess i can just keep looking to hopefully find a good qualtiy one

vast lantern Jan 12, 2025, 7:53 AM

#

what is the best ai for video. is there any that do it locally through some kind of software? i just feel like they all cost too much

dapper ginkgo Jan 12, 2025, 10:37 AM

#

vast lantern what is the best ai for video. is there any that do it locally through some kind...

If you want local it can get expensive as fuck

#

But there are free sites that do video generation but you have a very long queue and mostly only limited per day

polar flax Jan 12, 2025, 10:41 AM

#

vast lantern what is the best ai for video. is there any that do it locally through some kind...

it is too demanding for consumer gpus, RTX 3090/4090 may be bare minimum to do with reduced settings but workstation gpus over 24 GB are recommended

#

so better option is to rent some A100/H100, or yea even video gen services have long queue cuz too many ppl using it

solar torrent Jan 12, 2025, 10:46 AM

#

Some NVIDIA (Quadro) RTX GPU models have more VRAM than GeForce GPUs, can be used for high-performace AI tasks, but they usually sold more expensive. However, NVIDIA A100 and H100 are even more expensive compared to Quadro RTX GPUs. These GPUs are too over your budget, so it would be better if you have GeForce RTX 4090 or wait for RTX 5090 to generate video locally.

river sparrow Jan 12, 2025, 11:27 AM

#

covert lake weights.gg uses RVC for inference (use models) on pre-recorded audios and Traini...

Not real time i just want to do tts, not Audio to audio

#

also why isnt there an open source of elevenlabs, eleven labs is around for more than a year, still we have the shitty turtle tts skullsob

ancient swan Jan 12, 2025, 12:09 PM

#

vast lantern what is the best ai for video. is there any that do it locally through some kind...

i did mochi through comfyui. pretty good but takes a long time to process

covert lake Jan 12, 2025, 12:32 PM

#

river sparrow Not real time i just want to do tts, not Audio to audio

U could have specified that since Generator is vague

covert lake Jan 12, 2025, 12:32 PM

#

river sparrow also why isnt there an open source of elevenlabs, eleven labs is around for more...

Turtle tts? Tortoise is old ASF and already has better forks

#

There are different Text To Speech (TTS) AIs:

GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/

Freemium 11labs: An easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS

FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site

With RVC Models:

RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)

If you wanna do tts locally with RVC Voice Models (if you got a good pc):

You can get Applio in our docs
While Ilaria RVC Mainline here (no guide as of right now)

If you don't got a good pc you can do tts with RVC Voice Models on cloud:

Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
Use Applio UI Colab (with google colab T4 free daily limit gpu)
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc

cloud thistle Jan 12, 2025, 12:51 PM

#

why can help me in promlem RVC

solar torrent Jan 12, 2025, 12:51 PM

#

cloud thistle why can help me in promlem RVC

Can you elaborate in detail?

cloud thistle Jan 12, 2025, 12:52 PM

#

I say the program does not output sound

solar torrent Jan 12, 2025, 12:55 PM

#

cloud thistle I say the program does not output sound

If you have any problem using W-Okada, go to #🔍│help-w-okada. I don't think RVC would do "output sound" like that.

elder cobalt Jan 12, 2025, 1:00 PM

#

hey there, i'm trying to build a text-to-song generator, but i'm confused about where to start. should i focus on implementing research papers like musicgen, or should i use apis from services like suno or elevenlabs? one problem i've noticed with udio and suno is that they don't offer official apis, and the unofficial ones aren't very reliable. also, please keep in mind that my pc's specs are quite poor and it doesn't even have a gpu. it would be really helpful if anyone could offer some suggestions or guidance.

cloud thistle Jan 12, 2025, 1:03 PM

#

why can help me in promlem RVC

twilit peak Jan 12, 2025, 1:16 PM

#

Is index rate in weights 0?

#

It says when uploading a model that the index file is optional

#

Or does it use a different index rate when the index file is given?

chilly lake Jan 12, 2025, 1:46 PM

#

twilit peak Is index rate in weights 0?

index file is optional and missing index is the same as setting index rate to 0

#

also you can use any index with any model for extra fun

twilit peak Jan 12, 2025, 1:46 PM

#

Yeah but what if I upload my index file

chilly lake Jan 12, 2025, 1:47 PM

#

go ahead

twilit peak Jan 12, 2025, 1:47 PM

#

There's no index rate setting

chilly lake Jan 12, 2025, 1:47 PM

#

probably default 0.75

lyric elk Jan 12, 2025, 2:05 PM

#

Hi

covert lake Jan 12, 2025, 2:30 PM

#

cloud thistle why can help me in promlem RVC

!howtoask

pine acornBOT Jan 12, 2025, 2:30 PM

#

covert lake !howtoask

How To Troubleshoot

__**GIVE CONTEXT.**__ 📝

Don't simply mention your issue, like "my rvc is not working".
Describe the step you are on, what you're trying to do, the RVC you're using, a screenshot, etc.
The more context, the better.

__**BE POLITE.**__ <:matsuripray:1159685390156967936>

Don't be desperate. You can ping a Helper, but if they ignore, they aren't available/don't know the answer.
It's okay if you're frustrated, but don't take it into this server.
Don't DM without prior consent.

__**BE PRODUCTIVE.**__ 🤝

Don't ask for every little instruction. Put your own effort & test things by yourself.
Don't ask to ask.
Check if your answer is a Google search away/on our guides website.

ancient swan Jan 12, 2025, 4:37 PM

#

@covert lake https://www.youtube.com/watch?v=_qQwSVzYNpA have you tried veo 2?

YouTube

Two Minute Papers

DeepMind’s Veo2 AI - The New King Is Here!

❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambdalabs.com/papers

Try Veo2 here (Notes: likely USA only so far and there may be a waitlist):
https://deepmind.google/technologies/veo/veo-2/

📝 My paper on simulations that look almost like reality is available for free here:
https://rdcu.be/cWPfD

Or this is the orig. Natur...

▶ Play video

covert lake Jan 12, 2025, 4:57 PM

#

ancient swan <@911742715019001897> https://www.youtube.com/watch?v=_qQwSVzYNpA have you tried...

Nope I didn't

#

Oh veo 2

#

I was looking at smt to run locally tbh

late wolf Jan 12, 2025, 5:16 PM

#

is there any way somone can copy someones voice for me?

#

for ai

pastel nymph Jan 12, 2025, 5:19 PM

#

Кто русский помогите с проблемой

covert lake Jan 12, 2025, 5:25 PM

#

late wolf is there any way somone can copy someones voice for me?

#1159289738314919936 or #1191429836321849435

#

or do it urself

late wolf Jan 12, 2025, 5:26 PM

#

covert lake <#1159289738314919936> or <#1191429836321849435>

can i pay someone to do it?

#

is that possible

covert lake Jan 12, 2025, 5:26 PM

#

late wolf can i pay someone to do it?

yes u can do a paid request or check one of the model masters shop

late wolf Jan 12, 2025, 5:27 PM

#

covert lake yes u can do a paid request or check one of the model masters shop

and how ik who trusted?

#

or are they all?

covert lake Jan 12, 2025, 5:27 PM

#

late wolf and how ik who trusted?

model masters, they did an application to be able to get paid commissions

late wolf Jan 12, 2025, 5:28 PM

#

okay thanks

covert lake Jan 12, 2025, 5:28 PM

#

yw

worthy plume Jan 12, 2025, 5:31 PM

#

guys

#

is RTX 4070Ti enough to train models?

#

i mean small models

#

or fine tuning with my dataset?

covert lake Jan 12, 2025, 5:33 PM

#

worthy plume is RTX 4070Ti enough to train models?

yeah dw

river verge Jan 12, 2025, 5:52 PM

#

DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Walking the Wire (Drum model no. 568)

river verge Jan 12, 2025, 6:24 PM

#

DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Dancing on the Moon (Drum model no. 569)

twilit pond Jan 12, 2025, 7:03 PM

#

Taking free model requests, if you have a dataset ready already DM AIHC_Heart ( human vocals only no instruments)

stuck mason Jan 12, 2025, 7:45 PM

#

where i can get the models

ember current Jan 12, 2025, 8:02 PM

#

im trying to make my own model and it keeps saying this zip needs a .pth file

#

what is that and how do i get that

covert lake Jan 12, 2025, 8:04 PM

#

stuck mason where i can get the models

You can search rvc ai voice models at:

#1175430844685484042
In #🔍│find-models , Do /find with @hidden grotto
https://weights.gg/ (login required)
https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
https://voice-models.com/
https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)

if there isnt one, you can:

#1159289738314919936
#1191429836321849435
make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/

hidden grottoBOT Jan 12, 2025, 8:04 PM

#

covert lake You can search rvc ai voice models at: - <#1175430844685484042> - In <#11635920...

:wave: @covert lake, How can I help?

Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image

covert lake Jan 12, 2025, 8:05 PM

#

ember current im trying to make my own model and it keeps saying this zip needs a .pth file

what's ur pc gpu

ember current Jan 12, 2025, 8:05 PM

#

1650

#

why?

covert lake Jan 12, 2025, 8:07 PM

#

ember current 1650

Your Nvidia GPU is good enough to do inference (use models) locally (on ur pc), not the best to train (make models) even if still possible

You can:

Locally (runs on your pc so the speed depends on that, you will have to set it up with the guides):
- Applio: A fork of RVC with some extra features like Applio TTS, kinda faster and simpler but same quality tho
- Mainline: The original RVC
Cloud (remote good pc, easier and faster than ur PC but it's limited):
- Ilaria RVC Zero: fastest and simplest that you can get for free
- Weights.gg: Partnered with AI Hub, lets u do them easily but u may be in a queue
- Applio Colab: max 4 hours daily, not granted, of GPU

Easiest possible (automatically separates vocals & instrumentals) : weights.gg
easiest cloud: Ilaria rvc zero
easiest local: Applio

#

U could do it locally but small models, idk how much to suggest that

#

Train (make) RVC Models on cloud:

Prepare the Dataset
Setup RVC:
Choose a cloud way to use RVC,

Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI, no guide as of right now)
- Applio by Shirou (UI, no guide as of right now)
Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Applio (UI)
- Mainline (UI, No guide as of right now)

Be sure to know about the tensorboard

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.gg which ofc uses RVC

RVC Inference (use models) on pre-recorded audio on Cloud

You can use either:

Weights.gg: Easiest Possible Ever Automatic
Ilaria RVC Zero: Fastest free on cloud
Applio (ui)

#

I would suggest kaggle training

regal gate Jan 13, 2025, 1:15 AM

#

Do anyone here know how to make the lo fi text to speech that’s been used a lot in the gungame servers on counter strike source ?

polar flax Jan 13, 2025, 1:18 AM

#

regal gate Do anyone here know how to make the lo fi text to speech that’s been used a lot ...

just ur usual TTS with low pass filter as post processing

regal gate Jan 13, 2025, 1:18 AM

#

nooo like it’s a specific robotic kinda tts

#

very specific oneeee

polar flax Jan 13, 2025, 1:19 AM

#

it's all just post processing, mate

#

just google some effect vsts

regal gate Jan 13, 2025, 1:21 AM

#

but do u know which voice im talking about?

#

bc yes I know I can put effects on and make smth similar, but I’m looking for that specific tts

#

bc I’m almost sure that they just used a tts where u can choose that specific robotic voice

polar flax Jan 13, 2025, 1:23 AM

#

regal gate bc yes I know I can put effects on and make smth similar, but I’m looking for th...

try clone it using zero shot tts like fishspeech 1.4, F5, etc

regal gate Jan 13, 2025, 1:26 AM

#

Thing is I cannot find it now

#

but i think it says like “welcome to gungame server blablablab”

#

and also “someone is on knife level”

#

that’s just what I remember

twilit pond Jan 13, 2025, 1:30 AM

#

crazy nobody actually dmed me

#

Taking free model requests, if you have a dataset ready already DM AIHC_Heart ( human vocals only no instruments)

#

( i think nobody who wants a free model has a dataset ready, lol )

violet marten Jan 13, 2025, 3:05 AM

#

how do i fix delay

twilit pond Jan 13, 2025, 3:11 AM

#

violet marten how do i fix delay

if your talking about w-okada, you cant fix the delay, you an reduce chunk which reduces the delay but the voice quality at the same time

elder cobalt Jan 13, 2025, 3:38 AM

#

hey there, i'm trying to build a text-to-song generator, but i'm confused about where to start. should i focus on implementing research papers like musicgen, or should i use apis from services like suno or elevenlabs? one problem i've noticed with udio and suno is that they don't offer official apis, and the unofficial ones aren't very reliable. also, please keep in mind that my pc's specs are quite poor and it doesn't even have a gpu. it would be really helpful if anyone could offer some suggestions or guidance.

frozen egret Jan 13, 2025, 8:21 AM

#

guys what's the best voice separator to use?

twilit pond Jan 13, 2025, 8:50 AM

#

frozen egret guys what's the best voice separator to use?

i recommend ultimate vocal remover

#

https://github.com/Anjok07/ultimatevocalremovergui

GitHub

GitHub - Anjok07/ultimatevocalremovergui: GUI for a Vocal Remover t...

GUI for a Vocal Remover that uses Deep Neural Networks. - Anjok07/ultimatevocalremovergui

frozen egret Jan 13, 2025, 9:41 AM

#

twilit pond i recommend ultimate vocal remover

does it use a lot of gpu?

covert lake Jan 13, 2025, 10:09 AM

#

violet marten how do i fix delay

tell me what tutorial link and pc gpu u have in #🔍│help-w-okada

covert lake Jan 13, 2025, 10:09 AM

#

frozen egret does it use a lot of gpu?

what's ur pc gpu

covert lake Jan 13, 2025, 10:10 AM

#

regal gate Thing is I cannot find it now

There are different Text To Speech (TTS) AIs:

GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/

Freemium 11labs: An easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS

FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site

With RVC Models:

RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)

If you wanna do tts locally with RVC Voice Models (if you got a good pc):

You can get Applio in our docs
While Ilaria RVC Mainline here (no guide as of right now)

If you don't got a good pc you can do tts with RVC Voice Models on cloud:

Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide
Use Applio UI Colab (with google colab T4 free daily limit gpu)
if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc

#

the most robotic one would be google translator ig

frozen egret Jan 13, 2025, 10:10 AM

#

covert lake what's ur pc gpu

i use a laptop..

covert lake Jan 13, 2025, 10:10 AM

#

u could also try adding effects, not sure

solar torrent Jan 13, 2025, 10:10 AM

#

frozen egret does it use a lot of gpu?

UVR5, RVC and AI tasks benefit from GPU for their faster processing speed and such.

covert lake Jan 13, 2025, 10:10 AM

#

frozen egret i use a laptop..

You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU

#

hopefully it has gpu 0 and gpu 1

#

meaning it doesn't have only integrated gpu but also dedicated

frozen egret Jan 13, 2025, 10:11 AM

#

covert lake You can check your pc gpu via: ctrl+shift+esc (task manager) -> Performance tab ...

HD Graphics 520

solar torrent Jan 13, 2025, 10:11 AM

#

frozen egret HD Graphics 520

Only that?

covert lake Jan 13, 2025, 10:11 AM

#

frozen egret HD Graphics 520

nvm

#

integrated gpu

#

old laptop boohooh

frozen egret Jan 13, 2025, 10:12 AM

#

yea dude it's like a 9 years old computer

covert lake Jan 13, 2025, 10:12 AM

#

ur laptop could run it on cpu locally, but it would take many many hours if it even runs man

frozen egret Jan 13, 2025, 10:12 AM

#

8-9 years old

covert lake Jan 13, 2025, 10:12 AM

#

upgrade 🙏

#

The only thing you can do is either upgrade or use cloud (remote good pc)

frozen egret Jan 13, 2025, 10:12 AM

#

covert lake upgrade 🙏

i'm 14 you think i can easily ask my dad to change my gpu 😭

solar torrent Jan 13, 2025, 10:12 AM

#

It would be better to buy a laptop that was made four years ago from now.

solar torrent Jan 13, 2025, 10:13 AM

#

frozen egret i'm 14 you think i can easily ask my dad to change my gpu 😭

What the fuck even is that age?

#

frozen egret Jan 13, 2025, 10:13 AM

#

solar torrent What the fuck even is that age?

The fuck are you talking about

#

Hell, the fuck is your PFP?

#

Screw it, the fu-

covert lake Jan 13, 2025, 10:14 AM

#

frozen egret i'm 14 you think i can easily ask my dad to change my gpu 😭

alright got your point

frozen egret Jan 13, 2025, 10:14 AM

#

solar torrent What the fuck even is that age?

wdym by that tho you tryna groom me or smth🤨

solar torrent Jan 13, 2025, 10:14 AM

#

Unlike desktop, not all laptops are upgradable.

covert lake Jan 13, 2025, 10:14 AM

#

CYou're just looking to separate 2 voices in a song right?

frozen egret Jan 13, 2025, 10:14 AM

#

covert lake CYou're just looking to separate 2 voices in a song right?

Multi-voices

solar torrent Jan 13, 2025, 10:14 AM

#

frozen egret wdym by that tho you tryna groom me or smth🤨

I don't groom a child. I like women better.

covert lake Jan 13, 2025, 10:14 AM

#

solar torrent What the fuck even is that age?

alr no need to say that man

solar torrent Jan 13, 2025, 10:14 AM

#

covert lake Jan 13, 2025, 10:14 AM

#

No need for arguing about this

#

he's just 14, I'm just 16 lol

frozen egret Jan 13, 2025, 10:15 AM

#

solar torrent

yea eat them cookies cookieman

#

anyways

frozen egret Jan 13, 2025, 10:15 AM

#

covert lake CYou're just looking to separate 2 voices in a song right?

I wanna separate voices like in RnB

#

rnb songs have a lot of voices overlapping so

solar torrent Jan 13, 2025, 10:16 AM

#

You wanna seperate some harmonies chorus going on an audio, right?

#

There are some UVR5 models that can do that, but I don't remember the name. goofy

covert lake Jan 13, 2025, 10:18 AM

#

frozen egret I wanna separate voices like in RnB

You could try using Cloud UVR https://docs.ai-hub.wtf/rvc/resources/dataset-isolation/#colab and using the 6_HP-Karaoke model

Dataset & Isolation

Last update: Dec 24, 2024

#

basicaly, separate the vocals and instrumentals, then use the vocals as input, use that model and you should be able to seaparate the voices

solar torrent Jan 13, 2025, 10:20 AM

#

https://cdn.discordapp.com/attachments/1159289354439626772/1314945169979609158/image.png

stark scarab Jan 13, 2025, 12:47 PM

#

Mel Karaoke could work too

stark scarab Jan 13, 2025, 12:51 PM

#

solar torrent https://cdn.discordapp.com/attachments/1159289354439626772/1314945169979609158/i...

This should be pinned

polar flax Jan 13, 2025, 12:52 PM

#

stark scarab This should be pinned

#🔥│model-maker-chat message

stark scarab Jan 13, 2025, 12:53 PM

#

polar flax https://discord.com/channels/1159260121998827560/1159290096458149938/13137778402...

There is an updated one

#

#

Found it

#

I'll replace it in a bit

polar flax Jan 13, 2025, 1:01 PM

#

stark scarab

and also this https://docs.google.com/spreadsheets/d/1pPEJpu4tZjTkjPh_F5YjtIyHq8v0SxLnBydfUBUNlbI/edit?usp=sharing

Google Docs

Bleedless vs Fullness | by Bas Curtiz

stark scarab Jan 13, 2025, 1:01 PM

#

Okke

twin garden Jan 13, 2025, 6:12 PM

#

I am looking for good software that let's me accurately transcribe video files to text. Any good/free suggestions?

Thank you!

regal drum Jan 13, 2025, 6:25 PM

#

Is there any good free AI video generator

covert lake Jan 13, 2025, 6:31 PM

#

regal drum Is there any good free AI video generator

text/image to video AIs:

Locally (runs on ur pc):
- pyramid flow (Image/Text to Video)
- cogvideox 1.5 5b: Image to Video, Text to Video
Cloud (remote good pc, running on an online website for example, easier to setup):
- Weights.gg (paid only)
- pyramid flow (Image/Text to Video) (HuggingFace Space)
- OpenAI Sora (paid only, in some countries)
- lumalabs
- Hailoua AI

twin garden Jan 13, 2025, 6:33 PM

#

covert lake text/image to video AIs: - Locally (runs on ur pc): - [pyramid flow (Image/Te...

Do you know of any good free video to text transcribers that are free?

covert lake Jan 13, 2025, 6:34 PM

#

twin garden Do you know of any good free video to text transcribers that are free?

like, subtitles?

polar flax Jan 13, 2025, 6:34 PM

#

twin garden Do you know of any good free video to text transcribers that are free?

whisper

#

though extracting the voices might give better results

covert lake Jan 13, 2025, 6:36 PM

#

https://huggingface.co/spaces/Nick088/Fast-Subtitle-Maker could help too, it uses whisper, however it's made for subtitles rather than normal txt output

Fast Subtitle Maker - a Hugging Face Space by Nick088

jade estuary Jan 13, 2025, 6:38 PM

#

can you do this on a macbook

covert lake Jan 13, 2025, 6:40 PM

#

jade estuary can you do this on a macbook

do what

twin garden Jan 13, 2025, 6:48 PM

#

polar flax though extracting the voices might give better results

what do you mean by "extracting voice" would work better?

twin garden Jan 13, 2025, 6:48 PM

#

covert lake like, subtitles?

Nope. I need it to transcribe audio from video files to text

jade estuary Jan 13, 2025, 6:56 PM

#

covert lake do what

i know my windows pc is super bad and my mac has a m2 chip

#

i want to try make a rvc model of myself so i can do text to speech but its hard

eager cipher Jan 13, 2025, 7:04 PM

#

what program can run LLM models that has text to ai

ancient swan Jan 13, 2025, 7:11 PM

#

eager cipher what program can run LLM models that has text to ai

what

#

what do you mean by text to ai?

eager cipher Jan 13, 2025, 7:12 PM

#

ancient swan what do you mean by text to ai?

ai voice. i want to take a model from #1175430844685484042 and type out a prompt

#

and have it speak whatever i type

onyx bluff Jan 13, 2025, 7:17 PM

#

Not cool man.

#

ancient swan Jan 13, 2025, 7:17 PM

#

eager cipher ai voice. i want to take a model from <#1175430844685484042> and type out a pro...

it's voice to voice not text to speech

covert lake Jan 13, 2025, 7:18 PM

#

onyx bluff

wait tf did the tos change

onyx bluff Jan 13, 2025, 7:18 PM

#

covert lake wait tf did the tos change

Nah

covert lake Jan 13, 2025, 7:18 PM

#

you used to get a strike before rather than just a warning

onyx bluff Jan 13, 2025, 7:18 PM

#

it was.

covert lake Jan 13, 2025, 7:18 PM

#

it isn't a strike

eager cipher Jan 13, 2025, 7:18 PM

#

ancient swan it's voice to voice not text to speech

mk, do you know one then?

onyx bluff Jan 13, 2025, 7:19 PM

#

I used polish singers yet.

ancient swan Jan 13, 2025, 7:19 PM

#

eager cipher mk, do you know one then?

elevenlabs but it's paid

onyx bluff Jan 13, 2025, 7:19 PM

#

I never did western songs.

eager cipher Jan 13, 2025, 7:19 PM

#

ancient swan elevenlabs but it's paid

one thats locally run

onyx bluff Jan 13, 2025, 7:19 PM

#

What the heck happen?

ancient swan Jan 13, 2025, 7:19 PM

#

eager cipher one thats locally run

fishspeech maybe, but i haven't used it

covert lake Jan 13, 2025, 7:19 PM

#

twin garden Nope. I need it to transcribe audio from video files to text

you could check out whisper, maybe u can run it locally https://github.com/jhj0517/Whisper-WebUI?tab=readme-ov-file

GitHub

GitHub - jhj0517/Whisper-WebUI: A Web UI for easy subtitle using wh...

A Web UI for easy subtitle using whisper model. Contribute to jhj0517/Whisper-WebUI development by creating an account on GitHub.

#

u can prob find a lot of web uis of whisper

covert lake Jan 13, 2025, 7:21 PM

#

jade estuary i know my windows pc is super bad and my mac has a m2 chip

could you tell me your windows pc gpu just to check?

#

because mac isn't the best for rvc training either

#

I don't think I ever saw someone sucessfully train a model on mac since the speed is similar to CPU

onyx bluff Jan 13, 2025, 7:22 PM

#

Did anyone have UMG strike on youtube?

edgy bloomBOT Jan 13, 2025, 7:22 PM

#

Congratulations CrimsonZockt (by eMuzyka)!

Your Charmander is now level 2!

onyx bluff Jan 13, 2025, 7:22 PM

#

edgy bloom

Yes charmander!

covert lake Jan 13, 2025, 7:22 PM

#

jade estuary i want to try make a rvc model of myself so i can do text to speech but its hard

btw RVC is Speech To Speech natively

#

even if it can be used for TTS

covert lake Jan 13, 2025, 7:22 PM

#

onyx bluff Did anyone have UMG strike on youtube?

yes basically everyone

#

https://www.youtube.com/watch?v=mkF_uCX-KYU

YouTube

Nick088

AI Covers can TERMINATE your CHANNEL

AI Covers published on youtube without permission nor a license, can get you strikes and get your account terminated. Do it at your own risk
#aicover #aicoversongs #aicovers #ai #artificialintelligence

▶ Play video

#

I got 2 strikes

#

which is why I deleted all my covers

#

also, no privating/unlisting won't work

raven cargo Jan 13, 2025, 7:23 PM

#

Hello, can you send me the current colab link?

#

-colab

rare sorrelBOT Jan 13, 2025, 7:23 PM

#

raven cargo -colab

📒 Google Colab Notebooks

Applio, by IA Hispano Google Colab
RVC Disconnected, by Kit Lemonfoot Google Colab
RVC Mainline, by Hina Google Colab
AICoverGen-WebUI, by Hina Google Colab
AICoverGen-NoWebUI [English], by Ardha, fixed by Eddy, Hina and Gdr Google Colab
AICoverGen-NoWebUI [Spanish], by Eddy, Hina and Gdr Google Colab
UVR5 NO UI, by Eddy Google Colab
UVR5 UI, by Eddy Google Colab
Modified W-Okada's Voice Changer, Google Colab
FaceFusion UI, by Nick088 Google Colab
FaceFusion NO UI, by Nick088 Google Colab
EasyGUI, by Rejekts Google Colab

ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

covert lake Jan 13, 2025, 7:24 PM

#

raven cargo Hello, can you send me the current colab link?

curreent colab link of what

raven cargo Jan 13, 2025, 7:24 PM

#

covert lake curreent colab link of what

RVC Disconnected V2

covert lake Jan 13, 2025, 7:24 PM

#

what u tryna do? and could u tell me ur pc gpu first to see if it's powerful enough?

raven cargo Jan 13, 2025, 7:24 PM

#

covert lake what u tryna do? and could u tell me ur pc gpu first to see if it's powerful eno...

Intel i7

covert lake Jan 13, 2025, 7:24 PM

#

raven cargo Intel i7

that's a CPU

raven cargo Jan 13, 2025, 7:25 PM

#

yeah

covert lake Jan 13, 2025, 7:25 PM

#

google colab is a cloud computing service only for people with a bad pc btw, I'm asking your GPU since some people use colab because they don't even know their GPU

covert lake Jan 13, 2025, 7:25 PM

#

raven cargo yeah

You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU

#

CPU = Central Processing Unit
GPU = Graphics Processing Unit

raven cargo Jan 13, 2025, 7:26 PM

#

covert lake CPU = Central Processing Unit GPU = Graphics Processing Unit

#

covert lake Jan 13, 2025, 7:27 PM

#

raven cargo

nvm it's a bad pc

#

Train (make) RVC Models on cloud:

Prepare the Dataset
Setup RVC:
Choose a cloud way to use RVC,

Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
- Applio (ui)
- Mainline (UI)
- RVCDISCONNECTED (no ui)
Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
- Mainline (UI)
- Applio by Vidal (UI, no guide as of right now)
- Applio by Shirou (UI, no guide as of right now)
Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):
- Applio (UI)
- Mainline (UI, No guide as of right now)

Be sure to know about the tensorboard

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time
If you are looking for the easiest way and for free, is using https://weights.gg which ofc uses RVC

RVC Inference (use models) on pre-recorded audio on Cloud

You can use either:

Weights.gg: Easiest Possible Ever Automatic
Ilaria RVC Zero: Fastest free on cloud
Applio (ui)

#

I also linked RVC Disconnected ^

onyx bluff Jan 13, 2025, 7:28 PM

#

covert lake https://www.youtube.com/watch?v=mkF_uCX-KYU

And this is why i never did sanah AI Covers!

raven cargo Jan 13, 2025, 7:30 PM

#

covert lake nvm it's a bad pc

It's like I wanted my computer to be bad. Anyway

onyx bluff Jan 13, 2025, 7:30 PM

#

I loved to being sobbed.

covert lake Jan 13, 2025, 7:30 PM

#

onyx bluff And this is why i never did sanah AI Covers!

u can't do much about it

covert lake Jan 13, 2025, 7:31 PM

#

raven cargo It's like I wanted my computer to be bad. Anyway

I had to check it, there were some people with a 2k pc literally using colab 😭

#

people need to always check their gpu first before using cloud to see if it's powerful enough

#

btw u could also upgrade to a good desktop pc if u want

onyx bluff Jan 13, 2025, 7:32 PM

#

covert lake u can't do much about it

What? She Joined UMG?

kindred kelp Jan 13, 2025, 7:35 PM

#

Hi ppl

#

Hi crimson

river sparrow Jan 13, 2025, 7:37 PM

#

covert lake There are different Text To Speech (TTS) AIs: GPT So Vits: RVC isn't as good a...

Great thank you! I will use gpt sovits as the model sounds much better than rvc

kindred kelp Jan 13, 2025, 7:37 PM

#

You guys code?

river sparrow Jan 13, 2025, 7:37 PM

#

I do a little bit😅

kindred kelp Jan 13, 2025, 7:37 PM

#

Same. JavaScript

river sparrow Jan 13, 2025, 7:38 PM

#

Yes javascript

covert lake Jan 13, 2025, 7:40 PM

#

onyx bluff What? She Joined UMG?

idk her

onyx bluff Jan 13, 2025, 7:51 PM

#

covert lake idk her

She was born in Poland

#

I remember.

split blade Jan 13, 2025, 8:01 PM

#

Hello, I'm trying to create a model file, but when I run the RVC v2 Disconnected Google Colab that I usually use, the training ends without the epochs running. Even though I set the epochs to 1000, the training stops after only 35 seconds. I thought the Colab might be blocked, so I tried using a Colab from a YouTube video, but the training also ends without running any epochs in that Colab as well. What should I do in this case?
The Colabs I used are RVC v2 Disconnected and Bahaa AI. Both of them end without running any epochs.

dry karma Jan 13, 2025, 8:23 PM

#

Hola1

regal gate Jan 13, 2025, 8:52 PM

#

Hi, do anyone here know some of the early stage tts used in like around 2008

#

simple free ones

wide moth Jan 13, 2025, 9:24 PM

#

yo

#

what site should i use for online covers

plain cove Jan 13, 2025, 9:26 PM

#

https://www.weights.gg/

Weights

Weights | Create with AI for Free

Create with our AI tools for free. Generate AI voice covers, text-to-speech, and more. Join our community of creators sharing RVC and AI voice models.

covert lake Jan 13, 2025, 9:32 PM

#

wide moth what site should i use for online covers

did u try checking ur pc gpu first if it's good enough

twin garden Jan 13, 2025, 10:48 PM

#

covert lake u can prob find a lot of web uis of whisper

that let you upload videos and turn it into text?

covert lake Jan 13, 2025, 11:24 PM

#

twin garden that let you upload videos and turn it into text?

I think u have to convert the video to an audio

river verge Jan 13, 2025, 11:35 PM

#

covert lake https://www.youtube.com/watch?v=mkF_uCX-KYU

Heck, and I rarely even publish AI covers!

covert lake Jan 13, 2025, 11:44 PM

#

river verge Heck, and I rarely even publish AI covers!

I almost lost my channel

river verge Jan 13, 2025, 11:51 PM

#

covert lake I almost lost my channel

yeah, got 1 strike active, 2 total

#

i feel ya

#

I haven't gotten the orange card... yet

twin garden Jan 13, 2025, 11:58 PM

#

covert lake I think u have to convert the video to an audio

Any good software to do that?

covert lake Jan 13, 2025, 11:59 PM

#

river verge yeah, got 1 strike active, 2 total

Real

#

I got 2 in 4 hours

covert lake Jan 14, 2025, 12:00 AM

#

twin garden Any good software to do that?

U can just google "MP4 to FLAC" or "mp4 to MP3" and find hundreds of sites who convert file types

plain cove Jan 14, 2025, 12:01 AM

#

@covert lake It will sound stupid, but will it change the situation if the Voice is from the video game? :3
-# I think your answer will be no 💀

covert lake Jan 14, 2025, 12:02 AM

#

plain cove <@911742715019001897> It will sound stupid, but will it change the situation if ...

Huh what situation

plain cove Jan 14, 2025, 12:02 AM

#

covert lake Huh what situation

ai covers on yt

plain cove Jan 14, 2025, 12:04 AM

#

covert lake https://www.youtube.com/watch?v=mkF_uCX-KYU

!

river verge Jan 14, 2025, 12:10 AM

#

thing is, my first strike expired a long time ago

covert lake Jan 14, 2025, 12:16 AM

#

plain cove ai covers on yt

I'm not sure

#

What mostly is the problem is the instrumentals

plain cove Jan 14, 2025, 12:35 AM

#

If the melody only resembles the original And the lyrics and the melody are not 100% similar to the original? 😮

bronze widget Jan 14, 2025, 12:48 AM

#

is there a program version instead of a website for nvidia-b2332?

rose salmon Jan 14, 2025, 1:42 AM

#

Hello good evening. I have a question. What program do you recommend using to play voices? I currently use voice ai but it's not very good.

polar flax Jan 14, 2025, 1:42 AM

#

twin garden what do you mean by "extracting voice" would work better?

nvm you dont have to do it if not sure

polar flax Jan 14, 2025, 1:50 AM

#

plain cove <@911742715019001897> It will sound stupid, but will it change the situation if ...

you still have to buy mechanical license of the music itself depending on its copyright holder, same thing goes for remixes

plain cove Jan 14, 2025, 1:51 AM

#

polar flax you still have to buy mechanical license of the music itself depending on its co...

https://tenor.com/view/no-no-no-no-ludwig-van-beethoven-beethoven-music-gif-26228263

Tenor

#

https://tenor.com/view/mozart-wink-wolf-gang-gif-10499251

Tenor

#

:3

royal citrus Jan 14, 2025, 3:12 AM

#

guys can anyone help me how to download voice changer client and use it

eager coral Jan 14, 2025, 3:37 AM

#

hi everyone

#

anyone know how to make consistent face while generating image?

#

i tried using seed number but its still random

chilly lake Jan 14, 2025, 3:43 AM

#

eager coral anyone know how to make consistent face while generating image?

using an image prompt

eager coral Jan 14, 2025, 3:44 AM

#

nails i think thats everyone do

chilly lake Jan 14, 2025, 3:44 AM

#

image prompt adapter i mean

#

https://stable-diffusion-art.com/ip-adapter/

eager coral Jan 14, 2025, 3:46 AM

#

ouch such a silly question how i use this? i usually using leonardo ai skullsob

ripe spade Jan 14, 2025, 6:33 AM

#

Question: is there anyway to make dubbing from one language to another with ai? like, english audio + voice model = audio in another language.

#

Going to bed rn, good night y'all

odd knot Jan 14, 2025, 6:34 AM

#

ripe spade Question: is there anyway to make dubbing from one language to another with ai? ...

hmmm

polar flax Jan 14, 2025, 7:11 AM

#

ripe spade Going to bed rn, good night y'all

tight canyon Jan 14, 2025, 7:14 AM

#

.

covert lake Jan 14, 2025, 11:24 AM

#

#1159290752195633273

jagged pecan Jan 14, 2025, 2:23 PM

#

is there any good website to train AI voice model

covert lake Jan 14, 2025, 2:36 PM

#

jagged pecan is there any good website to train AI voice model

what's ur pc gpu first

#

You can check your pc gpu via:
ctrl+shift+esc (task manager) -> Performance tab -> GPU

solar torrent Jan 14, 2025, 2:43 PM

#

What is undressing AI? You mean like some AI image generator tool that makes a character undressing itself?

plain cove Jan 14, 2025, 4:25 PM

#

https://tenor.com/view/furry-cute-flushed-white-gif-25357908

Tenor

vapid vale Jan 14, 2025, 4:42 PM

#

kittystare

chilly lake Jan 14, 2025, 4:57 PM

#

solar torrent What is undressing AI? You mean like some AI image generator tool that makes a c...

bunch of school kids are getting arrested every so often for 'nudifying' their classmates, very naughty

polar flax Jan 14, 2025, 4:59 PM

#

solar torrent What is undressing AI? You mean like some AI image generator tool that makes a c...

better ignore those perverts

river verge Jan 14, 2025, 5:51 PM

#

DIlly ding, dilly dong! A new RegalHyperus drum model just released!
That Is How I Roll! (Drum model no. 570)

ionic scroll Jan 14, 2025, 5:56 PM

#

Huh

ripe spade Jan 14, 2025, 6:19 PM

#

ripe spade Question: is there anyway to make dubbing from one language to another with ai? ...

just woke up, got back for some answers, got disappointed

#

I thought about gpt-sovits, but it's basically just for english and chinese

elder willow Jan 14, 2025, 6:22 PM

#

ım coming

#

Aaa

#

aahhh

#

aaaaaAAAaa

chilly lake Jan 14, 2025, 6:23 PM

#

ripe spade just woke up, got back for some answers, got disappointed

elevelabs has dubbing, merlin clone as well

spring warren Jan 14, 2025, 6:52 PM

#

any accurate RVCs to make cover songs?

glass junco Jan 14, 2025, 7:15 PM

#

Oh snap https://www.facebook.com/reel/1535719697145052?fs=e&mibextid=0NULKw&fs=e&s=TIeQ9V

228K views · 5.2K reactions | Looks like Beans has been cookin up w...

Looks like Beans has been cookin up with some AI software and appears to have a new project coming 👀 Would yall want to hear a new beans album with the AI software to bring his old voice...

river verge Jan 14, 2025, 7:30 PM

#

DIlly ding, dilly dong! A new RegalHyperus drum model just released!
Dancing in the Flames on the Moon (Hybrid) (Drum model no. 571)

ripe spade Jan 14, 2025, 8:51 PM

#

chilly lake elevelabs has dubbing, merlin clone as well

I was looking for something local and that had support for rvc models, but thanks anyways

chilly lake Jan 14, 2025, 8:54 PM

#

I dont think there's anything close for local. The products I mentioned are big $$$

inner root Jan 14, 2025, 10:05 PM

#

hey

elder willow Jan 14, 2025, 10:31 PM

#

https://www.youtube.com/watch?v=A_kLk-bEKSA
is it possible to do it with other programs like coqui and whisper? if so, how?

YouTube

Adam Lucek

Speak Any Language With AI - Realtime Speech-to-Speech Translation ...

In this video we dive into real time speech to speech translation, speaking in one language, and having your own voice speak in a different language!

Resources -
Github Code: https://github.com/ALucek/speech2speech-translation
AssemblyAI Documentation: https://www.assemblyai.com/docs/guides/real-time-streaming-transcription
Elevenlabs Document...

▶ Play video

ionic pumice Jan 14, 2025, 11:18 PM

#

solar torrent What is undressing AI? You mean like some AI image generator tool that makes a c...

thats crazy

edgy bloomBOT Jan 14, 2025, 11:18 PM

#

Congratulations maheswari :33!

Your Samurott is now level 52!

ionic pumice Jan 14, 2025, 11:18 PM

#

wtf

ionic pumice Jan 14, 2025, 11:18 PM

#

edgy bloom

yay

icy steeple Jan 14, 2025, 11:25 PM

#

Is there a tool to batch convert multiple files with a RVC Model of choice??

stable wedge Jan 14, 2025, 11:30 PM

#

Dosnt there exists a Astorian Voice Mod/Filter?

covert lake Jan 14, 2025, 11:34 PM

#

stable wedge Dosnt there exists a Astorian Voice Mod/Filter?

You can search rvc ai voice models at:

#1175430844685484042
In #🔍│find-models , Do /find with @hidden grotto
https://weights.gg/ (login required)
https://huggingface.co/models (but watch out cus in hugging face there arent only rvc ai voice models)
https://voice-models.com/
https://thevoicemodels.com/ (for Turkish Models, login required with discord and level 2 on their server)

if there isnt one, you can:

#1159289738314919936
#1191429836321849435
make it yourself with our docs guides https://docs.ai-hub.wtf/essentials/how-to-make-voice-models/

hidden grottoBOT Jan 14, 2025, 11:34 PM

#

covert lake You can search rvc ai voice models at: - <#1175430844685484042> - In <#11635920...

:wave: @covert lake, How can I help?

Available Commands:
• @weights find <query> or /find <query> - Search for RVC Voice Models
• /create - Create an AI Cover
• /image - Generate an Image

ancient dawn Jan 15, 2025, 12:23 AM

#

ai is artificial

#

mustarddd

#

lfg