#🧬│ai-chat

1 messages · Page 345 of 1

rare sorrelBOT
covert lake
#

1st link is the deiteris fork
2nd normal wokada

solar torrent
#

That's a pretty decent setup your PC has.

covert lake
#

Ofc you will have to follow the Nvidia version as you got an RTX

covert lake
#

-# ofc u can get even better with a desktop

solar torrent
#

You can use the demucs or UVR5 to seperate stems from an audio. aismug

solar torrent
polar flax
solar torrent
polar flax
solar torrent
polar flax
#

idk the old UVR was like that

solar torrent
#

There are huge loads of music files I've always wanted to seperate them in batch. Is there any best Google Colab notebook I can use for this for both free and with unit points?

solar torrent
polar flax
covert lake
normal stream
#

Yeah

#

I installed it from some yt video. But it didnt show anything

#

It said to use the start.http

#

Or smth. But that didnt do anything

forest crypt
#

idk

solar torrent
normal stream
#

Yeah probably

#

😂😭

covert lake
#

Youtube videos are outdated about this topic

covert lake
solar torrent
#

-realtime

rare sorrelBOT
normal stream
covert lake
#

yw

wispy wind
#

yo

acoustic estuary
#

E

tired jasper
#

poeple were saying about BigVGAN and HifiGAN few days ago

#

didn't test yet, but seems to be better than HifiGAN and more usable than BigVGAN

valid night
#

W

tired jasper
harsh rampart
#

Y

tired jasper
#

there's an AI similar to Apollo but better?

#

Apollo is introducing noise in the audio instead of cleaning it

#

Demucs is good to clear audio noise, but it work by separating, i'm focusing in transform audio to another thing, like a restoration work

elder willow
#

to see if it can test with it or whatsoever

valid night
#

also is there any way to train models here?

tired jasper
tired jasper
tired jasper
tired jasper
#

i'm downloading musdb, i will try to ultra compress the dataset and finetune it to see if it can restore very degraded audios

tired jasper
#

found this model, BABE2, seems to be newer than Demucs

#

last change was 8 months ago, compared to demucs 1 year ago

#

very interesting results

#

MIT license compared to JusperLee Apollo CCNC

#

it's a diffusion model, seems to be what i'm looking for, capable to transform input into another thing

#

so it act as a restorer, not only a splitter

#

i'm interested to rebuild Adobe Podcast as a opensource tool

#

I notice that Adobe Podcast not only remove noise, it try to reconstruct damaged parts

#

it generate human like noise when you input not human noise only audios

valid night
#

by unwa

tired jasper
#

roformer is demucs, no?

#

when i said demucs i said the first project

#

by facebook

#

i guess all models from UVR are derived from demucs or just demucs finetuned

river verge
#

Dilly ding, dilly dong! A new RegalHyperus drum model just released!
Break Up! (Drum model no. 549)

odd monolith
#

How to make a voice model for AI Cover

vague grail
#

hi, I`m new here... (already configured local setup so currently looking for ai models)

tired jasper
#

roformer = 🐢

astral matrix
#

guys TITAN pretrain is a finetuned pretrain or is trained from scratch?

#

because if is trained from scratch sounds very good for 11 hours

astral matrix
somber kindle
#

is there a realtime voice changer yet or do i have to come back in a few years

brazen moth
#

hi

vestal temple
#

Hi

tired jasper
#

You can make it work a bit well for characters if you can voice act a bit

#

but if you try any voice tone it will be wild

#

beatrice quality is pretty good for normal voices like celebrity or artists clone, but the model don't learn well effected voices

#

Also, many TTS models, e.g. F5 can clone pretty well

#

without any previous training

night lake
# somber kindle is there a realtime voice changer yet or do i have to come back in a few years
tired jasper
#

Also RVC can work realtime with a little delay if you have 3060 or lower

umbral heath
#

hey!

tired jasper
#

i need an audio transformer/diffusion that can convert audio-to-audio

#

like inpaint or image-to-image in Stable Diffusion

#

so, i could train with LQ and GT dataset

#

to be honest, should be a GAN, like SR GANs for image upscaling

tired jasper
#

i will try audioSR now

#

HIFI GAN is just a vocoder? i didn't understood it purpose yet

#

no VRAM to run audioSR

#

would try finetune Apollo if it was easy to prepare dataset

high shale
#

i need some helppp

night lake
#

Take a read

high shale
#

in a call or something

high shale
#

okayyy ill try

night lake
#

Simple, explains everything

covert lake
high shale
#

i tried and im still very confused

#

i genuinely need help

elder willow
#

onde consigo as vozes em portugues br?

solar torrent
solar torrent
elder willow
#

ty

versed fiber
#

yo

jaunty hearth
#

hello every one

viscid arch
#

every time i refresh i get set back in the queue on weights

paper viper
#

does anyone have a text humanizer that's free? all the "free" text humanizers out there are js scams

toxic spindle
#

are there ai music programs that can remastered song and remix them because I like to do this
and have with it.

solar torrent
#

Racist people keep talking about the Fumo Project server to me again. I ain't even at that server either. So I told them to go FUCK themselves, this is AI Hub by Weights, not Fumo Project. lfg

#

These people are too blind looking for "Leave Server" option, sometimes. I've had enough of it.

queen flame
#

halo

solar torrent
#

Hi. doggowave

elder willow
#

hi weights by namari

spiral breach
#

Hi 👋

rain remnant
#

helllloo

forest crypt
#

Anyway, good night

#

I just realised that this is a chat that is not off-topic.

solar torrent
leaden flame
solar torrent
ionic pumice
#

if you don't know 15.ai while it was online then idk what to tell you my guy

solar torrent
#

Never heard of this site before. I've only heard about the Uberduck. shrug

ionic pumice
#

it was peak ai tts

solar torrent
#

Yeah, I've never heard of this site. imdead

valid night
#

weird how the chat is so dead even though this server has almost 500k members

valid night
solar torrent
valid night
#

my friend got hacked from in in summer of 2021 lmao

smoky star
#

mmvc keeps closing the second it opens

covert lake
solar torrent
covert lake
solar torrent
#

He seems to respond too fast this time, I didn't even have my time to respond. nails

alpine granite
#

hawk tuah

solar torrent
#

This is AI Hub by Weights, not Scam Hub.

#

This is not even CryptoHub, boy. Your Talk Tuah lady just took down the whole value of $hawk. trol

#

Just so you know. I already have a small amount of coins in one of my wallets. I'm not the type to invest in something big. aismug

elder willow
#

It doesn’t?

analog cosmos
solar torrent
elder willow
#

lol

covert lake
tired jasper
#

audioSR is weird. First loading goes 12.3Gb VRAM, but with runtime running if you put a second job on it, it goes to 6Gb VRAM

elder willow
#

it's becuase it may be heavy on resources ig

ionic pumice
#

if that's how it even works

tired jasper
#

not really heavy on resources, it's a 6GB size model

#

so we hope that it will require aprox 6GB VRAM

#

i guess it can be optimized to require less on first loading

#

inference don't take much more than 6GB, but load the model requires the double

#

not an issue for Windows users

tidal shell
#

Give me the most realistic girl voice with settings

analog cosmos
elder willow
#

yes

night lake
# tidal shell Give me the most realistic girl voice with settings

In absolute darkness, carefully arrange four candles around your digital altar. Position them with precision, their flames casting ominous shadows across your PC screen aglow with w-okada's eldritch interface. Begin the arcane dance. Move with ritualistic intent, your steps a cryptic choreography. Whisper the name "realistic girl voice" not merely speak it, but breathe it into existence. Let your movements be both precise and erratic, dancing on the razor's edge between calculation and chaos. The candles tremble. The PC hums with sentient anticipation. Will the RVC gods deem you worthy, or cast you into the realm of failed voice model invocations? Proceed, bold technomancer. Not all rituals end as planned.

covert lake
#

./just_joking.sh

night lake
#

🤷

tepid basin
#

Wtf is that word that didn't get translated

covert lake
analog cosmos
rugged hawk
#

selem

tulip coral
#

yo

#

how do i use the rvc model i have

#

like how do i install rvc gui

covert lake
#

don't follow yt tuts

#

what's ur pc gpu and what are you looking to do

tulip coral
vagrant otter
#

Obligatory sorry if stupid question. If I have an isolated vocal that's way off the voice I want to convert onto it, should I edit the isolated vocal first to make it closer to the intended end voice? For example change the key in bandcamp first. Or is it better to just edit after.

tulip coral
#

already have lyrics written, already have the beat, already have (partial) uzi verse recorded

covert lake
tulip coral
#

its snippet only

#

so im using AI to make the song

covert lake
covert lake
#

you'd have to sing the lyrics urself, then use the ai over the audio where you recorded yourself singing

covert lake
#

that's true actually

#

yea u'd have to sing it

tulip coral
#

im rapping the song out, and then im gonna run it through the carti rvc

#

but i have ZERO idea how to use the rvc 😭

covert lake
#

You can choose to follow either one of those 2 guides

tulip coral
#

like ive just been using weights.gg for the uzi rvc but since this is a custom carti model that i had made for me i obviously cant use weights

tulip coral
covert lake
tulip coral
covert lake
#

you're welcome

tulip coral
covert lake
tulip coral
covert lake
#

in here u can find linux and windows versions

tulip coral
#

i downloaded appliov3.2.7.zip cuz it says last updated 21 days ago making it the most recent

covert lake
daring thunder
#

where can i find models with .jsons?

tulip coral
#

@covert lake yo bro its wraps

#

i cant send a ss but idek what file to open

chilly lake
tulip coral
chilly lake
#

then you messed up the previous step

#

or did not wait long enough for unzip to finish

stark tangle
#

Guys, how can I make a voice MODEL online?

#

Weights, keeps messing it

covert lake
covert lake
tulip coral
covert lake
tulip coral
#

i opened it and it just opened an empty command thing

#

oh wtf

#

it just opened to a website

covert lake
covert lake
tulip coral
#

alright @covert lake can i dm you a ss of the custom rvc i have so you can explain to me how to import it into the site

covert lake
#

It runs locally on ur PC tho

covert lake
stark tangle
covert lake
#

You can train RVC models on cloud (remote good pc):

  1. Prepare the Dataset
  2. Setup RVC:
    Choose a cloud way to use RVC,
  • Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):
  • Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus):
  • Lightning.ai (Kinda hard, needs login, no issue with web uis or anything, but only free 15 credits monthly):

Google Colab = Easier but risk of getting disconnected
Kaggle = Harder but way more gpu time

  1. Be sure to know about the tensorboard

If you are looking for the easiest way and for free, is using https://weights.gg which ofc uses RVC

#

However I warn you it's harder on mobile

stark tangle
covert lake
#

Because of the UI not being much mobile friendly

covert lake
#

I'd suggest u either use kaggle applio

stark tangle
stark tangle
covert lake
#

Be also sure u got a cleaned dataset

stark tangle
#

Oh, applio is for downloading
Oh

elder sparrow
#

hiii

covert lake
#

It's on cloud

#

It downloads on a remote good pc

night lake
covert lake
tepid basin
#

@covert lake can we get a guide for voice changer

covert lake
#

Isn't there already the retry

tepid basin
#

I need girl voice

covert lake
#

Absolutely banned

tepid basin
#

Sadge

stoic obsidian
#

4

tired jasper
#

omG, still fighting to train F5

#

loss is decreasing, but slowly

#

caged in "almost state"

solar torrent
thorny drum
#

how do i create my own custom voice to use in deiteris RVC?

rare sorrelBOT
elder willow
#

read this guide @thorny drum

elder willow
polar flax
vagrant otter
#

If I have an isolated vocal that's way off the voice I want to convert onto it, should I edit the isolated vocal first to make it closer to the intended end voice? For example change the key in bandcamp first. Or is it better to just edit after.

thorny drum
#

in the tutotial the section for "uploading vice models" (i can see Only RVC models will work. If you have a gpt-sovits one or any other, they will not work)

#

i have a 5 minute recording in wav format of the voice i want to clone. would that work

polar flax
elder willow
thorny drum
elder willow
#

if i'm not wrong, tl;dr means that

small snow
#

hi

elder willow
solar torrent
#

Some UVR5 and Demucs models can be used to extract background noises from an audio file. catblush

worthy coyote
#

have you guys ever heard of 'enormous'?

#

i have not...

#

maybe i read that somewhere but i just dont remember

stable portal
#

Hey, i need ask you guys, i looking for GPU for ai (Mainly for wokada vc). What's gonna be better? RTX 4080super or old used tesla series? If tesla-what used tesla gonna be better than rtx 4080 super?

#

I can't afford a new Tesla, I'd have to sell a kidney

#

so i asker about used tesla only

polar flax
worthy coyote
#

June 26, 2024 11:00 PM

covert lake
#

Don't directly suggest cloud

covert lake
covert lake
#

Giving him cloud is useless as it's time limited and will have to deal with it and risking of being disconnected

polar flax
wind harness
#

hi

covert lake
#

Good enough PC

#

While colab gives merely 4 hours a day which could be even less

polar flax
covert lake
green otter
#

hi

covert lake
#

hi

neon herald
#

What if your brand had its own AI voice or character?

solar torrent
#

An NVIDIA Tesla GPU isn't really worth it for your budget. This particular GPU is used for workstation/server and sold way more expensive than GeForce RTX ones. Unless you're too rich to buy a powerful workstation PC for AI. skullfacedistorted

#

The GeForce RTX GPU is a bit more affordable than NVIDIA Tesla, and it's faster at certain tasks like outputing video to your monitor, video processing and AI. If you pick the Tesla one, you can only do AI and that's about it.

polar flax
solar torrent
barren mauve
#

scam

covert lake
ionic pumice
#

@covert lake

covert lake
solar torrent
#

This link has your Weights invite code attached to it. goofy

vague locust
#

is there a better way? idk i just clicked share and copy link

solar torrent
vague locust
#

oooh cool thx my bad. i'll go there then

magic pendant
#

.

worthy coyote
#

good beforenoon

ionic pumice
#

wha

elder willow
#

@solar torrent Does the “weights” in ur displayed name refer to AI model weights?

solar torrent
elder willow
#

Oh lol
I overthought then

velvet drift
#

what is the best ai for script writing can someone pls let me know

#

because i seriously dont think even if you give a good prompt for chatgpt that thats the limits in which script writing could get back to me scine i am trying to replicate a script from a youtuber

#

i heard mistral7b is really good has anyone had experiecne for it

wild radish
#

HI

hazy storm
#

Does some one know a AI TTS voice maker?

covert lake
# hazy storm Does some one know a AI TTS voice maker?

There are different Text To Speech (TTS) AIs:

GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/

Freemium 11labs: A easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS

FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site

With RVC Models:

RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)

If you wanna do tts locally with RVC Voice Models (if you got a good pc):

  • You can get Applio in our docs
  • While Ilaria RVC Mainline here (no guide as of right now)

If you don't got a good pc you can do tts with RVC Voice Models on cloud:

  • Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide

  • Use Applio UI Colab (with google colab T4 free daily limit gpu)

  • if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc

hazy storm
hazy storm
#

ok

covert lake
#

I explained you every single way

calm vapor
#

read bro

covert lake
#

Don't just read the 10 first words 😭

#

people gotta READ

covert lake
elder willow
covert lake
covert lake
#

it's not that hard to read it all

elder willow
#

i'm a goofball but not lazy enough to read a text lol

covert lake
#

idk much about that but FaceFusion has that

calm vapor
#

@elder willow we dont support kits.ai models anymore, dont post them

hazy storm
covert lake
calm vapor
#

u can run everything on the cloud

covert lake
hazy storm
#

ok

night lake
covert lake
#

but if I update the google docs one day, i will share it till the docs get updated too

elder willow
#

you're welcome

tight dragon
#

best girl voice? any1 know?

woven tendon
#

.

rugged junco
#

hello

bleak idol
#

hi

elder willow
#

holy shi

#

but yea the rtx 4090 thing is false, i just use it for example

thorny drum
gusty hedge
#

alo people

solar torrent
#

Joining, but not brave enough to talk.

quartz roost
odd knot
#

is the pitch handdrawn?

quartz roost
worthy coyote
#

heavy sound teto

spiral junco
#

have there been new nightshade antidotes? any other software to clean images?

wraith gyro
#

is there a realistic voice of any girl? In GTA 5 or cs go, it is very clear that I have AI, will you help me?

iron spire
#

.

wraith gyro
wraith gyro
clever sandal
#

🙌

solar torrent
wraith gyro
stable portal
#

i thinking about changing my actual gpu rx6950xt, i need lobotomy surgery if i want ai working on amd gpu

#

yup, i wants voice chsnger while gaming

#

And i have 2 optioms

#

buy new gpu for ggaming and ai (rtx 4080 super or rtx 4090)

#

or buy used tesla series on slot 2 pci-e and dont touch my amd gpu

elder topaz
polar flax
stable portal
#

Theo only think i know is tesla for ai

stable portal
#

and go for rtx40xx series or old ai gpu

#

i have money, but not too much

polar flax
stable portal
stable portal
polar flax
solar torrent
#

I've responded to you about this particular GPU. Have you read all of that? No. Let's be real.

thorny drum
#

i am stuck with a blank page when i click the File Url: link that pops up on my tensorboard after i input my ngrok token. The imjoy elfinder dosent load anything up on my page

coral ridge
#

Hello everyone

solar torrent
#

OpenAI Sora got released? I thought it gonna be another vaporware product. Baffled

austere pond
#

Hello everyone

covert lake
worthy coyote
#

does anyone here speak every single launguage?

analog cosmos
#

I doubt someone can speak 7k+ different languages trolley

covert lake
worthy coyote
#

launguage is consciousness

#

i mean personality

#

if you speak 2+ launguages then you have multiple personalities

solar torrent
#

Bro thinks he's the tower of Babel. trolley

digital plover
#

Guys how can I make these SUPER realistic ai videos

#

Like these ones on instagram or TikTok

#

Ronaldo doing some weird shit or something

worthy coyote
#

ask chatgpt

digital plover
#

solar torrent
#

OpenAI Sora is now released for public.

wet vessel
#

Hello

worthy coyote
#

Kanye

solar torrent
digital plover
#

I don’t think it’s good enough tho

covert lake
digital plover
solar torrent
#

Generated videos from the generated images on Weights. drayke

digital plover
covert lake
solar torrent
#

I'm not sure what was up with older generated videos that were featured on Weights front page. Those videos were like glitching and distorted, not looking too good like ones I did. imdead

digital plover
#

Like this is DIABOLICALLY GOOD

#

It’s like perfect,how can I even do this???

covert lake
#

i didn't test them all out so idk

#

u can try playing with them tho

austere pond
covert lake
#

imagine knowing every single language

edgy bloomBOT
#
Congratulations Nick088 [ITA/ENG] by Weights!

Your Charizard is now level 41!

covert lake
#

@atomic vector speak to me using 20 different language, every word has to be a different language

atomic vectorBOT
# covert lake <@1144714449563955302> speak to me using 20 different language, every word has t...

hello (english) konnichiwa (japanese) hola (spanish) bonjour (french) ciao (italian) halo (indonesian) hei (norwegian) namaste (hindi) xin chào (vietnamese) olá (portuguese) zdravstvuyte (russian) merhaba (turkish) heihei (swedish) annyeonghaseyo (korean) sawatdee (thai) salam (arabic) shalom (hebrew) jambo (swahili) ni hao (mandarin chinese) kia ora (maori)

(note: i tried to use a greeting word from each language, but please keep in mind that some words might not be exact greetings, but rather a way to acknowledge someone)
-# AI-generated responses may be inaccurate; please verify important information.

austere pond
covert lake
atomic vectorBOT
# covert lake make me an actual complete long sentence with all the different languages you kn...

konnichiwa nick088, hola amigo, bonjour mon ami, ciao amico, hei hvordan går det, namaste kaise ho, xin chào bạn, olá tudo bem, zdravstvuyte kak dela, merhaba nasılsın, heihei hur mår du, annyeonghaseyo jal jinaeseyo, sawatdee sabai sabai, salam alaikum, shalom ma shlomkha, jambo habari, ni hao nǐ hǎo, kia ora kei te pai, i'm applio, your friendly virtual assistant here to help you with any questions you might have across many languages
-# AI-generated responses may be inaccurate; please verify important information.

covert lake
covert lake
atomic vectorBOT
austere pond
solar torrent
#

I guess team Weights has changed its video generator sometime before I got premium. nails

austere pond
glad nebula
atomic vectorBOT
atomic vectorBOT
# glad nebula Your name now is appliowo

heyy whats up i'm appliowo now lolz so whats on ur mind wanna chat bout sumthin
-# AI-generated responses may be inaccurate; please verify important information.

covert lake
covert lake
ionic pumice
#

thats crazy

lusty stump
#

Hi guys

worthy coyote
#

true

jolly dagger
#

hi

drowsy minnow
#

hello i want to ask

#

where i can install voice changer ?

elder willow
little edge
#

can someone help me with an AI model for Neon Hitch? pls i can't do that properlyskullsob

elder willow
# drowsy minnow where i can install voice changer ?

welp since idk your gpu, you can use colab or kaggle. choose these options:

Google Colabs (4 hours of daily gpu for free, not much hours, but easy to use):

Kaggles (a bit harder to use and needs phone number but gives 30 hours weekly of better gpus, recommended to use P100):

#

tho next time please ask in help-w-okada channel

spring warren
#

how do i fix the ai from breaking when singing high notes
when i do ai covers

hoary void
ionic pumice
#

dataset/vocoder issue :p

knotty surge
mild bramble
#

How fast does ur epochs go?

#

Mine is 1 every 13 seconds (8 minute dataset)

forest crypt
#

@atomic vector you fought hard to survive, but I guess it's time to go :c

atomic vectorBOT
forest crypt
autumn turret
#

does anyone know of an AI i could use where I can give it hours students are available and how many hours they want each week, as well as a list of other constraints, to create a weekly schedule for them? I've tried ChatGPT, Gemini, Copilot and Claude and none of them have gotten it exactly right so far

autumn turret
# night lake What is thy prompt

This is a bunch of schedules for students who work in our lab. Along the left side are hours between 8 am and 5 pm, which is what we consider our lab's hours that it is open, in 15 minute increments. Along the top are the days of the week. In each schedule, the students have highlighted blocks of time that they are available to work in our lab, which are shaded in green. Blocks of time that are shaded white are times that they are not available to work in the lab, and blocks of time that are shaded yellow are times that we could have them work in the lab, but they are not preferred by the student. Based on their availabilities and the stipulations in the following paragraph, please construct a weekly schedule for the students, and give the amount of hours that you scheduled each student for so that I may double check that you did it correctly.

Here are the amount of hours each student needs to have (they should have exactly the amount that they asked for, not more or less): [Names and hours redacted for privacy]

Stipulations:

No shift should be less than 1.5 hours long. Most of the time, the shifts shouldn't be longer than 3 hours long but some exceptions are OK.
In general, the students' schedules should overlap as little as possible, and if they do, it should mostly be when one student is going to leave soon and another student is coming in. It is OK if [Redacted] and another student's time has to overlap.
The hours between 8:30 -- 11 am and 2-5 pm are more important than 11 am -- 2 pm, so priority should be given to these times. Each day must have someone starting work at 8:30 am and someone working until 5 pm.
Each shift should start on a new hour or a half hour (i.e. 2 pm or 2:30 pm, respectively).
Do not give me a script. Give me an actual schedule.

night lake
#

What the

autumn turret
night lake
cosmic scroll
#

.

autumn turret
night lake
#

Also maybe don't tell it colors and such

atomic vectorBOT
#
Applio Bot

Applio is a Voice Conversion based on VITS focused on simplicity, quality and performance, if you need to know more about it, you can check Website or Github.
Commands:

  • </search:1229146911483760781>
  • </searchuser:1232443891513561141>
  • </chat:1234277316994007040>
    And More...
misty locust
#

o

unborn sage
#

Hello

broken ice
#

can someone help me with this?

#

im trying to convert an audio already recorded to another audio with an ai model already loaded

tepid basin
#

@atomic vector ily

atomic vectorBOT
tepid basin
#

-audio dataset

rare sorrelBOT
tepid basin
#

You'll wanna start with thinking about your dataset and gathering audio. The guides above should be able to help with that

#

Cleaning with pro tools like RX is optional

torpid sundial
#

yo

#

does anyone here know how to make em models to be realistic and sh like that i rlly wanna catfish i tried to do a voice girl i failed miserably i cld pay well

river verge
#

Dilly ding, dilly dong! A new RegalHyperus drum model just released!
Happy (Drum model no. 550)

solar torrent
toxic spindle
#

why is weights thinking my model is n whatever.

#

help

#

nothing is bad its official art from the creators

tepid basin
#

thats weird

toxic spindle
#

ik

tepid basin
#

you should see if anyone can help you in the weights discord

solar torrent
toxic spindle
#

this photo was marked as nsfw

#

even tho its not

solar torrent
#

Maybe its NSFW detection system detected either a generated image label prompt to be inappropriate or just messed up looking at the image. skullfacedistorted

toxic spindle
#

contacted support

gray rover
#

lfg Totally not sus website

tepid basin
edgy bloomBOT
#
Congratulations UnitedShoes (by Weights)!

Your Ivysaur is now level 22!

worthy coyote
#

i urge you guys to buy razer synapse

#

it's the best mouse ever

chilly lake
#

I had razer mouse, it had complete garbage trash tier shitty microswitches that started double triggering in less than 6 months. Never ever again.

inland acorn
#

how do i create ai voice singing>?

tepid basin
#

My Naga Trinity has been through absolute hell

#

But I can't recommend razer because they're overpriced and Synapse's features suck

chilly lake
#

that too

junior girder
#

Yo the chatGPT limit is annoying is there a different website without a limit that's free?

sonic steeple
#

What are the functions?

solar torrent
fresh marten
#

j

ionic fractal
#

The freak

wraith solstice
#

Hey guys, which chatbot is more interesting to chat with? Any recommendations?

atomic vectorBOT
#

hi, how can i help you today
-# AI-generated responses may be inaccurate; please verify important information.

#

oui, je peux parler français. je suis là pour aider et répondre à tes questions, qu'elles soient en français ou dans une autre langue. comment puis-je t'aider aujourd'hui ?
-# AI-generated responses may be inaccurate; please verify important information.

kindred kelp
#

I'm developing an MMORPG that uses Gemini to generate 2D sprites for characters, equipment, NPCs, and world stuff. You can even design your own spells anyone interested in this game? It will be open source

solar torrent
solar torrent
#

I don't have a server PC. I only have an old laptop that's currently used to type messages to you. voidblep2

covert lake
kindred kelp
#

Thanks nick

covert lake
#

Yw

elder willow
solar torrent
#

My laptop is too slow for this, holy shit. So I left this damn contest anyway. drayke

#

I was just talking about myself. Is there anything we can help?

#

I wish I were to buy newer PC, but because I'm being cursed to not winning anything so. Congrats to them keyboard faster, not me. nails

worthy coyote
#

yours sincerely

solar torrent
#

I'd just make me to have a fake ass Nitro instead. dog

glad tulip
#

Hey guys, so im for the first time gonna get AI running on my own PC, and I was wondering; What AI model would you recommend? I am a studying c#, so I want something that's good for that, but also general stuff

covert lake
forest quarry
#

c++ also is used

#

u thunk

#

i think

vagrant tundra
quartz nest
#

f

analog cosmos
hallow plover
#

could anyone teach me how to download the rt vc? I'm really confused on some japanese git hub

glad nebula
covert lake
#

What's your PC GPU?

hallow plover
#

how do I check that

covert lake
hallow plover
#

AMD Ryzen 5 5600G

#

is that it?

short cobalt
#

-rt

rare sorrelBOT
short cobalt
#

Idk why i hear double or triple voices cmg out from the voice afterwards

covert lake
#

not a GPU

halcyon venture
#

can anyone help me to download w-okada?

chilly lake
#

and CPU has half the cache

covert lake
covert lake
#

Btw while we are talking about CPUs

How's an i7 14700F

snow sedge
merry hamlet
#

hi

craggy berry
#

lmao... @stark scarab do what you know lol

stark scarab
#

lmao

craggy berry
stark scarab
formal hornet
#

Does anyone here have a means to keep a google collab model training page from timing out/doing the captchas?

chilly lake
#

there was a script for UI version to click stuff

formal hornet
#

I think the captcha pops up regardless of the mouse activity...

humble cape
#

hi

rustic marsh
#

does anyone here use Voicemeeter? i need help

polar flax
rustic marsh
#

ill pay someone to help me fix this

#

i dont understand what im doing wrong with voicemeeter

rigid wraith
#

sa

chilly lake
rustic marsh
fiery fog
#

N

keen peak
#

idk why but the voice changer takes like 10+ seconds to actually talk from when i talked, even tho the chunk is at 384

boreal shell
#

Can anyone help me in coding? Because I'm trying to make communication between agents, if anyone can help please reply

solar torrent
#

I'm not sure what kind of a programming code you have trouble with. Maybe try ask this at ChatGPT or Gemini if they can help you coding.

dapper ginkgo
solar torrent
boreal shell
solar torrent
boreal shell
#

I'm using Python

elder willow
#

no

boreal shell
chilly lake
#

try to expand what you mean by 'agents'

river verge
#

Dilly ding, dilly dong! A new RegalHyperus drum model just released!
Dilemma (Drum model no. 551)

edgy bloomBOT
#
Congratulations RegalHyperus by Weights!

Your Chespin is now level 3!

boreal shell
low sierra
#

hello

warm silo
#

hi

boreal shell
boreal shell
spiral breach
#

Hi 👋

hollow tangle
#

?

wind linden
#

Who would have an appointment with Guizmo the French rapper???

mossy spade
#

.

tardy zodiac
#

How to train and make interference with amd gpus?

nimble iron
gray rover
#

@ noobies5663 knows more about it so, in any case, you could ask them some time

analog current
silent marten
#

where do yall suggest me to make an ai joji song

#

i have a beat for it

#

i js need the vocal ai thing

spring sable
#

Chat someone make an ai cover of apt with okarun and momo ayase from dandadan

spring sable
charred gazelle
arctic tinsel
little lodge
#

Mri psx

tender vessel
#

.

covert lake
covert lake
wary vapor
#

😄

covert lake
#

@fiery crystal

rustic marsh
#

Hey yall! Quick question.RVC uses more of my GPU than CPU correct? I wanna make sure that my i7-11700F won't slow down my 4080S

solar torrent
rustic marsh
#

I want to get the ms latency down and before I do this,I just wanna know if my CPU will affect it or bc it detects my GPU,it will use that instead?

solar torrent
#

Some part of RVC, W-Okada and any Python program will also use your CPU and main RAM for its components, usually the Python program itself. But anything GPU like Tensor and Pytorch will always use GPU.

rustic marsh
#

I apologize for such a late reply.I was driving. Is this good enough?

polar flax
rustic marsh
#

What does index0 do?

polar flax
rustic marsh
#

Hmm?

polar flax
#

pls disregard if you don't know "index rate"

rustic marsh
#

Can I get help from someone who knows what index rate is?

dapper ginkgo
#

The index rate is used to reduce/resolve the timbre leakage problem. If the index rate is set to 1, theoretically there is no timbre leakage from the inference source and the timbre quality is more biased towards the training set.

rustic marsh
#

Awesome! (Yeah,I just searched Google too Bfoops

dapper ginkgo
#

If the training set has a lower sound quality than the inference source, then a higher index rate may reduce the sound quality. Turning it down to 0 does not have the effect of using retrieval blending to protect the training set tones.

rustic marsh
#

Thank you ^^

dapper ginkgo
#

If the training set has good audio quality and long duration, turn up the total_epoch, when the model itself is less likely to refer to the inferred source and the pretrained underlying model, and there is little "tone leakage", the index_rate is not important and you can even not create/share the index file.

covert lake
dapper ginkgo
covert lake
dapper ginkgo
#

yes

covert lake
#

Index rate: how much is the index being used

polar flax
dapper ginkgo
rustic marsh
covert lake
dapper ginkgo
#

dont blame me for that LOL

glad nebula
covert lake
#

I have seen people who have an rtx 4080 and still use colab

It's the helper that needs to guide them to understand more what to do

edgy bloomBOT
#
Congratulations Nick088 [ITA/ENG] by Weights!

Your Charizard is now level 50!

polar flax
dapper ginkgo
dapper ginkgo
#

chatGPT would give me this LOL

chilly lake
#

It was mostly right. Index is a specifc speaker's prononcuations

#

index rate is a blend between (0-original audio) and (1 speaker)

polar flax
#

though the point was that the index rate setting >0 in original voice changer would eat more cpu usage

chilly lake
#

yeah, since it uses shitty faiss-cpu library without acceleration (my guess)

glad nebula
#

but still better not use it in resltime

chilly lake
#

for extraction maybe, but for search I do not recall there being hardware accelerated speaker embedding retrieval

glad nebula
chilly lake
#

mic audio - hardware accelerated hubert - search for the match in the index file on cpu - infer

#

hm.. nvidia one?

glad nebula
# chilly lake hm.. nvidia one?

He reduced the cpu index usage in both amd fork and nvidia, with the nvidia fork having the lowest cpu usage while using index

covert lake
glad nebula
#

The wokada fork dev is here u could ask him what he did (his name is emojikage and has a cat pfp)

polar flax
snow sedge
chilly lake
#

key word is Linux

#

vs "After the download, you run MMVCServerSIO.exe" 🙂

#

anyway, i did a lil test with 1 min files, speaker embedding took about a second with a training running as well in background

snow sedge
#

Platform was not in question. And embedding is still fast on Intel CPUs (thanks to Intel MKL) and fairly fast on AMD CPUs

solar torrent
#

Feared your CPU will slow down your W-Okada, but KNOW nothing about W-Okada huh.

glad nebula
#

Oh sorry it wasnt gpu accelerated index, it was gpu accelerated extra chunk skullsob skullsob im sleepy sorry

solar torrent
#

Please don't worry about your current PC specs since this is one of the decent setups you got.

dapper ginkgo
#

yea they are solid.

solar torrent
#

Unless you run two or more W-Okada programs on one PC, this can slow down each other performance. nails

covert lake
blazing berry
#

nice pon?

solar torrent
flat grail
#

hola

elder willow
#

Hoi

pale falcon
#

Hola

solar torrent
#

Hi. doggowave

elder willow
#

Hi everyone ^^

bright arch
elder willow
bright arch
elder willow
#

oh um... was helping a friend find a voice model, and also interested in AI art stuff

bright arch
#

Oh did you found the model

elder willow
#

yesh

#

this server changed a lot since i was last in it, months ago i think

bright arch
#

Good for you, now get interested in ai art

elder willow
#

XD

bright arch
#

Change is contagious

#

Did you liked it then or do you like it now

elder willow
#

err, I was in here before it got hacked? or changed or something

elder willow
bright arch
polar flax
elder willow
polar flax
minor blade
elder willow
glad nebula
#

he said once batch size 1 is the most precise of all

elder willow
#

XD

covert lake
covert lake
minor blade
covert lake
#

oh lol

#

i never watched mha

minor blade
elder willow
lilac lion
#

ok guys how do i make it so that celeberties say stuff, it is probably a frequently asked question so direct me where to go

covert lake
#

And do you mean inference (use model) on pre-recorded audios or realtime for calls?

polar flax
covert lake
lilac lion
#

id be happy to use those

#

gpu is laptop version of 3070 ti

covert lake
# lilac lion tts but if there are viable sts

There are different Text To Speech (TTS) AIs:

GPT So Vits: RVC isn't as good as GPT So Vits for tts, but gpt so vits (few shot tts, which means needs just a lil training for models) can't use rvc models (and viceversa), and its only limited to: english, chinese & japanese, if you wanna check gpt so vits instead, read https://docs.ai-hub.wtf/tts/gpt-sovits/

Freemium 11labs: A easy way to do TTS is https://elevenlabs.io/, you can't use RVC model on this but its a mostly premium easy way for good quality TTS

FishSpeech: FishSpeech is a 0 shot (no explicit training needed) TTS, if you got a good pc you can use it locally else use their site

With RVC Models:

RVC is natively for Speech To Speech, but forks such as ilaria rvc mainline & applio have built in tts (using Microsoft Edge TTS to make a generated tts audio, which i suggest you to choose a tts model that is the same gender and language of the rvc model you wanna use, and then convert it with rvc)

If you wanna do tts locally with RVC Voice Models (if you got a good pc):

  • You can get Applio in our docs
  • While Ilaria RVC Mainline here (no guide as of right now)

If you don't got a good pc you can do tts with RVC Voice Models on cloud:

  • Ilaria RVC Zero (Running on A100 GPU, free fasted rvc on cloud) and the guide

  • Use Applio UI Colab (with google colab T4 free daily limit gpu)

  • if you don't wanna use edge tts, you could try another tts ai from our tts index and use the output as an input in rvc

ElevenLabs

Create the most realistic speech with our AI audio tools in 1000s of voices and 32 languages. Easy to use API's and SDK's. Scalable, secure, and customizable voice solutions tailored for enterprise needs. Pioneering research in Text to Speech and AI Voice Generation.

polar flax
covert lake
#

ye even if laptop gpus are weaker than normal ones

lilac lion
#

if i do a real time one is it still good

solar torrent
lilac lion
cerulean bluff
#

.

analog current
tropic dawn
#

o

glass junco
#

Any site where we can upload our voice model and use it as a voice changer ? Real time

lilac lion
#

Guys I know it is a bad question and all. But, is real time voice changing really that bad? For me it works with many stutters and just overall unrealistic

#

If it isn't that bad how do you make it good

dapper ginkgo
#

You need good hardware

#

and you have to find a model which works good with your voice

lilac lion
#

Ok got it man

#

So just better hardware and experimentation with models

#

Ok

dapper ginkgo
#

what is your GPU?

covert lake
covert lake
#

Be sure to NEVER follow yt tuts

#

-rt

rare sorrelBOT
covert lake
#

Wokada is the program to use RVC, Retrieval-based-Voice-Conversion Speech To Speech Models, in realtime for calls

There's the fork (modified version), the deiteris fork which has better performance

covert lake
lilac lion
#

I received help from the other channel so I think it’s hardware

covert lake
glass junco
covert lake
#

Wait

#

Are you running a game with wokada

#

Like gta5

lilac lion
#

no

covert lake
dapper ginkgo
lilac lion
#

discord

dapper ginkgo
#

Laptop GPUs arent that powerful

lilac lion
#

yeah thats what i am saying. made a mistake of buying a gaming laptop

dapper ginkgo
#

hmm I see

narrow olive
#

sup

river verge
#

Dilly ding, dilly dong! A new RegalHyperus drum model just released!
The Hashira Unite - Into the Infinity Castle (Drum model no. 552)

covert lake
covert lake
covert lake
#

U can run it for sure

#

maybe @icy pendant can help u

dapper ginkgo
covert lake
# glass junco 2060 super i7 4.5 ghz 16gb ram.

Wokada is the program to use RVC (Retrieval-based-Voice-Conversion, Speech To Speech Models) in realtime for calls

There's the fork (modified version), the deiteris fork which has better performance

#

-rt

rare sorrelBOT
covert lake
#

1st link, the wokada fork

near root
#

checking in! - is there still no voice-conversion AI that has whispers, grunts, shouts, etc?

night lake
edgy bloomBOT
#
Congratulations Razer by Weights!

Your Grotle is now level 23!

wicked hemlock
#

Hey, how can i make my voice much realistic? i'm a man, so i need woman voice

fierce tiger
#

hi

devout mason
#

hi

#

i need friend

#

s

lilac lion
#

You need friends

#

We all do, not new info

exotic rampart
#

hi

dapper ginkgo
shrewd charm
#

holaa

keen creek
#

Hey, I'm watching a tutorial on how to download and im wondering why i cant see the chart to download on github

#

does anyone have answers?

glass junco
#

man im been sleeping on applio hecka better than 27.3

gray rover
#

@tepid basin Hmmm, so what's up with " find models " page not existing in applio? as in, what's the situation? Is it temporary or not existent anymore

gray rover
night lake
gray rover
night lake
gray rover
#

Alrighty, thanks

chilly lake
tepid basin
#

I've never used find models button

gray rover
#

did yeet it already

tepid basin
#

Make them drag models manually

#

No baby shit

chilly lake
#

model upload/download is for colab with UI

#

can't really drag models over

winged wigeon
#

when applio 3.2.8 collab?

solar torrent
#

-colab

rare sorrelBOT
# solar torrent -colab
📒 Google Colab Notebooks
ℹ️ Note

While the Colab free plan provides up to 12 hours of daily usage, the GPU is typically available for only about 4 hours each day on average.

solar torrent
#

Normal 3.2.8 or the big-fixed 3.2.8?

winged wigeon
#

the local version is different than the colab

polar flax
#

the former 3.2.8 has issue on the trained models not working on mainline rvc & voice changer

chilly lake
winged wigeon
#

where can i found applio kaggle bug fixed?

solar torrent
#

-kaggle

rare sorrelBOT
# solar torrent -kaggle
📘 Kaggle Notebooks

Note: Kaggle limits GPU usage to 30 hours per week.

winged wigeon
#

1st or 2nd one?

solar torrent
#

Two Kagger notebooks for Applio? nails

polar flax
winged wigeon
#

thanks

forest crypt
broken raft
#

Is there a Lightning AI notebook for Applio version 3.2.8 with the bug fix?

solar torrent
gray rover
#

Alr, theme fully finished

#

god I f hate how miniscule in details I am

#

imagine I do spend like 10-15 mins on picking a f shade that'd look nice to me x' ) that is, for a single color lol

ionic pumice
#

wtf 48k

gray rover
#

oh?

#

wut's so weird about it

ionic pumice
#

noothing i just wonder why you added that xd

gray rover
#

🤔 because it still is a part of rvc and like uhhh

#

you want to take away support of og pretrains or wut

#

or in general, 48khz

ionic pumice
#

nothign just wondering xd

gray rover
#

Aside, it wasn't " added " it was there ever since 48 came out

#

lmfao

polar flax
night lake
gray rover
#

Awesome

#

means mission's accomplished

elder willow
#

I have an audio dataset of 2:17, what's a good epoch?

gray rover
#

There are certain protocols and principles to follow ofc

#

But I suppose, in case you dunno how to use tensorboard nor wanna learn.. hmmm.. for 2 mins you could try to set betch_size to 3 or 4 and maybe train it for 20-50 epochs, if results are meh, try 60-80

#

No idea, just experiment ¯_(ツ)_/¯ It won't be the world's end if you had to try again afteral

elder willow
#

Thanks

gray rover
#

np, best of luck ✨

elder willow
#

That gives me an idea for a start

#

I'm absolutely new to this

gray rover
#

Ooooo, I understand then

#

In that case, do you know what's tensorboard? ever heard of it?

elder willow
#

Nope

gray rover
#

Want me to give you a quick sumup?

elder willow
#

Sure, I have time

gray rover
#

Essentially, it looks like this:

#

In there you have graphs that reflect how the training goes

#

for instance, total d loss and total g loss are metrics for 2 main components of rvc ( which does use GAN architecture )
It's Generator ( denoted as G ) and Discriminator ( denoted as D )

#

In essence, Discriminator tries to learn differentiating fake ( genrated ) and real ( ground truth samples / those you use for training )

#

Generator on the other hand, learns to Generate as much realistic samples as it can

#

Both want to minimize their " loss "

#

in this "minimax game"

#

Now that's that. The main thing you should focus for now, before and if you get any more advanced
is just " total G loss "

#

and few other losses

#

Mel, FM and KL are sub-losses of Generator

#

FM means feature matching, it's your main loss to keep an eye on.
In a short, it reflects Generator's ( your model's, because the models we actually use in covers are 'snapshots' of generator ) ability in matching the features with your target voice ( dataset ) For instance: Phonetics handling, pitch, spatial features, timbre, prosody and so on

MEL means mel spectrogram loss. You can think of it as " clarity " and " fidelity "

Kl is the more complicated one so, for simplicity I'll just say, it denotes stability. For example, if across N epochs you see the kl loss is somewhat similar in time axis ( for n epochs period wise ) it means, if you were to pick an epoch from that range, it'll be quite stable

#

In case something's unclear, please let me know and I'll do my best to simplify it or use some abstraction

elder willow
#

I'm following

gray rover
#

Now, moving for to the next part.
How do we even estimate when it's time to stop the training?
It's pretty simple.
I'll draw you an example of 2 total d / g grads

#

to demonstrate few scenarios

barren mauve
gray rover
#

ohhh, or actually.. I do have some stuff I used to show new people some time ago

gray rover
barren mauve
gray rover
#

Tho, if you want the theme, just lemme know lel

gray rover
#

This should be alright. Made em quite a while ago but should be somewhat still accurate

#

And again, in case of any questions, lemme know

elder willow
#

So beware of large spikes and dips?

#

And also abnormal flat sections

gray rover
#

yes, but this shouldn't be an issue if your dataset isn't having issues with gaping silences

elder willow
#

Gotcha

gray rover
#

it quite likely means there's stagnation going on

elder willow
#

I'm gonna need to study all this lol

gray rover
#

but it can also mean the model's encountering hard things it still learns
So it's always a good idea to let it train for more

#

Best way to not worry is saving every epoch. 1, 2, 3, 4 ... etc instead of like every 5th or so

#

In any case, for more support like that, you can ask in #✨│ai-help
Should be more suitable for advanced things

elder willow
#

Thanks

gray rover
#

Np man, best of luck on your new journey ✨

elder willow
#

I'll go in and see what happens

gray rover
#

btw, you use rvc or applio

elder willow
#

Applio rn

#

I use both, though

gray rover
#

you know how to open tensorboard?

elder willow
#

Yep

gray rover
#

Alrighty, neat

#

In that case, again, gluck!

elder willow
#

Thanks

elder willow
ebon nacelle
#

@gray rover One more thing re: our convo in the other channel: let's say I have say an hour worth of high quality training data for a male singer, including their very comfortable singing range, and I use a female input vocal that's an octave higher than the model's range and I shift down an octave.
Are you saying it's still understandable that the vocal can come out sounding lisp-y and unnatural?

gray rover
gray rover
#

tho, aside of the dataset itself, pretrains do play a significant role. original pretrains aren't that good in handling broad range, in fact they were done on rather monotone speech samples and so on.
either way, you can dm me with an example sample I mentioned and we'll see what to do.

Gonna respond a bit later cause about to go sleep

ebon nacelle
#

Thanks! Appreciate the help.

winged wigeon
#

so i downloaded the lastest applio release and i dont have the new " mrf " vocoder?

ebon nacelle
#

Anyone try ddsp-svc 6.0? Curious if it's worth training something

queen kernel
#

It works fine if the sample is completely dry.

winged wigeon
tepid basin
#

thanks roy

winged wigeon
#

@errant sphinx

#

you should make him support

tepid basin
#

we should make him owner

barren mauve
#

hmm

mellow fog
#

is voice.ai safe cuz some ppl say it is and some say it isnt

#

rlly confused 😭

polar flax
mellow fog
#

i need to prank my friends tonight