#💬｜general-chat | Stable Diffusion | Page 130

hot kettle Apr 16, 2024, 5:25 PM

#

I guess it doesn't work like that with the full model

karmic cedar Apr 16, 2024, 5:27 PM

#

you know how you walk into a very basic grocery store and it’s got aisles of redundant boxes, all with very clever marketing designs yet comprised of the same refined substances, etc.? it’s not like a higher end grocery, where there are more varieties of produce, more varieties of food in general…that’ll be the trade-off between the various parameter builds of SD3 that get released

#

in terms of aesthetics

#

(theory)

distant swift Apr 16, 2024, 5:29 PM

#

hot kettle I guess it doesn't work like that with the full model

The model needs to be trained for both image and text conditioning to be able to do image encoding without controlnets or IP-Adapters, but the SD3 arch specified in the paper uses a CLIP model and a T5 model simultaneously for text encoding, and has a separate CLIP model for image encoding

hot kettle Apr 16, 2024, 5:30 PM

#

karmic cedar you know how you walk into a very basic grocery store and it’s got aisles of red...

Parameter builds as in different parameter counts?

karmic cedar Apr 16, 2024, 5:30 PM

#

yes

static cape Apr 16, 2024, 5:30 PM

#

let's wish for some news to arrive this week

karmic cedar Apr 16, 2024, 5:31 PM

#

yeah i’m just being a pessimist, i love being wrong because it means progress was made where i didn’t think there would be

hot kettle Apr 16, 2024, 5:31 PM

#

karmic cedar yeah i’m just being a pessimist, i love being wrong because it means progress wa...

Well it's usually the case that with less parameters comes worse performance

karmic cedar Apr 16, 2024, 5:31 PM

#

yeah—just trying to imagine how that will translate visually

#

i think it will lead to a lot of syntactic homogenization

#

(which is…regrettably…also in the best interests of high profile marketers, etc.)

hot kettle Apr 16, 2024, 5:32 PM

#

static cape *let's wish for some news to arrive this week*

And we'll get a 1gb SSM model outperforming everything else

#

(not fact checked)

karmic cedar Apr 16, 2024, 5:33 PM

#

that would be sweet

distant swift Apr 16, 2024, 5:33 PM

#

I'm pretty sure people will start to make quantized versions of the biggest SD3 model similarly to what was done with LLaMa2, to have the same model used by most people

hot kettle Apr 16, 2024, 5:34 PM

#

karmic cedar that would be sweet

I don't think there was any use of mamba architecture for image generation yet

distant swift Apr 16, 2024, 5:34 PM

#

Though quantizing diffusion models isn't nearly as simple as LLMs

karmic cedar Apr 16, 2024, 5:34 PM

#

when you consider the differences between a quantized text model, low and high bit

#

quality-wise

#

what stands out?

#

because what stands out in a text model will translate visually

hot kettle Apr 16, 2024, 5:35 PM

#

distant swift Though quantizing diffusion models isn't nearly as simple as LLMs

Community will probably make a Turbo sdxl3 in less than a month

karmic cedar Apr 16, 2024, 5:35 PM

#

probably

#

unless they don’t have the compute lol

hot kettle Apr 16, 2024, 5:36 PM

#

karmic cedar because what stands out in a text model will translate visually

I'm not sure it works that way. Tokens and embeddings work way diffently in llms

the way they generate

sage reef Apr 16, 2024, 5:36 PM

#

distant swift The model needs to be trained for both image and text conditioning to be able to...

not sure what you mean by simultaneously, this sounds to me like you are saying it's necessary component, and yet from everything i gathered from the research paper and elsewhere, it seems T5 is an optional component

red lynx Apr 16, 2024, 5:36 PM

#

Omg what's with these new accounts today

karmic cedar Apr 16, 2024, 5:36 PM

#

hot kettle I'm not sure it works that way. Tokens and embeddings work way diffently in llms...

this is true!

#

how about tech support?

sage reef Apr 16, 2024, 5:37 PM

#

oh maybe you meant for training? i misread, cause i know for inference, t5 is optional

distant swift Apr 16, 2024, 5:38 PM

#

sage reef not sure what you mean by simultaneously, this sounds to me like you are saying ...

It isn't a necessary component, as the model can still go through the diffusion process with a singular text cond, but, it helps the model be more coherent and understand prompts better. Similarly to how a negative prompt can increase output quality but isn't a requirement

sage reef Apr 16, 2024, 5:39 PM

#

yea

#

it would be cool if the architecture is modular in a way that we can plug any variant of t5 we want and not just the one they are using

hot kettle Apr 16, 2024, 5:40 PM

#

I still can't get over the fact that we came a full circle back to GANs with turbo models

#

How is that even still diffusion

sage reef Apr 16, 2024, 5:41 PM

#

i personally dont want speed, i want quality :3

hot kettle Apr 16, 2024, 5:41 PM

#

sage reef i personally dont want speed, i want quality :3

I want quality and low VRAM usage
Only 6gb 1060

sage reef Apr 16, 2024, 5:42 PM

#

😮

#

well low vram is always nice yes

distant swift Apr 16, 2024, 5:42 PM

#

sage reef it would be cool if the architecture is modular in a way that we can plug any va...

It could be, SD3 won't be the first open source diffusion model to have a transformer model instead of a UNET and use T5, and that model could use T5 XXL and a smaller variant of T5

sage reef Apr 16, 2024, 5:42 PM

#

my god 1060 is so old right? when was that released

hot kettle Apr 16, 2024, 5:43 PM

#

sage reef my god 1060 is so old right? when was that released

No clue lol

karmic cedar Apr 16, 2024, 5:43 PM

#

context is what makes a field of red a rothko painting

sage reef Apr 16, 2024, 5:43 PM

#

well for example, the flan t5 pruned is only like 2.5GB, so very compact and still works nice

hot kettle Apr 16, 2024, 5:44 PM

#

If they use a pretrained one and the features are similar for both I'm sure it'd work

distant swift Apr 16, 2024, 5:44 PM

#

I remember a model called Pixart being able to use different variations of T5

karmic cedar Apr 16, 2024, 5:45 PM

#

pixart is still being updated afaik

#

good model

sage reef Apr 16, 2024, 5:45 PM

#

well i tried pixart sigma, but cant get it to work with flan t5, so rip

red lynx Apr 16, 2024, 5:45 PM

#

distant swift I remember a model called Pixart being able to use different variations of T5

Sigma recently came out, it is quite impressive

karmic cedar Apr 16, 2024, 5:46 PM

#

there’s an llm2vec model now that can take mistral7b and turn it into a text encoder lol

distant swift Apr 16, 2024, 5:46 PM

#

Cool, it's similar to SD3 arch wise, right? Transformer diffusion model with T5 text encoder

sage reef Apr 16, 2024, 5:47 PM

#

but wait, the first one was alpha right? so they skipped pixart beta, or pixart gamma? straight to sigma? lol

hot kettle Apr 16, 2024, 5:47 PM

#

karmic cedar there’s an llm2vec model now that can take mistral7b and turn it into a text enc...

Encoded features have to be similar to the training ones for it to work

#

So not all encoders will work

karmic cedar Apr 16, 2024, 5:47 PM

#

hot kettle Encoded features have to be similar to the training ones for it to work

i understand, but as a proof of concept i think that’s a fascinating direction

#

it also suggests we might see techniques applied inversely to diffusion models so that they are more adaptable to this framework

hot kettle Apr 16, 2024, 5:48 PM

#

karmic cedar i understand, but as a proof of concept i think that’s a fascinating direction

I was wondering whether I should take clip encoder from sdxl and fine tune it on some stuff but it was too much work

karmic cedar Apr 16, 2024, 5:49 PM

#

sage reef but wait, the first one was alpha right? so they skipped pixart beta, or pixart ...

i think they’re really just gunning for PixArt Omega: PixArt Forever

sage reef Apr 16, 2024, 5:50 PM

#

PixArt Infinity

karmic cedar Apr 16, 2024, 5:50 PM

#

i’ll premiere it on my fake youtube game comedy show series called “What’s With Those Latents?”

red lynx Apr 16, 2024, 5:50 PM

#

distant swift Cool, it's similar to SD3 arch wise, right? Transformer diffusion model with T5 ...

Pretty much yes. t5 does load up into ram so vram required just for the model itself

#

It is quite coherent

sage reef Apr 16, 2024, 5:52 PM

#

im still waiting for them to release some version of stable audio, musicgen is nice, but very limited in some aspects

distant swift Apr 16, 2024, 5:52 PM

#

red lynx Pretty much yes. t5 does load up into ram so vram required just for the model i...

That's probably also what stuff like ComfyUI will do for SD3 to run in a usable speed with typical GPUs, and we'll have optimizations like TensorRT to have it run faster without needing to loose quality

hot kettle Apr 16, 2024, 5:53 PM

#

I wonder if some other big diffusion player will join the 'market' in the near future

#

Hopefully opensourced too

sage reef Apr 16, 2024, 5:54 PM

#

the more competition the better

#

but yea hopefully open source :3

distant swift Apr 16, 2024, 5:54 PM

#

Pretty sure SAI said SD3 is their final diffusion model

karmic cedar Apr 16, 2024, 5:55 PM

#

it seems that the most proprietary and advanced platforms are being engineered to exist in a very narrow market, i.e. multimedia professionals who already have the equivalent of seed money from their hollywood success

distant swift Apr 16, 2024, 5:55 PM

#

Well, that's what was said last time I checked in here

karmic cedar Apr 16, 2024, 5:55 PM

#

it’s their final T2I model yes

sage reef Apr 16, 2024, 5:55 PM

#

why would they make it final? they dont want money anymore?

hot kettle Apr 16, 2024, 5:55 PM

#

sage reef the more competition the better

Not too long after playground 2.5v released claiming to have better color space, stability ai gave us CosXL. So I'll say yeah

karmic cedar Apr 16, 2024, 5:56 PM

#

they have set themselves up for a direct course with many different industries that are already reveling from other AI-related developments

sage reef Apr 16, 2024, 5:56 PM

#

cosxl is very cool, im using it to edit some pics, very cool results, but sometimes the output is a bit blurry it seems

hot kettle Apr 16, 2024, 5:56 PM

#

distant swift Pretty sure SAI said SD3 is their final diffusion model

Well then they better come up with a whole another generative model type

sage reef Apr 16, 2024, 5:57 PM

#

stable confusion 🙂

hot kettle Apr 16, 2024, 5:57 PM

#

Stable disfunction

ornate flame Apr 16, 2024, 5:57 PM

#

unstable corruption

sage reef Apr 16, 2024, 5:58 PM

#

and sd3 will have cosxl or editing capabilities too, so that's gonna be awesome as well

hot kettle Apr 16, 2024, 5:58 PM

#

Stoic scattering

ornate flame Apr 16, 2024, 5:58 PM

#

sd3 might never have a public release

sage reef Apr 16, 2024, 5:58 PM

#

no it will

distant swift Apr 16, 2024, 5:58 PM

#

SAI and the others will probably start working on multimodals now that we pretty much have a model for most stuff (3D, image, audio, etc..)

ornate flame Apr 16, 2024, 5:58 PM

#

a lot has been changing at SAI recently, they might just abandon public model weight releases

sage reef Apr 16, 2024, 5:58 PM

#

yea multimodal is the way it's going these days

hot kettle Apr 16, 2024, 5:59 PM

#

distant swift SAI and the others will probably start working on multimodals now that we pretty...

Then we wont get anything good for a while

sage reef Apr 16, 2024, 5:59 PM

#

or maybe happemad is gonna release some greatness in the future

hot kettle Apr 16, 2024, 5:59 PM

#

sage reef yea multimodal is the way it's going these days

I like how after a month we can start seeing a change in development direction and call it 'these days'

sage reef Apr 16, 2024, 6:00 PM

#

haha

ornate flame Apr 16, 2024, 6:00 PM

#

sage reef or maybe <:happemad:1012407616565149706> is gonna release some greatness in the...

i hope so but it might be hard for him to get funding now

#

and I doubt he can just buy H100s with his own pocket

sage reef Apr 16, 2024, 6:00 PM

#

😦

#

im sure Emad is a rich lad

hot kettle Apr 16, 2024, 6:00 PM

#

I'm still waiting for the Mixture of mambas cosine scheduler adversial contrastive slicing diffusion

#

Some paper names these days seem really absurd

sage reef Apr 16, 2024, 6:02 PM

#

yo you saying the word mixture, like the mixture of experts in LLM, maybe we will have mixture of diffusions (MoD) where one expert is good at hands (assuming you want to generate something with hands), another expert is with face, etc :3 lol, prob not...

hot kettle Apr 16, 2024, 6:02 PM

#

While it'd seem intuitive, MoE don't actually seem to divide the task the way humans would

#

It seems rather random, but improves the speed, so we just go with it

sage reef Apr 16, 2024, 6:05 PM

#

on the downside... MoE models are usually huge in size... so imagine something similar for image generators, i dont think it would be great for even folk with 24gb vram cards LOL

hot kettle Apr 16, 2024, 6:06 PM

#

sage reef on the downside... MoE models are usually huge in size... so imagine something s...

A 70b model compares to a 6x14b model in size

sage reef Apr 16, 2024, 6:06 PM

#

yea

hot kettle Apr 16, 2024, 6:06 PM

#

But for MoE only 14b params should be used at once

#

So who knows if no other new architecture comes maybe will get MoD

sage reef Apr 16, 2024, 6:07 PM

#

idk... i mean considering we have all these cool toys in 2024, and at the speed tech is moving... my goodness, imagine what we will have in just couple years from now.. but hopefully nvidia starts releasing cards with at least 32GB vram... cmon bruh

#

i mean we dont have the final vram confirmation for 5090 right? it's all rumours i think

hot kettle Apr 16, 2024, 6:08 PM

#

sage reef idk... i mean considering we have all these cool toys in 2024, and at the speed ...

Hopefully we'll focus on squeezing the most power out of every param, lowering the VRAM required

#

I ain't spending 2K for a new GPU

sage reef Apr 16, 2024, 6:09 PM

#

ye the other thing is for the image generators to somehow use less vram, using some new algos

hot kettle Apr 16, 2024, 6:09 PM

#

But yeah the next couple of years, especially if no law against open source small company models will be passed, is going to be crazy

sage reef Apr 16, 2024, 6:10 PM

#

one day, we will be able to make super mario 64 by clicking just one button happemad

hot kettle Apr 16, 2024, 6:10 PM

#

Maybe AI based game engines? Ai physics, rendering etc

sage reef Apr 16, 2024, 6:10 PM

#

yea idk

#

i mean technically it's possible

#

we already have these image to 3d models

hot kettle Apr 16, 2024, 6:11 PM

#

Yeah, also all nerfs and what nots

#

And we already use DLSS and fsr to improve performance

sage reef Apr 16, 2024, 6:12 PM

#

next we need a neural network trained on level design, and it can generate a level with 3d assets for you, and you prompt just what style or whatever you wanna see in the level :3

hot kettle Apr 16, 2024, 6:12 PM

#

hot kettle And we already use DLSS and fsr to improve performance

Although it's more like a addon ontop of current rendering methods

hot kettle Apr 16, 2024, 6:13 PM

#

sage reef next we need a neural network trained on level design, and it can generate a lev...

What a world we live in

#

Too late to explore the earth, too early to live on mars, but just in time to experience ai technology exploding

sage reef Apr 16, 2024, 6:14 PM

#

instead of image 2 image, you have game style 2 game style, so you input mario 64 and it will give you something similar design wise, 3d platformer :3 , or maybe you just textually prompt: in the style of mario 64

hot kettle Apr 16, 2024, 6:14 PM

#

We'll get brainwaves to image

sage reef Apr 16, 2024, 6:14 PM

#

lol

hot kettle Apr 16, 2024, 6:14 PM

#

Especially with the new stability ai research

sage reef Apr 16, 2024, 6:15 PM

#

one day we will have ai brain surgery: you prompt: fix brain and it fixes the brain 🙂

hot kettle Apr 16, 2024, 6:15 PM

#

Man I wish I was a part of this major ai development

sage reef Apr 16, 2024, 6:17 PM

#

welp, time for me to shave... and im lazy af

hot kettle Apr 16, 2024, 6:17 PM

#

Just use stable razor

sage reef Apr 16, 2024, 6:17 PM

#

haha

#

that's actually a cool product name

hot kettle Apr 16, 2024, 6:18 PM

#

That's gonna be stability ai merch rebrand after they drop ai

low moon Apr 16, 2024, 6:19 PM

#

So what do the lucky ones have that layman peasants lack that they were chosen to test this out?

hot kettle Apr 16, 2024, 6:21 PM

#

low moon So what do the lucky ones have that layman peasants lack that they were chosen t...

Stable coin is what they have

clear oyster Apr 16, 2024, 6:22 PM

#

which samping method should dreamshapers use?

low moon Apr 16, 2024, 6:25 PM

#

hot kettle Stable coin is what they have

I cna live without fkn incentive. I can;t wait for AI to destory the whole money thing. It never worked.

#

If all you get out of bed for is to make money might as well not get up. I welcome our AI overlord and they can have all our jobs and economy.

hot kettle Apr 16, 2024, 6:27 PM

#

What

low moon Apr 16, 2024, 6:27 PM

#

the ultra tall super empty skinny towers of NY tell the story

#

We are movign towards the Star Tek future folks

#

no more money

clear oyster Apr 16, 2024, 6:28 PM

#

hot kettle Stable coin is what they have

stable coin lol?

clear oyster Apr 16, 2024, 6:28 PM

#

low moon If all you get out of bed for is to make money might as well not get up. I welco...

wtf is wrong with you lol

low moon Apr 16, 2024, 6:28 PM

#

4 hour work week?

#

try no hour work week

clear oyster Apr 16, 2024, 6:28 PM

#

low moon no more money

youre high af

low moon Apr 16, 2024, 6:28 PM

#

i dont think so

#

i wish tho haha

#

i never smoekd weed

hot kettle Apr 16, 2024, 6:29 PM

#

clear oyster youre high af

Nah he just testing the prerelease Stable Schizophrenia XL

low moon Apr 16, 2024, 6:29 PM

#

she

clear oyster Apr 16, 2024, 6:29 PM

#

hot kettle Nah he just testing the prerelease Stable Schizophrenia XL

loool

clear oyster Apr 16, 2024, 6:29 PM

#

low moon she

makes sense looooool

low moon Apr 16, 2024, 6:30 PM

#

and no im not trianign checkpoints

#

tho i did some loras

#

i think they work

sage reef Apr 16, 2024, 6:51 PM

#

stable coin? she is using dodge coin obviously

hot kettle Apr 16, 2024, 7:00 PM

#

That's why she doesn't have access

sage reef Apr 16, 2024, 7:03 PM

#

dodge coin is very unstable

low moon Apr 16, 2024, 7:05 PM

#

Yeah :/

#

We all should buy gold and silver. Maybe AI won't take that from us. XD

sage reef Apr 16, 2024, 7:06 PM

#

buy? how bold of you to assume i have money... 😦

hot kettle Apr 16, 2024, 7:06 PM

#

She doesn't know about stable mining...

low moon Apr 16, 2024, 7:06 PM

#

Well no one has money.

sage reef Apr 16, 2024, 7:07 PM

#

stable miner is the new minecraft game

low moon Apr 16, 2024, 7:07 PM

#

Those who have it its mostly tied up in empty glass towers and other legalities.

#

Stonks and "Art" and off shore areas. Etc.

#

So the top 0.01% is busy stressing over whatever they have so even thay can;t enjoy it. Really.

#

Of course the party line says otherwise.

#

Everyone is happy on Facebook.

#

even the oil barons smell their end and are scaling back their absurd megaprojects.

sage reef Apr 16, 2024, 7:10 PM

#

there will be blood

low moon Apr 16, 2024, 7:11 PM

#

Well there is

#

you could argue we are in the middle of WW3

#

no one labeled it as such tho

sage reef Apr 16, 2024, 7:11 PM

#

history will label it in 20 years

low moon Apr 16, 2024, 7:11 PM

#

uhuh

sage reef Apr 16, 2024, 7:12 PM

#

il use my stable fork

low moon Apr 16, 2024, 7:13 PM

#

hope its made of gold

#

or silver

sage reef Apr 16, 2024, 7:13 PM

#

it's made of diamonds

low moon Apr 16, 2024, 7:14 PM

#

mm

clear oyster Apr 16, 2024, 7:17 PM

#

hot kettle She doesn't know about stable mining...

stable minig??

clear oyster Apr 16, 2024, 7:18 PM

#

sage reef stable miner is the new minecraft game

what lol

clear oyster Apr 16, 2024, 7:18 PM

#

sage reef il use my stable fork

stable fork?

clear oyster Apr 16, 2024, 7:18 PM

#

sage reef it's made of diamonds

everyone high af rn lol

sage reef Apr 16, 2024, 7:18 PM

#

did you just ping me 3 times

#

sigh

#

we are just having a laugh :3 (i think)

karmic cedar Apr 16, 2024, 7:21 PM

#

use your stable spoons ya’ll

#

the mind can bend

low moon Apr 16, 2024, 7:22 PM

#

there is no spoon

karmic cedar Apr 16, 2024, 7:22 PM

#

^^

sage reef Apr 16, 2024, 7:23 PM

#

@karmic cedar your about me says you are machine learning researcher? you publish papers? 😮 or just student/learning type

karmic cedar Apr 16, 2024, 7:23 PM

#

just a learning type! 😛

sage reef Apr 16, 2024, 7:23 PM

#

ah

#

maybe you will give us the next stable diffusion after sd3 🙂

karmic cedar Apr 16, 2024, 7:24 PM

#

i’m mostly an experimental artist

#

but i have a neuroscience background

sage reef Apr 16, 2024, 7:24 PM

#

cool

karmic cedar Apr 16, 2024, 7:24 PM

#

so i approach AI with that in mind

#

pun intended

sage reef Apr 16, 2024, 7:24 PM

#

yea mind, neural network, it all makes sense now :3

karmic cedar Apr 16, 2024, 7:25 PM

#

yeah it all runs together :_D

#

i love that people from different backgrounds can relate to AI based on its logic and structure

sage reef Apr 16, 2024, 7:25 PM

#

im working on a neural network from scratch, so im also learning i guess

karmic cedar Apr 16, 2024, 7:25 PM

#

nice!

#

transformer-based?

sage reef Apr 16, 2024, 7:29 PM

#

wow easy there cowboy... when i said from scratch and learning... it's really from scratch... and learning.. as in... i just started 🙂
so technically im learning the inner workings of neural nets, like simple concepts and moving forward, no idea what it will turn
out in the end. i ultimately want to create some sort of domain-to-domain network, so like maybe GAN or idk.. like you input something
and it outputs something within the same domain, so either image to image or music to music or idk.. but not like text to image, then
again depending where this journey takes me, i might try that too, but for now want to learn mostly image to image stuff, cause i want
to combine it with my other research field, with digital signal processing and combine it 🙂

karmic cedar Apr 16, 2024, 7:29 PM

#

nice!

#

i’m fascinated by fMRI2img

teal pagoda Apr 16, 2024, 7:29 PM

#

Anyone knowing the launch date of SD3?

sage reef Apr 16, 2024, 7:30 PM

#

april 26 happemad

teal pagoda Apr 16, 2024, 7:30 PM

#

yea, sure

sage reef Apr 16, 2024, 7:31 PM

#

i should probably stop saying this date cause people might start believing it and then when it turns out it's not, im gonna get like 100 pings

red lynx Apr 16, 2024, 7:31 PM

#

sage reef i should probably stop saying this date cause people might start believing it an...

too late, I got my hopes up

sage reef Apr 16, 2024, 7:32 PM

#

haha

#

on the other hand, i hope it's not april 26, cause then people will think im working for SAI lol

hot kettle Apr 16, 2024, 7:35 PM

#

sage reef wow easy there cowboy... when i said from scratch and learning... it's really fr...

I've tried 3 times to make GANs and failed every time lol

sage reef Apr 16, 2024, 7:35 PM

#

yea not easy stuff :3

hot kettle Apr 16, 2024, 7:35 PM

#

Only thing that somewhat worked from my models was SR

red lynx Apr 16, 2024, 7:35 PM

#

sage reef on the other hand, i hope it's not april 26, cause then people will think im wor...

Well obviously you don't work wink

hot kettle Apr 16, 2024, 7:36 PM

#

sage reef yea not easy stuff :3

GANs are one of the oldest models out there and I couldn't even make one work lol

sage reef Apr 16, 2024, 7:36 PM

#

i think GANs started in 2014 if i recall

hot kettle Apr 16, 2024, 7:36 PM

#

Well yeah, but with current rate of progress that's archaic technology

#

And easy to write from scratch using torch/tensorflow

sage reef Apr 16, 2024, 7:37 PM

#

oh im not using torch or anything, im literally doing it from scratch, limiting myself only to numpy for example and some plotting i guess, i really want to learn the inner workings

hot kettle Apr 16, 2024, 7:38 PM

#

I think I wrote a evolutionary neural net from scratch in c# a while ago

sage reef Apr 16, 2024, 7:38 PM

#

nice

hot kettle Apr 16, 2024, 7:38 PM

#

But I'd not dare to even try implementing SGD with just numpy

sage reef Apr 16, 2024, 7:38 PM

#

haha

#

it's not that bad :3

hot kettle Apr 16, 2024, 7:39 PM

#

Yeah except every source for anything ai related gives me some crazy equations with characters I've never seen lol

sage reef Apr 16, 2024, 7:40 PM

#

well i got help from wiki math when it comes to symbols im not familiar with, i know most of them

hot kettle Apr 16, 2024, 7:40 PM

#

Well good luck with that but I'll stick to joining torch blocks together

#

And even with that I barely understand anything from the last few years

sage reef Apr 16, 2024, 7:41 PM

#

i spent a lot of time reading technical research papers and sometimes they provide pseudo code within the paper and that helps piece the whole thing together and then i implemented it myself, for some of the projects (non ai projects), so you can learn, but some stuff is a bit too convoluted and the paper wont help you, unless they maybe decide to also release some code on github to learn from, but yea...

hot kettle Apr 16, 2024, 7:42 PM

#

Yeah I try to ready every bigger paper that comes out and some older ones I find interesting

#

But outside of the general mechanics they describe in text, most of the pseudo code or math equations don't make me understand it better

sage reef Apr 16, 2024, 7:44 PM

#

that's why i decided to start from scratch or just math (numpy) cause i really wanna grasp it, so i then have total understanding and control of what im doing and i know what im doing... as opposed to.. here are these legos... and make something... but how did they make the lego itself :3

#

and of course i take notes and comment my code a lot

hot kettle Apr 16, 2024, 7:45 PM

#

Well I understand those most basic basics, but I doubt I could code them without further reading

sage reef Apr 16, 2024, 7:45 PM

#

im actually super crazy when it comes to commenting code haha, i spend like paragraphs just on one line of code sometimes, cause i need to remind myself what this is doing and how it can be used if you alter it or whatnot

#

or if i write a custom function

hot kettle Apr 16, 2024, 7:47 PM

#

I make a comment every 200 lines if not more

#

Unless I specify the tensor sizes inside model parts

#

Or to segment my code

sage reef Apr 16, 2024, 7:48 PM

#

i remember during one project, i was stuck implementing a research paper, and i remembered i did something very similar and even commented it on another project and that saved me... and i completed the project

#

cause i had to understand the logic

#

im also working on a 3d game engine, got the renderer part (but not complete), the physics (but not everything), and now doing animations, but that is a pain in the thingy

hot kettle Apr 16, 2024, 7:51 PM

#

I never made an engine but I did make some game-ish projects, but that is way easier

sage reef Apr 16, 2024, 7:52 PM

#

im a programmer, so i like to try all sorts of projects 🙂

hot kettle Apr 16, 2024, 7:53 PM

#

I do some programming for fun but wouldn't call myself a programmer

narrow badger Apr 16, 2024, 7:54 PM

#

hey

charred mesa Apr 16, 2024, 7:54 PM

#

sage reef april 26 <:happemad:1012407616565149706>

very real

sage reef Apr 16, 2024, 7:54 PM

#

mhm

#

you know it

sage reef Apr 16, 2024, 7:55 PM

#

narrow badger hey

are you a bot too? cause they always say hey, or hello or something haha

loud solar Apr 16, 2024, 7:55 PM

#

Hello World!

narrow badger Apr 16, 2024, 7:55 PM

#

sage reef are you a bot too? cause they always say hey, or hello or something haha

nah just tryna figure out if there was a verification or not

sage reef Apr 16, 2024, 7:55 PM

#

ok if you are replying, you are not a bot :3

narrow badger Apr 16, 2024, 7:56 PM

#

just couldnt run stabilityai/stable-video-diffusion-img2vid-xt

sage reef Apr 16, 2024, 7:56 PM

#

ah

narrow badger Apr 16, 2024, 7:56 PM

#

maybe i find answers here

sage reef Apr 16, 2024, 7:56 PM

#

the xt version takes a lot of vram if i recall

hot kettle Apr 16, 2024, 7:56 PM

#

loud solar Hello World!

Hello MNIST!

narrow badger Apr 16, 2024, 7:56 PM

#

its been like 5 minutes

#

but its still on %0

#

i have 4070super

#

is it normal

sage reef Apr 16, 2024, 7:57 PM

#

i think you mean img2vid tho? and not img2img?

narrow badger Apr 16, 2024, 7:57 PM

#

img2vid

sage reef Apr 16, 2024, 7:57 PM

#

4070 super how much vram is that? i dont know all the models

narrow badger Apr 16, 2024, 7:57 PM

#

wow it says 3.5hours left

#

its my gpu

loud solar Apr 16, 2024, 7:57 PM

#

narrow badger its been like 5 minutes

Works fine here

sage reef Apr 16, 2024, 7:57 PM

#

yikes

#

but wait, how many frames are you generating?

loud solar Apr 16, 2024, 7:58 PM

#

Hopefully not more than 25 🙂

sage reef Apr 16, 2024, 7:58 PM

#

lol

#

gpu gonna explode

loud solar Apr 16, 2024, 7:58 PM

#

Model can't handle more ...

sage reef Apr 16, 2024, 7:58 PM

#

yea

narrow badger Apr 16, 2024, 7:59 PM

#

sage reef but wait, how many frames are you generating?

well i have no idea

rugged mirage Apr 16, 2024, 7:59 PM

#

are you definitely using your gpu? sounds like you might be using cpu/downloading some models from the workflow for the first time if it's hours

narrow badger Apr 16, 2024, 7:59 PM

#

from diffusers import DiffusionPipeline
from PIL import Image

print("-------------------- START -------------------")

pipeline = DiffusionPipeline.from_pretrained("stabilityai/stable-video-diffusion-img2vid-xt")
print("Pipeline loaded")

image_path = "image.png"
print("Image loaded")
image = Image.open(image_path)
print("Image opened")

result = pipeline(image)
print("Image passed to model")

result.save("output_video.mp4")

#

here is the code

narrow badger Apr 16, 2024, 7:59 PM

#

rugged mirage are you definitely using your gpu? sounds like you might be using cpu/downloadin...

no idea

rugged mirage Apr 16, 2024, 8:00 PM

#

Im guessing that downloads the models from that links, that can take adges currently

narrow badger Apr 16, 2024, 8:00 PM

#

first time just running it without ui

rugged mirage Apr 16, 2024, 8:00 PM

#

show the output not the code

sage reef Apr 16, 2024, 8:00 PM

#

is your pipeline using cuda?

rugged mirage Apr 16, 2024, 8:00 PM

#

and check in task manager or nvidia-smi or whatever if the gpu is being used

loud solar Apr 16, 2024, 8:00 PM

#

https://cdn.discordapp.com/attachments/1004159122335354970/1229884157325344810/20240415-094517-aambq_thm2_chf3_prob4.mp4?ex=66314de1&is=661ed8e1&hm=00272486642cb53c38ccbcfaec5525ace5a77c83ea9908f9e46fd5267dd679ad& my last video ...

narrow badger Apr 16, 2024, 8:01 PM

#

rugged mirage and check in task manager or nvidia-smi or whatever if the gpu is being used

ok iwill

sage reef Apr 16, 2024, 8:01 PM

#

@loud solar why is he looking me like that? :3

#

but nice stuff man

loud solar Apr 16, 2024, 8:01 PM

#

sage reef <@593876477129392139> why is he looking me like that? :3

Maybe he is into Anime?

narrow badger Apr 16, 2024, 8:01 PM

#

loud solar https://cdn.discordapp.com/attachments/1004159122335354970/1229884157325344810/2...

did u use a ui

loud solar Apr 16, 2024, 8:02 PM

#

narrow badger did u use a ui

Stable Diffusion Forge has SVD included and is pretty easy

sage reef Apr 16, 2024, 8:02 PM

#

i recommend you use comfy, it's very well optimized for memory and image2vid works nice in there

#

or forge i guess

loud solar Apr 16, 2024, 8:03 PM

#

Comfy is better if you don't need to learn everything 🙂

sage reef Apr 16, 2024, 8:03 PM

#

nah you dont need to learn a lot :3

narrow badger Apr 16, 2024, 8:04 PM

#

can you. guys provide me a doc or a link that would help a lot

#

actually im tryna implement ai to my app so i need an proper api

loud solar Apr 16, 2024, 8:04 PM

#

narrow badger can you. guys provide me a doc or a link that would help a lot

For Forge: https://github.com/lllyasviel/stable-diffusion-webui-forge

rugged mirage Apr 16, 2024, 8:05 PM

#

you can use comfy and forge in an api too

narrow badger Apr 16, 2024, 8:05 PM

#

thats why i tried it on python

loud solar Apr 16, 2024, 8:05 PM

#

just copy model and that's it ...

sage reef Apr 16, 2024, 8:05 PM

#

after you install comfy, you follow this for example: https://comfyanonymous.github.io/ComfyUI_examples/video/

narrow badger Apr 16, 2024, 8:05 PM

#

for api services which one do you guys suggest

sage reef Apr 16, 2024, 8:06 PM

#

i never used api services personally, they are very limited

narrow badger Apr 16, 2024, 8:06 PM

#

oh why

loud solar Apr 16, 2024, 8:06 PM

#

Same here

rugged mirage Apr 16, 2024, 8:06 PM

#

if your workflow is simple then yeh the way you are doing it with diffusers is best for API, if it's more complex then you can turn comfy into API

sage reef Apr 16, 2024, 8:07 PM

#

im gonna stay with comfy forever ❤️

narrow badger Apr 16, 2024, 8:07 PM

#

i just need to animate images in a very simple ways

#

no need configurations etc

sage reef Apr 16, 2024, 8:08 PM

#

so you just want the most easiest approach, in which case comfy is not for you

narrow badger Apr 16, 2024, 8:08 PM

#

but all the examples on internet are about UIs

narrow badger Apr 16, 2024, 8:08 PM

#

sage reef so you just want the most easiest approach, in which case comfy is not for you

yep i guess so

#

just a simple python code to run this model

#

i could not find any

sage reef Apr 16, 2024, 8:09 PM

#

the only problem with the most easiest approach is that it's most likely not gonna give you the best results, for example people literally have comfy workflows specifically tuned for svd to work and give nice results, which is not just the svd part, so yea.. you prob wont get the best results

narrow badger Apr 16, 2024, 8:10 PM

#

i see

#

then i will check them apis

#

i love you guys thx a lot

sage reef Apr 16, 2024, 8:10 PM

#

well good luck and always ask if you need help

narrow badger Apr 16, 2024, 8:10 PM

#

thxx

gusty oriole Apr 16, 2024, 8:13 PM

#

troll

loud solar Apr 16, 2024, 8:13 PM

#

Nicer to have some serious questions 🙂

sage reef Apr 16, 2024, 8:14 PM

#

as opposed to not serious questions

gusty oriole Apr 16, 2024, 8:14 PM

#

w00t

sage reef Apr 16, 2024, 8:14 PM

#

what up doc

loud solar Apr 16, 2024, 8:15 PM

#

sage reef as opposed to not serious questions

SD3 when? 😄

sage reef Apr 16, 2024, 8:15 PM

#

come on,.. that is serious haha

loud solar Apr 16, 2024, 8:15 PM

#

Got fresh brewed tea ... that's serious ^^

sage reef Apr 16, 2024, 8:16 PM

#

tea for two

#

i mean the question is serious, but the answers are not... for example april 26 🙂

loud solar Apr 16, 2024, 8:18 PM

#

Nobody here can give the answer so how serious can be the question?

sage reef Apr 16, 2024, 8:20 PM

#

well to be fair, not everyone is up to date with news perhaps, so technically speaking, they could have missed some actual news from SAI in which they did announce an actual date, and so the people asking sd3 when kinda makes sense in that regard

#

but yea most are just meming at this point... cause sd3 is a myth :3 lol

loud solar Apr 16, 2024, 8:22 PM

#

But even if somebody working for SD would be here ... I don't think he would know ...

sage reef Apr 16, 2024, 8:23 PM

#

i can only assume the lead dude knows, cause he gives the final order for release, the devs dont know when it will be released, they are just devs working on it

karmic cedar Apr 16, 2024, 8:23 PM

#

And let’s not forget who the devs report to

loud solar Apr 16, 2024, 8:24 PM

#

But the lead dude has other problems than chatting here 🙂

sage reef Apr 16, 2024, 8:24 PM

#

exactly

loud solar Apr 16, 2024, 8:24 PM

#

But he can DM me 😄

sage reef Apr 16, 2024, 8:25 PM

#

we are just the peanut gallery for the lead dude, he can't possibly be bothered by us down here

loud solar Apr 16, 2024, 8:25 PM

#

I still offer to watch the killswitch ^^

sage reef Apr 16, 2024, 8:25 PM

#

"look at these peasants waiting for sd3", he said calmly

trail lion Apr 16, 2024, 8:26 PM

#

look at these people thinking they are owed timelines and status updates

sage reef Apr 16, 2024, 8:26 PM

#

right

loud solar Apr 16, 2024, 8:28 PM

#

Maybe we should start a crowdfunding for earlier release? 🙂

sage reef Apr 16, 2024, 8:28 PM

#

happemad

karmic cedar Apr 16, 2024, 8:33 PM

#

happemad chad

sage reef Apr 16, 2024, 8:33 PM

#

im actually curious what Emad is gonna cook

karmic cedar Apr 16, 2024, 8:34 PM

#

him joining microsoft is poetic for the most part

#

i hope he works towards his own dreams while facilitating his role there

loud solar Apr 16, 2024, 8:37 PM

#

sage reef im actually curious what Emad is gonna cook

Spaghetti Monsters ...

karmic cedar Apr 16, 2024, 8:37 PM

#

i mean what if that was a clever name for a new diffusion model

#

since diffusion models are technically entangled spaghetti

sage reef Apr 16, 2024, 8:38 PM

#

@honest mica hey just have a question for you. i know you trained the CCTV loras, and that is technically a concept i guess? i want to try to train a concept lora, so wondering what are the best practices when training for a concept rather than just some object or thing? do concepts need something special? either parameter wise, amount of picture wise, or captions wise, or idk.. any tips? :3

steep timber Apr 16, 2024, 8:52 PM

#

what is the best model for converting a pixelated image to a realistic one?

sage reef Apr 16, 2024, 8:53 PM

#

SUPIR

#

unless you mean like an actual pixel style image, in which case, it can be almost any realistic model, cause you are doing image to image.

#

but if by pixelated you mean low degraded quality image to restored version, then SUPIR should help with that

#

and if the realistic model doesnt work, you can force it further with a lora like realistic slider, and put the slider to max strength and it should convert it

#

i did this to convert some anime pics to real and vice versa

karmic cedar Apr 16, 2024, 9:02 PM

#

magic image refiner could also do the trick

sage reef Apr 16, 2024, 9:07 PM

#

karmic cedar magic image refiner could also do the trick

is that available in comfy?

karmic cedar Apr 16, 2024, 9:08 PM

#

i believe it is a comfy workflow—let me try to source a link for ya

sage reef Apr 16, 2024, 9:08 PM

#

thx

karmic cedar Apr 16, 2024, 9:09 PM

#

https://github.com/BatouResearch/magic-image-refiner

#

chiggity-check

#

it be a cog.

obsidian viper Apr 16, 2024, 9:10 PM

#

hi, does anybody have a reference to a good image to video comfyui workflow?

karmic cedar Apr 16, 2024, 9:11 PM

#

https://comfyworkflows.com

#

they have lots

sage reef Apr 16, 2024, 9:11 PM

#

so wait im confused... cog? but it can used in comfy or no?

karmic cedar Apr 16, 2024, 9:12 PM

#

i’m gonna say no

#

i’m looking more closely at it

#

sorry for the confusion

#

it is a pretty good controlnet sandwich though

steep timber Apr 16, 2024, 9:13 PM

#

sage reef SUPIR

this one? https://huggingface.co/camenduru/SUPIR

karmic cedar Apr 16, 2024, 9:14 PM

#

oh hi WizardLM 2 8x22B holy $@^#$&^

#

https://github.com/victorsungo/WizardLM/tree/main/WizardLM-2

sage reef Apr 16, 2024, 9:16 PM

#

steep timber this one? https://huggingface.co/camenduru/SUPIR

i recommend the Kijai version https://github.com/kijai/ComfyUI-SUPIR but yea

sage reef Apr 16, 2024, 9:17 PM

#

karmic cedar oh hi WizardLM 2 8x22B holy $@^#$&^

yea Wizard LM 2 released

karmic cedar Apr 16, 2024, 9:17 PM

#

crazy performance looks like

steep timber Apr 16, 2024, 9:18 PM

#

sage reef i recommend the Kijai version https://github.com/kijai/ComfyUI-SUPIR but yea

i think i wasnt clear on what i wanted

#

i want a model that converts pixel art to realistic

#

that model looks like just an upscaler

sage reef Apr 16, 2024, 9:18 PM

#

it looks like they made a small mistake and released it a bit too early https://new.reddit.com/r/LocalLLaMA/comments/1c586rm/wizardlm2_was_deleted_because_they_forgot_to_test/

lol, so i guess the first version out there is less censored

karmic cedar Apr 16, 2024, 9:19 PM

#

steep timber that model looks like just an upscaler

i think they were under the impression you had a pixelated image. in your case i would recommend a good img2img controlnet-based solution

#

I enjoy Fooocus for its ease of use and consistency

sage reef Apr 16, 2024, 9:19 PM

#

yea cause i wasnt sure what you meant by the word pixelated

steep timber Apr 16, 2024, 9:19 PM

#

yea, sorry

#

language skill issue 🤓

sage reef Apr 16, 2024, 9:19 PM

#

it's ok 🙂

karmic cedar Apr 16, 2024, 9:19 PM

#

all good here

sage reef Apr 16, 2024, 9:21 PM

#

ye so you can use almost any realistic model, combine with optional lora like realistic slider, and then use image 2 image with strong denoise (cause it needs to convert it, so maybe 0.50 or above) and perhaps controlnet canny or whatever, depending if you care about the details

karmic cedar Apr 16, 2024, 9:22 PM

#

you could find a really complex workflow and then go Doc Brown on a DeLorean in ComfyUI

sage reef Apr 16, 2024, 9:22 PM

#

sometimes even simple workflows can work too, no need to go extra crazy 🙂

karmic cedar Apr 16, 2024, 9:22 PM

#

oh yeah, a little canny can go a long away.

#

way, even?

pearl ocean Apr 16, 2024, 9:25 PM

#

Forge is all you really need

#

bobagirl

nova zodiac Apr 16, 2024, 9:27 PM

#

pearl ocean Forge is all you really need

Lora merging still broken in Forge 😛

#

but adetailer, peturbed attention guidance, regional prompter, and controlnet all working awesome

pearl ocean Apr 16, 2024, 9:28 PM

#

nova zodiac but adetailer, peturbed attention guidance, regional prompter, and controlnet al...

Prompt Engineer

pearl ocean Apr 16, 2024, 9:29 PM

#

nova zodiac Lora merging still broken in Forge 😛

I tired that Connect Net, got some picture of someone in some pose, it was like 50% chance of the person to be in the pose given lol

#

catlurk

nova zodiac Apr 16, 2024, 9:56 PM

#

connect net or control net?

nova zodiac Apr 16, 2024, 9:56 PM

#

pearl ocean Prompt Engineer

What's prompt engineer?

pearl ocean Apr 16, 2024, 10:12 PM

#

nova zodiac What's prompt engineer?

Someone who writes prompts XD

pearl ocean Apr 16, 2024, 10:12 PM

#

nova zodiac connect net or control net?

Yes

sage reef Apr 16, 2024, 10:13 PM

#

you need to go to university to become a prompt engineer

nova zodiac Apr 16, 2024, 10:18 PM

#

sage reef you need to go to university to become a prompt engineer

I've heard some shit takes in my time, and that is one of them - a half hour on youtube and a couple of good links and you can be writing great prompts in no time

#

hell even just asking the bot on civitai for a prompt gives you a half decent starting point to work from

sage reef Apr 16, 2024, 10:20 PM

#

i hope you didnt take what i said actually seriously... lol, i meant it as a joke considering people call it "prompt engineering" and engineering is usually related to university studies :3

nova zodiac Apr 16, 2024, 10:20 PM

#

the /s was dropped

sage reef Apr 16, 2024, 10:21 PM

#

it's all good fam

next dawn Apr 16, 2024, 10:23 PM

#

Virtual assistant available here ✌🏻

sage reef Apr 16, 2024, 10:24 PM

#

virtual? im only interested in actual real assistants, sorry

karmic cedar Apr 16, 2024, 11:49 PM

#

when is DeepMind going to start playing Vampire Survivors and can I watch the stream plz

#

wow Cascade does sausages really well

#

is that because it’s a german architecture?

sage reef Apr 17, 2024, 12:12 AM

#

based on what i understood, Cascade is basically Wurschten v3, or however you spell that in german and it seems that word means sausage, so i guess it makes sense? lol

karmic cedar Apr 17, 2024, 12:16 AM

#

are you serious

#

i legit did not put that together

sage reef Apr 17, 2024, 12:20 AM

#

haha

worn aspen Apr 17, 2024, 12:54 AM

#

I feel dumb, but browsing the cascade sub, I'm looking for conversations and information, but it's mostly images. Is this an image generation bot channel thing like mid journey? I'm not seeing any indication in the rules channels.

karmic cedar Apr 17, 2024, 1:25 AM

#

speaking of midjourney what’s with all the mid finetunes claiming to match v6 performance lately?

#

small spoiler: they do not

shell tendon Apr 17, 2024, 1:31 AM

#

sage reef you need to go to university to become a prompt engineer

and it's gotta be a top master's program

sage reef Apr 17, 2024, 1:36 AM

#

for sure

heavy lark Apr 17, 2024, 1:49 AM

#

worn aspen I feel dumb, but browsing the cascade sub, I'm looking for conversations and inf...

it's because cascade is dead. it's been non-stop discussion for the few months since it came out, but ultimately most of what's being generated now is being put through an sdxl refiner because they know nothing new will ever come from cascade ever again. all the people who made it left the company, and nobody is making any finetunes because sd3 was announced within a week later.

shell tendon Apr 17, 2024, 1:51 AM

#

yep. unfortunate.

heavy lark Apr 17, 2024, 1:51 AM

#

i was using it for it's superior clipvision ability, but now that ipadapter for sdxl has been revamped, that's now way better than cascade was.

shell tendon Apr 17, 2024, 1:51 AM

#

worn aspen I feel dumb, but browsing the cascade sub, I'm looking for conversations and inf...

the really big issue with cascade is stage B

#

if you don't use enough steps, you have leftover noise. but if you have too many steps, you have oversampling noise of some kind.

#

generally, the number of steps that lay between those two areas is zero

nova zodiac Apr 17, 2024, 1:52 AM

#

sage reef based on what i understood, Cascade is basically Wurschten v3, or however you sp...

würstchen = tiny sausage

shell tendon Apr 17, 2024, 1:54 AM

#

heavy lark i was using it for it's superior clipvision ability, but now that ipadapter for ...

can you imagine how great the new IPA would be with cascade's CV?

#

too bad that's not a reality

pearl ocean Apr 17, 2024, 2:00 AM

#

shell tendon can you imagine how great the new IPA would be with cascade's CV?

Apply for a job, put it on your CV

#

catlurk

nova zodiac Apr 17, 2024, 2:09 AM

#

shell tendon can you imagine how great the new IPA would be with cascade's CV?

I mean a good beer and generating images is always a good time 😛

pearl ocean Apr 17, 2024, 2:35 AM

#

nova zodiac I mean a good beer and generating images is always a good time 😛

why not generate images of Beer?

#

🍻

warped shard Apr 17, 2024, 3:21 AM

#

hmm quick question, anyone know where .pt files go?

#

well.. where to find it

#

dropped it in the embedding file but cant seem to find out where to use it

trail lion Apr 17, 2024, 3:30 AM

#

warped shard hmm quick question, anyone know where .pt files go?

first make sure you have the right kind of model loaded. ie. a 1.5 model with an embedding trained for 1.5. if you have something like 2.1 or sdxl it wont be compatible. secondly, look for a tab called "textual inversion" on the main page of automatic1111, and you should see the embedding there

#

the .pt files go under stable-diffusion-webui/embeddings

warped shard Apr 17, 2024, 3:32 AM

#

trail lion first make sure you have the right kind of model loaded. ie. a 1.5 model with a...

hmm yeah i have it all in embedding but idk how to use them

trail lion Apr 17, 2024, 3:32 AM

#

you click on it and it puts a tag in your prompt

sage reef Apr 17, 2024, 3:38 AM

#

nova zodiac würstchen = tiny sausage

so they think my sausage is tiny eh?

warped shard Apr 17, 2024, 3:39 AM

#

trail lion you click on it and it puts a tag in your prompt

like i dont see anything for the embedding

#

#🏞｜general-with-images

nova zodiac Apr 17, 2024, 3:41 AM

#

sage reef so they think my sausage is tiny eh?

in a cute way

karmic cedar Apr 17, 2024, 3:43 AM

#

I’m going to finetune a lora of just action heroes busting through doors with guns and call it Embed This

sage reef Apr 17, 2024, 3:46 AM

#

do it

karmic cedar Apr 17, 2024, 4:03 AM

#

nah

warped shard Apr 17, 2024, 4:03 AM

#

but uh if anyone know how to use the embedding lmk

ornate flame Apr 17, 2024, 4:18 AM

#

sage reef so they think my sausage is tiny eh?

"General chat about all things Stable!"

karmic cedar Apr 17, 2024, 4:20 AM

#

“Democracy”

honest mica Apr 17, 2024, 4:22 AM

#

sage reef <@554031564913115136> hey just have a question for you. i know you trained the ...

Hi lodis. Is it though? I think you got confused with concept and style. Anyway, I just caption the whole image and put my keyword in front of the caption. Like CCTVfootage, {caption}. Amount of images I would recommend something in the range of 35 - 250. Parameters I could send you later, because I am not home at the moment.

thorny echo Apr 17, 2024, 4:28 AM

#

Has anyone been able to make pixel art by using a image as its style?

shell tendon Apr 17, 2024, 4:29 AM

#

thorny echo Has anyone been able to make pixel art by using a image as its style?

https://civitai.com/models/120096/pixel-art-xl

thorny echo Apr 17, 2024, 4:32 AM

#

shell tendon https://civitai.com/models/120096/pixel-art-xl

Thank you very much 😄

shell tendon Apr 17, 2024, 4:32 AM

#

np 🙂

thorny echo Apr 17, 2024, 4:33 AM

#

Just to double check it goes in the Lora folder right?

shell tendon Apr 17, 2024, 4:37 AM

#

yup

#

err

#

yes

#

that is a lora

#

had to check real quick cuz there's also a checkpoint that's kinda similar

thorny echo Apr 17, 2024, 4:39 AM

#

Doesnt seem to be showing on in the lora tab I am putting it in D:\sd.webui\webui\models\Lora

shell tendon Apr 17, 2024, 4:39 AM

#

might need to hit the refresh arrow

karmic cedar Apr 17, 2024, 4:40 AM

#

damn lora how u tune so fine

sage reef Apr 17, 2024, 4:41 AM

#

lora? barely knew her

thorny echo Apr 17, 2024, 4:41 AM

#

shell tendon might need to hit the refresh arrow

I just had to go in settings lol for some reason it doesnt show up on default

shell tendon Apr 17, 2024, 4:42 AM

#

restart it then

thorny echo Apr 17, 2024, 4:42 AM

#

yep It shows up now 👍

sage reef Apr 17, 2024, 4:42 AM

#

honest mica Hi lodis. Is it though? I think you got confused with concept and style. Anyway,...

ah cool, thx for the info, and yea send the params if you can ❤️

wide tendon Apr 17, 2024, 5:01 AM

#

do you think in the future we won't hire clothing models?

small cloak Apr 17, 2024, 5:03 AM

#

pearl ocean Apr 17, 2024, 5:11 AM

#

wide tendon do you think in the future we won't hire clothing models?

thomas

sage reef Apr 17, 2024, 5:18 AM

#

huh

fervent thunder Apr 17, 2024, 6:00 AM

#

What is a good way to turn 3d to anime

#

The opposite is usually much easier

warm bane Apr 17, 2024, 6:28 AM

#

Are there people that are considered to be experts in use of AI now? Not only image related but also other stuff? I have an interesting idea and was wondering if you guys could point me to 'experts' 😄

#

Like basically all the creative stuff like image generation, LoRAs, music, animation (Most important is image generation but you get what I mean)

nova zodiac Apr 17, 2024, 6:39 AM

#

I mean theres levels of experts - there are those that are aware of the wide variety of tools, there are those that know how to put those into a workflow to combine em, and then theres those that know how to build em

warm bane Apr 17, 2024, 6:57 AM

#

nova zodiac I mean theres levels of experts - there are those that are aware of the wide var...

Sure, I'm looking for people that are good at setting up the technical side of it. I feel like I got a good creative mind and a good feel for what works and want to get really good at it but the setting up process just kills me sometimes. Like I'd spend so much time on setting things up but it feels like it's almost endless

nova zodiac Apr 17, 2024, 6:57 AM

#

warm bane Sure, I'm looking for people that are good at setting up the technical side of i...

So a tech support expert - youll find those over in #🤝｜tech-support

warm bane Apr 17, 2024, 6:57 AM

#

most of the stuff doesn't even have proper explanation and/or even if you get to that point, there are always minor issues and it's so hard to understand why

unkempt hatch Apr 17, 2024, 7:06 AM

#

which model would you use for inpainting faces?

shell tendon Apr 17, 2024, 7:06 AM

#

yeah the documentation situation as a whole is horrendous

#

most of the time i've spent learning this shit was wasted combing through bad or nonexistent or wrong documentation

unkempt hatch Apr 17, 2024, 7:08 AM

#

e.g. I have a class picture of my students. One parent has requested their son's face be removed (e.g. a celebrity kid and they don't want that shared). I could inpaint and change his ID without wrecking the picture

warm bane Apr 17, 2024, 7:08 AM

#

shell tendon most of the time i've spent learning this shit was wasted combing through bad or...

I feel like this is super underrated in this field and would make someone bank if they manage to actually simplify things

warm bane Apr 17, 2024, 7:11 AM

#

shell tendon most of the time i've spent learning this shit was wasted combing through bad or...

hahahahah yeah I'd follow something super closely, literally do things 1:1 but then its missing important information or is straight up inconsistent or doesn't work. Then I'll try someone else but I already used stuff from someone else so then it just becomes a clown fiesta

#

I have so many random basic questions all the time, I don't even know where to start 😄

#

But then if I ask something simple and/or have an issue that's clearly not even my fault, a lot of people I talk to would be condescending

#

and I also really want to set things up but dont wanna bother someone 24/7 so idk

#

very frustrating dynamic

proper locust Apr 17, 2024, 7:13 AM

#

Baby Iron Man

loud solar Apr 17, 2024, 7:33 AM

#

YaY! My 5k ASUS coupon arrived 🙂

pearl ocean Apr 17, 2024, 7:36 AM

#

is EpicrealismXL the most realistic model??

#

catlurk

nova zodiac Apr 17, 2024, 7:41 AM

#

pearl ocean is EpicrealismXL the most realistic model??

ICBINP XL

#

I might be biased though

pearl ocean Apr 17, 2024, 7:41 AM

#

nova zodiac ICBINP XL

😮

nova zodiac Apr 17, 2024, 7:42 AM

#

but use that, dpm++ 2m karras, 3cfg, 832x1216, 30 steps, no hires fix, and pag scale of 0.6, adaptive scale 0.6

stone walrus Apr 17, 2024, 8:03 AM

#

This used to be easy, but i guess it changed. What syntax am i supposed to use to generate an image now? I tried # stable-cascade and it doesn't respond

nova zodiac Apr 17, 2024, 8:25 AM

#

stone walrus This used to be easy, but i guess it changed. What syntax am i supposed to use t...

none - #1047610792226340935

stone walrus Apr 17, 2024, 8:26 AM

#

why do i see other images that look recently generated?

nova zodiac Apr 17, 2024, 8:33 AM

#

because they would have been made with a different generator and copied in

gray vapor Apr 17, 2024, 9:01 AM

#

I'm working on a workflow for AI architectural renderings. So far my results are very promising and already very useful in my practice for design choices. My first goal was to have control over the materials in specific elements in the view, also avoiding prompt bleeding, and I have achieved it with the use of ipadapter+attention masking plus regional prompting/regional sampler over color masked exports.

My next goal is harder and has to do with being able to generate multiple renderings from different points of view while keeping consistency in the materials. Using the same references in ipadapter helps but the material is not exactly the same, details appear at different places etc. Geometry consistency is obvs achieved with controlnets.

#

For this I have considered those strategies:

1. Using the og color masks to cut the parts in the first generated image and using those cutouts as references for ipadapter in the next views, with the plus model and a high strength. The problem is that, for small elements, this cutouts can be quite small or far from square ratio. This could be solved with an upscaling but seems too inefficient. Also this wouldn't help with the position of specific details, only with the general material color and texture.
1. Given that I have the og geometry. Somehow transform the parts of the first image and place them in the correct spatial location in the next view, use this as a latent in a img2img setting (but it would have a lot of "non filled parts".
1. my fav: Considering AnimateDiff, This already tries to solve a problem of temporal consistency. I could export a video orbiting around the space, generate a full video maybe at low steps or low res (for efficiency) and then only choose the frames that are interesting to me to continue denoising and upscaling. I like this idea but also seems inefficient. I wonder if there is a way to "hack" the motion information from the module to use it directly without generating every frame in the middle. Also, having access to the geometry, maybe I could export the accurate motion vectors directly without relying on preprocessing.

I'm relatively new to SD and therefore I'm sure there are a lot of other ways to tackle this problem of point of view consistency. I'm really looking forward to hearing about your ideas.

#

thx in advance 🙂

sage reef Apr 17, 2024, 9:03 AM

#

stone walrus why do i see other images that look recently generated?

idk if it will help you, but this still works: https://huggingface.co/spaces/multimodalart/stable-cascade

pearl ocean Apr 17, 2024, 9:08 AM

#

Imagine using an A.I pin

#

catlurk

hallow nexus Apr 17, 2024, 10:30 AM

#

pearl ocean is EpicrealismXL the most realistic model??

From my experience at the moment they are these:
(1) REALVIS-XL V.4
(2) NEWREALITY-XL_ALLINONE V.2.1
(3) LEOSAMSHELLOWORLD_XL V.5

currently these 3 models are excellent for very photorealistic images

pearl ocean Apr 17, 2024, 10:34 AM

#

hallow nexus From my experience at the moment they are these: (1) REALVIS-XL V.4 (2) NEWREALI...

😮

pearl ocean Apr 17, 2024, 10:39 AM

#

hallow nexus From my experience at the moment they are these: (1) REALVIS-XL V.4 (2) NEWREALI...

I got real vis, I should try those others out

hallow nexus Apr 17, 2024, 10:39 AM

#

not just for portraits, they are excellent because they have many varieties of faces and postures. Obviously it depends a lot on the prompt and the samplers you use (I recommend DPM++ 2M SDE Karras, or DPM++ 3M SDE Exponential)

pearl ocean Apr 17, 2024, 10:42 AM

#

lol, I got@like 6 models already end dowmldoed for some reason XD

thorn quarry Apr 17, 2024, 10:43 AM

#

HI

#

BABY

#

what are you doing

#

sb

fair pewter Apr 17, 2024, 10:44 AM

#

im honestly mad

#

interesting

thorn quarry Apr 17, 2024, 10:46 AM

#

amazing

#

no you are not a honestly mad

wet dawn Apr 17, 2024, 11:02 AM

#

hi

ionic crown Apr 17, 2024, 11:20 AM

#

Are these bots not gonna work no more or what

#

Somebody clarify pls

loud solar Apr 17, 2024, 11:24 AM

#

ionic crown Are these bots not gonna work no more or what

Gone for now ...

trail lion Apr 17, 2024, 11:35 AM

#

hallow nexus From my experience at the moment they are these: (1) REALVIS-XL V.4 (2) NEWREALI...

I like these: fullyRealXL, cinematicredmond, juggernautXL

#

forget the bots, no way they're coming back

bleak matrix Apr 17, 2024, 11:41 AM

#

Good morning, everyone! How are we all today?

onyx hedge Apr 17, 2024, 12:50 PM

#

原神

worn aspen Apr 17, 2024, 12:58 PM

#

heavy lark it's because cascade is dead. it's been non-stop discussion for the few months s...

Well thank you for that information! I'll stop with the cascade and just use xl, so as not to waste time on an abandoned resource.

static cape Apr 17, 2024, 1:17 PM

#

While we are waiting for SD3... is there any way to use Pixart-Sigma with ComfyUI, Swarm or Forge?

heavy lark Apr 17, 2024, 1:22 PM

#

static cape While we are waiting for SD3... is there any way to use Pixart-Sigma with ComfyU...

https://github.com/city96/ComfyUI_ExtraModels yeah this page will get you going with comfy. i don't know about forge with it.

pine fiber Apr 17, 2024, 1:24 PM

#

static cape While we are waiting for SD3... is there any way to use Pixart-Sigma with ComfyU...

its not even that good is it

static cape Apr 17, 2024, 1:24 PM

#

pine fiber its not even that good is it

IDK - I just want to see for myself.

pine fiber Apr 17, 2024, 1:24 PM

#

use demo https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma

static cape Apr 17, 2024, 1:24 PM

#

300 token limit and color / prompt adherence sounds good.

pine fiber Apr 17, 2024, 1:26 PM

#

It’s pretty bad from what I tested

#

but the model was pretty small iirc

vernal saffron Apr 17, 2024, 1:30 PM

#

Hope the reason SD3 is taking time isn’t because they’re retraining without any copyrighted data.

trim magnet Apr 17, 2024, 1:31 PM

#

comedian eh

pine fiber Apr 17, 2024, 1:31 PM

#

why do u assume they trained with copyrighted data in the first place

vernal saffron Apr 17, 2024, 1:37 PM

#

I have to be able to generate pictures of Mario taking a shit 🚽

#

That’s my benchmark

karmic cedar Apr 17, 2024, 1:40 PM

#

pine fiber It’s pretty bad from what I tested

it’s behind, development-wise. but it’s not a Stability model, so already it has that going for it. lol

pine fiber Apr 17, 2024, 1:40 PM

#

thats a bad thing why?

karmic cedar Apr 17, 2024, 1:41 PM

#

it’s not a bad thing

#

we already know SD3 is the end of T2I for Stability devs

pine fiber Apr 17, 2024, 1:42 PM

#

you think they will stop after that? why

karmic cedar Apr 17, 2024, 1:42 PM

#

Stability has declared SD3 to be their last T2I model.

pine fiber Apr 17, 2024, 1:42 PM

#

I didnt know that, strange

#

wonder why then

karmic cedar Apr 17, 2024, 1:42 PM

#

it has a lot to do with Emad leaving, but it could also be some other factors involved as well.

pine fiber Apr 17, 2024, 1:43 PM

#

yeah well I wouldnt know enough about that

karmic cedar Apr 17, 2024, 1:43 PM

#

lots of politics involved there

pine fiber Apr 17, 2024, 1:43 PM

#

I dont know if people will adopt sd3 anyway

#

the inference times seem really bad

#

maybe with turbo

karmic cedar Apr 17, 2024, 1:43 PM

#

I’m certain they will, which is part of the complexity of the issue for sure.

pine fiber Apr 17, 2024, 1:43 PM

#

but it probably costs a ton more to train

karmic cedar Apr 17, 2024, 1:43 PM

#

cloud GPUs will step in on the timing part.

#

those will begin to take on more commercial representation as more investments are made

#

but prices will rise

#

the same logic Apple uses to deduce that consumers are okay with having a monthly plan for their smartphone as opposed to owning it outright is going to be the same logic used for cloud GPU, etc.

pine fiber Apr 17, 2024, 1:45 PM

#

personally I dont think diffusion transformers are very good right now.. I dont see the same kind of generalisation LLMs have in these DiT models

karmic cedar Apr 17, 2024, 1:46 PM

#

it’s all about attention guidance now

#

and structure

worn aspen Apr 17, 2024, 1:46 PM

#

Is forge better than comfy regarding memory management and avoiding out of memory errors? I like comfy and have no desire to go back to an A1111 style UI, unless memory management improvements are significant.

karmic cedar Apr 17, 2024, 1:47 PM

#

i think forge is memory king atm iirc

pine fiber Apr 17, 2024, 1:48 PM

#

karmic cedar it’s all about attention guidance now

people believe diffusers can go that much further just with better attention?

karmic cedar Apr 17, 2024, 1:48 PM

#

that’s also why i say structure

#

with more controlnets for different types of data

#

etc.

#

or rather, not data but syntactic detail

worn aspen Apr 17, 2024, 1:49 PM

#

Am I stubborn for liking the comfy interface above all other considerations?

karmic cedar Apr 17, 2024, 1:49 PM

#

nah not at all 😛

#

it’s a cool interface.

#

it represents the actual workflows that these models all use and that’s like operating a steam engine almost 😛

pine fiber Apr 17, 2024, 1:50 PM

#

I like it but it gets spaggheti

karmic cedar Apr 17, 2024, 1:50 PM

#

^

worn aspen Apr 17, 2024, 1:50 PM

#

I have no need to generate images, but it's fun, and comfy keeps me interested. I suppose I do need to try forge though.

pine fiber Apr 17, 2024, 1:50 PM

#

karmic cedar or rather, not data but syntactic detail

do we even have models that can identify features that are small/obscure?

karmic cedar Apr 17, 2024, 1:51 PM

#

not really—not to my standards at least!

pine fiber Apr 17, 2024, 1:51 PM

#

I cant think of a good example

karmic cedar Apr 17, 2024, 1:51 PM

#

they’re just not that granular…yet.

#

but they can be with enough code. 😄

pine fiber Apr 17, 2024, 1:51 PM

#

just like, does this piece of clothing make sense, should this thing look like this, etc

karmic cedar Apr 17, 2024, 1:51 PM

#

text encoding precision is also a key factor…obviously

#

text encoding is sort of like how the number of pixels in a raster window are defined

#

in my simplistic view lol

pine fiber Apr 17, 2024, 1:52 PM

#

its counter intuitive to me because I assume smaller prompts would work better for some reason

karmic cedar Apr 17, 2024, 1:52 PM

#

consider how much better SD 1.5 images tend to look when they use ELLA

pine fiber Apr 17, 2024, 1:52 PM

#

less stuff for the model to get wrong

karmic cedar Apr 17, 2024, 1:53 PM

#

which is just really nice text encoding for the most part

#

right right

pine fiber Apr 17, 2024, 1:53 PM

#

thats true

#

I just thought DiTs would be able to recognise its own mistakes more often but that doesnt seem like the case

karmic cedar Apr 17, 2024, 1:54 PM

#

perhaps there’s more potential for them to down the line, but the code doesn’t seem to support that function as much as it’s theorized at the moment.

pine fiber Apr 17, 2024, 1:54 PM

#

yeah

karmic cedar Apr 17, 2024, 1:56 PM

#

we have an instinctive tendency to approach models holistically, which is good, but we’ve managed to make older stuff shine more just by building in new functionality to preexisting architectures. this is going to continue to be a powerful thing since the sky’s the limit with creativity and AI.

#

and how it gets extended. it’s like digital putty.

#

IMO

pine fiber Apr 17, 2024, 1:57 PM

#

I agree

#

thats why I was on the fence about sd3 being "good enough" when 1.5 and sdxl are still getting better every day

karmic cedar Apr 17, 2024, 1:58 PM

#

i’m just getting caffeinated this morning so i’m already on my AI soapbox

#

I think it’ll be a really nice, polished sports car of a model. But we’ve got Honda Civics already that have plenty of potential for mileage. That’s how I see it. 😛

#

like Sora—that’s a lamborghini for sure.

pine fiber Apr 17, 2024, 1:59 PM

#

we get a sports car when we need an off roader

karmic cedar Apr 17, 2024, 1:59 PM

#

lol

#

oh….we don’t get the sports cars

#

hollywood gets those lol

#

j/k

#

not j/k

#

what’s really going to be interesting is when smartphones and other devices start to carry localized LLMs. for example the current iphone can run Mistral 7B

#

and others

#

When those types of models begin driving other functions of the device, that’ll be a game changer. Even local image diffusion will be a thing, the likes of Apple could even have their own proprietary diffusion algorithms baked into a future release

reef wing Apr 17, 2024, 2:05 PM

#

Sd3 api released

karmic cedar Apr 17, 2024, 2:06 PM

#

😮

reef wing Apr 17, 2024, 2:06 PM

#

Open weights with stability membership soon according to twitter

karmic cedar Apr 17, 2024, 2:07 PM

#

the 26th is seeming more realistic now

#

or that range of dates

shell tendon Apr 17, 2024, 2:12 PM

#

reef wing Sd3 api released

badass

karmic cedar Apr 17, 2024, 2:13 PM

#

oooo someone’s got an SVD multiview project going https://github.com/king159/svd-mv

charred mesa Apr 17, 2024, 2:13 PM

#

reef wing Sd3 api released

tweet or something? how

reef wing Apr 17, 2024, 2:13 PM

#

charred mesa tweet or something? how

@stabilityai

#

On X

charred mesa Apr 17, 2024, 2:14 PM

#

whoops

#

https://stability.ai/news/stable-diffusion-3-api

mortal delta Apr 17, 2024, 2:26 PM

#

charred mesa https://stability.ai/news/stable-diffusion-3-api

By chance does that cost money? and is that the only method to get stable diffusion 3?

charred mesa Apr 17, 2024, 2:26 PM

#

no

#

we'll get people invited to Stable Assitant where you can use SD3

#

and also we'll get the models themselves in the future

#

(model files + code)

mortal delta Apr 17, 2024, 2:27 PM

#

So now we just play the waiting game?

charred mesa Apr 17, 2024, 2:27 PM

#

like we have been all this time tbh

#

lol

#

but this means we're finally closer

#

they've been promising API "soon" for 3 weeks

#

thomas

mortal delta Apr 17, 2024, 2:28 PM

#

charred mesa but this means we're finally closer

I guess its better than nothing.

charred mesa Apr 17, 2024, 2:28 PM

#

yup

#

but all that matters is that we WILL get the models that we can use offline and etc

#

even if its like 2-3 weeks away

mortal delta Apr 17, 2024, 2:28 PM

#

I cant stand being impatient, it feels like ive been waiting for ever.

sudden ruin Apr 17, 2024, 2:29 PM

#

I wonder why people are so impatient, yes a few weeks is a lot of time in the AI world, but sometimes it feels like the only reason to live for some people is to complain about SD3 not being released already

charred mesa Apr 17, 2024, 2:30 PM

#

well yeah

#

people on the internet LITERALLY have nothing else to do

sudden ruin Apr 17, 2024, 2:30 PM

#

Poor souls

#

touchinggrass

mortal delta Apr 17, 2024, 2:30 PM

#

charred mesa people on the internet LITERALLY have nothing else to do

i can confirm this is true we also are depressed.

charred mesa Apr 17, 2024, 2:31 PM

#

see

#

now I do admit that announcing it so early was a stupid mistake

#

its been almost 2 months since they announced it

#

like thats the worst possible way to hype something up

rugged mirage Apr 17, 2024, 2:33 PM

#

I hope they create a #sd3 channel soon

charred mesa Apr 17, 2024, 2:33 PM

#

^

#

exactly

#

it would make sense

frigid wolf Apr 17, 2024, 2:33 PM

#

And I reading that announcement right, sounds like paid membership will be required for SD3 model weights even for personal/noncommercial usage?

charred mesa Apr 17, 2024, 2:33 PM

#

what

#

no

#

for non-commercial no

#

only $20 for commercial usage and that's it

rugged mirage Apr 17, 2024, 2:34 PM

#

didnt membership used to be <1m users is free, $20 is more? Now it seems to be 0 users for free, and <1m for $20? https://stability.ai/membership

frigid wolf Apr 17, 2024, 2:34 PM

#

Okay good, it's not overly clear in the announcement

mortal delta Apr 17, 2024, 2:34 PM

#

charred mesa no

well that stinks.

rugged mirage Apr 17, 2024, 2:34 PM

#

so you have to pay for membership even if you have 10 viewers and get 2 cents from youtube ads?

honest mica Apr 17, 2024, 2:34 PM

#

We aim to make the model weights available for self-hosting with a Stability AI Membership in the near future. kinda sus

charred mesa Apr 17, 2024, 2:35 PM

#

yeah they wrote it in a weird way lol

#

but model weights will come

mortal delta Apr 17, 2024, 2:35 PM

#

will they be open sourced by chance?

charred mesa Apr 17, 2024, 2:35 PM

#

the code yes, the models will have that license where its noncommercial

rugged mirage Apr 17, 2024, 2:35 PM

#

charred mesa the code yes, the models will have that license where its noncommercial

https://stability.ai/membership

charred mesa Apr 17, 2024, 2:36 PM

#

but everything offline available

mortal delta Apr 17, 2024, 2:36 PM

#

charred mesa the code yes, the models will have that license where its noncommercial

dang i was hoping to make a comic or something with the new models...

charred mesa Apr 17, 2024, 2:37 PM

#

if you dont make revenue, like its just for free you're fine

#

I mean I would not have the heart to make these images for money

#

I do these for fun

steel dome Apr 17, 2024, 2:38 PM

#

charred mesa I mean I would not have the heart to make these images for money

Why not?

rugged mirage Apr 17, 2024, 2:38 PM

#

Id like to make some videos where I can get at least some ad revenue if anything goes viral, but kind of a nonstarter if you have to pay every month even at first when you are making 3 cents

hasty badge Apr 17, 2024, 2:39 PM

#

from my tests, sd3 is laughably bad 😢 could they have some pre pre alpha in the api... that's hard to belief as well

charred mesa Apr 17, 2024, 2:39 PM

#

steel dome Why not?

well unlike these ai comic artists, I barely put in effort, and even with all the opted out artists there are a bunch of artists or studios that had their style left in so that would also make me feel guilty

hasty badge Apr 17, 2024, 2:40 PM

#

it can't even get text without errors while that was supposed the big thing

charred mesa Apr 17, 2024, 2:40 PM

#

if you put in a lot of effort and draw over it a lot and stuff then sure, I get it, you'd want some revenue for it cause you actually put in effort

frigid wolf Apr 17, 2024, 2:40 PM

#

@charred mesa same, I love seeing these models as open and free as possible as it fuels research and innovation. Like supir, omg that is incredible, especially for restoring old photos.

mortal delta Apr 17, 2024, 2:40 PM

#

charred mesa I mean I would not have the heart to make these images for money

Im not evens sure if i could even make money with stable diffusion.

polar roost Apr 17, 2024, 2:41 PM

#

honest mica `We aim to make the model weights available for self-hosting with a Stability AI...

I hope it's worded that way to advertise the memberships, but it's confusing for people waiting for the open non-commercial weights

frigid wolf Apr 17, 2024, 2:41 PM

#

There's just so much cool stuff that wouldn't exist if SD wasn't open

graceful spade Apr 17, 2024, 2:42 PM

#

Hi guys

#

Is there any steps can you suggest to change expressions in video using stable diffusion, (not faceswap to different person) just like my video is has a bored face and turn it to singing?

graceful spade Apr 17, 2024, 2:43 PM

#

graceful spade Is there any steps can you suggest to change expressions in video using stable d...

can anyone tell?

rugged mirage Apr 17, 2024, 2:43 PM

#

inpaint a different expression

graceful spade Apr 17, 2024, 2:44 PM

#

rugged mirage inpaint a different expression

in video?

#

can you please tell me more how to do it, if you can please

rugged mirage Apr 17, 2024, 2:44 PM

#

idk what you use for your video, but most of the approaches have an initial image you can provide

karmic cedar Apr 17, 2024, 2:44 PM

#

temporal video editing hasn’t really become a thing…yet

#

it’s getting there

#

but not quite

graceful spade Apr 17, 2024, 2:45 PM

#

can't we do it now? just try?

#

😐

mortal delta Apr 17, 2024, 2:47 PM

#

So just to make sure sd3 is free thru api at the moment but cant be used to profit off of? is that right?

karmic cedar Apr 17, 2024, 2:47 PM

#

it’s being made available via the developer API, yea

mortal delta Apr 17, 2024, 2:48 PM

#

karmic cedar it’s being made available via the developer API, yea

Ok i just wanted to make sure, and said api is free or is there a limit/paywall?

karmic cedar Apr 17, 2024, 2:49 PM

#

no clue

#

i think you need to have a membership at the very least

mortal delta Apr 17, 2024, 2:49 PM

#

Interesting...

karmic cedar Apr 17, 2024, 2:50 PM

#

they must be having some interesting internal conversations

mortal delta Apr 17, 2024, 2:52 PM

#

karmic cedar they must be having some interesting internal conversations

possibly?

steel dome Apr 17, 2024, 2:55 PM

#

So you will also need a membership to use the models on your machine? (comfyui/invoke/a1111)

rugged mirage Apr 17, 2024, 2:56 PM

#

there is a cost for sd3 in the api docs at least

#

so wouldnt think it's free

real zodiac Apr 17, 2024, 2:58 PM

#

steel dome So you will also need a membership to use the models on your machine? (comfyui/i...

the tweet makes it sound like that but I'd love for it to be clarified before it turns into a shitstorm

frigid wolf Apr 17, 2024, 2:58 PM

#

World definately be nice for stability to clarify this.

pine fiber Apr 17, 2024, 3:02 PM

#

steel dome So you will also need a membership to use the models on your machine? (comfyui/i...

membership is free lol

#

the only time you need a paid membership is for commercial usage and its $20 a month

charred mesa Apr 17, 2024, 3:04 PM

#

^

pine fiber Apr 17, 2024, 3:07 PM

#

dark your pfp looks like netero

steel dome Apr 17, 2024, 3:10 PM

#

pine fiber the only time you need a paid membership is for commercial usage and its $20 a m...

So not really free

#

Lol, i guess

pine fiber Apr 17, 2024, 3:10 PM

#

well if you thought you could use a million dollar model commercially for free thats basically stealing

#

I think $20 a month is a good compromise

charred mesa Apr 17, 2024, 3:11 PM

#

pine fiber dark your pfp looks like netero

what

#

NETERO??????

pine fiber Apr 17, 2024, 3:11 PM

#

yes

steel dome Apr 17, 2024, 3:12 PM

#

pine fiber I think $20 a month is a good compromise

It is if you do more than that in "revenue", not if you maybe you make a couple bucks once in a while. Anyway, I get the point

#

I still think there should be like a threshold that makes it "commercial use" kind of like Unity3d does

#

If you do less than X money, it's free

rugged mirage Apr 17, 2024, 3:16 PM

#

yeah I wish the $20 a month was at least only for if you do make more than $20 (or a bit more even)

#

though I guess they are unlikely to come at you if you are making $10 but still

mortal delta Apr 17, 2024, 3:17 PM

#

rugged mirage though I guess they are unlikely to come at you if you are making $10 but still

are you sure, also 20$ isint much.

trim magnet Apr 17, 2024, 3:17 PM

#

mortal delta are you sure, also 20$ isint much.

ok gimme 20$

steel dome Apr 17, 2024, 3:18 PM

#

mortal delta are you sure, also 20$ isint much.

Subscriptions tend to pile up...

rugged mirage Apr 17, 2024, 3:18 PM

#

eh, if you are just someone trying to make tiktoks or youtube videos, and you pay for a year trying to make it $240 while you make $0.34 back, on top of all the other stuff you use it's not great

mortal delta Apr 17, 2024, 3:18 PM

#

steel dome Subscriptions tend to pile up...

i cant tell if thats bad or good

rugged mirage Apr 17, 2024, 3:19 PM

#

especially since you also need a bunch more hardware, electricity etc. to generate compared to paying slightly more for sora or whatever

sudden ruin Apr 17, 2024, 3:19 PM

#

I remember times when you could buy stuff and simply own it

mortal delta Apr 17, 2024, 3:19 PM

#

rugged mirage eh, if you are just someone trying to make tiktoks or youtube videos, and you pa...

gpt costs 20 bucks a mouth and you get more features and isint such ai cheaper?

steel dome Apr 17, 2024, 3:20 PM

#

mortal delta i cant tell if thats bad or good

There's this, MJ, openai... But ok, not really their problem

timid bloom Apr 17, 2024, 3:20 PM

#

x20 cost of SDXL?

#

NICE.

#

xD

rugged mirage Apr 17, 2024, 3:20 PM

#

gpt costs a lot more to run, like you can never run it at home at that speed, so being same price while gpt4 has no hardware you need to purchase and runs on their hardware is a good comparison of how much better of a deal gpt4 is

mortal delta Apr 17, 2024, 3:20 PM

#

timid bloom x20 cost of SDXL?

I wish i could run sdxl but my hardware stinks.

timid bloom Apr 17, 2024, 3:20 PM

#

try with fp8

#

working nice on A1111 with 8gb 3070

trim nymph Apr 17, 2024, 3:21 PM

#

what are the sd 3 costs via api? can someone let me know real quick

mortal delta Apr 17, 2024, 3:21 PM

#

timid bloom try with fp8

where can i find this at? if i might ask? or like how does it work?

timid bloom Apr 17, 2024, 3:22 PM

#

A1111 now support fp8 mode

mortal delta Apr 17, 2024, 3:22 PM

#

timid bloom A1111 now support fp8 mode

what if i use forge?

timid bloom Apr 17, 2024, 3:22 PM

#

so if you download new version it should be right there in options under optimization

#

no idea

#

never used

mortal delta Apr 17, 2024, 3:22 PM

#

I guess ill research it then.

rugged mirage Apr 17, 2024, 3:23 PM

#

forge generally has lower requirements than a111 so definitely try it over a111 if that's your issue

timid bloom Apr 17, 2024, 3:23 PM

#

trim nymph what are the sd 3 costs via api? can someone let me know real quick

4 cents per image

mortal delta Apr 17, 2024, 3:24 PM

#

rugged mirage forge generally has lower requirements than a111 so definitely try it over a111 ...

I use forge im just not sure how i can run sdxl on low end hardware. ive been using sd 1.5....

rugged mirage Apr 17, 2024, 3:25 PM

#

you probably cant but 500 means their server is crashing from the request, so either it's a problem on their end or you are sending broken data - ilformatted or something

steel dome Apr 17, 2024, 3:26 PM

#

rugged mirage you probably cant but 500 means their server is crashing from the request, so ei...

Wouldn't that be a 4XX error though? (like 400 BAD REQUEST)

pine fiber Apr 17, 2024, 3:30 PM

#

timid bloom x20 cost of SDXL?

who said

#

oh

#

20x is crazy

#

scaling laws my ass

#

sounds like issue on their end

timid bloom Apr 17, 2024, 3:32 PM

#

yeah

#

its on dalle 3 price level

pine fiber Apr 17, 2024, 3:32 PM

#

it looks bad tbh

#

the anatomy is really messed up for some reason?

timid bloom Apr 17, 2024, 3:33 PM

#

no idea, didnt test it out

rugged mirage Apr 17, 2024, 3:34 PM

#

so exactly what I guessed and they've even told you what it is

#

youve put 'your account' instead of a token, presumably coppying the documentation directly instead of actually registering, getting a token and putting it where it tells you

balmy rune Apr 17, 2024, 3:34 PM

#

When can we expect the weights?

pine fiber Apr 17, 2024, 3:34 PM

#

rugged mirage youve put 'your account' instead of a token, presumably coppying the documentati...

oh lol

timid bloom Apr 17, 2024, 3:35 PM

#

soon TM

#

xD

pine fiber Apr 17, 2024, 3:35 PM

#

balmy rune When can we expect the weights?

my guess is by the end of the month

balmy rune Apr 17, 2024, 3:35 PM

#

I mean the model is essentially not released for as long as it's on the API.

rugged mirage Apr 17, 2024, 3:35 PM

#

my guess is next month

#

are you sure you didnt close/unclose some bracket or quote in your request

hasty badge Apr 17, 2024, 3:36 PM

#

rugged mirage youve put 'your account' instead of a token, presumably coppying the documentati...

nope, it really is an error on their end... that mess is the response

steep arrow Apr 17, 2024, 3:36 PM

#

I think we should focus on SDXL Ella implementation and getting text gen working in SDXL.

If SD3 is going to be this heavy, Stability is not considering local usecase

rugged mirage Apr 17, 2024, 3:36 PM

#

hm fair enough then

pine fiber Apr 17, 2024, 3:37 PM

#

steep arrow I think we should focus on SDXL Ella implementation and getting text gen working...

the implementation is there someone just needs to train it

hasty badge Apr 17, 2024, 3:37 PM

#

500 generally means internal server error

rugged mirage Apr 17, 2024, 3:37 PM

#

steep arrow I think we should focus on SDXL Ella implementation and getting text gen working...

there's multiple versions of sd3, some smaller than sdxl

steep arrow Apr 17, 2024, 3:37 PM

#

pine fiber the implementation is there someone just needs to train it

A few are already working on it.

I have SD1.5 Ella > IpAdpater SDXL running and it is great.

rugged mirage Apr 17, 2024, 3:37 PM

#

most servers arent that well configured, and can return a 500 in a ton of cases

woven panther Apr 17, 2024, 3:38 PM

#

not to take part in the current discussion, but as I wanted to try SD3 so made a quick node to use the API, you get 25 free credits anyway to try for the curious: https://github.com/kijai/ComfyUI-KJNodes/commit/22cf8d89968a47ce26be919f750f2311159145d1

pine fiber Apr 17, 2024, 3:38 PM

#

steep arrow A few are already working on it. I have SD1.5 Ella > IpAdpater SDXL running and...

good to know, what news on lavi bridge?

woven panther Apr 17, 2024, 3:38 PM

#

pine fiber good to know, what news on lavi bridge?

no news from what I've seen

#

I fixed the comfy native ELLA node btw

#

it never worked properly before today 😛

steep arrow Apr 17, 2024, 3:39 PM

#

pine fiber good to know, what news on lavi bridge?

I read the arvix paper but had not seen the github. I am really interested in checking this out, especially as the Llava models get a boost from WizardLM and Llama 3

steep arrow Apr 17, 2024, 3:40 PM

#

woven panther I fixed the comfy native ELLA node btw

I think I resorted to another deployment, I will wrap back around and see if I can get yours working.

pine fiber Apr 17, 2024, 3:40 PM

#

yeah because the ella models are trained on t5 arent they? bigger is always better imagine using llama 3 would be great

woven panther Apr 17, 2024, 3:40 PM

#

I have made a wrapper node for LaVi bridge as well, it's far worse than ELLA and there's no SDXL model for it either :/

woven panther Apr 17, 2024, 3:41 PM

#

steep arrow I think I resorted to another deployment, I will wrap back around and see if I c...

I made a PR for the ComfyUI_ELLA about it: https://github.com/ExponentialML/ComfyUI_ELLA/pull/25

steep arrow Apr 17, 2024, 3:42 PM

#

Ella 1.5 > Composition Adapter > SDXL
And then additional images for style control is really ace.

Unbelievably controlable on SDXL, and the CLIP seems to work better when composition weighting is involved.

pine fiber Apr 17, 2024, 3:42 PM

#

cool very nice

#

I would try it if I wasnt running amd

carmine herald Apr 17, 2024, 4:04 PM

#

I stopped messing with Stable diffusion for a few months and came back and now everyone's talking about ponies, what's going on

sudden ruin Apr 17, 2024, 4:05 PM

#

Pony is the go to anime Checkpoint right now if im not mistaken

carmine herald Apr 17, 2024, 4:05 PM

#

What makes it so much better?

sudden ruin Apr 17, 2024, 4:06 PM

#

No clue #🍥｜anime probably has more answers

carmine herald Apr 17, 2024, 4:07 PM

#

a terrifying location

sudden ruin Apr 17, 2024, 4:12 PM

#

Only nice people in there dogsmile

drifting knot Apr 17, 2024, 4:15 PM

#

Hi everyone, I've been wondering about is it possible to generate exact some person on different pictures, like some Tom(which is not real or celebrity). For example I want to create picture where Tom is cooking or walking dog.
How to make it? I need to describe Tom in prompt or use some special seed?

trail lion Apr 17, 2024, 4:18 PM

#

There's a guy in the images room showing off the preview, fwiw

pine fiber Apr 17, 2024, 4:18 PM

#

sudden ruin Pony is the go to anime Checkpoint right now if im not mistaken

is it better than nai v3 yet?

sudden ruin Apr 17, 2024, 4:19 PM

#

Im not the right Person to ask

pine fiber Apr 17, 2024, 4:19 PM

#

right XD

ornate flame Apr 17, 2024, 4:24 PM

#

I don't like the wording of the announcement

trail lion Apr 17, 2024, 4:25 PM

#

drifting knot Hi everyone, I've been wondering about is it possible to generate exact some per...

A few ways, you can create a pose collage, split it and use that to train a lora, there are also a variety of faces swap tools, such as reactor and I controlnet ip-adapter, instant id

#

It will take effort, but if can be done

drifting knot Apr 17, 2024, 4:26 PM

#

Is it possible to get it from default model, without training lora?

#

like any model from civitai

trail lion Apr 17, 2024, 4:27 PM

#

Yeah, with the controlnet method, but Lora will be more flexible

rain aurora Apr 17, 2024, 4:29 PM

#

(Masterpiece), (Best quality), (Ultra HD), (Super detail), (Whole body :1.2), 1 girl, Chibi, cute, smile, flowers, outdoors, holding the camera, sitting on the roof looking out into the distance, with mountains in the background, amber, warm yellow, sunset, artistic sense, Quadratic style, white clothes,

trail lion Apr 17, 2024, 4:31 PM

#

Eww, 1.5 prompt

charred mesa Apr 17, 2024, 4:32 PM

#

hehehe

#

opposite of natural prompting

rugged mirage Apr 17, 2024, 4:33 PM

#

tbf that's more natural to me, because that's how I search in google or whatever, and not writing prose with a bunch of obviously useless filler words

sage reef Apr 17, 2024, 4:40 PM

#

rain aurora (Masterpiece), (Best quality), (Ultra HD), (Super detail), (Whole body :1.2), 1 ...

i never heard "quadratic style" before

charred mesa Apr 17, 2024, 4:41 PM

#

well I got to try the SD3 api

#

and its almost over

#

only 25 credits, 4 credits per SD3 Turbo image

#

and thats not a bundle of 4 images or anything

#

just ONE image

#

so I guess I'll either wait for stable assistant or weights lolll

sage reef Apr 17, 2024, 4:43 PM

#

they need to make money before making it open weights, i guess it makes sense, but yea...

cerulean kraken Apr 17, 2024, 4:43 PM

#

So.. as just a normal user that wants to try SD3, any easy way I can use this API?

karmic cedar Apr 17, 2024, 4:44 PM

#

https://www.reddit.com/r/StableDiffusion/s/lUYMRFOvcF

#

Some comfy words from the source

sage reef Apr 17, 2024, 4:44 PM

#

happemad

#

cant wait

cerulean kraken Apr 17, 2024, 4:47 PM

#

Is there like a simple website I can go try this on?

sage reef Apr 17, 2024, 4:47 PM

#

so he said few weeks from now huh... that kinda passes my estimate of April 26 it seems :3

#

so maybe May 10

charred mesa Apr 17, 2024, 4:49 PM

#

cerulean kraken Is there like a simple website I can go try this on?

api only

#

so you have to figure it out

#

you could try this https://github.com/kijai/ComfyUI-KJNodes/

cerulean kraken Apr 17, 2024, 4:53 PM

#

Nothing I can just input my API key into and go off to the races?

rugged mirage Apr 17, 2024, 4:54 PM

#

so he says today (always API first, then a few weeks later weights), a few = 2+ so likely early may, possibly mid may

sage reef Apr 17, 2024, 4:55 PM

#

my new estimate is May 10 happemad

charred mesa Apr 17, 2024, 4:55 PM

#

cerulean kraken Nothing I can just input my API key into and go off to the races?

yeah with all your massive 25 credits

#

4 credits per one SD3 Turbo image

#

and 6.5 credits per one SD3 image

cerulean kraken Apr 17, 2024, 4:56 PM

#

charred mesa yeah with all your massive 25 credits

I have 567 credits

sage reef Apr 17, 2024, 4:56 PM

#

so how does the turbo version compare quality wise to base sd3?

charred mesa Apr 17, 2024, 4:56 PM

#

cerulean kraken I have 567 credits

nice

charred mesa Apr 17, 2024, 4:56 PM

#

sage reef so how does the turbo version compare quality wise to base sd3?

its pretty nice

#

could NOT test it cause I dont feel like spending 10$

sage reef Apr 17, 2024, 4:57 PM

#

understandable

honest spear Apr 17, 2024, 5:00 PM

#

so sd3 will require a membership for commercial use?

charred mesa Apr 17, 2024, 5:00 PM

#

yes

cerulean kraken Apr 17, 2024, 5:00 PM

#

nodders technically required for non-commercial use too but it's a free membership in that case

charred mesa Apr 17, 2024, 5:00 PM

#

thats only for the api

cerulean kraken Apr 17, 2024, 5:01 PM

#

charred mesa thats only for the api

that's not what the post says 🤷 "In keeping with our commitment to open generative AI, we aim to make the model weights available for self-hosting with a Stability AI Membership in the near future."

rugged mirage Apr 17, 2024, 5:01 PM

#

as far as I know sdxl also requires membership for commercial use

charred mesa Apr 17, 2024, 5:01 PM

#

they cannot be this stupid

charred mesa Apr 17, 2024, 5:02 PM

#

rugged mirage as far as I know sdxl also requires membership for commercial use

but isnt that openrail++

#

you sure you don't mean sdxl turbo

rugged mirage Apr 17, 2024, 5:02 PM

#

charred mesa but isnt that openrail++

well, the https://stability.ai/membership page has been like this since at least sdxl, and it says core models (which includes sdxl) are free for non-commercial, $20 for commercial same as now

#

nothing has changed there

#

also when you download most of their models through huggingface (I think including sdxl but idk anymore) you agree to the same non-commerical clause

cerulean kraken Apr 17, 2024, 5:03 PM

#

charred mesa they cannot be this stupid

I think it's just so people sign up and they can get user numbers to show to potential investors, and track engagement etc.

honest spear Apr 17, 2024, 5:03 PM

#

charred mesa but isnt that openrail++

exactly, sdxl is fine, 0.9 and turbo not

charred mesa Apr 17, 2024, 5:04 PM

#

rugged mirage also when you download most of their models through huggingface (I think includi...

that's stupid cause on huggingface sdxl still has openrail and sdxl turbo has sai-nc-community

honest spear Apr 17, 2024, 5:05 PM

#

well, whatever, I'll check sd3 licence at launch, if it's bad I'll just stay on sdxl or other open rail models that come out

rugged mirage Apr 17, 2024, 5:08 PM

#

I guess Ill worry about it later, I do hope this helps them keep existing at least

#

because currently the chances of ever getting sd4 are kind of grim

cerulean kraken Apr 17, 2024, 5:09 PM

#

woven panther not to take part in the current discussion, but as I wanted to try SD3 so made a...

trying to do this.. I'm stupid and can't figure it out. I cloned into custom_nodes and installed dependencies, but I don't see a SD3 node :/

charred mesa Apr 17, 2024, 5:14 PM

#

honest spear well, whatever, I'll check sd3 licence at launch, if it's bad I'll just stay on ...

yeah its gonna be sai-nc-community

#

10 times out of 10

#

guaranteed! 👍

rugged mirage Apr 17, 2024, 5:16 PM

#

I guess at least if someone trains using the code and architecture from scratch, but not their weights they can make it fully open

woven panther Apr 17, 2024, 5:17 PM

#

cerulean kraken trying to do this.. I'm stupid and can't figure it out. I cloned into custom_nod...

the nodepack is in the Manager too which is probably easiest to install, it should work with the steps you described though

charred mesa Apr 17, 2024, 5:17 PM

#

we have pixart-sigma which is openrail++, but that's 0.6B

#

similar prompt adherence, no text capabilities at all and somewhat cooked images (ESPECIALLY FOR PHOTOS)

sterile raven Apr 17, 2024, 5:18 PM

#

Let me know when someone makes a UI for the SD3 API

wheat anchor Apr 17, 2024, 5:18 PM

#

will sd3 not allow fine-tunes to be uploaded to e.g. hf and civitai because of its license?

charred mesa Apr 17, 2024, 5:18 PM

#

naaah

cerulean kraken Apr 17, 2024, 5:18 PM

#

woven panther the nodepack is in the Manager too which is probably easiest to install, it shou...

I figured it out, beginer mistake I've done before. Installed dependencies on the system python enviroment

charred mesa Apr 17, 2024, 5:18 PM

#

it will

woven panther Apr 17, 2024, 5:18 PM

#

cerulean kraken I figured it out, beginer mistake I've done before. Installed dependencies on th...

ah yeah, classic

wheat anchor Apr 17, 2024, 5:18 PM

#

hm

charred mesa Apr 17, 2024, 5:18 PM

#

its just that the finetuned models will have to be licensed the same

wheat anchor Apr 17, 2024, 5:19 PM

#

:/

charred mesa Apr 17, 2024, 5:19 PM

#

Also

#

Textual Inversions may come back into fashion

wheat anchor Apr 17, 2024, 5:19 PM

#

at least we now know why there was drama with people leaving stability ai

charred mesa Apr 17, 2024, 5:19 PM

#

those were researchers

wheat anchor Apr 17, 2024, 5:19 PM

#

ok...?

charred mesa Apr 17, 2024, 5:19 PM

#

and Emad

#

emad makes sense

sage reef Apr 17, 2024, 5:19 PM

#

hypemad

charred mesa Apr 17, 2024, 5:20 PM

#

cause emad left because of some opennes reasons or whatever

#

I forgot the exact reasion

wheat anchor Apr 17, 2024, 5:20 PM

#

and that is exactly what i refer to?

rugged mirage Apr 17, 2024, 5:21 PM

#

I mean, they've been too open anyway, it was impossible for it to last - it was basically burning VC money to give us free shit and that can only last so long before they run out of people giving them money

wheat anchor Apr 17, 2024, 5:21 PM

#

look up enshittification

rugged mirage Apr 17, 2024, 5:22 PM

#

sort of

wheat anchor Apr 17, 2024, 5:22 PM

#

its the definition of it

rugged mirage Apr 17, 2024, 5:23 PM

#

no, the definition of it is closer to milking users to increase profits

vague pond Apr 17, 2024, 5:23 PM

#

wheat anchor will sd3 not allow fine-tunes to be uploaded to e.g. hf and civitai because of i...

pretty sure civit and HF will pay for a commercial membership so they can do inference services on SD3

rugged mirage Apr 17, 2024, 5:23 PM

#

this is literally burning VC money for the users

charred mesa Apr 17, 2024, 5:23 PM

#

drhead do you think Textual Inversion will make a comeback

#

you can train on 24GB and it may work for all 4 models

#

a dev said that SDXL Textual Inversions may work on SD3

wheat anchor Apr 17, 2024, 5:24 PM

#

hopefully meta will join the diffusers game too and scare all commercial solutions like they did with llama2 and are gonna do again with llama3 next month

charred mesa Apr 17, 2024, 5:24 PM

#

that'd be sick

vague pond Apr 17, 2024, 5:24 PM

#

charred mesa drhead do you think Textual Inversion will make a comeback

textual inversion is already decent if you know what it does and what its limitations are, though I would imagine it might be difficult to make one work on multiple text encoders, and god knows how T5 will handle it

rugged mirage Apr 17, 2024, 5:25 PM

#

llama hasnt scared commercial solutions, it's been really cool for people but hasnt scared openai or anthropic one bit

wheat anchor Apr 17, 2024, 5:25 PM

#

rugged mirage llama hasnt scared commercial solutions, it's been really cool for people but ha...

stuff based on llama did, tho

#

look up mistrals latest open model

charred mesa Apr 17, 2024, 5:25 PM

#

vague pond textual inversion is already decent if you know what it does and what its limita...

yeah T5 might interfere

#

hmm

rugged mirage Apr 17, 2024, 5:25 PM

#

wheat anchor look up mistrals latest open model

the top open source currently is from cohere, cmd r+

#

but still none of it scares them as it can never really quite catch up

#

tho yeh supririnslgy close

steep arrow Apr 17, 2024, 5:26 PM

#

wheat anchor hopefully meta will join the diffusers game too and scare all commercial solutio...

Meta has Emu, they haven't released their image model 😢

charred mesa Apr 17, 2024, 5:26 PM

#

if only there would be another openrail++ t2i besides Pixart-Sigma

sage reef Apr 17, 2024, 5:26 PM

#

pixart too cooked for me

charred mesa Apr 17, 2024, 5:26 PM

#

^

#

sadly

wheat anchor Apr 17, 2024, 5:26 PM

#

rugged mirage the top open source currently is from cohere, cmd r+

let me guess, it somehow derives from llama as well?

charred mesa Apr 17, 2024, 5:26 PM

#

but it has great potential

sage reef Apr 17, 2024, 5:26 PM

#

yea potential is there

wheat anchor Apr 17, 2024, 5:26 PM

#

if you go back the chain to its root

vague pond Apr 17, 2024, 5:27 PM

#

like, as far as I know textual inversion is one of the safer ways to do "quality" alignment. when i was trying to make soyjak faces on one of the furry models, I noticed that all of the preference alignment loras I looked at made the outputs into just glossy sparkly generic stuff, and the negative embedding for boring images made higher quality soyjaks which is what I wanted

rugged mirage Apr 17, 2024, 5:27 PM

#

wheat anchor let me guess, it somehow derives from llama as well?

I dont believe it does

wheat anchor Apr 17, 2024, 5:27 PM

#

also I didnt know Command R+ is rated higher than some gpt4 versions lol

sage reef Apr 17, 2024, 5:27 PM

#

i wanted to try Emu Edit, but it's not open weights 😦

rugged mirage Apr 17, 2024, 5:27 PM

#

yes it's the first one that got higher than early gpt4s

wheat anchor Apr 17, 2024, 5:27 PM

#

I can only run Command R @ 4bits

#

R+ is just too slow :/

steep arrow Apr 17, 2024, 5:28 PM

#

sage reef i wanted to try Emu Edit, but it's not open weights 😦

Isn't CosXL similar? I wasn't super impressed with that model

charred mesa Apr 17, 2024, 5:28 PM

#

both emad (former stability ""dev"") and Lykon is saying that weights will come

sage reef Apr 17, 2024, 5:28 PM

#

yea similar but cosxl has some blurryness problem idk or it kinda deforms the output

charred mesa Apr 17, 2024, 5:28 PM

#

So to clarify, once you are done finalizing the architecture, will the model be released where people can download it for free for personal use?

Maybe even before, it's not up to me to decide.

steep arrow Apr 17, 2024, 5:28 PM

#

sage reef yea similar but cosxl has some blurryness problem idk or it kinda deforms the ou...

Its artifacts remind me of SD1.5

sage reef Apr 17, 2024, 5:29 PM

#

but cosxl technically is the "test" before sd3 edit model i guess, so hopefully the sd3 edit model will be better

charred mesa Apr 17, 2024, 5:29 PM

#

yeah cosxl edit was mid

#

#🆕｜sd3

steep arrow Apr 17, 2024, 5:30 PM

#

Even cosxl base was mid, I was hoping for different clip coherency.

Turns out IpAdapter Composition does way more to help things than a model switch did.

charred mesa Apr 17, 2024, 5:30 PM

#

^

sage reef Apr 17, 2024, 5:30 PM

#

then again... i didnt try to combo cosxl edit with sdxl refiner.. maybe it fixes the output

charred mesa Apr 17, 2024, 5:30 PM

#

oh I was hoping for more than a blurry finetune with just contrast being increased

vague pond Apr 17, 2024, 5:31 PM

#

I'm not concerned one bit about the non-commercial license tbh, I already am used to releasing my finetunes as non-commercial. I make those models for people to use, not for people to throw on some expensive cloud service or for people to put into some paywalled low-effort merge.

steep arrow Apr 17, 2024, 5:31 PM

#

sage reef then again... i didnt try to combo cosxl edit with sdxl refiner.. maybe it fixes...

I was getting very large oddities, I don't think the refiner would clean some of the mess i was getting.

I also never use the SDXL refiner ever, so... lol

charred mesa Apr 17, 2024, 5:31 PM

#

^ same

sage reef Apr 17, 2024, 5:31 PM

#

yea refiner i only used at the very start of sdxl, then finetunes came and it was kinda pointless

charred mesa Apr 17, 2024, 5:32 PM

#

vague pond I'm not concerned one bit about the non-commercial license tbh, I already am use...

not for people to throw on some expensive cloud service or for people to put into some paywalled low-effort merge
based

wheat anchor Apr 17, 2024, 5:32 PM

#

Re: Command-R #🏞｜general-with-images message

rugged mirage Apr 17, 2024, 5:32 PM

#

I used the refiner in like the first 2 weeks of sdxl, and I think maybe once in january

charred mesa Apr 17, 2024, 5:32 PM

#

yeah

steep arrow Apr 17, 2024, 5:32 PM

#

I just used hires mulitpass from the get go. I only played with the refiner right at release

charred mesa Apr 17, 2024, 5:32 PM

#

the refiner was deemed useless like 2-3 months after usage

#

if not less

vague pond Apr 17, 2024, 5:33 PM

#

they probably should have made the refiner go over the last 300-400 timesteps instead

steep arrow Apr 17, 2024, 5:33 PM

#

A bunch of the finetunes integrated refinger training stuff for a while too. I don't really pay too much attention once I get things working

rugged mirage Apr 17, 2024, 5:33 PM

#

the 2nd wave of finetunes were already exclusively saying to run without refiner

charred mesa Apr 17, 2024, 5:33 PM

#

exactly

vague pond Apr 17, 2024, 5:34 PM

#

rugged mirage the 2nd wave of finetunes were already exclusively saying to run without refiner

yes because they were too lazy to change the one line of code necessary to train it and invest an additional 20% compute in the training run

#

literally all you have to do is change the timestep selection from sampling [0, 1000) to [0, 200)

#

and since you're only training 1/5th of the timesteps the model should pick up on things about 5x as fast since it will only ever need to learn high frequency details

eternal ledge Apr 17, 2024, 5:37 PM

#

It's also just tedious to manage/publish 2 files rather than one 🤷‍♂️ and help confused users who applied them in the wrong order and so on

vague pond Apr 17, 2024, 5:38 PM

#

plus the refiner model is also differently structured:

"attention_head_dim": [
    6,
    12,
    24,
    24
  ],
  "block_out_channels": [
    384,
    768,
    1536,
    1536
  ],
  "transformer_layers_per_block": 4,
  "up_block_types": [
    "UpBlock2D",
    "CrossAttnUpBlock2D",
    "CrossAttnUpBlock2D",
    "UpBlock2D"
  ],

vs the base:

 "attention_head_dim": [
    5,
    10,
    20
  ],
  "block_out_channels": [
    320,
    640,
    1280
  ],
  "transformer_layers_per_block": [
    1,
    2,
    10
  ],
  "up_block_types": [
    "CrossAttnUpBlock2D",
    "CrossAttnUpBlock2D",
    "UpBlock2D"
  ],

rugged mirage Apr 17, 2024, 5:38 PM

#

vague pond yes because they were too lazy to change the one line of code necessary to train...

idk, there were some using it and some not using it, and those that didnt use it ended up rising to the top of civitai downloads, presumably for a reason

steep arrow Apr 17, 2024, 5:39 PM

#

Same reason cascade is not popular, people do not like to load a million things to do one thing.

rugged mirage Apr 17, 2024, 5:39 PM

#

I feel like most workflows require you to load a ton of things and people deal with it

sage reef Apr 17, 2024, 5:39 PM

#

i like cascade more for the image remixing part 🙂

rugged mirage Apr 17, 2024, 5:39 PM

#

if it was enough better people would've lived with it

steep arrow Apr 17, 2024, 5:40 PM

#

rugged mirage I feel like most workflows require you to load a ton of things and people deal w...

But each piece of the puzzle feels independent when crafting a flow. Having a segmented model is just awkward.

vague pond Apr 17, 2024, 5:40 PM

#

eternal ledge It's also just tedious to manage/publish 2 files rather than one 🤷‍♂️ and help ...

It also doesn't help that both ComfyUI and A1111 had incorrect implementations of the refiner. A1111 was switching over based on sampling step until I fixed it, and ComfyUI doesn't have an easy way to switch based on timestep that is built in.

steep arrow Apr 17, 2024, 5:41 PM

#

Look at civit rollout of cascade models, how was that ever going to make sense for sharing finetunes? lol

rugged mirage Apr 17, 2024, 5:41 PM

#

the a111 was really off yeh true

vague pond Apr 17, 2024, 5:41 PM

#

I think Diffusers is the only one that implemented it correctly