#🆕|sd3

1 messages · Page 6 of 1

viral plaza
#

0.o?

dusky thistle
#

the effect is greatest with the magic RES sampler i have, but yeah

#

we've been poking around over on the L2 server trying to pinpoint what's going on halfassedly

viral plaza
#

Wait can you confirm that fp32/fp16 are near identical with the same setup? ie it's specifically fp64 where something weird happens?

dusky thistle
#

fp32 and fp16 are also different but it's not as dramatic

jolly swan
#

Is there any chance you can get someone with the right paycheck to confirm this?

dusky thistle
#

i believe deliberately setting fp32 made no change

#

so it's not something that's accidentally fp8 somewhere

#

and a lot of us have replicated it

#

so it's not just my crazy setup

#

at fp32 or fp16 the images tend to be more creative - more combined objects, more transforms, and also more artifacts, broken fences, that kinda stuff

viral plaza
dusky thistle
#

fp64 things really come together

noble coyote
#

As somebody is talking about sigmas (noise scehdule?) - what exactly do they do?

dusky thistle
#

more detail, more cohesive images, and the it/s is unaffected

candid holly
#

One of these days after I build my new PC, I'm gonna have to sit down over a weekend and learn comfy...

dusky thistle
#

so it's something small somewhere

#

it's def not a bottleneck step

dusky thistle
noble coyote
#

ComfyUI is a doddle to learn! 😄

viral plaza
dusky thistle
#

sigmas are so much easier to understand imo

viral plaza
#

if you scroll up to clownshark's comfy screenshot you can see some graphs of his chaotic scheduling

patent acorn
#

so sigma

hallow talon
#

Will it be possible to train a lora for SD3 with the 2B model if I have a 4080?

dusky thistle
noble coyote
#

OK, so it is a curve which influences the noise-schedule and there is obviously a sigma-sweetspot to maximise image quality?

dusky thistle
#

and some other shit getting injected into the sampling process and mixed in with hard light

#

that thing is a monstrosity

viral plaza
# cobalt moon this right?

yes that there is christian, our CEO, and relevant final decision maker on the licensing topic, standing next to lisa su

static cairn
#

do you mind sharing what's being worked on for 8B? does it need more training before it feels worthy enough for a release? my team wants to train SD3 but we're debating whether we should wait for 4B/8B. 4B seems ideal for us

hallow talon
# viral plaza yeah

thanks! Looking forward to getting started with finetuning when it releases.

viral plaza
candid holly
# dusky thistle comfy is a lifestyle, not a weekend 🤣

with my new PC being able to iterate a new image in under 20 seconds should make learning by doing much easier. having to wait 40-80 seconds per image means that you spend your time waiting playing around with multiple things only to find in your next image, one of the tweaks you just did borked the whole thing

noble coyote
#

So as I eagerly anticipate June the 12th ... will having an 8Gb VRAM RTX 2070 ... will that disappoint?

torpid forge
#

alright, definitely looking better now that I'm not forcing the prompt to stay under 75 tokens

viral plaza
#

all the recent training work was on 2B

tiny lintel
patent acorn
#

example a T4 gpu

static cairn
viral plaza
patent acorn
#

ohh

dusky thistle
viral plaza
#

generally SD3-Medium is a bit faster and easier to run than SDXL is

noble coyote
patent acorn
#

its prob well lower resources than sdxl to me

#

yeah im righ

viral plaza
#

so whatever your SDXL perf is - expect a tad bit better than that

torpid forge
#

nice

candid holly
viral plaza
#

that's more than good enough yeah

dusky thistle
#

is 4b something we can expect this summer or should we buckle down and commit to building stuff for 2b?

patent acorn
#

what does the devs think about this though?

noble coyote
#

Not to spoil the party, but features like eyes, fingers and limbs are still poor - even for all the hype and chutzpah surrounding SD3?!

viral plaza
dusky thistle
#

great, thanks, appreciate it

viral plaza
#

Building against 2B is worthwhile imo regardless because most of the initial work is knowledge&tools, and those will transfer to the other scales

dusky thistle
#

okay great

#

good to hear!

#

the loras won't but if the tools will, perfect

viral plaza
#

not perfect

dusky thistle
#

get the training down with the fast train 2b

viral plaza
#

but like... RealVis-SD3 or whatever is gonna come out and you'll have awesome humans

dusky thistle
#

now where does 8b shine? is it in quality? diversity of concepts? transforms/creativity/combining stuff?

patent acorn
dusky thistle
#

my biggest desire is to have something that can combine concepts that sdxl tends to separate into discrete ones

viral plaza
dusky thistle
#

clown + shark = clownsshark, not clown + shark

noble coyote
#

There a guy @Glif named FABLAN - he is running SD3 generation for free?!

patent acorn
noble coyote
#

How can that be so?

cobalt moon
#

before ever Q6 or Q4

viral plaza
candid holly
opaque zodiac
#

will it be free?

dusky thistle
#

i've got my dora -> crazy noodle fiesta sampling psycho ward workflow pipeline down

#

i'm training on crappy images and getting razor sharp ones out now

noble coyote
#

I find that text and prose in SD3 works about 85% of the time ...

viral plaza
viral plaza
dusky thistle
noble coyote
#

About on a par with Harrlogos2 LoRA in SDXL

viral plaza
opaque zodiac
#

hey dear

#

will it be free?

dusky thistle
#

the API version has been able to do transforms that dalle3, mj, sdxl, etc all failed miserably at

#

is the api the 2b?

viral plaza
#

either now on API the 8B or june 12th with the 2B model itself

cobalt moon
#

oh wait

#

what's the thing with Core SDXL

viral plaza
viral plaza
cobalt moon
#

there have been a lot of misinformation of Core SDXL and SD3 API

patent acorn
#

for real someone should post all those information here to reddit

cobalt moon
trim arrow
#

weights are the uh... model thing, right?

cobalt moon
#

still writing

noble coyote
opaque zodiac
patent acorn
viral plaza
# cobalt moon what's the thing with Core SDXL

ImageCore is a workflow/finetune of SDXL, "ImageCore" is a placeholder to indicate "whatever the current best we have for general image generation" not including beta models like sd3

candid holly
cobalt moon
#

oh ok

viral plaza
trim arrow
noble coyote
#

"How long do we have to 'weight'?!" 😄

viral plaza
patent acorn
trim arrow
#

God it must suck to be you rn with how fast chat is going with all these questions

viral plaza
trim arrow
#

OHHHHHHH i missed the damn joke in there lmao

patent acorn
#

wait i didnt see it

viral plaza
patent acorn
#

the joke

viral plaza
viral plaza
#

I am putting my discord fingey muscles to the test

patent acorn
#

now test your type speed

cobalt moon
#

still writing...

#

i actually have my midyear exam tomorrow

#

so

#

good luck for me

patent acorn
#

good luck

viral plaza
opaque zodiac
patent acorn
viral plaza
# opaque zodiac what are the other versions?

Different sizes of the model, with different resource requirements and theoretically once trained different quality. So the small model for example uses half as much VRAM/time to generate, but theoretically gets about half the quality, vs large is double/double.
In practice the quality different is less precise, and mucked up by the fact that only Medium is trained well so Medium is best quality atm

gusty gale
#

By 'Stable Diffusion 3 Medium' they are referring to a version smaller than 8B?

viral plaza
#

Small is 1B, Medium 2B, large 4B, huge 8B

vernal ocean
#

From XL to medium? Big downgrade?

candid holly
# viral plaza sure why not

Okay, rewrote it in prose rather than tags:

In the Hall of the Thousands Doors, an awe-inspiring representation of the afterlife, the atmosphere is heavy with the scent of incense and damp stone. Quiet souls wander amidst dim torchlight, casting long shadows across the cool gray stone walls adorned with intricate carvings and symbols. Endless hallways stretch out from a central hub, each lined with doors featuring unique, ornate designs and precious materials. Whispers hint at the lives contained behind each door. In the center stands Aponyx, the Raven Goddess of Death, her majestic yet terrifying presence watching over her domain. Above her colossal throne, a mesmerizing portal shines with an ethereal light, offering a glimpse of the City of Heaven.

opaque zodiac
gusty gale
patent acorn
#

there is a comparison

#

i mpretty sure thats the ice cube thing on twitter

#

lykon did post one too but its a guess game

candid holly
viral plaza
viral plaza
patent acorn
#

nobody talked about the gens sd3 b2 will be 512x512..

viral plaza
#

we have some comparisons laying around somewhere to show the current state of thing

vernal ocean
noble coyote
#

SD3@Glif (FABLAN) ... free-to-use ... 9 seconds/generation!

viral plaza
#

That's on a 2B release candidate model

patent acorn
viral plaza
viral plaza
patent acorn
opaque zodiac
viral plaza
noble coyote
#

SD3 into i2i SDXL is a very powerful look!

patent acorn
#

iirc ppl on civitai discord, somewhere twitter said it being 512

gusty gale
viral plaza
viral plaza
viral plaza
patent acorn
#

fuck yea

#

!

viral plaza
patent acorn
#

theres so much noises

candid holly
viral plaza
cobalt moon
#

oh my god

vernal ocean
#

Is the prompt style changing again?

cobalt moon
#

that astraliteheart situation is long to write

noble coyote
#

I think this iteration of SD3 Medium should be nicknamed 'Hamlet'!!! 😄

patent acorn
#

he was typing out the yapping for 15 mins

#

wait what gender is astral

noble coyote
#

2B or not 2B ... that is the ?

patent acorn
#

keep it a sub version of SD3

opaque zodiac
#

artisan is sd3?

patent acorn
#

yeah?

candid holly
viral plaza
# opaque zodiac artisan is sd3?

Artisan is a discord bot with a variety of features - SD3 is the default result of /dream but you can also use Core or whatever

noble coyote
#

Hamlet x 4 - 8B or not 8B

patent acorn
#

nooo just keep it as sd3!!

viral plaza
patent acorn
#

it would be boring for an iteration of sub sd3 be called that

viral plaza
ocean lance
#

are you using clip vit-g and clip vit-l again? and are they the original models, or did you fine tune them?

noble coyote
patent acorn
muted dove
#

What's the story with Cascade? Why was it released and then abandoned, even on here?

viral plaza
viral plaza
noble coyote
#

Oh yeah - if it was down to me - "Torcello-2B! ™ "

ocean lance
patent acorn
dusky thistle
#

And I'm getting better quality images now with some sampling tricks than I've seen from any closed source model except sora

#

From sdxl

#

We've got the flexibility of custom tools, trainability... It's way more powerful than any closed source bs

patent acorn
#

i remember trying out Open-Sora-Plan and it generated worse than expected, wanna post it here

dusky thistle
#

I even have SDXL writing text

noble coyote
#

SD3@Glif

dusky thistle
#

It's far more capable than we think, just need tweaks and training on custom concepts

#

Doras are incredible

patent acorn
candid holly
#

Gonna have to go bug the guy I know with his $300,000 8 H100 setup to see if he'll let me use a bit of it's power when his team is taking a break from trying to make AI generated games...

hallow lion
#

lol i got the email too so apparently i wa son the list but i never got to access sd3 before lool

#

What's wrong with me Emad? XD

patent acorn
#

eh i got the email after losing a match on mobile legends anyways

hallow lion
#

loool

patent acorn
#

that was like 3 hours ago

#

i think

hallow lion
#

Photorealism: Overcomes common artifacts in hands and faces, delivering high-quality images without the need for complex workflows.

#

Bold claims.

patent acorn
viral plaza
cobalt moon
#

oh man

#

there's like

#

10 more question to go

#

I already write 6

viral plaza
patent acorn
#

if someone trained lora with danbooru tags on sd3 ill be dead out laughter

patent acorn
#

NOOOOOOOOO

cobalt moon
#

I mean it is inefficient to write CLIP tags for anime

#

where you could have thousands of detail to write

dusky thistle
patent acorn
viral plaza
cobalt moon
#

ask Lykon

patent acorn
dusky thistle
ocean lance
#

have they got a pull request ready to add the sd3 pipeline to diffusers?

viral plaza
#

beyond that idk

rain current
dusky thistle
viral plaza
#

@jolly swan I got a reply from one on the team managing the licensey stuff: they're still getting it sorted but are expecting to have a clear answer for commercial users before launch

patent acorn
#

magnificient

robust junco
dusky thistle
#

Dora the Mad Sampler

open plover
#

SD3 model will be dropped soon? Is that true?

viral plaza
sterile pendant
dusky thistle
#

and it's not just size

#

i've done some 1:1 testing recently

robust junco
dusky thistle
#

even though the charts show doras being marginally better at rank 64...

#

the results are distinctly better

#

there's a sharpness and a cohesiveness to the images that i've never gotten out of a lora/locon

muted dove
sterile pendant
ocean lance
#

how will the model be packaged? will the T5 files be separate?

sterile pendant
#

like when you use pixart sigma

dusky thistle
dusky thistle
#

what i found surprising was how significant the gap remains beyond that

sterile pendant
#

i'd have to look into it more

dusky thistle
#

it's not something you'd notice standing 10ft from the screen, but it's very much there

#

i'm getting images far better than i trained on in terms of quality

sterile pendant
dusky thistle
#

i had lots of blurry, kinda pixelated muddy stuff from cascade... came out great with the dora, whereas with locon and lora especially that crap carried over

viral plaza
muted dove
viral plaza
#

there's some disagreement internally about optimal format atm tho

viral plaza
# robust junco

"anime sucks"? I mean you can have your own preferences and whatever, but you can't deny millions of people love it

patent acorn
dusky thistle
#

any defects you see there are a result of the crappy lora

#

cuts down the incidence of misshapen eyes quite a bit

#

mutations, etc

#

it can do text

#

nodes need some cleanup then il'l throw em on github

muted dove
#

Great, thanks!

patent acorn
ocean lance
#

Will the workflow in comfy be similar to SDXL where you can send a different prompt to each text encoder? Does it still take the resolution, target, and crop parameters?

viral plaza
limber stream
#

and no point refiner right 😂

cobalt moon
#

ouch...

robust junco
ocean lance
#

Just checking, but does it support img2img and upscaling?

viral plaza
viral plaza
trail palm
#

Hey Alex! hope all is good on your side. Will you guys release controlnets as with sdxl?

cobalt moon
#

grammar stuff

viral plaza
dusky thistle
#

what kind of vram requirements are we talking to train one?

#

probably >24gb?

cobalt moon
#

( that someone from CivitAI )

viral plaza
viral plaza
dusky thistle
#

i presume like the loras, you'll need to train them separately for 2/4/8b etc

cobalt moon
viral plaza
bold socket
#

Does anyone have a link to any sort of information where i can inform myself about the status on the training of lora's on SD3

teal fossil
#

I'm so used to vaguely organizing my crazy noodle empires now...

patent acorn
dusky thistle
#

over 400,000 characterss of metadata

patent acorn
sterile pendant
viral plaza
sterile pendant
#

I know I'm just joking around lol

dusky thistle
#

things kinda went downhill with this one

#

and thank god i finally made a node to do this

cobalt moon
#

holy mother

#

..

#

what are you actually doing

patent acorn
#

NERD SHIT

dusky thistle
elfin plank
#

rofl, that looks like my students doing unreal engine visual scripting. Visual scripting is not a reason clean code should not be maintained! 😄

dusky thistle
#

the modulation continent^^

#

i call it greenland

patent acorn
#

NERDUI

dusky thistle
elfin plank
#

no, just no

dusky thistle
#

yeah this was my worst ever

#

this shit is what happens when i'm experimenting

dusky thistle
#

the contintents of sigma nodes now look like this

sterile pendant
#

so much so that i actually prefer noodles over code

dusky thistle
#

it's a balancing act

#

if your organizing is costing you more time than it saves digging through messes, you should stop organizing

#

if it's the other way around, it's time to organize

#

espec if you're going to be using it a lot

#

but experimenting? trying new ideas? make a mess.

#

all just about getting shit done

sterile pendant
#

yep, noodles are amazing for experimenting. granted, in UE4/5, you can just make a million functions to clean up the graphs, but still, early on, you deal with a lot of noodles while shits getting hammered out

torpid forge
#

I organize when my brain wants to rage quit

#

also, someimes what looks organized to me surely looks like a monstrosity to otthers

elfin plank
#

At least group and comment stuff :/ Don't teach yourself crap routines.

dusky thistle
#

current WF... organized as hell to me, but that's cuz i got the components memorized

elfin plank
#

Future you will appreciate the 5 mins you take to do that.

torpid forge
#

building those crap neural pathways up, bud

viral plaza
# dusky thistle

hey google can i ban someone from a discord just for inducing brain damage visually

dusky thistle
#

i'll be honest... that one became a practical joke

#

took every single node i had in all my open workflows

#

dumped them into one

#

and chained like... 12 samplers in a giant ring randomly combining everything

torpid forge
#

but why you do that?

sterile pendant
#

shit clown, i forgot about the plugin node i was talking about a while back where i was wanting to make some kind of bspline editor for doing the sigmas

#

when we were experimenting with stuff back then

dusky thistle
dusky thistle
torpid forge
#

man, I have had some monstrosities myself. but never started off with that intentions

#

they organically turn into monsters

sterile pendant
#

i just dont know the framework for comfy all that well, but in theory, it's pretty simple. i could probably whip one up in UE5 in maybe 30 minutes, 20 of which would be relearning equations and stuff lol

dusky thistle
#

here's the full workflow i made where i combined everything i could

#

in the most complex way possible

patent acorn
#

still nono

dusky thistle
#

it gave me this.

sterile pendant
dusky thistle
#

thanks

#

it took modifying the samplers to control it

#

or it just got blown out or muddy

sterile pendant
#

just havent been active that much lately to talk as much on here. busy with some projects

dusky thistle
#

the dora really helped push things over the edge too

#

work finally died down for me after an 1812 overture style bombastic end in early may

#

so my form of relaxing has been binging on making nodes i've wanted for a while

sterile pendant
#

i feel you there

sand jay
#

@viral plaza do you have more demo pics of the current SD 3 medium? 🙂

viral plaza
dull star
#

Yay

#

A set date

#

This is nice

teal fossil
#

Well, with weight loading we could prune those... but without Unet most LoRA's won't be worthwhile at all, so we should just ReTrain everything on the new tech with better caption understanding.

viral plaza
#

textenc based models are super powerful, i wish people trained them more often

dull star
#

Is it possible on 24GB

hallow lion
patent acorn
dusky thistle
#

before, after, and 1/4 the run time

#

that was trained on a number of images that had blurry edges

#

blown out contrast, etc

ocean lance
torpid forge
#

I think I know stuff until I come here, lol

hallow lion
#

There's always someone with a more complicated workflow...

dusky thistle
patent acorn
#

before sd3 comes out, is there a magic prompt node for comfy ui like ideogram?

dusky thistle
#

the 541 node one or whatever

torpid forge
#

no no no

viral plaza
dusky thistle
torpid forge
#

oh nice

dull star
viral plaza
#

oh, yes that too yes

dull star
#

Lessgoooo

#

Thanks

torpid forge
hallow lion
#

my worst

#

:)))

cobalt moon
hallow lion
#

ah yes

#

otherwise i go nuts

dull star
#

Oh wait this is 2B, even DiT training might be possible

torpid forge
#

I just put the boxes underneath, foret what those are called

#

the magnetic boards

dusky thistle
# hallow lion

you don't even have any nodes overlapping? live on the edge... try sticking a save image node underneath a ksampler

dull star
#

It's for 8B where training on 24GB might be finicky

dusky thistle
#

fingers crossed the 5090 is 32gb

#

if it's 28gb i will die

patent acorn
#

it says 28gb right

#

for 5090

cobalt moon
#

lol

dusky thistle
#

i mean really, it should be 80gb

#

nvidia is fucking everyone

cobalt moon
#

I feel like there is not need to keep adding up VRAM

#

like hell

dusky thistle
#

the fact AMD failed so horribly to offer real competition is killing our field

cobalt moon
#

at this point 5090 is going to be all-round GPU rather than gamer-orientated.

torpid forge
#

why has it been 24gb for like 5 years?

torpid forge
hallow lion
#

its not the complexity that kills it for me its the slowdown, a few ipadapters and maskign and it slowls to a crawl

#

😦

torpid forge
dusky thistle
#

by not offering more than 24gb, they can charge another $10k for the next 56gb

#

$180/gb

#

which currently costs like 3 bucks

torpid forge
#

I saw that picture of the h200 I believe?

dusky thistle
#

there's no equal to their high end cards, and there's no way around vram requirements for training, so they can charge pretty much anything they want

torpid forge
#

openai bro and nvidia bro and then some other guy

dusky thistle
#

there's really no reason we shouldn't be able to buy consumer equivalents of a100s

torpid forge
#

it's like a small fridge

#

or medium maybe

cobalt moon
viral plaza
cobalt moon
#

H100 also have SXM version

dusky thistle
cobalt moon
#

SXM is like CPU socket with no cooling whatsoever, and also NVLink each other more efficiently compared to PCIe version

sterile pendant
# torpid forge why has it been 24gb for like 5 years?

because contrary to popular opinion, gaming GPUs don't really need more than 16gb of vram right now, unless the devs are bad and don't know how to manage texture pools correctly. also, allocated vram != used vram. you can make a pixel game like minecraft allocate 85% of the VRAM if you want, even if it's only going to use 100mb of it, but if you check the gpu usage, it will show that a ton is "in use."

cobalt moon
#

basically cooling was settled by air conditioner inside server room who often have air circulation

teal fossil
viral plaza
#

CLIP is 77, T5 is 512 or whatever, but you can also just stack multiple CLIPs as-needed

dull star
#

I wonder what will happen if we plug longclip into it

#

In like comfyui

frail tulip
#

hi guys

ocean lance
#

I didn't like the results with longclip

patent acorn
frail tulip
#

oh hi there

patent acorn
#

ohh this guy

#

hes resistant to ELECTRIC!

patent acorn
#

no this channel is sd3 discussion not your jjk general

patent acorn
#

so hi

frail tulip
#

btw hi

ocean lance
#

less gifs, more sd3

patent acorn
#

hmm i think this would be fine

torpid forge
cobalt moon
#

yo

patent acorn
#

NO

frail tulip
#

yo

cobalt moon
#

this is not your usual interaction server

frail tulip
#

then?

patent acorn
#

welcome to the server where the world does not revolves around you!!!!

frail tulip
#

what do we do?

patent acorn
#

this is FUCKING SD3 DISCUSSION AND SOME AI SHIII!!!!!

#

sorry for being aggresive

frail tulip
#

ayo chill

cobalt moon
#

if you keep posting this you will get oof.

#

anyway

frail tulip
#

i am new here so i dont no anything

dull star
#

Oh will we get controlnets or not?

patent acorn
#

you only should discuse sd3 in this channel (either ai stuff because why not), not the usual interaction

frail tulip
#

waht is sd3

patent acorn
#

wait

#

how did you

#

even join

#

??

cobalt moon
patent acorn
#

who invited bro

#

??

cobalt moon
frail tulip
#

i want a channel where u discuss abot bots

cobalt moon
#

pretty sure you have heard of SD1.5 or SDXL

cobalt moon
#

it is a new member in the family

patent acorn
patent acorn
frail tulip
#

wait

cobalt moon
#

... actually depends how successful is the 2B

frail tulip
#

is this a family

patent acorn
#

No

frail tulip
patent acorn
#

that was a sarcasm

cobalt moon
#

what?

patent acorn
#

kinda like a new member to the server

cobalt moon
#

bro

frail tulip
#

patent acorn
#

@frail tulip who invited you anyway

sterile pendant
patent acorn
#

its like ur so clueless

#

about stable diffsuion

frail tulip
#

私は新しいです

#

no one invited me

patent acorn
#

ah my lil bro dont read rules

#

then

#

how did you came across this server?

frail tulip
#

i explore discoverable server

patent acorn
cobalt moon
#

calm down bro

patent acorn
#

go to anime hangout

#

not here

frail tulip
#

no

patent acorn
#

alright

frail tulip
#

i wanna learn smt abt chatbots

patent acorn
#

but

cobalt moon
#

oh chatbots?

cobalt moon
#

that's Large Language Models' job

patent acorn
#

the most chatbots are on Llama chatbot discord servers, stable diffusion is image

cobalt moon
#

or LLM for short

patent acorn
#

yea

cobalt moon
#

yeah Stable Diffusion is text2image

torpid forge
cobalt moon
patent acorn
#

im loving llama 3

frail tulip
#

ok

#

what do u even do here?

#

wtf

patent acorn
#

this one is opensource

#

the nowadays u see locally are Dalle 3

frail tulip
#

that is only what i am talking abt

torpid forge
#

you need to chill there, bud

frail tulip
#

i joined the right server then

cobalt moon
#

someone is damn new alright

#

DALL E 3?

cobalt moon
#

well DALL E 3 is intergrated into ChatGPT

patent acorn
#

if ur confused

frail tulip
#

i ma leave

patent acorn
#

goodbye

frail tulip
#

cya

torpid forge
#

whata is happening?

cobalt moon
#

before that DALL E 2 is a completely independent website on its own

frail tulip
#

dayum

sterile pendant
cobalt moon
#

no ChatGPT stuff

patent acorn
#

yup i do not like dalle

torpid forge
#

why are people tribal in regards to AI models?

cobalt moon
lavish pier
#

is the sd3 model which is releasing bad compared to the api one?

cobalt moon
#

you know when one product with revolutionary feature get introduced

torpid forge
#

yeah, tthat is the technical answer

obtuse fractal
cobalt moon
#

how will the people react

#

it's not just AI, but like, everything.

patent acorn
#

exactly just dont post anything AI stuff to artist discord server lol

cobalt moon
#

you show them boring stuff they dont even bother you whether you release the thing or not

cobalt moon
patent acorn
#

though twitter has misinformed them and leading them thinking its "theft"

sterile pendant
# torpid forge why are people tribal in regards to AI models?

why do people get tribal about their favorite sports team, their favorite music genre, their favorite foods, etc etc. it's just people doing people things and it's mostly just the vestiges of hundreds of millions of years of evolution that got us to this point.

torpid forge
#

people are only doing themselves a disservicee if they deprive themselves of learaning about things simply to feel like they're on a team

torpid forge
patent acorn
#

humanity moment

torpid forge
#

so many of these things that were once necessity

obtuse fractal
torpid forge
#

it was advantageous back when our ancestors were cavemen. I guess maybe it still is in some ways

#

I just like learning about different things

sterile pendant
# torpid forge yeah, that's something I've actually been reading about a lot recently

just observe our ape cousins, you'll see the same stuff. even in packs of other animals as well, dogs, cats, mice, etc etc. it's pretty common in the animal kingdom to not want to be the outcast, because that usually signifies something wrong with the individual and the urge to reject them is to keep the "broken genes" out of the pool. it's observed in pretty much every creature on the planet with a brain

#

tribalism is just an extension on all that

#

since we have more cognitive capacity and all

patent acorn
#

what about people calling anyone that uses ai "tech bro" (not ai bro in this case)

#

techphobic much!

torpid forge
#

well they're probably not comfortable with ai

#

makes them anxious

#

so they externalize it and call people tech bro

cobalt moon
#

remember those industrialist back in 1800s at England

obtuse fractal
cobalt moon
#

there always had peoples fighting against the idea of an industrialist and sought to protect their green land

#

it is the same

#

if you knew it

obtuse fractal
#

Early adopters of tech have always been mocked by the same people who later adopted it

torpid forge
#

People don't like the idea of the reality they grew comfortable in changing

#

It means the things they're good at might become less relevant

cobalt moon
#

this remind me one Tom Scott video

patent acorn
#

oh nvm

#

i misenterpreted that lol

#

anyone that mines ig

obtuse fractal
#

lol

#

Ive made good money with crypto

#

But crypto bros are annoying af

torpid forge
#

People also tend to dismiss things they don't understand

#

Crypto bros are like "influencers"

#

More sure of themselves than they should be

patent acorn
#

but are ai bros worse? i dont think so lol

obtuse fractal
dull star
obtuse fractal
#

$100 I’ll never get back 💀

patent acorn
#

YOU POOR SOUL.

cobalt moon
torpid forge
sterile pendant
cobalt moon
#

I remember correctly all ControlNet models was uploaded by Illyasviel

#

but is he a Stability dev/staff?

patent acorn
obtuse fractal
patent acorn
#

no thank you

torpid forge
patent acorn
#

id screenshot

#

;)

torpid forge
#

I don't even know about these dark humor nfts. I guess I haven't kept up

obtuse fractal
patent acorn
sterile pendant
patent acorn
sterile pendant
#

And therefore, they never learn or grow up

torpid forge
stray cedar
#

SD3 finally gets announced and this is what the convo turns into on the SD3 channel...

sterile pendant
torpid forge
stray cedar
#

hmmm how about... SD3

torpid forge
#

oh, by all means

#

go on

sterile pendant
#

what is there to talk about other than it comes out in a couple weeks?

#

and maybe see the same waifus spammed over and over again lol

patent acorn
patent acorn
#

there isa lready a reddit post for the infos

torpid forge
#

AI short circuits the idea to result path. it will facilitate an explosion of creativity

dusky thistle
patent acorn
torpid forge
#

oh well

#

they're on the wrong side of time

patent acorn
#

wait for 4 years and i will be able to reference AI stuff without getting harrassed

torpid forge
#

they're literally fighting against the idea of democratizing creativity

patent acorn
torpid forge
#

they won't be remembered in a positive light

sterile pendant
# dusky thistle Can't wait for the prompt zombie Armageddon

oh lord, and the complaining about them not looking good enough just because it isn't some hyperoverfit model they're used to using that spits out "adult women" with the facial proportions of a younger-than-adult-woman... oh and the bobs not being gud enuf

patent acorn
#

i will say they waste their time arguing on twitter, they will say i waste my time ai genning "slops"

dusky thistle
#

I can't wait to fulfill each other every request

sterile pendant
#

SAME lol

#

i'm going full troll with it using the typical old men in tiktok poses lol

torpid forge
#

well also people act like the models won't become more sophisticated. like they're these static things that will just be

patent acorn
#

whats a "sophisticated"

torpid forge
#

it's an adjective

patent acorn
#

whatever!

torpid forge
#

(of a machine, system, or technique) developed to a high degree of complexity.

hallow lion
#

We need AI to fix the real estate and the economy. 😄

patent acorn
#

AI Governments when

torpid forge
#

never

sterile pendant
torpid forge
#

who will train them?

patent acorn
#

hey man can you unban tax evasion
"im sorry, but"

torpid forge
hallow lion
#

I am going to Mars and throw your tea into space. I refuse to pay taxes to earth. Havea nice day.

#

Free Mars.

desert garnet
torpid forge
#

what about the 3 sea shells?

#

when's that happening?

sterile pendant
#

hah they dont know how to use the three sea shells

craggy ridge
#

So. Many. Messages. meow

dusky thistle
#

base is flexible

#

i like that

torpid forge
#

I use whatever I want

#

SD3 image for you guys

hallow lion
#

Im glad Dear Leader has been trained in.

patent acorn
hallow lion
#

All hail Kim.

sterile pendant
torpid forge
#

all the models have everyone in them. I got dalle3 to give me Xi

#

turns out the key is just spelling atrociously

dusky thistle
hallow lion
#

u just got winnie the pooh

dusky thistle
#

big waste of all the training ability we have

#

there's already trillions of pr0n pics online, do we really need ppls 4090s cranking out another 90k a day

torpid forge
#

nah, I got copilot to flip out about stuff microsoft did, censoring things for china. and then started in on making images to illustrate

hallow lion
#

did google fix the black pope divesity?

#

and asian nazis

noble coyote
#

SD3@Glif

sterile pendant
hallow lion
restive halo
#

didn't they originally say we are getting Controlnets with the release? It sounds like that's no longer the case?

hallow lion
#

BastardV1 is good for promt adehrence

#

it gets ANGLES right

#

top down, low angle etc

noble coyote
#

Knot Essdeethree

hallow lion
#

make one wiht XI and Kim kissing

patent acorn
dusky thistle
#

we just need the weights and the tools

restive halo
#

sorry, who cares if we have the tools, as long as we have the tools? what are you even saying

hallow lion
torpid forge
hallow lion
#

only famous artists were left out i guess

#

and getty/shitterstock watermakrks

#

famous people are fair game

restive halo
torpid forge
#

which artists were left out? I'll test that out

hallow lion
#

theyre in the public domain and attention whores anywayz

patent acorn
#

can sd3 generate this?

torpid forge
#

yes

sterile pendant
patent acorn
#

oh please

desert garnet
patent acorn
torpid forge
#

but can sd3 generate this?

noble coyote
#

SD3@Glif

restive halo
patent acorn
patent acorn
torpid forge
#

so can I. in fact I generated that image right there

patent acorn
#

oh shit fish gyatt 👀

noble coyote
#

SD3@Glif

torpid forge
#

can sd3 generate this?

#

yes. yes it can

noble coyote
#

A Coelacanth?!

teal fossil
#

That would also mean that we can use Longclip (once both versions are available).

sterile pendant
noble coyote
#

SD3@Glif

patent acorn
sterile pendant
#

i made it as a joke for my idiot fishing buddies that are obsessed with fish like they are attracted to them

noble coyote
teal fossil
patent acorn
#

i need to take a nap rn

noble coyote
patent acorn
#

men will see this and say hell yeeah

noble coyote
#

This is what all the hard work has produced!!! 😄

sterile pendant
sullen moss
#

12 June, almost two weeks 😅

noble coyote
lavish pier
desert garnet
teal fossil
noble coyote
#

Good prompt coherence

sullen moss
teal fossil
lucid swift
noble coyote
desert garnet
muted dove
noble coyote
#

2B or not 2B - hey, let's call it Hamlet instead of Medium?!

teal fossil
dusky thistle
teal fossil
noble coyote
#

Knot Essdeethree

hallow lion
#

looool

noble coyote
cerulean granite
#

a gnome

noble coyote
#

SD3@Glif

outer elm
#

@suchamazewow gut

rapid ivy
#

I thought because I paid for stable membership I'd get SD3 as a part of it but you're asking for commercial use for us to fill out a form.

Do you need that if you have stable membership?

torpid forge
hallow lion
#

Hmmm, number of legs questionable...

torpid forge
#

indeed, that's happened multiple times

hallow lion
#

oh no

#

before it was hands

#

now its legs!? :0

restive halo
#

so how exactly does the fully non-commercial license for sd3 work? Can you not even use it in youtube videos that have ads on them?

torpid forge
muted dove
restive halo
#

I wasn't sure if one of the staff who posts here hasn't said more

cobalt moon
#

they still sort it up

#

the license things

noble coyote
#

I suspect that SAI will make some money by selling Commercial Licenses ... but the Community can knock-itself-out for free ...

#

... as long as they're not selling stuff!

restive halo
#

the previous license was reasonable, you can use it commercially if you have less than X users (which presumably is the same as viewers for videos), and over that you pay

muted dove
restive halo
#

but in the email they implied you can't use it commercially at all for sd3, and you can't just pay you have to do a private deal

cobalt moon
#

dude

#

it doesnt mention it at all

restive halo
#

'If you would like to discuss a self-hosting license for commercial use of Stable Diffusion 3 please complete the form below and our team will be in touch shortly.'

cobalt moon
#

just the word non-commercial license

#

and form

#

read that again

restive halo
#

it implies you can only do commercial if you contact them and do a private deal

#

'SD3 Medium weights and code will be available for non-commercial use only.'

restive halo
#

only = at all

noble coyote
#

It would be very hard for SAI to regulate sales of SD3 material ... they'd have to have a "finger in every pie" to make it work!

cobalt moon
restive halo
#

what

noble coyote
#

In the old days, MJ said that you could sell material up to a certain level of remuneration. Anything above that, and MJ would want a cut, want a percentage

restive halo
#

that's similar to how the current SD license was, but in the email they say you can't do commercial 'at all' which is a change from the 100k users thing

cobalt moon
#

even though they did have some relation to each other.

noble coyote
#

Adobe do 'commercial' with Firefly: they allow so many free uses/month inside of Photoshop - then when that batch has finished - you must top-up, or wait until next month

cobalt moon
#

plus I am thinking "SD3 weight will be available for non-commercial use only" can imply those who paywalling their trained model.

hallow lion
#

Emad! That's 6 fingers!

lucid swift
teal fossil
teal fossil
hallow lion
#

On a scale of 1 to 10 how exhausted are you?

#

I am a solid 8.

#

C A F F E I N E is extinct.

low stone
finite hollow
cunning lintel
#

#🆕|sd3 message #🆕|sd3 message
I really don't like the 2B sample, way too contrasty with hard lines while nothing in the prompt calls for it, and while not so noisy, I'd call it smeary instead, fine detail got smoothed away. I kinda expected this effect was from overdone postproccessing when i saw that teaser tweets, but it seems not. I'd like the posted 8b one if only it weren't so unusable noisy.

Either way, tried the prompt in the API as well, it's different to both (follows the prompt less it seems, no goddess to be seen, but the overall aesthetic is way more similar to the 8b sample posted, not the weird 2b output), so that begs the question, what's running in the API? I'd suspect a first iteration with smaller text context and that that has been cranked up since.

torpid forge
remote holly
#

Sd3 for 12 june , the end of sd3 loop paradox lol

left parrot
#

Is it likely that we can run SD3 Medium with comfyUI on the 12th?

sterile pendant
# left parrot Is it likely that we can run SD3 Medium with comfyUI on the 12th?

Yes, comfyanon is a beast at getting new things up and running quickly. Even if he doesn't have it already ready, you can always just run the sample code from the hf page when it goes up, but you'll only really be able to do basic prompting

I'd imagine comfyui will have it ready before the models go live, or within 24h of the release.

plucky flax
#

Hey guys can you tell me which channel should I go to for help in using stable diffusion in local

gusty gale
coral sable
#

Fantastic news SD3

dull star
sterile pendant
wild remnant
low stone
woeful spindle
#

I wasn't lying when I said it's 2 weeks until the SD3 weights drop

dull star
#

SD3 when???
SD3 then

gusty trail
#

SD3 when-> SD3 Large When

dull star
#

So SD3 4B?

desert garnet
#

8b,i like them big

dull star
#

I like em chunky

sterile pendant
# dull star So SD3 4B?

Probably in a few months after they see what worked and what didn't work with the 2b model. Also, to give them time to pull in some revenue and to see if it's financially worth it to fully train the higher parameter models(or wait for some company to buy them out)

sullen moss
patent acorn
#

SD3 how

#

SD what

compact forge
#

So we got the announcement to announce the announcement on june 12th?

desert garnet
dull star
#

I am aware that it will take a lot of time

#

I bet we won't see 8B until July or August

#

when 2B comes out and its good, I'm gonna buy 10$ worth of credits as a donation to stability

gusty trail
dull star
dull star
#

Small, Medium, Large, Huge

gusty trail
#

SD3XL

dull star
#

yeahh

#

extra large

turbid grotto
#

extra huge

dull star
#

We're on track to release the SD3 models* (note the 's', there's multiple - small/1b, medium/2b, large/4b, huge/8b) for free as they get finished.

#

so yeah, 8B will come out

#

for free, offline, etc

wide pagoda
#

800M / 1B - "Small", same size as SD1.5
2B - "Medium", the first one that will be released
4B - "Large", same size as SDXL
8B - "Huge"

EDIT: this is wrong, apparently XL is 2B

dull star
#

basically

#

but 2B is as good as SDXL basically

sterile pendant
turbid grotto
wide pagoda
#

All except 2B are very much still WIP and not the current focus

dull star
#

800M might have to be their next target, so lower end people can get to work

tough oriole
#

Even if they gave us the 8B one they said it wasnt finished. It would take a while to even tune it so waiting is fine.

dull star
#

imagine FastSD-like implementation/app for SD3 running 2B-Turbo or 0.8B-Turbo or whatever

#

smart image generation democratized

#

its only T5 that has to be figured out

#

we need some quantization or ggml implementation so we can do stuff like IMatrix quantization at like 4-bit for T5

sterile pendant
#

2b will be massively more powerful than sdxl. The 2b size would be more like 8b in unet format. Sdxl has like ~3b in the unet, not counting the other model parts.

dull star
#

I remember 3.5B on the website, but I suppose that's the total and not just the Unet

#

idk why they didn't write that in the paper or the huggingface page

#

only in the announcement of SDXL

wide pagoda
turbid grotto
#

I checked base 1.5 and just shocked how much finetunes improved it, I think there won't be any quality problems with 2b version

sterile pendant
#

Kind of like how llama3 8b outperforms llama2 80b, it's kind of pointless to compare model sizes between different architectures

turbid grotto
wide pagoda
#

I'm mostly interested in what cards it actually runs on...

dull star
#

I can't wait to make Lora-like models for 2B

#

this is going to be so much fun

sterile pendant
#

So sd3 2b will likely be as if sdxl had 2-4x as many parameters, plus, don't forget we will have 16 channel vae

dull star
#

and Textual inversions might come back to fashion if we're going for ""cross-platform"" compatibility between the 800M and 2B models or whatever

#

just probably for styles, maybe not subjects

sterile pendant
#

It was just a bitch to pull off for image generation until more recently

#

Without having dog poop quality or paint drying speed

#

And without needing 9001gb of vram

turbid grotto
sterile pendant
#

It's in there, wanna say they call them mmdit blocks, but I can't remember off the top of my head

dull star
#

I suppose stability wants to do a llama moment where they actually train the model for a good time instead since now its more transformer based

#

if the results are gonna look like lykon's, I hope it will still have variety

sterile pendant
#

You can train it all you want, but if the captioning on the dataset is weak, you're just going to get more of the same. I know they used cogvlm for a good chunk of the dataset in sd3, but I don't know how well it works. I've used a ton of vlms and I know how much of a crapshoot they can be sometimes

dull star
#

yeah 50%

#

so that the lack of knowledge from cogvlm (pop culture or video games or etc) aren't as detrimental

turbid grotto
#

so there is still room for improvements

dull star
#

I'll make loras for video games and other stuff

#

its gonna be fun

#

when the model actually somewhat knows what you actually want

turbid grotto
#

I am curious, was dalle3 captioned by gpt4v?

dull star
#

think so

#

or whatever their in-house tool was for captioning

#

it is super detailed and smart

#

probably better than cogvlm

#

but cogvlm is still very good

turbid grotto
sterile pendant
#

The biggest room for improvement would be in the t5 portion of the model and its ability to map dumb prompting to magic in the network

dull star
#

man this looks photoreal

#

gave this image to some people and they thought it was real

wide pagoda
# turbid grotto now I am even more hyped, why didn't they use it as marketing... or I missed som...
noble coyote
#

SD3 into i2i Searge SDXL w/flow

dull star
#

people can just disable anything they want

desert garnet
dull star
#

like this looks good, and it doesn't have a dreamshaper or juggernaut feel to the face

sterile pendant
desert garnet
wide pagoda
#

It's still uncanny valley, just not the sameface of previous models (which you wouldn't expect it to be)

dull star
#

but yeah I wonder how much smarter it could have been if it was T5 only

sterile pendant
dull star
#

now this has a finetuned model feel to it though

dull star
#

its quite small

#

its still impressive for its size

#

and its better than cascade for example

sterile pendant
#

And look how fing powerful it is... I use it all the time for the prompt cohesion. So imagine if they had a much larger model...

turbid grotto
dull star
#

exactly

#

pixart is still the best model offline in my experience

#

for prompt adherence

sterile pendant
#

That's what sd3 could be if they went pure t5

#

But with actual image quality

dull star
#

for a 0.6B model, that like 1 stage and way smaller than cascade, this is really nice

sterile pendant
#

Yep

turbid grotto
#

yea pixart is crazy

#

are they cooking something else?

sterile pendant
#

Such a powerful little experimental model

#

I hope so

#

Competition drives innovation

#

There's also a couple of those Chinese dit models I think as well

dull star
#

HunyuanDiT

sterile pendant
#

Huendit or something

dull star
#

haven't tried Hunyuan but I think it wasn't that good

sterile pendant
#

And some other one that starts with an L I think

dull star
#

Lumina-T2I

#

I think

#

haven't tried that at all

sterile pendant
#

Haven't tried either either

#

Just know they're out there

dull star
#

the skin detail is nice on 2B

#

the eyes look nice too

#

I remember it looking a little buggy in the Goku images

#

it was undertrained back then

turbid grotto
#

hope finetunes won't get that 1.5 look sadcat

dull star
#

I wonder how much this "no refining, no upscaling" thing is true