#🆕|sd3

1 messages · Page 9 of 1

hallow lion
#

Comfyui nodes and workflow will arrive 5.2 seconds after the weights drop.

sterile pendant
#

I don't know about how long kohya will take to get it up and running with the UI, but I'm sure there will be some update to the accelerate and other python libraries that will support it and when that happens, you'll be able to do it manually with a CLI script (all koyha's interface really does is gives you a convenient UI to make the command, then runs it in the command prompt)

hallow talon
neon wagon
#

SD3 loras = 2 week after the weights
SD3 finetunes = 1 Month after weights

#

but good finetunes merges after weight need 3 month minimum.

dry wave
#

you don't need updates to accelerate (you don't even need accelerate if you use single gpu). You need updated on the training script

rain current
#

Where can I see the results of 2B?

bitter hearth
#

"the villain, a villainous villain making villain stuff, in a dark alleyway"

#

very villain

storm saffron
sterile pendant
# dry wave kohya sd-scripts *is* a cli script.

I know it is, I already said it's just a UI to make things easier to build the command. There's even a button to print the command in the command window so that you can use it without the interface if you desire. I said accelerate because that's literally what gets called when you run kohya, it sorts the rest out. But yeah, the other libraries, like I mentioned, will need to be updated before anything can propagate downstream from it.

bitter hearth
sterile pendant
#

Should only take a few days to weeks before you can train though.

vapid radish
dry wave
#

kohya_ss is the python script that does the training. It's not an ui

#

there are several UIs on top of kohya_ss, that is what you are referring to

#

accelerate had nothing to do with that. It's just a wrapper script that simplifies things like multi gpu training

sterile pendant
#

And accelerate can then call those other libraries. But yeah, I should have worded things a little clearer. Accelerate is like saying diffusers in the sense of it simplifies all the platform/hardware stuff and simplifies the calling of other libraries. Can you manually call those the hard way, sure. The training scripts will be updated pretty quickly, but won't work their ways into UIs until things like accelerate are updated, since you'll have people trying to train on a variety of PC environments

dry wave
#

yes and that is still wrong lol

#

you don't have to update accelerate

#

accelerate is really just a "simple" wrapper that allows to switch between different precision types and gpu setups.

sterile pendant
#

and you're 100% postive that it will be able to handle switching to the requirements that the DiT architecture has?

dry wave
#

doing that manually is not more effort or complicated than using accelerate. The idea of accelerate is that you can easily turn off these things without changing the code

#

yes

#

DiT is just a transformer architecture running on pytorch

#

and we had this discussion already but: DiT is using transformers, sdxl is also using transformers. You call the same pytorch kernels under the hood

cedar gale
sterile pendant
# dry wave doing that manually is not more effort or complicated than using accelerate. The...

i suppose we'll see, just figured there'd potentially be compatibility issues with SD3 due to all the components that will have to be shuffled around like if you're trying to train sd3+encoders+t5 at the same time. i imagine they will add in some kind of control over where to keep the various models, like having the T5 stay in ram instead of vram and so on. but yeah, at the core, they are all going to just call pytorch at the end of the day

dry wave
#

yes, but accelerate is not doing these kind of things

sterile pendant
#

i'll read through a bit more, but it definitely looks like it handles where shit sits

#

from a wrapper point of view

#

But anyways, again, other libraries are going to have to be updated, along with training scripts, etc etc, before you'll see UIs implment them. Your average user isn't going to want to go into command prompts with a scalpel to manually train. 99% of people are going to rely on things like koyha and onetrainer

#

So it will likely take days to weeks for all the stars to align to be able to just do it through a UI

storm saffron
sterile pendant
dry wave
#

stuff like cpu offloading, pre-conputing and so on have to be done by the library itself (diffusers or kohya_ss)

#

but you are right for sure that it will take some weeks until everything runs smoothly for the end-user

storm saffron
#

Yeah not everyone but the enthusiasts and early adopters will want to fiddle with cli scripts.

sterile pendant
# dry wave stuff like cpu offloading, pre-conputing and so on have to be done by the librar...

right, again, the main point is that people want to be able to do this in UIs like koyha and onetrainer, so all the various libraries used in those are going to need to be updated. there's a reason why linux only has a 2.3%(counting steamdecks) usage rate for home PCs, people hate CLI bullshit(even though modern versions don't make you do cli stuff really anymore, people still have the bad taste in their mouthes from when they tried linux ages ago lol)

storm saffron
#

We're just seeing new people starting to train pixart now OneTrainer added support.

sterile pendant
storm saffron
#

WSL does kinda help with that though if you truly wanted to train a lora the day SD3 comes out (assuming they release the LoRAs code same day)

sterile pendant
#

even if they don't release it day one, i'm sure people would try to frankenstein something up anyways

dry wave
#

if you have fine-tuning code going to lora is trivial

storm saffron
sterile pendant
#

it's not some new architecture or anything, llms have been using it for many years now, so a lot of the functions are already there. obviously, you just have to get the steps in the right order for the layout of how SD3s specific flavor works though

storm saffron
#

Wonder what happens if you give T5 and clip completely different prompts entirely that contradict each other when inferencing. 🤔

wide pagoda
#

Similar things have ready been done with SDXL
Anyway I assume clip would dominate

silver sluice
#

So after following the conversation, from my understanding, even tho people already have early access to the weights, none of those people have knowledge or ability to integrate or update the UIs

silver sluice
#

Seems like another missed opportunity by SAI, not giving early access to devs who integrate stuff and fine tuners, seems like a strong case for that VIP room

cobalt moon
#

Bro literally SAI staff

silver sluice
#

So we’ll have to rely on CLI scripts to generate anything after release? I realize someone said to expect workflow for comfy quickly after release but it seems prudent to just have given access to devs who would be making this work so it can be refined and ready for launch

cobalt moon
#

Lykon who also got access on SD3 Medium said he will train new Dreamshaper based of that model

silver sluice
cobalt moon
storm saffron
cobalt moon
#

SAI literally use Comfy to generate SD3 images yeah

sterile pendant
cerulean sphinx
#

Gradio is such easy to use that updating the UI to train SD3 should be quite quick once you've got a CLI script to train it.

silver sluice
silver sluice
cerulean sphinx
#

But personally I think using config files are easier in the long time than dealing with any UI. The only benefit the UI serves really is reasonable presets for training.

cobalt moon
cobalt moon
#

Everyone ( at least here ) hate regulation, but they must be there, for legal purpose

silver sluice
cobalt moon
#

You do not wanted people to easily generate porns with your face on and share it online

fiery wharf
cobalt moon
sterile pendant
cobalt moon
#

If you cannot show it then it is just a scaremongering

sterile pendant
#

(i was legit horrified to learn that it wasn't just a meme, that these were actual loras people trained...)

fiery wharf
sterile pendant
fiery wharf
#

reading comprehension 101

sterile pendant
#

this isn't checkers, it's chess

#

national safety is exponentially more imporant to a country than you playing with ai waifus

fiery wharf
#

if they truly cared about security they would regulate both

sterile pendant
sterile pendant
storm saffron
desert garnet
#

these reads like one of those evangelical clowns in america telling me why killin ppl is better and safer than sex

sterile pendant
#

and not doable by 99% of people that can type simple words out

sterile pendant
storm saffron
tiny otter
#

@storm saffron i need to create images where should i go

desert garnet
sterile pendant
#

it absolutely sucks and ideally should be zero

#

but modern warfare has cut it down astronomically

desert garnet
#

thats true,to increase that number we should train more models to automatically target ppl based on an algorithm

sterile pendant
#

i can dig up even more credible stats like this

desert garnet
#

perfect argument for ai robots in wars

sterile pendant
#

which is why im all for it in militaries. let the bots duke it out with the other bots and leave the people out of it. it would just be the modern version of siege warfare, minus wasting human lives

desert garnet
#

yea if your family dies because of a misplaced airstrike, its just collateral damage bro,man up

sterile pendant
#

the point is, that number will be far lower vs historical statistics

cobalt moon
#

Yeah, sarcasm 100%

sterile pendant
#

so yes, governments are going to continue to strengthen AI usage in their militaries and yes, they are also going to crack down on what citizens are allowed to do with AI usage. you can cry into the wind about it, but it is how it is

cobalt moon
#

We went from SD3 to whole AI military regulation and usage

desert garnet
sterile pendant
#

yeah, it was a dumb false equivalency fallacy thing that someone brought up, like usual

low stone
fair spruce
cedar gale
silver sluice
# low stone

can you do the same image but monochrome fuzz pets and instead of kids replace it with men in black suits? lol i dont know if you're taking requests or not but that one came out raelly nicely looks great

patent acorn
# cedar gale

i would better see the mice facing to the side instead

cedar gale
#

You can mount your hamster however you like.

patent acorn
#

no way

urban arch
#

Ribbed, for HER pleasure.

cedar gale
raven fern
frail shoal
#

i'm curious, is the SD3 prompt comprehension better than pixart sigma that has a 20GB T5 model ? Has anyone compared them both ? I'm currently using sigma with sdxl refiner and it can understand things like concepts, emotions and feelings and represent them visually like a real artist. Curious if SD3 text comprehension can do that

sterile pendant
# frail shoal i'm curious, is the SD3 prompt comprehension better than pixart sigma that has a...

The issue isn't so much the size of the t5, it's how well it maps to the image generation model. Think of t5 like chatgpt and how well it can handle and understand conversational input. Since it can better understand semantics like a red ball on a green cube on a blue table, it can then do a better job(in theory) of pairing those colors to those shapes when you go to generate it. That 20gb is it in fp32, there are also fp/bfp16 versions of it that are around 10gb. Pixart sigma is awesome, but is pretty small and not all that trained yet(in comparison to say sdxl finetune level image quality). SD3's t5 will likely be on par with pixart's, but the image creation half will blow it out of the water since it has a far larger dataset and more training to map the concepts

frail shoal
sterile pendant
#

Also keep in mind that SD3 has three input encoders: the same clips from sdxl and t5. I'd imagine you can run it with any combination of the three, maybe even pure T5, but I can't remember if they've verified that or not

#

yeah sigma is meant to be more of a research paper kind of model, they don't have the resources to fully "train" train it all the way

#

It costs a lot to make quality base models

#

but sigma is great for hashing a scene out to then be resampled with sdxl. it has kinda been the poorman's sd3 while we wait for sd3 to come out

silver sluice
#

how many gigabytes do you guys expect SD3 2B model to be? looking for a ballpark estimate

cedar gale
trail frost
#

Is there any open source image generation models other sdxl 1.0 and pixart

gusty trail
#

hunyuan-dit, lumina-t2i, Kandinsky 3.0

low stone
trail frost
frail shoal
frail shoal
#

and i need gpu room for sdxl refiner

#

i only have 6gb vram

lucid swift
#

living avian theropods

cerulean sphinx
sterile pendant
#

(well all birds are living descendants of therapods)

cedar gale
lucid swift
#

what the hell

cedar gale
#

Lol thats a dualtops.

lucid swift
cedar gale
#

😄

#

11 / 10.

lucid swift
#

😎

cedar gale
#

FBI catfish?

lucid swift
#

xD yes

#

the thinnest smartphone that exists android

#

the thinnest smartphone that exists android 1999

cunning lintel
fallen swallow
#

WTF ... this is awesome!

silver sluice
trail frost
#

Anyone here using Hunyuan dit if so can you please share some examples

sterile pendant
#

I need to reinstall it to mess around with it again

frail shoal
silver sluice
storm saffron
#

SDXL was 2.6B and ended up at 5Gb just for the UNET

#

Without VAE and CLIP

lucid swift
#

Lumina-Next-SFT promt: This image is a captivating digital artwork that portrays a surreal scene set in a misty, swamp-like environment. The dominant color palette is a deep, neon purple, which bathes the entire scene in an otherworldly glow. The atmosphere is thick with fog, obscuring the details of the trees and vegetation that surround the scene. The water in the foreground reflects the eerie light, creating a mirror-like effect that adds to the sense of depth and mystery.

In the center of the image, there is a figure seated on a motorcycle, poised as if ready to depart. The rider is clad in a dark, form-fitting outfit that merges with the shadows, and their helmet has a reflective visor that mirrors the intense purple hue of the surroundings. The motorcycle itself is sleek and modern, with a design that suggests speed and agility. The rider's posture exudes a sense of anticipation, as if they are waiting for the perfect moment to make their move.

Lightning bolts pierce through the dense fog, creating a stark contrast with the otherwise monochromatic scene. These bolts of electricity add a dynamic element to the image, suggesting a storm or a supernatural event. The lightning's jagged lines and the way they illuminate the mist create a dramatic and intense atmosphere, enhancing the overall sense of suspense and wonder.

The image is a blend of natural and surreal elements, creating a dreamlike quality that invites viewers to immerse themselves in its mysterious world. The neon purple color scheme and the ethereal lighting contribute to a feeling of otherworldliness, making the scene both captivating and unsettling." image

cedar gale
storm saffron
#

at least 8 GPUs are required for full fine-tuning of the Lumina-T2X 5B

Ouch.

lucid swift
#

at least it says to b in the web gui

storm saffron
#

There's a bunch of models and it's not that clear how to actually use any!

lucid swift
#

yes they shuld have sepereated models into diffrent githubs

#

but i guesse more stars or someting

storm saffron
#

no requirements.txt either for pip

lucid swift
#

💀

#

Lumina-Next-SFT

storm saffron
#

What's SFT?

lucid swift
storm saffron
#

Not just the base model safetensor's version?

lucid swift
storm saffron
#

Ahhhh ok

#

It doesn't look that fine tuned from the pic above. 😄

lucid swift
#

xD

storm saffron
#

What was your prompt?

lucid swift
#

i think it uderstands natural laguage better so my promt was not optimal "photograph of ghost special force agent, adorned in all-black human anthropomorphic furrsona fish in fursuiter at a con highly detailed, the interplanetary from "2001 a space odyssey"

storm saffron
#

I'm gonna compare it with my PixArt finetune.

#

Have you tried their 2K model?

lucid swift
storm saffron
#

There, my pixart FT

lucid swift
#

this one is more like a fish

#

"photograph of ghost special force agent, furrsona detailed, the interplanetary"

storm saffron
#

Anthropomorphic, ✅

turbid grotto
#

I have seen some disagreement about sd3 75 tokens due to clip or 512 due to T5 a while ago, is this still concern or it is not a problem?

storm saffron
#

OK now it's gone more furry since I changed furrsona to fursona

lucid swift
#

we are not sure how it will behave

lucid swift
storm saffron
#

Comfy needs a spellchecker node

turbid grotto
storm saffron
#

I guess it's going for Ghost Recon?

#

Considering it's a 600M param model that I badly finetuned, it's doing alright. 😄

lucid swift
#

yes thats smaller then 1.5

storm saffron
#

Well DiTs behave more like models 1.5 - 2x the size of their UNet counterparts of the same parameter count.

storm saffron
#

I mean they behave more like models bigger than the equivalent UNET model.

storm saffron
#

Hard to explain. A 2B parameter DiT is more like a 3-4B parameter UNET in capability

lucid swift
# lucid swift

vs sd3 but sd3 was only able to fit a small part of the promt

lucid swift
#

i think one problem of the lumina model is that they use sdxl vae

storm saffron
#

That could be an issue, especially with small faces and text.

#

Same problem with PixArt

#

IDK if I can be bothered to get Lumina working with SD3 just a few days away

lucid swift
#

lumina vs sd3 (but sd3 is like 8b and lumina is like 2b ) The image depicts an alien-like creature with a large, elongated head and dark, almond-shaped eyes. The skin appears textured and rough, reminiscent of reptilian or amphibian skin. The creature is sitting in a body of water, partially submerged, with its legs and lower torso hidden below the surface. The background is foggy, adding a sense of mystery, and features tall, thin reeds and barren trees, creating a marsh-like or swamp environment. The overall atmosphere is eerie and otherworldly, with muted colors and low light enhancing the creature's unsettling appearance.

silver sluice
storm saffron
lucid swift
#

yes

#

but the second one looks more like a photo

#

but i never said photo in the prompt

storm saffron
#

PixArt. (600M param)

lucid swift
storm saffron
#

I'll do a widescreen version, which I think would enhance it a bit

lucid swift
#

"The image shows a cozy, eclectic room with a vibrant, colorful ambiance. The ceiling is draped with multiple tapestries featuring intricate designs, including mandala patterns and depictions of plants and celestial motifs. The lighting is soft and atmospheric, with various sources contributing to the overall mood: Ceiling Lighting: There are red, pink, and purple lights that illuminate the tapestries, highlighting their patterns and adding a warm glow. String Lights: Multi-colored string lights are draped around the room, adding to the festive and relaxed atmosphere. Television: A flat-screen TV on the wall displays a scene from the animated show "The Simpsons"

lucid swift
storm saffron
#

Lumina seems pretty undertrained or not at all finetuned.

lucid swift
lucid swift
storm saffron
#

PixArt constantly impresses me for its size

lucid swift
#

did stabillety improve fursuits?! "The image shows a person dressed in a colorful fursuit against a plain pink background. The fursuit features a large, stylized head with prominent, pointed ears that are white on the inside and purple on the outside. The head is covered in bright red fur with a white stripe across the face and purple fur around the muzzle. The eyes are large, black, and oval-shaped. The person is wearing a black shirt with long sleeves and a purple skirt. The sleeves have blue and purple striped arm warmers extending to their paws, which are covered in purple fur. The lower part of the fursuit includes purple fur-covered legs and feet. The overall style is vibrant, playful, and highly stylized, typical of fursuits often seen in the furry community."

#

maby the lumina model wants diffrent promting or something

storm saffron
#

I guess they would have trained it on fursuits to please a certain demographic?

storm saffron
lucid swift
storm saffron
#

Mascot costume is not as bad, but sill not good. 😄

#

notes If I did a retrain on my finetune of pixart add more furries.

lucid swift
storm saffron
lucid swift
storm saffron
#

I'll try your prompt in it

lucid swift
#

the promot is just made with chat gpt so it might not be a good one

storm saffron
#

It'll probably miss out half of it due to being SDXL

lucid swift
#

xD

storm saffron
lucid swift
#

classic clip

#

its better then bais. but it does not look like a real fursuit xD

storm saffron
#

It's close, hold on.

#

Better?

#

A person in a grey wolf fursuit at a comicon

lucid swift
#

yes it looks better

storm saffron
#

that's dreamMODE CosXL on civit if you want it. 😄

lucid swift
#

i just asked chat gpt to generate the image with the promt and it made this

storm saffron
cedar gale
#

I have no idea I tested your prompt.

storm saffron
#

That seems like a lot of fire for it not being in the prompt. 😄

cedar gale
#

Ok ok I admit, I might have spiced it up a little.

storm saffron
#

did you just add "on fire"

lucid swift
#

storm saffron
#

felt puppet nightmare, night, fog

cedar gale
#

Noice.

#

Pikachu edition.

lucid swift
#

lumina cant do text

cedar gale
lucid swift
#

Lumina, idogram, sd3 "The image is a top-down view of a photograph placed on a wooden surface. The photograph appears to be in black and white and has a vintage quality. The edges are clean and straight. The subject of the photograph is a person standing and wearing a long dress with a high collar and buttons down the front, with their hands clasped together in front. The person is wearing a mask or headpiece resembling a goat's skull with large, curved horns. The background behind the subject is dark, suggesting the use of a flash which highlights the subject and makes them stand out against the darkness. The photograph itself is not dirty or damaged. Below the photograph, handwritten text reads: "Fear is weakness.""

storm saffron
#

Hands are no better on Lumina either

#

PixArt (once I added polaroid)

storm saffron
cedar gale
#

Tell me more.

sterile pendant
storm saffron
cedar gale
#

As u know me. 😉

storm saffron
#

Research is left to the reader. 😉

cedar gale
#

I do thank you kindly for the nice bat minion tho. 😄

sage burrow
vapid radish
bitter hearth
#

lmao the explosion became a christmas tree

silver sluice
#

can the SD3 API generate stuff that's like licensed/copyrighted? Like if you ask for it for super mario riding a skateboard on a sidewalk will it do that or will that throw an error?

silver sluice
#

oh wow so like fully unlocked lol cool thanks for the share

vapid radish
# bitter hearth gimme prompt for these 2

The Monster was:
In a foggy mountain pass, a repulsive creature emerges, but its form is indistinct and partially obscured by the dense mist. The winding road disappears into the thick fog, flanked by towering pines that create a tunnel-like effect. The fog muffles sound, creating an eerie silence and a sense of mystery and solitude. The creature's grotesque appearance is only vaguely visible through the haze, blending seamlessly with the eerie atmosphere of the journey through the unknown

The Cat:
cinematic photo film still of a little cat, (close shot), snow treading, rain dripping, fog filling, .shallow depth of field, vignette, highly detailed, high budget, bokeh, cinemascope, moody, epic, gorgeous, film grain, grainy, cinematic photorealistic, 8k uhd natural lighting, raw, rich, intricate details, key visual, atmospheric lighting, 35mm photograph, film, bokeh, professional, 4k, highly detailed

low stone
#

So they just released a new version of Lumina-Next-SFT which is kind of impressive and more prompt following. I prodded city96 to see if we can get comfy support for it.

bitter hearth
#

@vapid radish am I doing it right waow

vapid radish
bitter hearth
lucid swift
storm saffron
vapid radish
dull star
#

And fine-tuned models are yet to come

patent magnet
#

Side by side comparison: left is SD 3 Ultra raw, right is the Upscaled + Eye-Corrected version

#

Side by side comparison: left is SD 3 Ultra raw, right is the Upscaled + Eye-Corrected version

#

Side by side comparison: left is SD 3 Ultra raw, right is the Upscaled + Eye-Corrected version

#

Prompt: anime art, 1girl, kemomimi, harpy, curvy, [white|black] hair, looking at viewer, [yellow|aqua] eyes, hood, fur coat, underbust, corset, night, moonlight, castle, sitting, squatting, Large_wings, hands between legs, from below, rooftop, wide_shot

vapid radish
silver sluice
torpid forge
patent magnet
silver sluice
torpid forge
patent magnet
torpid forge
#

I'm just making little goats

#

and clowns

severe phoenix
brisk bay
#

a fat man 40 years drinking beer funny picture

bitter hearth
torpid forge
wild remnant
fading fiber
#

nice

fleet meteor
sick cedar
#

can anyone tell me what version of the T5 model SD3 uses?

patent acorn
#

yo sd3 make me an image of maniac laughing rick from rick and morty

low stone
# fleet meteor Amazing gens

Llama3 prompt (it's so creative) and SD3. June 12th is gonna be great: A grotesque Trump mask made from human skulls and twisted metal hangs upside down from a rusty chain suspended above a trash-strewn alleyway illuminated by flickering fluorescent lights casting eerie shadows on crumbling brick walls.

low stone
hallow lion
#

2 more days

#

😄

#

now it's not 2 weeks!

#

just 2 more days

desert garnet
#

after that we only need to wait 2 weeks for 8b 👍

bitter hearth
#

I miss the good old days

compact forge
desert garnet
compact forge
#

great, cant wait

bitter hearth
#

@desert garnet thomas since when did you become the sd3 ceo

bitter hearth
fallen swallow
#

^^

torpid forge
severe phoenix
#

are thee maade witth ideogram?? 😲

noble coyote
muted dove
hallow lion
remote holly
#

Have developers of tools like a1111, fooocus or comfyui received the sd3 models in advance or will we have to wait a bit for integration?

#

Or can we use the model directly?

jolly abyss
#

Oh and swarm obviously too.

remote holly
#

ah too bad, in any case I can't wait to see the workflows that it will be possible to do in comfyui

mild bramble
#

Request: Can anyone try to push sd3 to its maximum capacity for realism in ai images

lucid swift
low stone
sage burrow
turbid grotto
#

made a mistake in word "truck"

#

lykon is teasing with further trained 8b sponging

low stone
cedar gale
low stone
rain current
past flame
#

Ah yes

#

The Hort

patent acorn
#

could have some simpsons comic parodies in sd3

low stone
#

news reporter with a microphone standing in front of a chaotic scene where a hideous gigantic monster is destroying buildings and stores

lucid swift
noble coyote
#

SDXL, not SD3

lucid swift
patent acorn
#

who pinged?

sage burrow
fallen swallow
steep widget
#

New system for audioreactively generative geometries, intervened with various SD configs.

You can access this new TD patch and SD configs (3), plus many more systems, experiments, and tutorials, through: https://linktr.ee/uisato

#touchdesigner #stablediffusion #generativeart


0:00 - AI intervened - 1
0:07 - AI Intervened - 2
0:15 - AI Int...

▶ Play video
noble coyote
low stone
rain current
noble coyote
fair spruce
pseudo stone
fair spruce
raven fern
#

just like Naruto says in the dub: "Believe it!"

sterile pendant
#

What if it was a dyslexic mixup in d/m/y vs m/d/y and they actually meant December 6th?

fair spruce
cunning lintel
dull star
#

so close 🙏

#

its so good that we have a date

#

and 8B will come later and will be released as well

mortal mesa
#

is it known that its comfyui ready?

mild bramble
mild bramble
raven fern
#

or rather, available day 1

mortal mesa
raven fern
#

maybe ask comfy directly :3

ancient cape
#

same for stableswarm, if you wanna generate grids

dull star
#

exacly, comfyui and stableswarm are day 1 things

#

anything else is probably gonna take a few days if not weeks, don't know

#

we should also know that controlnets are not going to be day 1 things, but we might expect 1.5 quality if not better due to the MM part of MMDiT

#

it's just that it may take more time for people to do research about it first

#

as long as regional prompting is day 1 and they figure out a pos embed fix so that we can get proper highresfix, I'm perfectly satisfied for the time being

raven fern
#

i dont care about controlnets right away, il be happy enough with t2i and i2i for now 🙂

dull star
#

oh btw they might've already started training 8B

raven fern
#

😮

dull star
#

Lykon's been posting pics

turbid grotto
#

I dream about memes finetune

dull star
#

then again, these must've used Ultra's workflow with like highresfix and everything

dull star
#

I can't wait

#

and since its 2B, we can actually train it thomas

turbid grotto
dull star
#

no, with any Unet model to my knowledge

#

but with DiT models such as pixart and now SD3, highresfix is broken

dull star
#

for Pixart, the entire image is noisy and distroted, and with SD3, the area outside the base resolution part is a blurry mess

turbid grotto
dull star
#

if you go out of resolution range without fixing the positional embedding handling or using tiling, it does this (clear image in center, distortion on the outer edges)

#

from Alex (mcmonkey)

dull star
#

then agian, lykon's image must've been with upscaling, and they look wonderful

#

maybe low denoising is okay? 🤷‍♂️

#

I really hope its just a case of fixing a part of the code and not a limitation with DiT

turbid grotto
#

also tiled controlnet might be more effective

dull star
#

but the VAE is superior though

tropic plume
#

guys

dull star
#

this an image withotu highresfix and the woman's face doesn't look like a mess

#

the eyes look perfect

tropic plume
#

do i create images here or can i invite bot to my dms

dull star
tropic plume
#

thankyouuu

turbid grotto
dull star
#

no idea actually

turbid grotto
#

hope not 4 times slower waow

dull star
#

I hope its mostly vram difference and it can be solved with a tiled VAE

#

but I hope its not 4 times slower

tropic plume
#

it wont let me type anything in that channel

teal fossil
# dull star

Is he using the 2B we'll get or a version of the 8B?

dull star
#

these are 8B pictures

teal fossil
dull star
#

honestly doesn't look any better than 2B right now

hallow talon
#

Has there been any update on what time on Wednesday we should expect the model to be released?

dull star
#

don't recall

#

just simply expect the worst and you won't be disappointed

#

I would wager midday to night in the US

#

I'd be surprised if they released it around greenwich midday time

hallow talon
#

as long as it comes out at some point on wednesday as promised I'm happy lol.

#

thanks 🙂

dull star
#

it cannot possibly be delayed, the model is trained well now

hallow talon
#

oh yeah, wasn't expecting it to be! Might've worded that wrong. I was just curious what time to expect it 🙂

vapid radish
dull star
#

GUYS

#

YESSSS

#

YESS THANK YOU COMFY

#

excellent!!!!!

#

we can leave prompts emtpy to try it all out

bitter hearth
#

Comfy

#

(the UI)

hallow talon
#

hopefully someone will make a workflow cause I can use comfy but I don't know exactly how to connect everything together properly (especially now that it got more complicated with SD3)

remote holly
#

How many gb takes sd3 to memory including all components ?

gusty trail
#

cliploader cliptextencoder not a very good name for t5

dull star
#

yeah idk why its still cliptextencodesd3

#

when there's a T5 as well

#

but I suppose its for familiarity

dull star
vapid radish
#

Do we know how big the SD3 model file is going to be yet? I did just order another 4TB SSD just in case 😀

dull star
#

2B isn't going to be big, especially since its going to be at bf16 or fp16

#

it's T5 that's gonna be massive of course

#

which is 10 GB if you haven't installed it

turbid grotto
#

why T5 is so huge 💀

dull star
#

its a large LLM model (large large language model kek)

#

even if we are only using the encoder part of it

teal fossil
dull star
#

I mean if they train it more, I don't mind

#

okay maybe I do, cause I kinda want to try the model offline now 😔

teal fossil
dull star
#

no idea, but cascade also needed a shift node

#

yayy its alex!!!!1!

teal fossil
viral plaza
dull star
#

I think T5 will make a difference

teal fossil
dull star
viral plaza
#

great for getting set up and still lets you take the workflow itself and muck with it after at will

teal fossil
viral plaza
teal fossil
#

Hi Alex. 🙂

viral plaza
#

Also it's possible to store the model in FP8, making it only a 2GiB file when you do that

dull star
turbid grotto
dull star
#

it loads WAAAY faster than fp32

viral plaza
#

hypothetically if we did it should work fine and would be a ~2.5GiB file

dull star
#

I wonder if a ggml implementation of encoder models such as T5 would help 🤔 (I'm a broken record)

viral plaza
#

however you can just not use T5 at all as an option, imo probably the best launch-day option

#

yeah t5 works well in ggml 4bit, just need a convenient way to shove that into comfy

dull star
#

there's llama-cpp-python pip package or whatever its called

teal fossil
turbid grotto
teal fossil
#

Basically the difference between mere Dreambooth and Finetunes / multi-concept-LoRA's

viral plaza
teal fossil
dull star
#

well there's bnb4bit

#

we can load it in 4-bit, but storing is a different question

bitter hearth
#

no clue what im reading

viral plaza
teal fossil
viral plaza
#

and for reference when I say this, I'm basically personally the reason SDXL has tencs in the model lol, others wanted the tenc separated for XL but i fought to include it because training it is so worthwhile and the tencs are only 1 out of the 7 gigs of space the model takes anyway

#

balance is different for SD3 so I can't really argue it on this one

viral plaza
teal fossil
#

Textstream == the new Ddit architecture?

dull star
teal fossil
viral plaza
# teal fossil I've been hearing about Sigmas more and more for a few weeks, but apart from "so...

Uh so the short of it is:

  • in early diffusion models, we had timesteps 1-1000 exactly
  • in the modern era of diffusion, we now have dynamic steps (eg 20 step or 50 steps or etc) and that worked by converting to approx timesteps (so eg for 50 steps, you multiply your step by 50 to get a timestamp value) and adding the sigma value to represent the timesteps in a way the model can now actively process
  • the shift effectively curves the timestep space, so it can spend more time in the early (structural) steps or more time in the later (detail) steps

If you've ever used my Dynamic Thresholding toolkit in auto/comfy/swarm, the CFG Scheduler feature is very similar to sigma shift (albeit of course using the CFG rather than sigmas to push this preference schedule)

#

in the case of SD3 I think sigmas are basically just linear by default until you apply the shift (vs other models had more of an algorithm to em)

#

(the sigma goes through an embedder to turn into latent magic inside of the model)

teal fossil
#

So Sigma Shift 0.5 -> more time for structure Vs Shift 2 -> more time for detail?

viral plaza
teal fossil
#

Like we could (later down the road once it's fixed) go 0.5 Shift for the initial image and Shift 2 for the HiresFix / Ultimate Upscale / whatever?

viral plaza
viral plaza
rugged nova
dull star
#

this was 5 days ago, idk if something changed

#

but we can do tiled upscaling for the time being

viral plaza
#

SD3's default reference recommended shift is 3

#

it makes sense to play in the range of 1.5 to 3

#

you probably wouldn't ever do below 1

dull star
#

nice, this is a good demonstration

viral plaza
#

("RoPE is "rotary positional embeds" and "RoPE Scaling" is a technique to scale RoPE, SD3 has its own pos embedding logic)

teal fossil
viral plaza
#

TI embeddings have been awesome the whole time

#

they are incredible on XL

#

i wish people would use them more / write good tools for them

#

it's theoretically possible to train a small TI in a few seconds (some people even published code that does this before, then deleted it off github argh)

#

LoRAs are more flexible obviously but for single-concept training TIs are hard to beat for speed/quality/usability

dull star
#

also aren't they going to work across all 3 (or 4) model sizes?

#

since they use the same clip?

dull star
#

idk what happened, but stylistic TIs were really good in quality

#

we had like midjourney TIs and Greg Rutkowski TIs

#

they were nice and super small in filesize thanks to TI's nature

#

would making a TI for SD3 have nearly the same procedure as making a TI for SDXL?

dull star
#

eh but I'm inexperienced in training 😔

#

I'll have to wait for random rentry blogs lmao

#

all I did is use google colab and then later some super braindead easy GUI to train Loras for SDXL

fleet meteor
#

I think I was decent in training untill I tried training a lora / checkpoint with more than 30 images

#

My model broke

dull star
#

damn

turbid grotto
#

and convenient

dull star
#

ah, CEST has 1 day left

dry wave
#

I found TIs amazing in SD 2.1 but in SDXL they didn't worked well for me

#

also I had the feeling that TIs overfit much worse than unet training

wild remnant
viral plaza
viral plaza
wild remnant
raven fern
wild remnant
#

@raven fern lol

coral sable
#

hype is unreal, last day (or 2) of waitingcatroll cowroll pikaroll

raven fern
wild remnant
#

@raven fern

raven fern
#

❤️

coral sable
#

release will be as planed on schedule?

raven fern
#

all according to keikaku

coral sable
agile hornet
#

When it drops on huggingface I should be able to just grab the model and toss it in A1111 right?

low stone
low stone
#

There'll be comfy support at launch. Not a111

agile hornet
#

noooooo

low stone
#

Comfy is written by stability ai. The other guys are their own separate devs so they'll have to work on integrating afterwards.

fleet meteor
#

Aaa I can´t wait to try it

raven fern
#

132k lines changed 😮

molten valley
low stone
dull star
low stone
viral plaza
# agile hornet noooooo

you can use it in Swarm as well on day 1, which has a friendly interface like auto1111 does

viral plaza
viral plaza
viral plaza
low stone
viral plaza
dull star
#

excellent!

viral plaza
coral sable
#

updated and rdy ^^

viral plaza
low stone
#

Is there a new ksampler as well or still using the same one?

dull star
#

nope

#

all of the nodes I've shown you are the all of the nodes I've shown you

#

🔥

low stone
#

Ah

agile hornet
#

Not sure if I can run Swarm, I got a 2080TI with 8GB of ram and all the other ones work well but I heard swarm uses more recources to run is that true?

viral plaza
viral plaza
#

it's way more efficient

agile hornet
#

nice

#

im gonna set it up today then

dull star
#

can we use all samplers and schedulers?

#

how does the flow matching influence it

viral plaza
#

ancestral/stochastic are incompatible with flow

low stone
raven fern
#

go with the flow 🙂

dull star
#

we'll see, about that..........

viral plaza
#

AYS won't work out of the box, but can be retrained for SD3

raven fern
#

so nvidia has to release something new for sd3?

dull star
#

it's gonna be cpu by default?

viral plaza
#

alternate guidance options like SAG and PAG will probably just flat not work as-is and need new research to make things like em

raven fern
#

yea some new research

dull star
#

and freeu neither, cause its specifically for unets

viral plaza
raven fern
#

yea basically anything unet related wont work

viral plaza
#

as is cacheysamply :(

#

my precious baby, killed to death twice in a row

raven fern
#

i personally like the Inspire pack ksampler for ays, cause it's already built in, dont have to use a lot of nodes for AYS

dull star
#

at least we can experiment with stuff on day one, like messing around with text encoders, textual inversion embeddings, tiled upscaling

#

what about regional sampling?

coral sable
#

ok I'm convinced to try SwarmUI, isn't it on Pinokio, cant find

low stone
raven fern
#

wait, i just noticed comfy did a small fix to cosxl edit models, does this mean they fixed the artifacts? im gonna check 😮

dull star
#

oh shit I just realised

#

top-bottom memes will work perfectly due to regional sampling

carmine blaze
#

is API call the only way to test SD3 right now?

dull star
#

currently, yes

#

but just wait till 12th and you can use it offline

carmine blaze
#

sweet thx

dull star
#

you're welcome

raven fern
#

didnt fix the artifacts, welp

#

actually wait, il try with some other pics

dull star
#

hmm I wonder when we'll get inpaint models

#

ngl it would be perfect for SD3

raven fern
#

man greatness awaits 🙂

viral plaza
raven fern
#

nice

viral plaza
# coral sable ok I'm convinced to try SwarmUI, isn't it on Pinokio, cant find

swarm is super easy to install natively, you don't really need an install manager like some other uis need https://github.com/Stability-AI/StableSwarmUI?tab=readme-ov-file#installing-on-windows

GitHub

StableSwarmUI, A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility. - Stability-AI/StableSwarmUI

low stone
# dull star what about regional sampling?

I think I was able to achieve just directly what I did with regional promoting with this one. One of the things that's so impressive about sd3 so far is the variation available between seeds. It's not just the same image from a slightly different view each time. Every one is very different which is great.

raven fern
#

@viral plaza do you know when we will get an edit model for sd3, like cosxl edit?

viral plaza
viral plaza
#

my lil terminator kitty looks awesome

#

i set the creativity a bit too high so there's some discoloration on the edges but not too bad unless you're looking really really close

#

this isn't some SD3 magic btw this is just Swarm's inpaint code working well lol

#

can do the same with XL

viral plaza
raven fern
#

yea

#

also, look at Alex flexing with the release candidate version 🙂

dull star
#

like yeah its 0.9 strength, but still

viral plaza
#

can of course also just do the mask better to prevent the discoloration

dull star
#

kek

raven fern
#

nice kitty

viral plaza
#

oh actually turn off Mask Shrink Grow and this looks really good

dull star
#

how come a1111 users didn't switch to stableswarm yet, it even has these convenient features similar to it

viral plaza
#

... anyway tldr point is, if you use a good UI like Swarm and fiddle settings a bit, you don't really need an inpaint model

viral plaza
#

most people once they try swarm never go back to anything else

dull star
#

its not 1:1 to a1111, so therefore it must be bad thomas

viral plaza
#

comfy users it's a 100% no-brainer to use swarm, auto webui there's differences to learn but ... like, most of the differences are improvements so lol

#

the one pain point for auto users is if you have old auto extensions you really like - you can usually find a comfy equivalent, but if you don't like the noodles, it's awkward

coral sable
#

waiting installation to finish :c

viral plaza
#

swarm is generally less reliant on extensions though, a lot of stuff is built in

#

you only need extensions when you're getting really really crazy

coral sable
#

meanwhile octopussy v2 (SD3)

raven fern
#

stable swarm seems very well built, il try it one day

coral sable
#

not sure if she be eating herself or that's how she is

#

when lambo sd3🤣

ocean lance
#

just saw yours is roughly the same size

viral plaza
#

that looks like it's the same thing just bf16 instead of fp16, so yeah

ocean lance
#

do you know why the team chose xxl over xl? xl seems to be more popular just from searching for fine tunes of xxl

viral plaza
ocean lance
#

bigger is better 😄

hallow lion
#

Prepare the consistency nodes for the arrival of SD3.

#

the only thing left to conquer - consistency.

#

we have it in lighting btw

#

😄

#

we just need shape/texture consistency now

#

1 day left

#

dont predict the arrival of the mesiah

#

well in this case we can

low stone
faint breach
#

can sd3 make it 2 more days from now though? can it time shift?

frozen lynx
#

guys we gotta think of something to make time go faster so it's 2 days from now

viral plaza
#

("anticausal" = model that's trained to solve in the reverse of the direction of time)

faint breach
#

using improbility fields? thonk

viral plaza
#

i was thinking MM-DeLorean

faint breach
#

this is heavy

frozen lynx
#

like .. if u move fast enough u time travel to the future right? we all just gotta get on a spaceship that can travel at near light speed

faint breach
#

naw. that'll just age us infinitely before we get back to the same moment

wild remnant
faint breach
#

if i eat that will it be 2 days from now?

ocean lance
#

maybe from a food induced coma

compact forge
#

whos ready for unexpected delay on release day like last time 😆

sterile pendant
compact forge
#

obviously if you delay something right as people expect it to come out

frozen lynx
#

I think I figured it out. so like time is a measurement of light moving, so if we can move at light speed, it'd be like pressing pause on a video game but then we could just skip to any point in time instantly so to get to SD3 launch all we gotta do is go fast like sonic

desert garnet
sterile pendant
compact forge
#

yeah man fucking leeches

#

i hate people so much

ocean lance
#

is this the first model that hasn't leaked?

desert garnet
sterile pendant
#

But for real, people will complain about anything. Even when it does come out on time, people will complain that they can't run it on their 2gb vram gpu and 8gb ram laptop from 10 years ago

#

Or that it "suck"

#

Because no bobs n vegine

compact forge
desert garnet
#

too many bobs and vegana,ban model pls

sterile pendant
#

They just happen to come from the two largest countries on the planet

compact forge
sterile pendant
#

It's illegal in both those countries.

desert garnet
compact forge
viral plaza
#

like the person that saw "stable audio open" uploaded to HF and stole it and reuploaded a "leak" with the word "open" removed

#

(i really wonder what the goal was. Was it, like, clout chasing? or... why tho?)

compact forge
viral plaza
#

vs XL for example was sent out to anyone with a .edu address

ocean lance
#

so why not anyone with .com address? ;p

#

any sensible training parameters to recommend for training a lora?

viral plaza
#

that'd be a @lavish osprey question

bitter hearth
#

Is there a free way to use sd3?

ocean lance
noble coyote
junior dune
#

What’s the estimated vram requirement for sd3?

#

12gb and 32gb ram?

junior dune
ocean lance
#

since T5 can be loaded in CPU ram

#

pixart is a 0.6B model, requires 1.5 GB of VRAM

#

so roughly 5gb for the mmdit

#

loading clip g and clip l uses 1.8gb of vram

viral plaza
#

you can just not use t5

#

at which point the reqs are <= sdxl's reqs

ocean lance
#

comfy only uses 3.6gb to run sdxl

#

does that make the 2B model excluding the clip models less than 1.8gb?

junior dune
#

Does it generate a link you can access on mobile on another network like A111 does?

viral plaza
viral plaza
#

mobile support is a liiil wonky atm

#

it works but not a great experience tbh

ocean lance
#

is comfy offloading the clip to cpu maybe?

viral plaza
ocean lance
#

so might be able to run on my 6gb laptop card then :>

viral plaza
#

yeah probably

ocean lance
ocean lance
viral plaza
visual hamlet
#

哈哈哈

teal fossil
#

@viral plaza Does Swarm's Inpaint put the whole image through the Vae (degrading the image over time) or is it using a stitch-approach to mitigate that?

viral plaza
teal fossil
ocean lance
#

does swarm have tiled upscale?

teal fossil
#

Kinda like KoboldCpp + SillyTavern.

remote holly
#

It will be possible to use sd3 online tomorrow ?

#

Online and free

haughty jasper
#

Where can I use SD3?

cobalt moon
#

ComfyUI just update its compatibility to SD3

#

the Midnight is coming

#

uhhhh... actually are they going to just do the license announcement along with open weight

viral plaza
#

(can do in comfy tab of course)

haughty jasper
#

Is SD3 available for download now? I thought I could experience it on Discord.

cobalt moon
haughty jasper
#

But I couldn't find the SD3 bot in Discord.

cobalt moon
haughty jasper
#

okk

remote holly
#

Sd3 on hugging face ?

viral plaza
cobalt moon
#

yeah

remote holly
#

Ha cool , i hope it will run correctly because stable cascade have lot issues with diffusers lib

ocean lance
torpid forge
viral plaza
ocean lance
storm saffron
#

@viral plaza do you know if SD3 will work with tensorRT (not sure anyone would have tried it yet)?

viral plaza
coral sable
#

bro Swarm has everything, you were right. I'm rdy for SD3w00t

bitter hearth
#

has cat waow

coral sable
bitter hearth
lavish osprey
#

at the moment we didn't test any style or character lora (well, we briefly did but then we had to move out to other stuff).
We did lots of aesthetic or "fix" low rank training

prisma zenith
#

So I read thru the history to try to understand. But I still have a few questions. Woudl appreciate if someone could fill me in 😄

SAI is releasing SD3 Medium, but there could possibly be a small and a big? Also what is the impact of the 2B vs the 800M vs the large? Is the impact more on the model's ability to translate text into images and understand what it's trying to create? Or will it have an impact on image quality too? (my understanding is that it's a bit of both because it will have less latent's to draw from right?)

Finally what is SD3 Ultra? is it a comfy UI workflow when using the SD3 Api? Or is it the large model?

Thanks to anyone who takes the time to answer!

lavish osprey
#

It's style aligned well enough to allow most types of lora training imo

bitter hearth
lavish osprey
coral sable
lavish osprey
#

that being said, 2B mmdit is likely all a human can ever need.

bitter hearth
coral sable
#

SD3, same prompt(lots of cat)

bitter hearth
lavish osprey
prisma zenith
#

What about what SD3 Ultra is? Is it a workflow? or the larger model?

lavish osprey
#

it's only "Ultra" in theory, not "SD3 Ultra"

prisma zenith
#

but what is it? a model? or a workflow to make the most of the output from the model currently on the API

lavish osprey
#

what's in Ultra is a trade secret, you can think about Core and Ultra in terms of msg, which is salt on crack, while base models are just salt.

prisma zenith
#

I see, so Ultra is basically the model that is available via API is that it?

#

or is that the one available via stable assistant?

bitter hearth
coral sable
lavish osprey
#

there is no model named ultra

prisma zenith
#

so what is ultra then? a workflow?

lavish osprey
#

ultra refers to an api endpoint, what's behind that is not for me to say

prisma zenith
prisma zenith
#

thanks!

bitter hearth
prisma zenith
coral sable
#

that's evil, I'm hungry

lavish osprey
bitter hearth
lavish osprey
lavish osprey
# bitter hearth

@viral plaza give me the yellow name so I can start banning people who put pinepple on cheese and tomato sauce

prisma zenith
#

got it! i'm looking at the api now. This one has Sketch & Canny as controlnets right?

coral sable
#

tested some MJ prompts with SD3

lavish osprey
bitter hearth
#

runs away

viral plaza
viral plaza
lavish osprey
#

let's just piss off reddit

teal fossil
#

And I wonder how long it'll be until Kohya & OneTrainer add manual Clip-G/L training for concepts.

#

And dedicated simple Embedding training.

lavish osprey
#

you don't really need to finetune TEs for concepts

#

historically NAI used SD1.4 TE without any change

#

and Pony finetuning TEs kind of destroyed them, to the point it lacks basic knowledge like "a bank"

#

the first 5-6 months of SDXL lora finetunes didn't touch the text encoders and they mostly work very well

#

even with made up words as activation

#

(as a matter of fact, I'd suggest to keep the tes untouched and just use them for preprocessing)

ocean lance
storm saffron
teal fossil
teal fossil
dry wave
#

that's how the original dreambooth paper worked: they used a "non-sensical" input token and trained on that

#

instead of using one token you can also just use a bunch of tokens (like your name)

#

as more rare (and non-sense) the token is as harder it is for the model to learn from it but there is also much less overfitting and damaging then

dry wave
lavish osprey
lavish osprey
# dry wave each token gets an embedding, yes

^ this is mostly correct. The model will produce an embedding of your prompt depending on the tokens. So there sre still going to be vectors that the Unet/DiT can "catch" to understand new concepts

#

with SD3 you also have more stuff you should check when training, like "did I ruin text understanding" or "did I create conflicts among the various text encoders"

amber nexus
#

If the SD3 2B is released tomorrow, will I be able to use the inpainting or upscaling features directly in ComfyUI?

outer cloak
#

A quick dumb question is there a fixed time for the release like 12PM PDT or something?

teal fossil
#

Another thing - I started pruning my LoRA's per Weights - for some concepts that was highly effective and countered the detrimental effects of my LoRA's on the base Models.

Will that be possible / necessary with the new architecture, or is it structured differently?

dry wave
#

what do you mean with "per weights"?

#

but loras work technically the same everywhere. They are not specific for diffusion methods

teal fossil
teal fossil
dry wave
#

each matrix has it's own pair of lora matrices. You can remove them individually, yes

low stone
dry wave
bitter hearth
fathom path
bitter hearth
vapid radish
#

It's funny that if you mess up the prompt with SD3 it really tries to follow it, with hilarious results.
"vampire with fangs, wearing a black cape and black with red lining inside shoes"

gusty gale
desert garnet
teal fossil
dry wave
#

I don't know what you mean 😅

#

if you train the TE then you should train it first

#

or you train it together with unet

#

but training unet first and then te sounds wrong

#

regarding lora and unet: I have no clue what this "strange" interaction should be

#

in the end a Lora is doing nothing strange, it's just a weight update

calm surge
dull star
bitter hearth
#

thomas why gen them

sick cedar
tropic aspen
#

So, just want to mention we're around 1 day away, and we still don't know how the licensing for SD3 works despite being told it would be explained before SD3 launches

bitter hearth
storm saffron
dull star
#

I don't like how this one looks

#

why did the AI dude use portrait mode 😔

dull star
#

generic ceiling light, no depth of field, just a dude smiling

storm saffron
#

There, badly lit, mildly out of focus, terrible selfie

dull star
#

this one's better than the last

#

but I still prefer SD3's selfies

storm saffron
#

I'd prefer to see examples of things that can't be done already in SDXL.

bitter hearth
bitter hearth
storm saffron
bitter hearth
#

Kek

#

Weapons are annoying indeed

#

Don't think this basic version available can do any better

storm saffron
#

It can do it, sometimes, a bit better

desert garnet
storm saffron
#

Or a woman talking to another woman in the street, one has blonde hair and is wearing a red dress, the other is wearing a purple dress and a baseball cap

#

I mean it was fairly ambiguous.. 😄

#

more specific (for SD3 purposes)
a woman talking to another woman in the street, one has blonde hair and is wearing a red dress and a fedora, the other has brown hair is wearing a purple dress and a baseball cap

#

SDXL has mixed hair colours now, and only baseball caps

lavish osprey
lavish osprey