#πŸ†•ο½œsd3

1 messages Β· Page 73 of 1

bitter hearth
#

I use British spellings

torn wharf
#

i think more people who want to diffuse images as a hobby should start studying art history. i could certainly do with more myself

bitter hearth
#

yeah history of photography would be good to learn

torn wharf
#

you could just prompt for retro, but if the model is over fit on a digital aesthetic, then well, you get a modern look to your retro style image.

bitter hearth
#

ye

#

since this SD3 you also gotta lower shift

#

if you are using shift

torn wharf
#

is this how you shift?

bitter hearth
#

ye

#

can you try a low CFG image

#

and see if you like it

#

oh I don't know about cartoon style
could you do a realism one

torn wharf
#

look at the meta data for this guy's images. he's posting sdxl syndey sweeny obsesssion love notes

#

marketing works too well on teens

bitter hearth
#

I don't rly judge how people use the models

#

I personally don't do any NSFW or celebrity stuff
but open source is open source I guess LOL

torn wharf
#

i do. also, this is #sd3 . what is the context for these images in this discussion? none of his posts are sd3 examples.

#

yeh i believe in freedom, but i'm gonna judgem still

bitter hearth
#

lol fair enough

torn wharf
#

whose posting sydney sweeny gens?

bitter hearth
#

the way I see it is
what % of the photography + photoshop industry is either celebrity or NSFW?
probably quite a high percentage

#

I'm in the UK its actually unlawful here, surprisingly enough

torn wharf
#

they're generated with newrealityxl and not #πŸ†•ο½œsd3 also, you really gonna ask what the problem with attractive celebrity generations are in a post swiftgate environment? k. just know that many like me exist. we'll see that and immediately know what you're doing

#

you do. same old bag of tricks

#

use google if you truly don'tknow. it was a big deal. you were maybe there even.

craggy crest
#

taylor swift and deepfakes - and the lawsuits

#

for months, yes

#

in fact, laws are being passed now that the mess caused

torn wharf
#

this image uses a higher cfg than your earlier iamge. meta data is all readable. you're clearly not asking for help here.

craggy crest
torn wharf
craggy crest
#

this is the SD3 channel. since you're not using SD3 you should be posting your images in some other channel

torn wharf
#

that's another with cfg of 7. join the 5 gang

craggy crest
torn wharf
#

you're using an ensd too. people don't understand that setting at all

#

when i see people using ensd, i judge so hard

craggy crest
#

as a workflow

torn wharf
#

popular models authors on civit would be like "I recommend using this exact value for the ensd!" but like, why?

torn wharf
bitter hearth
#

you can also lower the CFG feel
with FreeU and CADS

#

FreeU normally makes the image look higher CFG
but you can reverse it

#

its good both ways

torn wharf
#

freeu doesn't work on sd3 since we use transformer blocks over here. keep up.

bitter hearth
#

oh we're SD3 channel yeah

#

its a shame cos I've really gotten into FreeU

torn wharf
#

lol

bitter hearth
#

not sure about A1111

torn wharf
#

yeh it does but it's inefficient

craggy crest
#

comfyUI

torn wharf
#

if all you want to use is the 2 clip layers and no t5, auto is mostly fine

bitter hearth
#

I can't go without PAG any more

torn wharf
#

node graphs have a very high skill floor

#

but are functional af

bitter hearth
#

PAG is absurdly good
sometimes you can add as little as 0.3 PAG
and it will fix things up as if you had raised CFG by 3

craggy crest
#

please go away

torn wharf
#

that's a skill issue. work on your ego issues

#

i mean skill issues

bitter hearth
#

well diffusers is the best long term one in my view
so you can jump on diffusers now if you want
but it has maybe 1% of the features of comfy ecosystem

#

diffusers is made in the style of huggingface transformers
which is a top LLM library
its a much slower and more deliberate project
(but it will takes years to get good for that reason)

craggy crest
#

you are no longer welcome, all you've done here is strut around, try to prove you're someone special, insult all of us, and we're tired of you.

bitter hearth
#

its not rly fair to criticise comfy
because comfy developed along with the tech
newer libraries are coming into it with the full knowledge of the tech from the start

torn wharf
#

i'm not sure how you can live in a world where people accomplish so much with comfyui, and still believe that they're masters and it's the software that is the problem. it's a bad act. i miss legends like blood wizard. these shenaningans here are low effort and just taking advantage of a cut back mod team. i gotta go. things to do. this isn't impressive trolling one bit. didn't even strip meta data or try to pretend that was the sd3 vae.

#

you can't call other people kiddo when you have a crush on sydney sweeney. that's just expected.

#

πŸ’£

craggy crest
#

boom?

torn wharf
#

explodyballs

craggy crest
#

boom!

bitter hearth
#

I didn't realise they were actually increasing the CFG when they were saying they were decreasing it
that is definitely a novel form of trolling πŸ€”

real terrace
#

Hi, is there inpaint for SD3?

craggy crest
real terrace
bitter hearth
#

maybe powerpaint 2 node with brushnet?

#

I don't do inpainting so I am not sure

#

I was under the impression there were some model agnostic ones

sterile pendant
# real terrace Hi, is there inpaint for SD3?

Yes, use differential diffusion and blur the mask. Play with the denoise value, it will take far less than you're normally used to with sdxl, to make big scene changes(the sigma curve for sd3 stays high for a lot longer than sdxl sigma curves). You can use 1.0 denoise to completely change things as well

#

With diff-diff, you don't really need an actual inpaint model

#

The mask blurring is where it soft blends. I recommend 35-50 steps for best results

fleet meteor
hallow lion
#

I missed some popcorn moments

#

:/

torn wharf
#

just a guy pretending to post sd3 pics and looking for advice.. weird troll

sacred jewel
sacred jewel
craggy crest
sacred jewel
uncut river
#

ahh, I missed the trolling

craggy crest
uncut river
#

still, cant read it back

#

oh well

bitter hearth
frail shoal
#

Anyone figured it out how to finetune Sd3 for anatomy?

uncut river
#

i dont think so

#

What I tend to find in SD3, the more you hide, the better it gets!

craggy crest
uncut river
#

how to prompt correctly for a woman sitting in the grass, one leg flat on the ground, one knee up, hand on her knees, head leaning on her arm ... ?

#

Perhaps SD3 needs more of a story?

#

something like "Imagine a serene scene in a lush, green meadow on a warm afternoon. A woman finds a comfortable spot in the grass, feeling the soft blades tickle her skin. She decides to sit down, letting one leg rest flat on the ground, while she casually bends the other knee up towards her chest. With a gentle grace, she places her hand on her knee, feeling the warmth of the sun on her skin. Feeling relaxed, she leans her head against her arm, enjoying the tranquility of the moment as she gazes out at the peaceful landscape around her."

craggy crest
# uncut river Perhaps SD3 needs more of a story?

nope. there is a core issue with all subjects, not just a human femal or humans in general. the farther you get away from standing straight in front of the camera, the more the subject is affected. it begins to warp, things shorten or elongate, and the AI starts trying to draw the subject from multiple points of view. it affects everything - trucks, cars, birds, paint brushes - everything. and you can't prompt it out. it's in the core training.

uncut river
#

so, with "prompt correctly" you mean only prompt for things in a simple pose, standing up, front view, etc ... ?

craggy crest
warm sonnet
#

Don't waste time with prompts, anatomy errors belong to the model. If you generate an image without anatomy errors, it's just luck

hallow lion
#

until they fix the bewbs and so on I am more than happy to use it for backdrops.

#

it makes phenomenal backdrops

#

if they do bring the anatomy up to the level of the backgrounds and landscapes i will be impressed. that indeed would be almost like the last model we ever need

#

XD

frail shoal
#

i'm trying to create a logo, why does the background looks like that ? Also it seems that this is not the 16ch vae quality

bitter hearth
frail shoal
#

ah it was the perturbed verision of sd3

sacred jewel
frail shoal
dull star
#

the smaller model, schnell is 12B, its apache 2

#

but holy crap we might not be able to run this

frail shoal
sacred jewel
#

I am not rich... but I did splurge on a 4090 πŸ˜›

#

(I do not golf)

sacred jewel
frail shoal
#

so stability lost a lot of the people that worked on sdxl, svd and sd3 etc and they went and created their company ?

frail shoal
sacred jewel
frail shoal
frail shoal
frail shoal
sacred jewel
frail shoal
frail shoal
sacred jewel
#

I can;t find the .safetensors formatted checkpooints... only the sft from Huggingface

#

nevermind. Someone said .sft [is] .safetensors

#

and she runs...

fossil pagoda
#

Almost 24GB for a model file? bruh

frail shoal
frail shoal
frail shoal
#

also can it be run in int 8 ?

fossil pagoda
#

Havent found the comfy workflow yet

fleet meteor
#

WAIT WTF

fossil pagoda
#

Flux it seems

fleet meteor
#

I tought it was an llm that described the image, like llava

narrow river
#

so we can finally get woman on grass ?

sacred jewel
#

First generation

fossil pagoda
#

Gib workflow

sacred jewel
meager burrow
#

Yeah I can't figure out how to build the workflow. I dl'd the main .sft model and the ae.sft (maybe ae is vae?) but I keep gettting this error:

#

Updated comfy and all

sacred jewel
#

Rename checkpoints to .safetensors

Put the model under UNET and the VAE under VAE of course.

Use the SD3 ClipText encoders

frail shoal
frail shoal
sacred jewel
sacred jewel
frail shoal
narrow river
#

is the model good ? prompt adherence ?

frail shoal
sacred jewel
#

So far it looks amazing.

My two above are from Schnell. Trying Dev now.

fossil pagoda
#

Ehh

frail shoal
narrow river
fossil pagoda
#

Idk how you run this omegalul
Im still at clip encoding with my 4090

narrow river
fossil pagoda
#

I meant local kek

frail shoal
narrow river
#

Epic anime artwork of a wizard atop a mountain at night casting a cosmic spell into the dark sky that says "Stable Diffusion 3" made out of colorful energy

meager burrow
narrow river
#

its very good in producing text
`Write this poem with cursive text on a background that fits the words:

Sunshine is bright,
The sky is blue,
Birds sing sweet songs,
And so do you.`

fossil pagoda
# sacred jewel

I copied that already but getting stuck at clip encode unfortunately

sacred jewel
#

It

#

looks

#

great

#

so

#

far

#

WOW

sacred jewel
frail shoal
#

crying gpu tears

fossil pagoda
sacred jewel
#

She is very hungry...

#

This is using Flux Schnell

meager burrow
#

Photo of bionic robot man with glowing circuitry and complex gears. He has a grim expression. Background is futuristic cyberpunk world. His helmet has "62717" printed on it in blue letters. Highly detailed.

narrow river
fleet meteor
#

Cartoon, french frie fighting with a potato using forks

brisk stream
#

wait, what gpus are you guys getting this to run on?

meager burrow
#

3090ti

mortal mesa
#

whats SD3 turbo eh hehe

brisk stream
#

I got it running but- at 768x1024 max size and it was at like 23.8 GB

torn wharf
#

That's a lot of parameters that don't seem necessary. Smaller models will dominate the future the same way that massive double Bay 5.25 inch drives weren't the form factor that hard drives used for very long. Smaller is how tech goes.

#

I'm not even going to bother since I only have 16gb

meager burrow
narrow river
#

cctv footage, car crash into shop

#

lmao

fleet meteor
#

cctv footage, outside a shop a magician is throwing a lot of pizzas,
bad quality Image chaotic

fossil pagoda
#

This will be huge if we get it to run on lower hardware

narrow river
# frail shoal

holy sheet
Photo of Criminal in a ski mask making a phone call in front of a store. There is caption on the bottom of the image: "It's time to Counter the Strike...". There is a red arrow pointing towards the caption. The red arrow is from a Red circle which has an image of Halo Master Chief in it.

#

this make memes so much easier

dull star
#

if I get fp8 weights and a workflow im using it offline fr

#

especially since its a turbo model, it can be dog slow for all I care, its all 8 steps

lunar rivet
#

got it running on 3060, looks good happemad

steep forge
#

@lunar rivet

lunar rivet
#

Annuvin desu

narrow river
#

possible on 3060 ? wow

lunar rivet
#

it is if you don't mind your browser lagging

#

some more ram would fix it

fleet meteor
#

cctv footage, boat crash into the top of a building

narrow river
#

aparently this model is good with cctv footage LOL

dull star
lunar rivet
#

make sure you have a lot of swap if you don't have a lot of ram

#

I think the fp8 t5 might do better too

fleet meteor
dull star
#

what's a load transformer?

#

can you tell me what plugin is that?

lunar rivet
#

it's just the workflow from above

dull star
#

whaat

#

I even updated comfyui

lunar rivet
#

I was trying to figure out how to connect the nodes and renamed them

dull star
#

ohh

lunar rivet
#

this has the workflow

dull star
#

thank you

#

oh its the unet loader lmao

#

funny

narrow river
lunar rivet
#

that's difficult to say tbh, sampling isn't that bad, about 13 seconds for 4 steps, but the constant offloading adds a lot of time

fleet meteor
lunar rivet
#

the image above took 130 seconds total

fleet meteor
#

Are you on windows?

lunar rivet
#

if you're just doing sampling, as in already have the prompt encoded I think it would be pretty fast, going to try with fp8 t5 now

#

yeah, I'm on windows

dull star
#

are there any fp8 weights for the transformer itself?

lunar rivet
#

ah no wait, I'm blind, that was 1 minute 13 seconds for sampling, not 13 seconds

dull star
#

makes sense with all that offloading πŸ˜…

#

oh god my ram is running out

#

its reaching the 32GB

#

f****ckkk

lunar rivet
#

there's just that one checkpoint in the flux repo right now, give it some time ig

dull star
#

its going into swap

steep forge
#

need 64 RAM asap

dull star
#

yeah we'll need fp8 weights

#

its not even using a LITTLE BIT of vram

narrow river
#

possible we can get smaller version of the model ?

dull star
#

yeah fp8 would be the solution

#

or a further distillation, idk

#

huh, its unloading some of the memory

lunar rivet
#

this wasn't that bad

#

fp8 t5

narrow river
dull star
#

damn this is going to be so slow

#

lowvram mode πŸ˜”

#

huh? now its using the vram

#

its piling up

#

holy crap

#

we'll need fp8 weights immediatelly

#

ok 2s/it isn't that bad

steep forge
#

send help

dull star
#

fp8 t5 + fp8 weights might be viable

#

cause holy crap my computer went super slow mode because of it

#

discord was dying

fleet meteor
#

Anyone got a t5 fp8 link? I can´t find it on hugginface 😿

lunar rivet
#

it's in the sd3 repo

frail shoal
#

probably they have the text encoder baked into the model, otherwhise they would put it up for download. So you are already loading the TE and loading it again with sd3 clips

fossil pagoda
#

Just need to figure how to load it then

lunar rivet
#

true that, it shouldn't be too hard to extract the transformer part tho

#

actually, there's a commit in their repo for diffusers support

#

they have the model split up into parts there

#

not sure if comfy can load text encoder that's in like 4 pieces

dull star
#

so yeah the DiT weights are 24GB itself, which isn't so great for offline use

#

but we'll see how fp8 will perform

frail shoal
#

And if they use T5 as TE than that's pretty big already

#

sd3 2B it's 4 gb right ?

#

oh shit

dull star
frail shoal
lunar rivet
#

same as sd3 I think

#

openai/clip-vit-large-patch14
google/t5-v1_1-xxl

frail shoal
lunar rivet
#

there are 2 tes only, no?

#

I mean, in flux

frail shoal
#

do you have a link where you can see what TE flux is using ?

frail shoal
#

okay so they just use Clip L and T5

dull star
#

yes

frail shoal
#

that clip L might have artist names which is nice

lunar rivet
#

and did some fixes looking at the repo

dull star
#

okay fp8 t5 makes my computer not be a hog!!!!

#

I can multitask!

#

This is amazing!

dull star
#

3090

#

someone got it working on a 3060 too

#

but its like 8.6s/it

frail shoal
#

waiting for the quant myself.

fossil pagoda
#

Give it a day or two for people to test stuff and let the dust settle I guess

#

But this really came out of nowhere

dull star
#

with fp8 T5, I can actually comfortably use this now

#

this is amazing

#

fully offline, apache 2, fast

#

dream come true

meager burrow
fossil pagoda
#

Btw this is the SD3 channel, I think we should move to General hmmm

meager burrow
#

oh duh, I can't read. Ty!

fossil pagoda
dull star
#

yuh

lunar rivet
#

it's using sd3 encoders, it's like half sd3 happemad

fossil pagoda
#

Youre using them cause you dont know better yet

mortal mesa
lunar rivet
#

oai finally contributing something

mortal mesa
#

clip L and T5 ya

sacred jewel
frail shoal
#

is this also a 16ch vae ?

brisk stream
fleet meteor
meager burrow
dull star
#

sorry one last question, is flux schnell or dev good at cctv? cause schnell doesn't seem to get it right

#

its sad that dev is the non-distilled model and its not apache 2 as well

#

such a bummer

sacred jewel
dull star
#

I'm downloading Dev right now

dull star
#

but its just simply something like low quality, blurry, birds eye CCTV security camera photo of, a muscular homeless man with torn clothes and long beard is rampaging in a store. There are police men are around him trying to stop the homeless man

dull star
#

dev or schnell

tough oriole
#

dev

dull star
#

makes sense, thanks

#

schnell sucks at cctv sadly

sacred jewel
#

Schnell

sacred jewel
tough oriole
dull star
sacred jewel
#

This is straight from the model, no upscaling

1792x2304

torn wharf
torn wharf
sacred jewel
#

and this is same dimensions, but DEV

#

FLUX is the new baby...

brisk stream
#

so guys- you can actually just int8 quantize with torchao

#

(for those of us who dont want to use comfy)

#

~1.4 it/s on 4090

sterile pendant
dull star
#

spingebill in gta san andreas??? no way

brisk stream
dull star
#

Its just so worth to wait for images, especially when they're this intelligent and good

#

god dammit

#

its a dream come true

sterile pendant
#

for the ins and outs

brisk stream
#

I stand defiant, with a piece of corn balanced horizontally sideways on my head.

brisk stream
#

Also some things aren't really possible

sterile pendant
sterile pendant
brisk stream
brisk stream
sterile pendant
#

there are vlm nodes for prompt expansion, there is an api flag and dev workflow mode to create the workflow for simple api usage

#

99.99% sure it would be faster, i doubt you have hundreds to thousands of lines of code for things like smart memory management. diffusers would be the only other close alternative

brisk stream
#

considering that for mine I literally had to create a custom react app with multiple websocket webworkers so it could process data fast enough without lagging the UI.

dull star
#

happy wheels irl

brisk stream
#

Also it has a custom backend which uses torch.multiprocessing workers which it can round robin submit jobs to async over two gpus, and even the image encoding is done via nvjpeg on device so that the only thing that comes off the device is jpeg bytes.

#

I made a neat torch extension for that

sterile pendant
brisk stream
#

yes

sterile pendant
#

swarm based on comfy

brisk stream
#

mine is just swarm

sterile pendant
mortal mesa
#

we butter our bread with butter

brisk stream
#

Oh- ok, mine is docker swarm

dull star
#

stop arguing and look at this image

brisk stream
#

wow nice image

dull star
#

its the image of all time

#

πŸ”₯

brisk stream
#

I'm not arguing, or at least not angy-arguing

dull star
#

I understand

brisk stream
#

it is silly-mode argument. I know I might be wrong, but I want to be right, aPES_EvilPlan

torn wharf
tough oriole
sage burrow
#

Flux vs sd3 using a short no effort prompt. Though this us very case specific since sd3 doesn't really know the word anthropomorphic.

dull star
torn wharf
#

i wanna say i love how sd3 can do sd television screens with higher detail in the rest of the image. so smarticles.

But i'm afraid it's flux again

dull star
#

kek

tough oriole
#

I stole the tv prompt from dark and added a black and white comdey show

torn wharf
#

my paranoia has been validated. fk.

dull star
#

flux-dev truly spoiled me

#

its so good man

torn wharf
torn wharf
dull star
#

can it ball?

sage burrow
#

It can't do smut

torn wharf
#

does it get up and get down with the ballness?

sterile pendant
#

man, my poor 2080 is getting 25 s/it lol... schnell is the only way i can really play with it

torn wharf
#

there's nothing perverse about balls they're perfectly natural

tough oriole
torn wharf
#

1 side, no edges. no corners. can't hate

sterile pendant
#

dude works so damn fast

dull star
#

I get 1.8s/it πŸ˜”

#

its still fast

#

how much do you get

sterile pendant
#

i could live with that just fine, i was getting 25 second per iteration

dull star
#

WHA

sterile pendant
#

bout to try it again

sage burrow
lunar rivet
#

I barely managed to load the models and comfy already pushed commits πŸ˜”

dull star
#

I recommend fp8 t5

torn wharf
#

the license says you can't use anythign to circumvent limitations designed into flux

sterile pendant
torn wharf
#

it's a non commercial license but i think they might offer commercial licensing unlike df

dull star
#

2D pixelart Gameplay screenshot of Terraria. The character is in the middle of the screen where it is holding a pickaxe. There is a wooden house next to the character.
It's not terraria, but it got it right?

dull star
#

you have to email them and everything

#

you cant just buy it off of a website I think πŸ€·β€β™‚οΈ

torn wharf
#

stability is setting a trend here for commercial licensing weights and i dont think it's so bad. money in helps quality out. in some cases.

dull star
#

a price would be good to know though

#

WTF its so good

#

it doesn't know what diamond is so its not blue

#

not even dalle 3 could get this right!

torn wharf
#

hard to do when there's so many different use cases and the market is so new and fresh. like you'd license it to dreamworks for general production purpsoes a lot less than you'd license it to an independent creator for a specific use case. these kind of licenses usually require signing an NDA and sharing business plan data

#

i bet a self serve one would come. like the membership option for $9.99/m or whatever.

torn wharf
frail shoal
#

that probably people are going to finetune and make it better

dull star
#

it got it super close

torn wharf
#

confirmed ball capability

dull star
#

idk how they are gonna finetune a distilled model, but if its made possible, then we have hope

torn wharf
#

There is a ball with a wacky looking face on it, floating in space. A wooden sign hangs from 2 small chains connected to the underside of the ball. It says "Get Yer Balls Out" painted onto it like a moonshiner sign. There are colliding stars in the very distant background.

sterile pendant
dull star
torn wharf
#

yeh its not a bad model its pre good

dull star
#

is this half life 1?

#

or some other old game

torn wharf
#

half life 1 danger ball zone

fleet meteor
torn wharf
#

it kind of vibes source engine sometimes and goldsrc other times. but it's very effective.

dull star
#

lol

#

woah the texture looks super crisp

torn wharf
#

guidance cranked to 10

#

it wouldn't go to 11 😦

dull star
#

awesome 3D render man! what pbr textures did you use?

torn wharf
sterile pendant
#

yeah i see why it's taking so long, i just simply don't have enough vram, so it's using sysmem. noticed my cpu was sitting around 50% the whole time i diffused and my vram usage was only like 6-7gb

dull star
#

this aint half life 1 but holy crap

torn wharf
#

i'm reluctant to go to general with images, because other people will start using the replicate link to experiment then. it'll become super busy and queue times will get longer. SD3 cahnnel is safe because all the haters ignore it

sterile pendant
#

i really need to upgrade to something with 16+ gb vram

torn wharf
#

longer this stays under the radar longer i can crank out gens with it

torn wharf
dull star
#

I thought I didn't make the right choice with 24GB because of 2B fitting well anyway in 12GB

#

but auraflow and this model proven me otherwise

sacred jewel
#

I am finding from gen to gen, some are super fast and then the very next one is super slow :/

sterile pendant
#

only messing around with the distilled model

#

4 steps lol

torn wharf
#

actually thats not so bad

dull star
#

yeah the distilled model is surprisingly intelligent for a turbo-ish model

#

I recommend it

torn wharf
#

it knows teslas but not cybertruck

dull star
#

but now I got super spoiled by dev so I'll keep using that 😭

sterile pendant
#

yeah dev at like 30 steps would take an eon per image. assuming 15 seconds per it, that's like 7-8 minutes per image lol

dull star
#

oh I'm using 15-25 steps

#

at 15 I already got amazing images

#

25 might help a bit

#

also im using 1024x1024 type resolutions

sterile pendant
#

well even still, they'd be like 5+ minute generations on my pc

dull star
#

lower res images work too

sterile pendant
#

oh thats right, it can go WAY down if you want it to

dull star
#

yeah its unfortunate

#

I hope fp8 weights for it will come out

sterile pendant
#

like i think it can go down to like 0.1 MP even

dull star
#

it has to be democratized

torn wharf
#

these new transformer models hitting the streets lately have reinvigorated my prompt curiosity

dull star
#

auraflow was made working on like 12GB or lower right?

sterile pendant
#

lemme test 512x512 real quick

dull star
sterile pendant
#

8 seconds per it at 512

torn wharf
#

soon as i hit a few limits with clip style prompting, my idea of a prompt became pretty formulaic. i used to experiment with wacky stuff more often

sacred jewel
dull star
#

schnell?

torn wharf
#

dev with woman lying in grass test though....

sacred jewel
#

Dev

dull star
#

kek

fleet meteor
#

Does Dev use more vram than Schnell? πŸ€”

dull star
#

probably not

#

they are both 23.8GB in size

sterile pendant
#

not sure, think they're the same size

#

but they are definitely meant for 24gb cards

fleet meteor
#

Time to destroy my PC with the 3060, iΒ΄ll test it

sterile pendant
#

well like i just said, expect something around the 15 seconds per it range

#

15-25 s/it *

fleet meteor
sterile pendant
#

my cpu is a 13600kf and was pegged around 50% the entire time, ddr4 3200 ram

sacred jewel
sacred jewel
mortal mesa
#

its like generating pictures but with alot of lag

sacred jewel
torn wharf
#

break dancer doing a hand stand. wearing an adidas tracksuit with neon tie dye colors. on a cardboard mat outside of a retro style diner. The Diner sign reads "BALLS DINER"

mortal mesa
torn wharf
#

doesn't entirely pass the handstand test but it fails a lot lot less

turbid grotto
#

is there any chance to run on 12gb with offloading or something like that?

mortal mesa
#

crazy to call that an improvment, but it is haha

fleet meteor
turbid grotto
mortal mesa
#

i am with a 2080

torn wharf
fleet meteor
torn wharf
fleet meteor
#

Would it work offloading it if i have 16gb of ram?

mortal mesa
#

ya it is for me

torn wharf
dull star
#

flux can make female nipples. yes. I said it.

torn wharf
#

this is peak balls. this prompt was fun. it takes randomness very well and tunes details extraordinarly.

torn wharf
dull star
torn wharf
#

goons will over use the free sites available. we can stave off september

#

so far i cannot make it create nipples that look like asperagus, but i've got nipples that look like fingers

#

can't do carrot nipples either. unfortunate

dull star
torn wharf
#

bonzai buddy lolol

#

that's obscure knoweldge. fun to prompt

sacred jewel
#

close enough? πŸ˜›

torn wharf
#

lol it does have fails with body positioning but is undoubtedly improved

undone meteor
#

Anyone else getting an error β€œconv_in.weight” when trying out the flux example workflow?

dull star
#

I am taking this for granted

#

way too much

fleet meteor
#

😭 What version of comfyui are you guys using? I just get errors with the portable one

torn wharf
#

just confirmed they used the multi modal DiT structure too. 31m in funding makes them more of a powerhouse than stability if i'm not mistaken. uhoh stability. how soon before flux discussion gets seriously enforced upon ?

dull star
#

you need to use the Load Unet and not the load checkpoint

#

hmm sadly flux cant do impressionist paintings

#

it makes SUPER photorealistic paintings

#

to the point they look like photos

#

πŸ˜”

fleet meteor
undone meteor
#

Yes, using Unet loader and the flux models. I updated confyui. Weird. Still get the conv_in.weight error.

fleet meteor
#

Okey gotta reinstall from 0

dull star
#

bruh

fleet meteor
#

my comfy is ded

torn wharf
dull star
#

no this is the Dev model too

#

in fact, the distilled one actually makes paintings

torn wharf
#

dev is distilled pro

dull star
#

well not like turbo-type distilled, but yeah

#

its like distilling from a large model into a smaller one

#

and not making it perform faster at lower steps, no?

#

I imagine the pro model being massive

torn wharf
dull star
#

thank you

#

FLUX.1 [dev] obtains similar quality and prompt adherence capabilities, while being more efficient than a standard model of the same size

#

yeah it just wont make paintings at all sadly

#

they look good but they are too photoreal

torn wharf
#

yeah like it has one idea for "painting"

dull star
#

I even asked for large brush strokes

torn wharf
#

sometimes distillations can collapse a class like taht i've noticed. saw it on vodka sdxl models

#

different kind of distallation too

dull star
#

hmmm

#

at least everything else seems okay

dusky thistle
turbid grotto
#

My 3060 is still worthy! chad

#

It runs on 3060, 170s for 20steps on dev version, including text encoding on weak cpu. I am using fp8 T5 and lowram in comfy

sacred jewel
mortal mesa
#

been using fp16 T5 cuz both default to low vram regardless for me, seems the same time either way

fleet meteor
frail shoal
lucid swift
torn wharf
#

Honestly, this is insane. Truly. I think SD4 is the only move they can do at this point. Or do a nullsoft style thing and jump straight to SD5

fleet meteor
signal shuttle
torn wharf
dull star
#

those 2 pixels are incorrect, unacceptable

signal shuttle
fleet meteor
#

Is it normal that the gpu usage is 3%? (and only about 33Β°c)

torn wharf
#

I love auraflow team. Underdogs for sure. Woo woo for the underdog.

Blackforest have 33m funding

#

and expertise

dull star
torn wharf
#

nvidia tookon pixart team. who knows what happens there .. but this is truly proper. I hope lora training for these don't need 80gb

signal shuttle
torn wharf
#

i woudl say that they just detonated a mother of a depth charge and we will see the surface results of it quite soon

opal meteor
#

has this thread been hijacked for flux? cuz wow it's good! sword test actually... works?

dull star
#

I wonder how pixart and sd3.1 will perform next to flux

#

lol

signal shuttle
torn wharf
#

i saw that

dull star
#

pixart might be some 900m(?) model, and sd3.1 is gonna be 2b further pretrained

#

cant expect them to compete with a TWELVE BILLION paremeter model

#

but we'll see

torn wharf
#

not just a model but a state of the art model. state of the art being kling and sora

opal meteor
dull star
#

I hope pixart will be good

#

apparently they are working on text

#

and 16 channel vae

signal shuttle
#

Someone actually quantized Flux and made it run with only 16Gbs of vram

torn wharf
#

pixart got nvidia backing now. i think they're gonna be good. proabbly a ton of experience there they can access

signal shuttle
dull star
#

and maybe no more lowvram mode

torn wharf
dull star
#

but its like some separate script

#

not comfyui

torn wharf
#

they didn't put out a technical paper weeks in advanced notice that? now that the model is released then it'll come out soon

#

stability should take some notes

#

most projects should take notes maybe. "heres what you need to run it. more later"

signal shuttle
#

The most surprising thing about Flux release's is that it came out of nowhere, they didn't tease the model for like months or release the paper way ahead of the actual model release, they just released and where like "Ok here's good model, now we work on Text2vid bye"

dry wave
#

auraflow was funded, too, by fal

dry wave
#

in direct comparison to Flux I think it's quite clear which model is better

dry wave
torn wharf
#

i like to think of auraflow as the bobross team of ML. like, since they've started doing their thing, they've really sparked engagement i've notied. more people taking on tasks that seem like its something only corporations could do. then they do it in some form all alone. it's sorta like the college bowl of ML

#

auraflow i think will do more good there. spreading knowledge of how these things work. i don't think they stand a chance of competing with expertise like flux

dry wave
#

Flux is open source. They will also release a technical paper about it+

mortal mesa
#

just the one model is open sauce

dry wave
torn wharf
#

then told me i was triggered

dry wave
#

I said it was weird to call it multimodal, as on the one hand all text2image models are by definition multimodal and on the other hand it is not multimodal in the sense of,. say, the Meta Chameleon model. But anyways, I don't see why you want to start that discussion again.

torn wharf
#

you're here explaining the benefits of mmdit to me now is why. anyways. just reminding you of our existing repor, especially in this context. you need not tell me of the architecture of sd3. we've been here before.

dry wave
#

I don't explain the benefits of mmdit to anyone here πŸ˜‚ #

#

I said Flux is working great

#

Auraflow is not really good

#

both mmdit models

#

its not all about the architecture

dry wave
fleet meteor
#

After generating the image the model unloads from the vram, is there any way for it to not unload? it takes like 5 minutes to load on a 3060 😭

torn wharf
#

I was thinking embeddings should do fine. If the knowledge is in the latent structure, embeddings will pull it out. Seems like a good base model to pound out embeddings for. i haven't seen that kind of action since sd2

sullen moss
#

Oh, FLUX really surprised me in terms of quality; I didn't expect it. I'll be doing more tests.

dry wave
sullen moss
#

Best Test for model)

lucid swift
#

dark new long room, high water, room out of new white tiles, modern, with round lights on the walls, clean, the seling also has tiles, high water, high water, water is fooding the room
flux-schnell

dry wave
#

Flux is also really great with hands and hands holding stuff.

lucid swift
#

yes

#

cat in a frog costume

fleet meteor
#

So cute!

fleet meteor
lunar rivet
#

idk, it doesn't just count the sampling but also stuff like moving to vram

signal shuttle
#

Bro flux is so good i can't 😭

lunar rivet
#

dayum, that's some solid text

fleet meteor
prisma rampart
signal shuttle
lunar rivet
#

I think you need to download more ram, that's a bit excessive

#

even worst case from cold load I get only like 3 minutes, ok maybe 4

sullen moss
#

That's what it means to surprise the public. Without any unnecessary noise and advertising, the model is just amazing.

fleet meteor
#

IΒ΄ll try again using the ssd as swap instead of an hdd

lucid swift
#

but it seems like it does not know what pixel art is "a colorful pixel art rendered artwork with the words "I am a cat!!" in bold, white letters. The letters appear to be made of a glossy, liquid material that is dripping and splashing in various directions"

#

"a shrunken very small adult person sitting on the neck of a flying goose nils holgerson. the duck is flying in the sky"

#

maby

#

"This image showcases a vibrant and colorful furry character in the foreground. The character is anthropomorphic, with a fox-like appearance, and is adorned with a mix of green and purple fur. The character has large, expressive eyes and a playful, open-mouthed smile, suggesting a friendly demeanor. The character is wearing a badge or name tag that reads 'flux def', which could indicate their name or a specific role within the event they are attending. The background of the image is a dimly lit room with rows of empty chairs, suggesting an auditorium or a conference hall setting. There are a few people visible in the background, possibly attendees or participants of the event. The lighting in the room is focused on the character, creating a spotlight effect that draws attention to them. The style of the image is a photograph. The clarity and detail of the character and the environment, along with the shadows and lighting, are indicative of a real-life photograph taken indoors."

prisma rampart
#

"pixel art of a vampire castle under moonlit sky"

dry wave
lucid swift
#

interesting 3d pixel art

cunning lintel
#

oh, everyone's seen flux, i'm late to the party 🀣 now this is what SAI teased all the time, who could have known they meant it would be released by some of their old scientists but another company, just weird, but i'll take it 🀑

lucid swift
#

"A vintage-style black and white photograph, captured from a top-down view, placed on a worn wooden surface. The photograph features a mysterious figure standing with poise, dressed in an elegant long gown and a high collar. Their hands are clasped in front of them, and they wear a striking goat skull mask with curved horns. The background is a deep, enigmatic darkness, contrasting the subject's prominence, illuminated by the flash. Beneath the photograph, delicate handwritten text reads, "Fear is weakness," leaving a thought-provoking statement"

fleet meteor
#

okno

dry wave
dry wave
#

its SD3 with 12b

lucid swift
dry wave
lucid swift
#

The image shows a cozy, eclectic room with a vibrant, colorful ambiance. The ceiling is draped with multiple tapestries featuring intricate designs, including mandala patterns and depictions of plants and celestial motifs. The lighting is soft and atmospheric, with various sources contributing to the overall mood: Ceiling Lighting: There are red, pink, and purple lights that illuminate the tapestries, highlighting their patterns and adding a warm glow. String Lights: Multi-colored string lights are draped around the room, adding to the festive and relaxed atmosphere. Television: A flat-screen TV on the wall displays a scene from the animated show "The Simpsons"

sacred jewel
lucid swift
sacred jewel
lucid swift
#

The image depicts an alien-like creature with a large, elongated head and dark, almond-shaped eyes. The skin appears textured and rough, reminiscent of reptilian or amphibian skin. The creature is sitting in a body of water, partially submerged, with its legs and lower torso hidden below the surface. The background is foggy, adding a sense of mystery, and features tall, thin reeds and barren trees, creating a marsh-like or swamp environment. The overall atmosphere is eerie and otherworldly, with muted colors and low light enhancing the creature's unsettling appearance

signal shuttle
#

Its stupid how good Flux is

#

It even got north america right!

lucid swift
#

i dont see the full imAGE

prisma rampart
signal shuttle
dry wave
signal shuttle
dry wave
#

I think it can do pixel arts and games very well xD

lucid swift
#

A large painting in the style of Rembrandt depicts an alien-like creature with a large, elongated head and dark, almond-shaped eyes. The skin appears textured and rough, reminiscent of reptilian or amphibian skin. The creature is sitting in a body of water, partially submerged, with its legs and lower torso hidden below the surface. The background is foggy, adding a sense of mystery, and features tall, thin reeds and barren trees, creating a marsh-like or swamp environment. The overall atmosphere is eerie and otherworldly, with muted colors and low light enhancing the creature's unsettling appearance.

fleet meteor
#

Fresh comfy install and its even worse 😭 I give up

lucid swift
#

A large king painting in the style of Rembrandt of a alien

lucid swift
signal shuttle
#

the text abilities is so good

dry wave
lucid swift
# dry wave

what promt? i like that it does not have a perfect camerea

dry wave
#

actually it was "this star trek episodes features Micky Mouse on the bridge of the Enterprise, standing next to Jean-Luc Picard, both in their starfleet uniform."
But it seems to not know Picard.

lucid swift
#

interesting so it might conect star teck with lower camera quallaty

dry wave
#

I mean, if you ask for star trek discovery it might not so, dunno

lucid swift
#

cctv footage of a cat eating a burger

cunning lintel
#

A large, mystical snail-zebra chimera with an agitated demeanor crawls through a dense, shadowy forest, its plush, velvety fur coat transitioning from deep ebony to bright alabaster, creating a mesmerizing zebra-like pattern. The chimera's spiraled shell, reminiscent of a giant snail's, glistens with iridescent colors that contrast with the muted greens and browns of the overgrown foliage. Its wide mouth reveals sharp, pearlescent teeth, while large, expressive eyes convey curiosity and agitation. Antennae resembling delicate tendrils protrude from its head, enhancing its otherworldly charm. Low trees and delicate branches frame the scene, creating depth and enclosure. Dramatic lighting filters through the canopy, casting sharp contrasts and highlighting the chimera's intricate textures. The ground is littered with rich, earthy tones, and vibrant greens of the low vegetation evoke life and mystery. Captured as a movie still on a 35mm lens, the image showcases extraordinary detail, making the chimera appear lifelike amid the enchanting yet eerie atmosphere of the Schwarzwald.

sullen moss
#

It's a pity to admit that the SAI team is playing catch-up

#

flux anatomy the best

dry wave
#

it does not work reliably, but you can prompt it to do old camera

signal shuttle
dry wave
#

it's good in anatomy in general

cunning lintel
#

competition leads to progress, maybe irrationaly so, i still think SAI can createa killer model, their SDXL and cascade were soo good and 8b halfway trained not bad at all either

lucid swift
dry wave
#

I would say:

  • anatomy is awesome
    ~ prompt understanding is middle
  • style is not flexible enough yet
lucid swift
#

A delightful close-up photo of a charming, plush Pikachu. It has a blue and black body, with thick thighs that accentuate its cuteness. Its eyes are a vibrant red, adding to its adorable appearance. The overall atmosphere of the image is playful and sweet, making it perfect for a cuddly plush toy.

dry wave
signal shuttle
lucid swift
#

but i agree with everything else

cunning lintel
dry wave
uncut river
#

FLUX

modern anime style, woman standing on the grass, fur coat, thick winter coat, long bare legs, thick hips, dark skies, sunny, bare feet, gorgeous scene, anime screencap, stone temple altar, skulls

dull star
lucid swift
#

ideogram is still sometimes better or wait a sec this is the schnell model

prisma rampart
#

flux seems to do hands quite well, they definitely paid attention to anatomy

dry wave
#

Ideogram is way better in regards of prompt understanding. It looks horrible most of the times, though

#

but Ideogram -> controlnet -> SDXL was always a nice pipeline

lucid swift
dusky thistle
lucid swift
#

ok the flux def is more comparible to ideogram ideogram vs flux def

lucid swift
#

impressive

signal shuttle
#

Flux beats SD3 in planes

uncut river
#

seems they have closed some of the gaps that sd3 left

dull star
#

I don't even know if I want SD3 8B anymore

signal shuttle
#

Flux is all you need

uncut river
#

but i cant run it local!! 😦

signal shuttle
frail shoal
dull star
#

how come

#

same seed and everything?

frail shoal
dull star
#

yeah schnell makes paintings whilst dev somehow doesn't

frail shoal
#

a stunning painting depicting of an alluring sexy sith russian woman sitting in a futuristic black obsidian throne, with pale skin and glowing red eyes casting lightning from her hands towards the camera. The scene is dimly illuminated by the blue white lightning that blinds the camera. In the bottom of the image there is a text saying "I am Flux !". She is wearing a hood and is wearing a crop top black shirt with the empire logo from star wars movies on it. Ultra detailed, masterpiece, best quality

#

this is in this HF space

#

and these are in the fal playground. Do you see that the text was forgotter here and the images are somewhat more realistic

dull star
#

wait I just realised

#

we have CFG now?

frail shoal
#

yeah, oh in the HF space there is no cfg

dull star
#

I dont remember cfg on fal either though

#

and in comfyui

#

ohhhh

#

cause it was schnell πŸ˜…

fleet meteor
dusky thistle
#

so no CFG

signal shuttle
dusky thistle
dull star
#

it actually makes it slower

dusky thistle
#

neg conditioning causes oom on my system

#

and you can't set cfg with only positive conditioning

#

you need something else

dull star
#

also im no longer using lowvram due to fp8, but its not even faster lol

#

in fact, it might be ruining my image quality since its fp8

#

but idk how much

sullen moss
#

Well, what can I say? The guys did a great job, giving us hope. Now, for me, this is the top model among open-source ones. And just imagine what will happen when various ControlNets, Loras, and fine-tuned models start to appear for it. I'm eagerly waiting

dull star
#

but at least the VAE is loaded in at all times, so the overall generation is faster!

#

unipc seems to work

#

I suppose ODE samplers must work out of the box

dry wave
dull star
#

I tried the CFGGuider

mortal mesa
#

i changed the guider to cfgguider, yup about tripled generation time with bad results at 3 cfg

dull star
#

yup

dry wave
#

I use diffusers. cfg 4-5 works great on the dev model

dull star
#

euler

dusky thistle
dull star
#

unipc_bh2

dusky thistle
#

bosh3 is good

dull star
#

I'll try that too

#

oops I forgot how slow they are

dusky thistle
#

yeah there's lots of extra model calls

dull star
#

holy moly its so slow πŸ’€

#

I remember it being slow but its way worse with this

#

my gpu is melting rn

#

not bad, but not worth frying my gpu for it and waiting 2-3x longer

sterile pendant
dull star
#

makes sense

#

I just expected non-lowvram to perform a little better

sterile pendant
#

you just lose some precision. like we engineers get memed on for "pi=3"

dull star
#

PI equals THREE πŸ’€

sterile pendant
#

(realistically, all of us know and use 3.14159)

#

but i still love the meme

#

or physicists using 10 for gravity lol

signal shuttle
#

I love flux

dull star
#

πŸ’€

sterile pendant
#

lmao

dusky thistle
dull star
#

same

dusky thistle
#

first was with zeroed out negative conditioning, second with a traditional negative prompt

#

k

dull star
#

just blurry mess

dusky thistle
#

it may be designed to just be all in on positive conditioning

sterile pendant
# signal shuttle

it does work well for fast approximations though. like if you tell me some circle has a radius of 3 and you want me to tell you the area, 3 * 3 * 3=27 is pretty fast vs 28.274

dusky thistle
#

and the very concept of cfg might not make sense with it

foggy cloak
#

Flux is actually pretty damn impressive, dogs on SD3 for sure πŸ€”

past thicket
#

Yep it's good.

sullen moss
#

Reality vs anime style mix test

real terrace
dull star
#

We claim no ownership rights in and to the Outputs. You are solely responsible for the Outputs you generate and their subsequent uses in accordance with this License. You may use Output for any purpose (including for commercial purposes), except as expressly prohibited herein. You may not use the Output to train, fine-tune or distill a model that is competitive with the FLUX.1 [dev] Model.

#

I was wrong about Dev

sterile pendant
#

dev is strict about it and schnell is apache 2.0 iirc

dusky thistle
#

good luck enforcing that lol

dull star
#

its like openai restriction

#

"cant use it to train competitor grrr!!! 😠"

#

but basically, we can use the generated images for commercial purposes IF we want to!

#

so I'm happy about that

dusky thistle
#

it's like saying that students can't become teachers cuz they're competing with their old instructors lol

torn wharf
hallow lion
torn wharf
#

"show us the dataset" and then if they are caught hiding it , that's pretty bad

#

so then it becomes ok so we'll only use some but at what scale can they hide it and still be useful

dusky thistle
#

tbh, i don't see that happening anytime soon

uncut river
#

does flux share their full dataset?

dusky thistle
#

if you demand to have a dataset opened up for copyright reasons, well... you better hope your own doesn't have a single copyrighted image

torn wharf
#

its a small clause that we scoff at cause like we care, but for midjourney or stability maybe, they'll calculate that risk very carefully

hallow lion
dusky thistle
#

and i sincerely doubt any makers of any major models are that clean

torn wharf
dusky thistle
#

exactly

#

i don't see anyone trying to enforce it

#

it's got potential to be suicidal

torn wharf
#

oh yeah its abig ol mexican standoff

signal shuttle
hallow lion
#

can comfyui lower those requirements hehe

signal shuttle
torn wharf
#

within day of release

signal shuttle
dry wave
#

I looked into the source code of diffusers and it seems that "guidance_scale" is not CFG

torn wharf
#

dev has cfg i think

dry wave
#

its more an additional parameter you can condition the model to

torn wharf
#

its a different guidance? waiting on that technical paper. its gonna be illuminating

dry wave
#

like "aesthetics" score in SD refiner

#

it just behaves totally like CFG which is weird

signal shuttle
dry wave
#

it looks so in the code, but I might be wrong

proven pecan
#

this is not what i had in mind when asking for an impasto oil painting, sd3 can do this better

torn wharf
#

pro isn't though

proven pecan
torn wharf
#

dev gets a very generic painted look with all the different painting styles i've tried so far

bitter hearth
#

if company A sues company B for training on company A's data
company B can't in court demand that company A reveals their dataset
I don't rly get that idea

dull star
#

knife palette painting?

bitter hearth
#

in court you can't just flip who is defence and who is prosecution

torn wharf
#

but i don't see it working 2 ways

bitter hearth
#

yeah

#

company A can compel the dataset of company B to be revealed

#

but not the other way around

#

that would have to be a separate lawsuit filed by company B against A

torn wharf
#

or if potentially the models came out at the same time, and b convinces the judge that they both stole their data, it might become suddenly relevant to discovery

#

judge/arbitrator/whoever

#

not a lot of precedence here but i could see that

bitter hearth
#

I can't really see it

#

its not impossible though

torn wharf
#

discovery is a 2 way street, but you gotta have good arguments to see

bitter hearth
#

yeah I don't think the argument is strong
to see company A's data
but judges have so much discretion that it is not impossible

torn wharf
#

i'm not a lawyer. i just watch fictional ones on tv

cunning lintel
#

still think output is as generic as possible in the definition to prevent a distilled model or things like that, not as a fraction of training data, that's unenforcable, esp when all these ais exists solely because training on anything is fair game

proven pecan
#

There is a difference in steps though 4 steps vs 8 steps

torn wharf
#

after 4 steps it be like "i'll give it more realism i guess"

cunning lintel
#

weird that the pro model has another knob to turn: Interval is a setting that increases the variance in possible outputs letting the model be a tad more dynamic in what outputs it may produce in terms of composition, color, detail, and prompt interpretation. Setting this value low will ensure strong prompt following with more consistent outputs, setting it higher will produce more dynamic or varied outputs.

bitter hearth
#

done on their model

torn wharf
bitter hearth
#

by the way the CADS node does this
(gives more variety for lower quality)

#

its not gonna be quite the same

#

but I have loved using CADS this week

cunning lintel
#

yeah, wonder how much of pro is pipeline vs better because original weights

proven pecan
bitter hearth
#

usually these closed ones are pipelines now

#

like stability AI ultra

#

or midjourney

#

both very much seem to either have noise injection or an upscale followed by a downscale

#

or some other methods of getting the complexity up

sacred jewel
#

What kinfd of performance are you getting? On my 4090, dev is been dog slow ... got it to go fast maybe once or twice but now it is slow.

And it fluctuates a lot. Sometimes 3 sec/it and then some steps up to 20sec/it

hallow lion
#

Everyday you come here you never know what will blow your mind...

sacred jewel
#

This one took 300secs

DEV, 20 steps.

frail shoal
proven pecan
# frail shoal what's the prompt ?

Alcohol ink portrait of a flowerlady with a cat in front of a filmstudio in L.A. Fluid and vibrant colors, unpredictable patterns, organic textures, translucent layers, abstract compositions, ethereal and dreamy effects, free-flowing movement, expressive brushstrokes, contemporary aesthetic, wet textured paper.

bitter hearth
#

gonna try it in SD3 and compare

proven pecan
#

I tried 24 steps to see if it would give me the film studio, but no

bitter hearth
#

in the last week I started getting better outputs out of SD3
by combining PAG, CADS and vector sculpt

#

I'm not saying its skill issue but SD3 without added nodes is nowhere near as good
its like a completely different model

fleet meteor
#

Anyone know if there is a int4 t5? πŸ˜…

frail shoal
frail shoal
cunning lintel
bitter hearth
#

I thought PAG worked with SD3 but maybe I got confused
I use like 50 different workflows each day so I get confused all the time

frail shoal
#

shit, almost good hands

bitter hearth
#

I will send you a CADS and vector sculpt workflow

frail shoal
bitter hearth
proven pecan
cunning lintel
#

why can pixart do styles while other new models cant, bit of a shame

dry wave
#

looked into the source code a bit deeper

cunning lintel
#

of the new dit ones pixart is most reliable for me (so, despite flux being great and all, looking forward to pixart next too :p)

dry wave
#

so the DEV and SCHNELL model both are no CFG models

#

only the PRO model is

#

the other models are distilled variants of the pro model

#

they have a "cfg" parameter, but this parameter is only "simulating" cfg

#

so they do not really do classifier free guidance, but they try to generate images that look like what the PRO model would give with this cfg value

#

which also means: we cannot use negative prompts with Flux 😦

bitter hearth
#

uh I would be sceptical about that

sacred jewel
bitter hearth
#

is it in comfy yet?

dusky thistle
#

when you get real close to running out it slows down like fuckin crazy

turbid grotto
#

honestly negative is not so necessary with such quality and adherence

bitter hearth
#

did you know with PAG
you can use SDXL fairly well with zero CFG
its interesting style

cunning lintel
#

not cfg negatives, but wasn't there other negative guidance (remember a bit way way back about it)

turbid grotto
#

Flux seems to be offloading to system ram but it is not using shared memory at all, what a point of it and can it bring any benefits?

dry wave
bitter hearth
#

there are other things than CFG out there
like Perp-neg

#

if you use the Sampler Custom Advanced node
you can use any guidance you want
you could write a python script to do some weird custom sampling method
its not common but its perfectly possible

sacred jewel
hallow lion
#

so flux is 23 GB... remember the times when 2GB for SD15 seemed steep -_-

dusky thistle
#

23.3 is where on windows i find everything stops

#

total vram in use, not just comfyui etc

sacred jewel
#

SHUGAR

dusky thistle
#

if you see that shit happening, minimize to your desktop

#

do shit like close extra browsers, windows with your output folder open, etc

cunning lintel
#

faux cfg 3 vs cfg 5
maybe less realistic styles just need low cfg to get better (like low steps as Eloyse showed)

dusky thistle
#

but min to your desktop will usually bail you out

dry wave
cunning lintel
#

i meant immitated cfg as kaibioinfo saidd no real cfg in dev model

sacred jewel
dry wave
#

but yes, you can probably just do classifier free guidance - it's a hack anyways. No diffusion model was build with cfg support

bitter hearth
#

I saw it on the comfy discord

dusky thistle
#

you just need to have less shit open lol

dusky thistle
bitter hearth
#

it would have to have quite a funny architecture
for CFG to not work

dull star
#

give us your nectar

sacred jewel
# dusky thistle shouldn't be needed

Thanks... yeah, down to 20 seconds 20 steps on DEV default. Had to close Chrome and Photoshop... now it runs ... I was wondering why I was getting FAST, then slow then FAST randomlyu... ooof

Thanks for the tip.

dull star
dry wave
#

yeah

#

the model seems to "hallucinate" not so much

#

like in SD you usually get weird stuff everywhere

#

too many chims and windows on a house and so on

sacred jewel
dry wave
#

in Flux you can even look into the background and most of the time it looks logical and clean

dull star
#

indeed

#

I suppose this is what a large parameter gets you

dry wave
#

I don't know. We do not really have a smaller parameter model to compare with