#🏞|general-with-images

1 messages · Page 153 of 1

wispy nest
#

flux dev

languid pebble
#

It's pretty goood!

wispy nest
#

its super fast

#

do we know the spec of the server ?

pallid ruin
#

It doesn´t appear, but it probably has an a100

wispy nest
pallid ruin
#

flux dev

languid pebble
wispy nest
#

can u share prompt ?

pallid ruin
# wispy nest can u share prompt ?

Mine was : "Cinematic detailed macro photography, extreme details and complexity,
amazing quality, masterpiece, best quality, hyper detailed, ultra detailed, UHD, perfect anatomy, magic world,
(kitten and fish), fish in the air, spell magic to get fresh fish as food,( fish jumping from magic book), energy flow,
a full body of a cute kitten, kawaii, wearing witches robe, witches hat, holding magic book, magic book on one hand, spell magic"

#

(stolen from civitai)

wispy nest
languid pebble
#

Aliens need some love, too ^^

wispy nest
#

same prompt replaced fish with dragon.

pallid ruin
wispy nest
#

this one more cute

#

Cinematic detailed macro photography, extreme details and complexity, amazing quality, masterpiece, best quality, hyper detailed, ultra detailed, UHD, perfect anatomy, magic world,
cute dragon spiting smoke that says (Hello!) the smoke is shaped as the world (Hello)

#

maybe more work on prompts and we get a smokey hello?!

fleet aurora
wispy nest
languid pebble
wispy nest
#

lol probably not

languid pebble
#

Like the Enterprise 😄

#

It's to expensive ...

wispy nest
frigid stirrup
languid pebble
wispy nest
languid pebble
#

Flux is the fun we wanted to get with SD3 ...

wispy nest
#

lol yeah pretty much

#

its just really aesthetic even with joke prompts

#

like your average random prompt is better than most SDXL

unkempt spindle
gentle blaze
#

I prefer lower values

languid pebble
#

Has anyone tried how high we can go with the resolution with Flux?

languid pebble
jovial tiger
# languid pebble Has anyone tried how high we can go with the resolution with Flux?

1344x768 gives the most reliably sharp images. 1536x896 is also good, but that's where you start to get occasional images that are blurry. I've found the higher res you go after that, the higher the chances of getting blurry images. I don't know why, it's just what's happened over the course of tons of images. Now I just stay at 1344x768 or some other 1 megapixel total image and i get the best results. That said, 1792x1024 (like what dall-e hq mode renders) is often really good, just be ok with random blurry images.

#

I was also able to do multi-stage ultimate sd upscale with dev at 1.2x with 0.15 denoise (i kept having to lower it) and it outputs crazy sharp stuff, but it requires many stages and it takes forever.

flat sand
#

yeah, above 1mp and the image quality goes down

languid pebble
#

Yes ... looks like it doesn't create any background any longer ...

jovial tiger
languid pebble
#

Text needs luck here ...

jovial tiger
elfin marsh
languid pebble
fleet goblet
mystic marlin
#

DualSense!

#

Used Flux-Dev

flat sand
jovial tiger
cobalt isle
#

which base models generated these two images? it's two different ones, that's my only hint

fleet goblet
mellow wasp
#

scentsy max melts simple logo

visual hemlock
#

I am trying to do an model for a user and I cannot get the full view of the model.. What is wrong in this prompt?

a woman with light green eyes posing with her hair in a bun, (full shot:1.2), bright brown hair, (realistic shaded perfect face:0.7), an airbrush painting, pointillism, 1girl, bangs, closed mouth, eyelashes, lips, looking at the viewer, realistic, solo, (whole body:1.2), low amount of freckles, (face:1.2), full shot, wide shot

nimble mason
uneven quest
#

This isn't an SD image per se, but #💬|general-chat doesn't allow uploads. Does the A1111 Docker template on Vast.ai allow you to pass HF tokens for gated model, like the ComfyUI template does?

uneven quest
# nimble mason

Genesis G70 but it's from Temu. Kidding, of course. Nice work

#

@nimble mason Mind sharing the prompts/environment/PC specs?

royal monolith
visual hemlock
royal monolith
# visual hemlock square AR?

aspect ratio. you're using a square. switch to tall rectangle. and don't use words like 'portrait' in your prompt

uneven quest
uneven quest
#

I have a 12400F, A770 16GB LE (think Nvidia FE), running W10 Education. Haven't tested SD locally yet, Vladmandic's benchmarks have the A770 max out at 10-11 It/s for 1.5, and 7 It/s for XL Turbo

nocturne oak
uneven quest
nocturne oak
#

Still breaks records.

uneven quest
nimble mason
wispy nest
nocturne oak
wispy nest
#

jesus lord

#

haunted peach

#

damn flux is cool

viral frost
#

Wow, I'm amazed! Considering how much I struggled trying to get an upscaling workflow with SD3 (and never really got it to my satisfaction), Flux just works! Because the image quality seems to break down over 1 MP when doing straight t2i (and because of the generally high memory usage), I decided not to try a hiresfix-like workflow and instead used Ultimate SD Upscale. I was concerned about @jovial tiger's comment about needing to upscale only 1.2x at a time, but I did it straight at 3x and it works GREAT! And there are barely any hallucinations and no visible tile seams, even without any controlnet in the picture. I can't believe it was so easy! The attached are the 3x upscale, a 2x upscale I tried first, and the original generation.

#

I used the bosh3 sampler through the ODE Samplers custom node and used Ultimate SD Upscale (Custom Sampler) to use bosh3 with that as well. Because I got bad image quality when trying to use regular CFG and needing to bring the Flux Guidance node into the picture, I set the USDU CFG to 1. Maybe this is why it worked so well for me. Plus 4xUltrasharp for the intermediate upscale.

jovial tiger
viral frost
jovial tiger
viral frost
#

It's definitely not the fastest, and I often have to wait for bug fixes or do a lot of troubleshooting before things work, but having 64GB of memory to work with really helps.

#

Though sometimes I wish I spent the extra money for 128 GB (though not really, it was an extra $1000!).

jovial tiger
nimble mason
viral frost
#

I've got an M3 Max and it's ~80% of the way there to an M2 Ultra, so I'm not complaining.

jovial tiger
#

so the only negative i can see, is on the bridge of her nose. it took some dithering that was on her nose in the original and expanded on it.

#

i've been using uni_pc for everything.

#

i just started using this one. I was using siax 200k before that but the random dots etc was a lot worse.

jovial tiger
nimble mason
viral frost
#

Oh, nevermind. I see it. Looks like it's based on SwinIR. When I used A1111, I didn't really like that upscale model.

#

In my testing with SD3, Ultrasharp tended to produce better detail for me than rESRGAN. I didn't bother testing SwinIR.

#

I used an upscaling workflow with SDXL extensively and learned that, at least with that model, the method of intermediate upscaling was crucial to the quality of the second-stage output. I started just using Lanczos rescaling and got more blurry images than sharp ones. A lot of people seemed to just use nearest or something like that, but that always produced blocky artifacts for me. But when I switched to rESRGAN (I used Invoke, which didn't support arbitrary upscaling models back then), everything cleared up quite nicely.

jovial tiger
#

definitely, although in this case, I'm not using any intermediate, just ultimate.

viral frost
jovial tiger
#

oh ok right.

wispy nest
#

SwinIR and Ultrasharp are quite old, you can get nicer results from stuff like HAT and DAT

#

they are in some ways just more modern versions of SwinIR

#

its got star citizen in the training data

#

but for Battlestar Galactica it has this

jovial tiger
jovial tiger
#

swinir has massively more detail

viral frost
#

Really? I see exactly the opposite.

jovial tiger
#

teh ultrasharp one is all smoothed out. most of the texture that's in the swinir is gone.

viral frost
#

Where are you looking in the image?

#

I agree that ultrasharp gets rid of the sparkles on the face.

jovial tiger
#

her face in the ultrasharp looks smooth like a doll, airbrushed.

#

there's almost no texture on her face at all.

viral frost
#

But I wasn't so hung up on the sparkles.

#

I don't think ultrasharp has realistic skin texture, but neither does swinir.

#

The pauldron detail looks a lot better to me.

#

The cup still isn't very detailed, but looks a bit cleaner to me.

jovial tiger
#

it's definitely cleaner on the ultrasharp

viral frost
#

SwinIR also gave the out-of-focus background a gritty texture all over that I don't really like.

#

Ultrasharp shows some weird chroma artifacts in the top left, though.

wispy nest
viral frost
#

I've been out of the loop on upscaler models, so not sure what the recent advancements have been. I'll give this one a try.

jovial tiger
jovial tiger
viral frost
#

Has a patterning in the OOF areas and some chroma sparkles in the upper left. But I agree that the in-focus textures look good.

rain gazelle
jovial tiger
wispy nest
#

if you liked DAT then the best model currently is ATD but it is slower
https://openmodeldb.info/models/4x-RealWebPhoto-v3-atd

#

and this HAT one is also very strong
https://openmodeldb.info/models/4x-Nomos8kSCHAT-L

jovial tiger
#

so the v3 is better than the v4?

wispy nest
#

ATD is better than DAT yeah

#

these are different types of model they aren't versions of each other

jovial tiger
wispy nest
#

nice

jovial tiger
#

running atd now

rain gazelle
nimble mason
lavish garden
nimble mason
nimble mason
nimble mason
nimble mason
languid pebble
#

Good morning coffee!

nimble mason
celest sigil
languid pebble
nimble mason
languid pebble
nimble mason
sterile kiln
#

Wow, just tried Flux, its unbelievable

languid pebble
sterile kiln
#

@languid pebble do you know if LoRA of something similar is coming for Flux?

languid pebble
#

There has been a post about finetuning about an hour ago ...

languid pebble
nimble mason
languid pebble
nimble mason
deft bison
nimble mason
celest sigil
languid pebble
steep sleet
smoky vigil
steep sleet
smoky vigil
#

I wodner if flux uses parenthesis and all those things we used for other models...

languid pebble
steep sleet
#

Flux.Schnell

steep sleet
#

I must try it

smoky vigil
#

yeah those

steep sleet
#

.. gone shopping! L8rs

languid pebble
#

Is it possible to read out the words a model has been trained on?

#

And maybe some statistics about it?

hearty violet
#

I got a good result with inpaint but wach time I generate the image the the height of the image has decreasedm why? I take the dimension of image with the auto detect size

dire cloak
dire cloak
#

yea well he is still one person and he propably was hung up on the term "distilled"

celest sigil
#

I think Lykon also said SD3 would beat it because it wasn't trainable.

dire cloak
#

they may be known names, but that doesn't mean they have the technical knowledge to make such claims. Reddit has lots of technical people saying that it is likely trainable

celest sigil
#

Anyway, the link still stands that there are people actively trying to fine tune and create loras for it.

celest sigil
dire cloak
#

definetly yea, but we can't make a surefire call yet until we get official answer (or reverse engineered)

dire cloak
devout dagger
# celest sigil I think Lykon also said SD3 would beat it because it wasn't trainable.

I do think SD3 medium will get more finetunes then Flux due to the ease of accessibility of SD3 medium, because SD3 medium is 2B parameter model and with SD3.1 will become a 2.5B model, a lot of people will gravitate to SD3 due to its smaller size and Lower requirements now compare that to the behemoth 12B model known as Flux, which has a Higher requirement to use and while training SD3 is cheap and possible on consumer graphic cards, Flux requires a couple of H100s to train a full checkpoint

celest sigil
#

Once bitten...etc.

devout dagger
dire cloak
#

Flux isn't optimized at all yet, so we should wait. finetuning a 12b model should be possible with a single 80g card if it gets optimized

devout dagger
devout dagger
devout dagger
dire cloak
#

yea maybe, but flux is an alternative for people that want to get the best quality. I also hope flux devs will release a 4b 6b or whatever at some point

devout dagger
#

They don't plan on making any smaller versions of Flux

dire cloak
#

i said at some point.

#

because i don't trust SAI a single bit

devout dagger
dire cloak
#

Runpod? I mean how many images do you want to generate

#

you can generate 1000s of images for 1-2 dollars lol

#

48 gig is more than enough for inference, and the card is 0.49 cents per hour

devout dagger
dire cloak
#

and lora training is also possible with 24 gig cards on flux

dire cloak
#

and wanting to run AI on your own device would not mean you dont have the option to use other services in the end

#

especially if you get a lobotomized model as alternative

devout dagger
dire cloak
#

any of these releases wasn't nearly as bad as SD3. All those prior models didn't have a real concept of anatomy because of training data lack, but SD3 got trained to misunderstand anatomy due to deliberate training data

#

And the worst part about the release is that SAI still hasn't acknowledged their wrongdoings

#

just look in their discord, post an image of garbled mess

#

and get called low skill low idiot bob

devout dagger
dire cloak
#

no, comfy explained

#

they added data to poison

devout dagger
dire cloak
#

before he left

#

he explained how they censored it

#

there is a screenshot in here somewhere from a few months ago

#

Thats why you have to use some really nasty negative prompts to get a somewhat okayish anatomy

#

they used random gibberish to nuke nudity from the model but messed it up so the concept was assigned to anything female

devout dagger
# dire cloak no, comfy explained

Well then mcmonkey (previous SAI dev that left with comfy) said something completely different, here is a screenshot from mcmonkey showing a image of SD3 medium before any safety tuning

dire cloak
#

it is specifically only for woman lying on grass

#

this concept wasn't trained. But nudity has been nuked via training data

#

anatomy does not get destroyed just because you don't have nude bodies in your training data

#

Also pretrain only introduces basic concepts. It is not supposed to know poses and so on

#

abliteration is what its called

devout dagger
dire cloak
#

yea and it makes retraining very difficult

#

so if they do the same with 3.1 its pointless

#

i hope not

#

but we will see

devout dagger
#

the more competition the better right?

dire cloak
#

yea espcially if the competition is insanely good lol

devout dagger
dire cloak
#

We hope for the best, i hope we will get their 8b model aswell at some point

#

because that seems to be actually competitive

#

and can be finetuned on 48gig cards according to some tool creators

devout dagger
royal monolith
devout dagger
royal monolith
#

however - that's the latest update

devout dagger
royal monolith
devout dagger
royal monolith
devout dagger
royal monolith
#

possibly, yes

devout dagger
# royal monolith possibly, yes

This method had worked for LLMS, but SD3 uses a DiT architecture, so maybe they managed to translate it and make it work somehow?, but we will have to wait until the model releases to see

royal monolith
#

not sure, but since the push everywhere is to find a way to run the models on mobile devices, i'm sure one of the top of the list research is how to make it small and still get the results

nocturne oak
nimble mason
pallid ruin
languid pebble
#

The advertising part came out a little small 😄

deep rapids
#

not just the style, but the content

#

in either case, sending ppf noise with its own random seed at 90% denoise allows some nice variation 😄

frigid stirrup
#

I can't run juggernaut x.

#

It's just processing.

hearty violet
#

futurist eiffel tower

still rose
#

Flux looking goated

#

Now it's a question can I run the fp8 on my M2 pro macbook pro (16gb ram) KEKW

still rose
wispy nest
#

flux is crazy

still rose
wispy nest
#

looks like a dick

#

flux creatures 😂

languid pebble
wispy nest
wispy nest
nimble mason
candid surge
#

flux-schnell

flat sand
nimble mason
languid pebble
candid surge
#

flux-schnell is very good with text, but not perfect

nimble mason
cobalt isle
# jovial tiger thanks. while I was waiting, I tried doing what you're mentioning. here's the re...

I fed your image to JoyCaption on HuggingFace, and then prompted Kolors with the result, worked surprisingly well.
"This is a high-resolution photograph in a fantasy style, featuring a young woman with short, platinum blonde hair and pale skin. She is dressed in a shiny, metallic silver armor with intricate details, including a high collar and shoulder guards. Her expression is intense and focused, with her eyes wide and slightly glowing. She holds a translucent, glowing purple goblet in her right hand, which emits swirling, ethereal purple mist. The background depicts a dimly lit, medieval stone corridor with arched doorways and lit candles in sconces on the walls, casting a warm, flickering light. The walls are made of rough-hewn stone, adding to the ancient, mystical atmosphere. The lighting is a mix of natural and artificial, creating a moody, magical ambiance. The textures in the image are highly realistic, from the smooth, reflective armor to the rough, aged stone walls. The overall mood is one of mystery and enchantment, with the glowing purple elements contrasting sharply with the dark, stone surroundings."

nimble mason
jovial tiger
languid pebble
#

And what is SD3?

#

"Keep it for kittens .... "

#

If they don't perform with SD3.1 they are off the market ...

languid pebble
#

And I am really sorry to say that. But others showed they can peform more with less money ....

cobalt isle
languid pebble
#

Both are pretty cool ...

cobalt isle
#

I guess SD3 Medium does work with ONLY T5 though, I think I tried it once

#

can't remember how results were

jovial tiger
languid pebble
#

You don't work for yesterdays hardware ...

cobalt isle
languid pebble
cobalt isle
cobalt isle
languid pebble
#

I might sound like a bad guy ,,, but often I start working with a model ... telling people here how to use it ... and later they trend ,,,

fleet goblet
#

I like this one more.

scenic yew
cobalt isle
# jovial tiger Yeah kolors does a really good job of styled midjourney style which is great a l...

i dunno man, I've been throwing people's Flux prompts at Kolors all day now
my conclusion is steering rapidly towards "Flux is really not even close to as good as a model with 12B parameters actually should be in comparison to everything else"
https://civitai.com/posts/5075351 Peeposhrug
the gap should be a whole lot wider than it actually is if you ask me

A post by diffusionfanatic1173. Tagged with illustration, photography, character, and woman.

royal monolith
wispy nest
#

blurry, bokeh, depth of field, cartoon, anime, 2d, illustration, traditional media, sketch, painting \(medium\), watercolor \(medium\), painterly, worst quality, low quality, normal quality, lowres, unfinished, low res, pixelated, jpeg artifacts, scan artifacts, simple background, bad anatomy, bad composition, bad proportions, bad perspective, bad arm, bad leg, bad feet, bad hands, missing finger, extra digits, missing eye, closed eyes, asymmetrical irises, cross-eyed, lazy eye, disfigured, deformed, broken, ugly, missing limb, missing arm, missing leg, extra limbs, extra arms, extra legs```
#

LOL

jovial tiger
#

Kolors, just like midjourney is stylized to the point of the image collapsing.

#

it's really good for some things, but for a lot of stuff it's just "too much".

wispy nest
#

massive negative prompt is not a fair comparison anyway

#

its harming both models too much

royal monolith
wispy nest
#

I wouldn't quite go that far
but in the majority of cases they are very harmful yes

cobalt isle
wispy nest
#

would recommend an empty negative prompt 99% of the time

cobalt isle
wispy nest
#

Pony might be different yes

#

because it has actual tags

cobalt isle
wispy nest
#

whereas for a photograph
someone would not caption a photo as "bad image"

#

my advice in general doesn't rly apply to Pony

cobalt isle
#

I don't see how Kolors is "too midjourney" either TBH, if anything Flux strikes me as having Dreamshaper Girl Face a lot
SD3 Medium actually IMO is the most "hard realistic" model going ATM

#

it often produces stuff that really looks like unprocessed reality

wispy nest
#

SD3 Medium actually IMO is the most "hard realistic" model going ATM 100% agree

#

SD3 medium gave the most realistic people I have ever seen

#

at least to my eye, Kolors is MJ style though

unique condor
# wispy nest its harming both models too much

Yeah, neither model should even really be using negative prompts and if you do, you only put in something specific to hard steer it if the positive prompt is having a hard time reliably managing it.

royal monolith
unique condor
#

People have to get out of the 1.5 mindset lol

#

Even sdxl doesn't need more than a few tags in a negative prompt usually

#

Almost everything you've seen of mine on here has an empty negative prompt

wispy nest
#

its hard to make a blanket statement because there are workflows that use the negative prompt differently

royal monolith
#

and everything i post has an empty negative prompt

royal monolith
unique condor
#

But people can't break the dumb sd1.5 negative prompt superstitions lol

unique condor
royal monolith
#

did you try prompting for that?

wispy nest
#

i mean remove them and have crisp image

#

the CADS node injects gaussian, exponential, or normally distributed noise into the unconditional, for example

#

(unconditional means negative)

unique condor
#

but only if it's not cooperating with me

royal monolith
#

yes. and the other thing people that use negative prompts fail to understand is that the AI isn't going to understand them if it didn't see stuff labeled with their terms when it trained. how many images were in its data set that were labeled with "poorly drawn hands' for example?

unique condor
#

exactly

royal monolith
#

so it's either going to ignore that, or just get the data for each word and go down rabbit holes they don't want it to

unique condor
#

you'll hear me say shit like "and how many images in the dataset do you really think were captioned with ___________"

royal monolith
#

yup. i do the same. far too many people think the AI has all the world's knowledge and they can just make stuff up, and it'll know not only what it is but what they actually mean

wispy nest
unique condor
#

poorly drawn hand would just get autocaptioned to something like abstract hand

#

so youd want to use terms like abstract instead

royal monolith
#

i'll tell people 'think of the AI as if it were an artist that has lived all its life in a dark box and only knows what it was taught. nothing else. and it can't read your mind

wispy nest
#

I haven't got it to work yet but the Perpneg node is meant to help a lot
it forces the negative prompt vector to be perpendicular to the positive prompt vector
so the negative is less harmful

royal monolith
unique condor
#

doesn't remove them all, but tones down their values

royal monolith
#

just learn how to prompt right and you don't need negatives almost ever

unique condor
#

imagine it more like pos/neg magnetic polarities

royal monolith
#

i'll start people off by telling them "what do you think the AI thinks your phrase "xxxxxx" means? go ask it. just use that as the only prompt and see what it does by default

wispy nest
#

I also stopped using positive prompts so much
I turn off the positive for the first 10% and the last 30% sometimes

royal monolith
#

and they come back in shock that their pet terms got them garbage renders

royal monolith
unique condor
#

you can also set the cfg to zero and zero out the positive prompt to see what kind of anti-image you're actually making with the noise

wispy nest
#

ah yeah I love the fully unconditional generation its like dreaming

royal monolith
#

yeah, but usually i'm talking to peopel that are using sites where they can't change most of the settings

unique condor
#

ahh true true

wispy nest
#

when I use negative now I try to only have it on for only about 20% of the sigmas, somewhere in the middle
and sometimes only send it to one block

royal monolith
#

me "your fine tuned model there is overfit for anime girls with large chests" them: 'no it's not" me: "i used the prompt man riding a motorcycle and got a woman. explain that"

wispy nest
#

yeah so many models overfit on women

royal monolith
#

so many people making models that dont' know what they're doing, too

wispy nest
#

I am the biggest PAG fan
PAG is absurdly strong
even a tiny drop of PAG like 0.3 can fix a broken image

unique condor
#

yeah, that's why i don't pay much mind to overtrained models. sure, they can produce really amazing results for certain tasks, but i make all kinds of random shit. i don't want my space ship having an ahageo face flying out of a dripping orifice instead of a wormhole...

royal monolith
#

merging two merges that were made from merging two other merges...

wispy nest
#

i just notice that animatediff ignores , sharpness prompt while image generation apply it.

cobalt isle
# unique condor But people can't break the dumb sd1.5 negative prompt superstitions lol

a lot of them weren't superstitions, they were just incorrectly applied in some cases, e.g. "masterpiece, best quality, high quality, normal quality, low quality, worst quality" specifically originates from how NovelAI categorized the images in their original dataset, and has very important meaning in all 1.5 models that descend from NAI. There are also SDXL models that were trained with the same sort of categorization. So it's really always been model dependent
that one quote from the SAI employee about masterpiece not meaning anything in Base SD 1.5 was never relevant, it was always missing the context / point

wispy nest
#

I mostly stick with Juggernaut because its nice and general

#

but I recently switched to Zavychroma because its more aesthetic

royal monolith
cobalt isle
#

prompting any anime SD 1.5 model without them though gives significantly worse results

wispy nest
#

I think for models specifically derived from Novel AI you have a point yes

#

like I was saying to the guy earlier Pony is also different

cobalt isle
#

so again it's really always model dependent

wispy nest
#

some SDXL model makers do bake terms in yes

#

for example Dutch Angle in chinook

royal monolith
#

and only used if really needed

wispy nest
#

another thing is to not run control nets for all the sigmas

#

you can often turn them off after 30%

#

if you run it for less long then sometimes that allows you to run it stronger

#

I don't actually use control net that much though

#

I never really know where to get the input images for control net

royal monolith
wispy nest
#

haha yeah the QR code thing is cool

royal monolith
#

yes it is 🙂

cobalt isle
wispy nest
#

if I get blender I could make depth maps in blender I guess

wispy nest
royal monolith
wispy nest
#

nice

royal monolith
wispy nest
#

ah okay

royal monolith
# wispy nest ah okay

you can actually do a lot in krita, you should search youtube for krita stable diffusion

wispy nest
#

if u need a depth video converter use amuse 2.0 by AMD

#

I think depth maps on low strength might be good for me because I could get different shape spaceships

#

krita almost seems more popular than gimp these days

#

I haven't tried it

royal monolith
#

krita is far easier to use than gimp

#

particle effects, 3d rendering, real paint program as in how you would really paint with real brushes and physical media, scripting, and AI now

#

and priced at the terribly high price of less than 30 bucks

wispy nest
#

krita has ai plugin that is crazy good , it support rendering of animation

#

thanks I didn't know about Howler

royal monolith
wispy nest
#

take alook at krita ai!

#

its photoshop ai made better +free!

jovial tiger
# cobalt isle what was the prompt for this one BTW

Hordes of zombies with glowing eyes and decaying flesh fill a massive stadium. A lone rock star with a spiked mohawk and ripped leather jacket plays a flaming guitar on a raised platform. Neon spotlights illuminate the sea of undead, their rotting arms swaying in unison. Fireworks burst overhead, raining sparks onto the zombified audience. Zombie mosh pits form near the stage, with limbs flying and heads rolling. A giant banner stretches across the top, reading "Brains & Beats Fest" in pulsating, green letters. Zombie roadies with exposed bones push giant speakers. Fog machines spew green mist, creating an eerie atmosphere. Skeletal hands reach up from the crowd, grasping at the air. Some zombies crowd surf on top of their undead peers. The drummer, a half-decomposed corpse, pounds on a set of skull-shaped drums. Zombie backup singers with microphones groan into them rhythmically.

wispy nest
wispy nest
#

upscaler sampling has instane detail.

#

sd 1.5 btw

hearty violet
#

how can I get the dimension of this image and put there?

wispy nest
#

if u can save it as image then read them dimension and put them there

royal monolith
hearty violet
#

I will try

#

how can I put more then one controlnet here? When I make a copy and link the other one go out

flint grail
#

Guys I think I found a way to get flux to follow styles more, put a CFG < 1

nimble mason
wispy nest
nimble mason
wispy nest
#

faces are brilliant with FLUX ..

nimble mason
wispy nest
#

nah

#

tried cascade too

#

and kolors

#

this FLUX has huge potential but training will take a lot of resources

#

and im just using the schnell version of FLUX .. the Dev version has better quality from what ive seen

#

running dev on my system lags my pc

nimble mason
flint grail
#

it means that people will likely finetune this model instead of dev

#

for lora it could be fine I guess? because I believe that a schnell lora could be compatible with a dev lora? not sure about that though

lost holly
#

where can i use it

wispy nest
#

i wouldn't use the word worst to draw a comparison, schnell beats sd3 in many ways, but dev version has better detail

flint grail
#

it has better detail and better trivia too, it just knows more stuff and from what I've seen, people are only using dev right now, no one care about the "inferior" version

royal monolith
flint grail
#

if someone wants to finetune schnell, I think its main goal would be to get a "finetuned schnell" that would be genuinely better than "base dev"

wispy nest
#

inferior version?

#

the schnell has a pretty decent standards

royal monolith
flint grail
#

it is inferior

#

it's like sdxl and sdxl turbo

#

we all know turbo is worse

wispy nest
#

you are talking out of your ass dude

flint grail
#

never said it was bad

wispy nest
#

go sulk somehwere else

flint grail
#

lol are you serious?

#

you're the one saying a lot of bullshit right now

wispy nest
#

yeah absolutely

#

you are just spewing words w/o logic and blindly

flint grail
#

if schnell was "equal" to dev, people would use that one instead

#

because we can go for only 4 steps on schnell

wispy nest
#

did i say equal you are stupid

#

troll actually

#

stfu please

royal monolith
#

tha'ts the difference between them. they aren't inferior to each other. one is not better than the other.

wispy nest
royal monolith
#

there's the github page

wispy nest
#

and just to be fair yuri schnell is designed to work on pc that don't have high end gpu

royal monolith
#

and the distilled variant is in a lot of cases, better unless you are a serious researcher

flint grail
#

schnell is fast, but gives worse quality

wispy nest
#

there you go again spewing nonsense

#

worst .. inferior.. you are blunt

royal monolith
#

they ahve different uses

flint grail
#

in terms of overall quality, schnell is worse than dev, or else everyone would be using schnell right now, who would want to go for 20 steps when you can go for 4 on schnell?

#

there's a reason they decided to not give apache 2.0 licence to dev, it's the best local flux version of both two

royal monolith
#

@flint grail who do you actually work for and why are you actually here? this isn't the black forest discord. go cause a battle over there

wispy nest
#

i have a pretty good impression of your mindset yuri.. now stop spewing garbage

flint grail
#

what the fuck? I'm trying to give some arguments there, all your responses are ad-hominems

royal monolith
flint grail
#

you're the troll in this case

wispy nest
#

that's not a constructive argument

#

schell by it's low resource requirement still sets a decent standards

flint grail
#

stop acting like a child

royal monolith
#

almost no one's using dev right now. they're not even trying to figure out how to train it. they're all working on Schnell

wispy nest
#

im judging by result

#

while you are spouting blindly

#

somone must have triggered your vulnerable spot about flux models

flint grail
#

you're the one spouting blindly, I've tested the both of them and I find schnell just worse in quality

royal monolith
#

it really doesn't matter though, this is the stable diffusion discord, not the black forest discord. so if you have a real argument, take it to black forest

flint grail
#

then tell me why dev doesn't have the apache 2.0 licence?

royal monolith
#

or L2 where they're developing

wispy nest
#

the results how you are framing inferior is rather blunt headed

flint grail
#

why do they don't want us to finetune that model?

royal monolith
royal monolith
flint grail
#

that's my point

wispy nest
#

and tbh after testing flux for 2 days now i think schnell sets a pretty decent bar for high standards among most other models out there

flint grail
#

schnell is faster, that's a big advantage for everyone

#

that I can agree on

royal monolith
flint grail
#

I didn't know there was a flux discord, yeah I should check that out, sorry if I went off topic for too long

wispy nest
#

what i was telling you is dev has better detail than schnell but that's not to say schnell is inferior ... do you understand how that sounds when you call schnell inferior...lol

royal monolith
wispy nest
#

schnell beats even sdxl.

#

and we are talking about FLUX base model

flint grail
wispy nest
#

i dont agree with your bs one bit

#

you keep twisting word with crappy mindset

flint grail
#

and I don't agree with your bs one bit either

#

what now?

wispy nest
#

sure dont have to agree with me

royal monolith
wispy nest
#

and dont expect me to call schnell inferior

flint grail
#

I don't expect you anything

#

but I expect you to stop insulting people who disagree with you

wispy nest
#

good enough

flint grail
#

that's simple enough right?

wispy nest
#

dont butt in then

#

i didnt ask you come lay your bs over my post

flint grail
#

this is a public discord, not your house dude

#

people have the right to talk and disagee with your suggestions

lost holly
#

is there a free SD to use?thanks

wispy nest
#

sure .. but have some common sense when you go around calling schnell inferior that's bs and not acceptable

flint grail
#

why do you act like I insulted your mother or something? if I want to call schnell inferior and you're triggered by that, then you have a serious problem

wispy nest
#

there you go again

#

stfu yuri

flint grail
#

you stfu

wispy nest
#

you got a stenchy view of life im sure

flint grail
#

I'm not the one insulting people becaue they dared to disagree with my point of views

#

you have some serious anger issues

wispy nest
#

you talk horse shit and blame others

#

you need hard reality slap dude

lost holly
#

where can i find SD bot?

royal monolith
royal monolith
lost holly
royal monolith
lost holly
royal monolith
lost holly
royal monolith
wispy nest
royal monolith
wispy nest
unique condor
#

So they don't use negative prompts

wispy nest
unique condor
#

You tweak the guidance and the shift

#

Cfg should always be one if you're using a regular ksampler

#

You adjust the guidance on the node between the model and the ksampler

wispy nest
#

that's what im using set at 1 but not using guidance in this workflow altho i have other custom workflow with guidance set at 3.5

unique condor
#

Default guidance is 7 iirc, at least it is on their official version on diffusers

royal monolith
wispy nest
#

the workflow that i used sets guidance to 3.5

#

and that's intended for FLUX

#

is there a full paper out on Flux yet?

royal monolith
unique condor
#

Since it's distilled, you should think of it like a turbo or lightning model where they'd normally use a cfg of 1-2 and no negative prompt. That's the whole point of distilling them: to weed out garbage and not need a negative prompt to steer it away from the garbage

wispy nest
#

i have no complaints about negative either way... so far with FLUX and w/o negative results are pretty good

viral frost
# wispy nest and this HAT one is also very strong ```https://openmodeldb.info/models/4x-Nomos...

@jovial tiger After yesterday's discussion about intermediate upscaler models for Flux, I did a detailed comparison with my test image (with seed fixed on 2nd stage to keep things as consistent as possible). Note that this is a single image, though I find its combination of features to be useful, and most of my comments are nitpicky. If you saw each image in isolation, you might not have any objections.

I compared five models: 4x-UltraSharp.pth, 4xRealWebPhoto_v4_dat2.pth, 4xRealWebPhoto_v3_atd.pth, 4xNomos8kSCHAT-L.pth, and 4xNomosWebPhoto_RealPLKSR.pth. tl;dr, I still prefer ultrasharp, as I find its detail to be prominent, but controlled and realistic. The others still work mostly fine except for the DAT, for a specific reason I note below. I did my comparisons at 200% zoom to concentrate on the details.

4x-UltraSharp.pth
Low contrast areas are a bit featureless
Texture is convincing, not repetitive or smeared
Oversharpening of high-contrast edges (branches, man) is suppressed
Slight noise over entire image, not preferred but not objectionable
Retains details in silhouetted branches in upper right
Realistic detail in brush on lower left/right
Alien's right hand has deformed first finger

4xRealWebPhoto_v4_dat2.pth
Low contrast areas are featureless
Skin texture not so convincing, looks like a carpet
Oversharpening of high-contrast edges NOT suppressed
Noticeable noise over entire image, noise has vertical stripes -- objectionable
Silhouetted branches in upper left & right have no detail
Brush in lower left has slightly less depth (more "hair fan" than "hair ball")
Taller shrubs are denser
Dirt and rocks at the bottom are slightly grittier
Alien's right hand is a bit less deformed

#

4xRealWebPhoto_v3_atd.pth
Low contrast areas are a bit featureless, similar to Ultrasharp
Texture is convincing, a bit less pronounced than Ultrasharp
Oversharpening of high-contrast edges is pronounced
No noise over entire image! Very smooth.
Silhouetted branches in upper right look completely flat & sketchy/outlined
Brush in lower left has slightly less depth, but lower right has a bit more
Taller shrubs are too dense/twigs are chunky
Dirt and rocks at the bottom are slightly grittier
Alien's right hand is a bit less deformed, forearm is blurry
Right foot is a different interpretation, has a hole in it
Man's right hand is a bit garbled

4xNomos8kSCHAT-L.pth
Low contrast areas are a bit featureless, slightly better than Ultrasharp
Texture is okay, better than the DAT, but more generic than Ultrasharp
Oversharpening of high-contrast edges only slightly worse than Ultrasharp
Slightly more blanket noise than Ultrasharp, not objectionable
Silhouetted branches have no detail, but more coherent than ATD and DAT
Brush in lower corners is a bit flatter than Ultrasharp
Taller shrubs are denser, but twigs still thin -- looks natural
Dirt and rocks at the bottom are less contrasty than Ultrasharp
Alien's right hand is less deformed and arm is still in focus

#

4xNomosWebPhoto_RealPLKSR.pth
Low contrast areas have more detail
Texture is very generic/checkerboardy, too pronounced
Oversharpening is not suppressed
Pronounced blanket noise, objectionable course grittiness to fog & hills to right of alien
Silhouetted branches retain more detail, but less than Ultrasharp
Brush in lower left is flatter, but lower right is more contrasty
Taller shrubs are denser
Dirt and rocks at the bottom are slightly grittier
Alien's right hand is less deformed, arm in focus
ALien's right foot has a hole in it

wispy nest
#

also taken sdxl examples, the key difference i feel between sdxl and flux is that flux looks more organic and natural

unique condor
#

literally 6x the network size of sd3 2b. size doesn't always matter and all, but still, it's a lot more room to store information

wispy nest
#

sd3 had huge pontential too if it wasn't for botched up anatomy, but in general the quality was pretty vivid

#

what got me pissed about sd3 is that every few render with female models things got f*ed up

#

and this FLUX 0 cherry picking

royal monolith
unique condor
#

100%

#

the only issue with distilled versions though is that they get gimped on variety to a degree

wispy nest
#

FLUX also has a very high aesthetic standards for a basic base model

unique condor
#

but it's a good tradeoff

royal monolith
# wispy nest sd3 had huge pontential too if it wasn't for botched up anatomy, but in general ...

it's not anatomy. it's all subjects. the farther you get from straight in front of the camera, the more the subject warps, stretches, shrinks, and the AI starts trying to draw it from multiple points of view at the same time. the issue is in the core of the architecture and flux has it too. but flux is so huge it's almost not noticeable, where 2b medium is so small, it slams it in your face

wispy nest
royal monolith
wispy nest
#

although i dont have technical analysis from programmer's pov

viral frost
wispy nest
#

lol

royal monolith
wispy nest
#

the lexica dataset is AI generated
which might help with your golem thing

#

the photo upscalers expect a photo really

#

with transformer upscalers you can't go too far out of their training distribution

#

they are not as broad as diffusion

wispy nest
royal monolith
#

also look at the stretching on the wings of this bird, and it's leg and tail are so shortened they're almsot totally gone

#

as opposed to this bird that is directly in front of the camera

royal monolith
smoky vigil
#

SD3 failed on everything except speed and backgrounds/landscapes. Emad dropped the ball.

wispy nest
royal monolith
wispy nest
#

i think Emad had already left the company before SD3 was released

smoky vigil
#

I know but hell be their maskot forever.

royal monolith
smoky vigil
wispy nest
#

lol you have those gif still

smoky vigil
#

🤣

wispy nest
#

not fine tuned but they released a single checkpoint version of Flux Dev

royal monolith
wispy nest
#

well the size shrunk to 16gb

royal monolith
#

good 🙂 - wonder what they did

wispy nest
#

i was trying it out few hours ago but unfortunately it lags my pc

royal monolith
#

ah. it's fp8

wispy nest
#

cant offer technical detail but images came out pretty good

#

fp8 but dev version

royal monolith
#

they didn' create a model card, either.

wispy nest
#

no

royal monolith
#

probalby discussion about it on L2

wispy nest
#

probably they rushed the relased idk

royal monolith
smoky vigil
#

Flux flex.

#

No hype, all delivery, that would have been good even if we had to wait 2 weeks after the initial announcement.

#

Sora will also probably disappoint everyone when it drops, if it does.

pallid flax
unique condor
# royal monolith good 🙂 - wonder what they did

They just quantized the transformer to q8 with the en8fpztuehs w/e the hell format and then saved it. Comfy made a version for both flux models and packed the same q8 version of t5, clip L and the vae

#

So it's an all in one vs having to have models in the unet/clip/vae folders

#

oh and the nice thing about it is that it not only saves some disk space, it saves loading time if you plan on using it in q8 anyways. otherwise, you have to load the full fp16 model and quantize it on the fly, which takes up a bunch of ram and time.

wispy nest
nimble mason
pallid flax
nimble mason
celest sigil
#

Soon...

pallid flax
nimble mason
steep sleet
steep sleet
#

Flux.Schnell

nimble mason
steep sleet
nimble mason
steep sleet
#

... if I was a psychiatrist ...

#

A Satyr

nimble mason
celest sigil
nimble mason
gray canyon
celest sigil
nimble mason
nimble mason
nimble mason
rain river
#

"AI art is the future it's so good."
AI art:

nimble mason
languid pebble
#

Good morning coffee

celest sigil
celest sigil
wispy nest
celest sigil
languid pebble
celest sigil
celest sigil
shut sinew
sour quarry
#

Bedroom

sonic bay
wispy nest
candid surge
wispy nest
candid surge
wispy nest
royal monolith
#

someone needs to go eat

languid pebble
#

😄

wispy nest
#

food porn for the eyes, no need to eat

wispy nest
wispy nest
languid pebble
wispy nest
wispy nest
#

i now prompt based on real life event, i copy pasted a youtubr video title here.

spring mist
#

nice

wispy nest
wispy nest
#

such a massive noticeable increase in image quality on this channel as soon as Flux came out

languid pebble
frigid stirrup
wispy nest
#

if it looks good then its Flux Dev

royal monolith
wispy nest
frigid stirrup
analog goblet
#

any idea what FLUX wants here ?

wispy nest
#

its a rectified flow matching architecture

frank drift
candid surge
nimble mason
edgy shard
#

Flux is really impressing me

#

Above was simply a chunk of meteorite with ruby in it

earnest cargo
clever oar
nimble mason
#

emo lol

clever oar
#

its hybrid emo cat and rat 😃

mortal idol
#

Flux Dev/Comfy I've never used Comfy before and the learning curve is steep! But these turned out ok.

deft bison
wispy nest
deft bison
earnest cargo
clever oar
viral frost
deft bison
unique condor
# earnest cargo

Reminds me of a book I used to have when I was a little kid, but I can't quite remember which one it was

cinder arch
earnest cargo
earnest cargo
#

The model is the base sdxl

cinder arch
unique condor
jovial tiger
burnt parcel
wispy nest
#

im feeling too lazy to write up complex prompt

#

also these pretty model faces soothe my mind

burnt parcel
flint gust
languid pebble
#

Good morning coffee! 🙂

visual hemlock
#

This is my first prompt A girl walking a dog with rain coat on
but then I like the bottom right most and I press "vartiation"

#

I then get these images instead

#

Can I do something like this in stable diffusion automatic 1111?

#

When I use **Seed: -1 ** most of the seeds are just 1 number away from each other..

royal monolith
wispy nest
#

a1111 has variation seed feature

#

I think that might satisfy you

visual hemlock
#

Sorry, I'm running SD on my local PC

royal monolith
#

you'd want to use "cell shaded, line art, cartoon," at the very least

visual hemlock
wispy nest
#

could you use comfy UI?
this would be possible there
the structure of the image has already been set which means this was done via a noise injection after the first 30% of sigmas or so

#

noise injection can be done either via a ksampler (sometimes called unsampling, and sometimes called flipping the sigmas) or by just adding noise to the latent directly

royal monolith
visual hemlock
#

I made 2 images and there is only +1 different

#

Image 1: 978357606
Image 2: 978357607

royal monolith
#

it's automatic and with stable the seeds are rarely just one number different

visual hemlock
wispy nest
#

maybe trying putting 0 as the seed number

#

a lot of UI interpret 0 as "make seed fully random"

royal monolith
visual hemlock
royal monolith
visual hemlock
#

If I do with 4 images they are 0 - 1 - 2 and 3..

royal monolith
#

that sounds very odd for a1111

visual hemlock
#

So with -1 .. it will be a random seed but the next seed is just +1 from the previous

royal monolith
wispy nest
#

if I remember rightly this might actually be the default behaviour

#

its not a good default though

#

if you want the next image to be random you have to run it with -1 in the seed box

#

either -1 or 0 I can't remember which one

visual hemlock
wispy nest
#

yeah I think this might just be how it goes

#

if you tick the extras box you can try variation seeds
you might prefer that

visual hemlock
#

I tried playing around there but was the same, you know how it works?

wispy nest
#

last time I used A1111 was 12 months ago and its a pretty foggy memory now

languid pebble
steep sleet
wispy nest
#

ADetailer is good yeah

#

its a bit like face detailer in comfy impact pack

steep sleet
dry crow
steep sleet
wispy nest
#

yeah its quite a common feature

carmine isle
#

how to use it

wispy nest
#

there's a few different ways of doing it
tends to be a second latent added in but with the magnitudes set by the "variation" slider

#

but you can do it via noise injection at later sigmas also

wispy nest
visual hemlock
wispy nest
celest sigil
celest sigil
ornate coral
#

@celest sigil Astro Boy style cat

#

@wispy nest Astro Boy style cat

ancient bronze
#

Y'all make sum wild shit and I just make anime tiddies

languid pebble
#

Peace, Art and Rock'n'Roll ... 😄

languid pebble
#

Hi Dicordos

clever oar
languid pebble
#

CivitAI is advertising for Flux 😄

limpid lichen
#

in the new versions of reforge, there are schedulers called align your steps git, align your steps 11, align your steps 32. i'm trying to figure out wtf they are , but now my conspiracy theory that Google is borking it's search results on open source ai topics, is blazing hot.

#

okay, so i saw one comment on one post about align your steps for 10 step generations. i'm thinking thats what align your steps 11 would be for. lower step gens.

#

still jsut theorizing

spring mist
#

noise scheduler

limpid lichen
#

i know what noise schedulers are doing, and i know what align your steps is, but there are 3 other varients in reforge that are confuddling me

#

"ays 11" seems to be doing well on low steps. thinking thats that one so "ays 32" should do better on higher steps and bad on low steps.

#

nope. ays 11 and ays 32 actually produce identical results on the same seed

#

which are also identical to the base AYS scheduler too. so i'm thinkign ays, ays 11, and ays 32 are all the exact same code. AYS GITS though... that does differnt images on teh same seed

limpid lichen
#

nope not actually at high steps just low steps it does. so i'm just going to nix those from my ui and ignore it

#

think i only need the base AYS

limpid lichen
languid pebble
earnest cargo
languid pebble
frank drift
frank drift
#

I find it SUPER irritating when you get something beautiful but you're deliberatly trying to do something and it's not working. Like, this is great, and I love it, but. I didn't want this. So I am irritated.

deft bison
viral frost
#

Flux just looks sooo good.

wispy nest
#

wow I love this one

#

sci fi is my main use case

floral flicker
#

It’s too bad it is so damn heavy to run local

celest sigil
calm vortex
#

I really am new here and not familiar with the community rules. Please let me know if what I posted is inappropriate. I will delete it if so😀

#

Any idea is welcome~

unique condor
#

So you want us to help you help some company predatorially manipulate and con middle aged people with some bogus dating app...?

candid surge
#

I'm out, not helping here.

calm vortex
#

of course not…Just exploring how to apply the tech into ads field and ads care about CTR and conversion. Sorry for making this uncomfortable impression. I will delete it then

candid surge
#

oh, so you want to design clickbait

languid pebble
#

Good morning!

calm vortex
# candid surge oh, so you want to design clickbait

Not really, unless you say all ads are clickbait. I was trying to reproduce some pics that have been proven as good cases in ads. Different industries want pics of different styles. So I am curious how to make it work.

wispy nest
#
  1. training a lora
  2. canny edge and depth control nets
  3. a bunch of IP adapters chained together
#

I didn't see the question but that will be the answer anyway

calm vortex
#

Thanks~

candid surge
#

If you want realistic photo style of average persons, do NOT use flux as it usually gives highly smooth polished photo, like top model style

#

flux-schnell, online

abstract spade
calm vortex
abstract spade
#

they are all top model in generating Caucasian, but not much so regarding Asian in general , the only problem is the face attributes diversity is flat

celest sigil
#

Cascade...