#✨|sdxl

1 messages · Page 158 of 1

high skiff
#

usually DDIM for speed tho

sharp robin
#

not too slow

steady grove
#

waifdoom

pure crystal
sharp robin
slender coral
#

Best prompt to get products to position directly in the camera?

high skiff
#

OMG

#

my training exploded on the last epoch lmfao

#

I have never seen anything like this

#

second to last epoch

#

last epoch lmfao

vale eagle
high skiff
#

what in the everloving fuck happened in the last epoch lmao

severe sapphire
#

I'm not sure if I can ask here questions about sdxl but I'll give it a try.

https://m.youtube.com/watch?v=zzA1iUgtiEs

Will this let me use sdxl more easily? I haven't seen the whole video yet due to university work and I won't be able to give it a look for a long while. Can anyone tell me if with this trick I'll be able to use my 2080S (8Gbs) with sdxl models? I know it's literally in the thumbnail but did someone try it?
Thanks :)

#stablediffusion #a1111 #nvidia #update #cuda #cudaerror #lowvram #kohyass #LoRA #dreambooth #tensorRT

(Update: while the update is able to solve CUDA memory errors, I have seen it to be very slow with SDXL... it is not very practical to use with low VRAM... works, but slow, hopefully in the next update, we get better performance than the curre...

▶ Play video
sweet abyss
#

Can you suggest me a good workflow to turn real life objects into anime style objects ? When using my usual workflow, with high denoising the details are wrong and with low denoising they does not look anime a lot.

noble shoal
sweet abyss
noble shoal
sweet abyss
#

results look a lot better with controlnet in comparison using img2img, thank you @noble shoal

upbeat summit
half cedar
#

In img2img workflows I've been taking the input image and sharpening them + adding a little noise and the resulting quality difference is apparent. Easy trick

crisp owl
crisp owl
hasty smelt
lusty wolf
#

Does anyone know if there is a SDXl Controlnet 'Tile' model arround?

lusty wolf
lusty wolf
noble shoal
#

Epic misfocus

agile quarry
#

I really like that sampling->custom_sampling nodes exists in ComfyUI but idk what I'm doing. could anyone direct me to papers on samplers, schedules or something that could help me understand the options available ?

crisp owl
#

Thor reluctantly agrees to wear a wedding dress to fool the Utgard giant king in an attempt to retrieve his stolen Mjolnir hammer

Took a bit to get this image lol

floral island
#

shameless model post ^^ https://civitai.com/models/148871 finally decided to release a version 1.2 of my model 😄 and it's pretty fucking good. I've not had many fail cases

slender coral
cyan crown
upbeat summit
cyan crown
vestal jay
cyan crown
floral island
cyan crown
steady grove
floral island
steady grove
#

i love it!

#

i feel like graveyard keeper inspired CA to make a spooky game that was more wholesome

cyan crown
lusty moss
#

Looks like she hates her side job lol.

cyan crown
slender coral
#

generally speaking, should I always be batching?

crisp owl
#

as in?

slender coral
crisp owl
#

Depends how confident you are you'll get the image you desire with your prompt or not

slender coral
#

It seems to add 15 seconds for 3 images compared to 1

crisp owl
#

oh in that regard. I dunno. I can only do 1 image at a time

crisp owl
#

If I could do batches, I would

cyan crown
high skiff
#

@visual glade Any plans to add LCM Scheduler to ComfyUI by chance? I have some ideas I would love to mess around with

cyan crown
slender coral
#

I have a upscale node that I want batches in, how would I be able to connect these OR is there a node that I should be using?

high skiff
#

My newest test realism LoRA absolutely fucken SLAPS

#

base vs with the LoRA

#

base vs with LoRA

fierce hollow
slender coral
#

So I'm wondering how I would add batches to my prompts.

fierce hollow
#

uhh do you mean like "Repeat Latent Batch" but for images?

#

I don't think there's a node for that by default actually

#

like so?

slender coral
#

Or is it batching the entire process into one prompt and outputting 5 images like your example?

fierce hollow
#

a batch like above would do something with all 5 images at once

#

Batch count in the menu would just queue up x prompts one after another

slender coral
fierce hollow
#

if you're trying to upscale the image x times then I guess so, but that node doesn't exist in base comfy afaik

#

there's probably something like that in one of those huge packs

#

or I can put it on gist or something

slender coral
#

I have this in another workflow,

#

But data

#

😄

slender coral
fierce hollow
#

yeah, not in any folder, just next to example_node.py.example

slender coral
fierce hollow
#

no problem (if you mean the archived repo then it's not really updated anymore but I guess if it still works that's good)

slender coral
high skiff
#

@bright valley I am now officially being sponsored by a company. They will be paying for all of my compute in the cloud, and we are discussing a monetary value for the work as a whole

#

As well as partnerships on many, much bigger things

#

So yes, you CAN be supported off of making LoRA's, granted their experience with my workflow gave me a big leg up, as well as my resources

bright valley
#

but I hope whatever it is works out great for you

agile quarry
#

Comfy? or anyone. I really like that sampling->custom_sampling nodes exists but idk what I'm doing. could you direct me on what I should be searching for to learn more about samplers and schedules and what each variable means

bright valley
hoary saddle
#

Anyone run into comfyroll nodes broken with latest comfy update?

high skiff
visual glade
#

it's a sampler not a scheduler

agile quarry
high skiff
#

Thank you, I will mess with it in a bit when I am able to

agile quarry
#

I think most of my answers might be in diffusers documentation

bright valley
#

dude that thing is nuts

#

SDXL image

#

2 steps

#

2

slender coral
#

is it weight or bias or something else that leans towards image / text prompt?

lusty moss
south horizon
#

just regular ol' sdxl

upbeat summit
bright valley
#

@lusty moss Doom is an activation word in my model

#

It's very weak in the training, so I don't list it on the model page, but it still works kinda!

lusty moss
bright valley
#

grockster's took heavy influence from it as well

lusty moss
steady grove
#

@bright valley cool lora!

hoary saddle
#

did someone have a workflow that would randomly choose an item from multiple text files to build strange prompts?

#

could really use one but struggling to figure out how to build it out

west breach
#

that will randomly select between the 3 files

half cedar
bright valley
hoary saddle
steady grove
#

it works really well paired with other loras

bright valley
#

Yes I agree, although I've only tried a couple personally, I see people make all kinds of wild stuff with them on discord/civit

steady grove
#

yeah it doesn't seem to overpower other networks so easily. off doing its own thing in the xl sized latent spaces

bright valley
#

oh this one was pretty cool too

agile quarry
#

LCM-LoRa + IP Adapter

half cedar
#

Nice

sharp vale
#

tried SDXL for the first time today

pure crystal
#

https://huggingface.co/blog/lcm_lora "Latent Consistency Models (LCM) are a way to decrease the number of steps required to generate an image with Stable Diffusion (or SDXL) by distilling the original model into another version that requires fewer steps (4 to 8 instead of the original 25 to 50)"

agile quarry
pure crystal
#

img2img fail, the face is obvious - look at the lion's front paws

floral island
#

fueled by coffee

vital ermine
lusty moss
noble shoal
lusty moss
lusty wolf
lusty wolf
lusty wolf
peak dove
#

Harrologos2 LoRA is a great improvement!

wet nacelle
lusty wolf
lusty wolf
cyan crown
#

oot : Dalle3 also now has IPAdapter

#

hot mods

#

in chatgpt

shy kelp
#

does making longer, more detailed descriptions also help in lora making like it does in model training

cyan crown
#

yes if it's a question

steady grove
lusty moss
vale eagle
lusty wolf
crisp owl
#

That's bad ass

vital ermine
loud glacier
#

Has anyone had success or know of LoRa's for generating DnD battlemaps ? I have tried prompting for things like: top down view, birds eye view, digital battlemap, etc. with not great success.

pure crystal
slender coral
#

At the moment is there anything that supports transparency? If I use an image with transparency there's no workflows that will infer that and use it?

slender coral
crisp owl
#

It's a video clip of realtime image generation based off basic shapes

pure crystal
mellow tendon
vital ermine
smoky thicket
agile quarry
bright valley
vital ermine
#

Aeroplanes are just as hard in XL as always.

#

An I said An Airplane

#

I am not even sure what those are anyway, lol

boreal bough
#

Original Juggernaut
young girl / young woman / woman / older woman

#

Juggernaut + My lora
young girl / young woman / woman / older woman

zinc cargo
kindred shoal
boreal bough
#

jk. works great when not mixed with general-purpose loras

zinc cargo
#

@boreal bough you early adaptor you 🙂

upbeat summit
boreal bough
boreal bough
vital ermine
zinc cargo
sharp vale
zinc cargo
sharp vale
zinc cargo
pure crystal
#

DMT elf, SDXL

vital ermine
native knot
pure crystal
#

retouched

vital ermine
kindred shoal
#

Reminds me of bioshock infinite

vital ermine
#

It should as it was part of this genre style.

pure crystal
vital ermine
#

I never played it only the first two

kindred shoal
#

I played through most of the first one, though that was on the switch years ago

pure crystal
#

infinite is more of a pure action game than a bioshock. fun and cool though

kindred shoal
#

infinite was also on the switch

vital ermine
#

well, that is probably why I never grabbed it

#

I loved Bioshock

#

fell right into this style which I have loved for eons

wet nacelle
vital ermine
#

That looks like one of the BeeGees

pure crystal
#

BeeJeesus

lusty moss
lusty moss
crisp owl
vital ermine
vital ermine
south horizon
crisp owl
sharp robin
#

Have u guys used FreeU?

crisp owl
#

yeah

#

for a day

sharp robin
#

Yay or nay?

crisp owl
#

in my opinion, it can be useful as a tweaking tool.
As in, you find an image you really like the general composition of, then you can fiddle around with the FreeU settings to keep the general feel and get variations of that (keeping the same seed and such)

#

But, that's not generally how I do my images, so I dumped it

sharp robin
#

So it serves the purpose of noise variation.

#

I tried it but way to much work. And makes images so bright. I thought i was doing something wrong

peak dove
sharp robin
crisp owl
#

There are some nodes which open more options for it, cannot remember what pack had it anymore

vital ermine
#

freebasing is bad. Oh, freeu I never tried.

peak dove
#

As soon as I know I'll get back to you

sharp robin
#

All the info im getting is it sacrifices details for coherence.

vital ermine
sharp robin
# vital ermine

Wait. I remember u. I think. When i was trying to do 2.0 embeddings

vital ermine
#

yes

sharp robin
#

What a waste of everyone’s life was 2.0. Tho guess positive side is it showed us what bad was so we don’t repeat it.

vital ermine
#

Well, kinda sorta. 2.1 was its own kind of training nightmare.

sharp robin
#

2.x I should say.

vital ermine
#

Never did manage to pop out a training on it from Jan until I went to XL

sharp robin
#

All 2.x 🚮

vital ermine
#

well, elongated necks in 2.1 and in XL alien, snake, or blind eyes

#

not sure how that is a thing in base XL

vital ermine
#

wonky eyes is a thing in XL

sharp robin
#

Oh. I i moved on from base. But i do know what u mean.

#

I can deal with that tho.

vital ermine
#

I never move off of base as I train on base only.

#

I could live with elongated necks in 2.1 too

sharp robin
vital ermine
#

I haven't done an embedding in almost 10 months now

#

Kohya isn't the same as a1111 way

sharp robin
sharp robin
vital ermine
#

loras and full models though I don't release those publicly just the loras.

#

my loras are up on civit under this name

sharp robin
#

Ok let me stalk you real quick and peep the loras. 😃

vital ermine
sharp robin
#

The apocalyptic one looks sick.

#

Will have to try tomorrow

vital ermine
#

it and steampunk and goth are fun af

#

This is one I am working on now

sharp robin
#

Looks promising

#

Goth one didn’t register. I thought it was like a roman one.

vital ermine
#

goth person. Even though person it works for everything

#

combined it gets sick

sharp robin
#

W this lora

vital ermine
sharp robin
#

Looks nice. But may need more lora strength.

vital ermine
sharp robin
#

Ok now we are getting there

vital ermine
#

This lora is a bit under trained as it was just a test one

sharp robin
sharp robin
#

Needs to train more

vital ermine
#

Yes

#

I am trying to see what it can't do

sharp robin
#

Tag me when it becomes buff

#

Can’t do captain America yet

vital ermine
#

I need an image of him to train on

#

The image I need would come from the first movie

#

before he froze

sharp robin
#

That’s another SDXL issue that white outline on pictures of people/characters

vital ermine
#

yep

#

can get bad too

#

now I gotta find CA 1940

sharp robin
#

Big wonky eyes work well when doing anime

#

At least SDXL has that going for it

vital ermine
#

hehehe

sharp robin
vital ermine
#

I don't have the movie to grab a frame from

#

I need this version of CA

sharp robin
#

Would that help

vital ermine
#

No

#

his costume is concealed

nimble current
vital ermine
heady vale
stone fossil
peak dove
rigid remnant
#

How to use stable diffusion

nimble current
#

wtf is this white sickness :X

peak dove
#

SD is sadly all too white! 😦

glad grove
#

is this one black enough for u

peak dove
#

"I can't see their eyebrows!!!" 😄

nimble current
#

still white halo lol

#

she has white halo like lighting coming from behind yet there is none

glad grove
nimble current
vital ermine
stone fossil
sweet bane
errant night
sweet bane
floral island
carmine gale
#

Anybody have a good auto cropping tool that sees subject to crop tighter in, while also using a given aspect ratio? With computer vision?

vale eagle
pure crystal
dapper dragon
pure crystal
#

I hope you aren't wearing suede boots in that gross water

dapper dragon
peak dove
#

Today we explore how to use the latent consistency LoRA in your workflow. This fantastic method can shorten your preliminary model inference to as little as 0.7 seconds and in only 4 steps using ComfyUI and SDXL. This will also make it a lot easier to run these models on older hardware and is just mind-blowing fast! Now, it isn't perfect, but...

▶ Play video
#

From Scott Dettweiler

dapper dragon
rustic garnet
fierce hollow
#

does anybody know of any optimized cli scripts for running sdxl with diffusers? the vae uses so much vram even with tiling/slicing/offloading enabled compared to comfy I'm losing my mind

#

idk if I'm missing something

nimble heart
#

batch size > 1 seems to murder it

#

with BS1 it's pretty close

#

you could also try changing the vae sample count, I believe that affects vram

fierce hollow
#

I'm doing batch 1 but seeing like 8gb spikes

#

haven't tried sample counts, thanks

nimble heart
#

else the unet won't get offloaded

fierce hollow
#

👍 guessing sample count is supposed to be vae.sample_size

nimble heart
#

and if you enable offloading don't manually call pipe.to("cuda")

nimble heart
fierce hollow
#

I saw that but pretty sure some of the optimization stuff is outdated

nimble heart
#

principles are the same

#

enable sequential offload, don't call to()

#

use fp16

clever verge
#

I'm pretty sure I've seen another article about it recently. Can't find it though.

nimble heart
#

that's about it

fierce hollow
#

yep thanks, will try in a second

fierce hollow
#

oh that seems promising

#

just had a thought offloading the text encoders might be useful too, idk if that happens automatically

nimble heart
nimble heart
#

when I use pixart it peaks at 12GB during text encoding and only 4.5 GB during sampling

#

I assume sequential offload also does too

fierce hollow
#

I have the gen at 4.0gb now but the vae still hurts, going to try the sample thing

nimble heart
#

you have Nvidia right

fierce hollow
#

yeah, 3060

nimble heart
#

hm

#

if you really wanted you could run each part of the pipe manually. like text_encoder.encode(), unet.sample() and shuffle the models around in vram as you like

#

instead of just calling the pipe which runs them all at once

#

if it's just for SDXL should be easy to do

#

could even call torch cache clears in between

fierce hollow
#

that's an option I guess, yeah

nimble heart
#

so you can see exactly what's using vram

#

but the vae specifically you can run alone by setting the pipeline output to latent

#

since that's the main issue

#

ohhhh

#

one thing

#

in SDXL diffusers forced the VAE to fp32 by default

#

where comfy I think uses bf16

fierce hollow
#

oh I think that might be the issue

nimble heart
#

yea. there's an issue on diffusers git to do that automatically but it hasn't been touched in a while

#

so I'd return latent from the pipeline then manually cast the latent and vae to bf16 and decode

#

so like 3 extra lines of code ig

stone fossil
fierce hollow
#

I tried torch.inference_mode and it got the total vram usage to around 7.5gb, still a bit too much

#

that's with decoding the latent separately too sadcat

nimble heart
#

bfloat16 and all?

fierce hollow
#

yeah, afaics it's already all correct dtypes, I wanted to try sequential offload but getting some error about it not being implemented

#

maybe because I'm using ssd-1b

fierce hollow
#

oh, I had custom_pipeline="lpw_stable_diffusion_xl" for longer prompts, sequential offload works without it

nimble heart
#

use Compel for longer prompts

#

it has weights and stuff too

fierce hollow
#

the only issue is that's twice as slow now, but the max vram usage is 2.3gb so that's certainly something

#

I think I tried compel but this one seemed easier, will take a look at it again

nimble heart
#

just copy paste their XL example and it seems to work

#

if your script runs on multiple models just check for the tokenizer and text encoder attribute on the pipe first

pure crystal
bright valley
native knot
lusty moss
native knot
crisp owl
#

Damn, just realizing the fp16 vae has the weird rainbow effect issue.

hoary saddle
#

playin in blender

open wagon
#

Hello guys, I'm going to do a full finetuning for SDXL to learn the aesthetic. I'm wondering is there a need to finetune the text coders of SDXL as well? Or I just need to finetune the UNet? Some people told me they always disable the training of text encoder of SDXL when doing a full finetune. Many thanks

vital ermine
vital ermine
high skiff
#

I like this a lot

peak dove
# crisp owl yeah

I want to introduce a brand new node that was just added by Comfy to his stable diffusion system this morning, it's called FreeU. The concept here is you are able to change some of the underlying contribution mechanisms of the u-net, and this is the core of stable diffusion. The results tend to be much better, and it doesn't slow us down or co...

▶ Play video
#

Adds extra detail in the high frequency range

vital ermine
vital ermine
zinc cargo
#

how can i create a loop that will take a generated image and put it in a load image node?

zinc cargo
#

it's for img2img style workflow..

#

so, i want the image to be loaded automaticly

vital ermine
#

oh, you want that node to go through images in a folder?

#

no such thing exists that I have seen

zinc cargo
#

nope

#

i want something that sends the outcome of a workflow to the begining.
i remember there was something like that with outpaint, but perhaps reading file names is the best option

vital ermine
west breach
peak dove
vital ermine
vital ermine
stone fossil
icy brook
#

Aether Pixel - making stuff fall apart into pixels.

peak dove
#

Bloody Nora!!!

lusty wolf
#

Not intended but not bad...

wet nacelle
kindred shoal
#

What's the min VRAM for sdxl to work? My buddy has 8gb, but a1111 crashed while trying to load the model in.

noble shoal
molten gull
#

i got trouble with comfyui + controlnet, anyone here who could maybe help me a bit?

icy brook
#

made with Juggernaut XL by @kandooai + Aether Pixel by me

steady grove
#

it's fake. i can tell because of the pixels

icy brook
#

💯

steady grove
icy brook
lusty wolf
#

This is fake... but a good one.

icy brook
steady grove
#

I'm very serious, and don't call me shirly

noble shoal
steady grove
#

I just want you to know, we're all counting on you

zealous tundra
#

hi guys ,
any1 got any idea if i want to create a design for a room but for the windows i only want to show a blue sky or something , any ideas how to controll the windows ?

icy brook
half cedar
kindred shoal
#

Bruh

noble shoal
#

Get ready for getting banned like never before.

kindred shoal
#

@uncut steeple

noble shoal
half cedar
#

Hahaha ^

noble shoal
half cedar
#

Close-up vertical industrial fan, lumpy mud turds, extreme hyper-realism, painterly shading, exaggerated mud splashes, rich colors, dramatic contrast lighting, texture highlight, 1986 novelty card illustration style, conspicuous shading

steady grove
molten gull
#

any of you got experience with comfyui + controlnet ?

steady grove
#

using other controlnet models with harrlogos or just controlnet on their own, doesn't get as good text effects

#

i think the new qrcode monster was trained with some text knowledges too so that might be where its coming in handy

#

was the source i used. put some 3d effect on it for cheese but it didn't do much

vale eagle
#

Do you have the link for the weight?

steady grove
vale eagle
#

Wasn't it a lora? or it is a control lora?

steady grove
#

ohhh you mean the monster qrcode thing

vale eagle
#

yeah

steady grove
#

looks like a ffull on controlnet. i didn't really consider that

vale eagle
#

thank you.

kindred shoal
#

Does anyone know where to find sdxl openpose and lineart controlnet models?

#

I found 4 but they didn't work at all

kindred shoal
#

I tried 4 from those and they don't work.

#

t2i makes everything green and ignores the controlnet, and kohya doesn't listen and ignores it entirely.

vital ermine
kindred shoal
#

..thanks.

steady grove
kindred shoal
#

im not using preprocessors, im just using the model. i have the outline/openpose thing of what i want but it's not listening

steady grove
#

before you rush to declare the models not working, maybe consider there is a layer 8 issue

#

for the open pose, i used sai's openpose control lora, and for the lineart i used the t2i-adapter for lineart

#

in my experience though, most models work to some extent.

kindred shoal
#

i dunno what layer 8 is.

#

i'm not that techy with sdxl.

steady grove
#

haha it's an old techy term. the roots are in the 7 layer OSI model which define a technical stack. good to know for troubleshooting. layer 8 is that layer where the end user exists. it was just made up by pricks like me to be like "maybe it's layer 8?"

kindred shoal
#

..oh.

steady grove
steady grove
#

ignore my weights i chahnge them all the time

#

yeah doing it without preprocessors works for me too

kindred shoal
#

well looks like sai's openpose is too heavy, just crashed out.

steady grove
#

thats actually the light weight one

kindred shoal
#

the size sure says otherwise, lol

#

my buddy's computer threw a fit over it so...

steady grove
#

wait, where idid i i get my open pose from? says sai in teh filename but they never released one

cyan crown
kindred shoal
#

oh youre right

steady grove
kindred shoal
#

wtf do i have then

cyan crown
#

can u see ?

steady grove
#

ohhh the file name doesn't say sai, it says controllora, which my brain just added sai to

kindred shoal
#

oh i have..thibaud.

#

thibaud openpose

steady grove
#

yeah it's only 750mb compared to other controlnets that's light

#

they've got the actual controlnet for openpose on that and its 5gb

kindred shoal
#

then why do some of them say 42mb on the collection you sent me?

steady grove
#

OH YEAH those are the kohyhya control lite ones. i haven't messed with them yet

kindred shoal
#

they don't work very well, they ignore everything.

steady grove
#

the extension support them?

kindred shoal
#

It reads them so i think so?

#

it shows up in the dropdown when you click on the box to select a thing

steady grove
#

it'll show any safetensor in that folder

kindred shoal
#

Oh

#

well

#

shit

#

they might not be supported then.

steady grove
#

always go to man pages

#

okay gonna try this

kindred shoal
#

if it works for you i'm gonna get a little upset lmao

steady grove
#

he's not shrugging but its' following it. i got a few of these

#

not bad for 45mb and its trained on anime too

#

works fine with shrek. vader just doesn't shrug it would be too powerful i guess

kindred shoal
#

what weights are you using?

steady grove
#

1

kindred shoal
#

hm..

#

i'll give it a shot later.

#

ty for the help and input!

steady grove
#

look at your logs when it doesn't work. might give you some indication that cnet failed

stone fossil
cyan crown
half cedar
cyan crown
mellow tendon
#

By combining the LCM Lora and TensorRT extension I made 320 images @ 512x512 in just over 1 min.

vital ermine
vital ermine
zinc totem
#

any suggestions on how what prompts would be good at making realistic, not deformed eyes?

zinc totem
#

thanks! 🙂

steady grove
vital ermine
zinc totem
whole oxide
#

Each time I select sdxl_vae it switches back automatic

vital ermine
whole oxide
#

newbie here;)

bright valley
vital ermine
slender coral
#

Anyone know why some images have issues with upscaling tiles? And if there's a solution?

glad grove
#

depends on the upscaler,the denoising level but its a common problem when using tiles

#

only way to get rid of them is by givin the finished img a pass on img2img with low denoise like 0.2

slender coral
uncut gull
#

Generated in 4 steps. (LCM lora + Euler A sampler in A1111.)

#

The lora works for any SDXL model (I've tried a few, and all worked perfectly). You can get much higher speeds and better quality in ComfyUI, since it supports the LCM sampler and handles image memory buffers better, but it still works in A1111 with Euler A.
Less than 2 seconds per image.

pure crystal
vital ermine
lusty moss
ripe bramble
#

is there an article out there about how the prompt is converted into.... I don't even know what comes after the text input!? like how it works step by step

lusty moss
#

Which interface are you using? You mean like how it gets turned into the image or a pipeline/workflow for say ComfyUI?

ripe bramble
#

code, turned into whatever the gpu can understand

#

like "a photo of", how does that help if everything is weighted on keywords?

lusty moss
#

I'm sure there are but I am unaware of all of that. Believe it gets tokenized then read by the GPU but unsure really.

#

I think the model is what dictates the "words" interpretation by tags applied during training.

ripe bramble
#

I see people using a very long "story" as a prompt input, resulting in a mess

#

I believe there is no LLM so everything must be weighted keywords

lusty moss
#

I haven't had much success with the long stories.. this is the prompt for the above image.

(Robot Biomechanical Lizard:1.2), (Gold gears:1.05), (Red Wires:1.2), (Steam powered:1.4), Medium Close-Up, Majestic Lighting, (CGI, Pixar:1.2), (8K, Realistic, Hyperrealism, Depth of Field, 85mm lens, F/4.6:1.15), (ultra detailed, ultra accurate detailed:1.1), (Bokeh, Bokeh Lighting:1.1), (surrealism:1.05), (Victorian:1.1)

wicked frigate
ripe bramble
wicked frigate
#

this is the worst prompt conversion ever, i need to slap my LLM, but a robotic biomechanical lizard, with gold gears and red wires visible on its body. The creature is steam-powered and has a majestic lighting effect around it, giving off an otherworldly vibe. It's rendered in CGI with Pixar-like quality and is shot using a 8K camera, capturing every detail of the creature. The image uses an ultra detailed approach with hyperrealism to create depth and realistic lighting effects using a 85mm lens at F/4.6, creating an immersive experience for the viewer. The image also features bokeh lighting and surrealism to add a dreamlike quality, while still maintaining its realistic nature. Lastly, the creature is set in a Victorian-style environment with intricate details and rich colors to complete this stunning image.
This giant story mess still produces very clean results, albeit it's a wall of nonsense text so not quite there

ripe bramble
#

good stuff thanks! I'll try to dig into reading at the code

wicked frigate
#

hand-rewriting it to a more sentence format: a medium closeup of a victorian steampunk robot biomechanical lizard with gold gears and red wires, Majestic Lighting, (CGI, Pixar:1.2), (8K, Realistic, Hyperrealism, Depth of Field, 85mm lens, F/4.6:1.15), (ultra detailed, ultra accurate detailed:1.1), (Bokeh, Bokeh Lighting:1.1), (surrealism:1.05) works great

lusty moss
#

Good info, you use a LLM to convert it from what i pasted?

wicked frigate
#

Generally you want text that vaguely resembles image descriptions on the internet, ie a short sentence describing it and a few keyword meta-tags on the end. There's a lot of photography on the internet in the format a picture of [object] at [place] during [thing], [camera type], [location], [date] more or less

wicked frigate
ripe bramble
#

oh so even the camera type is trained?

wicked frigate
#

(StableBeluga2-13B with a simple preprompt of a few prompt cleanups)

wicked frigate
ripe bramble
#

🤯

wicked frigate
#

in the prompt that was used above 85mm lens, F/4.6 is camera details

#

the model absolutely understands those (to a degree)

#

honestly even if you give incorrect or dumb camera details it still "helps quality" to a degree cause the subset of images on the internet that have camera details are statistically more likely to be very high quality professional photography

wicked frigate
ripe bramble
#

so let's say "a sun in the top left corner" is possible? like where to place object relative to the frame

west breach
#

I think the model understands general terms like 'slow shutterspeed' or 'high iso'

wicked frigate
#

directions, SD tends to be a lil iffy on

#

can sometimes get it but not always

west breach
#

I have seen the model place things if you mention left side or right side, or on one side, etc, but it doesn't put them where you want, so kind of like a child that doesn't know their left from right 😄

lusty moss
#

@wicked frigate TY for the info for sure!

pure crystal
#

take story prompts like that and paste them after "print a terse version of this prompt, keeping as much of the detail as possible but properly formatted for stable diffusion " e.g. Robotic biomechanical lizard with visible gold gears, red wires, steam-powered, majestic lighting, otherworldly vibe. CGI, Pixar-like quality, 8K camera, ultra-detailed, hyperrealistic, 85mm lens at F/4.6, immersive, bokeh lighting, surreal yet realistic. Set in a Victorian-style environment, intricate, rich colors

west breach
#

but can still be useful if you say 'on left side fire and the right side ice', even if it gets the sides wrong it still helps

pure crystal
lusty moss
pure crystal
wicked frigate
#

oh also! other prompting tip:

#

when grabbing prompts from other people

#

google image search the words they use

#

for example, go google image search Hyperrealism and ask yourself whether those images look like something you actually want here

#

(answer in this case: probably no)

#

"Hyperrealism" is a specific art style, and not just a vague "more realistic plis" request

pure crystal
#

my favorite magic word for quality in 1.5 was absurdres (for absurd resolution). supposedly this was how images from the early AI upscaler output (posted to anime forums) were indexed with. it really works, it ends up being a whole look

lusty moss
#

Anywhere that has good cliff notes on things like this?

pure crystal
whole oxide
vital ermine
lusty moss
whole oxide
#

Yess

lusty moss
#

If you type something like Dog, put your cursor in the text, say between D and o and press CTRL+UP/Down Arrow. It will "weight" the word.

vital ermine
pure crystal
vital ermine
pure crystal
sharp vale
vale eagle
trail shard
#

hand-rewriting it to a more sentence format: a medium closeup of a victorian steampunk robot biomechanical three - headed dragon with gold gears and red wires, Majestic Lighting, (CGI, Pixar:1.2), (8K, Realistic, Hyperrealism, Depth of Field, 85mm lens, F/4.6:1.15), (ultra detailed, ultra accurate detailed:1.1), (Bokeh, Bokeh Lighting:1.1), (surrealism:1.05) works great

wicked frigate
sharp robin
bright valley
peak dove
#

I'm stuck, how do I implement this stuff for CLIP-ViT-B-32-laion2B-s34B-b79K in Clip Vision (G)?

vale eagle
wicked frigate
#

that's an entirely different clip model

vital ermine
bronze creek
vital ermine
bronze creek
#

b company here

#

i love this one

peak dove
wicked frigate
#

what

vital ermine
peak dove
# wicked frigate what

I have d/loaded pytorch.model.bin and open_clip_pytorch_model.bin - but don't have a clue where to put them

vital ermine
vital ermine
wicked frigate
#

these are models that have nothing to do with anything SD related

peak dove
#

Its an IPAdapter setup by NerdyRodent

wicked frigate
#

IPAdapter does not use Vit-B

peak dove
#

Or even Scott Detweiler

wicked frigate
#

they have a ViT-H and a ViT-bigG

#

in either case, standard ip-adapter software autodownloads the clip model for you, you don't do it manually

#

(or where they require you download models, you download the IP-Adapter model, not the source clip)

peak dove
#

I know, and I am finding it difficult to locate and d/load CLIP-ViT-B-32-laion2B-s34B-b79K

wicked frigate
#

there is no vit-b to download that is not a relevant model

peak dove
#

OK, but Scott Detweiler uses it ...

wicked frigate
#

Scott Detweiler does not use any vit-b model

#

which notably are the standard (g) and the alternate (vit-h) models, absolutely no vit-b in sight

#

he also in that video for some reason separately loads a vit-h clipvision

peak dove
#

I know, and that ViT file I cannot find

wicked frigate
#

you don't need that and shouldn't use that idk why he does

#

either way it's H not B

#

(you should be using the bigG models not the H ones)

#

(if you use StableSwarmUI the bigG clipvision literally downloads itself for you)

peak dove
#

Yeah, he uses this vit h - so he says the laion version also needs to be vit-h

wicked frigate
#

you should just use the standard (non-h) version

peak dove
#

It worked pretty well. I had a brain-fart - I Symlinked my Models folder - forgetting that Symlinking at first empties your folder!!!

#

So I'm having to re download it all

vital ermine
peak dove
#

I'll maybe try StableSwarmUI

peak dove
vital ermine
peak dove
peak dove
#

OK, the zip file worked whereas the git clone version would not

knotty trellis
#

Has anyone tried training LoRA's for LCM?

wicked frigate
#

there's a oneclick installer

wicked frigate
vale eagle
peak dove
#

10 Steps, cfg 5

#

I want to change up to SDXL ...

#

Running an 8Gb VRAM RTX 2070

vale eagle
#

Just tried it. Amazing

peak dove
#

LCM LoRA - each image at 512x512 took 2.5 seconds/10 steps/cfg 5

#

I gotta admit, I generated 1500 pictures yesterday ... ! I randomized the Style Selector - some good - some weird!!! 🙂

glad grove
#

this is the best one

peak dove
#

I need to ramp it up to SDXL - I've seen reports that u can get 5 seconds/picture?

glad grove
#

that would be great havent tested it with sdxl so share results when u do🙏

peak dove
#

Each picture 2.5 seconds each!

#

I must admit, this LCM LoRA is Breast-Centred - almost 75% of the output is NSFW

#

🤩

west breach
#

SDXL with LCM lora, 10 steps

zinc cargo
knotty trellis
peak dove
upbeat summit
#

let me know if you any questions

noble shoal
#

2432x1664 in 21 Seconds. With a 4 Step LCM Upscale / Highres fix. (RTX 4070). Total sampling time: 7 Seconds. Total Steps: 10 (Base + Upscale).

peak dove
upbeat summit
peak dove
#

It's OK - I had sd_xl_base_1.0 and it was looking for sd_xl_1.0

#

I got it to work - 5 steps, cfg 1.8 - good quality SDXL in 7 seconds

#

Down from about 24

#

This is waaaay cool!

#

Upscale times are also reduced I see?

upbeat summit
#

LCM's performance is great, but the image quality loss is rough. I think it's great for real time applications, exploring animation and experiments. it's very impressive, just not something you would currently use to push image fidelity.

crisp owl
#

Yeah I was hoping quality was the same, but I can't use it for final images.
But can be used to check quickly if a prompt generally works

peak dove
#

The eyes are the only casualties of the speed ... much worse than "slow-SDXL"

#

Some cool examples, Melting Candles and Shaun Tan style LoRAs with Dynavision model

#

Kewl

ionic dragon
#

this is fr?

upbeat summit
peak dove
upbeat summit
peak dove
#

Even kewler

upbeat summit
#

too bad MP4s do not loop in discord. here's a low fps gif

pure crystal
peak dove
lilac wren
#

Guys, why inpainting is so slow with SDXL? I use Stable.Art and AutoPhotoshop-SD, same problem on both, inpainting is so slow, while with a 1.5 model, its really fast.
I can generate easyli txt2img with SDXL, but inpainting is slow as hell

pure crystal
pure crystal
boreal bough
#

(not because each image is fast, but you can batch generate 4 images in one go - takes like 10~25 seconds depending on your settings and step count)

kindred shoal
pure crystal
#

how did he get in there

boreal bough
#

apparently enough people had that question that an auto-redirect was set up

lilac wren
# boreal bough because sdxl is bigger than 1.5 if you have a 3090 or 4090 its fast

Of course, I know all about that.
~~But the areas I select are not large.

With Lasso/selection, I take a very small area (in pixels), a value that is much smaller than what I generate in txt2img.

It takes me 52 sec to generate a 1024x1024 txt2img image.
For inpainting, I don't get away with less than 15 minutes.~~

Don't listen to what I said. It's beyond comprehension. Before putting --no-half in arg, using inpainting with SDXL on Stable.Art caused an error and didn't work. With Auto-Photoshop-SD, it was extremely long and buggy.
I've just removed --no-half, and it works much faster than before...

rustic garnet
#

the area you select might be not important at all

#

depends on the chosen tool, but most tools will do an img2img on the complete image

boreal bough
#

oh yeah :/ no half go nom nom on your vram

rustic garnet
#

even if you inpaint a tiny region it will denoise the complete image

lilac wren
rustic garnet
#

as I said, it depends on the tool

lilac wren
#

Yeah, i'm talking about Inpaint ones

rustic garnet
#

some tools will do inpainting on a subimage

#

but even then the subimage will be "native" size

#

which is 512x512 in SD 1.5 but 1024x1024 in SDXL

#

(if the tool is good. Many tools are stupid and run SDXL with 512x512 which will heavily impact quality)

boreal bough
#

if he ran out of vram though, it may have defaulted to cpu ram x_x so... that's like 2 min to generate an image

rustic garnet
#

yes, it's probably a vram issue. I just say that inpainting in SDXL will always be slower, just because it will run on larger subimages

lilac wren
rustic garnet
#

I don't know any of these tools, so I cannot help 🤷‍♂️

boreal bough
#

still a single creator behind Auto-Photoshop-SD, who's probably doing it as a side project

#

photoshop generative fill is kinda unbeatable for me atm XD

#

so not like I'm in the market for plugins

lilac wren
rustic garnet
#

I don't use Photoshop xD

#

it's not working on linux 🤷‍♂️

boreal bough
# lilac wren You're missing the best part. Firefly can go back to bed with all the crazy mode...

depends on purpose. for real life applications, such as photo retouching, fixing areas in paintings, removing unwanted elements in artwork, the generative fill based on firefly is near instant has not failed me once yet. Its only when i try to misuse it as a stablediffusion alternative that it falls flat on its head - but that's also cause that wasn't the intention. The intention is photographers or people at corporations who need to fix small areas from renders, mockups, blueprints, etc...

#

if you want a stable diffusion alternative to compare at the enterprise level, then that's Dall-e 3

#

offtopic xD I made an old comic style lora

#

damn thing took 900 manually tagged images to complete

#

poor eyes & hands took forever to get right

lilac wren
boreal bough
#

advertising in multiple channels all at the same time

rustic garnet
fierce hollow
#

you selected the mouth but regenerated the whole face somehow 😅

#

guess it's not like content aware fill where it only needs the surrounding area

lilac wren
#

It's several layers and different generations, I simply deactivate all the layers to show the before/after.
Alt+click on a layer to show only it, in this case I do Alt+click on the base image, without modifications.

fierce hollow
#

well, be that as it may it's nice to see a photoshop plugin that doesn't connect to some saas

hoary saddle
hoary saddle
#

just says: qrcode sdxl 1c44f547

#

will try that one

crisp owl
#

I haven't looked at all into it, but I know qrmonster was working on their sdxl, so I just quickly checked that HF page and saw that it was updated 2 days ago, so I'm just assuming it's somewhere there

stone fossil
vital ermine
lilac wren
#

Bulma san

vital ermine
#

steampunk+decopunk

sweet bane
high skiff
#

@noble shoal Do you mind if I DM you a question about one of your LoRA's?

lusty moss
#

Does anyone know where I can find the ModelSamplingDiscrete node for ComfyUI? Been searching and can't find it.

upbeat summit
lusty moss
upbeat summit
carmine gale
#

Hey @lusty moss. do you have to be on dev channel of comfyui?

lusty moss
carmine gale
lusty moss
#

Default channel

upbeat summit
ivory blaze
#

Ironman is sad, his lasagna is ruined,

#

his son is a little.. slow. its okay , he works every day to teach him to walk. His mother drank a lot.

carmine gale
#

i have a symbolink for my output folder. wonder if this could affect something

upbeat summit
#

normally it should work, but maybe the update fails because of it

carmine gale
#

sounds like it

#

cmd git failed

#

that's what it updates with

#

ok

#

i ll kill the symlink

upbeat summit
carmine gale
#

great news though

#

it updated

#

got the node @lusty moss

#

need to try --output-directory

lusty moss
#

@upbeat summit Thank you!!

carmine gale
#

do you put it here? run_nvidia_gpu.bat

#

the --output-directory

upbeat summit
carmine gale
#

yea thanks both of you

lusty moss
carmine gale
carmine gale
upbeat summit
# carmine gale

not sure how whitespace characters are handled but it should work

carmine gale
#

annoyingly, my dropbox is on e: and comfy on d:

upbeat summit
#

that I don't know. I haven't used the option myself in a while

#

but with some image save nodes (WAS for example) you can specify a complete path in the node widget with a drive:\ and I know it worked - at least a couple of weeks ago.

upbeat summit
#

you could also try creating a symlink inside your ComfyUI\output folder. ComfyUI\output\dropbox and save everything there. all save image nodes should support subdirectories

mystic cipher
#

Any recommended checkpoints/LORAs or methods to get good line art? Tried with Corel Draw 2023's Auto-trace

nimble heart
#

high resolution found footage of a demon in a dark forest at midnight

#

high resolution dslr photograph of a white devil with glowing red eyes in a blizzard

vital ermine
#

I am just so sexy it hurts.

pure crystal
#

I dare anyone to run adetailer on this

crisp owl
#

I did one before that had like 60+ faces or something it fixed 😂

nimble heart
#

medieval german harley quinn wearing intricately detailed gothic plate armor, 4k UHD photograph fujifilm XT3

noble shoal
nimble heart
#

if this lil critter proposed would you say yes?

vale eagle
crisp owl
#

lol, inspiration from above 😆

nimble heart
#

your god no longer exists here

#

zombie miku

sharp robin
vale eagle
rustic shadow
nimble heart
#

when zero snr hits it sure hits

rustic shadow
#

it's kicking in

vale eagle
heady vale
vale eagle
vale eagle
molten gull
#

i got another controlnet/comfyui question: how can i change a person, but keep the face the same? like a closeup photography and give the person a hat?

and a second question: is it possible by now to do textual-embedding with sdxl and a1111 ? (i know that people use LORA, but i do different stuff and that's not working with LORA)

fierce hollow
#

uhhh wait there's probably no inpainting controlnet for sdxl //nvm, there seems to be one looking at hf

molten gull
#

can you give me a link of where to get what exactly maybe? @fierce hollow

fierce hollow
heady vale
#

there are some TE's for XL on civitai. not sure how they were made though

molten gull
#

how does that inpainting.safetensors would work in comfyui ? wouldnt you need some new nodes or something?

#

and i want to do my own TE's @heady vale , and with A1111, not comfy

fierce hollow
#

does anybody know of any good freeU settings for ssd-1b/finetunes? both the sd and sdxl defaults seem to bake the images way too much

ivory blaze
#

Mila Kunis, at an italian restaurant, eating lasagna and ziti

ivory blaze
# ivory blaze

she should have washed her hair before she went to eat though.

vale eagle
vale eagle
dry crypt
#

Hey Guys,

I am working on a project for creating different cartoon characters for children stories. There are total 3 characters that are consistent in every story, therefore I have fine tuned a dreambooth sdxl model using autotrain on each character. The results for each character (solo) is great, but I have to make a sdxl model that will generate images for all three characters. Can I do this by combining 3 dreambooth models, if yes, then how? Or do I have to create one model and train it on all three characters collectively.

I would really appreciate any help regarding this, thanks.

rustic garnet
rustic garnet
dry crypt
rustic garnet
#

just write your own captions, don't use the auto-generated captions of your tool

#

in kohya_ss for example you can add for each training image file a text file ending on .txt with same name where you write your caption. You have to set --caption_extension=".txt" and it will use your captions instead of the autogenerated files

crisp owl
lusty wolf
#

They call me Elton

stone fossil
lusty wolf
peak dove
#

TheLastBen Papercut SDXL LoRA

#

I recommend FreeU to add detail to low resolution pictures; works made with a low number of steps. It increases the high frequency detail. I am trying it alongside the astonishingly rapid LCM LoRA ...

peak dove
upbeat summit
steady grove
upbeat summit
steady grove
#

that'll be what gets AI banned from all videogames. When Ed boon gets his team to do AI fatalities

#

Jack Thompson will show up again

#

it'll be a mess

upbeat summit
#

hehe

steady grove
upbeat summit
upbeat summit
lusty moss
#

The ultimate wildcard is to type this into Chat GTP (or maybe other applicable LLMS..)
Create a two sentence story about a random subject and setting.