#✨|sdxl

1 messages · Page 131 of 1

hardy cipher
#

I've considered that myself actually

#

but not sure how it'd work within the graphical interface

ionic gulch
#

i have a variation of those in my pack:
difference is that off is left and on is right, which is more in line with many other on/off switches. comfyroll nodes do some things i don't like. for example the switches have a=1 and b=0 which always confused me.

hardy cipher
#

I just stuff 10 loras and switches into one node

#

I don't think I've ever used 10 loras in one workflow

wide crane
#

max ive ever used at once is 3

#

after 3 i usually get conflicts

ionic gulch
#

if i need more space in my menu section i would do that too. for now i like the flexibility of 4 separate nodes, even if they overall take a little bit more space. for example i scan spread them horizontally.

wide crane
#

it gets less and less organized

#

but for now

#

its still good

hardy cipher
ionic gulch
#

pipe-like nodes are nice for menus. as they only have one "pipe" output and take less space in the menu section. other than that i don't see much use and using too many pipe-type nodes was part of the performance problem i had a week ago.
example - menu node + output node in a different part of the workflow:

#

with normal outputs the menu part would be twice as high - wasting menu space and looking bad

hardy cipher
#

I'd really like if they offered the pipe option but didn't require it's usage

ionic gulch
#

would require complicated "real-time" code, to hide the outputs you don't use. that is a thing that can also cause the lost wires problem i had. had to replace all the impact pack xin1 switches for that reason. made simple 5 in 1 switches for almost all types for that reason.

hardy cipher
#

well image loaders have mask outputs they don't usually use and they don't appear to be very complicated

thorny frost
#

hi! having a lot of ai generated png images it would be useful to be able to convert them to jpg to save space. Problem is losing the prompt. Is there some util to batch extract prompts from png images?

hardy cipher
#

gotta be, if not it wouldn't be that hard

uncut fiber
#

@thorny frost there is at least Save Prompt To File, which can save a prompt.

hardy cipher
#

extremely cursed

uncut fiber
#

yes not sure it can work with already generated images, but there is chance it somehow can.

thorny frost
floral island
#

uhhh...

hardy cipher
#

well it's all in the metadata

uncut fiber
#

i read it already 🙂 Converting images to jpg is for xnview question of few minutels. You can as well keep metadata, but i think it is not in those one can keep.

uncut fiber
floral island
#

@uncut fiber well,tried absurdism, it's totally awesome 😮

shy kelp
#

formulaxl workflow grrr

uncut fiber
#

i love gertrude abercombie work, can work together probably.

floral island
#

art by <random artist>, absurdism, cute

floral island
uncut fiber
#

🙂 we all have different tastes 🙂 dont be dissapointed!

floral island
#

it for sure is something!

uncut fiber
#

But you can notice, it is always subscribed

floral island
supple knot
floral island
#

horror warning

zinc cargo
floral island
hardy cipher
floral island
uncut fiber
#

i got similar 🙂

#

not much, memory enhance past...

floral island
hardy cipher
uncut fiber
#

he looks like beholder is next in pot 🙂

floral island
uncut fiber
#

congrats!

hardy cipher
#

did you have gpt write that?

thorny frost
#

is there some way to paste code here?

#

so it reads nicely

hardy cipher
#

because if so, and you don't go over it, you're going to have a very bad time

#

in a block you mean?

like this?
thorny frost
#

yep

hardy cipher
#

the little hash thing to the left of 1 on the keyboard `

hardy cipher
#

repeat 3 times above and below

floral island
#

fat zombie hamster... very underwhelming

thorny frost
#
from PIL import Image

# Directorio que contiene los archivos PNG
input_dir = "D:/SD-outputs_storage/convertojpg/aa"

# Itera sobre cada archivo en el directorio
for filename in os.listdir(input_dir):
    if filename.endswith(".png"):
        file_path = os.path.join(input_dir, filename)
        try:
            # Abre el archivo de imagen
            image = Image.open(file_path)

            # Obtiene los metadatos (EXIF) de la imagen
            metadata = image.info

            # Crea un archivo de texto con el mismo nombre que el archivo de imagen
            txt_filename = os.path.splitext(filename)[0] + ".txt"
            txt_path = os.path.join(input_dir, txt_filename)

            # Escribe los metadatos en el archivo de texto
            with open(txt_path, "w") as txt_file:
                txt_file.write(f"File: {filename}\n\n")
                txt_file.write("Metadata:\n")
                for key, value in metadata.items():
                    txt_file.write(f"{key}: {value}\n")
                txt_file.write("---\n")

            # Cierra el archivo de imagen
            image.close()

        except Exception as e:
            print(f"Error procesando el archivo: {filename}")
            print(e)```
#

input_dir change inputdir to the folder where your png images are obviously

hardy cipher
#

all the LLMs are psychopaths. I wanted gpt-4 to give me advice on some code and help me figure something out and it kept messing up my code

#

they'll do little subtle things you might not even realize until way later

thorny frost
#

you need to explain it very clearly

hardy cipher
#

well it kept messing up the math

#

I said

#

do not touch the math do not touch the math

uncut fiber
#

Yes 59% of code it do is wrong, some recent tests, but for ppl that know language properly it can be very helpful

hardy cipher
#

over and over

#

and it repeatedly broke the math

thorny frost
#

it seldom do it at the first attempt

hardy cipher
#

it would apologize, say it understood now, and do it again

uncut fiber
#

Probably somebody already did it before?

hardy cipher
#

no no, they will never get it right because their entire existence is a calculated hallucination. they have no idea when they're lying or messing something up

#

I'm not saying that's a bad thing per se

#

just that I understand it's limitations, at least for me

thorny frost
# uncut fiber Probably somebody already did it before?

in this case i used a code from reddit from a guy that also used chatgpt to write a code to extract all metadata in a single txt file. Copied it, pasted it in chatgpt and asked for a modified code, so that each metadata would go in its own txt file with the img name

#

so it was a AI-human1-AI-human2 interaction. 😄

uncut fiber
#

i just dont know. In xnview it showing metadata, but not those you are looking for.

floral island
#

redbull clown mecha... uhhh

thorny frost
#

Chatgpt is better at modifying code than making it from scratch i think

hardy cipher
#

well there are approaches that can work, but they require human interaction to a degree. run it by bing every so often. but don't bother with bard. he's a bit tarded these days

#

well, just lies about EVERYTHING

thorny frost
#

bard is pretty dumb

#

LOL

#

has not a clue about anything

hardy cipher
#

hey bard, can you help me with this thing?

sure I know about that thing and I'd be happy to help

cool, so here's my issue with the thing we just discussed

an an AI language model I'm unable to assist you with that

floral island
thorny frost
#

have you tried to run llama model locally?

#

there are some that code acceptably well, not so good as chatgpt though

floral island
hardy cipher
#

well if you want actual programming assistance just get something like github copilot. but then, that thing doesn't have any clue about anything. it knows code real good, but it definitely misses a lot of other things

#

people want these things to be binary or like switches. but that's not how it happens

thorny frost
#

for amateurs like myself chatgpt is just enough.

hardy cipher
#

if you say so. but as I said, if you don't go over everything it does you're going to have a bad time at some point. it'll do something so subtle you won't recognize it. use a deprecated library, forget a comma in a place that rarely comes up. who knows what sot of hidden frustration mines it will leave

#

maybe?

#

like, I'm in no way trying to discourage you from using it

#

just want to let you know what I've found it's shortcomings to be. it's still a huge help to almost anyone

thorny frost
hardy cipher
#

are those dentures made of stacks of teeth?

thorny frost
#

i wonder what the promt was for this. Sadly i converted it to jpg before having my brand new metadata extractor script.

hardy cipher
#

it's just a new kind of tool, like a hammer that thinks real good. but any tool has it's limitations

#

is that a butt to butt shrimp centipede?

thorny frost
#

dont know. All i know shrimps are less appetizing now

hardy cipher
thorny frost
#

i use one button prompt script to generate a bunch of images with random subject, i only stablish the style and loras to use

hardy cipher
#

man, I have something like that sort of

#

except instead of random prompts I use images for the prompts

thorny frost
#

it generate garbage but sometimes some pearl appears, si it is interesting to keep prompts somewhere

thorny frost
hardy cipher
#

this one rarely makes garbage. just doesn't always turn out eactly the way expected

#

it's a combination of a few things

#

which ui are you on?

thorny frost
#

ah, i know what you mean

#

ipa

hardy cipher
#

well it's clip, blip, ipa, controlnet, etc

thorny frost
#

i use both a1111 and comfy

hardy cipher
#

all balanced delicately

#

and I don't think there's a one size fits all for the settings and parameters, but I have done some goofy things with conditioning data whch seems to have made it less susceptible to the black boxes I was getting at fiirst

thorny frost
#

have been disconnected for a couple of weeks and didnt remembered IPA

hardy cipher
#

dude, this combo I have going never gets old. just need 3 images and it's on

thorny frost
#

is it embedded in the image

#

?

hardy cipher
#

yeah, but it's a bit sloppy as I've been moving things around

thorny frost
#

will try it anyway. :p

hardy cipher
#

love the space monsters chasing children images

#

the monsters always look so full of joy

#

I have some examples of what the same images look like in different orders too

#

each slot has a different role

#

first image is ipadapter with interrogater running to clip g

#

second is two different types of clipvisions plus clip l

peak dove
hardy cipher
#

third image is latent base and controlnet

#

might have to install some nodes if you want to use it properly

thorny frost
hardy cipher
#

ahh

#

give me a second. I know why

#

I was just copying and pasting rather than dragging them from the folder

#

I'll make a new one and send it your way

thorny frost
#

ok ty

hardy cipher
#

let's see what happens here

shy kelp
#

deconstructing pepedrawing

dapper dragon
hardy cipher
#

let me know if you see anything that's obviously not the way it should be

thorny frost
hardy cipher
#

excellent

thorny frost
#

has to update everything after two weeks not using comfy zzz

hardy cipher
#

these thigns happen. and that ipadapter is so finicky and needs updating constantly

peak dove
#

AUTOMATIC1111 using Unstable Diffusers v6 model (Civit.ai) and SDXL Refiner

shy kelp
#

(haven't added the lora stack but it's working Okayge)

hardy cipher
#

which flow is this?

shy kelp
#

frankenstein'd the formulaxl workflow

#

what are these switches? Think if i had like, 3 loras, do i have to move the switch to on? hmm

#

or even 1 lora, does switch need to be on?

shy kelp
#

Hey can i use stable diffusion 1.0 using api

zinc cargo
shy kelp
shy kelp
zinc cargo
#

what goes on with the face?
is this a public workflow?

shy kelp
uncut gull
#

Can your prompt do that? 😁
Practical applications of the overlay prompts I've been working on. Her face is actually inside the helmet!

Steps: 50, Sampler: DPM++ 3M SDE Exponential, CFG scale: 7, Seed: 3784295722, Size: 1024x1024, Model hash: 0b76532e03, Model: SDXL-CrystalClear, VAE hash: 551eac7037, VAE: sdxl_vae.safetensors, Version: v1.6.0```
shy kelp
#

nice.
i remember the other day, we were trying to get this woman's face behind some glass, pretty tough

uncut gull
shy kelp
#

NODDERS turned out great!

uncut gull
#

Veils too.

Steps: 50, Sampler: DPM++ 3M SDE Exponential, CFG scale: 7, Seed: 2151636245, Size: 1024x1024, Model hash: 0b76532e03, Model: SDXL-CrystalClear, VAE hash: 551eac7037, VAE: sdxl_vae.safetensors, Version: v1.6.0```
uncut gull
#

And of course... the holy grail of fine-structure overlays... 🥁 🥁 🥁
Portrait rainfall coverage! 🥳 🎉 🌧️

...With a pretty convoluted prompt though...

Steps: 50, Sampler: DPM++ 3M SDE Exponential, CFG scale: 7, Seed: 1538189682, Size: 1024x1024, Model hash: 0b76532e03, Model: SDXL-CrystalClear, VAE hash: 551eac7037, VAE: sdxl_vae.safetensors, Version: v1.6.0```
#

Basically, SDXL sets up an internal 3D scene at the very beginning of the render, in the first 20% of steps or so. Within that 3D scene, it has the concept of a background, a foreground subject, and a screen-level overlay. Using prompt-switching lets us prime that screen-level overlay with large-scale structures that resemble the final effect we want. Once the 3D scene is primed like this, we can switch to a description of the prompt we actually want, and SDXL will convert those large-scale structures into fine details.

The screen-level overlay in general is very fragile. SDXL will just discard the entire structure in most renders. For that reason, we have to prime a special kind of scene: One with a background and screen-level overlay, but no foreground subject. Then, only after 20% of steps (once the 3D scene is well-established), we insert a foreground subject. At this point, the screen-level overlay's structure is so solid that it will not be discarded in favor of the subject.

zinc cargo
# shy kelp

how does it goes on about known faces? or Lora

shy kelp
#

honestly, not entirely sure

#

im not sure with a lot of things lol

#

it's weird, with formulaxl, my gen was failing due to memory issues but then i set it up exactly how it was and no memory issues now peepoShrug

zinc cargo
#

did you share the formulaxl workflow?

#

i only find its' model

#

ok, just got it that i can copy paste a workflow off civit.ai website

indigo carbon
uncut gull
uncut fiber
uncut gull
#

This is the Tooth Furry. He sneaks into your room while you're sleeping and takes the teeth of children who don't brush their teeth.

#

Really though, SDXL struggles so much with teeth...

uncut fiber
#

He will eat your fat daemon! 😄

noble shoal
uncut fiber
#

going to rest, will try open his mouth later 🙂

uncut fiber
shy kelp
#

idk what is going on

noble shoal
uncut fiber
#

Fatal battle of humankind vs evil beasts. Last part 🙂

shy kelp
#

it found a face in there monkaGIGA

stone fossil
noble shoal
#

I am starting a new series called "Sports done wrong"

stone fossil
#

Id like to request soccer icehockey.

stone fossil
#

Id like to add deap sea tennis diving.

noble shoal
uncut gull
noble shoal
#

Heck yes! Monstertruck Polo

uncut gull
#

Prompt a hand with 6 fingers, six fingers, polydactyl. I guess you just can't have it both ways. 😮‍💨

#

Well, I mean, if SDXL knew how to COUNT, now that would be AMAZING! Seriously, Stability, text is awesome, but please teach it how to count next!

noble shoal
#

@uncut gull That was the request, right?

uncut gull
noble shoal
#

Female was part of the request. Where do i need to sign the cancelation form?

noble shoal
uncut gull
uncut gull
#

You can't visualize a "thousand" in your head, neither can it. Also, human counting doesn't generalize perfectly to all scenarios. Just because you can count change in a cash register doesn't mean you can count beats in a musical measure. The question is: Can it count at all, or does it only have some numbered-pictures memorized?
(But it'll take a while to work out how to set this up and then run it.)

noble shoal
rustic garnet
#

I guess so, too ^^°

uncut gull
noble shoal
#

Most Finger picture i can get so far

#

Most of them need a manicure, though

uncut gull
#

Can SDXL count?
To get a real answer, you would need to do latent saliency probing.
That would mean: SDXL would need to show an embedding that activates on the prompt "four". But, when you prompt for "cookies" and SDXL generates an image with four cookies, that "four"-embedding activates even though "four" wasn't in the prompt.

What I can do... Is at best, just check how it scores across a random sampling of subjects. I'll have Llama2 get me a huge list of countable things, then pair them with random numbers between 2 and 10, and measure the hit-rate... Oof. That's one long experiment.

hardy cipher
#

gpt 4 can barely count

#

and sdxl doesn't understand human anatomy

uncut gull
#

The question is not "can it count well", but "can it count at all?"

noble shoal
hardy cipher
#

well I'm saying, their training didn't necessarily give them an understanding of the concepts we take for granted as simple

hardy cipher
uncut gull
hardy cipher
#

they learn them, but not flawlessly

noble shoal
hardy cipher
#

hae you ever tried having chat gpt give you a list of x number of something?

#

gpt 3 will fall off the rails quickly

#

3.5, whatever

#

4 does better, but it starts breaking down soon as well

#

they can do insanely advanced things in an instant, but struggle with computational tasks that most 5 year olds could work out

uncut gull
# hardy cipher 4 does better, but it starts breaking down soon as well

I honestly only use Llama2 on my local PC. It can count lists fine as long as you don't make them too long. Same for humans. You can count chairs or coins or whatever fine if there's just five or six. You can group them into stacks if there's a few hundred, but tens of thousands? Forget it. Humans can't count that without messing up. (We use technology, as old as wax and styli, to make up for it. 🙂 )

But... Can SDXL count at all? I will have evidence one way or another soon!

rustic garnet
#

you can train a DNN to interpolate a sinus function, but it will still struggle to extrapolate the very same function no matter how many neurons you use for training

#

same for large language models. There is a funny anecdote with large language models couldn't get finetuned on larger context windows, because the model was initially trained on a fixed size context window. The problem was not that it could not be generalized on the larger window, it could not learn the positional embeddings because it has never seen such large numbers.

#

same is for SDXL. We know that it struggles if we generate images with unusual aspect ratios

#

the problem is not that the underlying unet wouldn't be able to generate these aspect ratios

#

it just doesn't understand the numbers

#

768x1344 is fine cause it was trained on these numbers. 1080x1480 is weird, cause it never seen these numbers in its training data

noble shoal
indigo carbon
uncut gull
indigo carbon
hardy cipher
#

it just replicates the patterns that it learns. and sometimes when denoising it'll see an opening for 7 or 8 fingers on the one hand. so that's what it'll give you.

#

I'd imagine a lot of it's logic also comes from the clip models right?

uncut gull
indigo carbon
hardy cipher
#

an ensemble of experts, interesting

indigo carbon
#

it's just how to make all the weights work together in a single inference

indigo carbon
uncut gull
hardy cipher
#

ooh, an all in one LLM team

indigo carbon
# indigo carbon this

so idk if they really train anything. it seems they just glue together diffusion and LLM, etc..

hardy cipher
#

this tech is moving at warp speed

#

well that's what I think is going to keep happening

#

train specialized models that are really good at their one specific task or few tasks

#

then link them up with other models that are experts in other tasks

uncut gull
#

The 🔥 means these weights are live / updated parameters during training. The ❄️ icon means these weights are frozen and just used for forward propagation during training.

hardy cipher
indigo carbon
uncut gull
hardy cipher
#

well once you get the first then you move to the second

indigo carbon
uncut gull
#

And you might be right. I doubt it though. Even prompting for SDXL is not the same as prompting for SD 1.5.

hardy cipher
#

well yeah, but the framework is there

#

basic inputs and outputs are comparable. they might differ in some of the specifics but they are largely the same

uncut gull
#

It didn't mention that she's feeling "wet".

half ivy
#

Got no solid workflows for sdxl (control'd), Auto1111 cant produce (for me) = sad bois club, heart broke island, waterfall of tears, depression cave vibes

uncut gull
half ivy
half ivy
indigo carbon
uncut gull
#

😮‍💨 It really doesn't know what it's looking at. The LLM just has a very vague understanding of what's in the image, and zero understanding of the images it creates. (It never "sees" its own output.)

indigo carbon
uncut gull
#

LOL no wait. After you post the next message, it gets to see what image it outputted. Not super helpful. 🙂

#

We're not there yet though. Not with this exact setup, anyway.

indigo carbon
uncut gull
indigo carbon
#

I don't know what MJ does for blending, but it's not IPA for sure.. I think when it comes to pure txt2img SDXL destroys stuff like MJ but again, the lack of capabilities

peak dove
#

AUTOMATIC1111 DynaVision XL for a model

peak dove
zinc cargo
#

android to apple

peak dove
#

Chalk2Cheez

zinc cargo
#

rounded corners and censorship

#

only thing left is for SD to do pistachios and im good.

uncut gull
#

Bad news guys... SDXL can't count! 😭 At least that's what all my tests are showing so far.

hardy cipher
#

best image made by sdxl thus far?

uncut fiber
indigo carbon
uncut gull
zinc cargo
uncut gull
#

SDXL can't count at all. It doesn't even have a few numbered-pictures memorized. 😐
The prompt was {two|three|four|five|six} against a white background, and only the two ever succeeded.

#

(I'm testing now whether it understands the difference between 1 and 2.)

indigo carbon
uncut gull
indigo carbon
#

the blend thingy is impressive, but txt2img is not as good as SDXL

hardy cipher
peak dove
hardy cipher
#

well some opinions hold more weight than others

rustic garnet
#

dunno, I can't try MJ myself, but I sometimes see images other people generated with MJ that look extremely good. At least it seems that MJ is better in doing human anatomy right than SDXL. What's noticeable, though, is that all MJ images I saw so far look all "the same", as if MJ can only do one style (or all people who use MJ and post images all do the same style?=

peak dove
#

Objectivity is the democracy of subjectivities.

#

All subjective opinions are pooled and the objective emerges ...

hardy cipher
#

mj seems like it produces amazing results with minimal effort. which is great. but in that same vein it also feelss like the user is in no way controlling the real outcome

#

it's like taking a train somewhere

uncut gull
#

Somebody's using SD for captchas.

hardy cipher
#

or having a driver

peak dove
lusty wolf
#

Anyone have a problem with the just updated comfyui? My nodes sticks to the cursor and you cant put them down. Even when you click on screen starts draging the screen and does not let go....agony

hardy cipher
#

stable diffusion give you a pwerful engine but you need to know what to do wtih it

zinc cargo
indigo carbon
peak dove
hardy cipher
rustic garnet
hardy cipher
#

I'm not putting it down. what I'm saying is that a lot of powerful tools really aren't plug and play

peak dove
rustic garnet
floral island
#

someone gimme dumb prompt quick, wanna test my new merge

peak dove
#

A camel eating a banana

floral island
indigo carbon
peak dove
#

A serpent in silk pyjamas

shy kelp
floral island
zinc cargo
peak dove
#

This is a good prompt - (To a girl: If You Were A Booger, I'd Pick You!)

zinc cargo
shy kelp
#

lol

floral island
vital ermine
indigo carbon
shy kelp
#

that's a nice looking camel
and spock

peak dove
#

A snake in silk pyjamas

zinc cargo
lusty wolf
zinc cargo
shy kelp
#

ngl, i picked up comfy a few days ago and love it so far

#

been on a1 for like 1+yr or something, too long

shy kelp
indigo carbon
peak dove
zinc cargo
#

had to put it in midj... it's not so bad 😛
before upscaling etc

floral island
#

OMFG

#

actual prompt: This is a good prompt - (To a girl: If You Were A Booger, I'd Pick You!)

uncut gull
#

Okay! In some circumstances, SDXL can tell the difference between one and two with a 100% hit rate. That counts as counting! (Because I said so.) However! It can probably only count animals and flowers???
Prompt {one|two} against a white background. 100% accuracy, and it is obviously not pasting in pictures that originally had two subjects. (Some of these are clearly not pasted pictures.)

indigo carbon
noble shoal
zinc cargo
#

@indigo carbon you're preachign to the quire, we are all team SD here...

rustic garnet
# indigo carbon not as good. plus, MJ isn't free like SDXL is

of course mj is not free. It also does not have this amazing ecosystem of tools, plugins, extensions, controlneta and workflows. There are so many reasons why SDXL is better than MJ. But if you just look at text2image I would say MJ is not worse than SDXL

zinc cargo
#

but, MJ has it's strong points, it's a wonderful tool with pleanty of stuff that doesnt require the GPUs and serious knowledge.
it simplifes alot of the stuff we actually enjoy - so like SD, it's a tool. not a way of life. you can still use both

peak dove
#

L8r

floral island
indigo carbon
#

idk what y'all want. even though MJ is paid, it's still not as good when it comes to txt2img. that's all I'll say.

floral island
#

MJ really making a case for the face

hardy cipher
floral island
indigo carbon
lusty wolf
indigo carbon
indigo carbon
#

I use commits from a week or so ago because it's faster on my 4070ti for some reason

lusty wolf
lusty wolf
indigo carbon
#
GitHub

This was orginally written by: https://github.com/hlky - GitHub - FizzleDorf/AIT at ca8f0627992649eb2211bbdc2f269484b054e62b

GitHub

A powerful and modular stable diffusion GUI with a graph/nodes interface. - GitHub - comfyanonymous/ComfyUI at bc76b3829f5fbba7c5a439c7833d313a3ca87398

lusty wolf
#

Been trying to use the AI templates for the last week, (that is why I updated) even on the old it gave me the same error. So I give up.

indigo carbon
#

this is what I use

lusty wolf
#

Can one see the commits in the manager, or where do one see that info?

shy kelp
indigo carbon
noble shoal
#

A tribute to Tdg8uU

vital ermine
indigo carbon
#

banana cubes

hardy cipher
#

that's pretty good tbh

uncut gull
#

Okay... SDXL can count "birds" specifically! Why birds? Why not.
It can count "one" and "two" birds with 100% accuracy, and "three" birds with 75% accuracy.
Prompt {one|two|three} birds against a white background

#

How far does its concept of "one" and "two" generalize? Stay tuned.

hardy cipher
#

well think of it like this, they can count, sorta kinda, but not in the same context that we can

#

they don't understand what it all means, probably don't recognize larger numbers, etc

#

they are simply trained to recognize patterns. and sometimes things have numbers attached to them

noble shoal
shy kelp
uncut fiber
#

how about tags 1girl and 1boy ??? @uncut gull

uncut gull
# hardy cipher they are simply trained to recognize patterns. and sometimes things have numbers...

Oy. I'm talking about real results here, you know? Does SDXL have a baseline concept of "one" and "two"?

But here's OthelloGPT. I point people to this when they say NNs just "recognize patterns". https://arxiv.org/abs/2210.13382
Trained only to predict the next move in an Othello game, a GPT learns to predict the next move in an Othello game. No surprises there.
How does it do this, under the hood? It develops the concept of an 8x8 board, and learns the rules of Othello. Why? Because that's how you predict the next move in an Othello game.

How do you count? By understanding numbers. And I, as a human, do not understand the number 1.23 * 10^60 in the same way that I understand the number 5.

uncut gull
vital ermine
hardy cipher
#

I don't know if these results are very scientific

uncut fiber
#

depends if 3 cats are where 3 cats realy should be.

vital ermine
#

the main reason I jumped to 2.0 is that it could count up to 5

#

got it to count up to 5 knives and people were shocked. Never looked back

uncut gull
hardy cipher
#

well what I'm saying is that it's not a binary concept

uncut fiber
#

o.k.

hardy cipher
#

can it count, can't it count

#

it's not an on off yes no thing

uncut fiber
#

tried 355/113 today, it changed theme 😄

uncut gull
#

It cannot count cookies. Flat out.

indigo carbon
uncut gull
#

It can count Kermits as accurately as birds and cats! 🙂
(Each one of these is tagged "one" "two" or "three", and all of them are accurate.)

hardy cipher
indigo carbon
hardy cipher
#

it has no baseline of knowledge of human concepts

uncut fiber
#

cookies are probably difficult know where it starts and ends, if it is one or two cookies.

hardy cipher
#

yes

#

I don't even know sometimes

#

it's a fuzzy concept

uncut fiber
#

@hardy cipher i read it now, didnt noticed before 🙂

indigo carbon
uncut gull
indigo carbon
uncut gull
#

It can count "two" and "three" Sailor Moons with 100% accuracy, but not "one"? Or is it because I prompted "one Sailor Moons" (plural)?

floral island
indigo carbon
uncut gull
#

Fixing the prompt for plural / non-plural, we get {one Sailor Moon|two Sailor Moons|three Sailor Moons} against a white background, and now it can count Sailor Moon with 100% accuracy. 🙂

indigo carbon
floral island
indigo carbon
floral island
#

all 3 prompts combined!

uncut gull
#

It is 100% accurate with "one Oreo" and "two Oreos", but definitely not "three Oreos". It is generalizing at least a little bit to things without faces.

shy kelp
#

a matte grey cube in rainy cyberpunk city

indigo carbon
floral island
shy kelp
#

1 cookie

indigo carbon
floral island
uncut gull
#

It is counting 1 and 2 Excaliburs, and 3 with better-than-chance.

floral island
#

@uncut gull it all depends on model, which models "learned to count"

#

usually 1, 2, 3, 5 and multiple are the things you can rely on

indigo carbon
floral island
#

for example elysium anime model learned what 'punks' are

uncut gull
floral island
indigo carbon
floral island
#

anything above will be 'guesstimate levels'

oblique tartan
#

anyone using rtx3060, how long does it take for you to render a 1024px image? kindly include sampler use and step count.

uncut gull
indigo carbon
floral island
#

sdxl doesn't have the concept

#

the checkpoint needs to associate the tag "one/two" with similar elements

#

that's a training thing

shy kelp
#

why its elevated, idk peepoShrug

floral island
# shy kelp

have you ever seen non-flying cubes in cyberpunk cities, duh?!

indigo carbon
#

mine's aren't floating =\

floral island
uncut gull
floral island
#

yes, this is the 2 cubes concept

#

it TRIES to make 2 similar elements

lusty wolf
#

Depends on which side of the globe you are... floating here too.

floral island
#

but one image failed

#

three, will probably have even worse results

#

why?

#

because one and two are the most common numbers used in tagging images

#

three, lot less common

#

the rest, even less common

shy kelp
floral island
#

you'd have 'thousands' or 'a group of'

uncut gull
# floral island the rest, even less common

Yep. That's how number concepts work. That's why you can glance at three chairs and know it's three chairs. But glance at twenty, and you have no idea. Then you need an algorithm and a process.

floral island
#

and that's how the models work too

#

only they're dumber than us

indigo carbon
floral island
#

so after 2, they already start to fail

uncut gull
floral island
#

2 out of 3, three cubes. my model is pretty smart, but i like it that way

#

can it do. 5 cubes?!

hardy cipher
#

have you ever tried.... 5 cubes?

shy kelp
#

exactly 20 chairs lol

floral island
#

well, color me impressed, again 2/3

hardy cipher
#

this is what you guys need to be doing

floral island
floral island
shy kelp
#

here is a guard tower

floral island
#

this is glorious!

indigo carbon
hardy cipher
#

just gotta treat it right

floral island
#

@uncut gull this is 'millions of cubes' -> the model tries to resolve the request and 'decided' that it sub-cubes would suffice

noble shoal
uncut gull
#

It can count Einsteins as well as Kermits and cats. It can consistently make "one" or "two" {things with faces}. "Three" is ~75% accurate across the board for things with faces.

uncut gull
shy kelp
#

i have not been able to recreate that skull in the sky

floral island
#

meat. cubes. (technically prompt was : cubes made of meat)

noble shoal
indigo carbon
# hardy cipher just gotta treat it right

was BLIP the solution? I don't see how we would achieve instructive image editing and blending without SDXL loosing its qualities though. I'm thinking the ultimate solve might be to make SDXL work with something like LLaMA to give it image input abilities like MJ has

floral island
#

'a few meat cubes'

#

what's with the laser dome?!

hardy cipher
native knot
floral island
#

oh. i still have bismuth in my prompt

uncut gull
#
Steps: 50, Sampler: DPM++ 3M SDE Exponential, CFG scale: 7, Seed: 1920749068, Size: 1024x1024, Model hash: 0b76532e03, Model: SDXL-CrystalClear, VAE hash: 551eac7037, VAE: sdxl_vae.safetensors, Version: v1.6.0```
floral island
#

meat cube, but it also has neon glass bismuth cube in the prompt. no wonder i get these packaged meats

hardy cipher
#

plus I kind of stopped trying to go pure ipadapter because that seemed to fizzle out a lot. but using it with accompanying tools seems to work alright

zinc cargo
shy kelp
native knot
#

The Impossible Burger supplier had a container fall off the ship, basically.

floral island
#

sigh. i hate it when prompts listen too close.

indigo carbon
floral island
#

steak block, now farmable in minecraft

hardy cipher
#

not sure what's going to happen, but I bet it will look delicious

#

oh my

#

good idea, eface

indigo carbon
#

when doing stuff like IPA it makes a whole lot of new issues like speed decrease and compatibility issues with optimizations, so multimodal might be a solution

shy kelp
hardy cipher
floral island
hardy cipher
floral island
#

lol "how do you want your steak done -> blocky"

shy kelp
indigo carbon
zinc cargo
indigo carbon
floral island
#

this looks as raw as possible xD

indigo carbon
native knot
indigo carbon
floral island
# indigo carbon

adding "minecraft block" help make it a defined block, i just noticed

hardy cipher
floral island
#

it's awesome!

native knot
#

Every good meat cube needs a little bone.

floral island
#

reminds me of my last spareribs

#

you look at the ribs -> they just fall out

#

good spareribs ❤️

zinc cargo
native knot
#

Too small.

floral island
hardy cipher
#

think of the cow they'd have to be cut from

native knot
#

lol

floral island
#

1000 feet cow

floral island
#

333 non-freedom units

native knot
#

Imagine the size of Salt Bae you'd need.

zinc cargo
floral island
#

asked for electronic pork chops -> am dissapointed in my model

noble shoal
hardy cipher
#

they've evolved

native knot
#

Too much minecraft influence there.

shy kelp
hardy cipher
#

that's the economy beef cube

noble shoal
#

This even has vents

floral island
#

here is where you get that 1m cubes of beef get from

native knot
#

Massive unit, that one.

hardy cipher
#

the size of that lad. geez

floral island
#

that would feed a nation for months

#

but, i'm not done yet!

cyan crown
floral island
#

YEARS OF BEEF STEAK!

#

i mean, a single leg would keep a family fed for years

lusty wolf
hardy cipher
floral island
#

but. i'm sure i can make bigger cows!

#

still not big enough.

hardy cipher
#

full meat

noble shoal
floral island
noble shoal
floral island
#

oh he just donates 0.1% of his body mass for all the world so they can eat for year

#

universe scaled cow, your move atheists

hardy cipher
#

that cow actually dictates the physical laws of the known universe.

floral island
#

meatball star lol xD

floral island
#

meatball star. capable of feeding a civilisation for a gazillion years

#

or 5 years, because nestle be like 'yeah it's gone to waste'

#

been there for 1000000000 years, but went bad in 3 months

zinc cargo
#

meat ball star gave me this lol

hardy cipher
#

tried to make a lovely steak island

#

almost

indigo carbon
floral island
#

meat gundam?

zinc cargo
indigo carbon
floral island
#

meat gundam.

hardy cipher
#

that comic blew my mind tbh

floral island
#

oh yeah!

cyan crown
hardy cipher
#

those are unironically good

floral island
#

more meat robots!

hardy cipher
#

going through things I made recently, lol. don't always look at all of them initially

floral island
#

comes with integrated salad. lol this is a walking meal xD

cyan crown
shy kelp
#

space cow

cyan crown
floral island
#

"i'm ur dinner"

cyan crown
hardy cipher
#

wet meat gang

shy kelp
#

finding out the hard way that using a photorealistic lora on a photorealistic model is just overkill

hardy cipher
shy kelp
#

meatpocalypse

hardy cipher
#

I love it and hate it

#

that base image is a keeper though

shy kelp
#

wardog

floral island
zinc cargo
#

look into my eyes

native knot
shy kelp
floral island
shy kelp
#

not res'd but i liked this gen

floral island
#

uhh.. whatever this is.

#

oooh... this mech might wiwn against the dogs!

shy kelp
floral island
#

meat gundam. GO

#

uhh....

zinc cargo
#

wonderful!

native knot
#

Delicious

zinc cargo
#

and glad you didnt put sausage into the prompt

floral island
#

MEAT. GUNDAM.

#

imma keep going till i get the ultimate meal gundam

#

did i just invent the ultimate meal?

shy kelp
#

what is that green stuff?! moon2P

floral island
#

the stuff you don't eat.

shy kelp
floral island
shy kelp
floral island
shy kelp
#

i feel close

floral island
shy kelp
#

will always remind me of sea of thieves

vital ermine
indigo carbon
indigo carbon
floral island
#

the fabled eye of the storm

#

"so uhh... what are we going to do about that?"

#

"bullshit inspirational quote"

glad grove
#

she speakin fax tho

floral island
#

"space italian"

#

"british with a heart"

shy kelp
stone fossil
hardy cipher
crisp owl
hardy cipher
stone fossil
hardy cipher
stone fossil
hardy cipher
stone fossil
hardy cipher
stone fossil
uncut gull
stone fossil
native knot
#

Hey fellas...any of you done something with comfy where you point it at a directory full of images (or in my case, one with subdirectories full of images) and then have that run through the WD14 Tagger? Trying to get a bunch of images tagged and I figured comfy might actually be useful for that since I can otherwise build the workflow how I need. I just can't seem to find a way (yet) to load a directory of images in for processing.

stone fossil
#

You want to use WD14 to caption?

uncut gull
#

Card backs by faction.

stone fossil
#

Use Kohya.

native knot
# stone fossil You want to use WD14 to caption?

I'm using pythonssss WD14 tagger and it does a good job pulling them in. I'm dropping them off into a custom script that then adds the tags into my Lychee server when the images are scraped in.

stone fossil
#

I here for my stuff both BLIP and WD14 and then go over them by hand and my love. 🙂

native knot
#

I only need to do it once through for all the images that were generated prior to now. I've got new ones saving the tags off and being processed, but I've got thousands of images I want to pipe through the WD14 tagger and it'll be trivial once I can figure that part out.

stone fossil
#

Oh well this is for when I make a lora.

#

The both free caption methods give me a free start to edit each caption and merge them.

uncut gull
# shy kelp

I give up. It's not consistent, and it's always just somewhere between bone and cloud, never really a cloud shaped like a skull.

Steps: 100, Sampler: DPM++ 3M SDE Exponential, CFG scale: 7, Seed: 1629730332, Size: 1024x1024, Model hash: 0b76532e03, Model: SDXL-CrystalClear, Version: v1.6.0```
native knot
#

🙂

uncut gull
hardy cipher
native knot
#

New from Burger King?

hardy cipher
#

Yes, with authentic blue cheese

native knot
#

Your way, right away, as you die, from Burger King now.

hardy cipher
stuck bobcat
hardy cipher
slender coral
#

Could someone reccomend a comfy workflow for generating with upscale and use of loras?

hoary saddle
#

no addon made with oob nodes, very basic

stone fossil
slender coral
hoary saddle
hoary saddle
slender coral
# hoary saddle this one

Thank you prompting now, could I ask, why is there no refiner in this workflow? trying to get a better understanding of nodes.

slender coral
# hoary saddle

Am I to assume generated images contain meta for the nodes?

hoary saddle
#

just the default comfy workflow

#

some use refiner, some don't, was more needed with sdxl 0.9 i think

#

loras good a good job in my opinion

hoary saddle
#

if you save image it will retain it, if you right click copy image it won't

slender coral
slender coral
hoary saddle
#

dunno what that one is either, i use civitai website mostly for checkpoints and lora's (make sure to check sdxl 1.0 in the filter), download and copy to your comfyui/models/checkpoints and loras folders and refresh webui to load them in dropdown

hoary saddle
slender coral
#

perfect, yea hwas just wondering if there was a similar thing I may be missing to

hoary saddle
#

as you load in others workflows youll come across tons of errors as most are using custom nodes, that button will help you figure out which addons to install

low hill
vital ermine
#

Vulcan need no poles

hardy cipher
vital ermine
#

I give up on people as I can't train them

hardy cipher
#

Man, these are legit.

native knot
#

Got that tagging workflow figured out. Took a bit of ingenuity, but:

hardy cipher
native knot
#

Basically spit an output from find into a text file, used that with an incremental loader paired with load image from file path, then a bit of string manipulation to extract the file name so that I can re-concat that with where the tag file needed to be saved and boom. Tested with a small batch of .png files to start, then thousands of .webp files are next.

native knot
#

4675 webp files to get tags for, then a quick script to inject those into the mysql database for lychee. I've already got most of that written in a different way for images moving forward. Once all these get injected, finding an image on my site will be so much easier.

hardy cipher
#

Look like mf doom album covers

shy kelp
#

ghostly hershey kisses

vital ermine
hardy cipher
vital ermine
hardy cipher
native knot
#

Nearly 2000 images have tag files created for future integration into the database already. This will do just fine.

hardy cipher
vital ermine
hardy cipher
vital ermine
hardy cipher
uncut lichen
#

if i combine a pre-existing sd 1.5 (i think) model with sdxl, will it gain anything from it?

#

or should i just not bother until someone trains an sdxl model that has roughly the same style that i want

hardy cipher
#

You mean into one checkpoint?

uncut lichen
#

yeah

hardy cipher
#

Don't think you can

uncut lichen
#

unfortunate

hardy cipher
#

They're trained on different sized images, have different parameters. Just wouldn't work.

#

But believe me. Sdxl models are quickly surpassing the best the peak for what could be done with 1.5

uncut lichen
#

i know, its just that there aren't any available in the style that i like

hardy cipher
#

What style is that?

supple knot
vital ermine
#

Big dudes are back

supple knot
vital ermine
hardy cipher
vital ermine
ionic gulch
vital ermine
#

XL is fighting me for training a person yet every vid makes it seem so easy

shy kelp
#

is the process still the same as with 1.5 but just a bigger resolution src image?

vital ermine
#

pretty much

#

for styles I have that down pat

#

funny, as styles is supposed to be harder, lol

#

2 days of about 30 trainings

shy kelp
#

with 1.5, i didn't even bother tagging and got pretty good results
wonder if i could do that with xl Think

vital ermine
#

various ways etc...

#

Tried that too and no

shy kelp
#

:(

vital ermine
#

they say yes but damn if I could pull it off

#

if you don't tag it should make a hard coded image cause tagging/captioning is just telling it what you want to be able to change.

#

brown hair then I can change the hair colour

hardy cipher
slender coral
#

Anyone have any suggestions on more realistic?

half ivy
glad grove
#

sensei 🙇‍♂️

vital ermine
willow bane
#

hlo anyone here

#

i want to create a clothing graphics for a website which software should i use

hardy cipher
#

this one kind of terrifies me

supple knot
hardy cipher
#

cursed

supple knot
hardy cipher
supple knot
#

there all wearing one outfit

hardy cipher
#

their cousins

supple knot
#

Must be fashion Week

hardy cipher
#

good call

supple knot
tender timber
crisp owl
floral island
hardy cipher
#

not sure who this is exactly

floral island
hardy cipher
floral island
glad grove
#

thats a lot of mikus 😔

floral island
hardy cipher
#

not sure wtf is going on here

ionic gulch
#

Version 12 of my workflow - now with real "on/off" switches for Base, Refiner, Base Upscaled and Refiner Upscaled:
https://i.imgur.com/h98OyZA.jpg
https://github.com/JPS-GER/JPS-ComfyUI-Workflows

Features:

  • Easy-to-use menu area that fits on a single screen (if your screen is big enough)
  • Generate images with 100% Base model and/or Base + Refiner (selectable percentage for base and refiner, default: 80%)
  • Generate upscaled versions (2x)
  • On/Off switches for enabling/disabling generated images and/or upscaled versions
  • Easy selection of resolutions recommended for SDXL (aspect ratio between square and up to 21:9 / 9:21)
  • Switch between your own resolution and the resolution of the input image
  • Automatically adjust input images to the nearest recommended SDXL resolution
  • 5 Generation Modes: TXT2IMG, IMG2IMG, ControlNet Canny, ControlNet Depth, Inpainting
  • Use the VAE included in your model or provide a separate VAE (switchable)
  • 4x Lora - use up to 4 Lora models, menu area offers an on/off switch and all necessary parameters for fine tuning
  • 2x IP Adapter - use up to two input images, menu area offers an on/off switch and all necessary parameters for fine tuning
  • 2x Revision- use up to two input images, menu area offers an on/off switch and all necessary parameters for fine tuning
GitHub

Contribute to JPS-GER/JPS-ComfyUI-Workflows development by creating an account on GitHub.

supple knot
hardy cipher
hardy cipher
ionic gulch
#

usually each "real" update gets a new major version. only bugfixes get smaller steps :)

#

so next week i have the workflow with the highest version number available :)

#

finally having a solution for turning on/off parts of the workflow with real switches may result in some other updates in the next days

hardy cipher
#

do you have it on civitai? or just github?

ionic gulch
#

only github

hardy cipher
#

what do you mean by real switches though? I have switches on my things. too bad I goofed on some of them, lol. but they're fixed now

#

they looked like they worked, but when you don't use matching variables things tend to not work right

ionic gulch
#

before you had to cut wires. now you can choose the four output images with a switch (shown in the first screenshot) and really turn off that part of the workflow

hardy cipher
#

ahhh, an entire part of the workflow

#

gotcha

#

I've had to change four words total so far. well in the last 2 or 3 weeks. and those four words broke 2 nodes

#

I guess it wasn't that bad. but don't like being sloppy like that

hardy cipher
#

one of his grandparents was a khajiit

supple knot
#

@hardy cipher it was this

hardy cipher
#

lol

#

wow

#

brilliant

woeful patio
hardy cipher
# supple knot <@1051282294293414031> it was this

I just keep throwing things back in the pot. it's kind of like the upton sinclair book "the jungle" where butchers used to throw the old sausages back in the mix when they would make new sausages

#

gets weird after a while

ionic gulch
# hardy cipher gotcha

the usual xin1 switches with for example 4 inputs, 1 output and a value to select the input will still run code for all 4 inputs. the new switch used for the 4 output images will set one node to mute which prevents unneeded code from being executed. you also couldn't turn off some things, because missing inputs throw errors that stop the workflow. so for that use case there was no solution other than cutting wires (that were rerouted to the menu section). now as it mutes the vae-decoder you get no error and everything "left" of the muted node is not run.

hardy cipher
#

ugh, that missing inputs thing is annoying

woeful patio
#

is there a qr code control net for xl?

crisp owl
supple knot
#

@hardy cipher what the root of the frog people?

hardy cipher
#

and it just went from there

supple knot
#

it looks like Wes Anderson, mixed with like Vogue Magazine

hardy cipher
#

the patriarch

supple knot
#

Gen 1 ha

#

you think it matter the size of the photos for this process?

hardy cipher
hardy cipher
#

that's one of the reasons things get cut off with ipadapter I believe

#

I just try to make sure they have similar pixel counts though

supple knot
#

So I worked on Giant Spock for way to longer then I wanted to, even editing in photoshop and rerunning 10 total maybe

hardy cipher
#

lol. well got em at least

supple knot
#

Ever generation he would lose the Vulcan salute more

#

Now hes just saying Hi

vernal spire
noble shoal
vernal spire
#

honestly if 2005 fashion 80 foot tall spock told me to stop while i was on the freeway, i would

hardy cipher
#

what about these guys?

noble shoal
hardy cipher
#

wet meat gang

supple knot
#

Those are MeatBoys I think they fought the Warriors

hardy cipher
#

they look so comfortable in their own meat skin

#

no worries or strife

#

this guy is a different story

supple knot
hardy cipher
hardy cipher
#

this will create top tier fashion

vernal spire
hardy cipher
#

chipper tune

#

brought them from circa 1978 to circa 1984

supple knot
#

Woah

#

give them some toad

worthy orbit
supple knot
#

Ha, its to generate

hardy cipher
#

they're hiding, one moment

supple knot
#

As cast of SNL ? kinda

rigid bobcat
hardy cipher
#

alright, still trying to remember what folder they're in. in the meantime tried to bring some kool keith fashion into the mix

#

aged them a bit

vernal spire
hardy cipher
supple knot
hardy cipher
#

lol, this is genuinely perplexing. I tucked that whole folder of images into some weird corner somewhere and cant' find it. I'll find it as soon as I stop looking. but that's unacceptable

supple knot
#

its all good

hardy cipher
#

but I must find them

#

to feel at peace

supple knot
#

its the folder labeled frogs

#

isn't in your comfy outputs folder too anyhow

hardy cipher
#

nah, it was something smart like "new folder 3"

supple knot
#

what about looking at the thumb nails

#

Prompt : Gang in Christmas Sweaters

hardy cipher
#

Internet decided to go down right in the middle of my search

supple knot
hardy cipher
supple knot
hardy cipher
#

almost frog

#

if you squint your eyes it's frog

supple knot
#

FrogoWarriors

hardy cipher
supple knot
#

Were from France

hardy cipher
#

those are body paint vests

supple knot
#

Because you cant lose a body paint vest

hardy cipher
#

we've taken a wrong turn

#

turning back

supple knot
#

yeah Im done before I unearth a demon

hardy cipher
#

think I did already

#

I was just mixing them with spinoff rainbow goblins

#

their shoe game is on point though

supple knot
#

if they can get those goblin faces ,is that them too

hardy cipher
#

you want the goblin faces?

supple knot
#

Yes

hardy cipher
#

these are my og rainbow goblins

#

a bit of variation in them

supple knot
#

There really Good , all of them .A+

hardy cipher
#

that time they found the end of the rainbow

#

these ones are so grimy

#

little skittles colored street urchins

#

no demons here

supple knot
#

3rd maybe demon, ha

hardy cipher
#

might have to work this one in too

supple knot
#

Give them Pepsi