Using -I with a PNG always gives an essentially identical image | Invoke | Page 1

upper estuary Nov 5, 2022, 6:21 PM

#

I've tried many values for --strength and -C and these values have great effect when using JPG images, but no luck with PNG images. Most of the PNGs are exports from either Preview (macOS) or Pixelmator Pro. What could be going wrong?

latent pebble Nov 5, 2022, 7:08 PM

#

Can you share a command line that causes this problem?

#

And full output?

plucky coral Nov 5, 2022, 7:42 PM

#

This is really interesting. Can you upload some of the original images that are affected? I’d like to have a look.

#

I wonder if the color mode has something to do with it

#

We have had a few other reports of images not changing much if at all with img2img. Need to figure out why.

latent pebble Nov 5, 2022, 7:45 PM

#

It sounds like bit depth. I could try to reproduce here.

#

Nope, works fine for me with 16bpp.

loud kraken Nov 6, 2022, 3:39 PM

#

RGB vs RGBA maybe ?

latent pebble Nov 6, 2022, 4:07 PM

#

I'll test in a few.

#

It would be nice to have images and parameters from @upper estuary to test elsewhere.

#

@loud kraken That was a great idea. I used an image with transparency (254) and I got the same input as output!

#

Opaque regions don't seem to get drawn on. It looks like it might be a gradient. @plucky coral Is this the new inpainting at work?

latent pebble Nov 6, 2022, 4:43 PM

#

Here's my init image.

#

"a boat"

plucky coral Nov 6, 2022, 7:36 PM

#

loud kraken RGB vs RGBA maybe ?

Yes this is what I meant by color mode. There are handling changes if there is any transparency

plucky coral Nov 6, 2022, 7:36 PM

#

latent pebble "a boat"

Yep, thanks

#

@hot pelican we may need to rethink implicitly masking based on alpha channel…

#

@upper estuary Can you please share a couple of the initial images you used which had this issue?

#

And the terminal output, including the command

upper estuary Nov 7, 2022, 12:27 AM

#

invoke> high resolution digital photograph of a phone booth inside a subway station, -W 512 -H 512 --fit -I ./testimage.png -n 9 --grid --strength 0.99 -C 5

#

Sorry was afk for a while but back now... uploaded a sample input, sample output, and console output with prompt.

📎 output.txt

#

$ identify testimage.png
testimage.png PNG 512x512 512x512+0+0 8-bit sRGB 100757B 0.000u 0:00.001

#

is 8-bit ok? and sRGB?

plucky coral Nov 7, 2022, 12:37 AM

#

>> Initial image has transparent areas. Will inpaint in these regions.

#

are you comfortable doing some quick python REPL-ing?

upper estuary Nov 7, 2022, 12:37 AM

#

I'm a n00b with sd / invokeai but I think the issue is not just with inpainting, although not being able to do inpainting is a natural fallout from using jpg as a workaround.

#

sure

#

if you tell me what to do 😉

plucky coral Nov 7, 2022, 12:38 AM

#

ok just a sec

upper estuary Nov 7, 2022, 12:38 AM

#

not super up on python but we'll try

plucky coral Nov 7, 2022, 12:51 AM

#

no worries

#

📎 check_transparency.py

#

save this script

#

sorry - first - the goal here is to check the transparency of the image

#

bc if the image has transparency, it will not process corrrectly

#

this script just counts the pixels with transparency

#

so save it int he same folder as one of the init images

#

edit the 'test.png' to hav ethe name of the init image file

#

activate the conda environment then run python check_transparency.py - it will tell you the # of transparent pixels

upper estuary Nov 7, 2022, 12:54 AM

#

$ python3 check-image.py
12

plucky coral Nov 7, 2022, 12:54 AM

#

ok

upper estuary Nov 7, 2022, 12:54 AM

#

oh that was without conda environment, if it matters. I had to install pillow which I didn't have.

plucky coral Nov 7, 2022, 12:54 AM

#

all good

#

so whats happening is img2img is seeing that very small transparent area (12 pixels) and treating it as a mask

#

it is running img2img but ONLY on those 12 pixels

#

so of course there is no change in the result (technically there should be about 12 pixels different)

upper estuary Nov 7, 2022, 12:56 AM

#

hah

plucky coral Nov 7, 2022, 12:56 AM

#

so to clarify - the issue is that certain images don't change when you do img2img, right?

upper estuary Nov 7, 2022, 12:56 AM

#

yes

latent pebble Nov 7, 2022, 12:57 AM

#

I think this calls for a better error message rather than a change in InvokeAI behavior.

upper estuary Nov 7, 2022, 12:57 AM

#

but just to verify my assumption -- using the invokeai > prompt with -I means I am doing img2img corrrect?

plucky coral Nov 7, 2022, 12:57 AM

#

latent pebble I think this calls for a better error message rather than a change in InvokeAI b...

yes

plucky coral Nov 7, 2022, 12:57 AM

#

upper estuary but just to verify my assumption -- using the invokeai > prompt with -I means I ...

correct

#

built-in to img2img is inpainting over transparent areas.

latent pebble Nov 7, 2022, 12:57 AM

#

"Your image has transparent pixels, so inpainting will be used."

plucky coral Nov 7, 2022, 12:57 AM

#

>> Initial image has transparent areas. Will inpaint in these regions.

upper estuary Nov 7, 2022, 12:58 AM

#

I want to move to strictly command line (as in, unix shell command line) but not quite there yet.

latent pebble Nov 7, 2022, 12:58 AM

#

Yes... maybe something like inpainting only?

plucky coral Nov 7, 2022, 12:58 AM

#

this message is already in the output, but subtle

plucky coral Nov 7, 2022, 12:58 AM

#

latent pebble Yes... maybe something like `inpainting only`?

the code needs a refactor for this

#

unfortunately

latent pebble Nov 7, 2022, 12:58 AM

#

@upper estuary Is there an error message that would have made you immediately stop and understand what was going on?

upper estuary Nov 7, 2022, 12:59 AM

#

The messages mostly focus on inpainting, and doing give any hint (or I missed it) that the non-inpainting painting will also be, understatement, suboptimal 😄

#

WARNING: Colors underneath the transparent region seem to have been erased.
Inpainting will be suboptimal. Please preserve the colors when making
a transparency mask, or provide mask explicitly using --init_mask (-M).

latent pebble Nov 7, 2022, 1:00 AM

#

So in an ideal world, what's the error message that would have made you check your image for transparency first?

upper estuary Nov 7, 2022, 1:00 AM

#

btw while trying to make a simpler test image I found a way to make an image that crashes the process.

plucky coral Nov 7, 2022, 1:01 AM

#

I don't think there is an easy way around this, unless we drop support for inpainting with a single init image based on transparency and instead always require a mask image for inpainting.

latent pebble Nov 7, 2022, 1:02 AM

#

NOTE: Your initial image has transparency, and those transparent regions will be inpainted. If inpainting isn't your intent, please make sure you use an image without transparency.

#

Then the WARNING that's there now...?

upper estuary Nov 7, 2022, 1:04 AM

#

as a user (of image manipulation programs in general) I've always found masks super confusing fwiw. (probably because I haven't worked with them enough) But maybe that's just me. There's an inversion of perspective problem when communicating about them (what is the part you want to mask versus the part you want to mask out versus the part you want to mask in, to throw around some laymen's terms to characterize the confusion). Documentation often does not make clear what perspective it's taking. So one has to read the docs with a superposition of both meanings and try to glean from context what is meant. Although from my skimming of the invokeai documentation it looks like it was more clear than most.

#

I'll get to your question about messages, let me think about that.

plucky coral Nov 7, 2022, 1:05 AM

#

upper estuary as a user (of image manipulation programs in general) I've always found masks su...

this is great feedback, thank you

#

@brittle wyvern

upper estuary Nov 7, 2022, 1:08 AM

#

"Because your input image contains some transparent pixels, all non-transparent pixels will be passed through unchanged to your output images, remaining identical to the input image."

#

^^ that would make it clear

plucky coral Nov 7, 2022, 1:08 AM

#

summary:

Our img2img code checks for transparency in the init image and if it finds some, it does inpainting on those areas. In this situation, 12 transparent pixels (not detectable to the user when viewing the image) triggered inpaiting on those 12 pixels, but img2img was expected on the whole image. The result is an img2img result with no visible changes.
Our CLI warns the user but it's pretty subtle and easy to miss.
The user provides an init image and expects only img2img, there is no clear indication that inpainting is going to occur, because our code tries to infer what to do based on the init image.

How can we fix this UX? One option is to simply add an additional flag "--inpaint" which is needed to do any inpainting operation.

#

I've tagged hipsterusername for his UX perspective here

#

@glass dawn also could use your feedback - please read the above summary for the situation and advise how we can provide a better experience

upper estuary Nov 7, 2022, 1:10 AM

#

or... "Transparent pixels found in input. These will be inpainted and no other pixels will be changed."

plucky coral Nov 7, 2022, 1:11 AM

#

Side-note: On the UI, we will be changing the behaviour so that if you give it an initial image, unless you are in the inpainting tab, it never inpaints

latent pebble Nov 7, 2022, 1:11 AM

#

I like requiring --inpaint to force it.

plucky coral Nov 7, 2022, 1:12 AM

#

latent pebble I like requiring --inpaint to force it.

on the UI, this is implied by the tab you are on

latent pebble Nov 7, 2022, 1:12 AM

#

Yes.

upper estuary Nov 7, 2022, 1:12 AM

#

What I brought to this was the misimpression that painting and inpainting could be done at the same time. I admit a closer reading of the docs probably would have fixed this.

#

I've avoided the UI. I have a headless setup and no ability to look at the UI.

plucky coral Nov 7, 2022, 1:13 AM

#

It's web-based, you can access it from any machine with a web browser on your local network

#

But no worries if you wanna stick to CLI - we support that fully also

upper estuary Nov 7, 2022, 1:14 AM

#

yes ok I could do that.. I want to use CLI (eventually break out of the invokeai> and down to shell) so I can script things.

plucky coral Nov 7, 2022, 1:14 AM

#

yeah, you'll need to do so to do serious work

latent pebble Nov 7, 2022, 1:15 AM

#

That'll probably be much easier when the node-based stuff is done.

upper estuary Nov 7, 2022, 1:16 AM

#

I think adding --inpaint would not help break through my initial misimpression about the feature. I am guessing only from context here that existing -I does not have '--inpaint' as its long form equivalent.

plucky coral Nov 7, 2022, 1:16 AM

#

Yeah, once we have the node stuff in, you can write a graph in JSON/python dict and much of the complicated workflows are suddenly automate-able

upper estuary Nov 7, 2022, 1:16 AM

#

-I must be --image I guess

#

or --input

plucky coral Nov 7, 2022, 1:16 AM

#

-I is initial image

#

yep

upper estuary Nov 7, 2022, 1:17 AM

#

ok off topic but that crasher

#

$ convert xc:none -size 512x512 -geometry 512x512 -fill red -draw 'rectangle 128,128,384,384' +repage red-384-box-512-imagemagick-repage.png

plucky coral Nov 7, 2022, 1:17 AM

#

yes please haha

upper estuary Nov 7, 2022, 1:18 AM

#

that was my second attempt, first one did not have the +repage

#

convert xc:none -size 512x512 -geometry 512x512 -fill red -draw 'rectangle 128,128,384,384' red-384-box-512-imagemagick.png

#

let's just not mention +repage that was a random option I tried.

#

above command creates this image which causes a crash

latent pebble Nov 7, 2022, 1:20 AM

#

So what color mode is that?

upper estuary Nov 7, 2022, 1:20 AM

#

imagemagick identify says sRGB

#

$ identify red-384-box-512-imagemagick.png
red-384-box-512-imagemagick.png PNG 512x512 512x512+0+0 8-bit sRGB 527B 0.000u 0:00.001

#

I'm not a graphics expert

latent pebble Nov 7, 2022, 1:21 AM

#

That's indexed color.

#

Images have to be RGB or RGBA for InvokeAI.

#

That one is 3 colors - transparent, red, and white.

upper estuary Nov 7, 2022, 1:22 AM

#

oooh, ouch

#

good to know

plucky coral Nov 7, 2022, 1:23 AM

#

https://github.com/invoke-ai/InvokeAI/issues/1401 this is for your original issue

GitHub

[enhancement]: Make inpainting on the CLI explicit with a flag · Is...

Is there an existing issue for this? I have searched the existing issues Contact Details No response What should this feature add? Currently, inpainting is triggered when a check of the user&#3...

plucky coral Nov 7, 2022, 1:24 AM

#

latent pebble That's indexed color.

OK, so probably we need to convert images to RGB(A) before we do anything

#

We should be able to handle that image just fine.

#

Or at least not catastrophically fail

upper estuary Nov 7, 2022, 1:25 AM

#

So I think I know what I need to know now! Thank you!

#

looking at that page..

#

quick writeup!

#

terminal output is already pretty crowded and you may still miss it

#

this yes

#

but... given the results were not as expected, I did go back and scour the terminal output. So having a bit more info (or some info more targeted at this failure mode) would have helped despite the crowded terminal output.

plucky coral Nov 7, 2022, 1:28 AM

#

Still, I think it brings up a good issue and we cna do better

brittle wyvern Nov 7, 2022, 1:30 AM

#

I think the simplest is as has been suggested above

#

Make it explicit - -inpaint for inpainting

#

Rather than bundling functionality under the same arg

plucky coral Nov 7, 2022, 1:30 AM

#

Crash issue report: https://github.com/invoke-ai/InvokeAI/issues/1402

GitHub

[bug]: Non-RGB(A) init images crash InvokeAI · Issue #1402 · invoke...

Is there an existing issue for this? I have searched the existing issues OS macOS GPU mps VRAM No response What happened? Non-RGB(A) init images (e.g. indexed color images) crash InvokeAI. We shoul...

upper estuary Nov 7, 2022, 1:31 AM

#

Doing better, I don't know. You might give the world an overdose of awesome.

#

on second thought yeah if it was a clearly different mode, yes that would break through my initial confusion. I didn't know it was either-or. I thought you could enhance/alter the given starting content (resulting in changes to that content, the non-transparent pixels) while simultaneously inpainting in one operation.

plucky coral Nov 7, 2022, 1:33 AM

#

brittle wyvern Rather than bundling functionality under the same arg

Thanks, can you add any comments to the GH issue pls

#

understandable

#

in order to do that, the code would probably need to be a lot more complicated 🙂

upper estuary Nov 7, 2022, 1:35 AM

#

or have a hack that adds some random noise pattern replacing transparency

#

maybe... if that would work

#

maybe the existing algorithm will just have a free hand with random patterns, to act as in inpainting, while being relatively more restricted by the less-random pixel regions, resulting in a hybrid.

#

but I'm out of my depth

#

ok thanks again! I need to create a github account for this.

hot pelican Nov 7, 2022, 1:39 AM

#

upper estuary or... "Transparent pixels found in input. These will be inpainted and no other p...

I'll change the message in the CLI to alert the user that inpainting is occurring, and I'll give them an option to prevent inpainting from happening. I shouldn't change the UI more dramatically than that, such as requiring user to ask for inpainting when it's been happening implicitly all along.

upper estuary Nov 7, 2022, 1:41 AM

#

Thanks, I respect making the smallest change that could possibly work.

brittle wyvern Nov 7, 2022, 1:43 AM

#

hot pelican I'll change the message in the CLI to alert the user that inpainting is occurrin...

I think we ought to make the bigger change

#

My suggestion would be to have users receive a descriptive error in passing a transparent image to img2img, and point them to the new command. It may be friction for older users, but I imagine we will have far more new users than old.

plucky coral Nov 7, 2022, 1:43 AM

#

hot pelican I'll change the message in the CLI to alert the user that inpainting is occurrin...

I think we should think about it a bit before taking action... the only downside to making inpainting explicit is that it the users now need to explicitly request it. the upside is that what invokeai does is much clearer

#

There was another user who had the same issue but at the time I didn;t recognize it.

#

@inner veldt had this same problem on one of her images

#

we couldn't figure it out and chalked it up to SD being weird but I'm 99% sure this was the cause

#

(emma sorry to tag you - we are talking about when you had that image that didnt change whne you did img2img)

inner veldt Nov 7, 2022, 1:49 AM

#

plucky coral (emma sorry to tag you - we are talking about when you had that image that didnt...

no probs

glass dawn Nov 7, 2022, 4:36 AM

#

plucky coral summary: - Our img2img code checks for transparency in the init image and if it ...

Agree

#

And if you just fill the transparent area, if it is obviously small? Remove transparency. I had problems with img2img when the images had transparency by mistake and only one row of transparent pixels.

plucky coral Nov 7, 2022, 5:20 AM

#

glass dawn And if you just fill the transparent area, if it is obviously small? Remove tran...

i think we have to be careful bc making this decision for the user is a bit too strong

#

if we cater to enthusiasts, better to do exactly what they want

#

so explicitly requiring a flag to enable inpainting feels good for this

hot pelican Nov 8, 2022, 7:20 AM

#

My approach would be to print a warning message about inpainting occurring, and inform the user that they can ignore the transparent pixels using a --no-inpaint option. Otherwise we change well established behavior and documentation.

plucky coral Nov 8, 2022, 7:36 AM

#

hot pelican My approach would be to print a warning message about inpainting occurring, and ...

Specifying an init image clearly means we want img2img, but its not clear that we want inpaint. In fact we way not want inpaint, but we get it anyways, because there is a single transparent pixel in the init image that we didn't know about.

Changing the documentation and behaviour isn't a big deal if it leads to an improved user experience (we aren't trying to satisfy the documentation or necessarily retain a "tradition" of behaviour, we are trying to provide a useful tool). With the vast and complicated system we offer, I think it makes sense to require the user to explicitly request what the want done.

This is certainly the approach the UI will take; that is, init images will be stripped of the alpha channel before being sent to be img2img'd or inpainting, to ensure the user gets what they ask for.

hot pelican Nov 8, 2022, 7:55 AM

#

The node-based CLI is going to have a new command syntax which explicitly invokes txt2img, face restoration, etc. I will defer changing the existing CLI's behavior for now.

#Using -I with a PNG always gives an essentially identical image