#✨|sdxl

1 messages · Page 53 of 1

pallid sail
#

47 steps base, 3 steps refiner. Watermark is still visible. Will try and run the refiner on only 1 step before trying that base 1.0, vae 0.9 model

trim orbit
#

loading the lora just affects everything like i would expect now. very nice

midnight shuttle
thick goblet
#

Rtx 2060 6gb
32gb ram
i5 8400

When I switch to refiner model it make my system crash

I did used xformers and medvram

What else can. Be fixed?

trim orbit
turbid garnet
#

AMA - I developed an LLM, what questions should I ask to test it out?

boreal bough
midnight shuttle
# pallid sail

Wow, that's far from invisible. And it covers the whole image? They need a better watermarking method.

visual glade
#

yeah diffusers has that watermark enabled by default and I think it's not implemented correctly

#

it should not be that visible

dire rampart
#

yes I would also use the automated feature available in auto1111, kosmos2 offers solutions for those elements not typically found in the other datasets simply because the original human organisers didn't include them, yet basically with the built in feature we will easily find the poses, genders, clothing etc to make the images appear as we deem/

thick goblet
deft coral
#

i give up on SDXL im going back to 1.5

thick goblet
#

Or I run 0.9?

visual glade
#

the most efficient way to run SDXL right now is ComfyUI

thick goblet
trim orbit
dire rampart
boreal bough
boreal bough
thick goblet
boreal bough
#

free

#

paid gives a lot more

visual glade
#

comfyui can do SDXL on free colab

native moon
#

hey does someone know how to chage the sampler in comfy ui?

visual glade
#

on the ksampler node

native moon
midnight shuttle
#

I understand the need to watermark to prevent AI training on itself and maybe even the need to embed information about who generated the image in case it needs to be traced but it needs to be less visible.

thick goblet
#

How long it would run onit disconnect on console? On free

#

Free one

native moon
visual glade
#

you can download samplers?

trim orbit
#

i really like how this lora is working like i've loaded a full model now

native moon
#

yes

visual glade
#

how do you download a sampler?

pallid sail
heady vale
thick goblet
#

Also waiting for inpainiting model

civic forge
#

Yes I did, the answer is YES, but refiner model needs loras specifically trained to work with It so if you want to use loras with SDXL and the refiner model you'll need a pair of them, one lora trained for the base model and the same lora trained for the refiner model, and load both (one in the base and other in the refiner) in order to get refined images that keep the lora. Not very practical yet.

boreal bough
visual glade
spark pewter
#

is there code for training refiner lora? Kohya doesn't seem to support it

native moon
boreal bough
pallid sail
visual glade
dire rampart
visual glade
#

you can use it with all samplers

trim orbit
boreal bough
#

comfy has the patience of a monk XD

turbid garnet
#

AMA - I developed an LLM, what questions should I ask to test it out?

turbid garnet
# dire rampart if I have two cups of water, drink half a cup, pour a new glass, drink another h...

If you start with two cups of water, each containing 100ml, you have a total of 200ml of water.

After drinking half a cup (50ml), you are left with 150ml of water.

When you pour a new glass, you add another 100ml of water, making the total amount of water 250ml.

After drinking another half cup (50ml), you are left with 200ml of water.

Therefore, you have 200ml of water remaining, which is equivalent to 2 cups.

boreal bough
turbid garnet
#

The ANSWER it gave back

native moon
dire rampart
boreal bough
#

if it gets the HP right, then it passes my LLM test XD

visual glade
boreal bough
#

so far only gpt-4 succeeds

trim orbit
#

LLM spam is like meme spam. go to another channel with it

heady vale
native moon
turbid garnet
#

anyone else wants to ask a question? me asking would be a bit too biased

wet rock
#

why are my images like this

#

😭

dire rampart
#

is your resolution low?

midnight shuttle
alpine pine
dire rampart
#

its supposed to be 1024 output

wet rock
#

1024x1024

#

are you guys usinbg comfyUI?

dire rampart
#

for sure

cold mica
#

SDXL 1.0 Super Stage VOD is up!

https://www.youtube.com/watch?v=MiH1yBrwqlc

If you missed our live Discord stage, fear not! Here is all the scoop from our SDXL 1.0 stage with our @Stability_AI Applied ML team, Emad (GPU Emperor) and our host with the most Amli! A wild "cam on" appears 👀

Please enjoy.

🌟 Stage Summary by GPT🌟

🎛️ Trained on newer architecture for better control
🔍 Dual CLIP encoders for improved text...

▶ Play video
boreal bough
#

I see those colored lines

wet rock
#

downloading comfyui rn

dire rampart
#

if you are using comfyui try their discord and drag some 1.0 images over they are likely sharing nodemaps

wet rock
#

i was using that automatic1111 i believe

dire rampart
#

you need the latest auto1111, right click on the area next to the files and open powershell then type git pull if you have git installed

wet rock
#

i pulled. im on 1.5.1

#

xformers running as well

dire rampart
#

there is also SDnext, they have the same extensions as Auto

#

but it runs 1.0 and everything

wet rock
#

what's the best UI option?

dire rampart
#

comman line lol

shy kelp
#

How do I view the parameters people used to make the images they upload here? Does discord strip the exif data?

wet rock
dire rampart
#

auto has most extension support and broad userbase thusly more community networking but comfy is basically set to jet and it even includes all these special features like adding noise

dire rampart
wet rock
#

nice breakdown

#

ill try comfyui and play around. thank you!

boreal bough
native moon
#

what am i doing wrong why do my iamges look like this ?

boreal bough
native moon
heady vale
native moon
#

thanks

tender timber
#

Yes, you're correct. My hope is that ROCm (whatever the new version is supposed to be) closes the gap and I hope Intel ARC makes moves too. With AI being the shiny new thing, all parties are vested in making their cards better, or so one would hope. I'm not interested in other AI tasks nor am Interested in video games so I'll wait a few months, see how the software goes and then choose.

native moon
boreal bough
# native moon why did u remove yours

there were a few confusing dimensions included, that aren't beginner friendly and may cause more confusion that solutions. pure fires answer is simple and correct

hard fractal
#

Okay, I've checked.

We have indeed included the unet with SDXL.

Maybe we should @everyone

ionic dragon
#

Can we inpaint in 1.0

dire rampart
#

Someone likely has an inpaint model on civitai, check the huggingface modelcard to see if they mention inpainting

#

they do in the other releases there is some alterations to the modelling method

#

and if you can't try matching a 2.1 model for small details like eyes they are still contextually aware of awesome imagery

grizzled parcel
#

just tried SDXL base with transformers on my pc. the vram usage when inferencing is 9GB without xformers. however when it is decoding, the vram increased to 14GB. i tried base+refiner and the result is still same (base 8GB, use refiner 10GB, but then when decoding suddenly 14GB). does anybody know whyy? thanks!

boreal bough
nimble heart
dire rampart
#

are you sure it's the vram you are checking on? much smaller cards can run 1.0

boreal bough
native moon
dire rampart
#

They said 16gb of system RAM was sufficient they might limit requirements in those parameters

dire rampart
#

though how they do that with a 12 GB model is another question, they reduced size to 6...

tribal magnet
#

the ETA in A1111 is so janky. ;_;

boreal bough
dire rampart
#

my refiner seems to quadruple generation time

boreal bough
#

the others had nothing to do with generation, they were for target settings

dire rampart
#

but like they say it isn't required

tender timber
trim orbit
#

its like i'm really there

#

at the superstage

nimble heart
dire rampart
nimble heart
#

but yea was hoping to see 2k inference times magically like 30% faster

boreal bough
boreal bough
# native moon 😦
| 1:1           | 2:3           | 3:2           | 128:85       | 16:9          | 148:83       | 156:71       |
+---------------+---------------+---------------+--------------+---------------+--------------+--------------+
| 512x512       | 896x1344      | 1152x768      | 1024x680     | 1280x720      | 1184x664     | 1248x568     | 
| 768x768       | 1024x1536     | 1536x1024     |              | 1920x1080     |              |              | 
| **1024x1024** | 2048x3072     | 3072x2048     |              | 3840x2160     |              |              | 
| 1280x1280     |               |               |              |               |              |              | 
| 2048x2048     |               |               |              |               |              |              | 

if you want pretty graph XD

nimble heart
midnight shuttle
#

I hope Intel and AMD support AI better on consumer cards so Nvidia has competition. But if they really want to make money they will restrict it to datacenter cards.

grizzled parcel
nimble heart
#

I did see some people running one of those massive like 60B language models using a custom patched 4 bit mode and some other shit they hacked together on a 7900 XTX but I don't have the brain power for that

tender timber
nimble heart
#

yea the house is more important than a toy made of electric sand metal any day

glad flint
#

Update for any it/s minmaxxers that are curious:

just hit a new record of 1.5it/s and 29 second generation times for 1024x1024 on my RTX 3060 (12gb vram) and 16gb system ram

I was getting 1.3 it/s before

Updating Nvidia drivers gave me a small boost
Using nighly pytorch version of comfyai gave me another small boost, my favorite thing about it is that the ram total lockup periods are smaller and it seems to hover around max ram use instead of hitting it

vital wolf
#

Do you just select the value in lora loader or have to put the lora in the prompt as well for it to work?

glad flint
#

I'm doing base 20 steps euler, refiner 10 steps euler, no tiling vae

tender timber
nimble heart
#

wonder what Titans go for on ebay now. Might be a really cheap 24 gigs of vram

thick goblet
#

so comfyui says it will work on rtx 2060 6gb version with xl 1.0 ? with this

nimble heart
#

damn

#

stupid college researchers and their 64 bit compute

#

buying up all the titans

thick goblet
nimble heart
#

Try it and find out

thick goblet
#

not sure

#

it is glitching with black screen then coming back

nimble heart
#

if you're on windows open task manager and watch your vram graph

wet rock
native moon
#

what resolutions work with 1.5 models?

midnight shuttle
#

512x512

#

SDXL is the only one trained on multiple resolutions.

heady vale
midnight shuttle
#

Newer drivers are usually better. The rumors about some drivers being "slow" were due to not understanding new memory management features.

heady vale
#

err...0.05

native moon
shrewd barn
#

1.5 models do work at non-512, it's just worse the farther you get from that size

midnight shuttle
wraith apex
shrewd barn
#

some of the community models are better at higher resolutions than the base 1.5 model

native moon
#

ok thank u

glad flint
#

I'm getting far less 170 second generations where the computer gets locked up and getting around 30 seconds pretty consistantly now which makes this actually fun now

boreal bough
glad flint
glad flint
native moon
#

what is clip skip?

heady vale
native moon
#

how can i do that in comfy ui and why would i do that?

heady vale
#

I only started using comfy yesterday so its as new to me also

#

some people say the results are better

vast galleon
vast galleon
heady vale
native moon
vast galleon
#

is it possible to hide the noodles?

thorny frost
heady vale
vast galleon
#

kinda like where every input has an address and then you can link inputs from different nodes into one and make a custom control box?

thick goblet
#

Launching Web UI with arguments: --xformers --lowvram --no-half --disable-nan-check

i launched with this should i be good to go with 1024 x 1024 ?
on rtx 2060 6gb

midnight shuttle
#

Remove --disable-nan-check. NaN is an error and you want to know about it. Add --no-half-vae because SDXL VAE doesn't work with fp16.

#

Expect it to be very slow.

thick goblet
midnight shuttle
#

Actually --no-half may also apply to VAE already.

midnight shuttle
heady vale
midnight shuttle
#

If it works it will be very slow. But you can try it.

thick goblet
midnight shuttle
#

If it gives you black screen image that was because of --disable-nan-check. So remove it and then read the error message.

#

Why not just try it? Takes 1 minute to test.

thick goblet
#

what u prefer

#

and medvram or lowvram

midnight shuttle
thick goblet
#

i see

midnight shuttle
#

I don't know. Takes 1 minute to try it or 1 hour to ask every question.

thick goblet
#

ok launching ill see if it goes to 1024

midnight shuttle
#

If you get error show the exact error.

vast galleon
#

Interesting fail. The last thing you want to see walking home at night is a faun looking at you with his hand down his pants.

pallid sail
#

Managed to get rid of the watermark speckling in diffusers but I tink I've run into the interlacing effect that I think I saw mentioned earlier

shrewd barn
pallid sail
#

That is with that model. This is with the new base 1.0, 0.9 vae model in the repo?

glad flint
#

I'm having the same problem but haven't bothered fixing yet

thick goblet
shrewd barn
#

correct, that model should fix it. I'm using comfy but the I ran tests of the two models and the only difference is those chromatic lines in the vae decoded result

midnight shuttle
#

2 minutes is not bad for that card.

thick goblet
#

ye

midnight shuttle
#

Need 16 GB VRAM to run full speed SDXL in A1111.

pallid sail
thick goblet
#

now i guess vram is the way

midnight shuttle
#

VRAM is king for AI. King above all.

#

And right now you need Nvidia. Nvidia is king for AI. No AMD or Intel.

thick goblet
#

but then i have to upgrade power supply too sadcat

midnight shuttle
#

Someday AMD and Intel might be better.

thick goblet
#

while using refiner model is it compulsory for restore faces check ?

shrewd barn
#

this is the difference between the two VAEs, just subtracting one result from the other on a direct image > encode > decode with each VAE, no sampling

midnight shuttle
thick goblet
#

alright

pallid sail
shrewd barn
# thick goblet alright

the restore faces option is a post-process done by a GAN, not part of stable diffusion. You can use it or not, up to you. Sometimes it helps, so try it and see what happens 😉

nimble heart
#

anyone know what causes comfyui to tile the VAE? I've seen "trying again with tiled vae" in stdout once or twice but it usually just slows TF down when I exceed memory instead of tiling unless I go out of my way to use the tiled vae node

vast galleon
midnight shuttle
#

WTF?

thick goblet
#

i love it @midnight shuttle before after . amazed to see refiner model is fast af then base model takes 2 min

midnight shuttle
viscid pelican
#

For posterity, I am a moron. I thought the latest sd_xl_base_1.0_0.9vae.safetensors file was the VAE and was trying to load it along side the baseline 1.0 checkpoint. Hours of my life I'm not getting back.

thick goblet
#

and 19 steps

midnight shuttle
shrewd barn
midnight shuttle
thick goblet
#

pretty amazing yh

#

now hoping inpaint model soon in this week or month

#

and what if u generate images with refiner model ?

midnight shuttle
midnight shuttle
thick goblet
#

alright alright

viscid pelican
#

In my defense, the naming convention doesn't make it clear to a novice.

shrewd barn
#

yeah the file name isn't great, not obvious

thick goblet
midnight shuttle
shrewd barn
#

task manager doesn't do a good job of reporting ML workloads

viscid pelican
#

Again, novice. I had no prior knowledge of what an expected filesize should be. It says XL in the name, could have meant file sizes!

thick goblet
midnight shuttle
#

Often results are good enough without using refiner.

heady vale
#

Likeness isnt quite there but its still pretty damn good for a base model

shrewd barn
#

I don't know what magic comfyui uses, but it somehow keeps both models cached even in 8GB of VRAM

#

A1111 has a long way to go for SDXL support still

midnight shuttle
#

It's some amazing magic since each model is over 6 GB. Maybe one-time swap to system RAM isn't as slow as constant unified memory swaps?

shrewd barn
#

yeah it has to be that

#

but also A1111 is just terrible at memory management in general

thick goblet
#

whats the lowest vram 1.0 xl is supported ]

trim orbit
#

me personally? 16

midnight shuttle
shrewd barn
#

8GB definitely, I think 6GB is possible?

shrewd barn
#

might need lowvram option for 6GB though

thick goblet
shrewd barn
#

what image size?

thick goblet
#

1024 x 1024

shrewd barn
#

nice

thick goblet
#

refiner model working nice too

midnight shuttle
#

If I have time this weekend I will look at A1111 code to try to understand memory usage.

shrewd barn
#

I was testing it on a A4500 earlier in A1111, 20GB of vram and it was still erroring at 1536x1024

#

it would get 1 or 2 gens out, then after that each time it would run out of memory when trying to decode, and just quit

#

that was through wsl and docker though, so I think it couldn't use shared RAM

thick goblet
#

i want to upscale or refine real world images would that work ? without prompts

#

with refiner model tho

midnight shuttle
#

Upscale just use the Extras tab in A1111. To refine a real world image you must give a prompt so it knows what the image is.

shrewd barn
#

you could use clip interrogate to get a prompt from the image

thick goblet
#

oh smart ye clip interrogate sure

heady vale
exotic lichen
heady vale
#

Salad fingers in this one

trim orbit
#

gigapixel vs 1.5 is a questionable option. vs xl it's a no brainer. set up an xl upscale workflow and blow gigapixel away

trim orbit
#

yup. gigapixel is over 5 years old. its like 3 generation of AI behind. old news really. and it's still paid software.

molten gull
half ivy
#

i am here
what did i miss

trim orbit
#

ControlNets came out!! ||naw jk||

half ivy
#

bruh \

#

you mean the subtle flirt by Dukes o0

vocal stream
trim orbit
#

chaotic neutral

half ivy
vocal stream
half ivy
#

honestly, you filled my heart with joy for a brief second so worth it

trim orbit
#

💩

heady vale
#

I cant imagine how messy comfy workflow will look like with multiple controlnets working together with upscale etc

trim orbit
#

its gonna get fun

sour obsidian
shrewd barn
half ivy
#

wish i could code so i could help, but i'll be moral support instead, LG TEAM!!!

soft bone
#

yo did you ever figure this out? its clearly using latent fill rather than original fill but how to change🤔 or it could just be using grey pixels

shrewd barn
trim orbit
soft bone
south igloo
trim orbit
#

when he released that i figured he was some phd guy

soft bone
#

no i helped him learn DB when he made the double exposure model

shrewd barn
#

SAI have mentioned they have some internally already, but they haven't released because they're trying to make a lighter weight solution

vocal stream
#

they are changing the Controlnet architecture for sdxl, theirs is not really going to be quite the same as training an old-style one again

soft bone
#

well it was a community effort

trim orbit
#

we just need to train models

soft bone
vocal stream
#

ugh I hate that there's still no way to hide the '1 blocked message' thing in discord

half ivy
trim orbit
#

new research since has allowed an opportunity to a do a better way

soft bone
#

ik im just impatient, like how we all trained 0.9 loras

sour obsidian
dapper current
#

The problem is ControlNet in it's current state makes a complete copy of the UNet which becomes cumbersome for SDXl - you'd have another billion parameter model on your hands and it may not actually be necessary

#

@sour obsidian and @visual glade have a much better idea

trim orbit
#

i'll take that as a promise!

sour obsidian
dapper current
#

This isn't a direct reason for it at all, but something I personally think about is the energy cost of inferencing generative models and the impact on the environment. If we can standardize a much smaller model with the same results it's a win win for everyone and everything.

shrewd barn
#

I'm excited for the future potential. Controlnet made 1.5 SO much more powerful and useful for img2img workflows.

trim orbit
#

ohhh is that the same seed withotu a lora? geeze it lened into my prompt good

prime barn
slow marsh
#

Hi, do anyone know where can I test run the SDXL 0.9 refiner model online? My computer has no GPU, so I'm looking for online trial

sour obsidian
dapper current
#

Do it again @sour obsidian do it again

sour obsidian
trim orbit
dapper current
#

I'm in bed on my phone hahaa I just can't keep away 😉

trim orbit
#

you scallywag

vocal stream
#

any plans for a TemporalNet or other vid-aimed controlnet releases for sdxl?

sour obsidian
vocal stream
#

nice

dapper current
#

Thinking is putting it lightly

#

Those words are uttered every day I think

sour obsidian
#

hahahah

#

I just want pretty pictures, more pictures, more pretty

#

smooth pretty?

#

idk its been a long week

#

haha

dapper current
#

Haha

#

If you've seen a paper or heard of it, we've read it and thought about it basically

sour obsidian
dapper current
slow marsh
#

Hi, do anyone know where can I test run the SDXL 0.9 refiner model online? My computer has no GPU, so I'm looking for online trial

soft bone
#

I have an idea. Joe strongly recommends using celebrity names for face-lora training. What would be the best tool to find the closest "celebrity lookalike" given an input image of someone unknown? tryna scale this

dapper current
dapper current
#

... they're all kinda... Not great 😅

#

Lots of gimmicks

vast galleon
#

Something has gone terribly wrong ...

trim orbit
#

relevant username

dapper current
#

Name checks out

soft bone
heady vale
vast galleon
#

LOL

slow marsh
high skiff
soft bone
half ivy
heady vale
#

XL spawned new life, but not as we know it

high skiff
trim orbit
#

i love how easy it is to spank out new desktop wallpapers now

high skiff
#

Even on ultrawides

trim orbit
#

same. i'm only operating at 1080p for speed

soft bone
#

but i still have to go up to 0.8 denoise and the edges are solid, no mask blur even though there is blur set

native moon
#

is ther a high res fix in comfy ui so i can generate high res images (1.5)

half ivy
vast galleon
#

Trying that setup with a refiner included is giving odd results

native moon
# half ivy

is the high res in automatic 111111 also just a upscaler?. i thought it gets generated in high res directly

heady vale
half ivy
#

yea i think so 🤷‍♂️ 😂 \

native moon
#

my main problem in 1.5 is this behavior

soft bone
half ivy
#

was gonna say resolution but this is 386 x 579🤔

vast galleon
tender timber
heady vale
nimble heart
heady vale
native moon
# half ivy

i have tried that but it seems like the image renders in high res but looks low res and bad quality

thick goblet
#

this dog is sad because it didnt even tried to spell the word

ionic dragon
thick goblet
heady vale
royal fern
#

with the standard workflow?

#

I have a 3060 and do 1.4

heady vale
#

hard to compare unless you use the exact sampler and settings etc

native moon
#

is this ok?

high skiff
#

I'm sure with enough tweaking I could get 4K working with my workflow, but it would just be a lot of additional nodes and variables to ship to the end user

high skiff
#

I do not, but I'm pretty sure it out of memories and picks it up from there

half ivy
nimble heart
#

maybe that's only the dedicated tiled node though

shy kelp
#

So far, 1.5 seems light years ahead in terms of output. Sdxl has had a few amazing wins on some prompts but in general I think it probably needs a lot of lora and model support before it's really good.

high skiff
#

@shy kelp I think you should remember and take into consideration just how dog water 1.5 was on release, and how SDXL can keep up with it considerably, or even surpass it in certain things such as realism

It is an incredibly powerful next generation tool that is light years ahead of its predecessor when not fine-tuned, so I have huge hope for the success of it over time

jolly creek
dry anchor
#

training an SDXL1.0 lora and only got 12s/it by my 3080. Is this normal? using LoRA_Easy_Training_Scripts

nimble heart
#

yea people 100% are over-remembering 1.5 after being so used to tunes and tooling.
compare 1.5 base without embeds/loras/cnet to xl and there's no competition

calm light
#

my first generated

royal fern
native moon
half ivy
#

changed teh settings a bit

tribal magnet
#

interesting, i'm generating a batch, watching a stream, AND playing a game. And nothing is hitching. I wonder what it was that got better. it's always been if i tried playing anything while genning, either A1111 would slow to infinite ETA, the game would stutter, or both. This is really nice.

nimble heart
#

going all the way up to 4k wallpaper sizes is kinda throwing me.

base image
2 pass pixel upscale - fast and coherent-ish but lame
2 pass latent upscale - schizo
3 pass latent upscale - schizo with butterflies

Maybe the mythical 4 pass will save me.
....or 5?

heady vale
candid walrus
#

yeah 1.5 couldn't do this ^

crystal forge
#

ComfyUI question, advice for sharing the same primitive node for the "steps" and "start_at_step" (i.e. get rid of the "Prenoise steps Duplicate" in the image below), it won't allow me to drag the same INT output to both, so I am guessing these are registered as different datatypes

pallid sail
#

Ohhh, I know why my generations still have lines in them even though I specified the new model that has the 0.9 vae baked in, I only downloaded the base one and not the refiner one

#

I feel stupid

nimble heart
heady vale
proper goblet
gleaming gate
molten gull
pine turtle
#

custom LORA sdxl 1.0

Raw output ...... no upscale

molten gull
#

so what's the best way to upscale things ?

heady vale
molten gull
#

is that a a1111 thing only ?

#

how can i use it with comyUI ?

heady vale
lilac wren
#

0.9 vs 1.0 (same seed and workflow)

heady vale
vast galleon
#

I think I got Hires Fix working. But it doesn't like anything outside of 1024x1024.

dry anchor
#

just find out the refiner got his own vaesadcat

pallid sail
#

Hmm, the 1.0 base, 0.9 vae refiner model has taken over 20 minutes to run 8 iterations. The normal 1.0 refiner finishes in about 10 seconds

lilac wren
#

How much Vram do you have?

short marsh
#

i used a a100 via colab pro

#

used just over 24 gig ://

vast galleon
half ivy
vast galleon
#

I know right?!

dry anchor
#

sd_xl_base_1.0_0.9vae.safetensors this one I guess?

half ivy
#

yo mangler
how would you improve this work flow, its repeated 3 times for different prompts, but man, your detail kills

dry anchor
#

I'm going back to the guide

vast galleon
half ivy
vast galleon
#

seems to work better with photorealism than 2D

dry anchor
#

it says these two

#

and this one

#

so what is sd_xl_base_1.0_0.9vae.safetensors exactly? a vae or a newer version of sdxl1.0?

vast galleon
nimble heart
lilac wren
# vast galleon

I don't understand how you had good results with this. Every gen i tried with your Workflow is completely broken

slow sun
#

only SDXL trained loras work so search

pallid sail
#

Sharing my findings with the watermark + 0.9 vae here. First cat has the watermark enabled (bugged so the red and green pixels appear) and is using the 1.0 VAE. Second cat has watermark disabled and is using base+refiner 1.0 both with the 0.9 VAE

slow sun
viral narwhal
#

if you drag a RAW image created with comfyui to ui it automatically change the workflow, it blows my mind

half ivy
#

cringe

pallid sail
# half ivy cringe

Yeah, that's the diffusers watermark that's supposed to be invisible. If I stub out the watermark function in the pipeline then it goes away

#

Here's the image with no watermark and the 1.0 vae. You can see the colour banding at the bottom of the eye when you zoom in.

sand idol
#

Do you recommend the 1.0 vae or the 0.9 vae?

somber hill
#

Hello @boreal bough

lilac wren
somber hill
#

Can you tell me your settgins please I would like to compare

woeful patio
somber hill
#

Thank you so much for linking my video. @boreal bough said there are some flaws. I would like to learn his settgins and make a comparison video. But network rank is totally related to user choise. More keeps more info and thus more size. If we train only unet we will get subpar results. New tokens we have no info related to that. Captions. If you want Realism captions severely reduces. Also In my previous attempts when training with captions I never found good results.

south frigate
#

How should I prompt in SDXL? The same like in 1.5?

pine turtle
#

@somber hill is there a way of compressing my LORA output ........

also thanks for putting up your video

#

Im getting output sizes over a Gigabyte

somber hill
pine turtle
#

hmmmm ok

somber hill
#

Network rank is simply how many Neural network layers it is keeping. So if it is high it train more layers thus have more info. When we do dreambooth instead of dreambooth lora we train all layers. Thus it is better than lora but requires more hardware

#

Lora is an optimization technique. Originally found for LLMs

pine turtle
#

Ok ..... what are your standard lora sizes ?

#

is it large like a gigabyte ?

somber hill
#

I think it depends on purpose

#

Ye 256 rank is gb

polar aurora
#

what am i lacking for sdxl?

somber hill
#

Select auto or sdxl vae

woeful patio
pine turtle
#

Ok thanks ..........ill keep going with it then because my outputs have been great

half ivy
noble shoal
woeful patio
#

what does the empty latent image do in comfy?

crystal forge
late marsh
proud fossil
#

not that bad for base + refiner models without fine tuning

edgy otter
noble shoal
strange thorn
edgy otter
proud fossil
# edgy otter What was your prompt? I want to see how my refiner only setup does against base+...

I generated the prompet using a fine tuned LLama2-13B model I am making for making SD prompts
"Honda Civic parked on Highway, mountains in distance, dynamic composition, aerial perspective, black and white output, original vignette:1.27 during the blue hour, from a drone view [input], angular style, Plexure, toneset, hdr tone mapping, cinematic lighting, rim lighting, photolypze, pldirectrad, physically-based rendering, highly detailed , dvo stabilizer, rpyta aspoh otjecshron abjecshroomAIart"

#

thats why its not normal

proud fossil
#

SDXL 1.0 or 0.9?

edgy otter
#

I am not using it as intended tho. You are not supposed to use refiner on its own but I'm, as an experiment/

proud fossil
#

I am really surprised on how fast and easy the LLama2-13B model understood the SD prompts, it still needs filters and a little more fine tuning to make it work

proud fossil
#

"car on mountain"

edgy otter
noble shoal
#

What env do you run LLama on?

proud fossil
molten gull
#

is it possible yet to train embeddings with sdxl and a1111 ?

edgy otter
molten gull
#

not loras, but textual-embeddings?

icy brook
#

Can’t even use it properly. Crashes my computer on both auto11 and comfy

#

I have a 2070 super with 8Gb vram

proud fossil
icy brook
edgy otter
heady vale
vale eagle
#

I have an issue about lora. I trained a lora which able to produce original image features but when I using base+Lora->refiner this workflow. The refiner makes the image less similar to original image. How to solve this?

heady vale
#

yep thats the main reason with 8gb vram

proud fossil
icy brook
heady vale
#

ideally you want 32gb RAM on system and 8gb vram just passes

noble shoal
proud fossil
polar aurora
icy brook
heady vale
#

Im just observing what it uses on my system

noble shoal
proud fossil
peak dove
heady vale
#

in A1111 is was using 20-30GB RAM and all 10gb on the gpu. With comfy its about 6-8gb vram and 20GB RAM for system

peak dove
#

I hope to upgrade to 24Gb VRAM

icy brook
#

I could run my own 2.1 based 1024px model on 8Gb vram and 15 gb ram with no issues

polar aurora
woeful patio
#

how can i get restore faces for comfy?

heady vale
#

XL is a different beast

proud fossil
polar aurora
proud fossil
#

its sad but we all need to upgrade and spend a bunch of money if we all want to run the latest stuff

icy brook
#

Many people in poor countries that will have to resort to online

proud fossil
woeful patio
#

3070 currently takes about 30 seconds for a 1024 image

proud fossil
#

is civitai down right now?

edgy otter
dry anchor
#

no

noble shoal
woeful patio
crude sparrow
#

any good finetuned models i can get?

proud fossil
crude sparrow
ionic dragon
#

What does SAI mean that they have made finetuning easy on sdxl?

thick goblet
#

Idk why people want high GPU when I am here running xl 1.0 on. 6gb vram 2060 on 1024 with just slight notepad command line changes

#

30 second wait time is good enough tho for me

rotund cliff
#

why is there no option to load the sdxl vae in the comfy sdxl workflow examples?

#

is it built in?

proud fossil
#

it has a little bit of everything imo

#

I should note that it is a SD 1.5 model

strange thorn
crude sparrow
woeful patio
#

the models right now are pretty mediocre, it probably takes a few months for the really good models to come out, stuff just takes time

proud fossil
split smelt
#

Any possibility to train embeddings on SDXL?

crystal forge
#

I seem to be getting better results with dreamshaperXL10_alpha2Xl10.safetensors as a refiner step then sd_xl_refiner_1.0.safetensors, don't know if that is expected or if I am just using the refiner incorrectly? Showing zoomed image of the face, workflow attached:

woeful patio
#

is there a way to put the restore faces strength because right now it comes out way to strong?

noble shoal
thick goblet
azure oxide
thick goblet
proud fossil
clever verge
noble shoal
crystal forge
royal fern
proper goblet
#

Can anyone please tell me how can I inpaint in comfyui with SDXL?

woeful patio
noble shoal
crystal forge
woeful patio
#

is there a highresfix for comfy?

royal fern
#

these lines are a symptom of the wrong vae

proper goblet
royal fern
#

downloading the latest official vae

noble shoal
royal fern
#

and using that instead of the embedded

woeful patio
royal fern
#

if you downloaded it at day-1, get the new vae, they updated it

zealous horizon
#

Just finished my first style LoRA on SDXL 1.0
Children's book style illustrations, trained on ~500 images generated by a personal 1.5 merge model
(each image is from a different epoch)

noble shoal
thick goblet
#

whats the difference here ?

thick goblet
#

on this model

crystal forge
proper goblet
woeful patio
thick goblet
shy kelp
woeful patio
thick goblet
noble shoal
thick goblet
#

btw is there a better way to switch to refiner model fast ? or just a click

heady vale
woeful patio
thick goblet
fierce hollow
heady vale
#

Comfy is heaps faster and once you learn to use it is technically better to be creative

strange thorn
#

Anyone have a good image2image workflow for sdxl1.0 in comfyui?

pallid sail
#

What's a leg and what's a tail? 😆

quartz yew
woeful patio
elfin cobalt
#

Did anyone write a history browser for ComfyUI yet?

#

(It looks fairly straightforward... but I thought I should check.)

stark finch
#

Is there a discord server specifically for questions / help around ComfyUI ?

somber hill
#

Chek out github file of this tutorial

#

Updated for sdxl 1.0 too

strange thorn
#

Will check that thanks

woeful patio
#

is this a sign that i'm using the wrong vae?

molten gull
#

its a sign that you might want to see a dentist 🙂

heady vale
woeful patio
#

how do i seperatly load the vae in comfy?

elfin cobalt
#

There's a 'Load VAE' node in the loader section.

#

It'll be offered as a suggestion if you drag a line from the VAE input to empty space.

woeful patio
stark finch
#

I thought it is possible to drag and drop an image posed here into comfy to get the node setup used to generate it?

woeful patio
#

also it has to be an image that was uploaded straight from comfy

late marsh
viral narwhal
#

pixel art withou any lora

somber hill
viral narwhal
#

pos (masterpiece, best quality), mysterious,16 bit pixel art, epic composition of a explorer in a boat reaching an island in the clouds, by studio ghibli, cinematic still, hd

#

neg (worst quality, low quality:1.3), (greyscale, monochrome:1.1), cropped, lowres, text, jpeg artifacts, signature, watermark, username, blurry, artist name, trademark, watermark, title, multiple view, extra hand, mask, (animal ear:1.4), blur

woeful patio
#

have you tried turning it off and on again?

zealous horizon
#

❤️

pallid path
#

looks so squeaky clean

#

obv its distorted in areas but this looks too good to be just AI

zealous horizon
woeful patio
#

anyone have an image with the img2img workflow in it?

pallid path
#

oh a Lora?

#

from civitai?

zealous horizon
zealous horizon
pallid path
#

cool

indigo carbon
woeful patio
late marsh
slender raft
#

I'm having trouble with the WAS node add-ons for ComfyUI. When I try to connect to the new nodes, the connecting wire just seems to pick up the WAS node I'm trying to connect to and it gets stuck to my mouse. Base ComfyUI is fine. Any ideas?

azure oxide
#

I'd post it as an issue on the wasnode github

slender raft
#

Good idea, will do, but also wanted to see if anyone had see similar.

#

The output of the new nodes works fine, but not the inputs.

#

I'm connecting from STRING to TEXT. I can start the line from the new nodes or the old ones, but it messes up when I try to connect them.

molten gull
azure oxide
#

Unfamiliar with it on comfy but I know in unreal engine, we'd sometimes need to convert the type first. It could be unrelated but Id look to see if you're able to convert string to text

crude sparrow
#

any idea on when controlnet realeses for sdxl?

zealous horizon
#

mixing two of my LoRAs (style + subject) JuN10R + gr3g0r

viral narwhal
molten gull
#

i really like both 🙂 can you send me ? i would like to experiment some

slender raft
winged tree
zealous horizon
molten gull
#

yep 🙂

zealous horizon
dense chasm
dense chasm
#

i can't figure it out after git clone diffuser, then cannot import name 'DiffusionPipeline' from 'diffusers',but when pip install diffusers,it works well

molten gull
#

@zealous horizon maybe i could help to test ?

#

just a friendly offer though, i can wait for release, too 🙂

dense chasm
keen moon
#

how much vram to train sdxl ?

pallid sail
keen moon
#

I have 3090

thorny frost
#

lol, the secret was not to ask for a "furry octopus" but for "a furry creature with the shape of a octopus"

dense chasm
woeful patio
#

is there an easy to follow guide for making lora's in sdxl?

grizzled root
#

hi , i test sdxl with comfy ui , i would like to know how to make a correct prompt with text inside? possible or not?

zealous horizon
paper phoenix
#

sorry if this is an inane question, but im still getting the knack of SDXL and my computer is slow so cant just hammer at keys and learn that way. see where it says Style: X? how am i entering or injecting that into a prompt in comfy then?

#

is that just the bot parsing it as a variable so other people can mix it?

silver dock
# late marsh

hey kiksu if you dont mind can i know what style or prompt for image "man at the mountain" , but if you dont want share it's okey i understand

thank you before

pure hazel
#

Anyone try training yet? Hearing 1.0 is harder to train than 0.9

late marsh
slender timber
#

potentially dumb question. What is the significant of text_g vs text_l?

#

Are these just arbitrary var names?

pure hazel
brazen patrol
zealous horizon
timid sonnet
pure hazel
timid sonnet
heady vale
woeful patio
thorny frost
#

hands and feet looks worse than ever

timid sonnet
woeful patio
lost vortex
#

help me more sense of technology

timid sonnet
#

@woeful patio I made it to simulate my DMT trips LOL

lost vortex
strange thorn
#

@thick goblet I also have the RTX 2060 but for me a generation in sdxl 1 takes like 2 minutes. What are you doing different?

#

In automatic1111

brazen patrol
supple bronze
strange thorn
#

@brazen patrol I did but I dont like the workflows. I like to do image2image too and I cant find a good way for that in comfyui

late marsh
timid sonnet
pure hazel
lost vortex
#

i wanna use it in my video cover help please

brazen patrol
#

Nevertheless, that is the secret to speed and performance. Comfy says you can to img2img and inpainting.

thorny frost
#

it is a total mess

woeful patio
late marsh
late marsh
lost vortex
late marsh
timid sonnet
shy kelp
# high skiff <@456226577798135808> I think you should remember and take into consideration ju...

Yes, I would completely agree with this. I get that I'm making a comparison of a mostly virgin release compared to a release that has months of fine tuned user models.

Maybe my wording didn't come out right, but what you said is essentially what I'm hoping for. That with some time, and some user trained models, it will be far superior.

At the moment, I have a hard time getting superior results.

thorny frost
#

not so messy but still the hands are a bunch of meat

timid sonnet
#

This could be a strange fetish

woeful patio
#

I've had some pretty good luck with hands lately

brazen patrol
#

On the flip side there are things it does and understands that 1.x could never do no matter how much I tried. Chess imagery was a disaster through and through, as was trying to get it to make a logo. SDXL is in a whole different league.

thorny frost
woeful patio
#

that one was done with a1111

#

it does seem to be worse with comfy

thorny frost
#

this one looks better, maybe becuase the feet are mostly hidden

supple bronze
thorny frost
#

otherwise images are beautiful

brazen patrol
#

Does it natively allow you to use the refiner and Clip?

woeful patio
winged tree
#

hands have been difficult in comfyui

thick goblet
#

Then upscale

#

Or use refine model then upscale

thorny frost
timid sonnet
soft zealot
timid sonnet
proud fossil
#

is there a way to generate videos in ComfyUI yet?

tender birch
#

does any1 have a tip for fast generation? What sampler at how many steps do you use? Right now i got 23 Steps with dpm++_2m_sde Karras for the base, and Refiner starts at Step 23 and ends at 30 with dpm++_2m Karras...

timid sonnet
soft zealot
timid sonnet
soft zealot
tender birch
soft zealot
timid sonnet
zealous horizon
# pure hazel Looks like 2 diff styles to me. More consistent than what I was able to achieve ...

this was testing material with different weights and epchs of that LoRA. It's a process of iterating over the model and generating new training material.
But those images were actually from the first version, therefore I'm pretty happy.
It also helps if the character-sheet is using the style you're intending as a final product.
Mine looks like this but needs improvement. but i't super highres (11520x5760)

cyan crown
#

For speed use comfyui

tender birch
timid sonnet
late marsh
cyan crown
#

Is there any way to train Lora in reasonable time?

tender birch
soft zealot
woeful patio
timid sonnet
late marsh
exotic path
soft zealot
tender birch
heady vale
winged tree
clever verge
# heady vale

That is an incredible image. Three hands and two eyes from the side. Should not be possible. 😄

soft zealot
#

"a pretty young woman wearing a diaphonous summer dress running trhough a meadow full of flowers on a sunny summers day"

Anyone got a set of binoculars for me please lol

clever verge
heady vale
#

the bike didnt fair too well though

clever verge
#

No, that's broken unfortunately.

timid sonnet
heady vale
#

Does anyone know of a node in comfy that can allow you to manually toggle something on/off without needing to break a connection like I am doing currently?

edgy otter
trim orbit
soft zealot
clever verge
#

Toggle would be nice though, it's been requested before.

trim orbit
#

reminds me of a photo i once took over a field

heady vale
trim orbit
#

muting nodes just breaks the entire layout usually

soft zealot
timid sonnet
crude sparrow
#

sdxl inpainting in the works or not?

heady vale
#

I have one connection going to Upscale, but I dont always want it active, so a toggle would be nice

trim orbit
crude sparrow
timid sonnet
trim orbit
#

the devs were teasing me with their controlnets last night. rapscallions!

crude sparrow
#

heard it was still in beta

trim orbit
timid sonnet
crude sparrow
#

"my world feels mushy without controlnet" sadcat

#

deep

trim orbit
# sour obsidian

dukes remixed my photo on me last night. i couldn't even tell at first.

timid sonnet
edgy otter
#

is there an extension that has preset prompts for setting parameters like lighting, angle, composition etc. for generating photographs?

#

If there isn't, I would like to create one 👀

soft zealot
crude sparrow
#

where can i find a good controlnet workflow

soft zealot
crude sparrow
#

yes pls

#

i spent too long figuring out how to inpaint

edgy otter
#

either for automatic or comfyui?

timid sonnet
pallid lintel
#

Hi, is there a guide on training controlnet for SDXL? I would like to train the openpose one

soft zealot
crystal forge
upbeat tundra
edgy otter
crude sparrow
#

can anybdy explain how to use this (im clueless)

crude sparrow
#

asfd

soft zealot
crude sparrow
#

diy time

pallid lintel
#

Is there a guide on training controlnet for SDXL? I would like to train the openpose one

vast galleon
timid sonnet
fast vector
fast vector
#

just give me few minutes. Okay, you basically convert clip g and clip l to inputs and connect them to primitive node

pallid lintel
timid sonnet
vast galleon
pallid lintel
heady vale
timid sonnet
crude sparrow
#

imma try this workflow from civit

trim totem
#

interesting, link?

timid sonnet
vital wolf
trim totem
crude sparrow
#

uses 3 prompts instead of 1

#

thats 3x the fun!

pallid lintel
#

Introduction SuperStability 😅
SDXL -> Refiner -> Juggernaut Final + UltimateSDUpscale

soft zealot
midnight oriole
#

Ai doesnt knows the gigachad. Would you guys add him into his mind?

crude sparrow
#

how does sdupscaler compare with realesrgan and 3x ultrasharp?

soft zealot
vital wolf
pallid lintel
trim orbit
pallid lintel
pallid lintel
normal bridge
#

does anybody have a good prompt for getting helmet mounted camera like pictures?

crude sparrow
pallid lintel
potent summit
trim orbit
#

1.5 upscaling already destroys Topaz's software for quality. Topaz is just a one button operation instead of a little bit of a skilled hand. It does still work, its just, the topaz signature artifacting is there always.

trim orbit
pallid lintel
normal bridge
indigo carbon
trim orbit
#

questions like, do i use 512 pixel images or 1024? i dont know

trim orbit
# indigo carbon

How do you make a cat sound like a dog? || Douse it in gasoline and throw a match at it||

pallid lintel
trim orbit
crude sparrow
trim orbit
edgy otter
#

Euler-a vs Heun Sampler. Same seed. 32 steps. 1024px. Base-SDXL.

I was noticing a lot of bad textures and repeated patterns for concepts mostly related to nature or concepts that might have very few training images e.g. "indian village". It seems to be a sampler problem.

trim orbit
#

try different cfgs too. iv'e seen good results all the way to 12

pure hazel
indigo carbon
#

I feel like the best CFG for the most part on SDXL is 4.2, I tried many different CFGs

trim orbit
#

but, also, 6.9

indigo carbon
normal bridge
#

reminds me of the impossible umbrella i got yesterday wait

trim orbit
#

VR hands

heady vale
trim orbit
dusky flax
heady vale
#

powahhhhh

pallid lintel
trim orbit
#

did you see when he coudn't race in the newest grand tour because he was too fat?

#

best part

fresh path
pure hazel
#

Actually I've been having a lot of trouble with viewpoints and perspective, even (low angle) wasn't working for me.

trim orbit
normal bridge
#

getting away from that seems very hard

trim orbit
#

describing with natural language is useful for the G clip, "looking up at subject from the ground" or "viewing from above"

#

by default it goes towards perfect composited portraits, but with strong prompts you can push it elsewhere into the latent spaces

ionic dragon
#

Are any unofficial controlnets out?

trim orbit
#

yes! || jk no ||

pure hazel
#

Okay im very confused, all of a sudden all my gens are appearing like this in all comfyUI workflows.

pallid lintel
#

Samuel L Jackson as superman

#

(kind of 😅 )

potent summit
#

Is there an img2img workflow in ComfyUI?

crude sparrow
#

best samplers for refiner to add detail??

trim orbit
#

there are tools to make them in comfyui. you can find predefined layouts other people have made

pallid lintel
ember cargo
#

I trained an SDXL lora using Blame! manga art (faces only though) and it's not bad at all 🤔
roughly 1.5k steps, 90 images in half an hour. Maybe it needs more steps though

potent summit
crude sparrow
#

should i keep these refiner settings or not?

pallid lintel
fresh path
ember cargo
crude sparrow
fresh path
trim orbit
#

should note that the training data for this lora is just in game units and portraits from starcraft 2 terran race. some screenshots and concept art too. realizes the aesthetic so well

pure hazel
#

Is there a good repo for SDXL comfy UI workflows?

pallid lintel
pallid lintel
trim orbit
#

mostly transparent pngs in the training data too

lyric grove
ember cargo
pure hazel
fresh path
#

ok cool

pallid lintel
pallid lintel
lyric grove
#

just more thorough

pallid lintel
#

A bit better

meager canopy
timid sonnet
still dove
# soft zealot

hey! I loaded this workflow in comfyUI and installed missing nodes but seed as text is still missing, any idea for that?

pallid lintel
# meager canopy

I just discovered your GTM "ForYou-Photo"! It works really really well for the 2nd refiner pass

pallid lintel
trim orbit
soft zealot
normal bridge
#

Is there a reason why so many use comfy over a1111 with sdxl apart from the two step thing going straight from base latent into the refiner?

trim orbit
pallid lintel
#

reusability, reproduction, tinkering

normal bridge
pallid lintel
#

I still fire up A1111 from time to time, just to try out new extensions or model integrations

trim orbit
#

eat a pile of vegtables today so your thinking cap is fired up later

soft zealot
timid sonnet