#✨|sdxl

1 messages · Page 52 of 1

fast vector
#

so, does anyone have afterdetailer workflow for comfy?

boreal bough
#

yeah, many words arent weighted equally. brown hair also dominates most prompts

#

but I'm getting there

slender hinge
#

the refiner does eliminate much of the texture issues, i still don't consider it an amazing result though tbh a little let down by the quality so far i think. I hope it will improve in the future with community efforts

#

also the double step process is annoying but 🤷‍♂️

slender hinge
static prawn
#

idk why but the refiner is deleting details for me

#

without refiner

#

with refiner

waxen berry
azure oxide
static prawn
#

yes

azure oxide
#

ya i dont think it's using the refiner in the same way that it does in comfy

static prawn
#

oh ok , i just lose a lot of detail here

#

i think u can clearly see in the image

boreal bough
#

k. fixed the blur, now I just need to set the style

slender hinge
boreal bough
#

what kinda image are you going for? anything specific

sweet bane
boreal bough
#

selfie of a cute young woman with pastel hair with freckles
(blur, bokeh:1.4), + whatever other negatives you use

slender hinge
#

i was actually trying to make sci-fi movie scenes, but was having issues so i just wanted to try something simple with XL first to figure it out 😂

#

didnt expect everyone to start making pastel ladies

slender hinge
boreal bough
#

Positive: cat summoning a gigantic tomato using the power of but a single infinity stone, in the style of marvel movie
Style: (horror:1.2) still, grunge, vignette, chromatic aberration, dark, lovecraftian
Negative: deformed, bad anatomy, ugly

slender hinge
boreal bough
#

🤣

#

dont question my 'story style' prompting!

#

it gave us masterpieces like these

static prawn
#

anyone has an idea how to use the refiner in the right way in auto?

boreal bough
eternal crystal
#

using the tiled vae got me down to a min thirty, incredible. Thank you 😄

boreal bough
#

sai even confirmed that a1111 does not have full support yet, and will prob take a while

boreal bough
void skiff
#

In Automatic1111 I run this prompt "lora:offset_0.2:0.2 plastic doll Dora the Explorer wearing He-Man outfit" with Seed: 169085828 and Sampling method: DPM++ 2M SDE.
It look okay until the end when it glitch out and give me a grainy image, can some test if this is same for other or if it only happen for me.

void skiff
slender hinge
#

maybe you still have an old VAE set in the settings?

#

for an older version of SD, make sure you are using the correct VAE

#

or automatic

indigo carbon
void skiff
#

I run it in 20 steps and it look goot until last frame, and it still do great in the old Euler.

slender hinge
bold osprey
void skiff
#

I run Automatin on VAE and I get same when loading the sdxl_vae.safenor

bold osprey
#

use the separate vae just in case

#

what is your workflow

void skiff
#

It only happen with DPM++ 2M SDE, the DPM++ SDE look okay

candid walrus
#

i just read some of the comments about the 0.9 vae, doesn't anyone know the story on why, the benefits?

nimble heart
#

They left it in the oven too long

azure oxide
#

i dont think the devs had said anything about it

candid walrus
# void skiff

yeah i got this when i tried different samplers, can't remember exactly which ones at present

#

yeah interesting to see it just appear in huggingface

wary bay
candid walrus
#

is it's like night and day improvement or like some like a and some like b?

nimble heart
#

anyone know a way to change the default workflow
or better yet load an arbitrary one on launch?

boreal bough
# void skiff

that one suffers from not enough enough steps on base model, too much noise AND not enough steps on refiner to fix it

vast narwhal
#

There's token limit on comfyui or works like a1111?

void skiff
#

The bot here, what do it use to generate images, what Sampling Method?

wary bay
left coral
#

I'm training a lora for SDXL. Getting this error trying to generate with it

17:58:40-904771 ERROR    Diffusers LoRA loading failed: crown_vic-000006 'UNet2DConditionModel' object has no attribute
                         ''
#

(am using kohya)

void skiff
#

Well, for the most part SDXL give me great images so I am still glad.

candid walrus
#

i know i tried 30 steps on the _2m_sde_gpu and got the artifacts that kennylex posted, and thought, if i have to go more than 30 steps, so far... not the sampler for me

static prawn
#

finally a result i like 😄

candid walrus
#

but it begs the question if i do go higher yet on steps will i like the images enough to warrant the time

boreal bough
void skiff
#

What samplers do folk like to use for SDXL? Self I just try to avoid EULER and Ancestors.

nimble heart
#

Euler.

static prawn
#

just used highresfix with the base, seems to work out pretty well for me

boreal bough
azure oxide
#

was a few months ago tho so foggy on the details

boreal bough
azure oxide
#

ah yeah convergence and all that

copper pollen
#

are there supposed to be watermarks in SDXL base model?

fleet harness
heady vale
cold mica
heady vale
boreal bough
heady vale
copper pollen
#

This one has some small scribbles in the bottom.
That's base model, txt2img.

left coral
#

Did you ever get an answer for this question? I'd like to know too.

copper pollen
#

@boreal bough - With refiner in img2img, I get another one in the bottom left, + one diagonal across the background, like stock photo watermark google result

lost blaze
#

Mr Bean Ascending (missing the spotlight though)

sweet bane
fair tartan
boreal bough
heady vale
slender hinge
#

Slightly better results in comfy vs auto 1111

copper pollen
zealous viper
eternal fog
boreal bough
zealous viper
viral narwhal
zealous viper
boreal bough
fast vector
fresh path
#

could anyone explain what It/s means? It says like 17 it per second but still takes a few minutes to generate

midnight shuttle
#

As in seconds per iteration.

fresh path
#

oh haha

#

why is it so slow :(

midnight shuttle
fresh path
#

6900xt sdxl1.0-0.9vae 1024x

#

on sd1.5 I get 20x faster

#

is it expected performance?

midnight shuttle
#

So AMD card on Linux? Are you sure it's using the GPU? I don't know much about AMD.

edgy otter
#

I can not recommend an AMD card for AI related tasks. All the software prioritize NVIDIA because of its huge market share and CUDA so you may constantly run it problems while setting up libraries etc.

fresh path
#

it's on windows using --direct-ml argument for comfyui

#

and yes gpu is 100%

midnight shuttle
#

Everything I've heard is that DirectML is slow and trash. Not many people use it so you will need to ask around.

edgy otter
fresh path
#

oh cool

#

thats a shame

polar epoch
#

Automatic1111 isn't really in my favor atm lol

fresh path
midnight shuttle
edgy otter
vast narwhal
polar epoch
midnight shuttle
polar epoch
#

this was an earlier run with basic prompts testing refiner in img2img

polar epoch
# midnight shuttle What was the prompt and CFG?

7 and "beautiful young woman, short black messy hair, white, beautiful eyes, sitting in the park watching birds with a beautiful garden behind her
Negative prompt: fat, , , bad hands, mutated heads, multiple heads"

midnight shuttle
#

It's almost like your negative prompt got in the positive somehow.

fresh path
polar epoch
#

Indeed, automatic has done that lately lol. The more i imply negative, the more it adds it kek

#

@midnight shuttleMe and my non existing short term memory pokerface

midnight shuttle
#

Yep. Wrong model.

fresh path
#

bro refining noise

polar epoch
edgy otter
# fresh path did you find anyway to speed it up?

No. If xformers supported AMD we could have gotten better performance but it doesn't. Someone created an alternative to xformers called "flash-attention" and folks at AMD are trying to modify it to work on AMD. Fingers crossed.

The last update was yesterday.
https://github.com/ROCmSoftwarePlatform/flash-attention

GitHub

Fast and memory-efficient exact attention. Contribute to ROCmSoftwarePlatform/flash-attention development by creating an account on GitHub.

polar epoch
#

Oh boi.. This will kill my gpu lol

#

Also, noticing the generation of automatic is quite slower than on comfy

fresh path
midnight shuttle
polar epoch
#

Just about made it kek

midnight shuttle
#

Filling up 24 GB? Wow.

polar epoch
#

Nibbled at my ram, then it went down again :P

fresh path
#

it burped

midnight shuttle
#

VAE seems to use a lot of VRAM in SDXL/A1111. 9.3 GB used then up to 13 or 14 when VAE runs.

polar epoch
edgy otter
polar epoch
midnight shuttle
#

There is definitely something strange with A1111 and SDXL memory usage. It should not be so inefficient.

polar epoch
#

Nope!

fossil thorn
#

In comfyUI, is there any way to pick a single image from a batch and only regenerate that one? When I drag the image back in, I just get the same batch that I originally generated.

calm shell
#

here is the prompt: With Cat-Like Tread
Upon our prey we steal!
In silence dread,
Our cautious way we feel!
No sound at all
We never speak a word,
A fly's foot-fall
Would be distinctly heard!

polar epoch
#

Now to refine this one that already looks great at 80 steps :P

fresh path
calm shell
#

artroomAI with sdxl 1.0

edgy otter
#

pytorch and rocm's last update boosted my iterations by 3-4 on SD 1.5 512px

boreal bough
fresh path
edgy otter
fresh path
#

with 6900xt? wow, I only get 1

#

on 1.5 512px

edgy otter
slender hinge
#

It doesnt seem to be very good at faces that are not close up, and even close up ones seem iffy

zealous viper
polar epoch
#

Not too bad, teeth is a bit off though

midnight shuttle
indigo carbon
#

I want to punch this shit eater

#

I hate this duck cube so much

slender hinge
#

some ok results though

polar epoch
#

Is there a "highres fix" upscale by" feature in comfy?

polar epoch
zealous viper
alpine pine
edgy otter
midnight shuttle
indigo carbon
slender hinge
edgy otter
polar epoch
zealous viper
#

just gonna ignore the fact the tree/rock/explosion thing in the background is a low poly mesh but

fleet harness
midnight shuttle
#

Midjourney has the advantage of being a closed model so they can train it on more things and then filter the results. Open source models need to take into account possible misuse when choosing what to train. Have to be careful when training images of people.

boreal bough
zealous viper
static prawn
#

think im getting more into it now

fossil thorn
#

Going to ask again because I think it was missed. Is there a way in ComfyUI to pull a single image from a batch and just generate that one image again.
When I drag the image back in, it just gives me the workflow for the whole batch. I have tried incrementing and decrementing the seed, but I can't find it.

static prawn
#

just need an upscaler

boreal bough
fresh path
zealous viper
fossil thorn
# alpine pine I do not believe so

Is there any way to interrogate the image to get its generation parameters? Or are the later outputs in a batch dependent on the earlier outputs.

south igloo
#

Where is the best place to find well made comfy UI work flows? The ones I find are very basic I want one that can do high-res fix and face restore etc

edgy otter
boreal bough
eternal fog
noble shoal
#

My eyes hurt just a little bit

indigo carbon
fresh path
zealous viper
boreal bough
#

interrogator + Vit-H, if its just an image

edgy otter
fresh path
#

oh then thats why. I'm on windows, if I remember correctly rocm didnt work for me

#

its crazy linux has 10x performance for you lol

indigo carbon
fresh path
#

1 it/s on a $1000 card is amazing

fossil thorn
#

I don't want to do the batch each time because that is just wasting GPU time.

nimble heart
fresh path
#

what do you mean?

#

oh is rocm the only official amd platform?

nimble heart
#

yes

fresh path
#

fun

nimble heart
#

idk what card you have but if it's an AMD one ROCm is their version of CUDA and it only works on Linux right now. It's coming to windows Soon™️

#

rocm started as a server thing. didn't even work on consumer GPUs for a while.

fresh path
#

ah I was just talking to s0md3v because they have the same card as me and is getting 10x the speed

nimble heart
#

yea

sweet bane
nimble heart
#

I have a 7900 XTX and it runs pretty fast on linux.

edgy otter
nimble heart
#

sdxl 1024 is like 3.25 it/s or something

edgy otter
fresh path
#

wait I can just install linux

nimble heart
fresh path
#

yeah 7900xtx is pretty good

nimble heart
#

I have both windows and linux

edgy otter
fresh path
#

idk if windows shitty partitions will allow it

nimble heart
#

windows for my Rift linux for everything else

nimble heart
indigo carbon
nimble heart
#

yea

indigo carbon
nimble heart
#

S

#

the cheap one

edgy otter
nimble heart
#

worked fine for me

#

bakc up your stuff first though If you've never messed with that stuff before

indigo carbon
#

damn, I'm a huge fan of the original CV1, I had S, CV1 is soooo good

boreal bough
#

then it loads all the params

shy kelp
#

Should I buy a100

nimble heart
boreal bough
alpine pine
nimble heart
#

if you're 100% certain you really wanna do literally all the AI stuff sure. If AI is an occaisional hobby I'd snag a 3090 or something instead

shy kelp
#

A100 is like 2.3 times faster than 4090

boreal bough
fresh path
boreal bough
#

3090 is the best bang-for-buck option you have right now

noble shoal
shy kelp
#

Not a professional thoo

fossil thorn
nimble heart
#

long as you don't mind the ROCm pains

dusty turtle
#

lol just get an a6000 if you really really need the vram on 1gpu

alpine pine
agile quarry
torn canyon
#

It's running well on my 7900XT, getting ~2.9it/sec for 1024x1024 SDXL

outer oxide
nimble heart
#

I don't have a 24 gig nvidia card to compare against

boreal bough
shy kelp
#

I use gtx 3060 should I upgrade to 4080

nimble heart
#

I do know that the pytorch 2.0 attention is broken on AMD and it causes these VRAM spikes like 2-3x what you'd normally see

boreal bough
#

anything but that card

nimble heart
#

so you have to use sub quad or doggettx attention

boreal bough
#

ah, yeah XD 4080 is ok

dusty turtle
#

12g-16gb memory? not really an upgrade

shy kelp
boreal bough
#

not necessarily what I'd recommend for ai, but its good for gaming

shy kelp
#

Which one is yhe 24gb one

fossil thorn
shy kelp
#

Ohh 4090

#

I want to upgrade 4090

boreal bough
#

3090 (around 700$ used)
4090 (around 1800$)
have 24gb vram - and are the best for AI

shy kelp
#

Ohhhh thats nice

boreal bough
#

4090 is not worth it for speed. while it is a bit faster, not nearly enough to more than double the price

shy kelp
#

3060 still works but I want to learn training fine tuning

nimble heart
boreal bough
#

I only got the 4090 since I do gaming and other things, and had the extra money

dusty turtle
nimble heart
nimble heart
#

bitsandbytes isn't working on my card yet

#

only RDNA 2 cards

south igloo
#

Win 11 has faster performance too

nimble heart
#

back when I still had my 1070 4/8 bit didn't even exist so I'd say it's a fair comparison

shy kelp
#

So this xl cant work good with 512x512 is iy

nimble heart
#

yea no

#

use a 1.5 tune for 512

#

50% faster half the memory better results

shy kelp
#

Hmm

dusty turtle
shy kelp
#

So what is this xl for

exotic path
#

Finally got the right to use A100 gpu on google cloud umaruChilling

south igloo
#

Xl sized images

alpine pine
shy kelp
#

1.5 can do 1024 too 😭

nimble heart
# dusty turtle man that must suck

I mean there's a ROCm version someone made that seems to work well for 6900 XT's and similar. I ported some parts of the code to target my 7900 XTX but it miscompiles with some clang problem and I'm too lazy to fix it

midnight shuttle
# shy kelp 1.5 can do 1024 too 😭

But not well. If you want 512 images use 1.5 or 2.1. If you want larger images such as 1024 or other supported resolution use SDXL. Very simple.

hearty ginkgo
shy kelp
#

But 1024 way to slow

exotic path
midnight shuttle
shy kelp
#

Still it will be way more slow on better gpu 😭

nimble heart
alpine pine
vital wolf
#

How to enable embeddings in comfyui? Like it's no half vae for a1111

alpine pine
shy kelp
alpine pine
shy kelp
#

Yea 😭😭

alpine pine
#

especially when you account for success rate

nimble heart
#

SDXL is faster on 768+

calm shell
#

Bonfire of the Vanities (that's the prompt) ArtroomAI sdxl 1.0 dpm++ SDE

vital wolf
#

How to use textual inversions in comfyui?

dusty turtle
# nimble heart :(

I should probably move to linux whenever my upgrades arrive, dealing with all this shit on windows and amd is just pain

#

its all hacky kinda works *** bs

calm shell
torn canyon
#

Yeah linux works so much better for AMD cards, it's well worth it

shy kelp
#

What causes this unrealistic look in stable diffusion

nimble heart
vital wolf
dusty turtle
nimble heart
nimble heart
#

major oof

#

And here I felt kinda bad paying $1050 for my OC'd XTX

calm shell
#

price/performance not worth that I think

nimble heart
#

XTX or 4090

boreal bough
# eternal fog 😢

3090 will always be the better deal, unless you're determined to play cyberpunk without dlss enabled

calm shell
#

my 1660 S does quite well.

#

but Ii think I need more vram than 6gb

nimble heart
# calm shell my 1660 S does quite well.

man once you get a card that has ai accelerators it's a whole new world. Instead of very carefully generating images hoping they turn out OK you just generate like 10 of them at once and pick the best one

#

in 1.5 I had my default batch size to 16

boreal bough
# eternal fog 😢

is what I had to get to make full use of games along with rtx4090
you end up with weird bottlenecks if you go highest end

#

definitely wouldn't recommend XD

dusty turtle
nimble heart
#

idk anything that can use gen 5 speeds yet

boreal bough
calm shell
#

Whats the downside of a 12gb 4060?

nimble heart
#

It's really really hard to be bottlenecked on a gen 4

shy kelp
#

What model is best for photorealistic

nimble heart
#

unless you literally just copy around 100 gig models

boreal bough
#

ah, yeah. windows boot drive.
games - ratchet and clank: rift apart!

#

no loading while jumping through areas XD

nimble heart
#

linux gets my gen 4

boreal bough
#

I just want my adobe stuff to load faster tbh
windows is just an additional bonus

nimble heart
static prawn
dusty turtle
calm shell
#

I dumped adobe when they went SAS

#

But intel dumped Optane

nimble heart
dusty turtle
fast vector
nimble heart
#

it's not optimizing anything if the whole drive is made of the same chiplets

dusty turtle
# nimble heart it's not optimizing anything if the whole drive is made of the same chiplets

optimizing? its a completely diffrent tech to nand flash, I use it because it has a hilariously high write endurance/sustained write speed vs most junk nand drives
https://www.intel.com/content/www/us/en/products/sku/211867/intel-optane-ssd-p1600x-series-118gb-m-2-80mm-pcie-3-0-x4-3d-xpoint/specifications.html

heady vale
#

PCWorld did a video on optane drives. For most task regular nvme is better. some task optane flies past everything

pliant acorn
#

ROCM is now out for windows. Any ideas how to use it with comfy ui?

alpine pine
nimble heart
heady vale
pliant acorn
#

oh

#

😦

nimble heart
#

yea

alpine pine
slender raft
#

Anyone tried latent upscale? I'm getting artifacts when I try.

nimble heart
#

think endurance is oversold nowadays. had this drive for like 4 years across two computers having done a lot wipes + clean installs and in/uninstalling dozens of fat bloated modern games.
total expenditure, 8% lol.

dusty turtle
#

it loads ~25gb of write to the drive every gen

nimble heart
#

Fucking how. I've been using SD since 1.4 came out

#

mostly on that drive until recently

#

What's writing 25 gigs?

dusty turtle
#

Lmao. I really want the ram to arrive so this sort of nonsense can stop occuring (this is from just this session)

nimble heart
#

Ah

#

I've never used pagefiles

#

ever

#

swap on windows and page on linux are both 0

#

wait

#

swap on linux and page on windows

dusty turtle
#

I tried to get away with not getting more ram but llms are 100% bandwith starved so upgrade it is lol

nimble heart
#

ram's so cheap

indigo carbon
#

also, update on the whole TensorRT situation, NVIDIA stated that on the next TensorRT release they will release a script that converts almost any safetensors(including SDXL) into TRT

nimble heart
#

last thing you should cut corners on

dusty turtle
#

this was a while ago

nimble heart
#

the differene between 16 and 32 gigs on my 2017 machine was like $30

#

and until very recently 32 was enough to do everything under the sun without swapping

native moon
#

hey does someone know how to use comfy ui? becasue i have some proplems with loras and embedings. i dont know how to activate them i added them to the node tree but idk what to tipe in the clip text box to controll them

nimble heart
#

they're just active

#

it's not like auto where you need to type in a specific token to activate them

#

just go base model node -> lora node -> everything else

#

set the strength to 0 to turn it off temporarily

orchid oar
#

Does anyone of you have a good Img2Img workflow for SDXL 1?

fast vector
native moon
#

and with the embedings?

native moon
nimble heart
nimble heart
fast vector
nimble heart
#

if XL works 1.5 should 100% work

white iris
#

controlnet with xl works ??

nimble heart
#

No

fast vector
nimble heart
#

the tooling does but it needs to wait for new models to be trained iirc

native moon
white iris
#

right, saw the github discussion, waiting til its out then ill switch over

vital wolf
#

Im getting error" shape mismatch when trying to apply embedding, embedding will be ignored 768 1280" when I try to use embeddings in comfy

nimble heart
#

I thought you meant 1.5 on its own

soft bone
#

I'm trying to build automatic face-inpainting in comfy. But with CLIPseg it makes a solid mask that cant be denoised from the original image ("original fill" mode in auto1111)

Ideas?

native moon
vital wolf
#

Im getting error" shape mismatch when trying to apply embedding, embedding will be ignored 768 1280" when I try to use embeddings in comfy. Anyone help?

dusty turtle
#

are you using 1.5 embeddings for xl?

vital wolf
#

Using for dramshaper

#

Sdxl has different negative embeddings like BadDream or UnrealisticDream

#

?

static prawn
nimble heart
#

fucking resetting GPU motherfucker fix your shit AMD fuck

dusty turtle
native moon
nimble heart
#

also have my SD output folder on it

native moon
nimble heart
#

recommend 64 just for that

nimble heart
static prawn
sweet bane
native moon
slender timber
#

So, I left a SDXL lora on back on a 1.5 generation. It is not terrible..

#

Actually seems to handle upscaling pretty well

fast vector
nimble heart
#

very carefully

azure oxide
#

comfyui can do that

slender timber
#

comfy

azure oxide
#

actually it should be pretty easy to (accidentally) do on a1111 too lol

nimble heart
#

lets just answer anyone's questions they'll ever have from now on with comfy

fast vector
#

so, have you tried the other way around? 1.5 lora into SDXL model?

dusty turtle
#

spooky eyes

nimble heart
#

don't think that'd work. won't be big enough

#

or somehting

slender timber
#

I've not seen great results with that

#

re: 1.5 -> SDXL

nimble heart
fast vector
#

XL Loras are like insane size. Some on civit AI are at the size of prunned FP16 models

dusty turtle
slender timber
alpine pine
dusty turtle
nimble heart
#

nah fuck 1.5 it runs like ass at high res

orchid oar
#

How to i get the denoising option for img2img?

nimble heart
#

2 megapixels it's like half speed or something

nimble heart
dusty turtle
#

lmao (agony)

fast vector
nimble heart
#

if total is 20 and it starts at 10 that's 50% denoise

glad fulcrum
#

sometimes I get this,.

#

I queue a prompt and it does nothing

orchid oar
azure oxide
orchid oar
fast vector
alpine pine
#

and in sd1.5 you used higher ones

dusty turtle
nocturne dove
#

There are a bunch of different lora types and the appropriate size for each type varies. Then for each type, there are size settings that change the size. You can't just say "lora is big therefore settings are wrong".

heady vale
dusty turtle
#

surely you dont need 1.5gb of weights for your like 100 image lora

nocturne dove
#

That's true for sure.

azure oxide
alpine pine
fast vector
#

since resolution is 4x, so I would expect 4x size

heady vale
fast vector
edgy otter
#

did someone say street photography?

heady vale
queen ether
#

76 images of 2k images down in 30 minutes. Woohoo Lora training

echo stratus
#

Waiting for controlnet.

static prawn
queen ether
glad fulcrum
fast vector
civic sigil
queen ether
wet nacelle
glad fulcrum
#

I need to learn how to do it then. I have 12 GB VRAM. Do you have a tutorial for sdxl training lora

queen ether
glad fulcrum
#

do you have the link?

queen ether
#

He's a YouTuber

#

@fast vector 90 images in 30 minutes for a 3060 12 GB? I can't believe how much power sdxl demands :((

fast vector
#

"Oh it did. I think it took about 3.5 hours to train this for 2 epochs on my 3090. I've trained my last few 1.5 LoRAs at 1024 this took substantially longer."

queen ether
fast vector
#

Yeah 3.5 hours is too long to be honest

alpine pine
fast vector
#

wait you're mooncryptowow? Or is it just coincidence?

heady vale
#

Stack Overflow are releasing their own Overflow AI, specific for AI/ML. interesting

queen ether
alpine pine
fast vector
#

man, I actually wish if comfy had something like autocompletion. It would be great for embeddings, etc

raw cedar
#

does old textual inversion work on sdxl ?

nocturne dove
#

Old nothing works with sdxl. You have to train new textual inversions and loras afaik.

deft coral
#

Can SDXL 1.0 even be trained on

#

i can't seem to train on it

glad fulcrum
#

I think textual inversion will not be able to work on sdxl. At least that's what I remember reading

deft coral
#

i get a error everytime

fast vector
#

I mean there are already loras and even checkpoints

deft coral
#

when i try to train 1.0 i keep getting mussing unet

#

missing

nocturne dove
#

I haven't tried to train the actual model. Loras and so on weork fine. I use a branch of the kohya scripts.

alpine pine
alpine pine
deft coral
#

such as?

alpine pine
#

this has a summary of some

nocturne dove
#

I just grabbed updated scripts, and trained my loras like normal. I only needed to set the res to 1024 and check the box for an sdxl model.

deft coral
#

ya i dont want lora

alpine pine
deft coral
#

i wanted to train a checkpoint

alpine pine
nocturne dove
#

Yeah I have not done checkpoints either for sdxl.

deft coral
#

i just get unet issues

nocturne dove
deft coral
#

they added 0.9 vae to 1.0

#

im downloading it

fresh path
#

does comfyui not install on python 310

deft coral
#

no

#

you need to install that manually

fresh path
#

No I mean

deft coral
#

i got comfyui on 3.10.9

fresh path
#

I tried installing pip dependies and some packages aren’t available

deft coral
#

odd

#

it works for me

#

on 3.10.9

visual glade
#

yeah it should work

boreal bough
fresh path
#

specifically torchdiffeq

deft coral
#

i use torch 2

visual glade
#

I use python 3.11 and the standalone ships with python 3.10

#

torchdiffeq is pure python so it should have no trouble installing

deft coral
#

3.11 i dont use

#

since it doesnt support pytorch

queen ether
#

One hour and a half in and I just hit my first epoch out of 20 for SDXL Lora training chad agony mikuwha happemad

visual glade
#

well you made me notice torchdiffeq wasn't actually used for anything important

#

so I'll remove that dependency

queen ether
boreal bough
deft coral
#

bro

#

i can get 1800 done in 10 minutes

boreal bough
#

you going over your vram by any chance?

native moon
#

hey how do embeddings in comfy ui work. wehn i am using them do i need to put the file name or the embedding name into the clip text box?

deft coral
queen ether
#

@deft coral what graphics card do you have

deft coral
#

RTX 4090

visual glade
queen ether
#

@boreal bough I have 12 and I believe it was asking for 17 I got it to work by enabling the gradient something or other

#

@deft coral 3060 12 gb poverty gang over here

boreal bough
native moon
nocturne dove
# alpine pine do you have an example of your settings / output?

Here's the results of a "spider-gwen" lora for sdxl 0.9. I haven't re-trained it for 1.0 yet because I need to change the dataset some and I haven't gotten around to it yet. In this attempt, the fact that I mixed costume, no costume, mask, no mask, made the lora somewhat ineffective.

native moon
#

in ther case the file has the same name as the embeding

queen ether
#

@boreal bough I don't know what that means 🙂

visual glade
#

embedding:filename

deft coral
#

@queen ether

boreal bough
native moon
visual glade
#

can you show me one?

queen ether
#

@deft coral my GPU is held up by a paint brush

boreal bough
#

you're missing some optimization settings, since multiple here run it on 12gb vram

alpine pine
queen ether
#

@boreal bough is that a command argument ?

native moon
boreal bough
queen ether
#

@boreal bough UI :0

native moon
boreal bough
#

then xformers, gradient checkpointing, 1e-3 unet & learning speed. dim/alpha = 8/1 ,and the command argument for unet only

deft coral
#

does 1.0 even have unet

#

monkaGIGA #

boreal bough
#

--network_train_unet_only

fast vector
visual glade
#

that one should work fine on comfyui

sour obsidian
deft coral
#

atm it doesn't

#

everytime i tried to train its missing unet

native moon
fast vector
#

no, there are two files

queen ether
#

@boreal bough I was reading this on the GitHub but I don't know where to put it. I know where to put it in stable diffusion but I don't know where to put it in koya SS I have something filled in in the optimizer extra arguments that starts with scale parameter is that it?

nocturne dove
sour obsidian
native moon
boreal bough
fast vector
#

just click on other one

native moon
deft coral
#

my system just locked up

fast vector
wet rock
#

hi friends, whats the recommended vram for running SDXL?

deft coral
#

i just transfered a 12GB files in 2 seconds

visual glade
# native moon its just 1 file

{'string_to_token': {'': 265}, 'string_to_param': {'': tensor([[-0.0034, 0.0213, 0.0007, ..., 0.0280, 0.0048, 0.0047],
[ 0.0179, 0.0198, 0.0057, ..., 0.0105, 0.0117, -0.0043],
[ 0.0095, 0.0053, -0.0120, ..., -0.0096, 0.0066, -0.0025],
...,
[ 0.0037, -0.0079, -0.0430, ..., -0.0173, 0.0184, -0.0008],
[-0.0271, -0.0072, 0.0129, ..., -0.0072, -0.0059, 0.0009],
[-0.0165, -0.0159, 0.0144, ..., 0.0192, 0.0016, 0.0220]])}, 'name': '_EmbeddingMerge_temp', 'step': 0, 'sd_checkpoint': None, 'sd_checkpoint_name': None}

#

there's only one

native moon
deft coral
wet rock
#

i have 10gb VRAM but when i select the SDXL model, my GPU is stuck at 9.7/10gb 😭

native moon
queen ether
#

@boreal bough I could keep asking questions forever 😓

boreal bough
native moon
deft coral
visual glade
#

are you on the latest? try update/update_comfyui.bat

native moon
#

at fist i thogh comfy ui looks harder then a1111 but its the opposite.

deft coral
#

wtf

visual glade
#

which lora and can you give me a screenshot?

static prawn
#

is there any sheet for all the styles for sdxl? so we know how we can get a specific style?

fast vector
#

yeah, there is.

deft coral
#

so i add this to mine right

#

--network_train_unet_only

boreal bough
#

assuming you're doing normal training, with 1 trigger word at the start

#

then these are your advanced settings - where xformers and such is included

#

only the network train unet needs to be in that box (if you're using adamw8bit), if you're using an dadapt or other cool scheduler like that, then yeah, the remaining special settings also go into that box

visual glade
#

that's not the proper lora format though

boreal bough
#

@visual glade where do I voice my complaint about the civitai lora promotion? :/
terrible loras are mass produced by the minute right now

wicked frigate
sour obsidian
wicked frigate
boreal bough
#

T.T

visual glade
#

I'm pretty sure a1111 doesn't support that format either

#

is this the diffusers format lora format?

native moon
deft coral
#

you can't train in A1111

#

don't even try

boreal bough
# native moon the bad ones will get downvoted

its just a matter of considering 2 basic things
A.) this is not 1.5 - stop reusing the exact same captions - especially when they don't work in standard generation to begin with
B.) use dim/alpha of 8/1 unless you know what the difference is of not using this preset

queen ether
boreal bough
#

all very successful

boreal bough
visual glade
#

is there an example diffusers format lora somewhere?

#

it's like 2 lines of code to add support so I can add it as long as I can actually make sure it's an actual format and not an output of buggy software

heady vale
hard fractal
#

ooo

#

Fancy

queen ether
boreal bough
#

"easiest way to train stable diffusion xl"
couldnt be more wrong, dear god
almost everything said in this video is not true T.T

visual glade
#

so there's no diffusers lora files anywhere?

hard fractal
#

🤷‍♂️ looks like kohya it is.

hard fractal
vital wolf
#

Im getting error" shape mismatch when trying to apply embedding, embedding will be ignored 768 1280" when I try to use embeddings in comfy. I saw drramshapersdxl alpha 2 model using BadDream negative embedding, can you help? @visual glade

hard fractal
#

don't tag comfy

#

for tech support pls

vital wolf
hard fractal
#

here may be good 🙂

heady vale
hard fractal
#

just don't ping the devs pls

vital wolf
azure oxide
#

baddream is a 1.5 TI

hard fractal
#

your embedding is prob-- yeah that

visual glade
#

so it will actually work a bit on SDXL but will only apply to the clip-l text encoder

#

hence the warning

vital wolf
# azure oxide

That I know but saw the sdxl model dreamshaper using it, saw on meta data in civit

deft coral
#

hmm

hard fractal
#

it'll half-work.

and by half, less-than-half... since L is weaker than G

deft coral
#

Joe

visual glade
#

it will do something but it might not be the thing it's supposed to do

deft coral
#

@visual glade can i ask you something

wicked frigate
#

should use kohya gui

visual glade
#

sure

west breach
#

I made a node to apply a HaldCLUT to an image. Would it be possible to apply it to a latent? Not sure how the numbers represent RGB values.

deft coral
#

does SDXL have unet?

visual glade
#

yeah it has a unet

deft coral
#

because i can't train it for some reason

#

i keep getting missing unet

hard fractal
#

comfy, can you check to see if we included the unet with SDXL

visual glade
#

that's probably a problem with your software then or your settings

#

I don't think people would be complaining about memory usage if it was missing

wicked frigate
#

lmfao

spark bear
#

just stare at the noise like clouds and draw what you see, no need for unet

midnight shuttle
#

Boring old SDXL still requires a unet???

azure oxide
#

guys, theyre NEW lmao

deft coral
#

yes

#

SDXL still needs unet to train

wicked frigate
#

we might've replaced the unet with a tiny guy that sits inside your pc and draws the pictures for you, not sure

midnight shuttle
#

😕

boreal bough
deft coral
#

so iv tried koyha ss

#

and no luck

#

im running out of ideas

hard fractal
#

Try a LoRA first

wicked frigate
#

are you sure you have the correct SD model file? The official standard one not a diffusers file

deft coral
#

i can't even get the lora to work since it needs the main model

visual glade
#

maybe it's actually a wnet from sticking two of them together

spark bear
#

Freon needs to get Everydream working on 1.0 so we can see how screwed we are trying to do full finetunes on 24 gigs of ram

hard fractal
#

wnet.

#

I'm in

deft coral
#

joe

hard fractal
#

ryzen

boreal bough
deft coral
hard fractal
#

okay your issue is trying to do this in automatic1111

wicked frigate
#

i think that's kohya gui, which is just a gradio shell around kohya-ss scripts

boreal bough
#

"no, ignore, bad"

#

love it XD

deft coral
hard fractal
#

Ah, got it.

hard fractal
#

Try that tutorial step by step

autumn forum
hard fractal
spark bear
hard fractal
#

Just renamed the server

#

Since it's not only dreambooth

midnight shuttle
#

Any insight on why previous models' VAE worked with fp16 but SDXL VAE requires fp32?

hard fractal
#

Now I'm concerned we didn't ship the right file tho.

#

@sour obsidian - can you check?

#

And make sure we included the unet?

wicked frigate
#

jonatas

hard fractal
#

alexander

sour obsidian
deft coral
#

i don't like lora training

#

i train full checkpoints

visual glade
#

at least one of them did

deft coral
wicked frigate
hard fractal
#

fp8 wen

midnight shuttle
deft coral
#

koyah has suport for fp8

visual glade
#

there's enough issues that I made comfyui always run it in fp32

midnight shuttle
wicked frigate
deft coral
#

i use no-half-vae

wicked frigate
#

we need at least 50 dedicated to 24/7 cat pic generation before we can get around to productive usages like fp8 testing

deft coral
#

wait

#

im stupid

#

i think thats whats causing it

#

i used --no-half-vae

hard fractal
#

to... train?

vital wolf
#

Is there any equivalent of no half vae for comfy?

deft coral
#

no

turbid gulch
#

Playgroundai.com just launched the first batch of SDXL 1.0 filters. They're super cool.

deft coral
#

in the command line

trim orbit
#

i think stability should run a contest where i win and get a h100

deft coral
midnight shuttle
deft coral
#

because no half wont train fp16

#

only fp32

#

and if SDXL used fp16 it wont work

hard fractal
#

well, we can neither confirm nor deny the inclusion of the unet.

trim orbit
# deft coral i train full checkpoints

might not be very beneficial with sdxl. the base is so good that none of the refined models that have come out have really impressed me much. they all seem to ruin the general use capabilities

deft coral
#

i know what your saying

midnight shuttle
#

I'm hoping for SDXL hypernet training someday and also inpainting model.

deft coral
#

but i have my own dataset

molten breach
boreal bough
# hard fractal https://youtu.be/AY6DMBCIZ3A

joe. I just checked that video and it has a few critical things wrong.
it's missing the unet only, it's dimension is set to 256... ... ... and a lot more

essentially any LoRA trained using this video will
• significantly harm clip model. longer prompts wont work correctly anymore - which is what more and more people that rely on this tutorial keep mentioning in this very chat
• create huge filesize, when 43mb is all that is needed. especially for sub 500image datasets
• not mention how incredibly important captioning is, even more now on sdxl
• using ohwx token, which was a 1.5 thing, as we have new token weights now

and more but I dont wanna spam you.

turbid gulch
trim orbit
#

most refiners do too. lots have come out and none have really improved things yet

turbid gulch
#

Nothing local tho

midnight shuttle
potent shard
#

I guess sdxl has little to no knowledge of slime rancher...

turbid gulch
midnight shuttle
molten breach
deft coral
#

@boreal bough was i right then>

trim orbit
#

loras can train on large datasets too. i think the community is just going to move towards loras. they're so much more efficient and when they're performing better than all the base models that are available, people will start making lora merges and all the scene will happen there

molten breach
midnight shuttle
turbid gulch
#

And free. They give you 1k images a day for free. Their sub model is like $15 bucks a month.

boreal bough
trim orbit
#

inpainting with comfy is possible it's just a pita

molten breach
deft coral
#

@boreal bough i never watched the video

#

thats why i kept asking

midnight shuttle
turbid gulch
trim orbit
boreal bough
#

ah yeah, but the training settings in it basically mirrored the ones you had
guessing it's a game of broken telephone in regards to lora settings.

Um the settings I linked you I can guarantee will work out in a good lora

molten breach
turbid gulch
#

With Canvas and Davinci Resolve you can do everything you do at Adobe.

#

Free

midnight shuttle
#

But it sounds like some site has access to the pre-release inpainting SDXL model.

midnight shuttle
boreal bough
turbid gulch
#

This is what their Canvas looks like. Inpainting is at the top.

#

4k

deft coral
#

@boreal bough

midnight shuttle
#

I hope A1111 can get SDXL working fully and with better VRAM usage (needs 16 GB right now - 14.5 used max for 1024). Or else mayne SD.next will be an alternative. I still need to look at that one.

deft coral
#

i can't even get it to work

#

im just gonna wait for A1111 to fix SDXL

midnight shuttle
limpid rose
boreal bough
deft coral
#

koyah doesn't work for me either

deft coral
#

i get the same unet issue

boreal bough
#

use that. its foolproof, and uses kohya in background

turbid gulch
#

Inpainting with prompt + controlnet

midnight shuttle
queen ether
#

after like 2 hours of training only 3/20 epochs completed kohya finally crashed ❤️ i love god

turbid gulch
#

Yes

midnight shuttle
#

OK that's good news. Thank you. It means the unreleased inpainting model will hopefulyl be available someday.

turbid gulch
#

It doesn't work on 4k - 32k images. You have to shrink them, then it works.

#

Their max upscale is 4k

#

Davinci Resolve will upscale to 32k

deft coral
#

@boreal bough how can i nuke my ven

boreal bough
#

@golden quarry would it be possible to get an update for kohya?
gonna try and get the lora training tutorial out on sunday, and wanna link your distro for it.

boreal bough
turbid gulch
boreal bough
#

venv folder*

deft coral
#

so fresh install

boreal bough
golden quarry
#

I was pretty busy overall

dry anchor
#

Hello guys, is there any lora training guides for sdxl1.0?

west breach
#

so how do you train a lora without trigger words?

boreal bough
trim orbit
#

both kohya and automatic1111 with the db extension load up with xl for me. i don't think the fault is with their commits

#

i think its a local issue

queen ether
#

--network_train_unet_only is not an option in my kohya and i have no idea where to put ❤️

west breach
queen ether
#

kill my self

trim orbit
midnight shuttle
#

Is ther a good way to get A1111 to detect a .safetensors VAE that is named the same as the model? It only seems to want to use .pt VAE files that way.

cunning geyser
#

So i want to train an anime character on sdxl, can anyone teach/tell me how to do so? REM_think

#

I kind of dont know what settings to sue

#

use*

boreal bough
# west breach I was thinking of training an art style

I would still recommend a trigger word, as it will make your life easier.
but if you wanna do it without anyway:
-> Use Interrogator with Vit-H to auto caption everything. Accuracy doesn't matter too much. its only important you have enough tags.
shuffle on
have a dataset that is big & varied enough that enough concepts show up (car, house, person, toaster, whatever - just make sure its not all of a person, otherwise ALL captions will learn to create people xD)
I'd recommend 100~500image dataset if you wanna do without trigger word
50~100 to get a good varied style with trigger word

trim orbit
cunning geyser
#

Oh thats easy

#

I just don't know what settings work

west breach
trim orbit
#

with dadaptation trainers, settings are 1 and 1

cunning geyser
#

huh

trim orbit
west breach
#

I trained a 'grumpy style' lora which makes any face grumpy looking. But you need to add 'grump style' to the prompt

boreal bough
trim orbit
boreal bough
heady vale
#

some funky eye lashes going on here

midnight shuttle
#

Looks like neurons.

queen ether
nocturne dove
pallid sail
heady vale
#

fractal lashes

trim orbit
deft coral
#

bro

#

this is so good

#

imagine if i can make my own checkpoints

#

@boreal bough

#

this has so much potential

boreal bough
#

👋

#

my biggest achievement on sdxl so far, is getting finetune level of improvement, by training a 5k datasets, resulting in a mere 43mb LoRA file 😄

#

got over 100 learned concepts working

trim orbit
#

hmm. i don't know how to make comfy ui stop fudging my symlinks and rewriting new folders over them

#

one second, going through all my loras, next second, "undefined" cause it decided to make a folder there

autumn forum
trim orbit
#

all the checkpoint refinements i've seen so far have underwhelmed. while loras push the base in new ways.

boreal bough
trim orbit
#

Hm. Civit is throwing a contest. But it's also kind of "A model that is in dire need of some tweaking." ish. Which means the contest is for a porn model. If i enter the judges won't pick mine.

boreal bough
#

rules! XD always read them

trim orbit
autumn forum
#

oh i forgot he has me blocked lmao

trim orbit
#

they're very skirty like that. when they blocked NSFW submissions, all they did was make pornographic merges show up in general results, but with their porn images not showing.

#

the 3 winners will be boobs, bazongas, and butts

deft coral
#

@autumn forum i would if i could fix the unet issue

#

so i can't

trim orbit
#

when their call to training says that sdxl is in dire need of training, we know what that means

vale eagle
boreal bough
#

either way, lykon gonna get one of those 4090s for the dreamshaperXL base - since by contest rules its gonna beat everything by virtue of being used as a base by many others
and for the other two :/
character is hard as hell to get right, as you'll get to experience all the downsides of the refiner - meaning that lora will kinda cheat by being an existing concept, that is only reinforced well
style is the only true competition, as its also the only one that can be trained on all machines, and is by far the easiest to train

trim orbit
trim orbit
vale eagle
boreal bough
# vale eagle How do you caption with this size dataset?

yep. what flowwolf said is what I did.
Interrogator running Vit-H

but I really wouldn't recommend it as a starting thing for sdxl, as big datasets are the hardest to get right - and each failure punishes you with 20~40hours of training time (on an rtx4090)

midnight shuttle
#

Curating and captioning data sets is the hardest part of any training. The actual training process is easy.

trim orbit
#

my last lora with just 200 images cooked for 19 hours over 2 runs

deft coral
#

im still salty

pallid sail
#

I've noticed these weird pixel patterns showing up in my SDXL images. Is it to do with the scheduler? Attached image is via Diffusers with SDXL + refiner

boreal bough
vale eagle
#

I tested a training with 15 images with selfmade caption. It already able to produce decent features of original images.

deft coral
#

is this a joke

trim orbit
deft coral
#

24GB for 1 image

trim orbit
#

i think

deft coral
boreal bough
#

you got a setting error somewhere

trim orbit
boreal bough
#

I run a 4090 with 24gbvram and can do batch 8

deft coral
#

how

deft coral
trim orbit
#

i think i need to tell windows to put my page file on the new gen 4 ssd

boreal bough
trim orbit
boreal bough
# deft coral

ooohhhhhh. yeah I was talking about training a LoRA

trim orbit
#

thanks for the advice. i'll try it on my next training run on that same dataset

boreal bough
#

batch 3 is maximum on 24gb vram

pallid sail
deft coral
#

i used to do batch 10

#

on 24gb

boreal bough
#

and that's using comfy, as A1111 has a lot of unneccesary overhead

trim orbit
#

encoding the watermark as pixels prevents the embedded data from being lost in transit

#

automatic probably won't have it enabled by default

pallid sail
dire rampart
trim orbit
nocturne dove
#

How do we know the pixel pattern is a watermark? Where did this info come from?

turbid gulch
#

Emad on Twitter.

pallid sail
nocturne dove
midnight shuttle
pallid sail
trim orbit
midnight shuttle
pallid sail
trim orbit
#

the vae issue produces interlacing lines, it might even be intentional to give a chromatic abbrasion effect

#

it doesn't happen on my hardware though so i don't really know

nocturne dove
trim orbit
heady vale
trim orbit
#

i've got stable studio loaded but don't want to use a seperate install. so i'm trying to point it at my existing comfy and models but it all hates symlinks so much

dire rampart
deft coral
#

so i was right with the unet missing

#

ill just wait for it to get added to the main checkpoint

west breach
#

@Caith do you have a favourite tool for tagging by hand?

boreal bough
boreal bough
deft coral
#

if thats the case

boreal bough
#

best for manual tagging. also scales well for bigger projects

deft coral
#

why the hell doesn;t it work for me

boreal bough
#

takes a while to learn it once - but absolutely worth it

pallid sail
west breach
midnight shuttle
deft coral
#

and that's what im trying to pin point

boreal bough
#

for automated tagging, use A1111 -> extension "Interrogator", load the Vit-H model, and have that batch generate your captions. best (quick and easy) automated workflow for now

#

@west breach ^

midnight shuttle
deft coral
#

if thats the case then why is koyah ss telling me diffrent

midnight shuttle
west breach
#

I used BLIP originally. But the captions were very short and non descriptive

deft coral
#

what other options can i investigate

#

iv did a clean install

native moon
midnight shuttle
# deft coral what other options can i investigate

I don't know. Start by reading all available documentation about how the system actually works. Not about how to use it but how it works. Then use those learnings to identify the problem. No one seems to know the answer. So you must do this.

deft coral
midnight shuttle
#

It is new technology. If you wish to use it you must be willing to learn.

trim orbit
#

oooo my terran lora works nicely when i don't prompt for what i was captioning and just do natural prompts like i would on the base model

dire rampart
#

haas anyone tried KosMos2 from Microsoft? It would do a visual scan and return tags

boreal bough