#✨|sdxl
1 messages · Page 32 of 1
Family Photo 🙂
Thank you
SDXL seems to do bad in anime character or scene generation, I hope it can have a special anime version with surprise performence like niji jour
ney。
Sdxl today?
2moro 26th
Wednesday is 26th mate.
Ideologically, I disagree
When's day 26?
same. I think days, rather being tied to factual labels and conventions should be allowed to speak their own truth. Tomorrow can be wednesday if it wants to be.
What is closest Model I can use to get the MJ Niji look?
I want to get this look (this is an MJ Niji Creation)
Exactly, forcing any specific day to use a single label no matter what is pretty fascist imo
ComfyUI Prompt = beauty in the style of basquiat, rauschenberg, warhol, hambleton, haring, jessie makinson, richard burlet, inuyasha, john caple, victo ngai
kinda looks a bit jamie hewlett
n Ice
also this is still without the refiner
its that thing from fortnite
Niji -ish
time machine
some questions:
- what version is running on this discord? 0.9 that will be trained to become 1.0?
- what are the settings that are used by the bot? steps, cfg... are them listed somewhere?
- will 1.0 be released in few days or will be released 0.9?
thanks!!
middle of nowhere
- they are running the latest version that they are still working on and isnt released yet
- undisclosed as they are using the bot to test these parameters, each generation uses different settings probably
- 1.0 will probably be released on the 26th
Hope this'll do something like Niji in SDXL?
I love all of these
Getting Niji-er and Niji-er (is that a word?!) 😄
I kinda love this prompt in SDXL
(Forced Perspective:1.2) Unveil a microscopic society thriving within the creases of a crumpled piece of paper. The inhabitants, agile and minuscule, use the folds as paths, navigating the mountainous terrain with their ink-drawn vehicles. Their world may be minuscule, but it pulses with stories writ large
hi can anyone help me to install art diffusion xl 0.9 on google colab?
Hello👋 Do you think that sdxl 1.0 would work with controlnet and deforum at the same time ?
Best to head over to #🤝|tech-support for help with installing/running models!
Lots of incredible people over there focused on exactly those questions to help you out ❤️
Will there be a separate inpainting model of sdxl when it releases?
i made one in kohya's newest version and tested in in comfy. it works there. automatic tries to load it but fails.
deforum is just a script for setting up the next frame's generation so i see no reason why it shouldn't already work. People who are creating may just be holding off on releases because of the research license with 0.9.
controlnet will need new models trained
Cute thing to note about this project is they got permission to release it under the research license by Stability.
People who think stablity are trying to censor the model in order to oppress humanities and control populations, will conveniently ignore this massive gaping hole in the side of their argument. Or they'll see it and call it fine because it's gaping and they can't understand context.
any chance you could remake that in 1280x512 version could ya? 😉
was just trying to get something simliar on a1111 for the background of a blender scene i'm working on 🙂
rendering at 1280x512 wouldn't get the best results as it wasnt trained at that res, please choose any on these
cant even get an astronaut on deliberate lol
1344x768 would probably work fine
wanna stick it here
@hoary saddle
those are perfect! if you wouldn't mind me using one, i'll tag your social for credit and put it on the art itself friend
gorgeous
yeah you can use, no need to tag me(i am uncomfortable sharing my socials)
appreciate you kindly! came out great
that's a lora of a dog? or just shiba inu prompt?
prompt
no funny embeddings. SDXL is by far the best model even without silly embeddings.
If you want help you'll need to give a bit more information than that. We don't even know what your talking about.
I've noticed that sdxl 1.0 seems to have gained very significant progress in the last couple days, no? Or is it just me? I have been judging many of the generated images every day and hopefully this will play a role in the improvement of the model. I'm sure sdxl1.0 will get better as we work together.
sdxl1.0 is better than sdxl 0.9 for sure. But I don't think they would continuously update the models during the past few days.
U see what u want to see. There was 3 models to choose from. Maybe there’s only 2 now.
And those two may be better than 3
Or they are tricking you and using SD 1.5 finetunes XD
But iirc they haven’t altered the models
Ez to test. Just tell it to give you text.

Would be hilarious tho
If they used 1.5 as a control of sorts
as of 4 days ago they havent changed anything, i dont think theyd change anything still
re:
I don't think there has 1.5 behind the bot anymore.
I was joking
But yeah people thinking stuff has changed just shows how bias works
They get a few images a certain way that they like
Then they get a different type of image
"It's horrible"
may be
You can go from a decent to horrible image quite easily in the same seed, just by changing things like CFG, Steps and resolution
It is quite hard to compare because you don't know what going on behind the bot. Even using the same prompt, you could get a better then bot image and you also could get a worse than bot image using sdxl0.9.
we're honing in some parameters.
removing ones that very clearly would produce bad images.
Is that parameters like the samplers, steps and cfg, or parameters in the models themselves?
but but but thats censorship! ||/s||
WIP
Those people moaning about that sort of stuff i swear either haven't used the model. Or they are expecting some sort of hardcore porn generator for some reason.
they want the holosuite at quarks if you know what i mean
Candy-style!
I'm wondering if it's possible to write back to a field in a custom node, anyone know?
Selection here should ideally update height and width.
For flexibility reasons.
man wouldn't it be nice to have a really convenient UI that uses comfy's workflows & engine but gave you any easy native link between aspect ratio/resolution with easy dropdowns n stuff, like this,
maybe with like sliders if you selected 'custom', yknow like the good ol
shame that doesn't exist and i just made these screenshots up, oh well
both! model variant is one of the parameters that gets randomized
and workflow and etc
I really like those!
is there a new release date for SDXL yet?
There is a stage event the 26th, if it doesn't get released then we'll probably get a release date instead.
I hope the wait is worth it and the changes from 0.9 is actually significant.
be cool to automatically calculate the high res fix sizes by applying the aspect to the same megapixel count as 1024 ** 2. i made a python/gtk app to do that and it seems to do well.
guess I could probably just turn it into a node with 4 int outputs
I've made a multiplier for an aspect ratio node that was posted in here to do that.
WIP
amazing difference!
too late; already made my own
why have one way of doing things when you can have 17 incompatible competing standards
lol, mine was just a quick thing because I got sick of having to work out the multipliers
real baller move would be the ability to bake a custom node into a workflow so you can redistribute your workflow without them getting the big red
That's a little sketchy though
Embedding a python script into an image file
No thanks
it'd only be eval'd in comfyui
could also convert it to bytecode so it can't be accidentally interpreted
It would still make it more difficult for you to see what it did before running it, although it's not like most people actually look.
oooo
?
git pull time
I'm guessing this is so we can do something similar to the [Thing1:thing2:0.5] that Auto1111 has
yea I was just asking comfy about that yesterday
or the day before
something
time
do you know if nodes can reference other nodes?
So you can for example make a node that wraps two KSamplers together
I think there's some custom nodes that lets you do things like that
I imagine you could shadow the input types and feed it through eval(f"NodeClassName{NodeClassName.FUNCTION}(...)")
long as you can import the class directly from somewhere
I'm running SDXL 0.9 ON Vlad 1111 and after 47min it produced black square 512x512 🤔👎 Now I'm trying 1024x1024 it says 1085min 😱 Anyone had same problem and how have you fixed it??? Btw I run it with different model and it made 1 image in about 40min 👍
What are you trying to run it on because that's obviously not working properly, unless maybe you are trying to run it off CPU?
Yep on cpu
Yeah, you aren't going to have much fun with that
any news on if any work on SDXXL has started yet ?👼
ooof
I'm not concerned too much with time, more confused why it produced black square??
precision problems. try using xformers
does xformers actually change anything on cpu
i expect the only precision available would be float32
or theyve got that plugin that turns NSFW into lack squares ;o)
All I tried to test on was sticker, vector puppy🐶
Is it possible problem is with SDXL not running well on vlad or a1111?
Well it seems to work, woman crossed with frog/lizard man
I didn't see the actual parser for [token:token:time] in that commit
unless its earlier
It's a node
Ohhh
Two Text Nodes, 1 into it's own ConditioningSetTimeStepRange and then both those into a Conditioning Combine
However @visual glade I would say, these new ConditioningSetTimestepRange nodes. The values on them seem backwards.
In my head I'd have it so 0 is the start of the step and 1 is the end.
So for a full generation you'd have start on 0 and end on 1. Or if you wanted to stop half way through you'd have start on 0 and end on 0.5, but the way you've implemented it, it's backwards, where Start 1 means it will run all the way through all the steps and end 0 means it won't do any.
Might just be the way I think about it though.
oh, yeah it's because the first timestep is 999 and the last one is 0
so that's why I see it that way
It's super confusing to me lol
Works pretty well though, even if I can't get into my head exactly where it's cutting in
I guess I'll swap them then
Can I successfully generate images 256x256 on sdxl? When I run other models with 256x256 it takes less time 15-20min and than upscale with topaz
I mean it's up to you, I'm only 1 person, others might think the same way as you.
But having it the other way seems logical to me.
no SDXL only generates images close to 1024x1024
it works with other res but 1MP is the optimal amount of pixels
I'm sort of thinking that you'd be like ok I want this conditioning to start at the beginning so 0 and i want it to stop at half way, so 0.5 on end
Which package will work better on CPU and with what settings? I have Invoke 3.0, vlad, a1111 and ComfyUI
dont you dare 
I am the other person that likes it going from high to low haha
Lol that just seems completely backwards to me
I guess you could add a toggle, but I don't know if that would confuse people even more
node could have a tooltip explaining why it's backwards
the debate between "noising" timesteps vs "denoising" timesteps
It's not the reasoning that's the problem for me. It just doesn't fit conceptually in my head when the numbers are backwards, so I find it really hard to work out where exactly in the process things are changing over.
create two primitives and title them the opposite of what they feed into on the timestep node
didnt we end up flipping clip skip notation around because the first way made sense behind the scenes but the current way makes more sense
could be a similar case
it makes more sense from how it works to have 1.0 as the start but I guess people are more used to 0.0 becing the start
ya i remember this was a good explanation of how clipskip worked
It just completely screws with my head the way it is now lol
I set the number for "clip skip" negative in comfyui because that's the only way it made sense to me
That makes sense I think because your using less clip layers.
The only reason this is an issue is when I'm quickly trying to figure out where to make something cut in and for whatever reason my brain won't do it when it's backwards
it's because in python and other languages -1 is the last element in an array, -2 the before last element, etc...
I think if your thinking about it from a purely mechanical stand point your numbering makes sense.
But I think if your just looking at the nodes and wanting to make sense of it procedurally, it's confusing.
I think it's either that, or it's the "Start" and "End" words that don't make sense
Because in the code you have "start_percent" and "end_percent"
ok, I see the controlnet extension has it 0 to 1.0 so I guess I have to change it to match since that's the standard
if you update now I swapped it
having random thoughts/ If there are 11 distinct steps from 0 to 10 and there are 10 distinct steps from 1 to 10 why is the halfway for both 5 ?
Cool
is clipdrop using 1.0?
do we have a release date yet ?
i'm with you here, everything else in the UIs out there count up from zero
we're waiting for Emad to finish Tears of the Kingdom, so about 2 days or so
He's changed it so it matches, so all good.
just saw that
Going to need some playing with but you can get some cool stuff with this
rule 63 saiyan leonidas
noooooooo lmao.
I mixed the dog and the woman and it turned him into fur trim on her top 😢
That new node is awesome, I can finally get weird coloured skin to work
sd xl release on 26th event?
the last announcement said "in a week or so", without a specific date
okay so where are we with the release candidates?
apperently we are close becuase the bots got updated and we were told to make sure we pick which we like better becuase the outputs will be almost identical to each other
the bots.. lol
in the bot channels like #1100170312106127410
voted for a few but the images look close to each other.
i literally said it would be, pick the best in your opinion. we are deciding on which sdxl1.0 model we get
it does, just gotta generate/vote on more than a few
A node allowed this? What node?
ConditioningSetTimestepRange, Comfy added it today
yeah
Thank you for the clean-up 
comfybox i finally got around to making it work. cool.
this can be used to do prompt editing?
It lets you merge or change prompt mid sampling
So if you've used auto, similar to the [thing1:thing2:0.5]
oo, what's this new ConditioningSetTimestepRange do?
yep prompt editing ; ) nice
oh that!
It's a lot more customisable than A1111 too
nice!
I'm so done with A1111
one of the features I really like. did some crazy stuff with prompt editing TI embeds in SD 2.1. a must-have feature for me 😊
The nodes in comfy can be a bit cumbersome at times, especially when you start doing more complex stuff. But there's comfybox that can get around some of that. And from what mcmonkey has been saying I think he has something cooking in that regard.
i agree! i was a die hard a1111 fan with 1.5 but sdxl made me change and im glad i did!
I just looked through the github commit. have you tried it yet? can you do token weighting within a TimestepRange?
like [(foo:1.2):(bar:0.75)]
No it's a node. So you set up two text conditioning nodes, put them through the time step nodes and then combine them.
I see. so it's not working like {wildcards} in a prompt field. you need to build that for every range you want to add.
much more flexible but could get wild if you prompt edit the whole prompt I guess 😉
Yeah
maybe a custom_node might be able to integrate it in the same field
if one would want that
It's a mess on my workflow, but like this
I see. need to do some conditioning experiments I guess 🙂
is SDXL for auto 1111 out yet? like, not via web tokens, but for me to use my own GPU with"?
afaik only in the dev branch. so not officially
that only automatic1111 knows

Maybe the new node is useful, but way too tedious to actually use, while I'm glad to see ComfyUI getting more features, the nodes is holding it back instead of allowing for more complex usage.
I'm sure somebody will build a prompt editing node combining some of the features. I would if I would knew python better. A couple of experiments are already out there. a node interface is hard to beat if you want to be able to do custom workflows without writing code.
The problem is that you kind of need to write custom nodes to make it work well and now you're writing code.
yeah and I think that's okay. you can build a really simple workflow but you can also build more complex ones and try out so many interesting ideas that you won't able to do in tool with a static workflow.
I mean it's great there are many different tools. I started with ComfyUI a couple of days ago and I really like that I can build a workflow to my liking
The issue is that it is horrible to use for any complex workflow, I've tried. It becomes rigid and failure prone. If you want to develop a custom workflow in a good way your only option really is code.
sure you can always go more hands-on with the tech. but from the current time I've spend with comfyui I'm thinking differently about a lot of the stuff I've been doing before. I mean it's great that there are other tools so you can pick and choose what is important to your own workflow.
Alright, I think I might be stopping my SDXL research indefinitely
nothing is perfect and I can see limitations in every workflow
GPU failture Sytan?
why?
Yeah, it's back to failures
It was working fine, one of my temps was hot, I meticulously took it all apart and reapplied everything, even optimized the mounting pressure, now the card is just as useless as the day I got it
Rebuilt it 4 different times
Sick and fucking tired of this card
is it your new or old one?
New GPU
hmmm... damn
I really wish I could just sell this card off to somebody else and have it be their problem, but that would be shitty of me
Sure there are, but I cannot replicate any workflow in ComfyUI I've used in Auto1111 with it's basic nodes.
But like u should
So I should just scam somebody else by selling them a GPU I know is faulty? I couldn't be that shitty of a person
Return it to seller and get your money back
Can't
paypal?
Yeah, but I won't win
may as well try
180 days / as long as you have documented proof
Already am, but it's just a waste of time
I think you wrote him about the issues
I actually might be able to fight him now that he admitted negligence on his end
if you bought it from a commercial seller the buyer has a good chance to get the money back in a paypal dispute. if you have send the money to a person it would be different
Did you try a clean windows boot?
ya paypal usually sides with the buyer from what ive read and in my experience (twice) its true
is there a workflow for getting past the 77 token limit for CLIP?
idk if comfyui does it automatically but you can use the combine or concatenate nodes in there to achieve that
I am using automatic1111
it should be automatic in that case
that's in launch.py?
oh its for sdxl and a1111 huh
ya thats a tricky one
ohh were you talking about 1.5 etc
nah the nodes im talking about are in the gui itself, you can create them
no just comfyui in general
gotcha
been using automatic1111 for about a year, but I've been away for a while...are otehrs catching up?
comfyui has always been toptier imo but from the looks of it, its coming out on top with how sdxl is turning out
orly I thought automatic has always been top dog
ya it was always the most popular
but again I've been out of SD practice for several months
it's not even really implemented
ya
we dunno how its gonna work* with the refiner model too so its gonna be weird
I mean I don't a1111 for teh elegance...I use it because it has so many things
wait is the "refine" process not actually a refiner model?
that guy
oh looks like he did find a way to implement it
from what i read in its early "sdxl compatible" debut, the refiner wasnt working
yeah. super annoying tho, you have to find the file and bring it back in by hand
like idk why you can't just copy it from the other side of the ui
with comfyui there's no 77 token limit for sdxl?
I really like ComfyUI a lot... I have been getting amazing results... It seems way more customizable and I think its the way to go
do you know if it has a 77 token limit for sdxl?
it does
thanks
here is more info by the developer of ComfyUI #✨|sdxl message
So actually it can go above 75 tokens. It puts the info into blocks of 75 tokens and appends them..
It doesnt even average them either which is good because you dont see a drop in impact
I am very confused by that thread. First they say that it uses more tokens, then they say that SD can't handle more tokens, then someone asks about that, and there is no answer
I am talking about SDXL. @upbeat summit just said there's a token limit for it in comfyui
the model has a 77 token limit. as other SD models before it. comfyui does split your tokens into blocks of 75 tokens and appends them
appends them how? if there's a 77 limit how can more be appended?
seems like it's not a limit if more can be added?
like, why would a user care about the difference between these things?
there is some math to it. a1111 averaged your prompt out when you went over 75 tokens. comfyui does it differently
so this is implemented in sdxl as well?
most end users really wouldnt care or think about it... it just allows you to use longer prompts
it works the same with sdxl in comfyui. I don't know what a1111 does with sdxl. but I would think they handle it the same as with SD1 and SD2
most users won't care. most users even use more than 75 tokens per prompt
yeah that's what I am trying to figure out here. I just want to know if I can use longer prompts, if there's an easy way to add this to a1111's implementation of sdxl
they do not, it's limited to 77 rn
(only for sdxl)
well they will probably implement the same functionality they're using for the other model versions
off to dig thru the issues list ig
afaik there is no problem using SDXL in ComfyUI right now. at least I'm using more then 75 tokens a lot 😉
I have some pretty long prompts in ComfyUI... All of the text from those 3 boxes count as 1 prompt
and maybe install comfyui...I assume there's no overlap between the massive amount of crap that needs installed
I actually think comfyui is easier to install and get working
oh and one other thing...there's a facility in comfyui to install it on a host and use any client right? just checking before I start to dig. my art slave lives in a closet.
ah yeah, I see, it's browser based, so this should be the case
you would have to set the command line arguement to --listen 192.168.1.43 for example so it doesnt use localhost 127.0.0.1 and you can point it at your monster in the closet
depends on the your specs tbh
3080
then you should be good
Will sdxl explode enough to make people shift from 1.5 to sdxl?
most of the annoyance of installing a1111 is just waiting for crap to download
I think so especially with custom models based on it. sure
tbh I never liked 1.5 better than 1.4
Damn.
well it depends how experimental you are and if you want to try building your own workflows, but you can build some really interesting processes in comfyui that help you build very weird looking and cool shit 🙂
nice, I am keen
So comfyui is better I'm told?

I have been using a1111 for too long too long to know
it depends on what you are wanting, comfyui is way more flexible, a1111 is just point and click
and yeah I like to get experimental. here's a barbie inspired one
its like a dslr vs a phone camera
https://imgur.com/a/0m56ABF comfyui tends to start looking like this
im not a stranger to node graphs that look like this...making me wonder why comfyui isn't just a part of nuke lol
give it a try. I've been an a1111 power user for the last year. there are some QoL features and workflows you may have to rethink - some stuff is certainly different or not available yet, but I really enjoy building my own workflows. but that is just how I use it and work with - mostly txt2img.
Is rtx 3060ti compatible for sdxl??
nice. yeah I've always gone for bulk which is why I am a bit hesitant. What I like about diffusion most is the ability to rapidly give me many options
but I will try
for me i mainly use a1111 for my goofing off, training prep, etc.... comfyui i tend to use if i have a few images I need to need to iterate on and get super high quality proofs
yeah. I've always wanted a node based tool for AI images after using nuke, blender etc
substance
bet adobe's working on a diffusion implementation in substance
but it will be crippled like all commercail models
goofing off you can do 😉 you can still do the training in a1111 or kohya_ss. also you can always use both tools
make it part of the adobe cloud subscription damn it
comfyui is a bit combersome to use when im goofing off... sometimes i just want it to work and a1111 does a good job of that
I mean, it's kinda pointless to use their diffusion stuff. it will cockblock you constantly
this will help justify having two different ~50gb installs of SD i guess
i just symlink the model directories between the two
photostop's AI implementation is great for fixing things, in- and outpainting, but for creating images it's not so much fun.
venv alone was like 25g last I checked
noooooo
just build yourself a goofing-off workflow 😉
goofing off mode... dont want to dig through folders looking for the workflow json... i just want to load it up and click go
yeah. but if I want to build on something like this it's a nonstarter
photoshop is great for real life work situations - since that's what its supposed to do
👀
not if you're working on something even slightly pg13 for a movie
if comfyui ever gets a collection of preset workflows similar to A1111 scripts like prompts from text box, outpainting, tiling, matrix's, simple render, i could switch over
ph'nglui mglw'nafh ❤️
sure that's why it's great we have different tools to use. but drag and dropping a json or image from your favorites folder doesn't take too long? but I get what you mean.

actually I never make lovecraft references. didn't find it was useful, goes into hokey territory all the time
even though it's the feel I want often
yeah. 'lovecraftian' in the style prompt is the most I go for
Who does loras for sdxl?
As of right now?
i find mashing together a bunch of weirdos works better than adjectives. or in conjunction anyway
Just want to ask a time estimate for making a lora if you got 12 gb vram. 
12 works on some gpus & environment setups, but not all. 16gb vram works on all pcs
that is dope as shit dude, you got an ig or something like that?
12 gb vram nvidia
love me some pop surreal feels
https://twitter.com/masslevel (or X? whatever it's called now) but you can find most of my images on this discord right now. just search for:
from:masslevel has:file
So if i had a 12gb vram nvidia it'd take 30 mins right to make a decent lora?

the word is that SDXL 1.0 will be much easier to fine-tune compared to 0.9
assuming you have bf16 support (depends on your specific nvidia card - google is your friend), if all works out, then yeah. about 15~20min to train a good lora
^use these settings to get it to work
offset noise = 0 (cause then it auto sets it to the correct value)
I've managed to train LoRA with fp16 too.
its working now? 😄
Last time I tried yeah.
noice
Though my settings gives me 4 s/it because I run out of VRAM.
though I'm curious to check kohya changelog if it just ignores the flag XD since the recent code updates have been a bit hacky
with fp16 working, it should work on all 12gb vram cards though, so thats good
tyvm
So this is how I make a lora in generally 15-30 minutes?

What gpu u have?
RTX 3060
• good captioning is still relevant
• Dataset size of 30~60 to avoid typical issues. 10 or more will work fine if you know what you're doing. under 10 works if you really know what you're doing.
• set repeat to 1, do 40 epochs. 20 epochs will prob be your 'ideal' lora. so save every few epochs
Lucky u got 12gb I got 8 and got the 3070 😭😭😭😭
wait what
Can you explain that last one

dafuq is setting to 1 and 40 epochs do
Dont we want an image to repeat usually around 20 times?
does anybody know a comfyui node that has a text input and saves the field in the metadata? I want to save a processed wildcard prompt in the file.
I build a metadata .txt generator to save the processed prompt in an image companion file, but I would like to have it in the image's metadata as well.
right now only the unprocessed prompt is saved and the processed prompt is lost.
ultimately, total step count is what matters. so 10 repeats with 1 epoch = 1 repeat with 10 epochs
just much easier to tweak settings if you keep repeat at 1.
..
So wait i just want 20 steps?

I do not think I follow at all with this logic, since this seems like so little steps
image count x repeats x epochs = steps
it depends on what you are doing, some lora's i had to bake for 200 epochs, some needed 20
keeping repeats low, only helps you save the model more often. so you can save it like every 2 epochs, to later compare results
So basically I don't need to overdo it with steps like 1.5?
I can just do 20-200 steps and im fine with lower end based on complexity of design of character/idea?
So 20 for a simple char -> 200 for a stupidly high accessory char or style based gen
the 200's tended to be multiple object lora's
Yeah like asset based world morphs is what i mean
or style based on artstyles in case

Gotta make sure it do be accurate or like it
Tyvm for the help on that.
Kind of have to fall behind since I have to work on the lora database/spreadsheet of all 1.5 loras
So i'm trying to ge tin as much sdxl prep i can while listing
as long as you stick with the settings I gave you, same steps regardless of complexity. steps only need to go up if you use significantly bigger datasets - which can be used for training multiple concepts into 1 lora
How fast do you generate images like not create lorasa but comfy ui images
Just call twice the common_ksampler in the custom node. I was made a custom_node like that. https://civitai.com/models/108594/sdxlmixsampler-comfyui-jnode-workflow-included
To support my work, you could buy me a coffee https://www.buymeacoffee.com/JasonAICreator SDXLMixSampler The concept is to make one node to use bot...
Things may have potentially just changed, I have a possible opportunity to get a different 3090 relatively soon depending on my luck in sourcing sellers
Wait now i got a question
Fix your sentence please, but about 20 seconds for 20 steps for a 1024x1024 image. 2-3 seconds more if I also use the refiner.
Wait question
Though this is moreso for lora training
How do people do a certain tag for the dress of a character vs a tag for the character itself

Cause its weird how some people can 'tear' them apart and make one usable on another character.
have a dataset of multiple people wearing the dress, then do a trigger word for the dress, shuffle the rest. Make sure the 'person' is mentioned in the shuffled tags
if you want to train just the dress from one person, then do the trigger word for it, do not mention the person, nor any body parts, then ideally you'll get just the dress in the lora is sdxl
if it works on the first try, good. else tag the clothing pieces that didn't get absorbed into the trigger word. (like specific boots)
fyi, if you tag the boots from the start,then it will almost never be absorbed into the trigger word
As I know, the trainning dataset separated into multiple subset. 1_dress, 1_swimming_suit, 1_fullbody something like this.
it can be done with just one folder though without issues
just helps organize, or have irregular repeats
my biggest sdxl test, was 5k images trained into a 41mb lora.
it successfully learned over 100 concepts - including multiple dresses
would not recommend though, as it took my rtx4090 a total of 18 hours, and was still a bit undertrained. will rerun it on 1.0 release for a full 40 hours
It is amazing.
Nice
Do you know how much of it was the LoRA and how much of it was helping SDXL doing things it already knew?
its going to be interesting tossing my 1million image dataset into a lora, dunno how well thats going to work tho, might have to do what dreamshaper did and just release a custom refiner model
I used Vit-H + the typical (but not ideal) preprovided flavor chain
due to that, all the concepts trained either already existed, or were 'retrained' from similar enough concepts that it got all the details right.
I know that you can bend it to your will with enough steps, to adapt to any wording you want, but for now I'm willing to use the existing clip knowledge since it makes things infinitely easier
At 1 million images it sounds like you just want to do a full fine-tune and not a LoRA.
lol, yeh
I'll also need a custom refiner model, and I hate it
I'm genuinely dreading that the refiner retains it flaws.
at least with 0.9 most of the concepts in my dataset are grossly undertrained or just absent so still want to do it... originally built the dataset for 2.1 and had a model mostly ready to release but decided to wait with this new shiny around the corner
refienr is extremely biased when it comes to faces, eyes, and face accessories - which makes cosplay loras especially hard, when they have eyewear like 2B from Nier Automata
hopefully we'll get refiner lora support - as that would solve the issue as well
are you training at 1e-3?
oh, or you mean finetune
2.1 was a full finetune and pretty low lr (1.97e-6 mostly then 3 epochs of polynominal descent to -9)
lora's i usually do at 1e-5 tho
sdxl lora doesnt need to be that low anymore. would highly recommend to try your datasets first at 1e-3 - since not one of my attempts at that rate has failed so far
also unet only
Why not use prodigy?
my original argument was, that I'd try other methods once one of my datasets failed
since a lora now takes me around 10 min to train on rtx4090, and works flawlessly, I've yet to try other methods XD
If I use the search bar and your name, will I find that you have mentioned the method you're using?
• good captioning is still relevant
• Dataset size of 30~60 to avoid typical issues. 10 or more will work fine if you know what you're doing. under 10 works if you really know what you're doing.
• set repeat to 1, do 40 epochs. 20 epochs will prob be your 'ideal' lora. so save every few epochs
You got gud at dataset prep. I'm going to assume that's why you can run with a static LR. Is my assumption accurate? Consequential assumption would be that shitty datasets require LR adjustments but there should be an optimal LR if your dataset is of a minimum quality. How to quantify dataset quality is a different question but you musta got gud at qualifying the quality. Assumptions correct?
And no need to perform the labour for me like that in the future. Otherwise, other people will get the idea that they don't have to put any effort into learning, thereby abusing your efforts and contributions.
made a python script that centers ComfyUI workflows on the canvas if anyone might find it useful https://github.com/Beinsezii/bsz-cui-extras/blob/master/workflows/centerizer.py
had a use for this a couple of times already. thanks for sharing it!
Having everyone reinvent the wheel is a waste for everyone. But it is also good to challenge assumptions and see if what people are doing is actually the best or if it only works the best in their usecase.
For instance, if you find settings that is very good at replicating your face that doesn't mean it's good for general concept training.
If you want a model to learn from drawings without turning everything into a drawing you might need different settings. Though the settings Caith posted looks pretty much like the ones I've used for 1.5 with the exception of dim.
ironically enough, sdxl is much more forgiving, even when using bad settings, or outright wrong practices.
You know you got your settings/practices right, once you can use your lora at weight 1, and 4/5 generated images are of 'good enough to post' quality <- mostly just a matter of getting your captions right
If you wanna do good manual tagging, I can highly recommend hydrus network. It's high effort to learn the first time, but it scales well. I do manual tagging for around 30~500 image datasets using that app.
above that size I switch to automated workflows.
below that size I just use vit-h to write me captions only, and then add the remaining tags manually in a text editor (Sublime 3) like a barbarian
AUTOMATIC1111 WebUI now supports SD XL
7 mins XD you're fast
Does it run as well as in Comfy?
oh yeah. just updated and 1.5 is released
I am both impressed and equally disturbed XD
I was searching on now how to install SDXL in Automatic 1111 and just saw an update. It's funny and so random, lol
.

MY SHEET, IT HAS BECOME COMPLETE
now it doesnt matter wtf they are everything in the sheet works in a1111 now

gee gee
I've read your words 3 times and I still can't understand them
...so does A1111 not use the refiner, or am I too oblivious to see it
lemme check
It should have it
I mean it works if you manually switch model, then img2img it, but at that point... yeah XD
whelp. its definitely not a good sdxl experience
off to a good start
im gunna try the new update of a1111. if it doesnt work first try. then yeah i can see that being true lmao
hopefully sdxl gets full released with their internal inference tool / or a very big shoutout to comfy
cause A1111 is gonna make a lot of people sad
I did a fresh install for the new version, since a lot changed
a1111 is also super slow for sdxl, if compared with comfy
ripgrep for 'refiner' only showed that the UI has the configs to load the model. didn't see anything about 2-stage processing
it works if you do the whole process manually
kind of. is there a way to end early?
so you can do base 20/30
gen image, take half finished image into img2img, then change to refiner model, wait an eternity, then do the refining part
but t2i can't return noise
10/10 user experience 🤣
yep. img2img workflow only
but that was already known
and no one knew wtf they were doing
but that it would be 'released' without 2 stage process is... well...
guess he just wanted it 'finished' in time for
wonder if vlad implemented noise return yet
he uses Diffusers or something doesnt he. the shrek guy and sytan made a PR to get it implemented there
technically seen, its in diffusers now thanks to pseudo. so vlad should automatically get it due to using diffusers?
is it an automatic thing or does he still have to manually use noise return
from the screenshot of the PR psuedo sent it almost looked like he just added the ability to noise return
the function is there. but if you call it, or give ui access to enable the function is still his thing
whats noise return?
doing this. Makes the refiner work better
i see
rather than fully making 2 images, you send the half completed image into the refiner model directly. Has the advantage of massive speed increase, and due to the noise, you add just the right amount of chaos to literally add details to the final image
denoise half in base, return early with leftover noise , refine the last half
or really it should be like 2/3 1/3
@spark pewter I've been trying to get the 8gb training to work. My command is:
accelerate launch --num_cpu_threads_per_process=2 "./sdxl_train_network.py" \
--enable_bucket \
--pretrained_model_name_or_path="<path to sd_xl_base_0.9.safetensors>" \
--train_data_dir="<stuff>\img" \
--resolution="1024,1024" \
--output_dir="<stuff>\model" \
--logging_dir="<stuff>\log" \
--network_alpha="0.5" \
--save_model_as=safetensors \
--network_module=networks.lora \
--unet_lr=0.0001 \
--network_train_unet_only \
--network_dim=8 \
--output_name="last" \
--lr_scheduler_num_cycles="10" \
--cache_text_encoder_outputs \
--learning_rate="0.0001" \
--lr_scheduler="cosine" \
--lr_warmup_steps="260" \
--train_batch_size="1" \
--max_train_steps="2600" \
--save_every_n_epochs="1" \
--mixed_precision="fp16" \
--save_precision="fp16" \
--cache_latents \
--optimizer_type="AdamW" \
--max_data_loader_n_workers="0" \
--vae_batch_size="2" \
--bucket_reso_steps=32 \
--min_snr_gamma=5 \
--gradient_checkpointing \
--xformers \
--bucket_no_upscale \
--vae="<path to sdxl_vae.safetensors>"
I got the train command to be as similar to yours as possible.
50/50 is way too much refine
also pointing out the noise return only seems to work properly on certain samplers
like euler and ddim
without karras
which is extra bonkers that ddim errors out in auto
and where's my 2nd clip model! XD
tbf I still just use one text input fed to both clips
hard enough prompting for one as it is
confused why a1111 sdxl says sdxl doesnt support ddim but comfy it works very well..
someone else said that ddim isn't implemented properly in a1111
supposedly multiple samplers are semi-broken
its soooo good for cinematic photos though. just throwing all the still and style references into the 2nd clip only
ah
guess I'll have to play with it once emad finishes Tears of the Kingdom and drops 1.0
'2 stage process patchwork code of sdxl'
i mean only one and a half day left
it's gonna release, while I'll be playing DND 🥲
flowcharts already make my brain turn off, flowcharts in Simlish is just mean
procedural dnd npcs on a small screen
is this what stuff looks like to people that are illiterate?
train anime lora for sdxl should include clip model?
now it seems that sdxl not have too much train on anime character, so if we want to use it train anime character model , it means we should train also clip model because of danbooru tag is not in sdxl?
UNET only for default sdxl0.9 model
if you're using the waifu xl model, you'd have to try if the text encoder messes things up or not
someone should train an LLM to convert booru tags into actual prompts
train two clip model is enough i think
Waifu XL is already being trained. use that as a base, then you won't have to touch the clip model.
really? I just know stability AI release a common sdxl model, also a waifu XL model to release?
I see, waifu XL is now training based on sdxl 0.9, maybe a further work to train after sdxl 1.0 release. Because 1.0 may have different structer
https://huggingface.co/hakurei/waifu-diffusion-xl
here is their first test for 0.9.
1.0 will also have a full waifu diffusion xl release - but it will take a bit of time to train
yeah, It looks so
disable the extension
its cause A1111 added proper metadata support - but that also means that all/most metadata extensions need to be updated to work with the new version of A1111
how do i tell in A1111 to first use the base model and then the refiner ?
Seems like you need to manually select the SDXL base model, generate it like 50% and transfer it to img2img. Now use SDXL refiner model to further improve the details
automatically? you don't.
manually? Generate image, then send to img2img, then switch model to refiner, then generate it again in the img2img tab
the 1.0 model gonna be this tedious too ?
hmm, that's a little bit clunky, isnt it ?
if you want a good sdxl experience, use comfyui
Yep, it is
A1111 full release of sdxl, really isnt a full release
I am trying to produce the same result using comfyui and a1111 with same parameters. But they are just different
using kohya? yes
but text encoder is complicated.
If you're fine with training a lora, those are super easy to train in sdxl, as long as you have enough vram
i dont really want a lora, i want an embedding, and i want to save an image every 5 steps
then no
well, i would be fine if i could train a lora and save an image every 5 steps, but that somehow doesnt produce images in the same way as embeddings would back then
keep in mind, sdxl is not SD1.5
so many experiences made using one will not transfer over to the other
true
SD1.5 was using 1 clip model, which was easy to train
SDXL uses two separate clip model - which makes everything much more complicated if you want to retrain them
i m not even really sure about the theory of how to do an embedding here
do you have to train a lora both on the base and refiner, too, by the way ?
for now, we can only train lora for base
is that the reason why the end images are always a bit blurry/unfinished ?
they shouldn't be blurry
A1111 does that, but that's because its not fully implemented
nothing to do with SDXL
what would be a good set of training images ? all of a very similar appearance ? and how many pictures ?
Isn't SAI working together with a1111?
you should probably get sdxl working properly in comfyui, before trying training.
but once you're ready
-> #✨|sdxl message
i do have it working in comfyUI
the images really shouldn't be blurry though
A1111 with hires fix. Not bad.
with comfyUI from yesterday
not from a lora training though
my question wasnt really regarding settings though, but more like "what kind of images should i take to create a good lora"
like, if you want a photorealistic portrait, is it better to have people from all angles, are all the same angle, with different poses or rather same, rather 6 pictures or 100 ...
@molten gull the only blur here is bokeh in the cat & burning man image, which I specifically prompted for
i quite probably made something wrong with creating the lora itself, yes
the 2B image, is from a lora I made
Same prompt same setting in comfy
but what kind of images would be good to use for a training @caith ?
seeds aren't cross compatible. but are the images 'generally' the same?
if your lora introduces blur/background blur, then odds are high you forgot to caption the background in every image - which results in the other words learning to recreate the background
if you want a photorealistic portrait, is it better to have people from all angles, are all the same angle, with different poses or rather same, rather 6 pictures or 100 ...
No.
they have similarity
"photorealistic" should also never be used - not for training nor prompting. since a photo is never tagged with 'photorealistic' on any website. only drawings are marked as photorealistic, when they try to look real. so that prompt alone would cause blur, since its a 'drawing' prompt
I kindly think a1111 more emphasis style
more angles = good
dataset size =
1~9 <- hard
10~29 <- good enough, if you tag properly
30~60 <- ideal. Will help correct some mistakes you could make while tagging.
60+ <- also good, but training takes longer. good if you want to teach multiple different things at the same time.
Tagging:
• always caption the background!
• have a trigger word, shuffle the rest
• **dont **tag the parts that you want to show up when you use your trigger word
• **do **tag everything else, especially background
prompt: a lineart painting of a banana. No refiner using comfy. Steps: 20. Sampler: Euler. CFG: 6.5. Seed:1
That's way too much of a difference. Using 1.5 models I can 1:1 reproduce images in both a1111 and ComfyUI
Something's different
left is a1111 right is comfy
maybe A1111 uses style prompt with a different weighting?
also, sample size of 1 is too small. at least 9 grid to make it a fair comparison
Opening both ate up my memory. LUL
I'm training with attention masking on human faces now which works well. The annoying thing is that 1.5 LoRAs (and 2.1) extrapolated the person's body really well but SDXL LoRAs get your body wrong sometimes or lose you entirely if the prompt is complex.
anatomy words are basically off limits. then everything is fine.
(during training)
how do you mean?
oof
feet/neck/torso/ears/hands/legs <- words such as these shouldn't be used during training. If you do, you mess up the anatomy, and bodies quickly turn into lovecraftian horrors. You can train anatomy, but it's done differently from normal training.
at least, dont casually try to train anatomy while doing anything else, like training clothing, faces or artstyles
I don't do that. I just label [trigger] fe/male person
I built custom attention masking on the kohya script to disregard the background so no need to caption that
and it still works in sdxl?
attention masking? yeah. I made it for sdxl
@vale eagle if you're comparing images seed by seed in a1111 and comfyui make sure you change a1111 to use cpu for seeds
the loras look good too but they don't infer your body shape which I find annoying (before I could use the same lora for portraits and random shit)
oh, where could I set it
settings -> stable diffusion
I think it might be because I also trained TI in 1.5
ah, that makes sense
But I run my comfy with --gpu-only
left comfyui, right auto
Should it be the same?
I'm still hoping that the weights in the final 1.0 are more balanced - since that would solve your issue, and mine as well
hell yeah, man. I sure hope so
I hope the official 1.0 vae is fixed for fp16
but I doubt it lol
prob not XD
I think we just haven't figured out how to do LoRAs right yet
but it will make lora training a real pain. since faces dont train at the same rate as literally everything else
make a separate face lora and run them in series
blame your brain for being so good at noticing imperfections in faces
But SDXL gets similarity insanely well.
adherence is the pain
I've got it working pretty well. but it takes around 600% of standard lora training time :/ so I'm not amused
face/body shape/and clothing were trained in this lora. I can call each of them separately. I even got the eyepatch working, which was my greatest pain so far
Thanks for help. I got the same image.
yeah at some point it's just not fun to wait around 5 hours to train
left is a1111. right is comfy
congrats.
Harmony achieved
small differences still, probably just Xformers or autocasting differences
blob:https://discord.com/e85d1cbb-3355-4c8f-aa5c-af6795f930a2 Add a suitable background to this table
#1072220168534642768 Add a suitable background to this table
It confirmed the refiner doesn't have involved in t2i using a1111
and i2i is the "wrong" way to use it lol
any1 got an sdxl img2img workflow?
Like from a .png type image? or a hiresfix?
didnt read properly. img2img
controlnet no go
a1111 vs comfyui base vs comfyui base+refiner. prompt: a banana in an anime style
keep getting weird crashes
yea controlnet doesnt work on sdxl yet
lol wasted the whole morning then 😂
inpaint should work fine in a1111 or comfyui
im talking strictly sdxl
only txt2img is sdxl
No idea what you're trying to convey with that screenshot
have a controlnet setup for comfy (not sdxl)
same for highresfix
same for img2img
same for infinite zoom
same for inpaint
I have sdxl fo txt2img
wondering if theres any sdxl workflows for these that are out there
or at least wanting to know if its possible for these to be done with sdxl, controlnet we know not..
don't know what infinitezoom is but the rest are possible 100%. just gotta add the nodes for them
I have a hiresfix workflow too
https://github.com/Beinsezii/bsz-cui-extras/blob/master/workflows/sdxl-upscale.json
those are the only ones I can link to premade stuff. rest, while possible, we have no link for. so either search, or needs to be selfmade
happy to try make them, super fun process, just dont wanna waste a morning like i did today on control net 😂
they work fine
k cool thanks for bumping the links, will check them out now
controlnet uses models trained specifically for 1.5
so the models need to be retrained for sdxl
same for loras
and inversions
SAI seems to try to improve controlnet for SDXL (reduce the size of it). Hear from reddit
got some ideas in testing for it 
working on the old bulkier types too in case the new variant has some issues
deforum type affect
idk what deforum is either
there plans to be a dedicated inpainting model again?
no special inpaint model at launch, model does a pretty decent job on its own though it may not mean there wont be something to improve it further in the near future
the refiner, is that gonna be separate ?
if so, doesn't the refiner count as an inpaint model?
looks like masked inpainting kinda kerfuffles the refiner in comfyui
are you meant to inpaint with the base or refiner model?
yea still a second one, but as Beinsezii said the base is likely going to get you better inpaint results as inpainting is typically a full generation task and the base is a much smarter model. That said, go ahead and try all the things 🙂
are we still going to be able to get fine details in skin with the base alone? are we getting the same base that SAI was previously going to release by itself? Cuz I want to train face loras and 0.9 base cant seem to do photorealism alone
refiner sure has a field day with tattoos and jewelry
you should be able to get sharper detail with the base alone with 1.0 compared to 0.9, its still not quite up to spec with the dual combo for now but alone it should be much easier to get finer details like that with the base than on 0.9. Ill be able to dive in more on the 26th
excellent thank you
I wrote my own gradio app for tag/caption editing and I have an automated workflow that does 85% of the work. I could get it to do 100% of the work at an acceptable quality by using papers/tech that stands on the shoulders of BLIP/CLIP but I need a decently sized high quality high variety training dataset with near perfect captions/tags. It may be stubbornness but, because I'm not using (I think they're generally called) booru tags, I typically classify anything related as unrelated to my purposes.
I could see a benefit to using a combination of single word tags and phrase based captions together but, since I am only observing a divide in protocol and not directing it, I have to choose a side.
Like, I've read a lot of tutorials (for v1.5/v2.1) and the advice for training a drawn person and for training a flesh person is similar but different. Slightly different optimal hyperparameters, from step count to batch size to data augmentation options. The adaptive optimizers are gonna hand wave a lot of that complexity away but it was quite annoying for someone to assert with such confidence that their way is the way when it couldn't apply to......a lot of diverse situations.
Example: there was a Forrest Whittaker LORA that someone trained with flip augmentation on. 🙃
The thing I'm struggling with would be inventing a standardized flexibility/complexity test to determine if the LORA subject can do everything from stand and look at camera to...uhh...a yoga pose while riding a bike with skis down a muddy mountain side being chased by a bear(the bike has skis. Not the subject or the bear. Let's give the bear a Sherlock Holmes hat though) in the style of some obscure not well known artist that is within the base model /knowledge/
I was just training a spider-gwen lora, and I think I need a few more screenshots from spiderverse (I have a few movie stills, a few comic art images, a few cosplay, and a few of emma stone because why not?). In this first attempt, sdxl is like "you still want a little spider-man though, right?" Silly images 🙂
Oh well, live and learn. Don't caption anything as "spider-man pose" that's not explicitly spider-man unless you want to train it a lot longer.
it's not an AI, it's priditctive algorithm, so ofc if you type "spider man" in there you're gonna get something related to spider man in return, and i bet you most of the images in the data set contains a spider man suit.
Sure, I was hoping to get this kind of pose trained as "spider-man pose" and I didn't think about how strong "spider-man" waould be in the model. It's an easy fix to the training data.
you're gonna want that pose without the suit to train on 😛
Yeah I can fix the issues. just thought it was funny to see 50% spider-man 50% spider-gwen combos the model came up with.
I guess base sdxl knows what "spider-woman" looks like pretty well too. Since, without prompting for "spider-woman" or training the lora with similar images, sdxl comes up with the mask that that character wears.
How well does SDXL handle hands compared to previous stable diffusion?
Somewhat better but not what you'd like it to do.
For my setup it's pretty much a hit'n'miss still. Most of them don't look right to me so far.
Better than ever, but not perfect 
noice!
running your own flavour chain?
this should solve itself once mcmonkey gets to show off his new infinity grid generator
I really really hope its a parallel release with sdxl 1.0
run this image through Vit-H, see what it spits out.
Whatever pose word you get, is your path of least resistence when it comes to training that pose specifically.
You can retrain other words, but that's just painful, and can often cause unexpected issues
i see thats rough. Also isnt They gonna do a presentation for SD XL. Hope they will touch upon that and whats their plan to improve it.
What's a good amount of loss to stop training a lora?
I just started training something today to learn how it all works. Spent ages captioning images. My hands are sore from typing all the captions 😆
Rip ❤️
I know the feeling xD
Just delete VENV and start again
results so far from my 'grumpy style' 😆
training a lora based on images i created with a 1.5 model
ah, I see. looking good so far 🙂
I won the pantheon what does this mean? Also what is this even for? Picking the best fine tune of SDXL?
congrats!!! 🥳 you've made it, yayy 🥳
well, that's all the fanfare you get, I'm afraid
@sour obsidian how well the does the new controlnet work for qr codes?
lol
I used the same exact prompt that used to get front page in 2022 with Midjourney V3
Pretty sure it’s an overfit image or concept
As V3 and SDXL makes roughly the same image
Open source finally reached midjourney v3-4 levels
that's not nice 😒
that's kinda cheating, I'm telling the mods on you 👀

Going to jail for Stealing prompts
can you share outputs from both sdxl and midjourney in here? 🙂
I haven't used mj in months, would love to see how similar they are
Both V3
#🏅|pantheon is my SDXL
So SDXL does beat it in terms of everything except “beauty” in my opinion
Like I think midjourney V3 has some secret sauce that makes all of its images beautiful
does anyone know how to use the refiner model for sdxl with a1111
I really wish they shared their secret sauce for V3 because V4 is just generic photographic model and V5 is something else entirely
But oh no our profits
I think SDXL beats V4 and is close to V5
It doesn’t beat V3 aesthetic though. Neither does V4 to be fair
sdxl has a problem of stuff bleeding into other aspects of an image
so in mj, the universe is wholly contained within the bottle. in sdxl it has bled to the background 🤔
I deliberately picked the option that bled in. I thought it made it look better
There was another just like the MJ one
Wait new winner already?!
What is this 5 minute pantheon??
oh? ok. but in general sdxl tends to do that.
well, you had your 5 mins of fame. let someone else shine now 🤩
I kinda wish there was a platform like Midjourney for stable diffusion where people actively made Upscaling and voting decisions which can be used for finetuning
Oh also SDXL beats V5 for text but I bet V6 will also have text
bad planing. would have waaaay more ratings than they get now
same prompt, same seed, same sampler & scheduler settings etc the nonly difference is the number of steps.
On the left was 25 total , the right was 150
if you are using an ancestral sampler it will always do that in any SD version
I am
sometimes you can use 30 steps then 31 steps and bam, different image
nice to know (and I wasnt complaining lol)
mind you it is interesting what remains the same
and what chnages
yep thats one reason many like it
although I only had the ancestral on the Refiner step
SDXL is miles better than MJ, MJ lost this battle. I generate images that MJ would even dream to be able to do as good as SDXL
but MJ supports many styles
and its way far away in terms of concept design
so does XL..
this is not concept
just add ((whatever style))
know what? give me any style to do this and I'll prove you wrong
i amnot in a mood to think
just got back from work
im tired
SDXL doesn't create good abstract images too
but its way better than 1.5 or 2.1
bruh, yes it does, hold up I'll show you
we never know the situation after finetuning
Whats your definition of abstract?
Stuff like this?
I see
..
i mean, something like waves, lines, geometrical figures, or miniature things
define abstract?
sure, hold up
Waves 
Repeating patterns
yeah, SDXL destroys MJ
This isnt a competition between those or any other, its more like a mutual evolution and each can learn from the other
i dont know how to describe but like this
the entire image should be the same
@ionic dragon
i am not saying SDXL is bad, im just saying that it lacks in few area, im not saying that MJ is good at abstract either,
i am just saying that SDXL has yet to improve along with MJ in temrs of abstract
Also youre rating an unfinished product atm, thats what 0.9 really is, a preview
like for example,
if I say someone is bad, it doesn't mean that the other person is good
i am just saying that the 1st person is bad, irrespective of 2nd person
this is abstract
yup
even though SDXL can still do best =\
i have many times said that SDXL is very good than MJ bcuz of the flexibility
idk man, when I use SDXL on A1111 it reaches a level of detail I never saw other AIs get to
I dont like it in A1111 atm, I prefer comfy since the workflow is more straight forward
use comfy
sdxl is best in comfy
its shit in a1111
this is shit?
Thats more what you had in mind?
it's better for me on A1111 than on comfy because I mastered my A1111 inference, so this isn't a fair comparison
I still dont really know what you mean by that
idk man. It is in fact faster for me on A1111 than on comfy. it's also way easier to prompt
I'm probably not the right person to ask because I'm very experienced with A1111
Feel free to try the workflow im using
So am I lol
try comfy
you'll start to hate a1111
its pretty frustrating to use it at the start
Nah A1111 still has a lot of uses
I still used A1111 for inpainting
but trust me, after using it for a week, you'll love it
yeah, controlnet, inpainiting are better on a1111
oh, I did. I also have a lot of experience with nodes and I still like A1111 better
Please dont ghost ping staff
oh wait
i am trying to send something but i sent by mistake
i am trying to ask comfy
It could be worse, it could be like over on the civitai discord where everyone was given the permission to use "at everyone"
@visual glade can we expect user friendly controlnet and inpainting in comfyui after the release of 1.0?
surely thats more generic than being SDXL related as controlnet is a much wider usage
also I think given that ComfyUI is only 4 or 5 months old that its quite amazing already
yeah, I bet when optimized and setup correctly you can get the same results like I get with A1111. Its just that I really like an optimized interface
also I bet that you can use SDXL's clip to interrogate photos in comfy
what you mean is that A1111 has a more use friendly default GUI than COmfy ,
yeah, plus, it's capable of faster it/s when using 40xx cards optimization
both are amazing UIs, but it's just that one suits me better
zzzzzzzzzzzzzzzzzzzzzzzzzzzzz oh sorry it one of those mysetup is fater than yuors moments.
Its a bit like a tyre thread or an oil thread in a mortoring group ;o)
did I just have a stroke
oh, I understand now. I didn't try to flex that my setup is faster. I meant that A1111UI has optimizations for 40 series cards
probably can , I do know I can use it to interrogate an image to get the colour gradients used
yeah, can we just get along? we both use SDXL here. we are just doing it in different ways. the last thing I want to do is to accidentally insult anyone.
using a lora makes things a lot slower, and extra 12 seconds for each gen
Love these; what are you using for your prompt, btw?
theres gotta be a way to remove the fog... idk why it denoises like that
lora is an additional certain amount of dense layers right?
not sure, I thought it was like a multiplication on the original weights
"candy land"
i think it must add at least a few layers, just because you add a new word to the collection, and it also might affect the weights during the process as to override them
I figured it was something like that.
oh, what I'm using for my prompts? I actually made a template that work surprisingly well with SDXL
so yeah, additional time makes a lot of sense because you use the same model with the new weights(adding them on top of the current ones) or something
Yeah
does anyone know if AMD gpu's are supported at launch? or do we have to wait for support?
tho i have an rx 580 so idk if it'll even work
if you can run 0.9sdxl, then you should be fine
@delicate grotto i haven't tried yet, i've been holding out
im almost sure that you can run it on your cpu then
amd are quite slow with SD
yeah it's quite painful lol, if i use controlnet it takes ~3 mins to generate. but i dont mind long waits if it means running SDXL locally
this is the difference with and without the template
should be ~1m for a 1024x1024 image, and you can pay ~12$ to get 100CU(im not sure how much is it in pictures)
I love water in SDXL, or water in general
that's a style or a lora?
Very cool! 👍
A1111 style. A1111 styles are templates for prompts
mind sharing the prompt?
this is the styles file that has it. idk how to use that on comfy though
its just a prompt, shouldn't be a problem
A1111 weights prompts differently from how it does styles. it might cause problems
not really, you can press the paste button and it will paste your style
it usually places that after your original prompt
lets say you had
pos:
prompt1
neg:
prompt2
the style is prompt3
the output is:
pos:
prompt1,pos prompt3
neg:
prompt2,neg prompt3
that's not how A1111 prompts work.
but sure, feel free to try it =]
