#✨|sdxl
1 messages · Page 59 of 1
isn't that lovely? I am kind of in the same boat 😅
rip ❤️
hehehehehehehe
confused man is confused by how well the upscale works
btw any one doing finetunes for sdxl of decent size should probably target 1280x1280 and not 1024x1024
mad man did all my testing for me while I was refining things haha
glad to be of assitance 🙂
Roman statues of beautiful Irish woman with a hint of egyptian culture
@soft zealot got any suggestions here? 🙂
i see, because the KohyaSS guide tab actually recommend using 4e-7 because it's the standard LR but so far i only used 4e-4 and perhaps if somebody trained using 4e-7 can share the findings too
@high skiff where do i get the upscaler from?
trial & error lol
all i can say is look at the prompts Ive used , Im not speciying anything special although I am using Preconditioning and the offset LORA plus various "styles" , analog phot is aquite a good one for getting a good result IMHO.
Anyways going quiet as have to go out for a hospital appointment shortly
This is the recommended upscaler
if you don't have it, 4x ultrasharp is almost as good
Personally I like
ok. thanks for helping 🙂 noticed that it looks better with older people bc of wrinkles
different models for different things
The pixel upscaler is not as important now as it was before
lmao watda
damn, I wish I could share with people all the hell and 1000's of tests I went through lmao
exaclty , as Ive said before , at the end of the day do/use whatever makes you happy 🙂
I tested like 20 different pixel upscalers, and at least 5 major upscale revisions, with tons of different settings and stuff
over 3k image samples and tests
I won't have time to look over this until after work. But I see in the JSON a contrast fix. Nice, I was bitching about it messing with the contrast on img2img, but I'd not thought of doing this.
Yup, I added it cause I saw it as well
the default setting should be good for most images
I'm interested to see what you've done with it.
I mean, I can share now
did they ever say why they changed the encoder on the vae between SD 1/2 and SDxl
@high skiff Did you have your secret upscaling method shared somewhere?
take final image, upscale 4x downsample back down using a decimal multiplcation, encode that to a latent, put that into an advanced K sampler with add noise turned off, starting at 50% or later in the diffusion pipeline (this is the magic sauce, its what keeps it faithful, but adds noise to greatly improve deformities)
Then that gets exported, and overlayed on itself to fix the contrast iossues
its officially out
0 votes and 0 comments so far on Reddit
its kinda annoying since i can't pre-init alot of the layers from my last fork
Pretty sure it's a requirement because of the model change.
Thanks! ❤️
nah, you can use the sdxl vae on sd 1/2 if you retrain
of course, I am so pooped lmao
Is it not recommended to upscale to 4096x4096?
you can, tho you need at least 20GB VRAM to do that in a decent amount of time
I have done it
# SDXL original learning rate <- was meant as in this is what SAI used to train the whole SDXL model, with a batch size of 2048. While it's a neat fact, it doesn't really apply to any training nor finetuning, unless you plan on training significantly above 1 million images, and have around 64 A100s to pull off that training.
for LoRA, typically you'd train around 50~100 images, and that can be done at 1e-3, since the sdxl model is a lot more robust than SD1.5 used to be. You can achieve better quality with lower settings, but by lower I refer to 5e-4, which is as low as you'd ever need to go, for like 5k image datasets. Rather than focusing on training speed though, I can promise that every improvement you can make in your captioning, will show significantly more improvement in the final LoRA, than settings will
took 7 minutes on a 3080 lol
This sounds pretty much like what I was doing, but I couldn't get it quite right. I'll compare it with what I have tonight.
makes more sense if you pull things out in your workflow 🙂
I know you couldn't use it because it had to be stock nodes, but give it a go with the tiled ksampler
with incredible fidelity and detail
I'll put my 4090 to work then... I have to say so far this is looking amazing... Thank you for your hard work.. Great job
Lmao it's annoying me now. I really want to see what you did differently. But work :(.
I found the refiner to be really good for eyes and noses. But it wrecks everything else
base 100%
its way more consistent, more faithful, scrubs less detail, fixes deformities more, and also has higher pixel level detail cause its not overfitted
out of morbid curiosity are you using the 0.9 vae or the 1.0
😳
0.9 of course lol
No horizontal bands so assuming 0.9?
What values do I need to change to upscale to 4096x4096
I just didn't ship it with that, cause I am not sure still why SAI is being all silent about it
yeh thought so, didn't see any chromatic aberrations
downscale to 1
i personally prefer 1.0 except for the ca's... 0.9 is too sharp for me
@high skiff One little "under the hood" tweak I made was to chnage the final VAE Decode to a Tiled VAE decode as I noticed it was always (in the console) generting an Out Of Memory Error and falling back to Tiled VA (which is nicer from @visual glade than just crashing) . Saves about 20 seconds (on my 1080ti YMMV with newer cards) waiting for it to fail over
0.375 is 1.5x upscale, 0.5 is 2x upscale, 0.75 is 3x upscale, and 1.0 is 4x upscale
Does this need to change at all?
Or should i keep that at 2048?
you can do whatever you want with it, its not very crucial
That is the only vlaue I have not found a real purpose or use for in SDXL
its never better or worse, always just different
Replaced my first stage base img2img with a prompt only pipeline and went from the left image to the right image. Odd.
whut
he's been awesome!
yessir
a little kinder please ❤️
He just means for the upscale pass I believe
hell, I am very close to removing the whole refiner from the workflow after some recent finds
And it does really struggle with img2img, but don't think it's really designed for that.
need more testing tho
we've already removed it.
I'm loving the artistic value of the horizontal bands 🙂
Where's a good place to find upscale models? I've been using RealESRGAN4x but I'm keep to try alternatives
i think both the refiner and base model need some work but i am super happy they were released in the state they were... arch is probably pretty stable, just some more steps and rhlf inputs
that's base only
Ah I'm currently training at a 100, I guess that won't come out right then?
you guys have good quality upscales yet?
Cause I cracked the code on my end
@hard fractal but while you're here. could I ask for this message to be pinned?
it will make redirecting people a lot easier than constantly posting the link
#🔧|finetune message
Imagine AI art archaeology decades from now. Those bands will be like sediment layers in rock
amazing work @high skiff !!
thank youuu
Pou coming to Fornite??!!!!
wouldn't be possible without the support of this community
i tried to generate at 4k on my 3090 and it takes ages though, 2k is fast
My 4090 went OOM with 4x upscale 😦
Some more stunning beauty from the Triple-Process R-B-R
I did 4096x on my 3080
@high skiff So how do you end up using the base model as an upscaler anyway? I thought it would only work up to 1024x1024
will have my team look it over!
When you say decades from now, you mean like next week right?😁
what was the crack?
here is topping at 23,6gb when VAE decoding at 4k
so the way I use it is to limit its detail focous
Its refining the small details, not ruining the big details
I mean if you save every 5 epochs, it doesn't matter - that's why we use repeat = 1, so we can save the model often enough
oh, you're actually changing the size of the image to 4096x4096?
Dog years versus internet years versus AI years in ascending order of vroooom
that's cfg 7.5?
more of my mixed diffusion, but this time img2img with upscaling
or 7
thank you ❤️
yeah
would be cool to get the prompts for it
yes, I am getting clean and dramatically better looking 2048x and 4096x gens compared to base SDXL
tom cruise looking serious at the camera Style: Cinematic
yeah, ditto!
we wanna add upscaling to the bot
why is style seperate? is that some comfyui thing?
images are ridiculously better
After seeing how much it fixes all of SDXL's glaring issues, I can see why
Am I supposed to switch the refiner model to just base?
but would likely be a subscription or something
SYTAN
KINDER
PLEASE
I have the repeat set to 1, No idea what that means but I assume it'll be fine then 🙂
Oh I see, Is that the area value in the downscale node then? Do you manually move it around to specify areas to refine?
That's warning number 2.
ok, so I just can't say literally anything thats not just 100% blindly praising SDXL?
it has glaring issues, its a base model
its not perfect man
what's your prompt?
anyways
didn't say it was perfect.
its the best base model we have ever seen
your workflow of sdxl 1.0 is awesome
for sure 100%
glad you like it
i agree!
its been a shit ton of work
the eyes are better in the first image?
4k looks amazing
@hard fractal
i use ultrasharp 4X model instead
left is base SDXL, right is after my high res fix
give 8K a try 🙂
that one works nearly as good as the one I settled on, likely not enough to care much
those were fun tests!
no need to flex that you have better hardware, we all know lol
@high skiff So when I do x4 upscale it goes OOM then it says its trying a tiled approach and then it takes a while but still works... Should I use --lowvram or something?
with comfy
me just chilling with 8gb 😦
try swapping out the last VAE decode for a tiled VAE decode
thats weird that 24GB VRAM would be having that issue
oh wait
are you running --gpu-only?
yessir
must be something else you're doing here in comfy. can't get that sort of result on 7.5 elsewhere
it switches to tiled vae, since otherwise it would die at that resolution - but it doesn't do this for all users. that's why for some people they just go way OOM
manually putting in the tiled vae solves that instantly though
left 1024 res refine model, right 4X ultrasharp with mix diff and downscale 0.5
it is
when i was toying around with the idea of trying to merge weights from sd 2.1 partially into sdxl i noticed most of the parm counts per layer were 2x'd... thats a pretty large size increase... i imagine we are still really really far away from overtraining either model based.. that or we need to start doing layer-wise learning rates
we're looking to see if it holds, and if it does, we'll release
there's a glaring issue with your workflow, Sytan.
let's see how long it takes for you to find it 👀
and that would be?
can't use comfy very much locally. 8 gb vram and 16 gb regular ram seems too little
yeah, at least 24GB recommended
for which one type of ram?
My poor 3080
system
I can see that Sytan uses a robust turn-of-phrase - but its grazing the deepest sensibilities of a lot of people who've sweated blood for SAI ... 🙂
oh, I'm not the one to ask.
24GB recommended for system
that's a comfy question.
And here I am trying to run SDXL and Unreal at the same time lol
lol
I have talked with comfy, 24 is the recommended minimum, but 16 works
16gb system ram works fine with comfy, but it gets a bit laggy sometimes.
hehe. ok. a few people that will be resorted to online solutions then 😄
32 is the actual recommended
by whom?
comfy
(Am I the only one dedicated to using Scott Detweiler's R-B-R Triple-Process for ComfyUI?) 🙂
he and I talked about this during the research
i know.
back when I first got my weights
Increase your page file size if you are having issues with 16gb RAM
you had a whole reddit post about how beautiful it was.
man, was that already a month ago? jesus
have they announced what controlnets they are going to release for sdxl?
what's that?
Virtual Memory
@high skiff OOM
it'll do it.
are you using --GPU-only in your args?
same
I've been playing around with something similar in Diffusers. Trying to get some good A-B comparisons but I'm sure I'm screwing something up since I'm getting wildly different outputs
same for you, are you using --gpu-only?
yes... Should I take that out?
yeah, thats whats doing it
@high skiff where's your newest at?
0 votes and 1 comment so far on Reddit
Basically if you run out of RAM it puts it in a file on your hard disk. It's extremely slow, but it stops OOM or crashing in edge cases
I'm not using any args iirc all default
diffusers sdxl lora can't run on 16G RAM GPU,it require more highly
that is very strange
(honestly if i was stability ai, i would charge $5 or $100 or something for the controlnets, i would buy it just to feel like I contributed somethint to the cause and get something I would use)
This is normal it will switch to tiled.
even a 4090 will complain if you try to play a game + run kohya training + use comfy to generate images all at the same time 🤣
I would know after today
Thanks!
then it uses tiled VAE decode and takes a long time, VRAM almost filled
Give me a few days and i will be able to fix it on my end
my 3090 gets here on wednesday I believe
How much VRAM?
I've been able to do 2048*2048 on a 3080
once that is here, I can run the 24GB VRAM tests
generating 4k it almost tops, on 2k i didn't check
pretty confident that comfy doesn't account for overhead. but just manually changing the decode vae for tiled decode should solve that
DM me tomorrow
I'll get you some compute
(since in the end it defaults to tiled vae anyway)
I have done 2552^2 on my 3080 without OOM, used 9.8GB VRAM
Yeah 4k won't work though
oh? that could be amazing
Unless you use a tiled ksampler
you are giving away compute? ears perk up
I can try.
if I generate 2k, as it's the workflow default, I use almost 10gb of VRAM
That would be nice... I've been working on my cloud architecture cert lately
That's about right
I'll ask the SAI computemaster.
I mean, we're hiring.
I'm looking for 2-3 community finetuners.
interesting

lol
hi Caith!
my day job is reliability engineering, I just do this fine tuning stuff cause its fun
gotta love caith
That would be an awesome job 🙂
I'd offer up Caith, hes goated lol
DM me some links!
Same. Different day job. But I just do it to see what happens XD
which VAE model u recommend? i choose https://huggingface.co/stabilityai/sdxl-vae/resolve/main/sdxl_vae.safetensors currently
I have no recommendations for VAE's at the moment
I'd use the 0.9 one. They re-uploaded it I believe.
I think that's the one you linked
Which I believe was reverted back to the 0.9 one
Lora train on cloud is a pain in the neck, some options on github run locally
i prefer to diffusers
@hard fractal I send you a DM
I use it
here
use that
How does it compare to @Sytan setup?
how u run kohya like on colab?
Dug up one of my first test scenes and ran it through SDXL. Such a massive improvement over when I was using SD1.4
I have d/loaded Sytan's but yet to implement
Just run the scripts locally
Your layout is tidier than mine 😉
@dense chasm mcmonkey posted some settings to use
https://gist.github.com/mcmonkey4eva/0f0bd074c17802213817a9a5a50098df
Try it out with them
I presume that was directed at me.
My entire workflow is based on @high skiff 's work ad then tweaked
geez you built a whole front end in there.
i know a bunch of apps run locally,but i have to run on cloud,cuz i have not yet bought any GPU for my computer
I think @boreal bough has posted some settings too.
@high skiff do you have any commands for vram on startup? Mine blows past 10gb on the 2k upscale
We tested it on the bot.
So you have Sytan/R-B-R?
you could say that lol
I think there's some collabs for it. But not something I've looked into.
RBR was slightly under-preferred over BR.
How does R-B-R compare to @Sytan at all?
@peak dove Winston has been working with me in DM's for a while now, he uses the core stuff from my workflow
I have Sytans flow with a Pre COndition STep added befor the normal 2 Step Ksample process yes
I have absolutely 0 clue, it runs just fine for me
I'll have to sleep on it
I am so burned out after this 72 hour crunch I have been doing to get all this out
Also, fixed conditioning (including 4096x4096) was underpreferred.
--lowvram
no probs. it can wait and I will do some reading
that's your glaring area of improvement there, sytan
RBR being Refiner, Base, Refiner?
That acronym is Red Bull Racing to me lmao.
when did that change 💀
mm?
Try R-B-R Triple-Process ? 🙂
this is the "helicopter" view of it 🙂
I've been setting the conditioning to match the aspect ratio. That seems to work well.
how are your noodles so straight?!
you told me when I dropped my first workflow that SAI concluded that 4096x4096 is the best Latent size
And then I tested it myself and found no evidence supporting that
I looked at Winston's workflow and promptly went out to buy alcohol
????
why would that be best?
I have no idea
My Name Is Max and I am Inevitable
I don't think latent size is the correct word there.
No 😦
@hard fractal
yeah
is this not latent size instead of res?
oh and @peak dove I also use the offset example LORA
ah man, my body hurts
That's got nothing to do with the latent does it?
latent size is 128x128x4
Checo should be embarrased being so far behind him inthe same car
I know. Marko is going to make him vanish.
alright, I will have to catch up with this info tomorrow, I am too trashed from all this work for tonight
Swim with the fishes lol
(had to redo my launch post to reddit 3 times, cause reddit wasn't saving my drafts properly
)
grab a few hours shut sye after pouting a stiff one
Orange
what are the realest looking images y'all have seen from xl?
Live in a country where you can drink at 18 😉
hmmm, thats a hard one
The Future is bright, The Future is Orange (as the old ad tag line went)
cause some images have an insane level of realistic detail, but they don't look like a "real" photo, if that makes sense
wow, this is the longest I have been orng
@hard fractal Did you say you had some links for a Position at SAI?
I think this one looks pretty good
have my tiger
My pride and joy
depends on you definiton of "real". I, rather fond of this one
ok, I have to go, I am starting to feel sick again. This happened after I stayed up too late yesterday
that is a good looking tiger right there, the only thing giving it away on that one is that he is neck deep in leaves
I've been sending that to my friends as one of the best examples of how amazing these pics are getting
Well done
just my DMs
get some sleep!
Get some rest... Great job on the release... Its awesome
Hey all, Searge's nodes are not working here.
I'm running comfyui in colab and the default nodes are working, other custom nodes like from Sytan are working as well. Unfortunately, Searges nodes are neither loaded in comfyui (only the default) nor can I open the nodes from the file when I load it in comfy directly. I followed the workflow https://github.com/SeargeDP/SeargeSDXL/blob/main/README.md and my colab script is this https://github.com/comfyanonymous/ComfyUI/blob/master/notebooks/comfyui_colab.ipynb
Had anyone else here this issue? Thanks!
the ones we made as a joke while trying to create a whole storyboard about & with chaz
context and situational timing adds a lot to images, I feel.
what XD i swear I replied to the other message
You mean only people you messaged?
also like this one
discord gaslighting
another contender?
What's the deal with this new VAE, sd_xl_base_1.0_0.9vae.safetensors
Is that some improvement over the 1.0 one?
fixes issues. use the new 0.9 one
You know there are ways to handle your problem. The first is no blue light and therefore no device such as phone or computer. I don't know if you read but I cannot live without my e-reader. I have an 8 inch one which I love to death and is the last thing I'm using with my lights off set at 2%. Some people recommend melatonin supplements but I find they leave me a little doped so instead I found a great alternative with L-tryptophan. Which I highly recommend. Just friendly advice my friend
ok a cute kitten before I leave (standard res only as I never did an upscale on this one)
another dumb question, why do some workflows use the VAE, and some not at all? for instance this latest one from Sytan and friends does not have it?
I admit I haven't really figured out the upside of these upscalers. It's not that the results are bad or anything. It's just that the time invested, and realize that I'm doing this on a souped-up laptop, is just absurd compared to the result. So instead what I opt for because I do want a bigger and higher resolution image is Topaz Gigapixel which I upscale to four times and then downsize to whatever size I'm really looking for. It's been very satisfactory overall and is insanely fast.
is it the same as the OG 0.9 VAE (which Ive been using with SDXL1) or subtly different does anyone know ?
@hard fractal maybe?
@hard fractal hi, short question: what is the noise offset used for SDXL training. Is it still 0.0375 as for the SDXL 0.9 version?
do u know some datasets i can use with kohya lora testing?
I use a seperate VAE
I like the ability to switch
Just realised I have something to tweak there
As many a glitch as VAE 1.0 is giving, it adds artistic value 😄
oh I'm not sure about diffusers weights, that's been patrick
i uploaded the 1.0 base weights + 0.9 vae
And we very much appreciate it!
Overall, SDXL 0.9 is better artistically speaking; 1.0 looks a tad pastel/washed out - but that's just me 🙂
exact same
I found that 0.9 looks better but 1.0 has more details and realism. But maybe its just the prompting behaviour is slightly different
emphasis on vae though - not model
I'd say 1.0 was a photographer's dream, and 0.9 is fine art
There was a comment in the Reddit by one of the Developers saying that if you use 0.9 you should raise the CFG by 1.00 to get more equivalent results.
@hard fractal but really: what are the correct settings for training SDXL. Its really confusing that in your paper you write you used a noise offset of 0.05 while the kohya script is using 0.0375 as default. Also, the technical paper was never updated since 1.0, but I assume that you changed something between 0.9 and 1.0. Is there any report of what is different in 1.0. And again: what is the noise offset setting?
These are looking pretty awesome 🙂
My new Prompt With Style V3 node now supports adding multiple styles and multiple loras using the <style:style_name> <lora:lora_name> syntax. Also supports __wildcards__ with the syntax for adding multiple lines from the same wildcard file https://github.com/bash-j/mikey_nodes
also has ratio selection, option for custom size (including fit to 1024 res) and the clip conditioners are built in
What node package gives me tidier (and square) noodles/node connexions?
Awesome
where's the police car with the military Humvee turret from yesterday, from the guy who kept posting that sea monster lol
wanst that @late marsh ?
Ummm I posted a couple Cthulhus but the other dude got an inspiration and kept doing them a lot 😄
@urban breach I guess it was @timid sonnet
Ph'nglui mglw'nafh Cthulhu R'lyeh wgah'nagl fhtagn
SDXL understood the assignment 🤣
Haha, nice 😄
Oh god that red one is terrifying 😄
the red one is actually lore accurate. well done
Mine is just faded black inked to my shin 😛
Cthulu heard the call
Oh man I love SDXL ❤️
for sytans new workflow, I can recommend this style tag centered, dreamworld, sunset, bokeh, f1.8
adds nice colors and patterns to every image
weeeeeee ❤️
I just run One Button Prompt randomness, it's fun 😄
Same lol. This was one of the last ultrawide wallpapers i generated before i hit the sack lol
Need a few eye doctors, but once those kinks are done with
I love your pfp, btw
I love this style, very striking
Here's your prompt generated using my workflow
Those eyes! P e r f e c t i o n
Gothic, but high cfg 
Very intense look
Yeah, high contrast lighting...might look interesting with some intense shadows or fog
I've only got my own datasets, don't know about any public ones.
<------- just trying-out Sytan's Implementation ...
👍 good tip
U can take DulcoLax for that! 😄
Error occured while processing model:Input type (c10::Half) and bias type (float) should be the same
Getting this error while trying to run SDXL
jonah about to be swalled by the shoggoth kraken, Ph'nglui mglw'nafh Cthulhu R'lyeh wgah'nagl fhtagn
First Sytan example - clear and sharp!
Oooh, nice composition ❤️
Error occured while processing model:Input type (c10::Half) and bias type (float) should be the same
Getting this error while trying to run SDXL
But no real advance/disadvantage to the Triple-Process R-B-R imho
Need to add a Preview Image to Sytan's w/flow
Moby Dick!!!
Oh....you know. I totally forgot all about....

That phrase should go in the Supporting Terms box? 🙂
Haha, the randomness ❤️
yep
Thank you
Melon cat
Seems like the new Sytan Workflow is adding some nice details and fixing faces
Share the workflow as well for noobs like me?
I love melon cat
trying to make a xena fighting a named devil for you... but all I get is them weirdly dating instead 🤣
Just fixing an annoying bug with the regex pattern. I'll let you know once that's done 🙂
Well, you see, go look at my prompt lol
The full is far too long, but you can see what I posted haha--it's close enough
Love the warmth behind
I am generating images using Sytan's w/flow - how do I get it to Upscale at all?
do you have the one he released like an hour ago?
Yes
No, then mebbe I'd better re d/load 🙂
should look sorta like this
Yes I have that look ...
Also you have to download the upscale model he linked, if you don't have that
@eternal fog why do these both image nodes points have the same input?
whats the point of having 2 image nodes with the same input?
@ionic dragon one is preview one is save
You disable Save and enable preview
if you dont want the files saving
This one
Huh, do share the workflow! And what addons would be needed?
whats the point?
anyways its getting saved
why would someone want to preview it
any idea why khoya is making multiple files?
Sure, I did change the VAE Decode to Tiled because he recommende that, something about memory. Here's the PNG. This was before upscaling, just for workflow.
yep - cause the script says 'save every 5 epochs'. that way you can compare them to each other
My workflow is a mess I use for testing. I move the nodes on or off depending on if I'm testing or saving.
I usually end up using 20/25/30 - but its good to try with all of them, and see & learn from the results
oh ok
Aye. Tiled means it will use less memory to instead of generate one large and instead make puzzle pieces of smaller gens instead
so do they all have different results when using and you just pick the best one?
Also here is where he uploaded it - https://github.com/SytanSD/Sytan-SDXL-ComfyUI
Thanks!
There's your melancholy 
exactly. the bigger the number - the more it has been trained. But at some point, more training starts becoming a bad thing. this way you can find just the right sweet spot
ah that makes sense thanks
Werf! 🙂
@eternal fog why does it have an extra sampler?
what does it do and it starts at 18 and ends at 22?
Use sytans workflow. I've not seen his new one yet. But I've had a quick look at the JSON and he's doing similar to what I have now, but better.
OK, I got the freshest release at Sytan that I could, will re-open the json file ...
Not at my pc so I can't look at it. It's probably old as well.
ok
Sytan's implementation gives a very crisp and detailed finish - perhaps a tad too sharp for my more artistic tastes - but that's just me 🙂
Prompt = a vivid watercolor depiction of diverse Rococopunk Afrofuturist women with beautiful and bold head wraps walking toward a huge moon in the background, holding hands, walking away from the camera, collaborating in a creative and productive environment, women empowerment poster, photography, inspired by the styles of victo ngai and vladimir kush
sd_xl_base_1.0_0.9vae.safetensors - is this used as VAE or Checkpoint?
Nano robots
Vae
Well both
It's the SDXL 1.0 checkpoint with the 0.9 vae
I thought vae and ckpt were separate?
Because people didn't like the 1.0 vae
No you can embed them with the model as well.
The 1.0 VAE opens up so many distinctive artistic possibilities over here at Torcello Towers ... 🙂
It's why the comfy model loader has a VAE output
The 1.0 vae adds weird lines and colours for me.
But both are there to choose from.
<--- gone to Tescos - L8r
Can someone share a ss of sytans workflow, not at my computer
Its not sdxl but it seems good enough for a 2.1 model to go through the effort of releasing probably?
sorry for the questions, do you have any idea what the 20 to 40 number changes?
SDXL Now available on Mage.Space. Not a shill, just a fan.
it's like a video game, how do i disable depth of field, bokeh in all these pics 😄 just put that in the negative prompt and hope for the right seed.
could probably do a no bokeh lora pretty easily
too sharp because upscaler model i think, for base without upscaler pretty nice, i try your prompt, it's beautiful
i think somebody was working on one yesterday
Is here a way to reduce the size of a lora ?
Yes, use a lower dimension for the network and conv.
would it be better to do x2 + x2 or directly x4 with your upscaling method?
@soft zealot are you still online?
Out away from desk for 2-4 hours
so the answer is yes 🙂
Would it be ok if I chat with you privately when you get back?
Drop me a question and I’ll look later
2x + 2x would just make it take longer.
However it's possible it might prevent possible artifacts.
@vital wolf you will need to git pull to update the code for my nodes to get V3. Here is an example workflow that uses the new node.
How do I get photorealistic hands?
click a picture of your hands
in sdxl all the hands are fucked up compared to Realistic Vision 5.0 for example?
left sdcl searge nodes left Realistic Vision 5.0
wont solve it, ironically enough. it's that many words have bokeh & blur baked in. the trick is to avoid those words. especially cause when you accidentally stack them, the blur starts overtaking all other aspects (4k, 8k, 16k <- are top offenders)
hmmm very good point
but but 4k, 8k, trending on artstation are my most important tokens 
thank god 'artstation' wasnt affected. I actually need that one for concept art styles, until I get a better bigger prompt
oh yeah. cinematic lenses have amongst the biggest DOF, so insane blur
it works much better again in XL compared to SD 2.1
what keyword would you recommend for the style to get a realistic photo
@high skiff whats your opinion on DreamShaper XL1.0?
without the blur i mean
all hail sytan
id just keep shallow depth of field, blur, bokeh in negatives
works fine for me
tbh idek if shallow dof works
❤️
but aren't you supposed to kind of specify a style at the beginning of the prompt to avoid cartoons, anime, drawings, etc. something that is "photorealistic" but without the blur
ohhh righ this reminds me to test out how well sdxl can draw supercell thunderstorms
I get crisp and sharp images, maybe you should up your prompt game 😛
It does great job with this
Use “hands in pockets” in all your prompts………….
In sdxl I found that it’s better to not use “photorealistic” and the like since that imply a style
wow that is ridiculous
I find hands to usually be good, just make sure to not draw too close attention to them.
Oh.. let's see
hands are always easy, just edit them in photoshop afterwards 🙂
perfect hands everytime
personally i am so overwhelmed by the style of pictures my brain just filters out bad details. it's not until i send it to somebody and they are like, "what's with the extra 2 fingers" or "legs don't bend that way" that i'm like... oh yeah lol
why do the faces of my lora come out looking like they've been in a chainsaw accident?
Great Tater
My first gen with SDXL
can you explain in more detail? i already thought that 4096x4096 couldn't be best for all aspect ratios. that's why i used some math nodes to do width * 4 and height * 4 instead - from limited tests that reduces problems with stretched content. but maybe there is a better calculation for those values?
My latest gen using Sytan's w/flow
oh, i had the same idea. how do you calculate the values? i was just using some math nodes and use height x 4 and width x 4 to get close to the recommended 4096x4096, but consider the selected aspect ratio.
What's with the spaghetti thing I keep seeing from time to time? 😂
if end_at_step is higher than your steps, it just ends at your last step (30 in that case). just avoids that it get's stopped too early if you have some calculation or rounding errors in your workflow.
No shit Sherlock, if you over-prompt like that it's perfectly normal that the end result will follow your direction. That wasn't my point, try "a close-up photo of a beautiful woman" in SD 1.5 and then compare it to SDXL, let me know
I have the upscaler in my setup, but my images are still a tad too sharp but not upscaled. I will prefer Sytan's w/flow for a more phoographic result, and use R-B-R (or even SDXL0.9) for art 🙂
oh ok
also downloaded the sytan workflow but not sure how i can edit the steps in general
You have a sublime touch to your art!
Oppenspaghetti?
thx ^^ just re-tracing my steps and see what made it click
Only use Tiled VAE if your VRAM is too small. It will/may introduce small errors (color differences/lines between those tiles) and shouldn't be used if you have enough VRAM.
Titled vae?
it's a different node that some people use instead of the regular vae node.
which one do you prefer?
The sharper on right side
left has less broken finger
Secınd one look better but just slightly
right has more details, fingers can be fixed with editing
Exactly
where can we download the 2 nodes in sytan's workflow?
did git pull on comfy already
Both images has broken finger to be honest
Left one has less
which one from these?
yeah so if youre already going to edit either one, you may as well choose the one that has more details
Still learning how to tame this powerful beast
I mean, is harder to see the broken one in the left img than in the right one
why my upscales always use 11 steps?
Left one
The background is more detailed
But honestly, both are cool
I am kinda new with stable diffusion, what should be steps for base and refiner? Refiner seems to break faces especially eyes
Too cool
Did u use an upscaler?
@azure oxide @peak dove your opinion on these
1st one
80% base + 20% refiner
Nope, just native 1024 x 1024
Anyone try pruning the SDXL base model? Is it even possible?
I see, pretty cool tbh. But the eyes in some imgs are... Awkward XD, if u use a good upscaler it would be fixed
Tanks!
anyone got it working in vlad? i just haven't been able to get decent output from sdxl in vlad
its already pruned to 6.5gb. it's got a larger size for fp16 because of it using two clip models
ThANKS!
Ah, I figured.
Vlad explodes my 8GB VRAM RTX2070 - pushes it into error after error
hehe let's prune it to fp1 and see what happens 🤣
Whoa that's nice
Did you do that with ComfyUI SDXL???? Those actually look like real tanks.... Awesome job with those.
This is made in SD.Next. Thanks!
Do you have recomendation for upscaler?
Excellent job with those.
Thanks.. good 50% random prompts 😄
I got Vlad to work Base only (using Diffusers) - so so results, softish finish
new minecraft update hits different
yeah, this is the issue i have too, either base only, or it just disintegrates
Minister Gordon Ramsey went public in stating: "Officially a state of war does exist with Italy"
so the 1st post 1st image & 2nd post 2nd image had Sytan's default's samplers and schedulers.
1st post 2nd image & 2nd post 1st image had dpmpp_2m and karras for all ksamp.
but we cant always say that its better, have to try for other styles too
but ddim produces better fingers
That's what I'm using
Pink Slime! I need that to make my mining laser lens...
FTB?!?!?!
Seeing that Comfy actually works at SAI - I'd say that ComfyUI is the present-time leader in the field of SDXL ... !
Ah yes, good old days
man, I feel old now
Both the ones I chose I liked because crisp and sharp
yeah, the texture was better
This kind of romantic art needs Base only ...
... it lives to be soft focus
So how do you use the refiner? just chuck the image I got from t2i to i2i?
actually, i do base+refine, upscale and do a base/base+refine, and then get the best upscaled image
If you're on A1111 there's an extension for it
sometimes base gives better output, somtimes base+refine gives better output
OK - I've yet to implement Upscale - seeing as I use GigaPixel 🙂
i think for all pastels we should only use base
SDXL1 has that kind of pastel finish compared to the more saturated 0.9 look
Where do I find that?
The best I have seen is the @high skiff one. Literally fixes any output. But idk if it's already out, he said yesterday that he is gonna make it out today but idk if it's already.
Oh wow thanks, I will change that back cause I wasn't having issues before hand
Sytan released his latest w/flow already
Yayy
Do u have the link?
I don't like using SDXL on Comfy, it wants 2 positive prompts
From final VAE Decode set a Save node
they go in ComfyUI\output
Comfy is just annoying to use in many ways.
Woah pretty cool
I only ever use 1 positive prompt, and almost always leave the negative prompt empty
however if ComfyUI will have the AIT optimization, I'll switch to it in a heartbeat and make a workflow that I like
@peak dove do you use Sytan's workflow?
Sytan's w/flow uses 2 positive prompts to good effect
Yes, but it gets over-sharpened imho
would like to adjust the steps in sytans workflow, but dunno how
Oh it does this anyway, I guess it makes no difference for me? Warning: Ran out of memory when regular VAE decoding, retrying with tiled VAE decoding.
it's kind of annoying how SDXL doesn't seem to handle genning at 512*512 at all
as in it produces nonsense
usually
As you go from one sampler to the next, 1st sampler say 0 to 3 steps; 2nd sampler 3 to 12 steps; 3rd sampler 12 to 20 steps ... rinse and repeat!
i think this is actually the most significant, and most important benefit over sd 1.5 lol
still, I hate the fact that it needs 2 positives. If ComfyUI will have AIT optimization I'll make a workflow similar to my A1111 one
You don't NEED 2 positives. But it can help.
i want to understand the workflow can you explain it, the ksamps?
It doesn't need two positives. That might just be the one by Sytan. Mine doesn't have two positives for example
@floral island i love these styles
you have your answer, comfy is very flexible
double exposure?
yes 😄 trying to figure out how to get it to work with other cool stuff
base ksamp produces a small image from text - the second ksamp take img2img from base ksamp into refiner ksamp - or so I believe?
Generation data stripped, had to fix it a tiny bit
idk man, if comfy UI will have that AIT optimization I'll make an opinionized workflow
AIT?
WE COME IN PEACE
Turn our swords into ploughshares
tensorrt's big mama
where do we even start to address the ... things in this image?
The heck hahahha
Thomas The Telescope?!?!?! LOL
It's smoking lol
What is that Lora if you don't mind me asking?
is this new dprk propaganda?
Auto1111 seems to be struggling with SDXL img2img
No Lora
capable of insane performance with no degradation
it says offset 0.2 but mine is throwing errors
like it dropped to s/it
use --medvram
Comfy said they are currently working on implementing AIT to ComfyUI
Send the prompt 🥵
"a battle tank with a main cannon with the face of Thomas the Tank Engine"
I tried the inverse "Thomas the Tank Engine as a battle tank with a main cannon" and it wasn't as cohesive
That's disgustingly amazing
That is an awesome minecraft blob
this is what the Lora comes up as, but the one they released with SDXL is saying Erorr occured when executing Loraloader.. HeaderTooLarge
there's a AIT VAE decode node but when I tried it, resulting image was all black :/
Wow it's rly cool tbh
it's not released yet, whatever you're using isn't the official one
You would still use 2 positives to get the most out of it. The model has 2 text encoders. It's your choice if you use them both or not.
On automatic it will just use the one. Or just use the same in both and mask it from the user.
I want to punch it
You can do whatever you like I'm just saying that there's no two positives that you keep complaining about

Neon LoFi Girl in the city
probably using it wrong, so idk
Sytan's "two positives" seem to be 1) a prompt and 2) a style modifier (rather than a full-blown prompt). It matters not whether you fill-out one or both!
Well I've seen workflows with two actual positives. But it's a personal workflow thing and nothing to do with ComfyUI
Somebody try this in Sytan's two positive boxes - BLACK - WHITE - the result should be GREY?! 😄
Wow it's really accurate
Just tiny issues
For example I found a great workflow for Loras, but it imposed some upscaler which I absolutely did not want. Outside from that it was a great workflow. So I deleted the nodes that I didn't want and left the rest. No big deal
It has a soft watercolor finish ...
What does this mean? Renders still work
missing {'cond_stage_model.clip_g.transformer.text_model.embeddings.position_ids'}
Thank you... Its a work in progress
idk im still not a fan of the refiner, always destroys most of my images
"you might wonder how i ended up here..."
sorry guys does anyone know why it errors out with any value other than 0 for both these strengths?
All my images are with refiner.. works nicely
What does it do if you use less prompts?
same thing
Yes I have had that issue, not sure how to fix , other than have it higher.
Can you show the stuff you have plugged into the LoRA loader?
She's cool, but she'd slowly rust!!!!! 😦
Alright I redownloaded it from here, I think the file was corrupt 🤷♂️
https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/blob/main/sd_xl_offset_example-lora_1.0.safetensors
Now it's working 🙂
Ok cool
thanks Torcello

Luke Skywalker
Wutno.
Gandalf?
Beksinski.
OK, the artist
does the refiner fuck up loras? because it looks fine until the refiner comes in
Hi, any idea if () still emphasizes some keywords or not? Using comfyui rn
I feel the same but I kinda changed my mind
do you have a github to grab those custom nodes from?
I'm starting to warm up to SDXL.
You could just try it.
that's a dumd ass question
How much faster can SDXL get with optimizations? 2x? 10x?
What would "optimizations" be referring to? I would also like to know now.
I can try, but how do I measure for sure that "clown" is emphasized in my prompt "image of a ((clown)) eating pizzas"?
(clown:1.3)
Basically the Unet is the slowest part, so going through those layers and figuring out what is taking the most time, and pruning from there
From what I understand that would be the way.
I never understood it fully but instead used the (word:1.0) method.
try it - Stable Diffusion = A111, Comfy, NMKD etc etc
Ah interesting.
(a:1.1) (brown:1.2) (cow:1.3) (jumped:1.4) (over:1.5) (the:1.6) (moon:1.7)
People have done optimized attention implementations for previous SD models, seems like the same concepts should work
More Street Art
with upscale
Using R-B-R w/flow
maybe give my setup a try, its embedded in this pic
allright, ill try soon 😄
I can see little difference upscaled/non-upscaled
no its a big different, the upscale erases a lot of detail
look especially at the cats nose
Nasal upscales are tricky 🙂
Cool!
read the UMD notes. you need to change the steps to tune it
try 29 or 30
This one is really good, one of the best I've tried so far. It creates some awesome detail, and these eye pics are crazy 😄 Nice work
Thanks man, appreciate it 
Wonderful love the wide-angle and dof
Hey, a bit new to this. Noticing my generations lose all of my lora-ness when they pass through refiner. Any ideas of what I could try?
default short prompt images look way better in SDXL than 1.5, I don't know what you're trying to say
Hi-res workflow of both Sytan and Searge work fine
missing nodes for ur workflow
Yeah I have a few custom nodes installed, with the comfy manager it should tell you what you need
or maybe this helps
last time i tried to install ultimate sd upscale everything crashed
and i had to reinstall my whole comfy
xD
Birb! (it might have a double face.. kinda. You decide!) 😄
Art At Home
anyone have a good image2image workflow to share?
Are people getting better results putting the same prompts into both text_g and text_l encoders? or having different prompts,
I have had some really good results with just leaving the text_l encoder blank.
This seems to make more difference to the output image than what number of steps or sampler is used, but I haven't seen many people talking about it!???
Supporting terms to L if you're gonna split
definitely something that should be investigated.
I found both happening. Some prompts get better, others get worse
Yeah but what does that even really mean!??, I guess I need to look at some more good examples, it is like learning to prompt in different way/new language.
I made a moonlight picture in Sytan's w/flow - the second prompt specified sunlight - so the result was a very sunny moon (if that makes any sense?)
so far I prefer to just use same prompt for both
using different prompts for both is very awkward and strange. It seems to work, but it's definitely a hack
G and L use different versions of Clip, which you can go down that rabbit hole as deep as you'd like
Are these just prompting or some other wizardry? 😄
I suspect what it does is it places emphasis on the words used early on in each prompt - SD takes heed of the earliset words in a prompt, de-emphasising the later words - so a 2nd prompt is a form of re-emphasising (that's my surmise at least...)
the prompt literally had 4 words in it, so other wizardry
the problem is that prompts are aligned, i.e. the k-th token in clip-l is concatenated with the k-th token in clip-g. Using different prompts totally destroys this alignment and might end up with strange results
.. as in a "sunny moon"
That's what I thought 👍🏼
Everything with SD is a "your mileage may vary" situation lol
also, that's on A1111. the main reason I didn't move to Comfy yet is because it follows prompts in a different way
if your prompts are "moon" and "sun" then this makes sense, as you end up with a token that is associated with both, moon and sun.
But if you use "moon" and "by greg rutkowski" then the word "moon" is associated with the word "by" which does not make sense. Of course the tokens are somewhat contextualised,but still it is strange
i spotted a wild vae today. supposedly is a lot more memory efficient. https://huggingface.co/madebyollin/sdxl-vae-fp16-fix/discussions/7
it is identical in a1111 and comfy, like 98% of the image.
weird, for me it's not like that. can you replicate this in Comfy?
The only way makes image different is the seed which calculated from GPU or CPU
If you make both using the same way. It would be identical
It is in the setting
Testing Cyborg Style lora ❤️ | https://civitai.com/models/119405/cyborg-style-sdxl-or-goofy-ai
I tried copying the workflow and params of the images I make using A1111 to Comfy, it's not the same.
I think my best results come when i give different prompts to the G and L, where i treat L like a support. Both clips are going to go out and invoke the latent space. I think if you prompt them seperately, it can give a higher quality focus across a wider attention. If that makes sense. Prompting is all about the attention you're drawing to various parts of the latent space. using 2 prompts is like, 2 wheel drive vs 1 wheel drive
If it is the same I would love to see someone prove it
ohh, is this possible to change also on ComfyUI?
are any 3rd party control nets out?
someone has to have a dope img2img workflow to share 🙂
thanks Grougarawr, will snag the upscaler and try it out
finally starting to get rid of the boka look, having a subject and detail in the background is awesome. just don't look at her fingers lol
i pretty much got rid of every photograph "high quality" related term before it stopped the depth of field thing with a photographic subject
Does anyone know how to add a toggle button to a custom comfyui node?
ctrl + m
Could you give me the prompt and let me try?
Thank you. I'm looking for something more intuitive though
all in the metadata
The way COmfy is currently written it does not allow for a toggle to be added into a node because it expects a connected line to have an output.
You can write a custom receiving node and custom output node to allow for that, but you'd have to do the same for every single node then.
Alternatively, yeah just mute the node, or drag the output to a reroute, following another reroute, and use that as the toggle
It would be nice to have a switch junction. A reroute with two inputs, a toggle, and one output
Could it be added with code? Looking into litegraph and they have it on this project.https://github.com/jagenjo/litegraph.js?files=1
hmm this nose is suspicous
can anyone give me a good prompt to test if my sdxl works on auto1111 ? 🙂
In a way.
I fiddled around with trying to code something for a while, ended up abandoning it.
My process was to have an image output (like vae decode) go into an output switch, so one image input with two image outputs, controlled by a boolean. That way I could have a "switches" row of just booleans changing between 0 or 1, which would control which image output I wanted.
So I could have one for saving only, or preview and save.
I got them working by making a custom save image node which normally it would yell at you when there is a line connection but no image to be received and would redwall you, but I made it so that it would ignore an empty image.
This worked, except ONLY for that. If I wanted to do the image output to any other node, I'd also have to make that a custom node to ignore the empty input.
So basically the NoneType would have to be fundamentally rewritten by Comfy to allow empty inputs. That's above my python coding, no expert here
did you figure out how to reproduce A1111 images in Comfy? I have been trying and I couldn't recreate the gens I make with A1111 with Comfy.
Thanks. Me neither but I'll mess around with it later tonight and see what I can do.
trying, have some interest result.
This was essentially it that I did
I can't completely show it because currently I'm testing some other custom node ideas, but yeah this is basically it. Could have also added an image input switch to the Preview image so you'd always get the preview.
I'm really liking @high skiff latest workflow 👍🏻
can comfy default, add a transparent png overlay onto a final output image?
like for adding a company logo in the bottom corner
image/postprocessing/ImageBlend set blend mode to overlay and load a transparent image. might not always be in the exact corner if you change resolutions - haven't tried it
will try it, image node (transparentpng) --> imageblend node --> save image node?
interesting result
i was just curious cause i lowkey love stats...whats the difference between discord bot users now that 1.0 is out and when it wasnt out yet.

