#✨|sdxl
1 messages · Page 64 of 1
yeah frontend needs a bunch of work like a rewrite of the primitive nodes
this is how I imagine the scene
so we got the team ready to support comfy!
hacker guy is def a good prompt
comfy, could we get the history in descending order?
Let's chat next week about your wishlist!
time to work it seems
you got the whole company ready to support comfy 🤗
I can only plan features and do some UI/UX design but I'm not good at coding 😄
My favorite little trick:
Right click a node, turn it into an input
Sure
We know that
But then, you can just double click the little green dot
And it'll make whatever node is best

Also, using shift to move multiple nodes.
yeah lets chat about that and more
found that one early from having used Adobe products for so long
command-enter to make a prompt go
lol
a Crtl+Z undo would be nice
right click the load image node -> open in mask editor
One more colour down, 16 777 211 left to go. Even without the LoRA it seems like SDLX is a lot better at red backgrounds than blue or green and this was trained a bit differently.
some people gave a shot at implementing this but apparently litegraph needs to be improved a bit for it to work properly
who designed litegraph?
doesnt ctrl-z somewhat work tho?
not when you delete nodes
ahh
I've accidentally deleted nodes before when I tapped the backspace key after I thought I clicked into a text field. So maybe only having the delete key mapped to delete a node?
@hard fractal Do these 3rd party controlnets work with sdxl?
https://huggingface.co/TencentARC
No
The best would be to replace litegraph with something else.
holding shift to move multiple nodes took some time to getting used to. because I would have expected to select multiple things with shift (like in file managers, photoshop etc) and than the selected objects can be moved with the left mouse button.
I understand it makes sense to have a precaution to not accidentally move stuff, but it is different to the "standard" behavior that is common in most applications.
I picked litegraph because it was the simplest way to get a UI on comfy
Since we're on the topic of requests, could there be a rewrite which allows connected flows to not throw redwall errors if nothing is output?
So that way you could then have logic gates switch between one or the other based off a boolean, or some other future method?
Image for example of an image output switch node I wrote, but then realized the fundamental issue of NoneType errors
but it's not perfect
I'll reach out. Maybe we can get him to do some things for us
I learnt yesterday you can shift click multiple nodes and copy/paste into another workflow
decent
@comfy I took that link and downloaded the picture to have its workspace but I got this when I generated.
That's fine
ctrl + shift + v also pastes the node with the same links as the one you copied
sweet
Oh yeah that's the biggest one
yeah I leave some debug info for myself, you can ignore it most of the time
alrighty
snap!
Unless someone mentioned it already, you can control drag to select more than one node.
It made this
too fast for me! 😄
I just want to know how to do 90 degree angles and straight lines
Use SwarmUI and have all the nodes hidden from view or use Auto1111 and don't worry about nodes.
I need to try that. I keep meaning to
I can also really recommend the ComfyUI Quality-of-Life extension package by @zealous horizon. Great helpers!
https://github.com/failfa-st/failfast-comfyui-extensions
feels like they should be native 😉
I might have to nuke my install and start over. so many errors. it's my fault for not being mindful of what I was doing. but might be easier to just start over at this point. still works, but random errors
I am getting overwhelmed now with all of these extensions as I am not the type to have a lot of extensions only the ones I need. @vast narwhal
just work with what comfyui comes with for a couple of days
any stable prompt ideas for letters/words?
than you can add features if you want or need them
yeah, just wait a while
add cheese after you know how much cheese is on it
I need that mouse inverter 100%
those pixel graphics look nice 👌
need a way to do a prompt where I just <lora> so I can have as many as I want not something that needs an ultrawidescreen 8k monitor that is 70 inches.
That is how I feel
I made a node that supports lora:lora_name, but it's made for SDXL support only due to the conditioners
I mean, you don't have to put them all horizontal. you can loop back, you can stack the loras vertically
you can you use that efficiency lora loader
you can minimize them
yeh, its modular and you can draw a face with them if you like
you can put everything tightly together but navigation isn't really a problem with the canvas of comfyui. it's quite fast
Yeah, I will try that for sure and see, but what humble is talking about would be fantastic for all versions of SD.
I made my pixel art workflow arranged sort of like an old tv lol
Auto is practically dead now with XL and mem leaks that require over 64 gigs of ram and still wants more
nice
pics or it didn't happen
whoa
i mean. you know
that's nice
Wouldn't it be cool if nodes autosized?
excellent
is there a shortcut to convert input to widget and vice versa?
expressive - cheese from space
the moon.... ITS CHEESE
In space noone can have your bree.
van life
Trust me, I used to hate comfy ui, no my work flow looks like this
that van is legit
I hated it for like 5 mintutes lol
I never really did. just didn't know what I was doing at first
The Daflon Machine
still don't know tbh
Dude, I am not a Borg. 😦
don't get sucked into those LHC machines. start simple and build from there - than it's way less overwhelming.
I didn't realize for a couple days that I could load the images into comfy and get the workflow
so I was really perplexed by the tutorials
just pictures and vague descriptions of things
my main looks like a mess right now lol. I dont have fancy line straighteners yet.
I have seen stuff that looks like schematics. I mean I can't even follow all the routes.
right? like a CPU board or something
I use way too many image filters. I've spent so much time doing random things with photoshop
Exactly
I have to really hold back, lol
really its just time. after a while you realize you can merge projects etc
yes, they look like schematics.
neither can I. you just take it apart if you want a feature. build your own. take some pointers from Sytan's workflow. it's well organized.
or become a lizard wizard... maybe join a band...
your settings are whack
Not sure why I get that as I am using exponential
don't use exponential?

I mean it could be an art form. it's almost like stained glass
I'll see if I can load it. I'm curious about what's going on
That's generally just not enough steps with the sampler and/or bad sampler choice. I have only used karras and normal.
exponential does strange things sometimes
project, recue the prompt
Tried karras too and it is hit or miss
make sure your base and refiner use the same scheduler
awesome Pencil Art B&W Lora
karras
karrap
what are you total steps?
Oh? I am using that image workflow you linked me to earlier
there's a wrong setting
somethings afoot
I suspect your refiner start steps are set incorrectly
I have no idea I am using one comfy sent me to
pixel cheez robots? count me in
one has noise added the refiner doesn't and base is euler refiner is dpm
I mean, why not?! best cheese robot wins
can you screenshot your settings? steps, cfg, both samplers
I bet this fixes it from what comfy said
Yeah it could be start/end steps are wrong, different samplers chosen for refiner/base. Different schedulers on base/refiner can do that too but usually not so extreme. Most of the settings can cause similar looking problems if they are set badly.
this is so big my answer is not easily
are you stripping the metadata from the images, general? I can't seem to load them for some reason. then looked and didn't see anything in the one I downloaded
just screenshot your whole workflow so it's readable 😄 we figure it out
you'll get it. you could just snag anyones png and load up a new workflow and see how its set up etc.
ahaa
imma save this so as not to forget
where is the seed at or is the seed a noise seed and refiner has no seed?
The refiner has a seed but I've seen some workflows that set it to a constant since it's not changing the look to much anyway.
both need a noise seed. they probably use the same. which is fine
first is base second is refiner
In that screenshot the refiner is set to a fixed seed and the base is set to randomize it.
But I guess if you don't add noise does it do anythign?
How since I do not see a seed node?
It says "noise seed" and has a number. Under that you can set it to "fixed", "randomize", "increment", etc.
In the ksampler node.
I meant it isn't connected so how is refiner using the same one?
It isn't. But I don't see how it would matter if it was or not.
I thought someone said they need to be the same seed?
I don't think that is true.
what is XL version ?
I keep my seed number constant when I want to refine the same image.
I see the refiner starts at step 20 and base is still going so both go to 25?
YO..your chEEse Sir..rr. r
and I also just run the same number for all the samplers now
well unless I want different numbers for some reason
In fact, with add_noise off, is the noise seed actually doing anything on thge refiner at all?
¯_(ツ)_/¯
yeah, I have no idea, lol
the seed only does something when add noise is on
what's the default if the sampler doesn't have the add noise optoin?
you can double click the workspace and type Seed to search. then connect the seed into the noise seed inputs, if you want it the same
I did not know refiner and base are running at the same time. base to 25 steps and refiner kicks in at 20 or am I reading this wrong?
@visual glade is something like this possible to implement? Or would it require too much to fundamentally change in the way ComfyUI is written?
where to ask noob questions ?
that would only need a change in the frontend
general, what are you trying to do?
everything but seed was in there
figure all this out
yours doesnt have it hmm. not sure maybe its a custom node?
iif you want to run all the samplers off the same seed you should be able to right clcik and change seed to input
then drag the node out from the seed input and drop it into a random empty area
and it'll show you all your input node options
Do you think this is possible, comfy? https://github.com/comfyanonymous/ComfyUI/issues/1051
That's essentially what I tried to do
I think these node boxes do not allow for that as these are KSampler (Advanced)
but with available nodes and a bit of a custom one I did
how to install 4x upscaler for AUTOMATIC1111/stable-diffusion-webui
yes it shouldn't be difficult to implement in the frontend if someone decides to do it
it has to be in the frontend, it's not something that can be implemented in the backend only
I guess you didn't see what I just said, general. not sure what else to tell you
save the upscaler PTH file into
stable-diffusion-webui\models\ESRGAN
did still does not appear
restarted too
restart A1111
it should be in img2img page
or extras
looks like classic cinema
I feel naked as I did in 1.5 when I retried it as I rely upon all my models and loras, embeddings, and hypernetworks.
which upscaler did you save?
Ty
4x upltrasharp upscaler
this satisfies me
works on mine
thanks
Would you know about which files would contain that portion of logic? I'd be interested at taking a look
dogs are omitted from being considered spamming
That's one healthy looking doggo
Got 3 best puppers myself
He's got a sis to grow up with, and they've got an ornery old bitch on her last legs to tiptoe around
new 3090 gets here tomorrow
Excited to get back into LoRA's again
My best bud is tipping almost 11 now, sad to see him get older, but he's still bounding around. Just never think they will get there at that age until they do
didn't think it would be possible
a 6x prompt LoRA sampler I just built, for when you do get back in
Our old dog we had when I was younger made it to 17.5, and he was super happy and active until the very near end
How do I lock a seed in comfy because each gen is drastically changing?
Oldest girl's about 15 now
https://github.com/comfyanonymous/ComfyUI/blob/master/web/extensions/core/rerouteNode.js
this is the code for the reroute node
Does that have the whole workflow or is that just a peice of it?
I already told you how it can be done, lol
That's awesome, I would love for him to get there, so long as he stays healthy. He's had joint cancer once removed from one of his hind legs, so hopefully it doesn't come back, then he should be good
As I showed I never could find it.
because you didn't do what I said
CHeers, I'll take a look through, thanks
Pick your model, your lora, and click queue
<---- looking at an nVidia 4090 ... 🙂
Where do you get loras for sdxl 1.0? are any out yet?
I made one
lots of people are making their own, myself included
Kohya_SS or Auto1111 are the two primary UI's to do it with I think
your making me wanna do ones for my cats 
Can you send me the link for kohya_ss... it looks like a lot of people have their own version of this
now if i could just keep my cat still enough to take decent pictures lmao
https://www.youtube.com/watch?v=AY6DMBCIZ3A the tutorial I followed
Updated for SDXL 1.0. How to install #Kohya SS GUI trainer and do #LoRA training with Stable Diffusion XL (#SDXL) this is the video you are looking for. I have shown how to install Kohya from scratch. The best parameters to do LoRA training with SDXL. How to use Kohya SDXL LoRAs with ComfyUI. How to do checkpoint comparison with SDXL LoRAs and m...
Your first model won't be perfect. Searching this server for keywords can quickly find answers to questions that may have been asked at least once before.
#🔧|finetune message
with this is it should be easier - but captioning is still on you
secourses 
Where? Not there, and no right click to add it either.
literally says conver noise seed to input
Wow just watched part of the video... It looks like training on a 4090 with a lot of images could take hours... possibly days to train one LoRA.. Is that correct?
*convert
I don't even know wth a noise seed is
that's what a seed is
the seed of the noise
remember we don't have that in auto
I only had the seed
seed = noise
thats what a seed is...
its the same thing
auto just doesn't lable it right
it is infact, the seed for the noise
I effing asked that way earlier and itr fell on deaf ears. I thought that is what it was but helkl
@spring fulcrum If you don't have a lora, you could just delete that node and run something like Dreamshaper XL (since it doesn't need a refiner) alone with the workflow
I tried to help but your version doesnt have a seed node for some reason
Yeah, this is weird like that
since when was there a seed node?
Must be why I am getting different 100% each time
just set the seed to be static
maybe its a custome node. not sure
hmm
but it does have a seed node. it's call noise seed as it produces random noise
change control_after_generate to fixed
You can make that with a primitive
change it to input and there will be an input for seed, or noise seed
@boreal bough i'm prepping to train a Lora for a style (on camera flash photography) i misplaced the command line you place in here a day ago and can't seem to go back and find it, do you still have that handy?
then drag from that into empty space and you will see input options
for captioning, I tried this project https://github.com/jbmiller10/CaptionFusionator which used blip2, wd14, flamigo to caption image and summarize with llm. The result is pretty promising.
anyone know how to optimize sd?? i just want my images to generate faster. im running a gtx 1660 super with an overclock and my current start setting are --xformers --no-half --medvram. im just generating a single picture at a time but it takes upwards of 2 and a half minutes for a single picture to be done with these settings:
That's just a primitive node connected into the seed
Heun? whoa
w1 = torch.bmm(q[:, i:end], k) # b,hw,hw w[b,i,j]=sum_c q[b,i,c]k[b,c,j]
torch.cuda.OutOfMemoryError: CUDA out of memory.
fix ?
don't use Heun, that is one of the worst samplers for SDXL
its slow, and inefficient. Almost any other sampler would be much better
For SDXL use 1024 native, not upscale. And use --lowvram
yeah was just trying to work out why I can search for that by typing Seed, yet General didnt have it
what is sdxl?
The model that is the topic of this channel.
you might try Euler a and do like 20 steps
alright, I'm not going to try explaining anything else, lol
The latest stable diffusion model
alright ill try this
2 mins for a 1024 img on 1.5 ?
Does this help ya?
i've loaded that, but i don't see where the 8/1 settings are, and i don't know if you're just using the dreambooth tab and then extracting lora with the 'tools
i assume the Lora tab would be were i'd kick of the training
euler a cut my time in like half but it looks different then Heun, i was only using it because it seemed to give the best results
anything with a (ancestral) will continunally give you different results the more steps you have
yeah they are going to look different
oh derp i loaded the json into dreambooth tab hence being confused
is there any sampler that will look like Heun but perform like euler or other good samplers
relative rendering time compared to each other
would you use class images on a style lora then like dreambooth character
it broke when I did that but no big deal
broke as in?
Now I made it back to a widget broke still, so reload time
It does make some nice images in 1.5 at least though lol
i've got 2 aspect ratios 832/1216 and 1216/832, i'm assuming that i just put them into the same folder 1_triggerword
SOmething else is breaking then, not th eprimitive
sketch lora grid. last column is 'intended use'. every odd column is without lora.
it was trained to be mostly used on humans - but for the fun of it I added pretty random scenarios XD
Yep, and I save the workspace only now I can''t find where it saved it
What is the best way in Comfy to pass the same width and height to the SDXL Clip encoders for both main model and refiner, and also empty latent image node. Seems it needs the same values in 10-12 places.
it did not save it in the comfyui folder
'intended use'
in your usual download folder for your browser
this is one way to do it
I am using the nodes that have 2 prompts each. Each main model node needs width and height twice, and 2 nodes for positive and negative. Seems to run out of connections from the primitive.
And also need to pass the same value to the empty latent image. But primitives seem to only connect to 8 items max.
:? I never would have thought that as most save it into their own folder.
connections shouldn't be an issue
OK then it is a problem on my system. But that shows me the answer. Thank you.
sorcery
By accident I just found that and thank you. 1 down a few more issues to go until I feel comfy in this.
I was only able to get it to connect to 8 locations and then no more connections would be created.
Add a reroute then inbetween one of them and just push off that instead
oh nvm doesn't like reroute
workspace loaded and works again
can I not use the same primitive between the two ksamplers?
I don't have it up right now but I'll try just rebuilding the workflow from scratch now that it is confirmed there is no limit to the outbound connections from a primitive.
sdxl is so good that half the loras i think of making sdxl can already do it quite well lmao
Same
You can
Working now so before I dunno what broke
I think SDXL will require a lot less finetuning for many things.
happens lol
Oh, I already see a bunch of new things it needs help with
Over baked Asians is one
absolutely. while there are issues, they are few inbetween. most things can be prompted for - albeit weirdly prompt gated
turn your cfg down
like anime images are totally doable right now. but oh god the prompting gets weird 🤣
I am not sure a full on FT would be needed but a DB I can see
cfg 1
amazing. is this with base sdxl?
are you skipping steps?
well you can still have it skip the last fwe steps of the render if you'd like it helps with that sometimes. and if you'd like to know what connects to what, like I said a few times before, just drag from one of the inputs or outputs with your mouse down, then let go in empty space, and a little menu will pop up
then click search and you will see all possible nodes that will connect with that particular input or output
Yep, that is how I added the primitive
if you'd like to know things like that
I actually don't exactly know what a primitive is. I just experimented until things worked the way I wanted
well some things
Seems to be just an integer
Primitive is basically a variable.
Will take on whatever type you connect it to. Integer, string, pick-list.
gotcha. that makes sense
Now I have a static image I can work with
once you get over the hump it shouldn't be bad
but I still miss some things from a111 like cfg scheduling
and noise offset
might be a way to do those things in comfy, but haven't figured that out yet
I guess with primitives I could schedule cfg
sometimes it runs only when i hit enter in the cmd console
otherwise its stuck
gai, I had that same issue with a1111. don't have an explanation as to why
yeah, this hump isn't that bad but missing QoL stuff though these extension I will look at that might help that
if you click and drag your mouse in the cmd window it will pause it. enter unpauses it
,also, there's no reason you can't use both. but for me personally my resources are kind of on the low end, at least my video card. so isn't really worthwhile to run both at the same time
and these sdxl models take about 5 minutes to load, lol
gotta load fast
No, I am over, BACK, in ComfyUI because A1111 is totally screwed now when XL hit
it was kind of a mess already. no hate on the people working on it. but just got bloated and disjointed
I mean the mem leak from switching from base to refiner and back I had a 100gig page file and I have 48gigs of ram
and didn't really progress as it should
I appreciate it for all it taught me though. I didn't even know what stable diffusion was until around november of last year I believe
Alright I like the non gpu sde karras
so it was good for beginner to low/medium level stuff
It has become so broke now I am not even sure it qualifies for that.
what I like about comfy is it makes the process much more visible and intuitive, even if it's more complex and tedious at time
I just updated to the latest as I was using feb release still and omg, it became so sluggish and it was horridly slow.
and I like to get weird with workflows, so it's perfect for me
night and day as if I was back on a core2duo
I had to nuke it and reinstall about ever 2-3 months
Do you mind sharing the json file that you used for training?
It's forced me to learn more about what's actually happening, instead of just hitting big orange button to give me a picture
I am wondering if I can train XL on colab? I do 2.1 but if I do I can't use comfy with it, not yet at least.
yes, so many of the concepts were just vague ideas to me before recently
I always loved nodes as they are pure power but I rather have the DaVinci Resolve interface/UI than a Blender type.
I haven't used either, but it's ideal for my brain
Resolve is damn cool how they set nodes up. Top notch
You have a simple interface and can go deeper if you wish and most things are drag and drop connections so way more automatic.
can do manual if you wish just the grunt work is almost gone
ahh,that's right up my alley. I've spent many hours running filter after filter,blending layers, etc, in photoshop
to be able to quantify and save the flow is quite nice
layers is akin to to Adobe AE and PS Nodes is Resolve and now comfy
The main thing I don't like about Comfy is that it's hard to iterate on something already produced. Yes I know it can be done but so can using SD fully from command line.
InvokeAI if they ever fix their current issues
what do you mean?
Generate a batch, click one, send to inpaint. Much easier in A1111. Can be done in Comfy. Just smoother in A1111.
Oh, yes
I just miss using absurdly high cfg numbers. with scheduling, throttling, mimic cfg, etc, I could use mid 20s cfg values and it'd come out fine
Same with settings like "inpaint only masked". Can be done in Comfy. Easier in A1111.
Can anyone answer me why clipdrop's sdxl is far superior than anything Ive been able to produce. Ive tried sdxl in tensor art with same prompts but it comes nowhere as close as the ones in sdxl. Has anyone been able to match the same results as clipdrop's if you have tried? Im always addicted to clipdrop's superior quality, sure the anatomy is not the best but realism is unmatched.
I don't batch much because my 1060 sucks now, and is dying, but I know what you mean precisely
do you think you said clipdrop enough?
no
heheheh, I smell advert
are you using styles?
no, i found no style gives excellent results, prompts need to be detailed tho for that realism look
I still haven't figured out how to do styles in this
don't even worry about it yet
I barely understand it yet also, but I think its part of conditioning for G and L prompts etc, using separate prompt boxes
advanced stuff for me so far
just look at the quality, in secs with no hassle in clipdrop.
2070S a1111 most recent update is taking over 10 mins per image all of a sudden
So it isn't done via the refiner?
you can easily match and suprass that level of quality in comfy in also just a little bit of time
how am I getting 21it/s but it takes 10 mins for an image... before I avgd 3it/s and it took 1-2mins
right? you just have to know how to put a prompt together lol
I can put prompts together, bud
not saying they're good per se, but I can make them
Look at the unit more closely. It might not be "it/s".
Either using CPU or using system RAM due to VRAM overflow. What model and resolution? What interface?
wow, 22.9 seconds, lolol
a1111, sdxl, 1024x1024
i was using this model the day it dropped and it only took me about 1-2 minutes per generation with these set
idid update a1111 today but idk how it could be such a drastic change
Very strange. What GPU?
rendering 8192x9192
2070S 8b
Try setting --medvram or --lowvram
set COMMANDLINE_ARGS= --xformers --autolaunch --upcast-sampling --disable-nan-check --no-half-vae --medvram
these are what i have set already
lol
dont use auto1111 for sdxl,its too buggy right now
quality generations on clipdrop 😎
Remove --disable-nan-check because it will make fixing other problems harder later. Why use --upcast-sampling? That sounds like it will increase VRAM usage.
have you tried --opt-sdp-attention instead of xformers?
does sd next (vlad) work? cant stand the nodes in the other one
I will try your guys' suggestions rn
I am still not grasping refiner as I tell it moon, outside, etc... and it doesn't do it
isn't the refiner more about details than objects?
details as in style
tbh I haven't really fully figured it out myself
the prompts I am seeing done no but you would think so
set COMMANDLINE_ARGS= --opt-sdp-attention --autolaunch --no-half-vae --medvram
trying this rn
there is comfybox. basically similar to comfyui but with the same A1111 UI menu style
I don't even believe most of the stuff I read about xl
it's just people making things up
I could not get that to work as he says to run run.bat and nope.
two different articles or posts will say diametrically opposed things
no run.bat in his repo AND the command he says to start comfy says it was not compiled for cuda
Here's the papers
https://arxiv.org/abs/2307.01952
We present SDXL, a latent diffusion model for text-to-image synthesis.
Compared to previous versions of Stable Diffusion, SDXL leverages a three times
larger UNet backbone: The increase of model parameters is mainly due to more
attention blocks and a larger cross-attention context as SDXL uses a second
text encoder. We design multiple novel cond...
One person makes a video full of broscience and 1000 copycats repeat it as truth.
appreciate it. but my issue is most of what I find is either clickbait nonsense or is beyond my current level of understanding
wouldn't hurt to look over the paper though
It gets pretty weedy reading through the papers, but it's the actual project info, so it's the truth
most articles out there help with basic understandings, but that's about it
is there any settings i can change to make rendering faster
indeed. worth looking at. take away some of the ambiguity. and I guess I know more than I did when I first started trying to read htose things
I just want to know why sdxl decided to superimpose images into this? lol
Rendering time will be directly related to several things. Your processing power, settings such as utilizing RAM or CPU as well as VRAM, image size, extensions adding onto the process, sampler used, etc
Not really just an easy answer
or just paste like it's in paint or something. very strange
I broke my normal prompt into subject for base the rest of the descriptors in refiner
yeah, I sort of got the hang of that. but then I still don't know how to use the 3 prompts for positive and negative
how dou you even split the negative into 3 distinct group?
3 prompts?
yes, pos_g, pos_l, pos_r
just negative to everything
I have not even experienced g/l yet
it's in some of the node packs
I don't think it's necessary. but who knows. I haven't gotten great results with it
I'll really get into this when my 7900XTX is purchased at the end of the month
I've been experimenting with different combinations
Still extensively testing lol
60sec/it
does anyone here have a link to @high skiff 's workflow?
60 seconds per it?
yes
wth?
I was about to link it lol
beat me at my own game haha
thanks babe 💋
I know I know the answer to that
MmMmMmMmMm
you are paging out and it will get worse and worse
has anyone combined an unclip model with an interrogator? I'm curious how that would go
what is it?
Hey General, yes, i think too
My new 3090
Ah neat
I know so because I have a 1060 and got that all the time until I had extra pc ram as it was paging to the pagefile. god that was bad
these images keep getting more and more deepfried and it's really grinding my gears
this is why 7900XTX with 24 gigs will be nice
RocM 🤮
oh, no rocm is on windows now with hip support
Yeah, and with every new part of code you will wait 2 monthes
what ticks me off is not the rocm rather pytorch and tensorflow seemingly paid off to drag their heels on full spport
they are a hassle
I am happy to wait a lifetime over nvidia ever again.
done with Jensen, now if Jensen died and a new ceo arrives I would give Nvidia a go again, or if they return to sanity
They play their malovelent game I go somewhere else with glee. EVEN if a tad slower although it really isn't.
we happy to announce our new CEO : Jensen Jr.
abort
post birth abortion. Hey, I am 35. So? poof
Anyway I need a card as mine is dying on me and no way will I buy used and end up like Sytan or pay current 4k prices
Ive seen people doing gens with AMD cards, what about training, like you want to do?
yes, it trains but iffy because of pytorch and tensorflow although with hips it just uses cuda
buying used fragile cards like a 3090 is not a good idea yes
I have not had hips tested as it is far too new
I know this the 4090 is bottlenecked even with SD. Amazing
Elenore
I look at it like this. I need a card and I expect poster gamercon the MSRP to drop 10% on the 7900XTX, maybe 15%. ANd it trains faster than Colab and gens 35-40it/s on 2.1 on SDXL no idea
Love it 
cost aside, (for sell value), I'd still take the stang over any "supercar"
colab is horrid, btw, as the T4 is 1.0x s/it at BS1 for most training. That is about the speed my 1060 would do if it had 16GB of ram
@high skiff is there a reason why your template is only set at 25 steps?
I think the whole hips thing is something we download as I saw it was around 2gb
Thats the optimal threshold where additional steps don't seem to benefit quality much. In some instances, it can degrade quality
unless you're making hands/feet, it's the ideal performance/quality
ah got it , Is that in all of sdxl? i always used around 45 steps in 1.5
purposely reduced for all of sdxl. even if you do base only, no refiner, you'd still do 25 steps for most situations
thats my findings for the sampler/scheduler I use, which has yielded the best results for my workflow
Sytan has always been a 25 or less
Yeah steps depend on the sampler as well
Ah that is really good to know, probably saves on time too
for 1.5, I used 15
Anything higher was kinda pointless for DDIM
but SDXL does need 25, any less and it can get kinda wonky for DDIM
50 is the theoretical 'ideal' - but quality improvement is very minor, for double the generation time
Auto1111 no workie for ddim in XL
not surprising, Auto is even more of a shit show for SDXL haha
it is, I know
only thing I have it for is inpainting, as inpainting is very convoluted in Comfy
@high skiff Anyway, truly an amazing workflow, it looks so much better then the other templates i've tried
Glad you like it. Dozens of hours of testing adn 1000's of images went into getting it how it ended up.
I still have some improvements I will put out on a 1.1 update when I recover from the toll that the 1.0 crunch took on me 😅
i hope your testing becomes faster when you get your 3090
tho the benefit of the 3090 is for training LoRA's
lmfao
@fleet harnessHow did you even get that on your profile? lol
straight people
i think you can use dual gpu in stableswarm
I have no interest in using stable swarm, and I don't have case room for 2 GPU's
upgraded to a 8gb 3070 thinking it would be great, really regretting that about now lol
what did you come from?
1060 so it was a real upgrade
oh for sure
for how much didd you buy it for?
3090 is only so much faster than the 3080 cause it can run --gpu-only
Your work is truly appreciated, love using your workflow, thanks! 
Finally decided to try comfyui over A1111 and love it
350 euros so not to bad
That is the best thing ever to hear
So glad to get people into proper SDXL control and away from that dumpsterfire 1111 lol
Thats less than what I paid for a 3050 almost 2 years ago 
It was second hand tho, but still as good as new
That reminds me I still need to sell of my 3050 and the two old screens 
The one thing that came with this gen of AMD cards that I don't think anyone is taking advantage of is the AI accelerators. Only 7k cards have them.
RocM uses the Ai accelerator cores
rob NVIDIA
you know nvidia is putting low amount of vram in there on purpose so you have to upgrade again later 
for sure
very good as the 7900XTX has 192 of them I think it was
rocm is on windows now as is hips
Thats why I got a 4090 now, to be settled for a while 
only thing lacking is pytorch and tensorflow support for 7k cards
for a while = 6months
thats a solid while
I have 2, so maybe 12 months 
@high skiff Was trying ur workflow yesterday as well, ty for ur work. So far i did understand everything in there except one thing. You are feeding the clip text encoders with 2048 for width and height while the empty latent image is 1024x1024. I dont really know how the text encoders work, so thats a bit confusing for me. So, can u tell why u are using 2048 for the text encs?
A larger text encoder size can potentially improve image quality as it searches for higher resolution CLIP images. But if you scroll up, there was this whole discussion on what is a good text enc width/height. Nobody seems to have a definitive answer, so it is worth experimenting on.
hopefully if you use stablwswarm
i've got the thirst already. zombie for that silicon
If I remember you have a 4080. hows that going with XL?
i see, ty. I think i have to do a bit more research on how the text enc work, because i always thought the width/height has to match with latent image size to give proper results. my bad
very well!
can you use --gpu-only?
probably not
--gpu-only will max out a 4080 pretty hard
when I had my 3090, GPU only used about 16.8GB VRAM when idle
A solid bet is your image res, but experimenting can lead to maybe better or maybe worse results
generating a 2048x2560 image such as this on a 1080ti takes approx 250 seconds,
Sure its slower than a newer card but its hardly an eternity (and used 1080tis are cheap)
since it keeps the base, refiner, and TE's all in VRAM
you can even use seperate positive and negative encoder resolutions
ah ok for 4090 is basically it then
that is 2 extensions down
grabbed the manager first
I really like this manager
I just grabbed this but why, as I forgot now? https://github.com/LucianoCirino/efficiency-nodes-comfyui
One is at home and one in my personal rig so... yeah 
you have to disable the refiner if you're gonna use loras right?
Not in my workflow 
where workflow
who workflow?
tyou are using one as a door stopper?
havent released the version with lora yet
No im doing SD in the office and gaming at home 
just put a LoAD LORA node in between the Base Model & the Base Conditioners & First PassSampler
Dont put one intot the refiner step
SD in office?
lol
you work full time working with comfyui
my workflow has no conditioners it is just load model directly into the sampler
Not full time and not only with comfy but im doing AI stuff as well yes
cool
hey guys, anyone here interested in a Product Photography App using SDXL?? Check out out upcoming project: https://creativio.io
I also do 3D, video and photo editing as well as layouts for advertisments, catalouges and stuff
so your model doesnt have these sort of step in then to condition the plain text input using CLIP??
Thats why AI is very handy
fairy snuff 🙂
so on yours you would add a LORA Loader saround here , anything the "Model" goes to connect to the Modesl out of the LORA and same for the CLIP
Then connect Model Loader to the LORA LOader
Yeah. I am still trying to grasp refiner
what is the difference between strecgth model and strength clip?
When I changed strength clip, it did nothing at all. Just the strength model did. So I just leave cliip at 1 and bother with the model number
I'm looking at upgrading my RTX2070 to a 4090
Overtrained lora?
get 2x3090, thats better
OK tell me why? 🙂
if you use stableswarm its better, you can use multiple gpus
24gb+24gb vram
has anyone succeeded in training loras on 8gb vram?
100% XL base
So it'll need a completely rebuilt PC?
you can just get a gpu case
and a motherboard, if your existing doesnt have 6 pin pcie slots
Eau keigh
Sounds like a decent upgrade to me
pencil sketch is interesting ^_^
L8r
how strong the weights from text encoder are scaled (clip) and how strong the weights of the unet are scaled (model)
clip is textencoder, model is unet. A lot of loras train only the unet, so the clip part does nothing for those
models train the text-encoder too?
Spicy italian meatball
they can yes, they don't have to
in SDv1 it was pretty commonly found to be helpful to train it a bit. In SDXL the textenc is way more powerful so it's less needed
so if we want to add specific keywords into the model, we should mess up with the text encoder?
adding new keywords is a lil weird to do, generally Textual Inversion is the preferred approach on that
training the text encoder is more to affect how it reads inputs
if you're eg training an anime model over the base, you might want to train the textenc so it learns to interpret the anime-style tag-list input better
Does anyone have a colab script for running sdxl in automatic 1111? I'm a beginner and not very good in coding.
He’s really jazzed just to be there
what's the best practice for comfiyui to make a part of the workflow optional? is the only way to use two redirect nodes and cut the line between those two nodes?
what is a good prompt to make a dark photo? I have the feeling SDXL still has it's brightness issue. Cannot make a photo at night without lights
didn't they add the lora for that?
in general SDXL 1.0 makes images much brighter than SDXL 0.9 :/
hands are still the stuff of nightmares!
sdxl myself :d lora
lots of fun
Updated Workflow but still WIP, now with a switch to enable LORA loading, wildcard support with Impact node and a unique output name for images using the date and time together 
It created some strange decal on the car, but...a Lamborghini Huracan in Italy, came out okay. lol
did anyone try the SergeZT's controlnets for sdxl?
Is there any list of plugins that work with SDXL?
If you install the manager that has a list you can browse https://github.com/ltdrdata/ComfyUI-Manager
He is definitely entertained
Does anyone have a good colab script for running sdxl in automatic 1111? I'm a beginner and not very good in coding.
We back to this? 
well, can you argue with the results?

Very nice
Can you point me in the direction of those nodes?
I'm using wildcard files that have loras in them, so using the normal lora loaders is a no-go.
I hope the stability team @'s the entire server when ControlNet drops
hi
I've gotten my workflow into a that I feel good with, now all I need is just some dynamic poses and we're golden
do more of you guys use the LoRA_Easy_Training_Scripts ?
I use the difficult training scripts, it builds character 
sdxl LoRA training result is not bad
giant dwarf
Yeah, I was trying to train a LoRA on horror stuff but it ended up learning a high contrast style that does dark pretty well. Not mad about it, it was a happy little accident
really amazing stuff
With just a little inpainting I might use this a desktop wallpaper for a bit
is there any news on controlnet?
This one might be ready for an initial release here soon
this is the latest: #✨|sdxl message
cool, guess I'll have to wait a little bit
Might be a little gory for some, so marked it as a Spoiler, just a little blood and skulls
I'm not an artist, but this has gotta be some kind of contender for something imo 😍
why do i get something like this when use the prompt painting of?
hey it's you 🙂
great job man, Your loras look a lot better then mine for some reason. Thanks for all the tutorials
yep
thanks a lot
i am making a new tutorial for 12 gb vram
nice 🙂 thanks for all your work!
firehound!
Have you trained any style LoRA's or just LoRA's on your face? I've found they require quite different settings, though for likeness have only used my dogs to test
i was on vacation
so only face right now
next i am planning dreambooth sdxl hopefully
Looking forward to that, haven't been able to get dreambooth working properly yet, but have had little time to do so
@somber hill I'm just watching your video on Lora Training. Do you have by any chance a shared colab script to do it?
is it me or does sdxl not follow the prompts very well, like you say one thing and sd just does something else without elaborating
for example the prompt here was: labradoodle dog, doomsday, nuke, explosion | centered| key visual| intricate| highly detailed| breathtaking beauty| precise lineart| vibrant| comprehensive cinematic| Carne Griffiths| Conrad Roset
that does not look like a dog to me but idk
Lol yeah I see no labra doodle dog there
some words just overpower everything
yeah precise lineart and the artists are probably very strong tokens here
there should be a way of putting emphasis on certain words,
you can with (weighting:1.0). in SDXL it makes especially sense to de-emphasize tokens that you want less of
Select the word, hold control and use up or down arrrow keys to add or remove weight
see this for some examples how weighting works in ComfyUI:
https://blenderneko.github.io/ComfyUI-docs/Interface/Textprompts/?h=weighting#up-and-down-weighting
holy shit this changes everything
also remember that a prompt is being processed from the beginning to the end. so a token at the beginning has more strength than a token that comes after it. with weighting you can change the balance, but it's also a good thing to remember.
that's only true for one of the text encoders while the weighing affects both I believe (though not completely sure)
My Dog LoRA is up!
https://civitai.com/models/121064?modelVersionId=131716
https://www.reddit.com/r/StableDiffusion/comments/15g7obv/bestboynido_lora_now_available_under_19mb/
Introducing the best boy in the world! Nido! I immortalized one of my dogs' likeness with this LoRA! Nido is an absolute gentleman. When his sister...
0 votes and 1 comment so far on Reddit
whats the best way to run sdxl locally? iv heard sd webui is not recommended/buggy?
congrats to the release! now Nido is eternalized in the latent space 🙂
Hey I made a LoRA with your video as reference! Cheers man, I had great results!
Same! That LoRA I just posted in fact lol
Any info on open sourcing the discord dream bot like emad briefly mentioned?
What does the | (vertical bar) means in the prompts?
Lora's exists for a reason.. lol. I can't with these attempts of luffy lol
it's a generic thing @polar epoch
people who do realistic photographs dont tag their photos with "realistic photograph of a duck"
Damnit SD, monkey is his name, not his species 
so the ai doesnt really associate "realistic photograph" with a realistic photograph
there we go lol. Luffy D, exclude species to not screw up poor strawhat lol
'Realistic' is more a late 19th century painting style
"realistic" defines an artstyle. so ironically enough, 'realistic' pushes it in the direction of artwork 🤣
Hey, i want my 3d luffy realistic, but unreal ok? 
Now, lets take naruto luffy here and give him a actual photograph look :P
有人知道如何实现图生图吗?
with refine model,it's actually img2img
Heyo the server language is English, if I understand your question correctly img2img might be your solution
I want to use a photograph as a model, but how can the original photo be turned into a model?
How to implement img2img, if the first img is an old high school photo?
any advice for getting good text out of SDXL? My results are so so
A professional photograph of "Ken's Pizza" restaurant at early evening. Show the exterior with large windows, and feature a neon sign spelling "Ken's Pizza." Include some urban street elements for a lively scene.
well that's as close to a progress bar as I'll get peobably (just under the preview images)
*
NB save anyone asking workflow embedded in this image, dragndrop into COmfyUI*
does anyone know what the difference is here?
The embedded VAE version in each
ones got the sdxl1 vae baked in the other has the 0.9 vae baked in which fi8xes issues apparently for some occasions
So Comfy UI from Searge I don't need the embedded vae version?
personally I use the 0.9 vae sperately which is another option
or I might switch
depends on how bad the sunspot activity is
or the rain
pharaoh poker face
yes
It is recommended to use comfyui or webui to deploy SDXL. Which of these two is better?
I recommend Auto1111, others recommend ComfyUI, so use what you prefer.
there are multiple ways of using the SDXL model, ComfyUI,Automatic 1111, InvokeAI etc etc
Pick one any one, there is no right or worng answer
Its personal preference
That said I'm now a COmfyUI convert, its worth the learning curve
I'd recommend SD.next over A1111 though 😄
technically COmfyUI is a web UI
So is Autmatic 1111, so is Invoke, So is Vladmatic.....................
its like not all vacuum cleaners are hoovers or not all ball pont pens are Biros
Autmatic 1111 are running into issues
i cant really find a lot good use in the refiner at any place
I think it's good for photographic images.
not if you throw in any LORA
haven't tried any LORA with SDXL yet 😄
you should, it's awesome
and easy
installs like a piece of cake, too
I would. once I get a hang of the comfyui workflows 😅 won't be training any though.
so far, I've been stealing ppl's workflows and trying them out
Excepted if you don't understand half of settings
yeah dont "steal" mine its setup as a daily driver layout with all the workings out buried away#✨|sdxl message
Please, why not, I'm at the swimming pool with my children, they are having fun and I'm pissing myself off.
What did you use to embed the flow into the screenshot?
black magic lol
Thansk 🙂
Does anyone know if stable diffusion xl uses v-prediction or epsilon-prediction?
Help!
both |I think#
I seem to recall that the base uses one and the refiner the other but I could be wrong
or just plain confused lol
Where did you see that? I want to know what method base use. Thanks!
some YT video hidden away in my history lol
I think its in this one
In this video, I will compare the newly released SDXL 1.0 checkpoint to both base Stable Diffusion 1.5 and top tier Stable Diffusion 1.5 checkpoints to see how they compare.
I start out by discussing the architecture of SDXL compared with SD 1.5 to see how they compare on paper.
Afterwards, I show the results of extensive testing in Automatic1...
Wow, thank you so much!
hi, i have question - thats normal I have only 2-3 it/s when rendering with SDXL? its SD 1.5 I have like 9-15 it/s... GF 4080.
oh that's just the vae
it's a smaller model that you should use as a "VAE" in the options if you're in A1111
or load and plug in the VAE in Comfyui
0.9 Vae is baked in
that one is required if you don't want weird colouring around edges close up
oh now it's baked in?
nicee
so that's just an improved model, not a separate vae
get that one
The size of the model ending in vae is the same as that of the model without vae
I still don't understand the difference between the two.
@wraith tide
trying to squeeze as many details as possible out of SDXL... a little creepy but the details are impressive
What kind of images do you guys use for LoRa training on a concept?
i have only done a celebrity but i want to try a concept now
I need more vram 😦
they have different VAEs baked in. Just after launch of SDXL1 some people were rpeorting odd artifcats so initial advice was to DL & use the 0/9 VAE, thats now available as a baked in option
