#✨|sdxl
1 messages · Page 83 of 1
especially with a prompt as painfully simple as "medium shot of a handsome Indian man shopping at a busy street market in Bangladesh "
what is there to not get?
yeah that's a good one, i mean they all are looking away, good job on the photographers direction 😛
in the background
the other one it's like randomly place (more real life) but a good photographer is going to have that nice touch like the angled heads in yours
Why did one fail that?
what do you mean?
Both look like that prompt
yes, but look at the mangled people in the background of the first
specifically this lol
Where
Just a fat man in a robe
also, comjoined hands with a third arm
that a lora?
that's reality haha 😉
Both images have mangled hands and the basket is 4D
He got that Two 1st Graders Trying To Look Taller look.
i do think in that one i like your lora better
here we go, I have a new example
naw
Don't get it. Not arguing.. I just don't understand
less to fix and more pleasing on a 'golden ratio' perspective
(exterior cute bedroom:1.2) fisheye@alpine pine
I think man with grey beard is more handsome
maybe this one shows the coherency improvement a little better lol
Nohomo
medium shot of a pretty Nigerian woman sitting next to a fire pit
how are you training it to do that tho
Share your Lora!
something about that gen tho feels a bit 'pregnant' tho tbh, can't put my finger on it
Same seed?
if that doesn't show what the benefits are, I don't know what will
is it a negative lora ?
same everything
its a positive LoRA
and its nowhere near done, just early tests
Do you never get extra limbs with your Lora is the claim?
Both look like good images but extra limbs is an architectural issue
Time to go to bed and to the sea for 2 weeks. Se you soon!!
Fixing that means over fitting
I don't think anyone could claim to never get extra limbs
it's not about them being good images
base vs my LoRA
Close up photograph of an old man whos balding with a slight smile in black and white
how it feels using comfyUI
it's about the one being naturally lit
that and much more
Seems bodacious
in that last one, you're also pushing to a real photo and not hyperreal art
Training is fun and you can tune towards captions very well, but to insist you've improved the bases architectural shortcomings so early....
I have about 90 images right now, I wanna get to at least 200, preferably 400, and they are tagged by hand, so a decent while
i don't think he's saying that
just this 90 image one took about 3 hours to train
i think he's saying he's getting more out of his promt in a way that he can classify (that he is wanting)
I get ya
Its a LoRA, not a finetune
aren't you technically conditioning the clip to favor your way of thinking what a prompt should be
So a month and or slightly longer?
the whole point is better realism with less fuckinga round to find magic words to get SDXL base to look good
oh no, likely much less than that haha
Ah okay. rofl in the copter
I won't say any specific timeline, as I have bad health issues that have been flaring up
but likely before a month for sure
here is one from my old 60 image dataset that was undertrained
looks decently like a real person, rather than glossy plastic
One would think if all quality takes is a prompt, just save those magic prompts and add them as a style
Health issues can be the worst. especially when you know you might need to put into surgery knowing that it's for the best.
I think i could get it better, but I am happy with that example
Staff at sai keep saying prompting is king
its a hell of a lot more than just "a few words"
It changes massively from image to image
Staff at SAI say a lot of things.... lol
Big bit about that in the super stage event. They really went into detail about the breadth of prompts available to save as styles.
styles work when its a general one size fits all glove, but its not for SDXL realism
and I should know, I have gotten some incredible realism out of base SDXL, at my own peril of prompting
take away even a few words and it reverts back to plastic city
i think they purposefully left the base unfinished to help others train/lora it into what they want, hence the refiner, and them saying prompt is king, and to save them for being blamed for it being too good at bad things, and to save money
vs my LoRA that listens better to the prompt in under a quarter the prompt
"a medium close up shot of a pretty 20 year old woman with blonde hair and red lips being lit from the side by dappled sunlight through blinds"
I can agree with that, it seems reasonable
yeah and i don't mean unfinished with any bad connotation at all
in fact it does serve the public well
"A tiger in a forest"
SDXL base vs my LoRA
SDXL base does tend to favor matte paintings vs. photos.
@high skiffHave you tried any word type prompts?
X holding a sign that says "enter for X"?
also how long does it take for you guys to make a 2k image
2048x2048
takes like 8 minutes for me 😭
you mean pre-upscaling ?
Do you have an example of what you mean? Like can you provide a prompt example?
You asking because you want someone to give you a reason to upgrade?
no just curious how shit my pc is
bro i'll never be able to afford upgrading
i'm living off disability
neetbux
uhhh... with my workflow... about 30 seconds on a 3090
What GPU do you have?
imo the lora is a little 'cold' in photo temp
20-30 seconds for me for 4k
I kinda averaged it to neutral white, which I have not messed with prompting as of now
if it could know about that as well it'd be a plus, it's nothing photoshop can't fix but nice to have
Using AIT
let me see how well I can prompt for color temp
I’ve pushed to 8k in under a minute
💀
Not the greatest but should be able to handle SD 1.5 512x512 and maybe even SDXL 1024 with optimizations and patience.
im running sdxl rn
in comfy
normal image takes a minute
interesting, I can't seem to change the color balance
let me see if I can find a proper termp to help
its funny tho some dslr will sort of undershoot on auto color temp, so if we call it an error it errors on the right side
Sounds about right. I can get 20 sec/img on a 3060 but I have to use the FP16 VAE.
i got it a little more warm
my dataset is very neutrally color balanced, so I am not sure how to combat that
I am sure I could add in color temp tags
add images with color temps and label them
just do 6000k, 7000k, etc
sort of like i said before cameras often favor the cooler side, so not sure there
i was just going to say that ^
my guy is missing a leg 💀
it'd be interesting if there was a tool that took the average colors to a group of images and see if they are favoring any way
he's half stepped in and half stepped out bruh. like on the edge of the liminal
shift is almost over
we did it
yooo cant wait for this one to upscale
it really catches alex's style
shmoies magazine do be gettin crazy tbh
its a lora
do you guys ever click to drag comfy ui, but acidentally click a group title and drag it, then you move it back and it takes a bunch of other nodes with it?
i gotta use space more
i just realized my ability to read is as good as SDXLs ability to write
😂
yeah, i tend to pin things now
I would contend that the image on the far right is probably more realistic than most women post on IG/FB/X etc
this is why I like https://github.com/failfa-st/failfast-comfyui-extensions, you can pin all & freeze all from a TH click context menu
🤔
you're actually using an alex lora! nice. is there one?
i can dl
If you like what I do please consider supporting me on Patreon and contributing your ideas to my future projects! Alex Grey art style lora for SDXL...
Is SDXL Lora training possible with gtx 1080 (8GB VRAM) ?
I know I can do it in 12 but my friends asks
is that stan lee? lmao
the man in the back lol
I have a 1080ti with 11Gb vram and whilst I happily use it to generate images the thought of trying to train a LOTA on it has mostly passed me by.
he's got the biggest bills of them all
he didn't pay taxes
decided to give a little water gun gen a try with my realism LoRA lol
lol
machine pistol 5
not as wet as I'd like
nice
Oh, I bet this thing does fire guns
its just a little moist
this is how my workflow does waterguns
it does
I've gotta admit... thats a pretty damn good glock lmao
impressive
dang even went for the logos
im waiting for your DROP
its not gonna be any time soon
its not well trained
I need a much bigger dataset and more time
Yeah my favorite thing about AIT is the reduced temps and power usage
but I mean
dang, thats really good
trying others
oh what movie is that from?
the corporate catalogue look ;o)
yeah, it's a breakthrough. I'm generating stuff like this in a few seconds
I kinda like how my stuff follows that prompt better, but nice!
Well, we generate LoRAs from a single image and OH EM GHEEEE
I really put 0 effort into the prompts on these
(LAST image in the field is the original)
SDXL is crazy
more than one to skin a cat
same
what is AIT?
how you getting that beauty of a txt file winston/
like, aesthetics are completely opinionable, but speed is factual. my AIT workflow allows me to generate this in a few seconds
what is AIT 
tensorRT's big mama
This was orginally written by: https://github.com/hlky - GitHub - FizzleDorf/AIT at sdxl
All this down in the basement of my workflow
I will have to try it sometime
https://github.com/facebookincubator/AITemplate
bit words on here that says things i dont understand
I actually built a workflow designed specifically for AIT. this image took my 4070ti 12-ish seconds to make.
very interesting
AITemplate is a game changer
20 seconds to MUCHO HIGH RESOLUTION with 3090
yeah, it's actually insane
i'm going to have to throw all i know about steps out the window i think
since everytime i try to not use your workflow, my images blow
lol
you can change as much stuff as you want with the workflow where it comes to prompts, ARs, etc.. but the sampling settings are crucial
oh (((^* , just realised its not putting all of it in there, thats a later today job, gone midnight off too bed
baulders gate 3 is out now an has some great visuals. been thinking bout that lately in the context of training xD
but then I realise, all it is is cos these 2 arent joined
thats better, all present and correct
Good Lord man. That's a lot of prompt beef. 😂
I only typed in this bit lol
I'm using @high skiff s terminology ;o)
cant wait for SDXXXL to have base prompt L,base prompt G,Base prompt P ,Base prompt F,Negative support,Negative base,negative global,positive global,positive linguistic,positive L,positive S,positive plus,
You forgot the pre conditioning options
Reading through what was discussed just earlier on AIT, I also did my own tests and found there is a slight degradation in coherency.
The top row is my normal workflow.
The bottom is the same setting and seeds but with AIT
would that be the CPU/GPU seed generation thing?
Did we ever get any confirmed proof from the code that CLIP-G is "linguistic prompt" and CLIP-L is "supporting terms" or was that determined by guesswork?
first of all thanks for doing this test! I'm working on it and will share as soon as I have some results.
would you be able to share some specs about your test? what gpu, os and what sampler settings?
I was wondering the same if there are any trade-offs.
WIn 10, 3080, dpmpp_sde_gpu
I tried various steps and found the same result
so not sure if its maybe my setup or something else. Was wondering if anyone had done something similar
The other thing I found is that AIT doesnt like a fixed seed for Base and Refiner. I had to use random seed for refiner or it gets all messed up
there were problems with xformers too but a lot of stuff was optimized
very interesting
very likely. I built the AIT workflow with GPU ARG in mind.
where can I change that?
so my go-to sampler is dpmpp_sde_gpu karras and I noticed at least a 80%-100% speed increase. but dpmpp_2m karras is even faster and @indigo carbon thankfully shared the AIT optimized workflow and I have since then worked with dpmpp_2m and I think the results are great.
but I need to do A/B testing shortly
cool. will be interesting to see what you get
ComfyUI\nodes.py noise = torch.randn(latent_image.size(), dtype=latent_image.dtype, layout=latent_image.layout, generator=torch.manual_seed(seed), device="cpu") change this line to this: noise = torch.randn(latent_image.size(), dtype=latent_image.dtype, layout=latent_image.layout, generator=torch.cuda.manual_seed(seed), device=device)
I don't have any clue how this technically works so I can only speculate if there are side effects we don't know yet - and maybe only with certain setups and settings
this line tells ComfyUI how to generate an empty latent image from seed. default is CPU, but with GPU quality is somewhat higher IMO.
guys, I put a video to generate in the deforum, the video has 13 seconds, and it has been generating for two hours, a 920x500 video, is the delay time normal?
I will try some sample test with that later today
@indigo carbon Can you send me the AIT optimized workflow or an image that uses it?
dpmpp_2m with Karras enabled is very good with SDXL, especially with AIT
that's what's used in my workflow
2m is quite good and much faster but I generally see better results with sde_gpu
the trade off is its slower
yeah I'm glad I'm given it a chance again 🙂 thank you!
my pleasure. I'm glad you guys enjoy the workflow I made =]
Lots of experimenting left to do
getting this error lol "Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor) should be the same or input should be a MKLDNN tensor and weight is a dense tensor"
Been finding best consistency with dpmpp 2p with Kara’s
screenshot? this might be the preview error
maybe let me see
it was lmao my bad
holy shit this is fast
Lol
i blinked and it was done
if you want live preview use ComfyUI_manager's live preview, only one that works with AIT
That’s what we’ve been saying
Ok so I'm using the workflow but getting the same speeds.. Is there anything that I need to download or do to get AIT to work? I haven't been in the discord server in about a week.
https://github.com/facebookincubator/AITemplate
bit words on here that says things i dont understand
that's odd.. does the console say something like: "module not found"?
no... everything is there. I just don't know what AIT is... Is it part of the workflow itself?
the first generation with AIT is always slower. do you have the AIT nodes?
Not sure.... These both say disable?
that's how it should be. I don't understand why doesn't it do that boost for you... what GPU are you using?
Got clone the link into custom folder
4090
Nvm looks like you got past that step
nah, you should be getting atleast a 2x boost. this is strange
are you on linux maybe?
that doesn't make sense.. it should be either 11.x or 12.x
11.8 is great
no issues with that
does your Comfy setup have any tweaks?
Are you running with command line args?
@strong field you have a 3090 right? @spring fulcrum should get at the very least the same boost as you get
Yes 3090 here
that's docker?
I searched in nodes.py but cant see that exact line to change. I only see this that is similar
device = comfy.model_management.get_torch_device()
latent_image = latent["samples"]
if disable_noise:
noise = torch.zeros(latent_image.size(), dtype=latent_image.dtype, layout=latent_image.layout, device="cpu")```
What’s the standalone build commands for?
It was in there when I installed it... idk
oh, right, bat file.. your flags are weird. I don't use any flags, it might interfere with AIT
I do not use any modifiers either
what speeds do you get with this build? what it/s?
I have no other explanation why it doesn't give you that boost if that's not it.. maybe your behind a few ComfyUI commits? @spring fulcrum
I have a 4070TI, and it's at least 9.5it/s, sometimes over 10. overall generation time is 12-16 seconds
its all good
its not like its slow... I was just thinking it would be something crazy like 30 it/s
8 is still fine
it should be WAY faster than 8 if your on a 4090.. @upbeat summit uses a 4090 and it's way faster than that
maybe I should just do a completely new install of ComfyUI
try to do a few gens, the first one is always slower @spring fulcrum
I did like 10
odd..
shouldn't be like that
likely a problem on your end. most likely due to Comfy flags or even you might be a few commits behind.
have you tried
pip install -r requirements.txt in the comfy folder?
also this. a 4090 should be way faster than a 4070ti with AIT, I don't understand if the problem isn't an inference issue
also yes, this line
Did you guys use the portable comfyui with nightly?
Is there something to see the image metadata more consistently? For now, I'm just using strings
no
are you on pytorch nightly @spring fulcrum ?
no but I was wondering if that might work better?
no, it won't work at all
it might actually be you pytorch version, you need pytorch2.0.1+cu118
if that's not it it's 100% the flags you are using, and maybe even a few commits behind
this is what I get on my 3080
that's way faster than without AIT on your 3080, right?
how do i check my pytorch and cu version?
miles faster... its crazy fast
i figured
@spring fulcrum go to the main directory of ComfyUI and run pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
if this is not the torch version you are using that might be the issue
it says already satisfied
I don't understand what tf is wrong then. maybe things are running in the background?
do pip list torch
its shows everything
thats a long list are you looking for anything specific?
This SDXL Monster Lora is cool: https://civitai.com/models/86377
Makes some freaky S*** thought you would like it 🙂
what is that command line to update that
for reference Im also using nightly versions of torch and xformers
this should also work
to update torch run pip3 install torch --index-url https://download.pytorch.org/whl/cu118
force reinstall then, I forgot what was the force reinstall flag, let me check
pip uninstall torch
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
has anyone tried these? it seems they are controlNet models for SDXL? https://huggingface.co/SargeZT/t2i-adapter-sdxl-multi/tree/main
2.0.1 is what you need, on the list you sent a few moments ago it said 2.0
see the last line in the ss
you need VENV for this I'm afraid
make a venv in ComfyUI dir and run comfy with the venv, it should resolve all this on it's own
idk how to do that. I never use conda
not conda, python
go to ComfyUI main directory, run python -m venv venv
after you did that, add call "DIRECTORY\venv\scripts\activate.bat to your batch file and remove unnecessary flags @spring fulcrum
Hello everyone,
I'd like to know if I can participate in the SDXL image contest on civitai.com using the samples generated by my bot here in the room. Unfortunately, I can't use the XL with my 2070 GPU as the model is too heavy for an 8-gig GPU. I've been getting great results using the bot, and it would be awesome to be able to use them in the contest. However, I'm unsure how to obtain the information that the bot uses in my outputs, such as seeds and other relevant details present in my images. This information is necessary for participating in the contest.
Thanks for your help!
fairly sure the contest requires meta data parameters on the image which the bot doesnt attach, so that wont be possible
this is what it looks like when I start it up
I don't see any reference to venv. Is that normal?
yes, it's active
oh wait, no
I understand, that's what I thought. The bot doesn't provide the essential metadata required to participate in the contest, which is a shame. A new GPU would be amazing.
have you tried ComfyUI? 8gb is possible
yes, that's good
?
your system ram is barely enough, probably why its slow. Comfy will use 20gb of system ram
You should be able to do comfy with 8gb, maybe a tweaked workflow but possible nonetheless
The system ram is 128GB DDR5
that's amazing
oh yeah, I read the number wrong...lol
you removed a zero..
silly numbers tricks are for pure fire lol
I did
I tried with automatic 1111 and my rendering became extremely slow, so I ended up uninstalling the XL model. I have never used ComfyUI, and I couldn't find any clear tutorials in Brazilian Portuguese to teach me how to use it.
Ok Ill be back later to mess this up further... I need to eat.. havent eaten anything since I woke up this morning.
I forgot to eat
@spring fulcrum mad lad that spec is fire
yeah, easily some of the best specs
that's why I'm confused why AIT didn't boost @spring fulcrum's performance by atleast 2x
Check dem specs... I'm going to go eat something.
not often you see 128gb.. very nice
This SDXL ControlNet works with ComfyUI out of the box: https://huggingface.co/diffusers/controlnet-canny-sdxl-1.0
Only has Canny right now.
Yes I downloaded that one already, but there are other ones in the link I posted, but don't know if they are working really.
also they are way lighter-weight
I read they work standalone but there is more to do to get them compatible with Automatic1111 or ComfyUI and the developer is working on InvokeAI intergartion first.
@spring fulcrum I need to go. If you would still need help, @upbeat summit might help you if they're up for it, they also got a 4090 and the AIT workflow I built works for them flawlessly.
I will try to install SDXL and ComfyUI. Do I need to remove my previous files and also automatic 1111 from my system to proceed with the XL installation?
nah make a new folder and install comfy there. its separate
thanks buddy
also, symlink the model folders aww baby
@upbeat summit I changed the nodes.py file as Tdg8uU mentioned from "cpu" to "device" and Im seeing better results, for your reference. The faces are a lot better
Very interesting. I should check that out as well. Thanks for letting me know!
that does look a lot better
Oh wow I haven’t done that yet but I will be when I get back to command center
what do you mean? what is this
its related to AITemplate
mentioned a little bit back in the conversation on what to change
you mean this #✨|sdxl message, right?
currently in nodes.py
line 1154
noise = torch.zeros...
and change that to
noise = torch.randn...
fancy lads
yeah because the code was different. all I changed was the last part where it says CPU, to device
ah ok. just cpu = gpu
yeah I was wondering to replace the whole line
so you have the same as Td?
I have noise = torch.zeros
yep same
Im not a coder so not sure what it is. but logically cpu to device would make sense
it makes sense, i wont pretend to know like i know what half those lines are for
which cpu do I change? found 6 of them. just hit em all and see what happens?
Line 397: latent = safetensors.torch.load_file(latent_path, device="cpu")
Line 857: def __init__(self, device="cpu"):
Line 1154: noise = torch.zeros(latent_image.size(), dtype=latent_image.dtype, layout=latent_image.layout, device="cpu")
Line 1268: i = 255. * image.cpu().numpy()
Line 1327: mask = torch.zeros((64,64), dtype=torch.float32, device="cpu")
Line 1374: mask = torch.zeros((64,64), dtype=torch.float32, device="cpu")
hmm... yeah I don't know how any of this science fiction technology works, but it's a revolution :D. I'm not quite sure why you get so much differences in quality though because of that change
yours didnt?
how do I force them all to run on my onboard video?
doing it now and will compare seeds - if it's possible. the seed might be totally different
1154
awesome, thanks1
yeah it definitely has a different seed between AIT and non AIT. Other than that Im not sure what its doing. Maybe its possible that all the images were just bad seeds for the comparison?
hmm... I don't think so. it could be an unstable prompt build
but if you are now doing good images all the time with the same prompt, it must have something to do with it
or you relaunched comfyui and it was something else
tech is complicated 😄
lol it can be
testing now
well it sure didnt break anything changing 1154 lol
faces are worlds better
impressive
but i gotta say this is hilarious
sdxl HILARIOUS with it
cool, so its not just my imagination
yeah i was generating trash for faces earlier testing
going back to old prompts to compare, this is almost no prompt fu
i do think mine slowed down ever so slightly
from 6.9it/s to 6.44it/s
Hey guys. Can anyone confirm to me whether or not this (Text:1.2), is the same as ((Text)) This?
Or do they have slightly different results when in use?
so... not a big deal
in theory or in practice hehe
i believe they are handled differently
https://civitai.com/models/126239?modelVersionId=137968 just upload character rola model on civitai and huggingface hosted inference API
this is a realistic sdxl 1.0 base model trained lora of maggie Q,first of all,i training 68 raw images of maggie Q with kohya python scripts on 16G...
Like in practise. I have used both in the the past. But i have a feeling that ((())) These gave me different results to (:1.0) these.
hmmm - I guess it must already work in gpu mode on my system?
really really minor difference in this one.
Windows 11, 4090, dpmpp_2m karras, 70 steps, cfg 8 (that's high I know, but it works with some prompts)
doing some more tests
1: device="cpu" / 2: device="gpu"
i like using the (:1.0), i have found better control
it likes to ignore my ((whatever))
yh. thx. it had me wondering. lol
my faces are defnitely better
I need to head out. I will be back a little later
ttyl!
yeah I totally believe you
guys, why not change all the cpus to device? push the limits
guess you could always just git pull if you break it
I made a new LoRA for perfect faces
@indigo carbon @upbeat summit @heady vale Ok so with the venv setup I'm getting like 12 it/s @upbeat summit does that sound about right or are you still getting higher numbers than that?
im not liking refiner tho
what's your gpu?
on a 4090 that seems about right?
4090
that's awesome 😄
thats from 8 to 12, thats decent
@upbeat summit they said you have a 4090... How many it/s are you getting?
those outfits probably cost 20k or something
I don't know if stable diffusion could outdo what kanye just wears in real life
so with the Tdg8uU's unaltered workflow it takes ~17 - 18 seconds per generation
One image, 80 epochs, SDXL is quite good at learning.-
oh... mine is at 14 seconds per image.... I was just wondering what the it/s was while it was generating?
so does changing that cpu to device do it for all the samplers, or only that particular one?
nah bruh, i think i got u
beats me - probably all of them
those are his pajamas
how did all the cool adidas sneakers turn into these ugoh things he marketed?
narcissism
i started buying nikes cause they're still doing sneakers
and various other mental illnesses
no but, people buy them. thats why the whole market shifted for the product
i heard that they sold more than air jordans
who knows. I like the ridiculous old school nike yeezys
thats insane to me
Is there any documentations somewhere that talks about all prompt commands for SDXL 1.0?
Just wondering
I'm super new to this LOL
the red octobers
I'm back =]
look like some 1998 k-mart charles barkleys or something
@spring fulcrum did you fix it?
this isnt official documentation but a solid start
SUNNY’S SOLID PROMPTING LIST Ahahahahahahhahahaha, have fun :< Last Updated: 8/11/2023 Some of you might recall,not too long ago, that I made two installments of prompting for SDXL, with the first being here. This, dear friends, is the third, and I am about to open this jam jar right open and ...
@strong fieldThanks a lot!
converse used to be a kmart brand lol
I think so... I'm getting like 12 it/s now maybe it could still be better
i still cheer those kids on when i see them with their litemup heelies
12it/s sounds right. I think that's the same as @upbeat summit 's speed?
I remember an episode of cops where dude was trying to run from them with the light up shoes
I ran a test with just this image and its embedded workflow by Tdg8uU
11.56 it/s on base model
10.75 it/s on refiner model
Prompt executed in 14.87 seconds
Windows 11 22H2, 4090 (driver 536.67), latest ComfyUI and AIT build (20230812)
@spring fulcrum this is what you should be seeing if you have set it up right
Seems like I am on point now
great! it worked =]
nice - have fun 😄
I'm not sure I quite understand the civitai prompt submission contest. am I allowed to do anything other than just base/refiner submit? I mean, I want a 4090
I think my numbers might even be a little better... but not by much its really close
shit this too funny
too soon?
you need to share the workflow - which will be funny if you use wildcards
also they probably just want a lot of workflows and new metadata
Its a ton better than the 5 it/s I was getting before doing this
amazing tech 🙂
no doubt its about the data
yeah, it's an AI breakthrough if I ever saw one
over 2x boost for you it seems
ya its really awesome... thank you for the help with that 🙂
added some info to my post, just in case for later referencing
huh, I'll add this to my documentation. interesting how much faster AIT gets with pytorch 2.0.1
how do I update to pytorch 2.0.1
?
I still think I'm on the wrong pytorch version btw... I think it was the change to venv that made the difference here
pip uninstall torch
Comfy I2I is now public if anyone would like to play around with it. Please let me know how it works for you.
you're already on it, the reason they had issues is because they weren't on 2.0.1
ah - it makes sense now. I started with a fresh comfyui setup for AIT
what about bitsandbytes? I forget why I uninstalled it. I had reasons
seems to work fine without it
some nodes wanted it
but then it made something else angry
so just uninstalled it
@indigo carbon used the aitemplate nodes with a custom workflow i am playing with. The flow is pretty much the same (without upscaling)...i just have a lot more primitive nodes to control other nodes input. the only thing that is a bit different is, that i am using ksampler adv instead of simple and i am using the sdxl clip encoders (for base and refiner). with this setup i "only" get ~6.5it/s (base) ~4.41it/s (refiner) instead of the ~12 with your workflow. i am wondering what exactly is making your workflow that faster now?! is it using the simple clip tenc for base? or using the simple ksampler? im confused ....
since I'm testing a lot of fine-tunings has anybody done tests with X/Y plotting (efficiency nodes etc) in ComfyUI? Or is it still better to use a1111 for it?
yeah i do my testing with those
lots of setup time but once you get a workflow going its easy to spot what you like
I think for major xy plotting just run it when you're doing something else
set it to run when you go to the store or something
yeah overnight for that
I have my outputs folder synced to my google drive so I can watch the progress from afar 
man ive been trying to get the --listen args to work on my home network so i can just watch from my phone but cant get it done
I did it! I finished installing SDXL and also ComfyUI. I'm not sure what magic they did here, as the render time is much faster compared to automatic 1111. I just conducted my first test, and everything worked perfectly. I'm really excited to be able to use XL in a local host setup. Thank you for the suggestion; ComfyUI seems amazing. As someone coming from 3D, I'm accustomed to nodes, so this is great. Thanks to everyone for the help!
hardly even worth loading comfy on your phone
yeah. I mean x/y plotting is very important to find out if a fine-tuning does what you want. I need plots to help test stuff and variations of models.
so in a1111 it's super easy and I'm thinking about doing a setup in ComfyUI but I have not really checked it out yet and I'm not sure if it makes sense or just to switch to a1111 for the plots.
unless your has some sort of capabilities that mine doesn't
Is this yours by any chance? If so, would you be able to export to HTML? Google docs loads content after the page loads and loses scroll position 😦
not mine, the owner hangs around in ehre
its a WIP
you don't need SDXL clip encode. the ones built into ComfyUI are way better. main difference is - the SDXL ones are made for multiple positives, which is a bad idea IMO. one positive prompt seems to make better, faster results.
okay let met try that. my custom wf uses only one prompt as well. it is just attached to _g and _l...ill replace it and do another check
as I understand it there are use cases to seperate the G and the L, but for most of us it's probably not beneficial
still, I do it, lol
can you do XY with AIT though? I havent found batch is supported yet
what do you guys do for the refiner though? that's the magic question in my book
would you say it's lot harder in ComfyUI to run x/y plots or are you getting all the options you need?
I've been sending both prompts, combining them, and overwhelming the refiner, lol
You don't need to do batches with XY, you just create each image in sequence.
maximim quality
a1111 is def easier to get up and going
but comfy just makes every little option available
i like to tinker 🤷♂️
I legit cannot wrap my head around the comfy hate on reddit. it's like hating on photoshop or ableton or something
but if they were both free
pass latent without VAEs in between. so much better and more efficient this way
because you want to use ms paint
lol i thought it was batching this whole time
don't mean to sound like a dumbo, but how do I do that? I'm not sure I have VAEs in between though
are you using efficiency nodes to create the plots?
i started with the efficiency workflows on their page
IMO, the best option for people that hate nodes is ComfyBox, which is just like A1111- but it's only a frontend
yeah and their nodes I suppose
If you don't like people disliking ComfyUI, don't compare the alternatives to MS Paint please.
yes
thanks! I will check it out
they are actually really really good haha
@indigo carbon ah i think i found at least one factor, what makes it slower...i am using the sdxl clip tenc because i am passing in a custom widht height (2048x2048 --> taken from sytans workflow). If i replace the tenc node with a simple one, i cant pass in custom sizes anymore though ...
From what I understand, it's supposed to apply Comfyui Metadata, but it's a new feature on civitai, and works terribly inconsistent. I see some people's posts with nodes applied (Comfyui Metadata), but I haven't gotten it to work yet either.
I don't dislike people disliking it. that's fine. it's their trash attitudes and entitlement that gets me. but I guess that's the world we're in
My first render using SDXL and also ComfyUI. Thanks guys for the support.
XY doesnt want to load SDXL, hmm, interesting
@upbeat summit
heres one of my tests, but cant get it to work with sdxl for some reason
good plot! ok I will try to set it up shortly
interestingly enough AIT works with SD1.5 checkpoints lol
time to go hunting
this is what i get with sdxl
I think because sdxl is not supported yet https://github.com/LucianoCirino/efficiency-nodes-comfyui/issues/69
ayyy thanks for the info, i didnt bother to look and i was about to start googling pooled output lol
this... hmm.. makes me sad now 😉
but I can run it without refiner?! maybe?
but for simple basic stuff like samplers/schedulers/etc. StableswarmUI has a grid generator. U also can bind your custom comfyui workflow to it....works pretty well for me and i don't need to switch to a1111
ah, thanks for the tip!
thats a good idea, do they already have a workflow built or do you just drop in a comfy?
I need plots and grids - I can't help it heh
lol
i think i know why it wont work with sdxl, but i gotta go diving into this monstrosity
You can use comfyui as backend (self started or your own instance via api). by default it uses a built in comfyui workflow, BUT you can provide your own workflow. the ui will adapt the parameters of your workflow. it may take a bit to get used to it, but i like it more and more and it has the benefit, that you can use your own comfyui workflow
How do you install AIT and is it compatible with RTX 2060 ?
I think thats SM75 and I think AIT is SM80 only
yeah id definitely rather try this than spelunking in efficiency_nodes.py
Error occurred when executing MiDaS Depth Approximation:
MiDaS_Depth_Approx.midas_approx() missing 1 required positional argument: 'image'>>> What is wrong?
yeah try it. at least when you want to have x/y grids it is very handy. u still can use basic comfyui for your standard stuff or switch between them whenever u like. i think thats what i like so much about it ^^
kanye goin crazy with the new yeezy's
Someone asked him to put out some frogskins.
https://www.reddit.com/r/StableDiffusion/comments/15oeymz/depth_sdxl_controlnet_coming_soon_brace_yourselves/ i've been braced for hours
like, how much time do you need to call brace for?

Same prompt, different seed. I can still see Seagal here. 🤣
Auto1111 doesn't support ddim for SDXL, just use one sampler and stick with it, there is no point really to test lots of different samplers.
I remember now! I wanted to add the date time the filename, but your time string did not include the three letter month abbreviations. If you add %Y-%b-%d %H:%M:%S to 'time_format', in line 257 of your Python script will it allow for the abbreviations?
Yeah I agree once you do the tests then no need to do them again
Maybe some step plots but def not samplers and schedules
huggingface inference API developed by Rust run fast
Best Seagal frog I could make
I can still see it.
that armour suit is awesome
is this you? 🙂
yes
Can someone explain how you do the two step workflow with img2img? Its not working for me for some reason
let's put this in the living room
indeed. these are interesting
beautiful work
might need some refinement
thanks! 
strange hodgepodge of loras and styles
those hands, lol
divine steven seagal
It can't be bargained with, it can't be reasoned with. It doesn't feel pity or remorse or fear and it absolutely will not stop
Except when they roll out the buffet.
You could hear it in his whispery voice. "Remember, I go first."
"It's four or nothing!"
@upbeat summit solo detailed (ukj:1.4) (sci fi goblin:1.2), upper body portrait photography, (sitting anthropomorphic creature by Jim Henson, sci fi rogue, Mos Eisley cantina in Star Wars (1977) by Jim Henson, bokeh, cafe booth:1.3), (masterpiece, highest quality, extreme resolution character art:1.3), (photon mapping, physically based rendering, global illumination, area light, indirect lighting, transparency, reflection:1.2)
I like it when people for real go nuts over the buffet. that might be a quality theme for some images
buffet wars
battle at the buffet
I guess I should run that face detailer after some of these. although not sure how much it'd really help
wish I could figure out why I can't load all the models with it
lots of options with this one
Darth Seagal holds the lightsaber by the blade.
that hypertension
But Darth Seagal, that buffet was supposed to be for everyone. By eating so much, you've altered the portions.
Darth Seagal: Pray I do not alter them further.
Redness isn't always hypertension
that looks real
Help us Obi Wan Seagal. You're our only hope.
Judge Seagal. "You know what. I am the law."
I like to believe that everything we make here comes to be in another universe
and that there's a steven seagal universe somewhere
The Seagalverse.
yes, like when john malkovich went into his own mind in being john malkovich
they just say "Seagal" to each other over and over.
theres an universe where seagal is our earth president
because that's the only word they need
Judge Jedi: "I am....the force!"
Steven Seagal went in for the role of Kingpin. They told him he could get it if he lost 100 LBS.
Palpatine: "At last, we have our revenge on the....WTF!!!!!????"
Anyone know why I would be getting shit results?
top 10 chewbacca quotes
my favorite is "burrrUrrrrr!!!"
Thanks for sharing it! I remixed the prompt a bit and used NightVision XL as a model
Rowwrrrrrrrrrrrrrrrrrrrrrrr reeerrrrrrrrrrrrrrrrrrrrrrrrrrrrrrrr grrrrrrrrrrrr!
gorgeous, care to show prompt? I am going to try that model next
of course!
hyperrealism photo portrait of a solo detailed (ukj:1.4) (sci fi goblin:1.2), upper body portrait photography, (sitting anthropomorphic creature by Jim Henson, sci fi rogue, Mos Eisley cantina in Star Wars (1977) by Jim Henson, bokeh, cafe booth:1.3), , face symmetry, intricate accurate details, cinematic color grading, cinematic, artstation, 8K
NightVision XL makes it so realistic - it's crazy
(solo:1.3), detailed bald (ukj:1.4), space jockey, (wearing square black spectacles:1.3), (upper body portrait character photography:1.2), facing the camera, crossed arms, (alien sci fi horror film 1987, by Donato Giancola and Anna Liwanag, ornate, hyper realistic, bokeh, alien mothership interior:1.3), (masterpiece, highest quality, cinematic still:1.3), (photon mapping, physically based rendering, global illumination, area light, indirect lighting, transparency:1.2),
original video of : Star Wars Minus Williams - Throne Room
star wars
original Chewbacca scream
original Chewbacca cry
original Chewbacca screech
we can rebuild him we have the technology
@upbeat summit your prompt, base sdxl
pretty cool 🙂 the prompt build is very powerful
what model, what sampler/scheduler, steps? what aspect ratio/resolution
Would all those apply to how there's multiple shitty arms?
/ legs
I just want some cool dinos/
There's multiple eyes on this...
@upbeat summit how to get nightvision xl? I see it referenced in DynaVision XL description
hmmm, might be a prompt thing
It's not yet released. I'm currently testing it for SoCalGuitarist
😢
prehistoric Brachiosaurus, by Daniel F Gerhartz, greg rutkowski, asher brown durand, Johan grenier, Anna Dittman
Out loud laugh. 4 eyes are you!
not quite Seagal likeness + inflated lora
this is what im getting
What's the prompts?
If I wrote a script to generate an image a few hours after I typed the prompt, it wouldn't be a prompt anymore
not sure which workflow you using but this is what im getting with your prompt exacly
lol
SDXtinct
you might have good luck with the Searge workflow, lots of good optimizations there
ffs
Maybe add " bad arms" to negative?
Thanks will give all those a try,
T-Rex: "Very funny!"
cool
So, it takes about 3x longer to generate, I can live with that, but it's throwing an error forget exactly what it was but it's not able to remove the "metatdata", any ideas?
Some sort of meta data flag, I don't know enough to know if its a property in the workflow or not.
well hard to say without knowing what node it is
or what you're using
seems like a node issue to me
Searge workflow
you should update ComfyUI to the latest version
I like that you low key have pipes in your node pack and I'm just now realizing it
yeah, makes for a cleaner workflow
pipes are the future. now just need someone to come up with collapsable group nodes. nested nodes are alright, but really doesn't do it for me. it'd be cool to have them all in place but then minimized into a node until you need access
Thank you that was it.
can't really complain as I'm not the one making these things, and so don't want to sound needy or ungrateful
FINAL question, f
I think I saw an extension that adds the ability to group parts of your graph into a single node
nested nodes? or something else?
yeah
For Searge wf, do I have to use an image / how could I not use it as a prompt?
you only need those for img2img and inpainting, in txt2img they are ignored
the readme file explains it well
Okay I assumed since it looks completely diff than the outputs I was getting:
Yeah it's just been a long day on my side will read tomorrow.
really just waiting on that advanced sampler now. although I haven't quite figured out how to set all the parameters in those
which sampler?
advanced sampler. how to set up the start from and stop steps, as well as add noise and return with leftover noise
I kind of just wing it and it works. but might be better approaches
ah, I see. I just made my own sampler for SDXL to take care of it
yeah. I normally use that. but then if I want to add a second round of sampling is it ever reasonable to start the base steps after the refiner steps in the previous sampler? so say first one goes 30 steps, 24 base/6 refiner. is it ever a good idea to start another round of base at 30 steps and then refiner at 54?
hopefully that makes sense
Not sure, just experiment with it. I tried so many different variations I can't remember anymore which ones were best. I just took the setup that made the best images and turned it into a node.
that's fair. I do a lot of weird experimentation. now just need to step it up and start experimenting with making some nodes myself
yours are pretty on point though, I must say. they're my go to default nodes
nice! which ones are yours?
sdxlmixsampler. For introduction https://civitai.com/models/108594/sdxlmixsampler-comfyui-jason-node-workflow-included
To support my work, you could buy me a coffee https://www.buymeacoffee.com/JasonAICreator SDXLMixSampler 1.1 The node updated to accept loop:number...
I might have that. haven't had the time to really check out all the things yet
I really wish I could figure out how to get this sort of control over cfg using comfy. one thing I miss from a1111
I've tried to vary cfg using a cosine function, but kind of hard to really do with the refiner. not sure how that would work. and then not exactly sure how mimic cfg works either
you could replicate it with nodes, but it can get messy. I'd just implement that in a node
would an external node be able to have the sampler's cfg change over the course of the steps? or would it need to be implemented within the sampler?
that is awesome.. Did you use a different model or lora for that?
default XL and prompt
wow that is awesome
those hands
well anyway, I realize there's a lot more to it than just the cfg. should probably learn how mimic cfg works before trying to make anything myself
And I really like the results that I can get if I prompt in a non-sensical way, like feeding unrelated and completely different prompts into each of the 2 clip models
I tried thing like putting a description of a fantasy scene in the main prompt, prompting for the end of the universe in the secondary prompt and mixing in some disney style cyberpunk as style prompt
I've been using sdxl prompt styler. use the same base prompt but feed it through different stylers for the G and the L
@hardy cipher do you have any lora settings suggestions, all my loras haven't been through
been through? not quite sure what you mean
in general my approach is to only add one lora at a time and then see how it impacts the overall structure of things
default is to start with the set last layer node and then the noise offset lora, then build from there
normally more trial and error than anything else. some things just don't work together. other things that seem like they shouldn't work together do well together. and once you get a feel for how a particular lora changes things you can guage if it's overbearing in a particular setup
i mean, it didnt work out
you suggest fidling with the settings?
yeah, just add one at a time and feel out how it interacts with your model and prompt. and then add another when you're satisfied with the first. some of them vibe well together, others don't. you just have to feel it out and no one right answer really
oh ok, but i want my lora to be perfect so that it can generate the face even using the simplest settings
@hardy cipher why do you have so many loras
added them one by one to see how they worked out
I wouldn't suggest that many if you're going for photorealistic
you willl probably have a number of other similar errors relating to other bits and bobs as not all the "enablers" are turnd on. Even says on the next lline "ourour will be ignored"
Should still have run ok though
I think I've inadvertently been using a 1.5 lora to make a bunch of these images, lol. but it works somehow?
its either working or (and more likely IMHO at this point) just being ignnored
well getting errors like this
ERROR diffusion_model.output_blocks.5.1.transformer_blocks.0.attn2.to_v.weight shape '[640, 2048]' is invalid for input of size 983040
ERROR diffusion_model.output_blocks.5.1.transformer_blocks.0.attn2.to_out.0.weight shape '[640, 640]' is invalid for input of size 1638400
I'll drop the weight to 0 and see if it changes the output
The COmfyRoll LORA Loader has an "odd" switch 🙂
ugh, still getting that error either way. doesn't seem to be ruining the output, but curious what's causing it
If its not doing anything to the output then its a warnning not an error (even though it says error) lol
maybe because some loras are trained on 640 resolution?
makes sense. I guess I'll just figure it out through trial and error. not sure how else to do it
yup. I loaded up a lora made for 2.1 and got a whole screen of the same diffusion errors
Guys, this is my first day with SDXL and ComfyUI. Everything is new to me since I was used to the 1111 interface. Sharing my first promising result. I can hardly believe my 2070 Super is doing all of this!
this was using a tarot card lora for 2.1
nice one
starting to wonder if it's the model itself
Thank you for encouraging me to use ComfyUI buddy!
bypassed all but the noise offset lora and same error
hahaa
nice
I'll run another model and see what's up. if not that it's some kind of crazy mystery I guess
not the models, not the lora. ugh. wtf? I guess I could just ignore the errors. but I don't like taking that approach
geez mr 14 pack
look how he eats
valid point
This is going to be a lot of fun. Friends, currently I only have 4x_NMKD-Siax_200K as an upscaler model, do you recommend anything else?
That's my preferred one
I use ultrasharp normally. is that one better?
I not tested ultrasharp
goodness, so many node updates. that's probably my issue
I tested a ton a while ago, just personally liked the siax one the best
NMKD Siax is usually better , yep
i tested 4x nmkd siax 200k on anime and it works there too,its a great upscaler
always forget I need to fecth updates in order to see them
Thanks guys, I'll download them to test
I haven't explored them much. prior to using comfy really just used a1111 defaults. esrgan 4x, anime6x, esrgan5billion, etc
theres also one called "1x_NMKDDetoon_97500_G" it removes the 2D or anime like effect from some pics
these crazy names
thanks guys for hard support
is there an "update all" option I'm missing with node packs? or do I just have to click each one every 30 seconds?
Btw I can see in your image that you might be using the 1.0 vae, I would use the 0.9. It doesn’t have those weird artifacts when you zoom in
some of us might like weird artifacts
Ur right my bad😂 I’m just sayin as a helpful tip is all
alright, countless nodes updated, very exciting
I like how the manager says update so I click update, and then it says it needs updated just like before
thanks buddy
you guys ever go to huggingface.com instead of huggingface.co?
a real feast for the eyes, let me tell you
not even sure where to get the 0.9 vae
gracias
So as a demo to the question that was been asked somewhere at some point.......... @high skiff I believe is doing "research" on that.
Does changing the target sizes in th Text Condirioners make a difference?
I give you 3examples using same seed/promppt etc which from Left to Right are
a) a factor of x1 (ie same h&w as latent space)
b) a factor of x2
c) a factor of x4
Now to m y eye all 3 are pleasing (although I do have a slight preferecne for b) simply because the beer in the glass is a nicer colour)
As my observation, higher value would give the model more freedom to generate the image. Incorrect value would scale the latent to the actual image.
just redownload SDXL. They provide it now with the 0.9 vae
"incorrect value" hmmm that almost makes it feel like saying " this is wrong" (and I'm not syaing thats what you are saying, Im just having an early morning not had a bacon & egg sandwich yet ramble) /
ILike many things related to SD I don't think there is such a thing as anyone being inherently wrong.
Yes to my eye there is a difference but again (to my eye) all 3 are acceptable
no need to download and configure a separate vae
Anybody using this at all? https://youtu.be/H5103u5uRII
We're building the MEGAZORD of image generation power. AUTO1111 and ComfyUI unite with the sd-webui-comfyui extension from ModelSurge. Do you use AUTOMATIC1111’s Stable Diffusion webui? Do you want to add the power of node-based AI workflows to the tools you already use? We're committing STABLE INCEPTION today on Building Dreams.
📌 Links:
sd-we...
Sorry for the misleading term, "Incorrect value" means empty latent image with w1024h832 and the target width and heights set as w4096h4096. The ratio of target width/height doesnt match with the output image.
and to that end I have now added a "Conditioner Scale" input to my workflow (attached) which sits in between the Height & Width Inputs to the Text Conditioners to ensure all Aspect Ratios are kept aligned tegardless of which scaling factor I choose to select 🙂
You might want to test more on non sequre image like 4:3, 3:4, etc. Sometime exact same value works better than 2x, 4x
when you follow bing's advice on fixing an error, 
luckily I know LLMs are psychopathic make believers so backed up the file beforehand
tbf I'm no longer getting any of those errors, so I guess it technically did what I asked it to
it does look totally usable
Werf!
What is your LoRA trained on? Looks very much like base sdxl
I don't mean the composition but most other aspects of it
Its trained on realism, and more specifically portrait photographs
I have several different epochs of it which affect the images differently, from small changes to realistic lighting to huge changes to composition
Base SDXL vs my super light realism LoRA
that LoRA is only 1/10th of the actual final LoRA
1/1th ?
Isnt that like 100 % ??
;o)
you could just have said 10% in the first place...............
10% makes it sound like its the final LoRA at strength .1
its only 1/10th the actual training
so its 10% trained?
hehehehe
basically, yeah
thing is its far more consistent once you get say a quarter of the way in to say "25%" than to have to siwitch units because you can't ereally xpress 1/4 as x/10
percentages are so much easier to work with, same as celsius and mm 🙂
yes, but it also interferes with LoRA strength, which is also extremely important
If you can't 1.0 a LoRA it was trained badly, thats my opinion
if I say "my LoRA at 10%"
People are gonna assume its the lora at only 10% strength
this can be the case sometimes, but not others
says the man who lives ina country where we pay for fuel by the litre but express fuel consumption in miles per gallon (thats proper gallons not the smaller American ones)
bur where beer & milk can still be sold in pints
Don't get me wrong there are uses for other strenghts but the ideal training should have the LoRA "working" at a strength of 1, regardless of if its the desired results.
the final Epoch 30 of my LoRA runs good at about 0.75 strength, but it still has a very different look that is beneficial over the properly fitted previous versions at 1.0
So its not really a one size fits all
its insane how different every 3 epochs of my LoRA look
also, the validation images are fucking hilarious lol
gotta love validation images lol
so goofy lol
which one?
I like the second one more cause of the contrast, but I think they could both use some work for realsitic skin and stuff
second one on the RH has been upscaled
yeah
Spoiler as its a zombie with blood on it and missing skin, so click at your own discretion
pretty dope
I trained a lora on zombies and found out sdxl already knows them pretty well lol
moral is, check your subjects, styles and concepts before training
Ive gone down the corporate rabbit hole lol
Just found out the gligen doesn't work with sdxl 😦 anyone got any other alternatives which allow to place certain objects at specific places for sdxl?
Photoshop? As far as AI though, we don't have segmentation models yet I don't think
Sad.. too lazy to use manual work so just gonna use area composition and latent diffusion for now as alternatives
technically I guess the woman is next to the man
i just found some lora named faces so used it
@high skiff Would be very intresting to know how your dataset is captioned.
I am still experimenting with it
I have plans for another set of changes for my next training
Where is this node type (seed With text) from?
Install missing custom nodes does not find it.
Even if it's not final still get a glimpse of direction
its honestly just like bulk tags
Wish you could run SDXL in reverse, like BLIP
Let me give you a clue!!
doubt it as its not a slef contained script, needs the core comfy etc to run
there's this here:
https://github.com/comfyanonymous/ComfyUI/blob/master/script_examples/basic_api_example.py
but i cant get it to run
did you enable api?
I stopped using that one and now just use the web hook method listed there