#✨|sdxl
1 messages · Page 94 of 1
either way, it sucks for me that Comfy is on its way to become the standard because I have a shittier experience with it
I wish - not demand, just wish - that something like a111 was funded instead
such a childish response
Voldy def has deals going on. Hosts like civit might throw cash his way to build the community up
I hope they all make money, a1111 people included
Be the change you want to see. It's not a childish idea at all. It's something children often can't grasp
Open source software really relies on people doing it themselves
Sorry, you think it's realistic for me to hire a dev to work full time just because I'd have prefered if a different thing was done
should I make my own Samsung when I prefer for them to not take the sd card away, come on
it's an extremely lucrative industry that is going to explode with time. so hoping it does trickle down to the developers of these UIs
all of them
Python coding is far from android engineering
lol
is the base of what youre upset about that comfy has a payroll so now he's constantly updating his software 24/7 because he's being paid to do so and a1 isn't doing the same? i dont think this has any merit if so
it's pretty high level
I'm starting to learn python just so I can make nodes that don't seem to be available
the base of what I am 'upset' about is that Comfy is likely to become the standard in part due to a push by Stability and that I don;t like to work with Comfy
that's it
I could have come in here and complained those nodes don't exist
Comfy made node API so simple an nice
compare comfy's activity on comfyui: https://github.com/comfyanonymous/ComfyUI/activity?ref=master
with a1:
https://github.com/AUTOMATIC1111/stable-diffusion-webui/activity?ref=dev
they're both active. they're both good devs. one being hired on by SAI does not change anything for you
man, I have to say, he really thought of everything
surely some glitches and stuff still
It's just not stability's issue that voldy hasnt built auto effectively
but the fact that it's even put together as well as it is
confusing perspectives
voldy is actually SDnext, a1's main dev is actually a1
you know who's a real hoot? thelastben
if youre referring to something else tho, disregard
colab a1111, lol
whichever one becomes the standard will get the extensions first so that changes things for me. Now seperately you can argue that funding devs for a project doesnt mean more work is done on that project but that seems either naive or too pessimistic
When I resize a reroute node then connect to it, comfyui resizes it back to default. Completely unusable..what am I even paying for????
Can we all just agree that development takes time, sdxl is still fairly new and it’s gunna take time to get all the qol features we need for sdxl in comfy and a1111. Ai is moving faster than we can develope software for it😂
we live in a strange society
That we do..
could you explain how this works? I guess I don't understand very well
those of you that aren't happy with A1111 not properly supporting SDXL whilst at teh same tme moaning about ComfyUI would be better off IMHO just getting a MJ subscription
when you hire people to spend more time to work on something, more time is spent on that thing on average
this take is even sillier. How does getting a MJ subscription make it easier for me to use things like deforum or ControlNet
you do realize comfy isn't releasing most of the extensions and what not right?
neither is a1111
they're largely 3rd party
devs are more likely to release extensions on top of whatever UIs are most popular, typically on the most popular first/only
they take care of core implementation
during 1.5 era a ton of things got released only for a111 and maybe later for some other ones with less hours spent on it because users were on a111
wooohooo give that poster a gold star for noticijng the abusridty ingherent in the statement lol
still lots of things only on a1111
OK, well we all get it. You're upset something you like isn't working anymore for you.
Adapt, or stick with it.
my favorite extension ever still stuck on a1111. but I don't complain too much about it
I've actually looked into replicating it but I'm not smart enough, lol
ComfyUI didnt exist back then , It only launched back in May or June of this year.
which one if you don't mind sharing?
I mean same, and it's because a111 was the most popular. It'll just be annoying for me if the next deforum or something else I end up liking is stuck on Comfy next time around. that's all
well here's the thing, tenoke, with swarmui you can use comfy on the backend and their a1111-esque front end if you're not into that. you can also use the front end with comfy workflows. so essentially you could find a workflow that uses whatever tool you want, load into swarm, and then you never really have to look at it. maybe that's not ideal for you, but lots of moving parts at this point. so kind of hard to have it all clean and streamlined for everyone
This is just an irrational fomo
Visual scripting is what creative work needs. Node editors are huge for this field.
That's why most studio software has some kind of node graph
A111 vs Comfyui conversation likely happen everyday. Why don't people just use whatever they love and do not need to care about another tools.
sure a better UI built on top of Comfy (whether swarm or something else) is my main hope for comfort if Comfy becomes the standard
Comfy was always made to be a back end. There are a half dozen different front ends for it already
@polar epoch
Wow, does it for ip adapter?
more LoRAs? or do you think 3 is enough?
yeah, lets just ignore all the tools ;o)
Forgive my naivity, what is clipvision? Seen the node, haven't seen what it's for
I currently I use up to 3 loras, but my old TI embed workflow went up to 8 :D. you can always chain more together right?
Sorry, I don't know.
as far as I can tell its for loading standalon clip models
well yeah. I'm just wondering if I should make a deluxe 10 LoRA version
for the big dawgs
Curious under what circumstances that would be utilized
well now theres a question..........
afaik clipvision models are used for style transfers
As in?
add a specific model with a specific style to your image gen
I think it is available for a1111 for some time but I haven't tried it
ah gotcha, sounds like something I'll have to do some more reading up on
so many questions
Where do the Clips come from
where do style models come from
Tried following that node path myself at one point and the undefined selections was as far as I got lol
"I'll look at that another day"
there is no way this could be made to work. Is there??
No clue, personally don't even know what models goes where to change those undefined selections
I don't know if it is related. https://github.com/tencent-ailab/IP-Adapter
is that jimmy carter?
Jimmy Flobber and his Meatball Cavern
there's this video by SECourses showing it as a ControlNet addon in a1111: https://www.youtube.com/watch?v=tXaQAkOgezQ
Are more repeats the way the model learns more detail in individual image? My model so far still doesn't understand textures properly and im using only 5 repeats.
ive went back months before when i needed information on very specific things lmao
i just checked my computer today and that was the first message on my screen lol
hmmm, I do recall doing that once or twice in A1111 with style transfer now that I see that. Will have to see about that in Comfy now sometime
Ok these daily checkpoint releases are killing me. Comcast is gonna be knocking my door down
6 gigs here, 6 gigs there, pretty soon your talking about a lot of data
I won't say how my bandwidth looks like then 😅
Copax has gone from 1.0 to 4.2 in 2 weeks
lol
Pretty sure I’ve snagged every one of those
Delivering bits luckily costs Comcast nothing
But they have you convinced it's a huge load
And yet, they have data caps everywhere
doesn't mean comcast will let you just do it, lol
Corporate propaganda wins again
Also there’s no marginal costs but building out the system does cost capital
not so much propaganda as it is virtual monopolies for a lot of these companies
And if everyone downloads 2-3 6 gig models nightly, they will run out of bandwidth at some point
Another gooder. Listerine was house cleaner before they convinced everyone their breath is nasty all the time without it
your choices are xfinity or dsl that's 5 times faster than dialup
So now we all swish with house cleaner
I can get an ATT fiber line but it’s slower than xfinity I believe
never used mouth wash nastiness
I keep yelling at my local fiber ISP to PLEASE expand to my area, but they don’t want to trench neighborhood lines if att already serves
It does seem unnatural but propaganda sells it
I lived in a deadzone for a while that only had at&t dsl. I could barely watch youtube videos, lol
There is some really questionable swimwear happening in the background there lol
Government here paid Telus to lay fiber. I live in the deep woods and have fiber. But Telus tries to convince me it costed them money to lay
well of course
Yeah, red shirt at the beach? I'd never
Corporations love to socialize their expenses
privatized profits, socialized expenses
Sick of perfect AI Images? Then use this Lora to make some terrible FanArt! Weights 0.8-1 If you want to donate: https://ko-fi.com/proomptengineer
this is brilliant
We're live with ComfyUI Ep3! Join us at #1005545148211527800 and #1029055412764422214 🔴
my sd keeps telling me memory error when creating above 512x512 resolotuion, is it due to my 1060 gtx with 6gb or is there a way to allow me to create higher resolutions even with this low tier graphics card ?
Don't think 6gb cuts it
I can do 512 resolution with the model i downloaded, and someome here told me sdxl is also possible but i wonder what settings i can change to compensate low vram ?
--lowvram
It generated 2 people
how to make it only 1?
cant see prompt, try 1boy or 1man
how about 2 people for negative prompt?
or make resolution to be tall like 512*768
Hmm okay
@craggy ibex i think it will not work.
My friend sent me his oc picture
he want me to use an ai to turn him to different art style (or this stable diffusion)
probably controlnet is beter for this, dont know how it works in comfui
Yeah thanks for reminding me 6gb is low in 2023
Idek what is the best img2img workflow
Holy shit I remember when 4gb was a lot
that could magically turn some characters to anime style like magically
Like that tiktok user...
Idk how..
Civitai has a lot of workflows
I didn't know i2i was a thing on comfy btw
Not work
the prompy 1man not work
so try taller layout.
Set resolution lower, then upscale when you have something you like. Put (1man,solo) at beginning of prompt
Also add descriptors for the background, SD will redraw prompts with higher CFG and empty space
is there a preprocessor for softedge sdxl?
How to upscale?
lol
@craggy ibex do you have A1111 still installed? Probably with controlnet and older models.
you are working in A1111?
o.k. i forgot, what do you want to add remove?
Can you fucking generate better?
for fucking sake.
Why am I always have skill issue in Stable Diffusion
I don't fucking get it
Anger is never the answer, young one.
I have 0 help for this ngl
Yeah I know
but I'm very jealous of someone who can do this
This is impossible
I need to find an answer
I like this training as it has that Ralph Bakshi 1970s look to them.
Welp, my masterpiece is finally done, lol
I wish I can do something like yall do :(
Prompting is everything and negative prompting even more than that.
Yes.
did that but only allows 512 , how much do u need for 2048x2048 ?
is that your own model/lora? i love it
I have 6gb of ram and I have no problem with larger resolutions. if you're using a1111 that might be the issue currently
naw
How to upscale?
I like this one
I've been playing around with this style trying to get it right
Such a disappointment
Do I need to know of how to coding?
no
in order to create perfect ai generated image?
there is no perfect, my friend
It's 3 AM in here
It's all in prompting and matching to a style of checkpoint you find fits your style preferences
I can't sleep until it's not beautiful yet.
FUCK YOU
Load upscale with model node, load upscaler node, plug the two with image input and output, done
FUCK YOU
one day you'll be able to make beauty like me
Looks like your doing image to image with high denoise values, lower denoise value or do what I said
I’m telling you what to do
Stretched beyond XL's limits
I'll decrease to 0.8
No to much. Decrease it to 0.25 or 0.3
but it's impossible for A.I. to generate perfect hands
I like that art style, btw.
Okay just ignore me then whatever😂 no help for you
often it's a luck of the seed
Has this been Harry on ketamine?
think of it like photography. if you're going for "perfection" there will be lots of fails before you get a win
Your welcome. Next time maybe take some help from others rather than say fuck you with a picture🤣 I gave you two options to upscale and got ignored lol
No I would use a upscaler model with the upscale with model node
#✨|sdxl message. It’s like this if this makes sense
Discord is the easiest way to communicate over voice, video, and text. Chat, hang out, and stay close with your friends and communities.
Alright
Guys
I made a video, using the A.I images of SDXL, it's a sad story, do you want to see it?
It's a 1 minute video
If I don't cry, I'll be disappointed
Hope I will get angry issue after this
I think you will laugh instead 🤣
but let's see..i m uploading it here
It is so bad, that I am cringing right now. I need suggestions how to improve
Here is it guys ^
Need suggestions, how to improve
It's a sad story, that I tried making using A.I
Like this?
awful story
How can I improve?
ask an llm to write a story, then make images based off each section
stick with an art style
Yes now drag the upscale model point to the left and add a upscale model loader node. Then if you have one they will show up, if you don’t look up modeldb in google and find one you like. And add it to the upscaler models folder in your comfyui directory. And it’ll show up.
i see, you mean chatgpt for a story, and images based off each section, but only 1 art style?
Yeah, that's one option
Yup! You got it. That’s upscaling!
You’d hit queue like normal, and no no sampler.
That would just do a basic upscale with upscale model, there’s also other upscalers made for more anime style such as that one ultra sharp is a good general one though
@tiny walrus gotcha
Are there any prompt to prevent this?
It's REALLY fucking annoying
booru life
IMMA DO IT
IMMA CLICK IT
As soon as someone responds to this with an image of a ship
We'll release our ControlNets
Send it
It's ALIIIIIVVVVE

YESSS
collect all 4! or 8 if you want both sizes!
I'm getting the big size, it's got the other size in it too
Revision is a novel approach of using images to prompt SDXL.
👀
any plans on doing something like this with comfy? https://github.com/mcmonkeyprojects/sd-dynamic-thresholding
Wait those aren't controlnets
what could they be 
The Good Place was such a fun show
haha loved it
Woo hoo!! https://www.reddit.com/r/StableDiffusion/comments/15uwomn/stability_releases_controlloras_efficient/?
I reimplemented this as a node some while back for some testing. Could prob clean it up and push it out if thats fine with you @wicked frigate
@hard fractal
i should probably redo it and release it myself. there's been feature updates since then
it was my most favoritest a1111 extension
which, uh, have been largely ignored cause they've been on auto lol
So will this help me improve my noodle ships? 😛
All yours, mine wasnt the best implementation anyways haha
@hardy cipher follow this git thread https://github.com/mcmonkeyprojects/sd-dynamic-thresholding/issues/54 i keep meaning to reply but forgetting cause busy and/or sick, will post there when it's up
Give it a try 😀
So there are only 4 CNs or are the others gonna drop in few days?
I think it's too Gore for this server
no
Unless its generated through SD
used it to make stuff like this with 1.5 in a1111
And now some minor qol nodes: https://github.com/Stability-AI/stability-ComfyUI-nodes
No guarantee on others atm though these base ones are pretty nice. We have some in progress but atm we're looking to switch up our training system for em to work more direct with the control-lora format to try and improve the process a bit
edit: nevermind it works now
sweet, thanks! I understand life takes priority over such things. but, man, I really had a lot of fun with that one
So it might take few weeks?
the second i posted that, it was fixed 😅
Can't really put a timestamp, its a bit r&d atm. Could be much sooner for some
Im the meantime it would be great to hear how people like this new format
I will for sure cheers nice release, been checking out canny / depth / pose so far for XL glad to give this a spin and compsare. 🙂
Ah, got it
Can we expect a lora which specializes in QR code generators?
Excited to hear what you think 🙂
these 4 are the major ones for sure, i think the only other controlnet that i used often was openpose
Openpose is og, it can get perfect hands
Got it on the list, love those qr/luminance ones. Some out there prob have nice data for it already so a regular ol qr one might get trained out in the community first. Focus here is trying to improve the format first (control-lora) to make this all easier into the future
man, the cfg and mimic cfg scheduling stuff really does some cool stuff when combined with noise offset and what not. still have no idea how mimic cfg really works beyond how it impacts render. but really great tool
So you are basically working on the algo atm
I've messed with qr codes a bit. only some of the squares are important. that's the key
or vital rather
Yea so these new ones aren't "controlnets", they are basically full encoder weight diffs (loras for all weights) to allow the models own encoder to be its own controlnet so you dont need a seperate model
QR codes are fun but barcores some more usefull to me.
It's too technical for me
People scan QR codes but not barcodes
For users it will be the same on the interface, just means smaller files and less vram required
Embed barcode all over a object almost not visible, scanning never fails, win win?
Barcodes are only used by businesses
they are controlnets just that the weight of the controlnets are stored as a kind of lora
That's kinda better considering SDXL vram consumption
yeah the main goal was reducing vram usage
say, where does one get this CLIPVisionAsPooled node?
what sort of vram requirements? I apologize if that's been stated already
use this workflow
I'm a bit deficient in that department
@sour obsidian which text encoder does clipg use?
It doesn't use blip/clip?
It would be cool if you added a switch to select what model to load on what cuda device and then also select a checkbox that allowd to keep the model into VRAM.
what's clip vision?
Clip g is the text encoder 🙂
Were SDXL images trained on blip/clip or was it trained on completely diff captions which were long and a bit a more detailed?
Why is it used?
It enhances the clip encoding?
so is clip vision like unclip? or am I misunderstanding how it works?
does comfy looks for the controlnet lora from the lora folder or the controlnet folder?
Ultimately generating bit more relavent images?
yeah the community canny/depth are basically instantly obsolete now lol
lol
XL was training on clip l + clip g text embeddings made from the captioned images
The new revision is something new. Like unclip but far more conceptual vs literal. We are still exploring it as well
awesome. I always liked unclip tbh. but then it kind of did some wacky stuff sometimes
i wish model pages will come with sample workflows as well 🙂
Oh unless you meant actual clip vision. Its part of the clip model as clip was designed to compare text/image similarly
There should be some 🙂
(Or were you commenting that all should come with workflows, in which case I totally agree 😎)
dropping crazy new things right before the weekend
rule #1: always release on a friday so if there's any complaints you can run and hide during the weekend
release on friday so it gives the weekend to play 😛
brilliant!
mm, very strange unrelated results... 🙂
I hate it when the powers to be decide to release a new software package on a Friday for something thats mission critical (which at least this isn't)
i've taken these to pics 🙂 guitar and burger...
unclippin
intersting nipples for Superman
i dont get it 🙂
lol
just the concept...
it helkps if you click reply on a message so people have got a chnace of understanding what you're referring too
sorry, it's late here and im trying to wrap my head about everything
its late(ish) here as well 🙂
@upbeat summit nice!
any text or just that?
I left the text from comfy's workflow in beautiful photograph 😄
Was new stuff released today? If so can someone mass link it?
How do i get SDXL?
internet
download the model from civit or hugging face
And how do i use model?
Just tell me one way
Okay thanks
yeah - the one example I posted worked great. but now I'm only getting random results. not sure yet how to control this
@visual glade are there preprocessor nodes for canny and the rest?
i can't see the ones that ar ein the samples.
do I describe my input images like in img2img?
if you actually want to use it on your own you'll need to download some software, and you'd need to learn some things. so might want to read up on it a bit. can't really explain it all here
What is the clip_g for? Is it just to have it work?
or do I need input images the clipvision model can understand?
@upbeat summit can't wait to see your images with sdxl
hmmm she go "boom" boss
Yeaaaah that's all I'm getting
did you update comfy after?
make sure you are updated
hmm works great here - updated comfyui earlier
oh not in the last couple hours haha
anyone was able to run canny edge?
*SIGH here we go again, the curse and the blessing of OpenSource, everything being out of date if you blink!!!
earlier i took the burger into community canny
I had issues with comfyroll this last update and it was because I never installed ComfyUIManager correctly. Didn’t show up as a problem until the most recent update >.> Was totally user error in the end in my case.
and did this
redwall, just ran update .bat file
i now want sushi burgers
Seems to be tiunning COmfys example workflow with red walling after I updated
prompt: cinematic still
ah shiet, updated my wrong instance
Hi KaliYuga 🙂
control lora is working with auto1111 ?
howdy 🙂
could you install the stability nodes?
i dont have the preprocssors
I ahven't gone through everything yet, so unsure
I dont know what I expected really lol
context works out.
the context is that Comfys example workflow
with my images n lol
naaa, the context is the struggle of the mind between the inner joker and the bat.
for that I'll share them 🙂
when it works it works!
so this should be a Blend kind of option?
like Midj'?
I stopped paying attention for a couple days and new things appear 😮
couple hours is all it takes
Well thats a better result if I dont uise the basic SDXL1 model.
Not sure wherre this little darling came from given the input images though lol
such a surprise in this scene 😄
Do you need a model for revision?
They talk about it on the Hugging face page, but I don't see a model for it.
dont blink 🙂
the only thing you need is CLIP-G Vision: https://huggingface.co/comfyanonymous/clip_vision_g/tree/main
put it in your ComfyUI/models/clip_vision/ folder
Ok, seems like maybe an oversight on the Hugging Face Model card then?
As it tells you about it, but doesn't explain how to use it
Weeping Angels, Classic episodes of New Who
color grade and pattern to jacket transfer
yeah it's going to be added to the huggingface page
sorry to nag, what about the rest of the nodes? we want to play with canny edge
for canny there's a node in the base install
Now do it with her skin instead. 😉
is there a way for us to know what the clip vision sees?
like a preview for what ever input it passes?
Answers on a Postcard as to what this will produce??
Manbat. 😄
spoiler ;o)
amazing
id eat that
no prompt
I am excited to see all the combos you all come up with 😄
any tips for noise_augmentation?
ah I think thats actually just a vestigial piece from a real unclip. Dont believe it has an effect here. It was meant as a sort of feathering between clip img and a clip txt embedding made with a prior model (real unclip). Correct me if it is doing something though @visual glade
id just leave it at 0 since these are operating on their own
anyone can help me with the settings for canny?
im trying to understand why my preprocessor can't "see" the image and how to setup the low and high thresholds
thank you 🙂
I don't think I'm using it right lmao
noise augmentation will have an effect but it's not useful in this case
so it should be zero
Generally it just converts is to a white background I believe
It was just an image I had in my downloads folder
great experiment
Ah bummer figured it would use it as a mask
Yeah I think if you use in in img2img it just white backgrounds it
what is your prompt if I may ask?
every time I get beautiful landscapes my settings aren't dialed in yet, right? 😄
As something simple such as: a guitar made out of PBC, and motherboard traces and computer parts
will do.
hey this is pretty fun
IF you want the van gogh one try: a very abstract painting of guitar in the style of Vincent van Gogh, starry night backdrop
ok so you are customizing the prompt depending on your input images
Thats how to use CN right?
I guess so 😄
playing with the strength values a lot right now
and what differences are there where the image is placed and how it's all connected in comfy's workflow
honestly might be better than prompting.
I've no idea what im doing, where I'm goig etc (other than too bed) but........
dat order though
i just zerod out the conditions and it started working without generating women lol.
I personally found 1.75 on the unclip node when combining with text input tends to give a nice balance between em 🙂 might not be perfect in all cases though
but which model?
which node?
because I just looked at how it's all connected and there's of course a difference between both unclip conditioning nodes (how they are connected)
ah this would be for single image input. so an example might be spiderman text input + starry night image as the one img input to revision. For two it might be worth bumping them both up a touch since text tends to over ride the image input at base values when not 0ing it out
single input... gotta try this 😄
thng is you can Combine, Concat (which seems to be the default in the example provided) or Average the Conditioning .
Meanwhile zeroed out the top 2 text inputs and got this
thanks for explaining! lots of experiments to do heh
you could always prompt the "new" way
. Revision can read text 👀
thinks about 3..no 4...no 5! image inputs
this is going to rurn into a classic Oil Type talking point about what is allegedly the best way to do something.
Which example workflow should we be using, there seems to be multiple
this is amazing!
ive noticed that it takes elements in a weird way. i could definitely tell how it would be able to do this. this clip vision thing is very cool.
yea its pretty interesting to play around with. its very conceptual, merging the concepts as a whole vs just the image features
my mind was also blown when @runic granite tested the text input the first time with something about a tree and a tree showed up haha
yeah! thats a way to put it. i honestly would have had a hard time describing it but that was well put
so when do we get subdirectory support in /ComfyUI/input/ - this will get a lot of new content 😄
toss a coint
To your Witcher?
yea we noticed single words seem tougher for some reason. So just dog would rarely work, but a cute dog showed up much more often
like these outputs are so insanely good its like a whole new way of prompting. with images!
should we get a dataset of texts on images? ---- wait i've got an idea!
haha it might help a good bit. There is likely some good potential in tuning things for revision specifically
I just loaded the lyrics in as the positive prompt and got this
zero out the conditions if you dont care to prompt
like this. and it works great
haha, sometimes 🤣
No prompt for my Woody and Mona Lisa merge
Ah. The word bubble around it makes it think it's a comic book
yea its neat how it interpreted it
yea, i guess i dont get it
did you zero out the conditions? or do you have to prompt? if prompting, describe exactly what you want
ah yea youre just 0ing out the image strength there so its not using them. Gotta add that little node in between
nice!
such a nice series as well
indeed. thanks so much to the team. this is like the ultimate wildcard system. i never have to worry about silly prompting again! hehe
lol
We can add it to the bot now 😄
Before, people would've been very confused about the new "ControlNet"
sorry, i'm gonna do a few more in excitment of how nice it comes out 🙂
blantly borrowed from MJ's community page (sorry OP)
Pretty cool eh?
can't read hebrew though...
thanks for this! 😄
@tribal gale we need to fix this
your welcome! however i want to try and find a way to implement a prompt for additive details instead of having to prompt for the whole thing.
changing the aspect ratio is none issue as well
Is it because I have a perverted mind??
i wish
supportive prompting
yeah thats a good name for it lol.
those neat workflows make me feel so bad for my spaggethi
only @high skiff &and @visual glade quickly mashed together and straight lines turned on
yeah, makes sense. since both images (and both unclip nodes) are connected in different positions in the pipeline I'm trying to figure out what the differences are
so i tried to implement supportive prompting. annnnd no. but there is red hair
beautiful 😄
im waiting to see you all mix revision and controlnet stuff too 😄
really fun to give a base with say depth and toss image concepts to mold into it haha
soo it worked???? but i can tell mid generation that its realllly struggling
straight lines? is that a plugin?
click the little gear on the menu and change the line render mode, should be in the base comfy 🙂
I'm considering subscribing to Collab Premium to generate a few images per day. The free plan is always disconnecting, which seems to be a common problem with free accounts. I'd like to know your opinion if this seems like a good idea. My goal is to just generate images, nothing too heavy like training models or something like that. I'd like to know if XL works well on Collab Premium?
my life kind of changed right now...
can only use canny right now, right?
straight lines = satisfying, but when they overlap your like. wait where is this line going? lol
depth is pretty great too. Just need to grab one of the pre-processors out there for it
Alright. Does anyone have a workflow they want to share yet?
SDXL remix possible?
for what?
The newest set of plugins
I'll drag that and see how much I understand it.
i was trying to find one 🙂
@upbeat summit Klinter did already but do you think that you could also share yours?
so i definitely got it to work guys. supportive prompting. no text = zerod out condition and works like normal. and with text adds to the image, prompt may need to be a bit specific sometimes
for clipvision I'm using the one comfy posted earlier #✨|sdxl message
Also should I download windows_portable and uninstall the old Comfy Windows version? I still have the one that is just labeled Comfy for Windows.
Alright rad
did you try to inject lora there?
havent tried lora yet. i guess i could.
I definitely recommend cranking up the strength on the unclip node if you are mixing revision and text. I found ~1.75 strength to be good for most cases but didnt test for too long. Text still has more inherent strength over revision in most cases
is it normal for the VAE step to take a very long time
the main thing i was trying to solve was that without the zeroout, not having text produces basically nothing. i wanted to be able to do both with no problems. so text adds details, without text is basically zeroout so it works either way. if that makes sense
there is definitely a difference between no text and 0'ed text. 0'ed will get you even cleaner images when purely image prompting as it totally 0s the embedding where as no text still produces a non-zero embeddings. Still there is a lot to try with these things
agreed. its very exciting. ill definitely be doing lots of tests
clip vision!!
damn i gotta try that now
@empty orbit just grab any of these images
with Aether?
base sdxl
nice
no positive but some fidelity tweaking with this negative
photoshop, video game, ugly, tiling, poorly drawn hands, poorly drawn feet, poorly drawn face, out of frame, mutation, mutated, extra limbs, extra legs, extra arms, disfigured, deformed, cross-eye, body out of frame, blurry, bad art, bad anatomy, 3d render, photograph
@sour obsidian can i assume that clip recognize instagram icone and prdocuve pretty girls because of it?
(also, yea, i've added a 3rd one)
hahah its totally possible
I'm offended, I gave it images of London and it gives me the eiffel tower 😢
Still don't fully understand how this is working, but it's interesting
latent space style transfer image mixer interpreter
put that on the poster
That makes me no closer to understanding lmao
on a bumper sticker
id rock it on my car
Ooooo
had to be chirped
is there a fix for when dropping workflows and/or images onto the comfyui webpage doesn't update?
not everyone is sharing their info when they are uploading the image
but here is one from me 🙂
yeah, this is the workflow @hard fractal dropped earlier
load nor drag are updating
definitely need subdirectory support in Load Image nodes and ComfyUI/Input/ - like the Load Checkpoint node
these arent' droppable i assume since just pure json?
worst case just download and hit the load button. I moved the revision stuff under its own folder since its a tad sep along with the G vis model so its all in one spot https://huggingface.co/stabilityai/control-lora/tree/main/revision
with prompt?
sweet
so differnt toy, did anyone tried to do a fancy QR with SDXL yet?
Does anyone publish a "what's new today/this week in stable diffusion" newsletter? The speed of releases is 😵💫
only with SD 1.5 - very cool. starting to use it for clients
yea, me too, but with the sdxl it looks so much better
i tried with canny but no good yet
yeah - have not used controlnet yet with SDXL 😬 I'm also a total noob at it. but will learn it now
well, it kind of came out today 😛
You guys are killing it with the revision images! Nice stuff 🤩
hey @runic granite 😄
Hi, I need a little help with the new stuff please! I'm trying to use the control lora workflows from here: https://huggingface.co/stabilityai/control-lora/tree/main but i seem to be missing these nodes: ScribblePreprocessor, CLIPVisionAsPooled, ColorCorrect,MiDaS-DepthMapPreprocessor,CannyEdgePreprocessor
Could anyone point me in the right direction please (I already installed the nodes from https://github.com/Stability-AI/stability-ComfyUI-nodes)
yeah - I meant ControlNet in general. But the time is now 😄 and also training. You know - always focused on prompt engineering but it's time to do some fine-tuning myself 😉
@sonic vortex grab a glass and relax (the workflow is included)
you're going to be awsome in that as well ah?
you can't leave something for us mortals?
Thank you 😄 and you are being very awesome yourself, Klinter!
hey, i've learnt everything i know about prompting from you.
2.1 was a great school for me
Thank you so much! Now to try and work it all out 🤣
lol
but today's MVP is via with the zero prompt
that's awesome!
Just changing the weight of each image can change the results a lot
That's with the space like image having higher weight
I'm doing nothing else 😄
Why does it like the Eiffel Tower so much
strong in the latent space
what's this error mean for clipvision?
https://civitai.com/models/130726 i uploaded a revision workflow to civit lmao. i know that seems silly but it has additive prompting and i know a lot of people check there
update comfyui
update after
Bruce Leroy, the last dragon, has the glow and joins the Avengers
gotcha
it was an awesome time 🙂
any update on preprocessors missing for the official comfy controlnet workflows?
I think I can see where it's getting this from. Space Ship, Ship, Building, oh lighthouse.
lmao Long boy
@autumn forum got AIT working with it yet?
lmao i just tried it and it gave me an error hah. but no not yet.
Error occurred when executing KSampler:
Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!
hmm okay
it still is! 🙂
wow
huh, is this SDXL clipvision?
Yeah
@eternal fog which model?
link? I didn't know there is an SDXL clipvision thingy
2nd one has a workflow with it, so probably get it from there
comfy's workflow for ClipVision: #✨|sdxl message
lmao
oh wow
I put a negative weight on the woman
Less woman more Alien I guess
she's an alien
these are so awsome!
nice - I think you got it 😄
It's not what I was expecting, but it's cool
This is better for me because I'm useless at thinking about what I want to prompt
Hey guys, I have been using the Searge 3.4 workflow and I keep getting the following error. I'm not sure how to fix it. Any help would be greatly appreciated 🙂
almost...
does the strength of the unclpping get normlized to 1?
(is that a smart question? it's very late here and my brain is not 100%)
Try to update comfy and then update the searge nodes, likely a version mismatch
If you guys want to get really fun and confused, combine two images and put another image in as a negative 😁
That seems like a good way to get cursed images
Trying that now
what is this DRLX announced in #📣|announcements ?
I'm not really that into software / coding so if anyone (who's obviously way smarter than me) wouldn't mind sending an explanation and their thoughts about it I would more than appreciate it!
What tools are currently working for training textual inversion embeddings for sd-xl?
I've mainly been using auto1111 sd-webui.
Which doesn't have the training working yet for sdxl
Gnight eveyrone. amazing new tool! thank you for this.
sleep well! ttyl 🙂
kinda fun, you can just toss in a simple sketch with no prompt and get better sketches of the same concept
In my quest to find a better color transfer method for I2I I've found 3 that I've managed to get working. Semi Discrete Optimal Transport, Linear Monge-Kantorovich, and one that uses a HistoGAN model. Working on cleaning the code up and popping it into a ComfyUI node.
(Woman + Power Armor) - Santa on the beach = Anime Girl... Sure! 😄
I missed the discussion; Do the control loras offer advantages beyond efficienct/compactness?
PSA: to get the official CLoRA workflows going you will need this node pack:
https://github.com/Fannovel16/comfyui_controlnet_aux
Canny works with the build in preprocessor, just swap it out and tune the values accordingly. We found low_threshold 0.100 and high_threshold 0.200 does a good estimation.
We will add a note on the HF Repo pages as well.
so clean! great characters
thanks! left it running while I was gone. got all sorts of interesting things
Hey which folder will these go into?
Just posted some info on how to get it to work above. 🙂
controlnet folder
thanks
the revison file goes here correct?
Using ProtoVision XL by @delicate kelp
sweet
Thank you that worked... 🙂 It took a while but it is working...
Now I have a new issue. I keep getting the following error:
Warning: Ran out of memory when regular VAE decoding, retrying with tiled VAE decoding.
Now I do already have --lowvram
Are there any other command line arguements I can use to get this to work better?
I just switched over to windows_portable. Do I have xformers automatically enabled?
what's a bit odd is the same setup also created these images, lol
ooh, put the one in twice
It says using xformers in your screenshot
That's what I thought. Thanks.
the latent space just wanted to have some fun!
indeed. I've always liked the unclip/clip vision stuff. really takes on a life of it's own
@upbeat summitCan you help me understand what works best when using revision?
revision?
Clip
clip vision?
The UI I think you said I should use.
yes?
I think?
I read somehere "revision"
it basically creates conditioning data from an image
hmmm... I don't think I mentioned that
Revision is what SAI are calling the Clip Vision G stuff on their hugging face
oh ok 😄
I mean how can I make it actually look the way it was meant to look? I'm trying to use the workflow Comfy sent had the tree as the reference image.
similar to what you'd get from a prompt encoder
You put 2 images in and they act as prompts.
You can change it up and use 1 image, or images and text
you use an image as input, attach the model to the clip vision
It's only just come out, so play with it and see what it does.
https://huggingface.co/stabilityai/control-lora/blob/main/revision/revision-basic_example.json
This is a more simple workflow that uses no prompting and just images.
and then patch it into conditioning data from your prompt encoder
you first need to get images... like your inputs. styles, people, artwork, film screengrabs. anything you want to play with and put those images files in your /ComfyUI/inputs/ folder
I mean it looks awful. Not at all the way it's meant to. I'll send in an example in a second.
@upbeat summit What checkpoint have you been using with it?
There's some more revision examples on this page: https://comfyanonymous.github.io/ComfyUI_examples/sdxl/
You don't need to do that. You can just drag and drop images into the Load Image nodes.
I used based earlier. I'm not using ProtoVision XL
true - I just like to have them all there to switch between them
That requires planning on what images you are going to use, I just find random stuff and throw them in.
I mean I just got this from thjese.
What are you expecting to get
https://civitai.com/models/130726 use this workflow and it fixes the problem without prompting anything.
I don't know. I don't know how it works. From what I've seen though it should've have mixed the two styles together to make something new. It shouldn't be entirely heavy on one style or subject.
sorry i keep spamming my workflow heh. just want people to get good results out of the box
hey Via, I updated the clip vison model onto the main repo. Any chance you'd be willing to switch the link at the bottom https://huggingface.co/stabilityai/control-lora/blob/main/revision/clip_vision_g.safetensors
@upbeat summitOh Comfy called it revision.
It doesn't really mix, it takes what's in the images and creates a prompt from them, as far as I can tell.
You also have the added problem that you aren't using the workflow with the nodes to Zero out the conditioning, so even though your prompt is blank, there's still technically something there. And that has much higher strength that the images.
yes no problem will do that right now
you can set it up like this
the unclip conditioning nodes require conditioning input
no prompt involved. All just image concepts 🙂
I need me some SDXXL 2048 models 
I guess does it actually use "prompts" in the way we'd think about it, or does it just mess around with latent space.
I wonder if it could be quantified into actual tokens
I get what you mean, like what prompts is it putting into the model to get the results, but I don't think it's using text tokens.
Igga thinks I know the first thing about setting up nodes myself. No offense to you but I'd rather have a workflow that someone else makes for me (or they share it with the rest of the communtiy).
I am lacking the mental strength to learn that currently. The last time I tried a node set up myself I sweat so bad and got a headache.
oh i thought that's what clip did was that AI would attempt to describe what's in the images, is that wrong?
its using a shared latent space where clip compares images and text (pooled embedding space) so its not a prompt but it acts more conceptual like one
so its not simliar to a1111 clip interrogate?
its very different, this is directly using concepts from the image, no tokens involved
I guess it could be used in a similar way but this is direct
I have my workflow attached here, but I'm using a couple custom nodes I made, so you would either have to swap those out or download them
Plus a lora right?
Have an issue getting the Controlnet nodes to work. I run the install.bat for dependencies and everything is satisfied but have this error appear repeatedly
DEPRECATION: torchsde 0.2.5 has a non-standard dependency specifier numpy>=1.19.*; python_version >= "3.7".
@wet nacelle Use the workflow that me or Via posted. That will be the easiest way to start without using any promptng.
Of course the mutated stray ears are perfect 
The one I posted is SAI's example, Via has done their own.
you don't really need a lora, but I'm using one currently
updated torch, updated comfyui, updated controlnet nodes. but no dice
damn, name module pack please
The thing about that is I fear that I may need to rewire the nodes if the Lora is active.
I just made it today.
. you can download it here if you want https://github.com/picturesonpictures/comfy_PoP
Neat, thanks! :D
no problem. it's something I'd wanted for a while, so had to teach myself a bit of python to make it, lol
there's the lora stacks, but none of them have that input output that I'm aware of
along with the on/off switches
Aye, i sadly lack skills on where to even start to learn python, so i'm at dead start for eternity as i don't know any way i can effectively learn it 
Holy shit...I just hit a baller combination.
That's what i've been waiting for for weeks. A simple switch to turn off an entire section, for instance img to img, upscalers etc
Ugh, I'm running into this exact issue right now
Could I be doing something wrong?
not sure what the problem is. still ttroubleshooting here
im not sure. lol idk how that could happen
I'm updating a few things now but yeah, not sure what's happening
Noice, found a somewhat highest native res i can make without too much artifacts, now to just shout at negative prompts to not screw up tails or ears 
same. it was a bit of a hassle to put together, but future nodes should be easier for me
unless it doesnt like the alpha channel on that image? idk
Should I try to find a way to update Comfy?
have you not? lol if so you definitely need to lol. this got updated today
Would I do it from the update folder?
do you have a update_comfyui.bat file?
Aye. If you get it all running nicely, i have a node request, and that's a model and lora combi module :P Were base, refiner, as well as lora's underneath. And if possible, a plus button to add as many extra loras as one wants with lora priority for each 
I just clicked it. It showed the latest update date which was today.
okay cool then it should be updated
hmm. I'll see if I can figure it out. the plus button part might be tricky. but could add like 5 lora spots
Can you link where you got your clip file?
I'm sure adding loras like that is possible. but not sure how I'd do it
its on my civit page where you got the workflow
Indeed 
if successfull, do like 15 or 20 lora's, i made a image once that had 19 lora's
I'll make a deluxe lora stack, lol
Here's my 19 lora generation from sd 1.5 using just armor loras 
You've no idea how many generations that went for that one clean ish gen xD
well I could probably do a larger lora node later tonight. but your other request would take a bit more time. but I can do that too. good practice for me
If you go crosseyed, tail will fit /s

I'll be back here in a few hours. if you're still around and I get that figured out I'll send you a link
Ping me when done, it's 4 am, so soon heading to bed 
Also, how the heck did i get gamer role lol. It's not in roles
this would be sick if the pignose grill wasnt f'd
@visual gladeDo you think you can help me set up Clip Vision? I'm on the latest version of ComfyUI and have all the needed files to get it working. The issue is that each time I use it in a workflow it seems to only understand the prompt and not the image nodes.
I've also just uninstalled the UI and reinstalled it to see if that was the issue.
I've tried several workflows now.
