#✨|sdxl
1 messages · Page 171 of 1
😉
k nevermind i think this is actually something here
👍🏻
shit i didn't realize it wasn't saving but yeah anyway, batch of 4, three had LILLY, all had VILLAGE, one had LIY VILLAGE, and then there's this one
chose that cuz there's a lot of opportunity for it to f it up
with counting letter parts that iterate
k holy shit
a billboard that says "IlEMWUNHS3"
first try
lost the U and that's it
needs shark teeth lol
wow that's neat
I mean...all the elements are there, but wild combination
@copper kraken
yeah this is pretty great for getting letters down
sometimes you need a few generations or to drop down to exponential... font is something to consider to stop merging but yeah
looking good
definitely
hah it'll put your eye out kid
fk'n great
new ipadapter is god
don't even get me started on the style transfer weights
input, input, output
you should be able to just drag one of my workflows in
here's a style one
composition and style, combining two images for each
yes, thank you, then I first have to copy them over into the according folder (?)
just save the images, then drag them into comfyui
I would still need the models in Comfy is what I ment
just go ahead and drag those workflows in, if you don't have ipadapter they nodes will be red
i'll help you get the models etc
Allight ty 🙂
np
I openend it in Comfy and like expected it shows a few reds 🙂
So which models would I have to get from that long list at github?
alright, do you have comfyui manager? got a manager button?
yep
k do the whole install missing custom nodes thing
Installing 🙂
https://huggingface.co/h94/IP-Adapter/resolve/main/models/image_encoder/model.safetensors and this, rename it "CLIP-ViT-H-14-laion2B-s32B-b79K.safetensors" and put in /ComfyUI/models/clip_vision
the other goes in /ComfyUI/models/ipadapter
https://huggingface.co/h94/IP-Adapter/resolve/main/sdxl_models/ip-adapter_sdxl_vit-h.safetensors i'd get this second, it can be handy when it's overwhelming the image in some way and dialing down the weight isn't fixing it
works differently, it's more subtle and doesn't hammer the composition as much
the rest... kinda just depends if you want to do all the other stuff with sd1.5 and faces etc
I would like to mainly use SDXL
sounds good, then get those three things and you're set
really only need the first two
cool
Where do I put this one (almost finished loading)?
/ComfyUI/models/ipadapter
allight, ty
once you have that you should be able to run the workflows i dropped in those images
np
btw for anyone into anything alike 🙂 https://youtube.com/watch?v=bKmQrrIjGYo
Support my Work / Download`s / https://www.patreon.com/Dub_Element
Picture: Ramy - https://unsplash.com/de/@ramyig
00:00 Warmth & Faidel - Zenit
06:20 The Venusian - Human Decadence (Moodeep Remix)
12:23 Mown - D111
19:40 Markus Masuhr - Junosuando
25:25 Space Scavenger - Observatory
32:38 Paolo Lucchi - Episode 4
38:52 HiWstre, Dubliss - Ruby...
loading the CLIP model now
15 mins 😄
nah, it´s fine 🙂
good time to grab some snacks i guess
it's pretty hard to get up and deal with bodily functions etc once you discover this
I still can generate 🙂
😄
you laugh now. you'll be using a water bottle to go in three hours lol
😄
if any of you notice anything really interesting in particular with the weights, how to combine images with the composition weights above all else... def share
and uhhhh
embeds scaling - i haven't been systematic but i'm not aware of any reason to use V-only vs any other option... i've just been using the one at the end of the list with C penalty etc
@copper kraken even though it shows this now in the manager, the IPAdapter nodes are still red (after restarting)
i remember i had some weird issue with that too
i forget what the exact solution was but i feel like i needed to restart comfyui twice for some dumb reason or something like that...
first thing to check though is if any of those new nodes show up in your node list
in the ipadapter folder
the models are in place already 🙂
yeah there was something weird before
try reloading the image too
the workflow that is
yes did it twice (after it didn´t work with the first one)
possibly completely rebooting could help?
Or look at the console for why those nodes may not have loaded.
yeah copy paste everything into notepad and search for ipadapter
i wish i could remember what the issue was but i remember having tihs issue for sure when i updated
what i do remember is it either fixed itself invisibly or it was easy once i spotted the problem
do you see these?
how do i fix this img2img? i am using aihorde so sdxl (what generated this) doesnt have img2img, so i have to use sd1.5
No, and I got this showing at the console @native knot
Couldn´t load the custom nodes for whatsoever reason
ImportError: cannot import name 'clip_preprocess' from 'comfy.clip_vision' (F:\AI-Art\comfyUI\ComfyUI_windows_portable\ComfyUI\comfy\clip_vision.py)
Cannot import F:\AI-Art\comfyUI\ComfyUI_windows_portable\ComfyUI\custom_nodes\ComfyUI_IPAdapter_plus module for custom nodes: cannot import name 'clip_preprocess' from 'comfy.clip_vision' (F:\AI-Art\comfyUI\ComfyUI_windows_portable\ComfyUI\comfy\clip_vision.py)
AH! i remember now
clip vision
it was my version of pytorch i think
Do you know which one is needed?
how can I check? Been a while? 😄
search the output for pytorch
the console output?
yeah
Doesn´t show any 😄
k one sec lemme reboot comfy
managed to scroll all that stuff off the top here by now lol
delicious
When was the last time you updated ComfyUI itself?
I think your version is too old for the new IPA nodes.
That was in your screenshot.
End of Octobre (already pressed "Update all") 😄
oh holy shit
Yeah...WAY too old, bruh
😄
that's like realizing you haven't updated since the carter administration 😄
in AI years
shit, a month is a decade in AI years
I thought it might update manually on start 🙂
Nope
nah, gotta hit the button
You have to do it.
i check the github pretty much daily to see if there's updates that i'm interested in
Comfy is kind of an "update early/update often" thing. lol
is there a yaml file I could edit for automatic update checking/installation like I could with other UI?
idk
Current version:
how big is an update?
😄
Most of the time they're tiny updates...but the new IPAdapter+ stuff was a bigger change. It changed the fundamental structure and required a ComfyUI update in order to run the new nodes.
I bascially installed comfy end of October, did a few things, and simply dug it out again a few days ago
ipadapter plus will ensure that you open it daily for a looong time 😄
😄
I´m currently more like in a softer AI phase, still generating basically daily, yet not as intense like during the 18 months before 😄
ha nice
i had a few lulls, then comfyui clicked
80k+ images and 3 monthsn later here i am
me neither, something like 350k?
yeah eventually i'll be doing that
i've looked back at what i was making a couple months ago and i'm like dayum, this is trash
I was barely going back to it for anything.
though only the ones saved, generated quite a bit more (via Wombo´s Dream) at the time
there are some things i really want to salvage though
and digging through all that shit would take days
it's cheaper in terms of time to just buy another bigass HDD and put a full one on the shelf lol
wait for AI-image-recognition-search-tool, like where you can prompt your search 🙂
yes that would be fabulous
mostly i want to salvage styles
made lots of weird stuff like this with sd15
where i just love the style
and haven't ever been able to replicate it except with that one specific workflow and the silicon sd1.5 checkpoint
well IP Adapter could help in that regard, no?
the style ipadapter is bringing these back to life in a new way
I keep the stuff as well, alone for "nostalgy" 😄
ah yes, it´s nostalgia in English 😄
yeah, well, the steam library is basically enough here and I rarely to never replay games I played through
man you don't want to see my garage
i've already accepted i'll never be able to park in there again for the rest of my life
think my workflows get chaotic? it's like 3D abstract art man
i have three cats that all want to get in there so i can't take the time to get in and out carefully
Would you have a CBS Coleco Vision in there?
the process of storing stuff now literally consists of look both ways, rip the door open and heave and slam it fast as i can as all three try to dive through the gap
I was never terribly into the Colecovision or the Intellivision, though I did play them.
Liked it a lot at the times 😄
I was more interested in my Commodore64 and Odyssey2 at the time those things were around.
yes, yet those were slightly later (only had C64 + Amiga)
The C64 and Colecovision both launched in the same month.
Odyssey2 came out about a year before the Intellivision.
But they were close enough.
got a 64 in 84 while a coleco in 82 and Amiga in 87
The Odyssey2 was my first system, but the first one I really loved was the C64 followed soon by the NES. I think the NES is probably my favorite right now, but it sometimes flips to the SNES.
Amiga in 89 I feel
played zelda and secrets of mana on SNES and a bit of Donkey Kong Country
I ❤️ my copies of Zelda.
I have a boxed copy and a 2nd cart-only copy for playing for both the original Zelda and for Link's Awakening. TLOZ:LA is the best Zelda, don't @ me.
so is it enough to update comfy only or would I habe to click update all?
it's usually a good idea to update everything too
You should update everything.
Gets the other add-ins up to date, too.
yes, ty 🤭
@copper kraken @native knot first IPA+ image generated 🙂
these are VQGAN images that work very well for input:
all I did was loading new images
nice
awesome
yeah we def need to get a lil shared collection of good style images started
i've been trying to set aside some good ones
@native knot These work particularly well because they are similar to a diffused image, they are generated with the VQGAN AI (I use it via Wombo yet would like to use it locally as well, haven´t checked whether it´s that easily possible)
Interesting
@native knot
If you are referring to me I simply loaded the images in, just started dabbling with the weights
loaded them into ipadapter standard?
btw why do you send the upscaled image again to a sampler before saving?
cuz i'm resampling it to add detail
I simply loaded the images into the for "slots"
the straight up upscale basically just expands the pixels... bigger image, but no more detail, just like hitting ctl-+ on your broswer
so with the sampler it gets AI-processed once again?
yup
have to be pretty careful with the settings for that second stage
i usually use exponential as the scheduler andcap the denoise at 0.5
otherwise it changes too much, which would be great, except usually it looks stupid cuz it's not at sdxl's trained resolution anymore
i find tiled resampling is better, but this is a lot faster
if you're wanting to add NEW details, def want to use tiled
In Easy Diffusion I usually simply upscale via prompt strength 0 and steps 1, that way not alterations occur (currently testing with your setup in Comfy)
there's shit i can do to make it change even less
here i'm looking to add subtle detail
yes, for those cases it´s different
that´s already pretty different though 😄
ah, I see 😄
yes, improved
@copper kraken so well, thank you very much for you help @native knot
@copper kraken so how would I automatically adjust the image size to let´s say the first of the IPA-images without having to do it manually? When using inputimages it naturally choses the given dimensions, yet how can I route the image dimensions to the actual image generation when using IPA?
You mean read in the dimensions?
yes
I already changed the empty latent image´s width and height to input, yet don´t know how to get the value output of the loaded image
@copper kraken and habe you already checked on the VQGAN images along IPA?
the node you need is in this image
it's in impactpack but i stuck it in here so you can just do install missing custom if you don't have it 🙂
going to in a sec here 🙂
Looking forward
so what's the idea with the vqgan images and IPadapter - that they're similar to something that's partially denoised? curious about the theory behind it
just kinda muddy and weird with weights = linear, but really interesting with some of the other ones
reverse in/out
that's with the latent image being automatically set to the size of the image
Well, they aren´t actually partially denoised but a fully generated image (that´s what the VQGAN-images look like), just saying, even though those are fully rendered they look and sort of behave similar to diffused images
ahh gotcha
yeah idk anything about vqgan
got into this stuff in the first week or two of december
me neither except for using it via Wombo
gotcha
it´s been the model they used prior to DiscoDiffusion (they used prior to SD)
Looks like it´s as well doable locally (also like I remembered it was)
In this video we are implementing the famous Vector Quantized Generative Adversarial Networks (VQGAN) paper using PyTorch. VQGAN is a generative model for image modeling. It was introduced in Taming Transformers for High-Resolution Image Synthesis. The concept is build upon two stages. The first stage learns in an autoencoder-like fashion by enc...
Interesting 🙂
@copper kraken This is what I get when installing missing nodes. It sepaks of renaming the node yet I wouldn´t know where exactly
i think that probably means within the workflow itself
but i'm not sure
i'll be honest, i've ignored all of those messages
every time
probably not very wise, but yeah
lol
From emad on sd3: Should be sometime this month per CTO comment, expect API shortly
Model went to all the chip makers for optimisation too and the distilled versions all done
good. praying that all goes through
some really cool results with timestepping the ipadapter
@copper kraken so I got everything working except for these:
Jeez, reading the Reddit thread, various things are not so great. 77 token effective limit still in place. Context window is 512 tokens, but the model is only trained on images that had a 77 token limit. Accusations that beta testers were booted who called out shortcomings. Oh well, better than what we have.
those aren't essential for this workflow
and it says this :
you really want to get the one on the left eventually though... unsampling is extremely useful
ah, so I can simply ignore them?
wow
yeah, for this workflow
where are they getting that info? was there an announcement, or is this a leak from beta testers?
that certainly would explain the very vanilla tests
i guess the best case scenario is the main issues come from incomplete training as a result of their financial problems
I'm scared to post it here lest something bad happen to my beta access. Saying they have access to beta testers and their experience is that it's barley better than sdxl
wait you have access to sd3 now?
People on Reddit
O I don't want to copy paste what they're saying.
I'll get the link.
k
here's what i really want... something more trainable, better controlnet/ipadapter support, some upgrades for comfyui, etc...
So scroll down in the comments. A couple people who claim to know things expand on what they've seen
text would really be nice and actions would be really nice but if we can train that or controlnet it then we can work around it
I really wanted more than 77 tokens. My local llms have such a hard time string within that limit
yeah it sure would be nice
guess we need to get serious about thinking about ways to prompt with images more effectively
Well, my hardware will get better, so the llms will get better. 🙂
true that
but yeah, i really think that's the secret in the end
generate components with just enough information on what the other components will look like style-wise that they can be integrated
then bring everytihng together somehow
what you said about regional prompting with one subject per region was pretty interesting on that note
the other thing i wonder about is this
" Even today some claim that SDXL is no better than SD1.5, so I am not surprised that some people say that about SD3 vs SDXL. People look for different things in a model, and I would not be suprised that a fine-tuned SDXL model can produce better looking images today compared to SD3. "
and seeing some flashes of genuine greatness from cascade
the biggest thing that stood out with cascade tbh was what it does with the texture of water during rough seas
yeah, the details are shitty and fucked up, but they are also at the same time fundamentally much better
the structure of the lighting is so much better
" They should have never announced and let us enjoy Cascade " <-- very true
people would be way more invested in getting whatever they could out of it if they hadn't followed up a week or two later with a way too early SD3 announcement
So I checked the file and there isn´t any specific node from what I can tell, instead it uses the Empty Latent Image Node as well (?)
hmm well it's in the ImpactPack
and when generating it ignores the size of the input image but uses the one from mentioned node
it's called "image info"
cannot find it 😄
Have you actually used it in that workflow?
yeah, it works
i've used it before
when going through the categories of Impact I couldn´t find any image info in the shown nodes (quite a few). It´s installed though
when going via gitclone it says the directory Impact Pack already exists. Gonna check on it eventually, currently I just don´t feel like deleting the folder and reinstalling after all the shizzle already, prefer actually testing IPA for now 😄
Like said, it appears within the node list (Impact Pack) yet within its sub-categories I cannot find an image info node and also not via the search
Love all those
thank you 🙂 Thanks to your help 🙂
I use sdxl img2img and is use theses keywords in the starting of my prompt : poor quality photo , photo taken by an amateur
For my pfp is sdxl turbo i2i
Hi Everyone,
Hope you all are doing good.
Is there any way to merge two models trained on different concepts. Like in one model we trained with cats and other we trained with dogs. Need to merge both these models keeping both dogs and cats without affecting output of each based on prompts we provide.
My unprofessional opinion on that is, hence cat and dog are 2 different tokens/concepts, you should be able to just merge the models together and keep both. As I said, unprofessional. You can just merge them on the fly with comfyui and check for yourself.
Thanks for suggestion. I'm using automatic1111. Is there any option on automatic1111 to do so?
To be honest, I have no idea. I've lost track of A1111 since I use comfyui.
Oh okay. No problem
Thanks
On the fly, no
That's only available in comfyui
You have to dump it out to disk and reload it
Also, use forge, it's much faster and better with vram than a1111
Thanks for your suggestion. will try it
Made a huge diff for me on my 3080 12gb. I couldn't do batch sizes over 3 with SDXL without oom. Then for whatever reason it was down to one
And sometimes it went oom anyway
Forge I could do eight and it was twice as fast
Sure, will try it
wow this looks cool!
So where is it and is it that good?
I did apply to test it but I guess I'm not artist enough for it.
Or i gave a wrong discord name idk
whatever
It better not make inappropriate number of fingers.
Nowhere yet. It'll certainly be mentioned here if it ever comes out in a month
Hello everyone, I need to do a small task of merging two images. I have to merge the image of a baby with an image of their grandparents. Is this possible? How do I do it? Thank you in advance, best regards.
😄
Is it safe to make the statement that SDXL is "too" real?
I haven't don't a lot with SD, but one of things I work on from time-to-time is an angel in a nice landscape setting. I have some solid prompt and setting variations that are very workable for me. The biggest trouble was getting the wingspan in the frame, plus some longshot distance around the subject.
I read that XL solves the framing distance / cropping problem. My "angel" is not meant to be completely realistic from the beginning, and is to be refined once some other things are corrected.
So the framing issue is better, but I cannot get any creativity out of the angels dress or the landscape surrounding. It looks like pictures of Xena princess Warrior during a filming break in a boring back lot, worn and tired. All the charm and fantasy is gone. Even the angel wings are like neglected and withering houseplants. In 1.5 I was getting less perfect skin and facial features and more interesting armor and robes, capes, boots with all sorts of grandiose design, clothings and accessories, gauntlets, etc. Now it's just blah.
Pffff too real.... I wish, we're not there yet!
Pretty good tho, at first glance.
Avoid the eyes...
And fingers.
if you want epic fantasy weird whatever awesome results try the midjourney_xl lora
or the sd_xl_dpo lora
=0
together they work wonders
@glass forge I'll check those out; thank you!
😏
Can you link that midjourney Lora you like? I know of one called midjourney mimic on civitai.
thats the one
0.5-0.8 strength is bets in my experience
Ok cool thanks. Yeah it really makes stuff look great. I was playing around with merging it with my favorite model and I found that it brought down prompt adherence for the sake of centering single subjects quite a bit. Something a lot of Lora's suffer from I think.
yeah things do get smuddgy if you use a lot of loras with high settings
its almost like cooking
you need to be carefu with how much you add of each
like seasonings in food
For what I was doing, when you reach the strength that makes the image special, is also the point where it's not what you asked for anymore. I'll have to keep playing with it though.
dpo lora does brign adherence of promt up a lot tho
but again exprient with strenght
Cool I'll have a look
👍
Thank you, those were basically some of the very first images generated with IPAdapter+ after @copper kraken helped a lot on installation and also workflow 🙂
1girl
generate a bee
Here is the image you requested.
how do i gen stuff?
Where can we generate pictures?
Any suggestions on how to get the Ren and Stimpy cartoon style?
I can't seem to get it 😄
Ren and Stimpy, The Ren & Stimpy Show theme, beach background, hd, 4k, detailed.```
How generate image ?
What’s a great sdxl model for realism I wanna recreate irl images with just prompts as a challenge
hi
Galacticgears V10
Generate cartoon pictures. In a warm town, there lived a cute little cat. The kitten has big eyes and striped black and white hair. There was also a little blue bird, a little blue bird, full of feathers, and big eyes.
The picture shows a cat lounging in a cozy bed, outside the sun is shining, but the cat is indifferent, next to the alarm clock indicating the time to be late. His mother looked anxiously at the watch at the door of the room, her expression anxious. By contrasting the warmth of the interior with the vitality of the exterior, this picture vividly depicts the dragging habits of the kitten.
The kitten's face is beaming with contentment and pride as his mother cat and little bluebird cheer him on. The picture not only shows the success of the little cat, but also conveys the positive results brought by advance preparation. Return seed value
@native knot Do you know where I could find the model for clip?
All those are in the list of models to install under the Install Models button in ComfyUI Manager.
Which model is it?
There's several. Just grab all the ones for ipadapter plus.
Here's the picture you didn't request
@native knotI'm still confused. I downloaded these models from the manager and I'm still running into the error.
those aren't clip
This is why I'm lost. I have no idea what to download
/ComfyUI/models/clip_vision
CLIP-ViT-H-14-laion2B-s32B-b79K.safetensors, download and rename
grab that one
it'll be called model.safetensor so you have to rename it appropriately
Thanks
np
might need to restart comfyui and/or hit the refresh button
make sure you hit update comfyui and update all as well
Sorry. What am I renaming it to?
CLIP-ViT-H-14-laion2B-s32B-b79K.safetensors
Ah I understand now. Thanks.
The clip models are available in the Manager, too though.
I had trouble finding it. I have stupid.
main reason i download things manually is just cuz my connection sucks and it locks up comfyui till it's done
but yes, def good advice for finding stuff if you're on a reasonable connection 🙂
is ipadapter working now?L
I just gave up on it as soon as I started using it. I thought I was going to be able to transfer style from one image to the other but that was a different workflow I can't copy from.
you can totally use it to transfer style
drop whatever workflow you have that gives you something you like in here
i'll pop in the IPA for ya 🙂
thats a cool car, too 🙂
SVD
whats that and how does it work ?
Can't really explain, it's a txt or img to video model by stability ai, can create stunning animations, I'm using it through comfyui
can i just throw that thing you made there into my comfyui like i can throw in images ?
nice, animateddiff?
so made a lora
is 160 steps enough for 25 images
i mena it works.
i try to train juts like clothing items
whatever consitency is not there 100% even with the loras
everythign works "at first glance"
but dont look too close
@copper kraken You made the original right? Here you go.
yup that was me
Hi everyone, I'm new here, I would like to install SDXL on my computer locally, I've been reviewing the minimum requirements and I think I meet them well, but I don't know where to start, any advice, where should I start?
type that same question into google and click on the link which is most recent and follow whatever steps it gives you
https://github.com/AUTOMATIC1111/stable-diffusion-webui Go to the Installation and Running section.
is my prompt just too long at 61 tks?
ipa style?
Nope
really cool regardless
Nice...that last one. ❤️
pretty much the statue of liberty right there
Here is the image you requested
Here is the image you requested.
Lightning and black jungle and red sky
Here is the image you requested.
Heaven.
Does anyone know how to load a "controllite" controlnet in Comfy? I can load normal controlnets and controlnet loras using the loader and loader(diff) connected to the apply node or apply advanced node, but neither of those works for "controllite" models.
These models work normally in A1111 with no special settings, so I have no clues about what makes them different.
https://huggingface.co/bdsqlsz/qinglong_controlnet-lllite/tree/main
Nevermind found it https://github.com/kohya-ss/ControlNet-LLLite-ComfyUI
street view of the front of an adirondack sytle coffee shop in summer
Do it yo self
Your image is now loading...
been a while since I have been on here, I forgot the command to generate 😄
Here are the images you requested.
Awesome, thanks Doc!
ah new inpaint method with object detection https://www.youtube.com/watch?v=X89IQop_0dM
Inpainting with the latest BrushNet models for AI generation using stable diffusion is lots of fun! Get great results quickly thanks to both random and segmentation mask models - all for free on your own computer! Installation is quick and easy, so why not give it a go today?
Want to support the channel?
https://www.patreon.com/NerdyRodent
Lin...
Do one with Winnie the pooh holding the us flag.
手机简笔画
I can finally run Deforum + SDXL thanks to Forge UI !
no idea who are the dev of Forge Ui but thanks for that gift !!!!
thank @visual glade for the backend 🙂
draw chinchilla with the text happy birthday
This is we gonna make Murica great again.
s
Testing my app and it's looking so artworthy. 🙂
great work so far for real
having done a fair amount of testing on this, anyone with an android tablet is going to be real happy 🙂
@meager canopy a girl
A man, wearing a suit
Will push a commit today with a new app name and icon (and a bunch of fixes / improvements). I've been prepping material to explain the app better. Alpha version will be done today. 😊
i'm excited to see what it looks like!
Can I request 24 and a half girl?
Guys i have a question
what lora should i use to get this style of line espetialy?
something about these images is so damn beautiful
stuff like "strong sunlight" really help
This is actually really great 😂
He's a Yakuzard.
@meager canopy A man ran out of a piece of 4a paper
Does anyone know if @primal vault is still developing anything for SDXL? He hasn't updated his page in almost 5 months but I remember him being in this channel quite a bit. I hope I didn't tag the wrong person. Sry if I did.
Does anyone have a good ComfyUI workflow alternative to Searge? Or has everyone moved on to something different from ComfyUI? Its been a while since I've been on the channel.
Just make your own workflow that has the parts you need? It's not like his templates are some holy grail or anything. What do you specifically need for your workflow?
I need mostly the top portion of his workflow. Prompt areas, customizing image settings and, a place for multiple loras, upscaling and fixing.
I'm not very good at messing with all the noodles and boxes.
If you have a good workflow that you can share I would appreciate trying it out.
AI is a really fast moving field, it might help you out a ton more in the long run if you take some time now to learn piecing workflows together for yourself. There are tons of guides and videos about it all that range from super elementary to extremely advanced
Once you can read noodle, you can make spaghetti
lol... that is fair.
IDK maybe I'm just biased though since I spend a lot of time developing with ue4 and ue5 lol
I've been a pastafarian for a long time due to blueprints
I'll give it a try but I'm far from a developer. I am a medical receptionist. This is all greek to me.
Sadly, I believe Michael may have passed away. 😦
Not only did he have great work here, but he was the creator of MCP (Minecraft Coder Pack/Mod Coder Pack) and even ended up working for Mojang AB. The last time there was any activity across literally every account I know of for him was back on December 8th.
wow 😦
You'd be surprised what any average or greater intelligence person can learn, given some time and mild dedication. Just start out simple and small and work your way up. Learn the terminology and try to grasp what each step is doing and why you're doing it.
You'll screw up a ton, but that helps you learn
when github fork? 🙂
"The reports of my death are greatly exaggerated." - I'm just busy IRL and can't spend much time on SD/SDXL stuff right now.
A dog is running towards an old man in Memphis style with 4k image quality
Here is the image you requested.
Chinese girls wearing bikinis, big eyes, small face, on the beach
Here is the image you requested.
awesome, thanks
傻逼,放的什么国旗?
我是一个机器人,只能把中国真正统治者的旗帜放在中国的长城边。
Chinese opera characters
extremely minimalism portrait
geometric shapes
cartoonish lithographs
Here is the image you requested.
Posted alpha last night! https://github.com/QuintessentialForms/ParrotLUX
Amazing! Instruct demo before SD3. Wonder if Lightning merge would work.
Why are you just copying another post here? Stop being a fool and leave if you have nothing to contribute.
Ok, the new Edit model is kinda cool
I tried it, but don't think it's something I'd use. Couldn't figure out what the non edit model was for.
The non edit model is a riddle for me too
Bro, I can't say how relieved I am to see this post. So glad that the rumor can be busted from the source itself.
I defiantly have use-cases for the edit model though
Cool! Was that just using the demo workflow?
Yes
And if you hook up a styler you don't even have to prompt
Use the styler output as prompt and it will just change the style of the image. Done
The architectural design of the Ark at Sea presents a magnificent ocean painting. On the vast sea surface,the ark stands tall and towering,blending with the shimmering sea water. The architectural design style is unique,combining modern simplicity and ocean elements,with smooth and powerful lines. The appearance of the ark adopts a deep blue color scheme,symbolizing the depth and vastness of the ocean. The details are even more exquisite and subtle,such as the sail shaped building structure,cleverly converting wind power into clean energy. At night,the ark is brightly lit,like a beacon on the sea,guiding the lost navigator. This picture not only showcases the harmonious coexistence between human wisdom and nature,but also outlines a grand blueprint for a future ocean city
The architectural design of the Ark at Sea presents a magnificent ocean painting. On the vast sea surface,the ark stands tall and towering,blending with the shimmering sea water. The architectural design style is unique,combining modern simplicity and ocean elements,with smooth and powerful lines. The appearance of the ark adopts a deep blue color scheme,symbolizing the depth and vastness of the ocean. The details are even more exquisite and subtle,such as the sail shaped building structure,cleverly converting wind power into clean energy. At night,the ark is brightly lit,like a beacon on the sea,guiding the lost navigator. This picture not only showcases the harmonious coexistence between human wisdom and nature,but also outlines a grand blueprint for a future ocean city
Not sure what happened to mine 😦
Let me change my WF so it outputs png images again
I only added the styler
And changed the sampler
Yes, and changing the CFG to try and balance it better.
Getting there
They all look washed out
Better
Still weird artifacts going on
Yes, when I change the style, it looks bad again
This is with the default workflow, just adding the prompt a pencil line drawing of a law enforcement robot
I just fired up the default workflow to reproduce
The workflow is in my image. I havent changed anything
Oh, ok
Nope...still sh!t. Something is very wrong here.
This is using your image workflow
You sure? Why is your image called "image.png"?
Because I just copied it out of the save window
Ok, that makes sense
Have you updated comfyui lately?
Every time it starts 😄
Can you send me the last image with metadata so i can reproduce it on my machine?
It is literally what you are using
oh
Please use #1092446741984444416 to advertise your project
I dropped your image into my UI and ran it
Then there is really something odd going on on your side.
Yeh, maybe I should try the portable version
Perhaps all of my gens would have been better 🤣
Can sdxl run on colab?
@noble shoal First run on windows portable version... 🙂
Now that waaaaaaay better
hello, i watch this video to generate sdxl image,https://www.youtube.com/watch?v=9k-yb83ZHfc&ab_channel=MattWolfe, why my image is pure blue color image. vae:https://huggingface.co/stabilityai/sdxl-vae/tree/main, base:https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/tree/main,refiner:https://huggingface.co/stabilityai/stable-diffusion-xl-refiner-1.0/tree/main, my comfyui log is :got prompt
model_type EPS
Using split attention in VAE
Using split attention in VAE
clip missing: ['clip_l.logit_scale', 'clip_l.transformer.text_projection.weight']
Requested to load SDXLClipModel
Loading 1 new model
Requested to load SDXL
Loading 1 new model
100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 20/20 [00:08<00:00, 2.34it/s]
Requested to load AutoencoderKL
Loading 1 new model
Prompt executed in 18.44 seconds
Here's how to install and run Stable Diffusion locally using ComfyUI and SDXL.
Discover More From Me:
🛠️ Explore hundreds of AI Tools: https://futuretools.io/
📰 Weekly Newsletter: https://www.futuretools.io/newsletter
😊 Discord Community: https://futuretools.io/discord
🐤 Follow me on Twitter: https://twitter.com/mreflow
🧵 Follow me on Threads: ...
thank you
I don't believe this! The image quality has gone back to how it was before I installed the portable version. Maybe it's reletaed to some custom nodes I install 🤷🏻♂️
this is great, looking for a way to add a remote comfyui ip address/port 🙂
Yeah that's been on my todo list for ages... Do you have a cloud comfy api running somewhere?
i run it on a local pc with a 4090 i use for testing, have it setup so i can reach it from a public IP
mass replaced all the 8188's with my port number and the IP address as the host
no luck yet tho, still playin around, great app so far tho!
You install/run the github repo on the same pc as comfy, then you access your public IP on port 6789 should work?
just modifying Comfy-SC.json right now to see if i can get it to connect 🙂
yeah, running great on my cloud VPS server for the frontend
just sits here while trying to reach back to my comfy server via those changes i made in the comfy json file
Yeah, it won't work I think. Let me recheck the code to confirm though.
Huh. Actually it should work. In Comfy-SC.json you would change "host" to the ip address and port to the port. I've never tested it, but the code handling it looks right at a glance.
massed replaced all the x.x.x.x with my external IP, nothing in console, comfy didn't throw any info either on the other server
will keep tinkering, i enjoy this stuff 🙂
i know my external is working tho, my other app is still connecting via IP/port
Yep. Looks right then. Can't debug till after work. If there's a bug in my code (sounds likely), it's in server.js somewhere after line 177.
Be warned though: I have an implementation of formdata there that looks intimidating. 🙂 But comfy uses formdata, so there it is.
Easiest fix is just to run it in your local PC, but I'll debug it tonight.
no worries 🙂 don't get in trouble at the office
LOL found it. Line 295 and line 338 request.port ? ... should say requestOptions.port ? ... Don't think I can push a change from here tho.
have you explored adding support and widgets to a pre-existing application?
Not sure I follow. What did you have in mind?
(This is a webapp, so technically it could be a node in Comfy or a tab in A1111.)
i am saying that there is a lot of demand for high quality openpose posing inside e.g. photoshop, which could be connected with, whatever
or if there were something that had a hope and a prayer of supporting extensions on ipad for drawing purposes
but then you wouldn't get mired in the product development of writing a paint application
there are a lot of existing ones
changed both lines, gonna restart node
have you tried any of the photoshop plugins?
Make sure you change both POST and GET lines.
i have not
I have never used photoshop, and I'm only personally interested in a tablet UI, something like Procreate (also never used that though.)
which drawing tools have you used on a tablet?
There are zero open-source apps with a tablet-based UI like Procreate / InfinitePainter that I could integrate this into. It's a very deep integration you know. Have you tried the Krita plugin? That thing is clumsy and tiny. 😕
Infinite Painter on Android. That's where my UI inspiration came from.
Any success??? 😮
not yet, gonna add some console logging to see where it's hanging
Also, if you have an ipad, please test my app. 🙏I'm really curious to know how far it gets through loading in Safari.
i see. so what is your goal?
everything makes sense to me
Goal is to be able to use AI while I'm making art, but forget about the tech. Seamless experience so I don't break my flow / lose sight of my art. 🙂
for example, "compile krita to wasm, write a shim for the pen attributes of the pointer / touch events" has a 100%, known, defined goal that requires basically no product development decisions. then, you can make a plugin that "just" works inside krita
it's a ton of work
but suppose you had unlimited manic energy. lots of people would use that
that would also solve the issue of using krita on an ipad
Krita runs on Android, I've used it for animation. I always go back to InfinitePainter because the UI just "disappears", so I really feel immersed in my art.
My goal here is about the experience, not about the exact tech, if that makes sense.
gotchya
i wanted to recreate the SD doodler thing, need to get back to working on it, so many side projects lol
but you can modify krita to make the UI better, no?
and if you did that
Yeah sketch -> render has been the main way I've found myself using my app so far. Works so smooth. 🙂
then, you would get everything else they've figured out
No, definitely can't. Would take years to rewrite the codebase.
(This uses CSS, the most powerful UI spec out there.)
hmm who said anything about rewriting the codebase?
i mean surely making UI elements in it is pretty straightforward
guess i need to set mine up with a cascade workflow, would probably get better results, might try that after dinner tonight
this is just food for thought
there is only a canny controlnet for cascade
no idea how img2img would work with it for this problem
ah yeah, true
@uncut gull like if you have unlimited manic energy for programming tasks, why not focus on krita?
That is absurdly horrifying. Seriously. UI coding is the hardest part of any software. It's 80% of the time investment, easily.
And you can't make this UI with Krita's toolkit. It's just not compatible.
And you can't make this UI with Krita's toolkit. It's just not compatible.
this is something you determined from reading their code?
i know you mean to use the word "impracticable" and not "can't"
but i am aware there are already krita plugins for the automatic web ui API
and that has custom UI
and yes, maybe krita doesn't have a robust way to define new widgets / manipulators, but i don't know. i guess the important thing is that answering such a question requires zero product decisions, it is only an engineering question
The UI touches every single tool in the codebase. It's the deepest part of any integration. You have to change literally every piece of code there is. Sorry, but I don't think you've tried something like this before.
so i am hearing you have not read their codebase
which is okay
or at least explored this
that's why we're talking about it
i haven't either
I have hacked on Krita before. You're talking about an 80% rewrite.
what was your goal with krita hacking?
Debugging the SD plugin back when it was new.
okay, and you didn't find any pre-existing browser based drawing tool that you could modify? i am sincerely curious which ones you've looked at and why you didn't think they were useful to work with
and finally, you are 100% certain you don't want to build a plugin for photoshop instead, which has a really robust and thriving plugin ecosystem? or something else on the PC / macOS device that has first-party remote pen / tablet support, such as photoshop?
i wouldn't want to work on krita either, don't misunderstand me
here's an example of a product decision: maybe there is a reason there aren't widgets for making pose vectors. presumably it would make more sense that the user draws a pose, maybe with a stick figure, and then something else turns that into the pose vector for controlnets
because moving around pose widgets is kind of tedious
i don' tknow though.
Because, again, you have to integrate this into the UI. Plugins let you add tools at best. You realize my app had an open-ended UI called APIFlows? That means whatever your AI tool input needs, you can write 1 json file and have a seamless art experience. I had zero prior art to draw on building this. Just the idea of connecting layers to an open-ended tool UI as node inputs is completely novel across all art apps. This cannot be done without rewriting an app's UI, and that's 80% of any art app, and there are zero tablet art apps with even basic features.
okay i really respect what you are working on don't misunderstand me
it would be really helpful to answer the question
even if it is tempting to give me a bunch of opinions
This would require training a new AI model, but that dataset doesn't exist.
i kind of want to know what you already looked at
and either the answer is "nothing" or a list of pre-existing web apps
I tried to improve the Krita plugin, then went looking for open source art apps on github, was shocked by their lack of features, wondered if I should fork something, then realized the UI code was something never done before. It couldn't be patched in. 😕 So here I am 2 months later with a new app.
photoshop already has pretty robust generative tooling, and it will ostensibly get better and better. it is arguable if controlnets even make sense, in the long term, there's a reason dall-e3, imagen/google, ideogram, midjourney do not have them
okay, so you're not aware of any pen based open source web apps?
you haven't looked at any
simple yes or no correct?
I am trying very hard not to be offended, so I'll sign off here. Have a nice day.
i am sorry, i am not trying to offend you. i am not talking about krita, just other applications that you may or may not have looked at
i found two that might interest you - https://github.com/LHRUN/paint-board and https://github.com/bitbof/klecks
🎨 A powerful multi-end drawing board that brings together a lot of creative brushes to experience a whole new range of drawing effects! - LHRUN/paint-board
Community funded painting tool powering Kleki.com. Contribute to bitbof/klecks development by creating an account on GitHub.
@uncut gull both are very high quality and have a lot of great product development in them
@hoary saddle this lhrun paint board is really nice and has pre-existing, super slick google "ai doodle" integration
like it's very simple
LOL yep those are both shockingly bad apps I saw. Here's some more. 😛 https://opensource.com/life/16/5/open-source-drawing-applications-android
hah, i was just reading that github
this thing you linked me does not have either of these projects in it
it sounds like the answer was no
i like yours better if i can get it to run on my setup 🙂 gonna finish up some work at the office and tinker more tonight
yeah i think it's all good
i am just wondering
there is so much effort going into so many different, uncoordinated places
and before i am painted as the bad guy here, you know, it is pretty bold to say those really high quality, well polished, maintained apps with audiences and some adoption are "shockingly bad"
i don't think a paint app needs to have adoption or be bug free or have an audience or whatever to be good. there is no "right" app or opinion. but something something, log in your own eye first
It's about art. Krita vs. Infinite Painter: Krita has more features, fewer bugs, a larger community, and is free, and has a SD plugin. So, why don't I use Krita on my tablet? It's something about the "immersiveness" of Infinite Painter (which was basically copying Procreate). It just feels natural. You forget about the tech and lose yourself in the art, just like with physical media.
That's the goal of my app. That flow-like experience. Nothing else. 🙂
yeah, this is a big deal imo
there's a big gap right now... if you want to use drawing in SD, you have to transfer stuff back and forth and break your flow with too much time on the tech stuff, as much as i enjoy that
I understand you are very opinionated about the UX, and I agree with you
It’s just the wisdom of writing a layered paint tool from scratch
When that LHRUN repo has literally three onscreen UI elements
The journey to removing or adapting them to suit your needs is small
That’s an objective fact
Then you are free to try, friend. I've never written typescript before.
(Did you know that LHRUN app you just mentioned doesn't have pinch to zoom-pan-rotate like Procreate does? That's because in order to code that functionality you need to understand matrix inversion. Instead, they have a "hold two fingers" to activate a pan and zoom mode, no rotation. Easy vectors. What else. They don't have layers, much less layer groups, so obviously no layer blending. They aren't GPU accelerated; they're limited to SVG drawing, so you can't paint at all. They don't have undo/redo. Do I really need to go on? Do you get what I mean by "shockingly bad"? I'm not offended though. You just didn't take time to do research, and you also wouldn't have my dev background to guide you if you tried to research. Thanks for your interest in my app, and if you do get to test it on ipad, I'd love to know how it goes. :-))
it is true, i am not really paying close attention. i suppose kleks is pretty close
it has layers
i mean you had to write the layers from scratch no? might as well have written them from scratch for LHRUN
pinch to zoom? same thing
if you feel strongly about it, implement it maybe for this existing thing
i agree with you that this stuff is challenging
and i am not minimizing what you are trying to accomplish
I've never written typescript before
you should try it soon
What, exactly, would I be reusing from LHRUN? Because bear in mind, I would need to spend a month at least learning its codebase. Three months for Kleks, and a year for Krita (if I'm even capable of learning a codebase that big).
I started this app around February 22nd, and I've already surpassed both those apps, and I know my own codebase inside and out, so I have unlimited, fast iterating power on it.
i did a very light search on github. maybe 5 minutes. it's your time, not mine
I started this app around February 22nd, and I've already surpassed both those apps
hmm
I strongly oppose typed languages. I'm in the camp that you waste 50% of your time debugging typedefs and only save maybe 10% preventing type-bugs.
all i advocating for is to focus your manic energy
LOL yeah with caveats. Kleks probably has a couple things I don't. But I could never rewrite Klek's UI. Not my code.
to just consider it
because it can yield great things
it's like a water cannon
you choose where to point it
while in principle you will learn a lot by building a layer best editing tool, even a high quality one
like it's good to be very opinionated about the UX, but delivering those opinions into applications people actually use, like Photoshop, it will be very high yield
i mean nobody uses krita either
if you feel strongly about tablet based tools
it's a lot of work. ther'es a reason there isn't a good github project for native ipad drawing
it's a ton of work and in a darwinian way, all the people who tried to do it for free have failed
i'm sure people have tried
For whatever reason, all my YT artists use Krita. I'm really like zero-exposure on PS.