#✨|sdxl
1 messages · Page 170 of 1
thank you malicor, you make me 
wow nice
HDR photo of woman, dark-brunette hair, light-blue eyes, (f_stop 5.6), (focal_length 28.0), f/5.6, 28mm focal length, sitting in bed, contemplating, sunset, romantic, warm, revealing clothes, alluring, white sheets . High dynamic range, vivid, rich details, clear shadows and highlights, realistic, intense, enhanced contrast, highly detailed

I set out to create a 5K desktop wallpaper for my Studio Display.
I used SDXL base to generate the initial image in 5K resolution. The resulting render was lacking in coherence and exhibited all sorts of perspective issues. However, I liked the general feel and the overall composition so I decided to continue working on it.
I wanted to maintain the 5K image size without resorting to any latent or pixel upscaling. I decided to do a little inpainting to see where it would take me.
I used ComfyUI for image generations and I used Affinity Photo on a PC to create the individual PNG mask files and to handle the compositing of inpainted elements back into the master image.
Once ComfyUI had generated a suitable replacement element using inpainting (e.g. a person, banner, scooter, etc.) I would paste that element back into the master image file in Affinity Photo and then I would create a new masked section of the image for the next element.
Rinse and repeat quickly turned into an inpainting-palooza.
No fine-tuned models, loras, or control nets of any kind were used. Everything is base SDXL.
I created a short timelapse video to illustrate the process for those of you who might be interested. I've posted a higher-res version to the Stable Diffusion sub-reddit.
Here is the image you requested.
is this using fooooocus?
A book cover titled "Principles of Marine Sediment Dynamics", designed with bright colors and a simple, abstract style.
Please draw a picture. The ratio of the picture is 16:9, and the main color of the picture is dark blue. The theme of the picture is technology and intelligence. Thanks.
Here is your image...
Chinese city with a distant view of a rolling mountain range with snow on top, the image is of a boulevard lined with traditional Chinese buildings, skyscrapers in the distance, and an ancient well with a Chinese dragon hovering over it in the near distance, the image is colorful and Pixar-esque.
#1047610792226340935 bots offline. You can run Stable Diffusion locally or use online services.
I don’t know if this is the right place to ask , but any good SDXL instalation tutorial? Thanks
https://github.com/AUTOMATIC1111/stable-diffusion-webui you can try this web ui first
Its as good a place as any, but first you gotta choose a platform/interface , SDXL is a model , to interact with it you may choose one platform or another considering ease of use, hardware, expectations, etc... you might wanna follow some YT guides to get started , apart dad jokes I like the https://www.youtube.com/@sebastiankamph , he is concise and slowly breaks things down , but there are plenty others
Professional Art Director teaching you AI Art and Generative AI.
Your go to for Stable diffusion guides and tutorials.
- Dad jokes.
Get early access to videos and help me, support me on Patreon: https://www.patreon.com/sebastiankamph
This channel is about artificial intelligence and machine learning news and content related to the creative ar...
一个女孩被一只棕熊追逐
#1047610792226340935 Bots currently offline, please use Stable Diffusion locally or use online services
whats the status of stable diffusion 3 by the way ? anyone got access yet ?
there are some people got access to sd3 but afaik they won't release the whole model yet
Are you an administrator or?
no, just a normal user here.
So there’s no problem chatting here, right?
yes. but i assume you are asking for a image generation
haha,It seems that there are only xl models here, not 1.5?
yes. this is a place for posting and discussing sdxl
Which one is the 1.5 model? Or is 1.5 outdated?
Which country are you from?I mean do you want to come to my place to play?Bring you barbecue and skewers😋
wait a minute you are chinese right
ang,Chinese pure man
两个中国人用英语讲话讲了半天草
那就是 中国纯爷们😆
哈哈
hi
Heyo, please keep it to english only. Thanks!
There is no need for any trigger. The strength should be 0.8 or 1.0 depending on the subject (do some testing). "The line" style refers to a form o...
Awesome
thank you
Lol looks like fun, might check it out
it's even better
holy shit this is amazing
better than the cascade clipvision fun we had recently
With so many cretaive crazy amazing things people come up with you'd think consistent characters and stuff woudl be like the most basic most boring thing they'd just do to get it over with.
fuse watercolor and cubism a portrait and add some rodent teeth and floating cucumbers and a dragon in a glass oven. got it. Get the image of the same caharcter from two angles... nope
that's not really that hard to do, all the tools exist
yeah but not 100% its always kinda wonky
you get the pose kinda but not really i mena try getting squatting people or people laying in bed. and the angles... forget it anything that veers too far from flat frontal angles gets messy fast and then of course the textures, skin blemishes the minute details... forget it
yeah, if you want total consistency then you need to actually train a lora
its like were halfway there, this is awesome for spitballing and coming up with amazing stuff but when you want to use that stuff for anything things fall apart fast. imagine a 3D program and you made a nice 3d object and everytime u move the camera the object changes a bit
when this will be sorted THEN this will change the world
until then not really
there's no way around that, no genius trick with diffusion models that can accomplish that without independently rendering a 3D model of a character first with more traditional methods
i did loras
made some for 1.5 and some for sdxl
they dotn give total consistency
yes but if we go and just do the old way than how will this improve our speed and productivity? were back to square one almost lol
and even if we do use some 3D poses as references still we cant "smear consistent AI over it"
like clearly ai is having an understanding of 3D objects at this time... couldn't there be a way to promt something and select it with inpaint and "name it something like woman in red dress" and lock it down and really tell the ai to keep that "concept" as is and not change it anymore except the pose and lighting. and then whenever we promt that "woman in red dress" it will give that character. lol this could work for objects too maybe? and then you could generate up a bunch of props and characters and then mix and match them in photoshop or whatever and bring them back and just fix the lighting with AI and late paint in motion / lipsync, etc. feels like something that should be possible maybe... idk lol. wishful thinking. or is it
i never know with this, things that used to take 10 years take 2 weeks now
||dog||
#1047610792226340935 bots currently offline, you can run it locally or use online services like https://seaart.ai
A boy
I'm pretty sure it's not a boy
Some say it breaks ComfyUI? Has that been fixed at all? 🙂
The screenshot is in ComfyUI, so clearly didn't break it there 😄
@copper kraken I'm experimenting with Ip-Adapter and Photomaker, If you dont mind,wich IP-Adapter model were you referring to?
copy space, ultra detailed photo, forest silhouette in the shape of a wild animal wildlife and forest conservation concept in the winter night, northern lights dancing. Beautiful design for wildlife preservation, environmental awareness. ¯_(ツ)_/¯
Breaks how so?
Sdxl plus vit H
I trying to make a Lora now. And I am kinda confused on if I should make it on 1.5 , sdxl, 2.0 or 2.1. Or 2.1 blob vs 2.1 base
Also I am running dreambooth v23.0.15 and my layout is completely different than other videos I’m trying to follow.
I’m trying to make a hyper realistic model and I’ve gathered a bunch of photos 50+
Apparently it overwrites the old IPAdapter, so stopping all the projects using the old node?!
correct
tbh, it probably should've been released as ipadapter v2, so people could keep the old one too and eventually uninstall it once they were done
ok now THIS ended up being very epic
can't seem to get metal claws though
adding wolverine into the mix just makes him look like wolverine of course lol
Nah, better to not split the user base up like that. It's mostly all done by a single dev, so he shouldn't have to deal with the headaches of separate repos. You should always consider everything in ComfyUI to be subject to change. If you like the old version, redownload it and put a gitignore in the directory to prevent it from being updated
true
im fine with it myself
its really not hard to tweak a workflow for the new nodes
takes 30 sec
yeah the prompt coherency is really failing on me, even when doing this regional prompting
im trying to replicate something I have in my SD3 prompt list
and no matter how much I tweak the settings or the prompt (or even use a base image with high denoising to get good composition) it just won't work 90% of the time
whah is it
look at my previous images here, two fighters, one with metal (wolverine-like) claws and the other with a katana
looking at eachother whilst doing a fighting pose
I bet festivalman could get this one right, but I just can't
epic
lol, this is what happens when I accidently paste the giant instruction set with examples into the regional prompter.
but i post the stuff here that works. there's a fair amount of stuff that doesn't.
this regional prompter stuff can do amazing things, but it still hits the limitations on interactions that sdxl has unfortunately. regional prompter with sd3 will be epic.
yeahh, well thank you anyway for trying
it's definitely a weird one. i can get it to do all sorts of stuff but can't get it to do both of them. even if i make one of them a cat (so not a man at all), it only draws the cat 1 out of every 4 tries.
look at that. sometimes claws guy, somestimes samurai, sometimes neither.
not sure what regional prompter extention is really thinking here.
SD3 got this on first try (even if the result is a little distorted)
also idk why it's pixelart, I asked for digital artstyle
OOOH. i have the answer. one sec.
maybe "video game" confused it
getting closer. 🙂 back in a few
cool
There is always Interrogator: "a woman in a hooded jacket and hoodie walking through a muddy sea of water, splashing dynamic pose, long brown hair blowing wild in the wind elden ring cinematic lighting, monks!!!!!!!!! fire, tarot card goddess of death, wearing brown jedi robes, portraits of a woman enraged, dangerous cliffside, sansa, power pose, 2 0 0 0 ad magazine setting, by Sigurd Swane"
Interrogator never does a great job though.
What about vision models
It can be a good base to work on.
okay no that would be OP with SD3 though
Nice
not happening with sdxl, even with regional prompter. sd3 is the answer
yessir
I've learned that you can still do a LOT of amazing stuff, but interactions? nope. possibly with inpainting, but I ain't got time for that
can you show the ideogram result in like SPOILER or #🏞|general-with-images
absolutely
as you can see, ideogram results are pretty rough, but they give a great basis for img2img to clean it up
well, sd3 should look a lot better too
but unlike ideogram we can do highresfix, which will fix the faces!
or just adetailer, but I don't use those
yeah exactly
if I can do highresfix on the 8B with 12GB of vram I'd be fine for like.. almost ever?? idk how much the finetunes will bring as well
this plus this with juggernaut 9, prompt steampunk machinery in a city
yeah with tiled upscale, i can even do it on my 3080 with 10 gig.
equals this
one last one then back to work for me, one from a while ago, ideogram but with tons of img2img sdxl upscaling. i have high hopes for sd3.
yeah im just afraid of tiles ruining coherency between tiles of course
unless there's gonna be some controlnet tile which is really good at stiching it all together
idk
your images look good though
I've found that I need to latency upscale 2-3 iterations to maintain consistency, THEN I move to tiled ultimate sd upscaling. Then it works well, but yeah, those first couple generations need some vram.
decided to sing up to ideogram to test stuff, here it is with my exact prompt
It's pretty good actually, I expect this quality from even the 2B model
same inputs sdiff prompt
you know the style of the images and faces kind of remind me of 2.1 for some reason :|
for Ideogram
It's fun to use though
are these the standard or base line sdxl models making these?
not entirely either exactly, but juggernaut9 which is sdxl
im on comfyui, load these images and youll get the workfflow
I had no idea merging models and lora was so easy. I combined all my favorite things, dark arts images, proteus-rundiffusion, and andrea75c's cute3d cartoon lora into one checkpoint that does it all quickly. from left to right, dark arts, proteus in middle, final combined 3 way merge on right.
hah I should do that midjourney mimic lora onto this as well for gawdy over stylized rainbow brightness.
works with regional prompting well as well.
this merge looks good
yeah it combines 2 models that have good prompt adherence, with the smoothing out factor of the cute 3d
wow
can you try the fighting one with this
can you copy/paste the prompt that you liked from ideogram the best? the magic prompt one?
I'll see if that helps at all. My hopes aren't high for that one in particular. I think this giant at the gate works well because there's no subjects interacting with each other.
I had this detailed prompt in my SD3 prompt list, I hope this helps:
A game screenshot of a fighting game in digital painting style. There are two yellow health bars. The characters are both black silhouettes against a colourful background. The background is a beautiful landscape of a lava mountain. The left silhouette character is a ninja holding metal claws and the one on the right is a japanese samurai holding a katana.
you could opt out the "yellow health bars" and "game screenshot", those might get in the way
so this is just as is, that original prompt pasted in without changing anything or regional prompting.
not bad
SuperPrompt V1 gave these:
In the center of a vibrant canvas, a fierce battle scene takes place against a backdrop of lava-covered terrain. The left silhouette character is a majestic ninja, holding wolverine claws and the right behind it is a japanese samurai, holding a katana. The black background is reminiscent of a vivid landscape, with the lava mountain contrasting sharply against the stark white background. The painting exudes an aura of calmness and serenity, making this an unforgettable representation of martial arts in the most unexpected way.
In the center of the canvas, a fierce battle unfolds before your eyes. The black silhouette against the vibrant backdrop of a mountain range is depicted in striking contrast against a captivating backdrop. The ninja holding metal claws and the samurai holding a sleek japanese samurai holding a sleek katana are both prominent figures against this stunning visual masterpiece.
I'm gonna try running it at a higher cfg and see if that helps with drawing the second person.
these look stunning
yeah other than the regional thing, the quality looks very aesthetic.
i'm running an x/y/z plot, from 4 cfg through 11.
I'll run the super prompt after this is done.
thanks
nope, all those cfg values, all at 1920x1080 native res (sometimes that helps). only 1 person ever. the extention just doesn't want to do it.
this was with the super prompt thing you pasted.
rather good actually, that first one.
that's a really good prompt actually.
oooh
if you want I could run a couple of prompts through it for you
but I can also show you how to run it offline, it's all on CPU and it's super fast
it seems proteus is more coherent
I think it gives the best combination of them all..
it certainly is for that.
im going to try merging it but in real time using comfyui
here's the workflow if you want to make it for yourself. you just have to plop in the checkpoints and lora files.
yep, amazed at how easy it was.
honestly darkarts reminds me of illuminatidiffusion
I used to use that model all the time
back in the 2.1 days 👴
yeah, that's the only part that makes me a little sad. I worry that some of the people who made/make amazing stuff for sdxl might not be around for sd3.
Me when 2.1 had 1 step images without any special implementations

models almost done downloading 
oh I have to have the one WITHOUT clip?
wow
is it cause of the ponyXL stuff?
yeah he went nuts with ponyxl and everyone revolted.
so he quickly dropped it. version 2 might come out if he decided to release it. I think he wants to try and make money off an image bot for a while first.
it's funny, midjourney mimic lora makes it look even better, but the prompt adherence goes right out the window.
actually this isn't bad with MM at 0.3 strength
I'm finding that dpm++ 2m karras 20 steps, followed by a latent upscale of area/1.5x and then another dpm++ 2m karras with 20 steps at 0.5 denoise cleans up any messed up swords etc.
lol
hah that's really good, although i have that love for the slightly pixarish look
@copper kraken
kind of unrelated but still looks cool
that also with the new model?
yes
inpainting?
because I could never get that even with regional prompting.
this however, is amazing. the previous generations from either of those models could never pull this off.
get what
This shit is espec good with the res momentumized custom sampler and a custom scheduler
that with the merged model or the ip adapater stuff?
Oh nice, what'd you do to merge
Any specific weights etc or just straight up model merge simple?
proteus-rundiffusion-without clip, dark arts, and 0.5 of cute 3d render
there's the merge json
Ohhh that's a comfy workflow, flew over my head ha
Def will check this out
Try the params in that nuke image I posted btw
That sequence is giving me great results with the res sampler
Includes a non tile upscale which you might like
ok looks like i need another update
do i need to download any new models compared to yesterday?
Nope
Oh oops you don't need that
Forgot I had junk off to the right
Go ahead and delete that entire island lol whoops
That's the new SDXL tile controlnet
that is a wonderfully random image compared to the sources. 🙂
Awesome
Yeah this blows cascade clipvision outta the water
Now if only we actually had non shit sdxl CV...
It'd be really interesting to hit from both the unet and conditioning side
that is some weird wild stuff
nice
Still need to try attn masks with this
hold on, I'm having a apocalyptic picture-off with trustory here.
Need some advice, I want to turn this ugly looking image into prettier photo, the text can go away, but i want to prettify it, make it seem more 3D instead of being like a 2D drawing.
Is it possible to make it happen?
is that my house when all 3 nodes are running?
No kidding
reordered some stuff, so now it's just 30-40 seconds for 9 images.
That's awesome
Btw did you try forge?
And also. That exploding house.... Ipadapter is god
I didn't. i should. 🙂 but I have to not ignore the fam so I should stop for today.
another day, another no sd3.
Exact same params with ipadapter weight = 0
This was the image I fed into it
Compare to the violence of this image
Incredible improvement
hah it is definitely more splodey
Vastly
The first one is what I hate that it loves to make
An intact structure surrounded by fireballs
Destruction/flame/whatever tending to be adjacent to the image with no real connection
No splintering structure, charred paint adjacent to the flames etc
Then feeding one crappy low res images in via ipadapter and boom, literally
Prompt: house exploding
it's not gonna make this kind of nonense though. 🙂
I think the regional prompter command is good, I just need to get forge going on the nodes so it's actually fast though.
Forge was a lot easier to get set up than I recall with a1111
Came with more shit ready
oh ffs, i figured they would have fixed the VAE for the lightning models... in case someone doesn't know, the sdxl1.0+1.0vae is bugged and makes these horrible artifacts like on the right side of this image. they even reverted it back on the HF a while back so that it's sdxl1.0+0.9vae. i guess the lightning models are using the bad vae. the left side of the image is decoded with the 0.9vae and the right is with the baked in vae from the 8step lightning model
this is super zoomed in btw
though i guess to some, the "chromatic aberrations" are a feature and not a bug lol
yeah youre tellin me. the only reason why i noticed is because it will absolutely destroy your image after even a few rounds of inpainting. it will get progressively more and more blown out like a bad feedback loop
other than that, i actually really love the 8step lightning model(i dont do a ton of stuff with people people though, so i dont know how it stacks up against the base model at that)
great ! my message ignored
thank you, exactly what i needed ❤️ can you please do this for these 3 ones as well?
another update to deep blue is out. https://civitai.com/models/128397/deep-blue-xl?modelVersionId=400487
Deep Blue XL Just keep mixing the successive BluePencil XL About This model is a illustration merge model. このモデルはイラスト系のマージモデルです。 simply a model tha...
thanks!
i'm going through my merge stuff, i thought the cute3dcartoon lost prompt adherence and it just turns out it's because i was doing square aspect ratio. it's fine at 0.5
so now i'm testing merging with deep blue instead of proteus, but I think deep blue is too animated and pulls away from the style of dark arts too much, whereas proteus is complementary.
gotcha
yeah there is def some of that
there's some more advanced merge nodes too where you can pick out the different weights within the unets themselves or whatever it is
tbh, just baaarely have the weakest understanding of that stuff right now
but i've poked at it like a caveman a bit and noticed a pretty big difference depending on which parts you use
that was with loras but i believe it translates over
It's really neat to play even with just the ratio numbers and strengths on the loras. to keep hitting render and see how things change and see which is better or not. some unexpected results, where I thought a mix would be awesome but it totally wasn't.
yeah, and using the clip from any of these loras just outright destroys the prompt adherence of a good checkpoint. I had no idea until i started playing with this.
but it might be possible to avoid pulling over the animated style and get more of how it structures the image... idk
the midjourney mimic one basically just makes everything into a portrait with clip on. gets rid of anything in your prompt other than the single centered subject.
if that's the case, carosello is a top candidate imo to throw in
with it off, things come back to life
you can go pretty high with the model, with the clip off on the lora and still be in good shape.
Please send me feedback
This guide is still work in progress. Any and all feedback is highly appreciated, it doesn't have to be suggestions, even questions regarding things you didn't understand can help me figuring out what to refine. For the moment I can be found in /sdg/-threads, but I might m...
this is what i'm referring to
something i think that would be really beneficial for us to wrap our heads around eventually
exactly how to manage this that is
but yeah i saw some really interesting effects with loras by dropping various blocks
i'm ready to get an ipadapter tattoo on my forehead
haha yeah
concerning that block merging, I think that goes beyond what I'm willing to sink time into. I'll leave that to the people who make the major models. they're clearly very good at it.
well if you look at that first scheme
it appears dropping the first few blocks from deepblue might prevent the animated effect from carrying over too strongly
I think datavoid with the new proteus model he has on his paid bot has really struck gold with what you're talking about here. luckily I got the version just before that at least. hopefully he'll release this new one at some point. it's a pretty big leap even over what we have publicly from him.
ahh i haven't seen it, is it a closed model now? no weights?
so let me say this before I forget. I just ran my big giant at the gate on my channel. one before it is with 350, now with 40. hard to know what's what because of different seeds, but it looks like it's little more detail oriented. high detail anime is awesome.
i think i heard you or someone else talking about it the other night when i was in a daze from overwork + SD induced sleep deprivation
yeah proteus-rundiffusionv2 and now v2.4 is closed... he says he needs to make money to be able to keep doing this.
with sd3 right around the corner, doesn't seem practical.
yeah
I'd pay to be able to download the model, but I'm not going to use his bot.
I think a good number of othr people have said the same thing.
yup i'm on that list
it's local or it's something i'll mess with for a few hours at best
yeah, and i want my llms and all that other stuff. otherwise i'll just use midjourney
i want discord bot stuff AND comfy, not only one.
yeah exactly
all the tools are what makes it so addictive for me
https://civitai.com/models/143043/starlight-xl-animated don't know if you've seen this one, but this is another very good model
⚙️ ComfyUI workflows included 🇯🇵 Danbooru tags supported 💬 New Discord, come play with us: https://discord.gg/rHCnjX9cW9 🌟 Starlight XL 星光 Animated 🔮...
doesn't as hard toward animated style as deepblue
can do a lot of photography
that was my go to forever. then i found proteus, and now i've found all these other new ones.
gotcha, cool
i did a lot of A:B testing with that one, and proteus, even the early ones, were way more prompt adhereing than starlight, even though starlight looked the best. it makes me said that starlight hasn't been updated in a long time
agreed
lots of stuff that's just sitting around that sucks to know it's not getting updated...
hey what was that new aetherverse or some such that came out recenlty? have you tried that?
Make sure to check out my new Lightning version: Aetherverse Lightning XL This is Aetherverse XL - a multi-concept model that does everything from ...
supposedly does good darks. gonna try it now
downloading, thanks
my SSD doesn't say thank you..........
haha, i really need to just buy a 4TB
honestly, i'm procrastinating because i don't feel like having to take the 4090 out to stick that in
need to just do it
well, do you really need all those? 🙂
yes >_>
lol
honestly part of the reason i hoard is because my connection isn't that great
the main issue is a lot of these models i know are good at doing one single thing
for example, thinkdiffusion i fire up for one thing only, nukes
it's funny. i'm sitting here slowly increasing the cute3d render. and it goes from stone giant off on the side, which is especially prompt following, and it just gets more and more centered and portrait-ey
that first one is aetherverse, the new one, which is rather prompt following for what i put.
the higher deepblue i add, the more centered and portraity it gets.
but more unrealistic.
they def have their tendencies
that's where the block merging looks so interesting
def going to spend a bit more time looking into that once my schedule clears up a bit
the experts are def going to run circles around us on that probably forever, but it'd be really nice to be able to get custom models
stuff like... noticing you really like a certain type of composition a model is giving you, but not liking the style
so extracting the part that's leading to that composition and swapping in blocks that will guide it toward a diff style
hah yep. I can't wait until sd3, where I feel like I can really describe a scene where something's actually happening, so what you just posted means something as part of something bigger
so you could have reactions from other characters to that, or where that thing is climbing in the window...
or crawling out of the television
hah we need a mecha made out of exploding housing.
that's one place where dalle3 doesn't
anthropomorphic robot made of exploding houses is crawling out of a television.
sigh.
my channel
anthropomorphic robot made of exploding houses is crawling out of a television.
i have playground, my proteusrundiffusioncute3d merge, and deepblue40 on slash image now on that bot, so it's one of those. 🙂
awesome
looks really good
wow, this is from my merge.
that's actually got it all.
robot/ houses/ exploding/ tv.
this is that same combo but deepblue40 instead of dark art images
it does a really good job too
yeah that's really impressive
my merge , merged with carosello is just nightmare fuel
🤪
i like that one. a nice place to live
so many possibilities with the new idadapter
one i'm finding i like is subtracting the positive embeds, adding the negative embeds, using ease in-out for the first pass
then a img2latent2img2img 1.5x upscale
switching to style transfer for that one, and changing some of the params for the res momentumized sampler
zero clue what i'm doing but it's awesome
image of your endeavors
res momentumized is something absolutely no one is talking about afaik
you've used it, aimingfall has used it a bunch, but i don't know if anyone else has really dug into it?
it's a really special sampler
just throwiing my old images into this
those were the inputs
my cats compete to be the AI Art Cat of the hour
my PC is set up so all the heat blows behind my UW screen
best cat warming station ever
it's really good, just takes time. I think it gives SDE like results but faster, but its all still slower than 2 passes of dpmpp_2m. alone, it messes things up. second pass of 0.5 denoise, and that crappy fast sampler tuns amazing.
hah
i think they're a lot more interesting results
the compositions are different from any other sampler
those are neat
haha
50/50 my merge with deepblue40, another 0.5 of cute3d, and 0.8 of midjourney mimic. it's style overload
merging aetherverse and deepblue with 0.6 towards aether is actually giving some off angle shots for once. and is actually a stone giant, not just a giant, and actually looking down on little people.
this might be a good one for the bot instead of plain deepblue.
wow masking is really nice with ipadapter
you can use masks to mask out stuff you don't want to incorporate now
as well as still using attention masks to specify where things should be generated in the image
oh nice
this was a good prompt. might work for your stuff
astronaut in a faded space suit covered in samurai armor that is adorned with noh era trinkets and meija era pieces. the helmet is covered in sleek shiney black accents with various antennas and cameras.
lots of detail that your stuff could riff off of
used modelmergeblocknumber with juggernaut and carosello which obv is pretty much only illustration outputs
if you knock out input block 1 and output block 1 you don't get that style anymore, you get photographic
so swap in block 1 from juggernaut and you get photographic style
prompt: a shark car
juggernaut
carosello
carosello with blocks 1 from juggernaut
carosello
juggernaut9
juggernaut blocks 1-3 input & output, rest carosello
carosello, then carosello with blocks 1&2 from juggernaut
这里可以生成图像吗
I'm excited to try out any textloras/models with SD3, cause SDXL is pretty decent already with certain methodology
这里可以生成图像吗
Here is the image you requested
Here is the image you requested.
5
1
Here is the image you requested.
Here is the image you requested.
one dog icon
Here is the image you requested.
Wow very neat
pure base model with a slight influence of 3 input images
not as detailed/clean
Very cool. I used to work for an art gallery that had a Japanese noh mask exhibit. There's probably more cool stuff with that keyword
yes, can imagine
nice one👍
Humanoid robot with clear structure
Create a photorealistic image showcasing the 'Fire Guardian' smart fire safety system, based on the AIOT platform. Include a variety of fire sensing devices within a small venue setting like a retail store or office. The devices should be depicted in both close-up, to emphasize their design and technological features, and in use within the setting, illustrating their integration into a comprehensive monitoring and alert system. The composition should also hint at a user interface or data visualization that represents real-time monitoring and analysis. The lighting should be clear and bright, emphasizing the system's reliability and the safety it provides. Highlight the system's ability to safeguard life and property through advanced technology.
pure base model with a slight influence of 3 input images
cats
生成一张树叶
Here's the image you requested
Here's the image you requested
Is SD3 out yet?
that is a gif of a dog
^
create an image of a cat playing a recorder
Beautiful pixel art of a Wizard with hovering text 'Achievement unlocked: Diffusion models can spell now
cat
Two huge monsters fighting each other brutally
No
lol
Yeah but it looks like a dog looking for soemthing. So yeah... funny
Image
Lots of colourful balloons in the sky
Here is the image you requested.
Politics in a nutshell.
美女
Here is the image you requested.
Whay can;t I use Playground 2.5 with Fooocus? Isn;t it a sdxl model? It's sposed to work yes?
@glass forge It's an sdxl architecture, yes, but it uses a different noise scheduling method. EDM I think. Comfy has since it came out. Foooocus hasn't implemented it yet
cat
who's gonna clean up that mess?
Here's the image you requested
Wow
Prompt was : Exploded cat (by nychos-(by Van Gogh:1.5))
finally sitting down to do some methodical tests with the new ipadapter since we have no documentation still
Wow that's a really good prompt
holy shit
crazy
results of round one testing: all "Combine Embeds" methods are commutative except subtract.
Wow really good
Try this one quickly : Exploded cat (by nychos-(by Van Gogh:2))
clipvision doesn't seem to have any effect on the output so i'm not sure why that input exists... hm
(Shark by Zdzislaw beksinski:0.5), (shark by nychos:0.5)
not sure what the neg embeddings do...
with concat, the answer seems to be nothing
character (by Zdzislaw Beksinski-by Esao Andrews)
Made with sdxl turbo
I think I've seen your post on reddit
Yess thats me
You applied image to image and image to video ?
ot
it's an animatediff thing
nice images
Reminds me of liminal spaces, i love em
i love the concept of liminility and feel it's been hijacked by backrooms memes. The idea of thresh holds and gates entering new spaces. You've probably felt it when you enter a room and something switches in your mind. not only a new physical space but a new head space at all. even goes back to ancient times. Towns would have gates. Chinese would have lion sculptures guarding their gates. very cool stuff. Gateways.
TIL that theres a specific LCM for animatediff. very neat.
Yes, I generated this image with sdxl in order to make a liminal space
heyyyy i got it working in automatic1111 animatediff extension. 4 step animations neato!
My best is this one
Wow those are amazing
theres a lot of panic about the coming eclipse that'll hit eastern north america. i think it'll line up with the devil's comet being in the sky too. oOoOooOoooo spooky
solar eclipse instead of a lunar but 'm still reminded of it
Yess thats terrifyng ambiance
My technique is to start my prompt with “a photo taken in” and use the key words “urban horizon”
Amazing it's surreal
reminds me of control
@rich olive Try the prompt: "A photo with a lot of image noise and visual artifacts taken from the top of a building, night sky, clouds, moon, urban horizon, dark nothingness"
With sdxl base model only
having some difficulty with the i2v model in a1111 . looks like that's not supported as well
Base SDXL is powerful
The render is amazing
Saving
With better hands
In the future, rather than trying to wrangle seeds and RNG, just Marigold a depth map out of your own hand, takes like 60 seconds
LETS GOO Comfyui node
prompt: a cat watching tv. superprompt: a cat watching tv, 8k, masterpiece, in the style of greg rutkowski
honestly it's all about length. giant prompts don't matter all that much if they can't also restrict themselves to stay inside the 75 token limit. i know the really big models have that issue.
hopefully they put in logic in the node to force the issue.
yeah
It's quite dumb but it's fun
a fluffy orange cat sits on a windowsill, its paw resting on the television. Its t-shirt is folded neatly around its body as it watches the latest scene of a bustling city skyline. The cat's eyes are fixed intently on a tv set in front of it, and its tail wags excitedly as it watches the tv with curiosity.
certainly gets the job done.
yeah i'd say that works fine.
and that prompt was 74 out of 75 tokens. 🙂
Yeah I always trimmed my superprompt prompts to 75
But with SD3 we don't have to trim anything 😌
I hope so. I heard lykon say sd3 is 512 tokens, and then someone else said it's still actually 75 and deals with them in the same way as sdxl where they all just get averaged. I have t read the paper on it
Well if T5 applies conditioning to the image separately then yeah it's 512 Tokens, even if there might be a fade off towards the end of that number when it comes to efficiency
I mean even if it only doubles or triples the context length we'd be happy
especially if it actually listens
there's already not much point in hitting 75 now
you hit 15 and it's often already ignoring stuff
Most of my SD3 test prompts are hovering about ~50 to ~120 tokens
idk how to count prompts though so I'm just doing gpt2 tokenization
well, this is an interesting new interpretation of "supermassive black hole in a city park"
This apparently(?) just snuggly fits into the prompt limit, no idea though.
Old analog film photo of a giant room with a big monitor in the center. The monitor has a giant head in it with an ominous stare whilst speaking. There are massive rows of chairs with people in them, mindlessly watching the giant head. There is a caption for what the head is speaking on the bottom: "No Fun Allowed"
And the rest are just longer and more complicatd
(It's a 1984 meme described so that any intelligent image generator like Ideogram can do it)
Ideogram did it with flying colours, so I suppose SD3 will do just fine with this much
Festivalman could easily do this with a simple regional prompt
(Big giant monitor with head in it on TOP grid and rows of people watching on the BOTTOM grid)
interesting, SDXL almost did it actually
with no regional prompting
with regional prompting
That's very good
ipadapter shines even more with diff diff
really improves consistency if that's the goal
I think this is one of those times where regional doesn't do much for you. Base works well enough, other than getting that text subtitle on the screen which sd3 would probably do really well with. I tried throwing it at my regional script, but I think it understands big monitor as just one that's close to you.
maybe something like projector screen instead of monitor?
these are great
Hah, zoom out. Zoom out more.
oh no nipple!!!! 🤯
An old, grainy analog film photo of a colossal screen on a front wall, in the style of a dystopian movie still ADDCOL The ominous, enormous head of a man projected on the screen, in the style of a dystopian movie still ADDCOL The giant head staring straight ahead, its mouth moving as it speaks, in the style of a dystopian movie still ADDROW Countless rows of chairs filled with people facing the screen, in the style of a dystopian movie still ADDCOL The people sitting motionless, their eyes fixed on the giant head, in the style of a dystopian movie still ADDCOL A caption at the bottom of the screen reading 'No Fun Allowed', in the style of a dystopian movie still
I changed up a bunch of words like "in the front of the room"
that is SO epic
got timed out for 12 hours a few weeks ago cuz i posted one on my phone and didn't notice you could see nips poking through the black dress (some bat freak girl i made in response to someone who was spamming prompts)
💀
just the shape alone was enough, apparently
makes sense, you are a massive criminal
lol

i don't even know which i like better
this is part of the fun with this diff dif workflow
you get five shots to have an interesting output with each run
darkarts works really, really well with diff diff
deer
he want a huge
bro im chair 💀
Emily a beautiful woman 30 years old, long brown hair, light eyes, frontal portrait
have you heard the good news
I asked for a vampire in red... 😆
these head wings keep haunting my images
giger
Here is the image you requested.
A polaroid photograph of modern day jesus christ hunting for easter eggs while holding a basket radiates golden light
Here is the image you requested @copper kraken
Here is the image you requested.
hahaha so good
that regional prompt feature is great
yeah, even with the regional stuff doesn't quit hit, the regular often does with the same prompt.
@copper kraken For the ones that didn't, I tried running it manually on sd through lots of seeds and about 1 in 5 was perfect, and the rest were just a mess. Not really sure if that's fixable or not. we obviously don't want to wait for lots of images to generate. (or at least I don't)
yeah def don't want to wait forever
i usually do small batches (if any) cuz it's often better to change parameters than to get extra dice rolls
but, that is also without the benefits of llm enhanced prompts
i'm starting to think i need to post this workflow on civitai
i'm enjoying how it's producing such wildly chaotic images while becoming increasingly chaotic itself
definitely.
hah I want to see the comment section on that.
i need to somehow ensure that no nodes are just sitting there
must ensure all nodes are essential
the heart of the action
you don't need to generate images, that's already art 
very cool!
dare you to try the embedded workflow 😄
with a diff prompt, but everything else the same
interesting world, lots of details to explore
Rookie numbers compared to some of the blueprints I've seen in UE4 and 5 lol...
Where people don't know the sacred art of creating functions or macros
This is scary and impressive but does it really yield better results than Foooocus?
really weird
it doesn't make sense
Oh my god yes
Wipes the floor with it
thats me in the chair
APPLE
@uncut steeple @smoky patrol Scams ^ (leads to an incorrect link)
Thank you!
So what tool are you using for your images? I assume it's not just txt2img, but some kind of regional painting or some such? Clearly it's something I need to start using. 🙂
Pls where are u guys generating these images from
I find it difficult
from our GPUs
Okay
But it's not in this server
no
not gore. Just a bit much
It´s simply txt2img along SDXL´s base model and in this case 1 step Refiner (which I usually refrain from using) on top 🙂
Wow ok neat
playing with img2img as well, mostly for variations of an image or simply as a creative tool, in this case it's prompting only though. Haven't yet touched IPAdapter and regional yet. Next on list 🙂
I'm impressed at what you're able to get on screen straight. I've been working on regional prompt stuff a lot.
Thank you 🙂 Regional prompting I heard about not that long ago, then I just recently learned it´s about regions of the image while before I thought of it potentially being naming actual regions of the world like countries for region-specific outcomes 😄
lol that would be pretty funny. Yeah check out the rpg diffusion extension for a1111 that works with the regional prompter extension. Does amazing things when it gets it right.
It's scary, it's great
Having just started using Comfy more intensely and Easy Diffusion otherwise, not too much of a fan of A1111 in terms of interface, I feel like I´m gonna experiment with the approach of how I think it works for starters, thank you anyway 🙂
Yeah I only use a1111 because nothing beats its regional prompter extension for text to image ability. Clownshark does it via images in comfy, but the text method allows for llms to do cool stuff for you
yes, can imagine it being easier, then I like the manual approach experiment for its nature anyway 🙂
I mean, it's super easy implementing that in comfyui same way. I just think using images is more precise most of the time
it's definitely more convenient than the one in comfyui
can't say i've explored either in a great deal of depth - afaik, the comfyui one is more powerful in the end, but you obv either need a workflow thrown your way or to spend a bunch of time making one, and comfyui's api is a lot less friendly from what i'm hearing
certainly more work to tweak than ADDCOL etc
comfyui could really use a node that can efficiently generate mask grids etc without it turning into a big chunk of the workflow
and a regional prompter node that can actually take those as inputs or something like that
Holy shit...the focal distance on this one...this looks like it's legitimately floating in front of me.
Oddly enough, the thing I find most disturbing in this image is the fact that he has a button-up collar on a shirt with no buttons.
it's always the subtle things that make a creepy scene so unsettling
Yeah...exactly. Not phased by the makeup or anything else...it was the shirt.
there's probably a subreddit for that, fashion horror
Darth Haus?
He's got this crazy knack for figuring out exactly what totally rare force choke you need.
I updated my InstantID, and produced this "Splendour" SDXL Turbo/Pearl Mountain Collage Maker
dunno, I don't think that these things turn out as big workflows. It's just that the workflows people publish are unnecessary huge, because they add too many features to them.
I would say ComfyUI needs more modularity, a way to easily share and save parts of a workflow
That would be really handy too
Save and share functions/group nodes
You could just build group node components and share them 🤔
I guess I mean something through the manager but yeah
Components you can just re-use in every workflow
Yeah
Not a big deal to me anywaym
Im not afraid to go wild making workflows that imitate my art lol
@river stratus Goku in his super sayen form
Goku became tentacled
Doesn't this already exist?
kinda, guess i had some vaguely diff in mind at the time
With comfyui manager you can pack your group nodes. I guess you could share those packs if you want.
Yes, there's a Component Builder that allows you to do this.
yeah, i guess i meant more share them as separate subworkflows that can be neatly opened and closed in a separate window without ungrouping the entire thing into a mess all over the current workflow
The on-site background wall design of the headphone exhibition, with the headphone products as the main focus, and the overall yellow color tone.
Here is the image you requested.
Here is the image you requested
composition idapter
kiss a week of your life goodbye now that you've also discovered how insane that update is lol
i have no idea what you're talking about. i can stop whenever i want.
will it do SD3 or is it still stubborn about a number following a letter?
still stubborn... lost most of my men but i'm winning the war
oooh that's cool
can get it to be a bit more subtle by skipping the first few steps with the controlnet, then hammering it hard after
the qrcodemonster with sdxl is not anywhere near as good as with sd15... with sd15, i wouldn't need ipadapter to help
this does come with the advantage though of some control over what goes where
Still waiting on the next version of SDXL:
SEXL
@copper kraken
i don't know if it's ipadapter or just this art model, but it does text really well (click the second one)
that was the source image on the left?
yeah moofi posted that one
that was just "movie poster stability ai" obviously it's using what's after the movie poster as part of the image basis
think you might've gotten lucky
like i did with my MEAT BAG image a few days ago
it's generally not spitting out any text with "CLOWNSHARK"
did get this one though hm
Am I also to assume that your pfp is also stable?
Can you share the details on how you gend that?
ok, best shark feet ever
true