#✨|sdxl
1 messages · Page 167 of 1
generate a logo from the letters JO
Zoom from eye into the brainnnnn - https://twitter.com/HikariUchu/status/1764705840240746659
Hi!
What is the difference between Base and refiner?
you use base and you don't use refiner
Why?
it is a model that was never upgraded, most civitai models will tell you they don't need refiner
in my experience it is mostly messed up the generations adding crap
but it works in the final steps of the generation, you can set the steps it takes
I see, thank you very much!
I think Base is enough for a start.
refiner is supposed to improve fine details on the image. Its trained on high quality images and only at the low-noise timesteps. Anyways, most custom trained models nowadays are better in doing fine details than using base+refiner, so you can ignore that

sorry for the answer I was trying to be edge. I did some good generations with the refiner but mostly it killed details. At low steps it just kind of messed up what was there, at more steps it killed details and replaced by some "high quality" texture, but it didn't know how to make real detail. So it gave some soft saturated kind of image. You can try it, or just download some of the highest rated SD XL model on civitaai
imagine a rat
Imagine thinking you could just give a command and art would appear without reading around the server.
😆
Guys, how to get download this file " DPM++ SDE Karras"? I not find this in Civitai.
I not have this
I decided to install Forge for Stable Diffusion, I'm testing things out here. It's a little different for those who are used to ComfyUI Nodes
Human skull looking forward, drawing, black and white, white background, 2d, no cracks, flat
Any tips would be appreciated.
dpmpp_sde is the sampler. Karras is the scheduler.
Hi all!
I am learning to finetune a sd model via dreambooth. I just need a small dataset for getting through basics. If anyone has any resource where I can find small image datasets of the same object, pls share them
Ex: 25 images of the same animal/thing/place
"Generate an evocative and poignant image of the legendary composer Karlheinz Stockhausen crafting his final musical note in his iconic studio. Capture the atmosphere of this momentous occasion, blending the essence of Stockhausen's avant-garde spirit with the emotional resonance of bidding farewell to a lifelong creative journey. Use your artistic prowess to convey the significance of this last note, encapsulating the fusion of innovation and reflection in the twilight of a brilliant career.""
Design a modern and minimalist coffee shop logo with the English letters ‘US’ as the central element.”
Color and Emo
Becoming warmer
all three are good kittens
reasons like this i don't know why people hype comfy so hard as "betteR". A smidgeon faster on a few hardware configurations, but worse UX than a hacked together gradio
you have to do a ton of homework to use comfy. same with automatic but the interface is more intuitive and doesn't do shitty abbreviations like pp for ++
if you can even call that junk an abbr
It's an abbreviation used elsewhere and it's usually because you can't use special characters for some reason.
Comfy is fine, but it could do with better and clearer documentation, especially for making nodes.
I use it because it's nearly always got the latest stuff on it, either natively or via nodes. Auto is always fairly far behind. Maybe that will change with Forge, as that seems to have quicker development, but I've not used it or A1111 in ages.
gets things first because the node architecture allows coders to do less work. i like that . gets tech out faster. now theres the unet patcher in forge. layer diffusion coming there first is a sign of the tides
comfy put himself in a position. can't copy the unet patcher without looking like he's copying forge. after accusing it of copying comfyui. don't expect it'll get a patcher architecture for nodes anytime soon
in the spirit of open source it would be nice if everyone just copied each other, was open about it, and encouraged it
then everyone wins
Or maybe.... the author of the layer diffusion paper also made forge...... (and has a bit of a big ego so won't create the library for other UIs, but meh no one seems to do, stupid fragmentation, why can't everyone just agree on huggingface diffusers as a base)
lllyasviel and LayerDiffusion accounts are the same ? Got evidence of that allegation?
I don't think there is a paper for layerdiffusion either
Diffusers is the worst format i've ever dealt with . lots of issues. why agree on that format that already is one of the worst for compatibility?
even docx had more compatibility
if you ask me though, ego's a little deserved when you're the author of the original controlnet models.
I don't even think illy is the one with the ego. Clear to me it's comfydev
while ego may be deserved a little there too
Well, there is. https://arxiv.org/abs/2402.17113. Lvmin Zhang = controlnet author (https://arxiv.org/abs/2302.05543) = forge dev
We present ControlNet, a neural network architecture to add spatial conditioning controls to large, pretrained text-to-image diffusion models. ControlNet locks the production-ready large diffusion models, and reuses their deep and robust encoding layers pretrained with billions of images as a strong backbone to learn a diverse set of conditional...
And no, you can't just rip the core code from another opensource project, then cast shade on said project and happily announce you don't use it 🤡
of course, license permits it as long as you atribute what goes on in someone head to say "not based on comfy" while actually reusing all comfy code, it's weird at best, malicious at worst
comfydev created such an air of toxicity aroudn forge. so disapointing
stability should over rule him and recognize the contributions of those authors to the community loud and clear. shut everyone up and muzzle comfy on the issue
egos getting out of control and need to be leashed
actually, had a brainfart, comfy being gpl, means any project using it must attribute the authors (comfy). so what happens in forge reusing the code like this is worse
yeah you're blocked. clearly can't be reasonable. " it's gpl so you CANT copy it!" what?
but meh, in my experience the whole ecosytem is a bit messy, let's just think of it as growing pains
its so much worse than growing pains. It's a toxic direction being steam rolled by highly recognized representatives of Stability AI.
Leads to community members with clown hot taks
it's a copyleft license... so yeah, you can't copy it without attribution and keeping the new code under the same license
then @visual glade should take him to court. is he serious about it or just memeing? pretty sure licenses are attributed too
court for what? sue for what? waste of time and money for both parties
guy here is claiming forge doesn't attribute licenses
I have the feeling you enjoy this soap opera and "air of toxicity" way too much
naw. i hate it. want to see comfy get it together and be more FOSS spirited
forge is a hackjob that just took comfy code and slapped the a1111 interface on top
I wouldn't have any issues with that but he went and denied it
Emad needs to muzzle you lol. You got your fanboy entourage going after people recommending forge on forums tehse days.
we're in a post "linus quit linux to work on diplomacy" world
You talk about "air of toxicity" and just insult everyone. Maybe thinking about a muzzle yourself? Or just dial it down a notch
Can't confront toxic topics without hurting some feels. Medicine shouldn't taste good. Poppins got it wrong.
the problem is already that you think what you say is "medicine"
Gaslighting. That's a cool way to approach it. Abusers protect abusers i guess. same ol story.
The future will continue as i predicted. The small niche of comfy users will continue "punching down" from their faux position of elitism, enabled by culture leaders.
🙄
you are right, ego is the problem here 😉
maybe just switch the topic. The paper for SD3 came out today
seems like the T5 text encoder is optional. So you can use it for complex prompts, but you can also just use CLIP. Should help a lot, in particular you could leave out T5 for inpainting, upscaling and so on which should save a lot of time and vram
the modularity is a good idea
i just hope the modules are named appropriately instead of the first stage being c and the last being a
now if you could still use a lora trained on a smaller model with a bigger one (just losing some ability to precisely reproduce the training set) we're in biz
i hope the smallest model is named Y, the middle size one is named X, and the smallest one named P
parameter count would cause issue there potentially but we got xadapter tech now a days
I would say that's unlikely, although not impossible. But I guess the loras won't be compatible
i'm sure the numerous models will be impossible to understand how to load in an intuitive manner, but i can hope for a positive future
Hey which sdxl tool/model for placing a specific clothing product on a person/multiple differet people? As well as seeing it in different poses.
For example, given the (1) mockup image --> getting (2), (3), (4)
/jeune fille sur une balancoire
cinematic film still of a Beautiful Rococo Princess sitting on the Japanese metro, elaborate gown, massive curly red rococo hair, head and shoulders portrait, pretty eyes. Crowded metro with onlookers
?
what?
To infinity and beyond! - https://twitter.com/HikariUchu/status/1765399437264998472
did you promise to someone that this was possible?
0.8 of strength seems fine. No special prompt (see samples for good examples). "The Pulp Session" evokes a scene steeped in the bold and dramatic f...
that looks really great! nice work
thank you very much
It should be used with a strength of 1.0. No specific prompt is necessary, but you can refer to the examples to achieve drawings with more or less ...
you've been busy 🙂 I really enjoyed the work you've been posting over the last couple of months. Keep it up :]
doing in the night when family sleeps......
的
办公桌子
这满是脚气的鞋子是怎么做出来的?很牛的样子。
So will all SD1.5 and SDXL xheckpoints and promt method be totally obsolete in 3 months with SD3?
i need help
NansException: A tensor with all NaNs was produced in Unet. This could be either because there's not enough precision to represent the picture, or because your video card does not support half type. Try setting the "Upcast cross attention layer to float32" option in Settings > Stable Diffusion or using the --no-half commandline argument to fix this. Use --disable-nan-check commandline argument to disable this check.
what should i do to fix it
Hi, have prompt or Lora suggestions to limit the number of colors or shades? Eg. I want simple images, with as less color shades as possible in order to be able to easily vectorize
Why not just prompt for monochrome images?
that may be an option, however I would like to have color images, just somehow get ride of the shades/gradients
Hey, sorry I'm new here,but ,Can I generate image in discord if yes where and if no,where should i ?
A breath of fresh air, though...someone actually asking before just lobbing random thirst prompts into the chat channels. Good on you.
Most of us run it locally but there are lots of online generators! I don't know what they are though sorry
I bring to you "Loathsome XL" to bring cheer to your life. cough Feeling generous? https://www.buymeacoffee.com/generalawareness
jesus and lucifer fighting , photorealistic
Here is the image you requested.
resting dog
Here is the image you requested.
Good Bot.
On the next episode of Naruto....
Hi there. Is stable-diffusion-xl-1024-v1-0 that I'm using with your dreamstudio/api equal to stabilityai/stable-diffusion-xl-base-1.0 from huggingface? I'm trying to get same results with your api and local build but images are different (same seed, prompt, negative prompt, cfg, checked different samplers).
iridocyclitis
made a goofy ahh image a while ago
Please make it so we can "lock" certain elements of a generation and reuse them... characters objects backgrounds...
Can't you do that already?
how?
most directly, inpainting or outpainting, less directly, ipadapter, clipvision with cascade, or training a lora or embedding
say you want a movie scene with 2 characters talking in a room. how do you lock each character so you cna generate them from different angles and how do you lock the room and the furniture in it so you can film it from different angles? coz if this was doable then you cna generate all this and put it in a motion video ai thing and hopefully promt up different areas like "lift hand" "walk here" etc and add lip-sync music and thats a movie in essence lol
more or less
small discrepancies could be ok , real movies have continuity flaws too...
maybe in 6 months ? a year? or maybe doable already idk with this AI stuff it moves so fast
you can generate up amazing things, almost anything but you cant make it move... no stories, just sad still images..
Hey question
if im making a lora is it normal to only have 300 steps for a sdxl model with 100 images
and it taking 4 hours to do so
given its doing high quality over quantity
Real nice. Very Patrick Nagel to my eyes.
madness
Does anyone know how to get A1111's break to work in comfyui
Cowboy. Use from 0.7 to 1.1 weight depending on the model. Some prompts refuse to become a full on cowboy but will wear the leather. Hint. Feeling ...
regional?
hah that's impressive.
It's using the base Tempest checkpoint.
wow the clarity of this, since it can render at 1920x1088 natively is really impressive.
wow. incredible
Yeah...that first bat pic I put up above was pretty sick
i'm amazed that it did it without subject detail bleeding
no kidding
Time to test something...
yeah im running through my promtps
it looks like it doubles every now and then, but for the most part it's good
heh
can get some crazy shit by cutting down the early steps
this was ksamp adv steps = 20, run 0-5
then ksamp adv steps = 40, run 10-40
so fraction of denoising was the same but denoise rate per step went way up
I don't think puny man has a chance, but boy does he have hutzpah.
A Mercedes luxury sports car. Open top. Red
||A Mercedes luxury sports car. Open top. Red||
||A Mercedes luxury sports car. Open top. Red||
||A Mercedes luxury sports car. Open top. Red||
A Mercedes luxury sports car. Open top. Red
||A Mercedes luxury sports car. Open top. Red||
||A Mercedes luxury sports car. Open top. Red||
A Mercedes luxury sports car. Open top. Red
||A Mercedes luxury sports car. By the sea||
A Mercedes luxury sports car. By the sea
||A Mercedes luxury sports car. By the sea||
lol
Here is the image you requested:
a ControlNet tile model for SD XL https://civitai.com/models/330313/tplanetsdxlcontrolnettilerealisticv1
Thanks for your attention. contact me if you want, discord with "ttplanet", Civitai with "ttplanet" you can also join the group discussion with QQ ...
still testing it
Bumblebee has sure grown up.
he went back in time to defeat megacybershocktron
really amazing. did you have to region prompt for the mirror?
this was differential diffusion
posted a few more in show n tell
the mirror was indeed in the prompt
holy cow that joe87. those images where he's got rule of thirds going with the man and the dogs. I can't believe I haven't been sitting doing regional prompting 100% of the time. instead just sitting here getting angry at sd...
I could have been doing this the whole time. 🙂
lol
seriously this darkimages model is better than most I've seen.
it's fn amazing
it's really versatile too
though i'll admit i'm hesistant when i hear versatile
the high res nature and incredible small details. i keep finding stuff in the images.
that word to me screams jack of all trades too often, overtrained
but this one is flexible
wow that's really cool.. i did learn 2 things tonight concerning this stuff. the 75 token limit still applies. I was just putting whole giant prompts in each section and it was dropping stuff. when i dialed it back, things reappeared. second, I was finally able to get the mecha to raise his fast and grab something by dedicating a whole column to just "closeup of mecha fist"
awesome!
yeah i kinda figured that'd be the case... i assume that applies even if the regions don't overlap
my guess is you'd need the two samplers node to get away with 150
(i often have these massive prompts that are way overflowing so i can cut and paste stuff back up at the top... my sloppy way of having crap ready to fire off)
that's an interesting trick... having them rendered completely separately to get around the limit.
yeah, separate samplers
with differential diffusion you can go way over the limit
cuz you're using separate samplers over and over
now what i'm curious about too... this will have to be a tomorrow thing
is what if you use regional prompts WITH differential diffusion? will that allow differential to be even more coherent in terms of the final image cuz it knows what's supposed to be outside of its masked region? idk
darkarts + juggernaut merge
right now I'm just using a1111 with regional prompter. creating rows and columns on the fly.
Although I'm blown away by your work, I don't think I can keep all those comfy nodes straight when it gets to that level of complication.
hah man, who needs sd3
I'll prolly be doing that soon
Forge on my phone is a great way to ensure my SD induced insomnia can continue in the luxury of being horizontal instead of vertical lol
yeah, i'm at the point where I need to make sure i CANT hit it from my phone. losing sleep (like right now) is the biggest issue.
Lol
i'm looking through that show and tell thread. incredible stuff. none of that is ever shown on the stable diffusion subreddit. I made a post a couple months back on how regional prompter is the key to everything, but even then I had no real idea what it was capable of.
Guessing it didn't get much attention?
Subs always totally suck compared to affiliated discords IME
All the ppl who are hardcore about whatever it is will be on discord and so the best content always accumulates there
I think the people on there are the general people, running off a 3060 or less, only willing to use a1111 in its base configuration with probably 1-2 models, if I can't do it with straight prompting, then that's just the end of it.
Yep
I'm not surprised but I'm still impressed how many ppl here have the 24gb vram
when sd3 was announced, 90% of the conversation was "will it run on my machine from the 80's"
A lot more life on this discord than most of the others I've hopped on
hah soul is just posting pictures of his night activities as a way to tell everyone where he hid the body
much better merge
mohawk instead of juggernaut
mohawk can gcet pretty creative
now every time i look through the samples section of a model, i look to see who probably regionally prompted. this was on the mohawk one
oh the latent noise lol
cascade has given me an eagle eye for that
kinda wish i didn't have that now
What is regional prompter?
That's something completely knew to me, so it allows to have control over composition
yep, it's amazing
differential diffusion is what i'm doing with these recent examples
this was regional prompt
but it's not that complicated.. you just start with adding 2 columns, then rows, then more columns, until you have something neat.
It has a certain vqgan+clip vibe with all the stuff thats happening, if you know what i mean.
more regional prompt
anddd back to diff diffusion
that's 80% darkarts 20% albedobaseXL2, just simple merge node
That's... insane
now that ollama-webui has built in text to image, I can check the promtps before moving them into a1111 or comfy. and it lets me say "change it to a mall" or "add more beach balls" to shape it
That's fn awesome
What set of 1111 bleeding edge plug-ins do you use?
a dog
Also, which SD model?
regional prompter, reactor for face swap. sometimes controlnet. I've been using a lot of the clipvision stuff that batwing has been making workflows for in comfy
Dark Arts Images THIS MODEL IS NOT FOR EVERYONE. READ BELOW. Any liability arising from the improper or illegal use of this model is the sole respo...
I see
this is the new hotness for the day.
it has an really impressive amount of detail, and I like action shots, which it does better than many
tomorow it'll be something else. 🙂
Cool thanks
Here is the image you requested.
Here is the image you requested
I'm sitting here trying to recreate this ideogram image of the cat in the plane. aaaaaand how it's going.
Lol figures
That's something I feel requires in painting when we're talking sdxl
Maybe fenrisxl would work
When it doesn't work, man... Some of those damn prompt fails have eaten up an entire evening
Then I look back and wonder why i even cared about that image and realize it was only because I couldn't do it
exactly. i feel like i'm making progress.
it's funny though. when ideogram first came out, i put every thought and idea I had into it for the first few days. one and done, i'd put it in, and it would do it immediately. no countless generations to get it right. I realized I wasn't spending any time with this stuff anymore because I always got what I wanted immediately. Then I started realizing that the image quality is significantly below sdxl. and i eventually ended up back here.
which kind of goes against what I said a week or so ago.. that prompt adherence is king. it is, but I want it all.
Yeah, it is... Kinda
Ultimately there's shit like Photoshop and firefly
The ultimate I think is gonna be a bit of both
create a dot
Here is the image you requested.
Lol
ok, gonna call it done. looks way better than the ideogram one anyway.
A Chinese woman wearing a sexy bikini,by the drive
Here is the image you requested
I think I've determined that the badbot paired with that dark images model is a match made in heaven. the image it just made... jesus
It did this one perfectly, with amazing detail: Surrealist, darkly lit scene featuring a bloated, disheveled man with flushed face and bloodshot eyes, clutching an empty bottle in his clammy hand, surrounded by broken shards of glass and stale cigarette butts.
Man you got me curious now lol
Amazing holy crap haha
yeah gonna delete that one, don't want to get canned before sd3. 🙂
Lol
In this Impressionist-inspired depiction, a powerful Chinese woman radiates confidence and grace as she strolls down a bustling city drive, bathed in soft sunlight. Dressed elegantly in business casual attire, she embodies strength and self-assurance while cradling an endearing litter of kittens close to her chest. Her love for these feline companions shines through, highlighting the tender side of this formidable woman who seamlessly balances ambition with compassion.
an image even Google Gemini could make.
Self-assurance lmao
Here is the image you requested.
hah the one on the right looks like one of Naruto's enemies
ok, i'll see ya guys later. ran out of steam
@gloomy lark caroselloXL gamma is another very good checkpoint so far
? Do you have an answer or not, idk what you saying. I just asked a question
The orange cat is wearing a bachelor's uniform, standing hopefully in the middle of the campus, holding a graduation certificate in his hand, with a joyful graduation ceremony in the background
any advice to get photorealistic stuff a lil more smooth
like it sometimes look to sharp for me
use euler sampler.
like here
the SDE samplers are better with skin texture, the euler ones are worse.
so they'll be more rendered looking
yeah, same line though.
kk
i might try, but it only happens a few time
i like skin texture, but sometimes it looks oversharpen
thats more my problem
i dont mean this, nono, i want fine skin details 😄
its more like some fotos get too sharp, too much contrast maybe
oh, then use dpmpp_SDE if you want the best details. it's also the slowest sampler though, so people often go to the turbo models which take less steps of dpmpp_SDE
what CFG number are you using? if images are too high contrast or blown out, lowering the CFG can sometimes help.
i feel like its hard for the AI if u have a light (ice for example) background
i get way better result if i prompt dark room or something like that
different models have different CFG tolerance. some will get blown out at 10, some at 30
so if you drop the CFG to 5 that might help
i tried several guess its my prompt atm
and in general always struggle when i prompt something with ice / snow
porque no los dos. 🙂
No keyword. 1.0 is good. The "A Plastic World" style is characterized by 3D animated characters that appear almost tangible, akin to living sculptu...
This could be super cool to translate to svd1.1 to add motion and speaking. Nice!!
Skilled archer, bow and quiver of arrows, standing in forest clearing, intense, detailed, high detail, portrait
Skilled archer, bow and quiver of arrows, standing in forest clearing, intense, detailed, high detail, portrait
/Skilled archer, bow and quiver of arrows, standing in forest clearing, intense, detailed, high detail, portrait
Here is the image you requested.
This is a basement of Qingdao Ocean University in 1950, The owner of this room is Professor Wen Shengchang, There are a large number of books and materials in the room, 16:9 horizontal composition, 4K resolution, wide shot, panorama, high details, 4K --stylize 0 --v 6.0 --ar 1:1
redraw the picture with 90 percent similar
Here is the image you requested
I smoked so much spice I learned how to fold space.
nice stuff!
I present to you, Artistic Grandeur for SDXL. This does not require an activation word to use, BUT if you give it the word "dystopia" without the q...
you don't want to mess with the boxing cat, he's just all fist
if I ask it to punch, it won't. i just have to say boxing match and just regen until the seed does what I want.
have you gotten it even with seed hunting?
seems like a good time for ipadapter maybe
yeah.. the ability to do things where the subjects aren't upright, like an upside down face isn't possible in sdxl. I'm running into that. trying to get animals falling into a vortex, but it all just looks like they're dancing in the sky.
as you said, i'd have to controlnet it or something
Loras have issues with that and that's proof you're right imo
If you denoise an upside down face... Yikes
dark arts images model + cute 3d render lora makes a good combo
lol great idea
that is such a great checkpoint
what i really like to is instead of tending toward soft images at times ilke most models
it has a tendency toward grittiness
yeah the detail that comes out, even before upscaling is incredible
@copper kraken you see this one on their sample page? from clownshark, countless renders burst forth https://civitai.com/images/7420332
i'm really liking that other illustration checkpoint ig ot two nights ago
3d render lora takes the edge off. 🙂 which illustration one?
ILLUSTRATIVE NEGATIVE: (Photography, raw format, photo:1.1), (Realistic, Photorealistic, Photographic:1.1), (canvas frame, watermark, signature, us...
prolly the best one i've used yet
almost everything i've thrown at it has given really great stuff
https://civitai.com/user/andreac75 this person posts on here as well, and they made the cute 3d render lora. they've also got some interesting illustrative ones
i really like the look of some of those... gonna have to check them out!
dark arts vs. carousel. purple one is carousel
carosello?
technically neon chemicals is in the prompt, which the purple is probably more adhering on
interesting how similar the outputs are there
yeah carosello
so this is the carosello one with the cute 3d render lora, really brings in the qualities of her face and hands, without going overboard on the cutesy part of things.
it does, that's interesting
Paths in ancient forests, ancient symbols --auto --s2-imagine -
darkarts and carosello working together
although you have to spice it up with something in the scene
the mask for diff diff for darkarts, the inverse used for carosello stage
hah the roses are an interesting addition
I love seeing the engine trying to make sense of it. the mechanical tendrils into the snak
yep really interesting to watch
darkarts
weren't those the enemies in the aquaman movies?
google images. 🙂 apparently none of these models knows black manta so SD wasn't helpful
/prompt Tang Dynasty style icon design
Here is the image you requested.
/prompt Tang Dynasty style icon design 2D
yeah this bad image bot prompter thing definitely requires a cleanup in aisle 10
dark art images
that prompter and that model are made for each other
"Post-Apocalyptic Expressionist" depiction of a grotesque, deformed character covered in filth and grime, illuminated by a sickly green neon light amidst a desolate wasteland littered with debris. A scorched and rusty sign in the distance reads "Welcome to Tang Dynasty."
lol the prompt on that is drippin with sarcasm at the end
Here is the image you requested.
hahahah
in all seriousness, if you want icons or 2d art like icons, midjourney is really good at them, probably the best
do not taunt happy fun ball
i dunno man, that one at the bottom got pretty close before a trapezoidal mask veered it away
verified, no nip slip!! lol
wild as f image
obv some issues in the lower middle part with some incomplete merging of the image but yeah
that thing at the center, the "head" was a giant shark bat
so strangely enough, I put emad's tweet about there not being another major text to image model after sd3 into the prompt generator and it made this.
i've always loved this kinda surrealistic art
yep...
i just noticed that
i blinked it and it went from 1 to 3:30
totally in the zone here
img2img
hah yeah i saw that yesterday. the left looks like a north korean propaganda poster
good point, let's see what it does with this prompt
"Neo-Surrealist" depiction: A luminous, otherworldly creature emerges from a swirling vortex of digital colors, its ethereal form shimmering in the electric glow. A massive neural network sprawls behind it, adorned with intricate patterns and glowing nodes that seem to pulse in sync with the cosmic energy. In the distance, towering structures of abstract geometric shapes pierce through the surreal skyline, casting long shadows on the bustling cityscape below.
holy hell that is great
i really like the compositions it generates
def listens to prompts better than most
this one is dark arts
i've found in genearl the anime models are pretty good there
same seed
also spectacular^
I just tried this on a couple other models and it doesn't make the "creature" part of it, so these 2 are definitely more prompt adhering.
yeah
what i'd like is a comfy node that would allow me to just dump a list of my models from the terminal and copy paste them in then gen the xy plot
no way i'm going to be bothered clicking through 100+ of them on the efficiency node one
so these are ideogram and MJ for comparison. SD does a better job in a lot of cases.
no kidding
yeah I can do that with my scripting, where I have an sdxl comyui api json, then I convert that to a powershell object, change the model name, convert it back to json and run it. one could pretty easily create an array of all the model names, and then just have it run it.
I think there's going to be a LOT of similarity between models though, so there may not be a benefit other than ones that are very different from each other
so much for the great dalle3... waiting for one more turd
awful lol
generate 4 images with starting the batch, showing 1 and ending the batch and starting a new one with the next one and no text in between each batch and so on til all in all i see 4 images Topic: "Neo-Surrealist" depiction: A luminous, otherworldly creature emerges from a swirling vortex of digital colors, its ethereal form shimmering in the electric glow. A massive neural network sprawls behind it, adorned with intricate patterns and glowing nodes that seem to pulse in sync with the cosmic energy. In the distance, towering structures of abstract geometric shapes pierce through the surreal skyline, casting long shadows on the bustling cityscape below.
that was the prompt
the gap between quality of those outputs vs what you just posted in here is greater than sd15 vs sdxl
these are dalle in highest quality mode
those are badddd
they like the swirls.
yuck
lol
those are also awful imo
also poor prompt adherence
no sign of circuits in the first
what they are, is censored. anything that might look scary etc isn't going to be shown on it.
what model?
Playing around w/ training/mixing - but I have a couple of LoRA on civitai that are the same concept (just could replicate exaclty): https://civitai.com/models/313770?modelVersionId=352057
Dreamyvibes Artstyle Alternate is a followup to a the "Dreamyvibes Artstyle" LoRA I uploaded a few months back. This version was trained from scrat...
SDXL LoRA trained on 50 HD images of explosions. (Dreambooth trained / extracted to LoRA). Trigger w/ "Explosion Artstyle"
now THAT is my kinda lora. holy shit
in the words of the great @noble shoal , day is ruined now lol
you can't be dropping this stuff at 3:42am my guy
volcanos explode good with that lora
Almost Nirvana at the end of the world
Cali......although clocks jump tonight...
that's quite the scene
yeah dall-e has more to say about that one than the previous one
mine was: anthropomorphic volcano island belching forth green smoke. Sea creature warriors surround the volcano in anger / it expanded it to: Imagine a scene where an anthropomorphic volcano that takes the shape of an island, spews forth green smoke. The volcano, dramatic and imposing, appears to have human characteristics, giving it a sense of sentience. A range of sea creature warriors of different descents such as an Asian serpentine sea dragon, a Middle-Eastern giant sea turtle equipped with armor, a Black mythical kraken with a multitude of tentacles, surround the volcano in an uproar. Their expressions filled with assertiveness, reflecting their combined force and anger against the erupting volcano.
I was watching the new aquaman movie so it's based on one shot from the movie
yeah the exploding lora is pretty neat.
haha it's poster art for the second matrix movie
That's a killer image - if you ever post comments on civitai in reply would be cool if you did w/ that one.....dig it....
dling myself
sure, it's now posted to your page
img2img
lol that's awesome
no army would want to face off with that thing
this is with the dreamy lora added. that poor boy wants to add fuel to the fire.
img2img
early iteration from that lol
swapped some of the masks and got this instead
that's quite the chin strap beard on that middle one.
with and without dream vibes lora
hahah and now with explosion lora
he's having quite the blowout
I'm loving those middle few. that dripping glass under the cork is really cool
yeah sometimes the midway through ones are the best
i make sure to save at every step
The best ones would be ones where you stopped at every step
Reconfigured the masks, rewrote all the regional prompts, then did the next step
Etc
What I've done here is kinda approximate something that works so I can industrialize art production lol
didn't even prompt for bat sharks... lol
what's your ability to change resolutions? if you wanted to go widescreen, could you?
haven't tried but i bet it could be done
giving it a shot with a sloppy attempt
1280x768
1152x696 is typical
i've done tiled upscales with these np
veeery nice
got one native render coming then i'm off to bed too
that widescreen opens up more room for chaos
wild party of barbie clown freaks dancing in a zoo with nuclear bombs going off and german tanks storming a village of penguins next to the ocean at night with a burning comet impact
^^input image
that last one i just stretched one
random question. do you know if there's a way to fuse a lora with a checkpoint? so i could just render against that new product and have it be at checkpoint speed instead of lora + checkpoint speed which is slower?
that is indeed a good q
i feel like i heard that could be done?
i know you can extract them...
yeah looks like you can.
swarm has a gui for that
ok neat, thanks
?
wild party of barbie clown freaks dancing in a zoo with nuclear bombs going off and german tanks storming a village of penguins next to the ocean at night with a burning comet impact
joker
Her ya go
Bataman 👌
@native knot I have improved my workflow. Example is in #🏞|general-with-images . I hope the users are satisfied.
I am very proud of my newly written "Save JPEG at 30% quality" node.
You know that already exists in the WAS pack?
Well, obviously not, i would say. 🤣
Ok, now i have to say something bad about the WAS implementation to feel better. It's way to bloated.
It's "feature rich" 😄
@native knot There is a guy who has send a prompt in #🌠|show-and-tell . I can't serve at the moment, because i am finetuning a 1.4 checkpoint (don't ask why)
Here is the image you requested
That's actually a rather good prompt : Art Deco style scene, featuring garishly colorful Barbies dressed as wild, manic clowns, flailing about in a dismal zoo amidst deafening explosions from nuclear bombs and thunderous German tank rampages over a devastated penguin village by the moonlit ocean under a blazing comet impact's smoke plume.
Lol that person echoed the random prompt I punched in to generate a 1280x768 image for my img2img workflow
Kinda weird to have an echo lol with no other comment
The Bot Seekers are a strange lot
indeed. i expect a new DM from them. that said, one of the ones I responded to the other day sent me a dm with "fatso"
Lykon keeps posting these portraits (he doesn't post literally anything more complicated, sigh) but yeah, that 16 channel vae is something special
it's funny, we've been chasing skin detail, but with that sd3 vae, even the ones he posted where it's a little blurry and there's no skin texture, still look way more like a photo than anything from sdxl
I believe, it is possible to say yes to more detail too many times.
Haha remember last night we were like whoa, wtf, how is it suddenly 3:30? ...daylight savings
I bring to you Digital Neon. Use various weights (even past 1) depending on the prompt and/or the model used. Feeling generous? https://www.buymeac...
Can't see the workflow (jpg)
that's a diff style of color bleed than i'm used to seeing with sdxl
⚠️ Light Zombie Ketchup Gore Alert ⚠️
Open Original and zoom in for the intricate details
Ohhh, yeah, that jpg is part of the workflow. I meant more like: You can see the product of that workflow in....
It's littered with self-written nodes that are available nowhere.
dang
in a month or two i'll have a lot of extra time on my hands and get into that
writing nodes, that is
there's so many things i want to clean up or change
countless nodes that desperately need that One Extra Feature (customsampler... my god, that needs an advanced version)
i think i have some idea of what kind of "fine" tuning it is you're up to.... 🤣
this is my message to the bots
😬 Oh yeah?
just curious, what kinda nodes did you add? wondering if we're seeing the same gaps
-A prompt style hat makes everything just worse.
-A 30% JPEG quality save node (got to hear later on that that already exists)
-I think an altered version of an Text on Image node.
Defiantly not the same gaps, i guess.
gotcha
yeah the lack of an advanced custom sampler drives me nuts... you can do really cool shit by changing the rho value for karras and polyexponential schedulers
the other thing i would really, really like to have is some kind of model merge node, even a simple one, with timestepping
I think we have that. Not with timestepping, whatever that is
^as in, first 0.3 fraction of steps it's 0.3 strength, next 0.3 it's 0.6 strength, etc.
starting vs ending strength
there is a tileddiffusion node that ports over the style from webui but i swear the tiled ksampler gives better results (at least without regional prompts)
You mean strength of the model?
the strength of one vs the other, yeah
Oooof. Ok, that's a usecase i personaly have no use for. But if you simply would like to merge them or merge loras into the models, you can just do that.
yeah, you can timestep by chaining advanced ksamplers
but then the interface starts to lag with too many nodes out
If you want to mess w/something interesting, do some model subtraction.
It's essentially a way to create a LoRA that you pull out of a trained model.
You grab the trained model, subtract the base, and whalla! You've got a LoRA.
oh wow that's all there's to it? makes sense i s'pose
we do already have this: https://github.com/asagi4/comfyui-prompt-control which does allow timestepping with a lora... so that's really good to know
There's a few interesting use cases...so for what aimingfail is doing, he could re-merge the LoRA that is the net-result of something he didn't want back in at a weight of -1, pushing the model further into bad results.
Is it effective in keeping characters and styles well?
merging within comfyui or outside of it?
yeah, just wondering if there's any downside to using those nodes vs something else
It's all quite interesting.
very!
so merging with a weight of -1... which node were you using for that? subtract again?
Just search for subtract...you should see the merge subtract node in the list.
gotcha, yeah just checking that's what you meant 🙂
anything interesting you've seen with modelmergeadd?
That's the basic merge model, right? I've used it in the past. But ttn makes a much more interesting multi-model merge node.
that one i have but haven't used
Some of the conditioning nodes are interesting but I haven't fully elucidated their use
Conditioning multiply, concat vs combine... Etc
Oh they are interesting
i'd love to hear if you've found specific uses
but how's it diff from combine?
i've done stuff like that a bunch... but with combine
You have one conditioning from 0 to 30% of the time. And then it switches to the other
yeah, i mean
this works pretty well... just don't know why/where/when i'd use concat instead of combine
guess i really need to just start looking at the code
I don't know what the Impact Pack combine node does. I am talking vanilla nodes here
i think there's a vanilla combine too
the impact one is identical except just keeps adding inputs instead of being stuck with two
I think it just smashes it together
is order of operations important with concat, like how the order of tokens matters?
I am honest and don't even pretend. I have no clue. But back in the days, i made weird stuff with concat and setting time step ranges
Because you can just switch the whole prompt in the middle of it all
By the way. I hope the guy can use your images for his scientific work. Good work there.
two pos prompts combined using the combine node: "a car" "a shark"
order didn't matter here
concat node: "a car" -> to, "a shark" -> from
swapped: "a car" -> from, "a shark" -> to
a single prompt "a car shark" "a car a shark" "[a car|a shark]" "a [car|shark]" etc don't come close to replicating the result of course
"a car AND a shark" in a1111
https://github.com/comfyanonymous/ComfyUI/issues/1403 ah here we are
a car
BREAK a shark
^^in a1111
someone was asking about break in a1111 being avail in comfy... gonna go get them their answer now
https://github.com/comfyanonymous/ComfyUI/issues/1403 conditioningconcat node
icon for video maker app
Thnkx
Here is the image you requested.
Can you help me generate icons
One other thing...if you either adjust your KSampler so that the sampler/sigmas widget(s) are inputs or if you use the SamplerCustom node, you can then apply the KarrasScheduler and PolyexponentialScheduler nodes to adjust the min/max for the sigma and the rho. 😉
conditioningaverage is pretty interesting but very finicky with the strength
i will be doing that for sure
actually just started looking at the code
not a python programmer, almost all my background is asm/C/C++... did a lot of RE and sploit stuff back in the day
but should be doable
If you pull down the ComfyUI Extra Samplers custom nodes, there'a a neat little SamplerCustomModelMixtureDuo node you should check out.
ooo, anything in particular you've done with this?
I just discovered it not too long ago...trying to set up some tests.
i've installed every sampler i could find
don't know wtf to do with a few of them yet
ksamplercycle is on that list
Check the node out:
slightly diff results but otherwise yeah that was easy
samplercustom however does replicate ksampler there
Ugh...keep getting a 2 device error trying to use this.
I wonder if it's hitting a vram limitation and then moving something to cpu. :/
how much vram you got?
10
drop a workflow here and i'll run it and let you know what i hit
It's a first gen 3080.
just installed the mikeynodes will test in a sec
did you modify your sampler like i was talknig about? the custom thing
I just retitled the nodes...nothing code-wise.
i just realized my naive as f hack change just causes it to default to euler/normal
Error occurred when executing SamplerCustomModelMixtureDuo:
'SDXL' object has no attribute 'model_options'
hit 15.6gb vram
Woahhh mikey's got his own nodes now?? Niceee
wait... did you mean that you got a ksampler advanced that accepted samplers/sigmas as output or are you referring to something else
You asked if I modified the sampler, I said no, I just re-titled it.
what effect did re-titling have
They sound cooler
It literally changes the title of the node. That's all.
ahh k just making sure there wasn't something i was overlooking
or misunderstanding
Can you explain this a bit more?
oh, i'm just looking to make samplercustom be able to behave like an advanced version where you can turn off add noise, and do something have it run steps 10-20 out of a total of 30 steps, etc
return with noise
that or make kadv able to take samplers/sigmas as inputs and actually pass them on correctly
Ah...yeah, I think the KSampler (Advanced) node's sampler and scheduler input will only accept primitives. @visual glade could really stand to adjust those to accept the KSamplerSelect and various Scheduler nodes.
I don't quite understand why they trumpet the reduced latent resolution if they increase the depth
Shouldn't the cost be proportional to channel * internal height * internal width
If width and height go down, but channel goes up, is there really a net savings
Yeah spent a bit looking through that code, gotta admit I prolly don't have time to fully wrap my head around it for a while yet
Not sure which approach would be easier - modifying the custom sampler or kadv
Guessing the former
@elder plume would this be something you might know about ?
I guess it does kind of make sense if they use self attention, then the cost would scale to the square of latent height/width
Dreamshaper XL
No special prompt. "Melting World" style combines dystopian cityscapes with characters that appear to be disintegrating or melting, suggesting a se...
anything is possible it's always a question of what will be merged 🙂
I'm gonna need another node to clean things up before VAE...
I think this lora help me get to where I wanted it to be. 🙂
Now put Will Smith there.
I only prompted for hair made of noodles, and he doesn't have any
Here is the image you requested
will smith goes through an impressive transformation for his next role as he uses method acting to become, the spaghetti.
The only roll he should ever play in again.
😆
in this candid shot, will smith goes in for the slap, but realizes he's much taller than chris rock
Shit, what's this? MIB 3? I mean mib 5 coz there is a mib 3 qand 4 apparently? Idk i sotpped watching movies after Tranformers came out... more or less
Cinema died at that point XD maybe AI will revive it
i forget if he was in the last one, it was chris hemsworth and the woman from westworld
Last pic before bedtime, just wanted to extoll the virtues of image to image in SD. the bad quality one is from ideogram, which has amazing prompt adherence, but generally pretty low quality visuals. SD to the rescue (dark arts images checkpoint) with an amazing restoration with iterative 0.35 denoise upscaling.
got one for ya in case you need any help getting insomnia
that's with my dual regional prompting + diff diff workflow
decided to take a crack at my original simpler diff diff one again...
diff diff is king
also, used my mistral llm to enhance the prompts... def was a quick and easy way to get better results
alright now i'm really liking this
used conditioning concat for the third pass, instead of a new conditioning
This neighbourhood was so nice before the demons and constant lightning 😦
Love these!
A lot of chocolate sauce you got there.
Yumm!!! Where's my knife & fork?! 🥳
Some already have a knife hanging somewhere close to the toilet if you're lucky
good ol poop knife
Here is the image you requested.
just crazy how good differential diffusion is
What exactly is differential diffusion? I've seen a few references to it
https://differential-diffusion.github.io/
It allows you to use masks to edit small portions of images in a way that doesn't distrupt the things around it, if done properly.
Editing different parts of a picture by varying amounts, as specified by a map
Nice, I'll check it out
Are these from you diddling around with the regional prompting workflow again?
The differential diffusion does work pretty well most of the time
diff diff and rotating the image between multiple iterations
shit, it figured out it was upside down
i always wanted to work in an office building
1950s Photography as a style. Use it for more than just photography like images. Feeling generous? https://www.buymeacoffee.com/generalawareness
been making a few environments and really loving nature compositions, anyone have any tips for upscaling strange perspective images like these?
This was done by SVD?
nope, both it and the sea turtle gif next to it are random tenor gifs lol
||gil||
19/2000 Logo with talent element
Here is the image you requested.
The fractal art of a pentagram
can you let me know which model is it? Does we need to use ADetail to get this result?
It's my UltimateBlendXL v2 on civitai...which doesn't seem to be working. Just search my name on there, when it works.
It was a while ago since I made those, and I honestly can't remember if I used the face detailer or not.
prompt: In a room, there is a man wearing sunglasses, a gold necklace around his neck, and a big cigar in his mouth in front of the coffee table style: Anime aspect-ratio: 16:9 character: None
hairy snake

how to use?
Which one is the beginner's room? How do I send messages and draw pictures? Thanks!
You either install it locally or use online services. Right now the bots are down, cf #1047610792226340935
@slim wren Oh! So it can't be used here? Thanks!
My only grievance is that it VAE encodes the whole image and thus also changes the "black" parts of the mask.
Did anyone create a nice Workflow to circumvent that?
yeah i've been doing that for a while
just use latentmaskcomposite to patch it back in
Can you share a simple Workflow showing how you set it up?
Why did I see low tier god here
lmao
it's the lightning bolt
sure
workflow embedded
a fish is playing game in the ocean
Kind of neat. I've got a prompt that attempts to split an image up into 6 pieces and create a regional prompter compatible prompt automatically for me based on a given regular prompt. GPT4 works on it maybe 20% of the time. Claude 3 just did it perfectly on the first shot. The requested prompt: a cat with a red hat playing baseball with a dog in a green jacket.
New here, is this a good place to ask for poses/etc?
I see all sorts of human ones, but I can't seem to find any animal poses for 4-legged friends and whatnot.
reason #1 million why comfyui is such a good thing to learn how to use... this image
tried like 15 times to inpaint the shadowy figure on the porch in forge/a1111 without luck
got it in the first shot with the first seed in comfy and i'm done
Hah nice
could always just region prompt it. 🙂 claude 3 is apparently awesome at it. i'm sdxl'ing one of the SD3 shots
I've had issues trying to region prompt something like that that is that small
Claude 3v
?
This was the #SD3 shot that lykon posted
A black panther leaping, in the style of John Howe ADDCOL
A blue elf with white flowing hair, in the style of John Howe ADDCOL
Yellow eyes and a blue scarf , in the style of John Howe ADDROW
A black panther leaping, in the style of John Howe ADDCOL
Blue armor and a wielded sword, in the style of John Howe ADDCOL
A stone valley, in the style of John Howe
the key is having vertical columns that are the same subject, so you can have horizontal granularity, but have a subject inhabit a whole 1/3rd of the left to right
I want you to create a terse text to image prompt. For that image, I want you to split the image up into 6 pieces; 2 horizontal rows of top and bottom, and 3 columns per row, left and middle and right. I want you to describe what is in each piece, starting from top left, then top middle, then top right. Then for the bottom row, bottom left, bottom middle, and bottom right of the image. Use the word ADDCOL to delimit columns and ADDROW to delimit rows. Don't add words denoting which image piece it is. Don't mention more than one subject per image piece. Put each image piece prompt on its own line. A subject is allowed to span vertical image pieces. Determine an appropriate artistic style for the overall image and mention ", in the style of " with that style at the end of each prompt line, but before the row or column delimiter. Please make a text to image prompt for: a black panther is leaping to the left of a blue elf who has white flowing hair, yellow eyes, a blue scarf, and armor while weilding a sword. backgorund is a stone valley, which they're both at the bottom of
gpt4 couldn't really handle this complex a prompt. but claude did it
yeah it was just released a week ago and it's probably 20% better than gpt4, which is obviously a big deal since that was the king before this.
good question, I don't know. 🙂
I'm wondering if it was able to handle that prompt because it understood the concept of a grid and could count
Gpt 4 def can't
apparently transformers based models can't tell how many tokens an answer will be, or conform to that, so getting sub 75 token prompts from these generators is the tough part.