#✨|sdxl
1 messages · Page 183 of 1
yeah there's all diff kinds... all are valuable in some contexts, bad in others
there's apparent detail... which can be enhanced by shit like film grain, sharpening filters etc that's not really hallucinating anything real
real detail... adding complexity on a fine level that should be there... then compositional detail, which is what we're usually thinking of when we mean "add detail with an upscael" etc
"real detail"
train model on only last 100 timesteps. Then use it as a refiner model, which only takes over the last 10% of steps.
works 10/10
poor mans refiner model 🤣
looks like a legit photo of a gold leaf artwork
Thanks, i'm working on a specialized Mangled Merge for styles. In the process of merging a bunch of TBH style loras to Output Block 1 and found some new techniques that are working out really well.
@copper kraken your stuff is amazing it keeps getting better and better. Makes me wish I could run comfyui (which we'll probrably get at one point in DT). Are you using your own model, if so are you sharing it somewhere?
is a 6.7gb sdxl model fp16 unpruned?
Kolors
thanks! last few have been base cascade with ultrapixel. i need to overhaul these nodes completely from the ground up so this can actually work in comfyui properly and support things like loras, alternative checkpoints, different samplers, etc
this is really the floor of what we can prolly do with ultrapixel
Loooks like it's getting the best out of Cascade. Does it need a lot of vram?
nope, these are all being run in something like 10-11gb vram at the peak
it requires far less vram than sdxl at the bigger resolutions
Still too much for my poor 8gb, but I think our dev can make it work for the smaller devices.
oh great!
We have tiled decoding and tiled diffusion 🙂
cascade is really memory efficient if you're using the "lite" checkpoints
usually the peak vram is with the vae decode (stage a) step but you can tile that too
def look at the full version of that one^
anyone knows what it does mean black and red dot in segment anything?
If I remember correctly, the red dot is what you want to select. It's been a while since I've used it.
thanks buddy
TypeError: 'NoneType' object is not iterable using controlnet openpose in SD WebUI Forge
anyone would help me, please
Right skeleton doing sum quite odd to that frog
Fooocus
Hi there, I've been out of the loop for a while. My models are still SDXL, using SDForge. Is there anything more up-to-date or better than this? thanks
cascade is great, espec with ultrapixel... working on that myself lately
I'll check it out on YouTube, thanks for updating me.
Here is the image you requested.
Can you recommend any tutorial that teaches how to install this?
Yes, I have ComfyUI, but I liked using Forge. Is it available there?
I have 8gb vram, is possibile to rum?
you could prolly gen something smaller than i'm doing here
Jesus Crist, this is crazy
cascade is pretty mem efficient
def more efficient than sdxl if you use the lite checkpoints
I'm watching this video, and it looks like it will install UltraPixel directly in ComfyUI. Can I do that? https://youtu.be/4Gf9lkgZ0SE?si=sfgAxOn54kGwki04
00:00 Intro
02:03 Local Installation
08:08 Control Net
My workflow https://openart.ai/workflows/-/-/hO6cXzfOxTQK61tnsAXg
Comfy Node for Ultrapixel
Github https://github.com/2kpr/ComfyUI-UltraPixel
paper: https://drive.google.com/file/d/1X18HH9kj7ltAnZorrkD84RJEdsJu4gDF/view?usp=sharing
All thanks to the team that made UltraPixel:
https://jin...
no idea with the video but if you git clone the 2kpr repo then swap in the stuff i addressed in a pull request you should be set
download what i converted, those two safetensors, into models/ultrapixel
then replace the python files in the custom_nodes ultrapixel folder with these
the workflows that use these are embedded in the images above
@willow stone
@copper krakenIs this delay normal the first time I'm rendering something? Or is this a mistake, something wrong?
yeah it's downloading the checkpoints
again just make sure you used the .py files i gave ya and the safetensors in models/ultrapixel
mitigates potential security risks cuz the ones they provided weren't really safetensors
Numpy 1.23 eek, will that mess up other stuff?
no idea
it didn't fuck up my install
i don't have a ton of shit on here though
Can you use SD 1.5 Loras with SDXL models or even sd3?
Ok, well will try it out in my dirty comfy that is spammed anyway
i'd just duplicate your comfyui venv, and if it gets fucked then just revert
lol nice
I have a clean version too, so will see : D
iirc, numpy 2.0 has been causing a lot of headaches lately. I wanna say even comfyanon rolled back a req on it to like 1.26, but I'd have to double check
Yeah true, numpy 2.0 single handedly killed my entire VoC env lol
I know a lot of other things have been rolling back, so it's possible I'm getting it mixed up. But all I know is I've seen it in quite a few repos lately for various AI related things
Ye, but numpy 1.23 in this case should be fine
UltraPixel raw renders, Idk if it'S worth it to pixelf***k it by downscaling and upsampling with a detailer again? : D But so far it doesn't like complex prompts
prompt adherence is not gonna be cascade's strongest area
I mostly use jugger cos it has one of the best prompt adherences
but sometimes losing some prompt adherence for looks is okay
I would use kolors as an example cos it looks great but can't follow prompts that well
Kolors follows prompts really well if you prompt correctly. The problem a lot of people have with the newer style models is they still, prompt, like, this, rather than using correct grammar with them. They are using datasets where large portions of the data are captioned with natural wording. Long story short, 80% of the time, it's poor prompting
Using an LLM to take some 1.5 style tag prompt and expand it into a real prompt, can help alleviate a lot of the issues people have with adherence in the newer llm style models
for context I am one of the worst prompters around
I've just never paid attention to learning prompting properly
so yeah good prompting may get way better results than I did
I do use natural language rather than tags though
for the encoders that take natural language
so at least I avoid that mistake
So I'm at the 2750 lora mark of Mangled Merge. I've spent the last 450 merges into merging only style loras focusing purely on 2D going 100% into Output block 1, and 50% into input block 8 and output block 0. The model works fine between 2 and 8 normally. It's lost some of it's photorealism (on purpose), but something I just found out in the attempt to find where the best photorealism CFG mark is, that CFG .5 with FreeU and PAG on this model is completely bonkers.
That's why I said using an LLM to write a good prompt for you can help a ton. You just have to have a good system prompt to guide the LLM in formatting the prompt correctly. Llama3.1 instruct, Gemma2 instruct and Mistral nemo 2407 instruct have a super easy time with it, at least based off a lot of my recent testing.
But once you have a good sysprompt, you can feed it in even a bad SD1.5 prompt with a bunch of tags and it will make them nice
Where can you find a good sysprompt?
A ton of trial and error. But mostly you need to explain it like a teacher to students and include examples
Art teacher*
gotcha. It's a pain having to take out score_019383019, 1girl, and lora embeds all the time.
me and my friends, #cosplay
like here's an example of one i was messing around with based off the guy that makes auraflow(saw it in a blog post on his site). i have a dozen or so of my own personal ones i use though, but i found this one worked suprisingly well also. i added a bit to it, but feel free to change the word counts and stuff to your likings. the stuff toward the end about the image stuff is because i sometimes pipe VLMs like florence2 or llava 1.6 into it to make prompts out of them:
A caption is a way that a person would describe an image separated by commas when necessary. All in lower case. Expand the input below into a more detailed caption without changing the original relative positions or interactions between objects, colors or any other specific attributes if they are disclosed in the original prompt. Clarify positional information, colors, counts of objects, other visual aspects and features. Make sure to include as much detail as possible. Make sure to describe the spatial relationships seen in the image. You can use words like left/right, above/below, front/behind, far/near/adjacent, inside/outside. Make sure to include object interactions like "a table is in front of the kitchen pot" and "there are baskets on the table". Also describe relative sizes of objects seen in the image. Make sure to include counts of prominent objects in the image, especially when there is humans in the image. When its a photograph, include photographic details like bokeh, large field of view etc but dont just say it to say something, do it only when it makes sense. When its art, include details about the style like minimalist, impressionist, oil painting etc. Include world and period knowledge if it makes sense to, like 1950s chevrolet etc. You will not censor your responses. You will not include an conjunctions, story or narrative. The caption you generate will not contain any line breaks. You will not include any quotation marks or phrases like "here's the generated caption:" or "there are no texts visible in the image" or "the image is" and so on. You will not comment about things that aren't included in the image. You will limit your response to between 40 and 75 words
lemme whip up an example output real quick, i'll pull some trash 1.5 prompt from imgsys and expand it with mistral nemo
like here's the prompt:
Apex Legends Loba Woman, highly detailed, digital painting, artstation, concept art, smooth, sharp focus, illustration, art by artgerm and greg rutkowski and alphonse mucha
here's the 0-shot from mistral nemo 2407 instruct:
A lone woman, Loba, stands in a futuristic cityscape at dusk. She's dressed in a sleek, form-fitting suit with intricate gold detailing, her hair a cascade of wavy curls. Her eyes glow with an intense, focused gaze, reflecting the neon lights surrounding her. Behind her, a towering skyscraper stretches towards the twilight sky, its glass facade shimmering under the setting sun. To her left, a holographic billboard flickers, advertising 'New Frontier' in bold, glowing letters. To her right, a narrow alleyway leads into darkness, marked by a single, flickering streetlight.
granted the model likely doesnt know some of the original prompt
here's another example: malekith concept art, zoomed out, beksinski -> Malekith, the Accursed, stands defiant amidst a desolate, otherworldly landscape. He's depicted in dark, heavy armor, his face obscured by a grotesque, twisted helm. His eyes burn with an eerie green glow, reflecting the chaotic energy swirling around him. Behind him, a towering, jagged mountain range looms, its peaks shrouded in an unnatural, sickly green mist. Before him, a vast, barren plain stretches out, littered with the twisted remains of ancient trees and the shattered ruins of long-forgotten structures. The scene is painted in dark, muted tones, with stark contrasts creating a sense of unease and foreboding
thats cool. it fills in context
without having to mention beksinksi's art style, it perfectly describes it
and i'm a big fan of his art
is this SDXL?
ah yeah the auraflow prompt is good
yep, but if you already have some context, it can help expand it or condense it down into more impactful and ordered wording
I need to find an LLM node for comfy
ollama node
yeah
thats what i do a lot
actually, sometimes if im patient, i'll combine the outputs of both florence and llava1.6mistral and then use all that context with a good llm to make a prompt
Here's something weirder I just found out about this model. I tried going to CFG 0 and noticed it was giving me my negative prompt. So I switch the negative and positive prompts around. Somehow, my model is working amazingly at CFG 0 with prompts switched? How?
But here's the thing. If I put the positive prompt into the negative prompt, it's working.
run more seeds, it will likely flipflop btween them
nope. pretty consistent
I also have PAG and FreeU. but that's so weird that the prompts flip at 0
I'm gonna put some styles in the positive and see if it gives me any
yeah there's some kind of clamping going on with those nodes then, zero cfg with a default workflow will generate weird shit that doesn't follow anything
lemme flip the inputs though
ahh nvm, yeah it's working now with the neg in the pos and vice versa
at any rate, i wouldn't go that low with cfg
Tried the same prompt at different seeds, and putting illustration, 2D, artwork by sigeru miyamoto, into the positive, didn't give any illustration.
Looks good for randomness too. I'm gonna test more.
so whats controlling the uncond_pred?
ahhh. ok that makes sense now. weird.
that's why it's working though, but yeah, it should never be zero, if yout set it to even 0.01 it will allow the positive to work again
negative prompt yes
and stuff like control net, IP adapter, SAG etc can also go via negative conditioning
think they all get cfg multiplied before sampling though, so they'd end up being zeros as well, except their negative prompt portion if you use advanced nodes (for cnets i mean)
is it possible to go to .01 in comfy? Im trying, but it keeps rouding to 0
yeah in the settings you can change precision
cfg will snap to two decimal points though on stock ksamplers
so 0.01 will be as low as you can go
found it. at CFG .01 with the prompts normal it's going back and forth between the negative and positive as you described.
but yeah, use a normal cfg. honestly, don't go below 1.0. think of 1.0 like 0
I just like to experiment 🙂
yeah for sure. at 1.0 cfg, you have no negative prompt
it will also diffuse much faster too since it's not having to handle unconditioning
like stable cascade used 1.0 and 1.1 for cfgs i think
These are at CFG .01 with positive and negative equal.
right, but some of those other nodes as likely doing clamping stuff on the cfg
bypass them and you'll see what i mean
yeah, it probably wouldn't work without PAG and FreeU
so what i meant earlier was that you cfg isn't your actual raw cfg
like if you use automaticcfg, for instance, the cfg on the ksampler isn't your cfg anymore
and PAG is cfg+pag amount
so by default, PAG wants to be 3.0, so 0+3=3
makes sense
which is another reason why a lot of people dont like pag because they will use a cfg they normally use like 7, with pag 3, it makes an effective cfg of ~10
so your target cfg with pag is target=cfg+pag
so if you want 5.0, with pag of like 2, then cfg would be 3
and so on
might be better for me to just stick with FreeU then.
FreeU really is a free lunch yeah
These are CFG 1 without FreeU and PAG.
i personally dont care for freeu, automaticcfg and/or pag is pretty much all ill use anymore with sdxl. freeu just doesn't do the right things for me i've found.
These are CFG 1 with only FreeU
FreeU is just really subtle and weak
its not a very powerful tool
but sometimes it can help
what do you guys use?
FreeU can also harm
I've found that with the right settings. This is how I usually keep mine.
if you want some real fun though, start messing around with beta noise schedule. shits cash
I do want real fun lol. I'm gonna look into that.
where can I find info on that?
but it might with other images
helps a ton for the early steps and fine details at the end. it's in the default ksampler now
click scheduler and it's in the lsit
outside of test images I use adaptive samplers anyway
cos that just removes any sigma issues
yes and no, they still use sigma curves afaik
i dont really mess with adaptive samplers that much though
they take it as input but they ignore it
I guess I need to update my comfyui. gonna try this out.
yeah do that. obviously, don't expect magic or anything, i've just found that it works excellently for images in the 30-35 step range, which is all i do
oh yeah it might be good for 30 steps
I was testing it as like 12 steps
where AYS/GITS do better
oh yeah for sure
those sigmas are centered around that specific task
using that malekith prompt example from earlier, this is hunyuandit1.2. fucking nailed it(helm might be one of those face helms)
I kinda only do two kinds of generation
12 steps UniPC GITS in a few seconds
or adaptive implicit midpoint, adams or RK4 for nearly an hour
I don't really like the middle ground
hunyuandit is good sometimes yeah
i think that might be AI generated
ok, so its definitely very well conceiled
there is lot of lighting in the photo, do you suggest any keyword/ method to lower the lighting? I am trying to achieve iphone camera quality picture, but cant seem to work with lighting, I am using realvisXL 4.0 lighting (baked)
IP adapters with photos with a lighting level that you like
@copper krakenHi, my knowledge is quite basic, I don't know how to apply that command you showed me. I still can't run Ultrapixel, but also, I'm not sure if my computer can handle it. I have 64GB of RAM 3200 2x32, 2070SuperFTW3, and a Ryzen 5700X. I really want to create images like yours, but if the 8GB of GPU limits me, do you think it's better to keep using Forge while I wait for a LOW GPU version of Ultrapixel? Forge benefits from low GPU, which is my case.
oh, I will try
Cool what model is this
cool
It's the next version of Mangled Merge I'm working on. Hoping to release sometime this week.
W
Can't wait @ me when it comes please
Will do. ☺️

such cute spiders!!
Spider I'm your father.
One message removed from a suspended account.
Lion
XXMix_9realisticSDXL Model + ADetailer (person_yolov8n-seg, face_yolov8n, mediapipe_face_mesh_eyes_only) + 4x-UltraSharp Upscaler 
NFT? That's still a thing?
is this also ultrapix?
No, this is SDXL
It's using my image2image detailer. It started with this image
Now that's a dirty girl!
As a depth map? Looks like what I'm always talking about when I tell people to just mock up a simple scene in blender and export a depth pass
No, just as a starting image
Ahh yeah looking at it now, it's an ambient occlusion map anyways
But if you have the 3d scene, you can export a depth map and even a basic render with flat colors as eell
I don't, and there's no need when I can create images like I have above.
I see
this is what i'm talking about btw. in ue4/5 you can make an extremely quick postprocess volume material and get perfect depth maps if you want and even adjust the cutoffpoint distance.
obviously, render it out to the resolution you want and jack up the antialising on it. shit works really well with the union cnet
Looks nice, what checkpoint do you use for this?
Boltning
here's an example of using the technique with the union cnet (i like making fake cryptid/ghosty stuff)
UE5 is great for it because you can get a shitload of free assets off the quixel bridge and mock up scenes you want. props, materials, characters, etc. there are also meta humans you can make and drop in, pose and so on. so you can get a ton of control over the scene and then use SD to do its thing with cnets and/or just img2img at some denoise amount
(for people that want strong control and direction over a scene they want to make)
Sounds like too much effort for my liking 😄
well i've been the vfx world for decades, started doing 3d shit as a teen around 2000. i can tell you that this hybrid approach absolutely rocks compared to having to manually model/texture/tweak everything for a 3d scene
but i guess not everyone is down with it and that's fine too
just like to show options and workflows for people that might have similar skillsets
Reminds me of There Will Be Blood
Depth is probably more interesting for AD or so. Actually not even sure if comfy would clip 32bit depth renders to 8bit
BTW also coming from. VFX
Mr Originality - nice to see someone breaking barriers and exploring beyond the typical!
Yeah idk, but 256 levels should be enough for diffusion since it's a "fuzzy" process anyways. In theory, people could train a 24 or 32 bit grayscale depth cnet model.
I was actually talking about that with some peers a couple months back for terrain generation. Literally make a diffusion model that spits out height maps with good erosion and flows. The RGB channels would be depth, flow and deposit. But I don't have a handy cluster of a100s and thousands of man-hours for captioning height maps to train a model from scratch :/
Vs using apps like gaea, world machine and world creator
They become painfully slow once you get into the 8k range. I wanted to make some kind of model that would take the course output and then use diffusion to finish the rest
Obviously it could be expanded to other tasks like making other splatmaps for things like vegetation, rocks, debris, etc
yeah i really need to try this
Yeah kinda as you said. Currently on mobile so don't have example images, but we have a setup that is ultra old-school. Took hq stuff from https://gisgeography.com/usgs-earth-explorer-download-free-landsat-imagery/ fed it into https://www.bluemarblegeo.com/global-mapper/. It's 2d kinda since sat data from above. But when you camera project ai renders in nuke over a displaced point cloud you get really cool stuff. Workstation in repair atm, but can send some images if you're interested
The USGS Earth Explorer is quick and easy to download free Landsat imagery & other remote sensing data. Enter ROI, define timeline and download for free.
Powerlines 😀
@uncut steeple @smoky patrol
There are two cryptocurrency messages above this one. Can one of you remove them? ❤️
And all their posts as well (they spammed the crypto thing in multiple channels)
Thank you ❤️
Hey guys, is there any model that performs well with typography?
Can I use this for personal projects where I'll be paid? From what I understand on the website, I can, as long as I'm not a big company. Is this correct? Thanks.
up to 1 million
Thanks
how do i set clip skip to -2 in webui?
or is it just 2
because it with go in the negative
not sure but in comfy there is a node for it
yeah, its just 2 as you expected
Alr thanks
Does anyone know if there's a way to run adetailer on a batch of images in a folder?
@spiral bridge https://civitai.com/models/447902?modelVersionId=684665
cute faces, how well does that model work on half body or full body images?
i see the author released v1.0, user posts on it look good but they are mostly half body
which one do u think is better?
both of the image textures look good each with unique difference in perspective
no such thing as better
better would be strictly subjective opinion
that's true
what's your opinion though?
i dont offer that kind of opinions
those 2 images have good quality in tone and texture is all i will say
it would really help me out because 1 of those is my attempt at an improvement
but thanks either way
its up to you how you wanna apply your idea in context of what your actual objective with those images are
my objective was to get a more varied image similar to how i feel smaller resolutions of images offer a better look variation wise but is worse quality wise
the right image is my attempt at taking a 1024x1024 image and improving it by using controlnets and downscaling it to 512x512 before upscaling it to 1024x1024 and further upscaling it afterwards
the left is a standard 1024x1024 image
there you go
imo the background on my version is better but that's only my opinion
your objective = your opinion that will matter
thank you :)
np, so yeah both are good quality
here's another try with an improved background
well the right one has more depth cause it incorporates the background and foreground
im not saying better between either of the two, if you want close up left is good too
that's the thing because a 512x512 image is cray good at doing that for some reason but it's always very zoomed in compared to a 1024x1024 image. to combat this i used a 1024x1024 image and a controlnet to force the 512x512 generation to be zoomed out, then i used that to generate a 1024x1024 image
any reason why you are choosing square canvas?
you could mix the height and width like 768x1024
now that you mention it i saw that faces became a bit deformed when generating the first 1mp image to use as a reference for the 0.3mp image
i'm not sure why that is
the top one is for the 1024x1024 image
technically choosing 512px with sdxl model is not a good idea
i made it so it always ends up being 1mp regardless of the width and height
but it doesn't seem to like that for some reason
the problem happened with the 1024x1024 image before going over to the 512x512 image which is what kinda confused me
sounds like you are attempting something that goes beyond image dimension
because 1024x1024 or anything close to 1mp should be the best but it's somehow deformed
just something like 1080x1920 messes up faces, let me try to show it
here is a 1024x1024 image for reference
Hey guys I've got a question I feel like I might not be in the right place to be asking us and so you guys can just tell me to go fuck right off if I'm intruding on your space but can I pose my question?
SDXL
1024x1024 default
Variants:
640x1536
768x1152
768x1344
832x1216
- 896x1152
use those image resolution for basic render
it stretches the features to fit the frame
thx :)
which resolution did you set that?
So the thing I was trying to do was take a picture of some stitches that someone had on their forehead and I added to it a little bit until it occurred to me it looked like a nine inch nails cover. And I wanted to lay the graphics over and everything and I know that sounds super simple but, I couldn't figure out how to get the rights to overlay tools trademark over the top and then just create an album cover
the base model training data set requires you apply those mentioned resolution then upscale later
1080x1920 and plugged into something that sets it to 1mp
or is it just the resolution combination that matters and not that the final resolution ends up being close to 1mp?
yes the resolution combination matters on rendering image by pixels
you can't apply random dimension
image would look distorted
i thought it would be fine as long as it ends up being 1mp
Hey look guys I'm a little bit new to this place. I mean I got on here years ago but I just haven't been on a lot so help me out I'm not even sure who I'm talking to at this fucking point
To be honest I'm not even sure if anybody is actually speaking to me
i'm so confused
You tech wizards are way above my pay grade but what I was doing was I took an image and I tuned it up a little bit and just for a gag I was trying to turn it into a fake 9 inch nails cover and when I was trying to find a way to overlay good vector graphics of nine inch nails graphics over that image it just wasn't working out for me because they're protected or maybe something like that or I don't know maybe you guys are better at that than this that's why I came here Jesus fucking Christ
it feels like you're being very hostile and i haven't even done anything
i would gladly try to help but at this point it feels more like being held at gunpoint lol
Not at all my friend you know in fact if I could get any help at all or if you could even lead me in the right direction I would appreciate your help I'm sorry if I came off at Host as hostile I didn't mean to come off that way
light show inspired by Song Dynasty aesthetics, soft blue and cyan hues creating a tranquil and elegant atmosphere, gentle blue waves and reflections integrated into light projections, delicate flowing particle lines, showcasing modernity intertwined with elegance, minimalist modern design, intricately expressing Song Dynasty beauty traits, unique visual effects that resonate deep emotional connections with the audience, serene ambiance, immersive experience, 4k --version 6 --ar 16:9 --chaos 0 --quality 1
i can't do it directly for you because i gotta go to work soon but if you could download comfyui (which is a UI that makes it easier to generate images locally on your own pc without dumb copyright stuff restricting everything) then you could make whatever you want 😄
Well anyway I could show you the image that I had. It just sort of like when I saw it it reminded me of one of the earlier cover images and it just made me think how can I repackage it into a another you know faux label. Not to try to fool anyone but just to you know share with my friends who also love tool as much as I do
sure ofc
I mean I don't have any problems sharing the image with you because you could probably do a way better job of it so here you go. That is is there a way for me to just upload this right now I'll try it
if you click this button you get a bunch of options and then there should be one that says upload file or something
this one
but i can't make anything right now because i gotta go really soon. wish you best of luck with it all though and feel free to ask if there's anything
As I have said I'm I'm pretty new to this format because I haven't been back here in a long time it's probably been four or five years since I've been on here and I'm sure a lot of things have changed since then and I was already an idiot even back then
generally the way to go about it would be to do this:
- download comfyui
- go to civitai, find a checkpoint model (.safetensors or similar file types :))
- in the default workflow inside comfyui you can change the checkpoint model to another one if you download one
Yay I did it
amazing :)
So do you see why I thought that would be a great background for a tool cover?
In case you're concerned, that's not pornographic. It's just a picture of my forehead this very weekend
it does look like metal yeah
They blocked my last message because I said they stapled it up with metal instead of some weak floss, some weak ass loss
They didn't like the word puvvs axx floss
Anyway my stupid idea was just to take that picture that I took of my stapled up forehead and turn it into the tool album cover that never existed and not use it for any kind of ulterior motives but just to just you know have a laugh my friends and go you know hey here's an album they never released it's called the weekend when I broke my fucking head open and had to go to the hospital etc I thought it would be a great joke
Guys, in which folder in my ComfyUI should I add the file "pytorch_model-00001-of-00002.bin" for Ultrapixel? I'm downloading this manually, since I always get an error message when running Ultrapixel for the first time, according to Chat GPT this is the file I'm not able to download.
Are you telling me that this is the first time you can't find an uploaded image and I'm the one that's provided you the image that you've never been able to find before or am I miss hearing you
this error
I am not sure of the answer because its Cascade architecture
if you wait until the Cascade guys come online they will know
Okay I'm so out of the loop here I'm sorry everybody's laughing at me Jesus Christ fuck me okay well but I am also the only asshole that split his own head open on the table and went to jail for splitting his own head open fuck me
I skipped Cascade completely and went from SDXL straight to SD3
nah man nobody's laughing
we're all in this mess of constantly having to learn stuff we thought we knew
Hey vipraz I appreciate that message you know I love that attitude
of course man
No you don't know you don't know that I love that attitude but I'm telling you right now I certainly do love that attitude
There's a guy here in the group, he's posting the most incredible images I've ever seen, he helped me, he recommended using Cascade within ComfyUI + Ultrapixel.
Hey guys can I ask a really uncomfortable question here what's the age range can everybody know that I'm going to ask that that's I'm pretty much already know it's early twenties to late 30s maybe if that I'm not that but what I was trying to do I'm sorry I stumbled into this room to just try to figure out how to do something simple but like I'm a little lost do you mind that you guys kind of point me in certain directions maybe??
i'm not sure how old everyone else is but i'm 21
yeah sure what are you trying to do
I have 36, old man
Yeah that's about what I expected so I'm 44
All of your Ultrapixel models should go into {your ComfyUI}/models/ultrapixel, but that pytorch model isn't one of them. The controlnet directory has a canny.safetensor file (for Cascade).
This is not a dating group, so your age doesn't matter.
Thanks Galaxy
Okay I appreciate that I know it's not a dating group come on guys!!
But for some of us blokes we feel like you know I mean come on now I'm 44 I'm kind of not where it's happening anymore so I understand that if I'm outside of the spectrum of everything that's cool now I start feeling like you books dads and I'm I probably could have wanted to be one of you guys dads Jesus Christ that fucking oh my God I should just leap into the river right now
I don't mean it I'm not going to leave in any kind of river but you know what I mean maybe
before you do, download comfyui or automatic1111 and see what it can do 😆
I get this error when trying to run Ultrapixel for the first time here within ComfyUI, I already installed Cascade today, and everything is working fine, do you know how to solve this?
Okay well what are those things designed to do please tell
I mean only in a sketch you don't have to say it all
those are both UI's (user interfaces) so they have a bunch of buttons you can click and they will do most of everything you want
At least give me enough so as to not throw a hitch over the timber and turn over a stool
Not really. Do you have a Huggingface account? Sign in, delete the ultrapixel model and custom nodes directories, reinstall the custom nodes and try again to let it download the files.
I know what the fuck a UI is, as well as a GUI what do you take me for. And that'll be the last thing that I wrote my note after I have credited you Galaxy time machine
Hi now I don't have a hugging face
i don't know what you do and don't know so i'm starting at the start
I'm not sure how this interface works are we in private message right now when I respond to you right now?
I have so much to learn
The message isn't addressed you. Were you like this before the bump on the head? Perhaps simmer down a bit.
And now who am I speaking to GTM?
Okay guys to everybody here's what I don't understand I think I'm I'm okay here's what I've here's what I think is happening right now. I think everybody is having a laugh at me because I have no idea about the format of this stuff and you guys are absolutely right I have no idea
But there was no there was no cohesiveness or cohesion towards any of this or between any of him and my siblings and so it all fell apart and I continue to hear about this platform through all the podcast that I would listen to and where they would try to like really get this thing off the ground and it seemed to me like a sort of fell apart in large part
But I I feel like it still held together and kind of an underground way and in a way that I would like to have been a part of but I just never really got into it and found a way to you don't like I didn't ever really find a way into it so I never did
i'm not sure i understand
Why does it matter to you? This is how "social platforms" work.
Okay I'll just answer you from my point of view I don't know what we're talking about necessarily when we talk about the social network but I mean I got on the socials when they first popped off and then I got burned off of them I got tired of them I got off of them at some point because they are not serving me.
oh quite a lot of people quit social networks by now yeah
they are not as big as they used to be
I hear stories about back when people were on VBS stuff and people who were early adopters and that sounds like what I would have really rather been on and I I was not that early of an adopter but I was on earlier than even that because I was on the phone tag stuff do you guys even know about that?
not sure what phone tag is
Yo it was wild it was like, you would pick up your phone and you would call a certain number and then you would leave a voicemail. And then other people would do the same thing and then after a while you could call into that voice bank and you could press certain numbers and they could press certain numbers to be like oh let me hear your response to the thing that I said and so forth and back and forth and so people would just get into these riots in local neighborhoods within their local areas and just fight back and forth and that was before the internet it was even popping off although the internet did exist back then just nobody knew about it because it hadn't because the Internet hadn't popped off yet
Yeah that was the crazy alert back then because that's when it started it's like when everybody was started but that was like the earliest social media there ever was
almost like an argument except you could leave it until you had a solid enough thing to say lol
That was in the early 80s when that started happening and if anybody could have had the foresight to you know see the future but you know who could have predicted it
i wonder what would have happened if modern social media as we know it today wasn't introduced
Were you even old enough to use a phone then? 🤔
me? no. but if i stopped wondering about things that existed before i did, it would be a boring place
Hey Galaxy I guess you're asking this Perez because I'll tell you right now I was born in 1979 but, I think you both are much younger
I'm 58
Oh wow I'm surprised to hear that Galaxy cool I guess that makes me not the oldest person in the room
i'm still the same age as before, guess there's not much i can do about that 😂
The thing that makes me most upset in the world is that I'm not going to live to see what comes, I want to see all the things that comes and the fact that I cannot see those things it's the most bummer thing that I can possibly imagine
This is a channel about SDXL, can we get it back on track? 🤷🏻♂️
Well tell me GTM what is sdxl?
Because I just wandered into this room looking for something that was apparently completely unrelated
If anybody's interested I just had a hell of a day. I'm just happy not to be in jail
But please tell me what is sdxl
Software that lets you describe an image and it then creates it.
Google can tell you more
these were testing vectorscope node
and that one was FreeU node but with the settings pushed the wrong way
(lower B and raise S)
@coarse kelp hey I guess we got disconnected because I said some things that got banned I didn't know they were so sensitive over here my bad
Hey but I was really enjoying some of the chit chat we had going on earlier
I hope I didn't buy me out with anything I said I didn't mean to be offensive honestly I was just really seeking out answers but you know it was you and that one other guy really seem to give me some feedback and I really appreciated that
Your work looks really great for whatever that's worth I'm sure you get that a lot but it does.
You know I'm really not in the space but I really enjoy the conversation so if you'd ever like to get on a meet up for coffee
Strictly speaking in a techno space I just mean I really enjoyed kind of back and forth but I'm kind of a goofball myself and I don't also anyway I don't get out a lot it was nice to have some conversation I'll shut up now
I speak better in real life this was just voice to text so please forgive
not really getting good results with xxmix9realistic ..
Don't forget to use trigger words in positive prompt 
hmm i totally overlooked that
i'll give it a try again

but then again, a checkpoint shouldn't require a trigger word
i mean its not a lora
but thanks for pointing that out, im copying the model again from my backup drive
Yes, but the owner of the model said that there are trigger words like lora, so you have to use them, haha. 
ps. If you use it with Hires fix, I recommend adjusting the denoising strength = 0.3 and then using Upscaler (Latent) + 4x-UltraSharp (Pixel). If the denoising strength is 0.4 or higher, maybe the anatomy is wrong sometime. If using only denoising strength = 0.3 + Upscaler (Latent), the picture will be blurry as well.
ok ...i will make note of that
atm, im just trying a render w/o upscaling at 896x1152 res
preview looks better with xxmixgirl
what a big difference, thanks for the vital mention on trigger
for best result using with ADetailer (person_yolov8n-seg, face_yolov8n, mediapipe_face_mesh_eyes_only) as I wrote in the previous post.
for demonstration only (not cherry-picked
)
i haven't been using all those detailer feature, been using it for only face, curious to try those 3 features together now
Maybe 2 features are enough (face_yolov8n + mediapipe_face_mesh_eyes_only)
im gonna play with it to see the difference, any thought on using adetailer for hand ? i haven't much luck with hands
As for the hand, if it doesn't come out well, using Adetailer is useless. You may have to send it to img2img >> inpaint and then mask the hand and use it with Controlnet openpose full.
there is a tool that can help hands
more process = more gen time haha.
i get fairly good results with pony models when it comes to hand but sdxl seems to fall short
meshgrapho
meshgraphormer
I haven't used it yet, I'll give it a try. 
nice.. ty for that, gonna try it out
using adetailer for face and eyes seems to do a pretty good job ...
its a long video but he shows that we now also have separated detailers for eyes, nose and mouth as well
I often use Face Analysis in my workflows but we never actually talked about how it actually works. Here all you need to know. Remember to upgrade the extensions, these are all new features!
Check my Discord for the workflows, they are all free for everybody to use.
Discord server: https://discord.com/invite/W2DhHkcjgn
Github sponsorship: http...
they're pretrained detectors so they don't need any fine tuning
I was going to sleep, thanks for posting this, when I wake up I want to study this.
Is it better to use hires fix, or upscale a piece after it's already made?
Ty so much u did it ill 100% try it today and let u know how it is
Looking forward to seeing some great images from it. 🙂
I'll try my best depends how creative my brain works for prompts when I try it
Depends on time of day I'll try my best to not do it so late otherwise my brain dies

I know what you mean lol
Later it gets my brain dies and I can't think of anything creative to do so prompting gonna be a challenge for me basically
So I'll try ur model at 4 pm so that's when I'm most creative
Sweet! I should probably do the same and stop releasing my models so late at night. Lol
😆
Hi guys, I tried SD3, but honestly, my impression is that SDXL is a few years ahead. Is that right?
Sd3 is a base model, not a fine-tune. Most of the sdxl models you've used have been very fine tuned for certain common concepts like portraits of pretty women. In doing so, those models tend to lose a lot of potential variety in things. Base models are meant to be a jack of all trades and sd3 is very versatile for almost anything you throw at it. Compared to sdxl base and 1.5 base, it's light-years ahead
Hello what do you think about this image ?
maybe try an img-to-img pass with a lot of denoise
you might get something out of it
Yeah faces come out good with adetailer but help me please with the hands...
hands are a hit and miss at best
i like it its different, not typical ai vibes
Good idea
I typed uncanny valley
meshgraformer can help
*meshgraphormer
Nice ones
Thank you!
Cool but strange, reminds me of gmod with that weird jpeg mods that follow you 🤣
/prompt a woman
(Buzz Lightyear from Toy Story, design schematic). (detailed line drawing, black and white outlines, colored 3D main body, annotated labels, dimensions). (DSLR camera). (macro lens). (midday). (technical blueprint style). (Kodak T-Max 100 film) --ar 3:4 -- v 6
Next bots 😂
anyone knows how to make an inpaint model in comfyui?
powerpaint v2 and brushnet
/prompt a man
New model from the original makers of stable diffusion, Black Forest Flux
https://huggingface.co/black-forest-labs/FLUX.1-dev
(I added the boxes.) First model to understand shadows.
Transparency and reflections.
Very solid, not perfect. close up of the G chord being played on a guitar
Oh! So close. These are not cherry picked though. 1 generation each prompt is all I'm doing.
a painting by John William Waterhouse absolutely flawless. 😮
Perfect text of course.
More challenging text and it doesn't even blink.
"A word fitly spoken is like apples of gold in pictures of silver." At first you think it messed up and left out the word apples. And then you realize it put actual apples there. Is it really closing in on AGI??? 😐
Can I use FLUX with comfyui with 12GB vram?
You can but you will ran out of vram and ofload it into ram (which will be slow)
Im hungry now 😁 nice images
yeah, i'm running it on a 2080 with 8gb vram and it's slow, but works. 15 seconds per iteration lol... not really worth it, but it's an option
and im here with my 20sec/it with just loras
I'm getting good at styles.
Ooh this looks really good
More like a Asus2 
Damn, it is good with adherence
flux is like ideogram at home
Simply stunning
After ClownSharkBW
hello
...is it me you're looking for?
Flux on 4090: 22 seconds per image. It's a base model that does hands and eyes and keeps geometry pretty stable. I can knit-pick one missing wire on the leftmost wing strut and the right wheel's stalk joining the chassis too far back, but otherwise this is legitimately a well-constructed biplane. The model understands solid things and puts them together like block pieces. It feels so surreal.
(And this is the quantized fp8 version. The full version is probably better, but it's not even like even the 5090 is gonna help. This is as good as it gets consumer-side probably. 😐 )
three dice on a table, the dice have all rolled sixes Zero number-bleed! It didn't really get the concept of dice rolling sixes, but it understands that I want three dice, and it understands that I want three number-sixes, and it put them all on a table. This is just a case of it not having seen enough properly-labeled dice in its dataset, but the architecture is 100% capable of separating number-quantities from number-labels. 😊
Oh that is sooo good! 😱 No question it understood the concept of a reflection, not just two people standing next to each other. An anime girl leaning against a mirror. A girl beside her reflection.
(Reflections and counting are emergent intelligence. They're not behaviors you can get just by adding more data and labels. They only emerge if the architecture is intelligent enough to build generalized models of physics.)
Gah so close! No! 😭 This was really just so beautifully close. It gets the concept, it just doesn't want to write text backwards. I feel your struggle, little model. Gotta pass those text benchmarks. A sign that says "MIRROR" sitting on a bathroom counter beside a mirror. A sign and its reflection. The sign is reflected in the mirror, with mirrored text.
(Also bear in mind I'm one-shotting these, no cherry picking. You can always get better performance out of a model by running multiple generations and picking the best output.)
I can't get it to generate trading cards. 😦 They're not in the dataset, or they weren't labeled. That could easily be fine-tuned though.
But I'm still amazed by its sense of symmetry, the solidness of these costume parts, the fingers and the eyes and the overal body proportions despite using a non-standard aspect ratio, the structure of the sword, the correct folds and drapes in the fabric. This model just understands physical objects so well.
a trading card game trading card, gameplay element, card design, tcg card
512x512, 20 steps, 19 seconds
512x512, 10 steps, 11 seconds
512x512, 4 steps, 14 seconds? Why slower? 🤔 Also not usable quality, but it does seem a lot stronger than SDXL was before lightning.
Sampler / scheduler make a huge difference, no time to test all those now.
Bottleneck on my 4090 is swapping the VAE out of VRAM, so you don't get any meaningful speedup by lowering the stepcount at 1024x1024. 😦 It's always going to be 20-40 seconds.
It understands underwater hair and lighting so well.
swimming underwater, long hair
That's some flawless riding anatomy. It's just so perfect everywhere. The constructed lines, perspective, shadows, anatomy, lighting. Didn't give me motion blur though? Hmm. Well, no more tests until tonight after work.
derby, horseback riding, anime girl, motion blur
Had to try one more. pixel art anime girl fox
But styles can always be added with fine-tuning and loras, so they're really not important. But yeah, that's some absolutely flawless-looking pixel art right there. Doesn't look 100% accurate grid-wise. Not sure how well it would hold up if you actually scaled it down. Gave her two tales, but the style is good.
Gotta come up with some more physics tests since it's aced my normal ones. See what I can think up before tonight.
dream
With a massive 12b model, I'd hope so. That's a whole ton more room to store information, assuming they use it efficiently and correctly
I haven't updated it in a year, but try my old workflow
Eventually flux will have a cnet and loras and life will be good again 😂
OOOOOOOOOHHHHH
Are you having a smorgasbord at all?! 😄
Yes
🥳
Ryan McCree
@copper kraken I'm interested in learning more and using Ultrapixel. Can I dm you for details, custom nodes, etc., etc.,?
Impresionante! Que PC se necesita para poder crear una imagen de este calibre?
cnet or a lucky prompt?
yeah that's fine, also fine to just ask in general with images since we don't have a stable cascade channel anymore for reasons no one understands
might be other ppl with questions
Ok
feel free to ping anytime
Thanks!
sexy girl
that's a latent lol
You know what
to this day sd15 is the best for the last moment finetunes
like the last few steps to kind of finish the skin and eyes
😮
Yup looks fantastic on the hair, lighting et all.. except the left leg 🦵😅
Yep. Some weird calf distortion.
One message removed from a suspended account.
Go away Garfield. It's not Monday!
It's the weekend! Leave evil cat!
Why.
looks at hand the robots just don't get you...
These are not the hands you are looking for. XD
Wow, amazing!
The second one made me spill my beer
which model is this
that one's cascade
The story of the three detectives, in a universe similar to ours. Being closer to other universes in the extra-space, more anomalies happen. Commonly golden fishes glide through existence, as if from another place. The bureau of the Blue Rose is investigating this matter, trying to capture one of these ghostly appearances. The blue box was specifically designed for the purpose. Then more gold fish gather, showing up like an easy target. Goldfish slowly took over, grew bigger in size and numbers. An even larger shark suddenly attacked and snatched a goldfish. But instead of being devoured full, the goldfish got into the spirit of the shark. In this new body, goldfish had the power to take over. A massive war between the flying goldfish-shark and mankind resulted, which continued for many years into the future. TBC
This better be an AI generated promt or someone is doing a lot of weed.
puts the lighter down
😆
you can only really see the seam if you look for it, but i'm still gonna try to fix it
ok much better
upscalers amirite
we got a freaking scooter now
they take up like 80% of generation time
true 😂
Yes, a scooter with freaking flowers! XD
from this far away nobody can tell xDD
sigh
this is gonna take a while
lol

see you in 10 minutes 
0.43
0.5 was too high and 0.4 is too low because some of the details don't get fixed imo
i stick to around 0.2 because i try to maintain original details
theres no magic formula
true, and the most confusing part is that each image needs a different value lol
if you have lots of patterns you can get away with a higher value
a plain color with a slight variance in color will look different to everyone
but a square of slightly different leaves or bricks won't stand out as much
Favorite SDXL model for anime?
A1111 + SDXL
Which model?
And this.
These are from a personal mix i made. It is cleaned and i also removed the noise and finetuned it with 15k images.
This one is from a new version of Mangled Merge XL that I'm working on. But if you're looking for pixel art, the current version of Mangled Merge works great.
Still need some work, but the details are good. Not many loose ends or washed out details.
Fine dust particles from charcoal barbecues, as recorded with an electron microscope, dense
Here is the image you requested
Trippy
draw a dog
The world is so beautiful when one looks at it with the innocent and hopeless eyes of a child: the moon rings and stars in the night sky are beautiful, the streams, the beaches, the forests and the rocks, the goats and beetles, the flowers and butterflies are beautiful.
anyone got this canny working with flux on comfyui? https://huggingface.co/XLabs-AI/flux-controlnet-canny
Nice!
artem skibidi toilet
I think it's finally got "holding things" down.
almost lol
the platforms are an issue though ...
Hi! this image is Upscaled with Ultimate SD Upscale, generated with Flux.. I like the upscale exept for the color bleeding ang visible seams? any tips on upscaling images?
How to create images like realistic and high quality like that? When I want to increase quality, it moves away from reality.
Like this. You can see it in its iegs.
And wings.
How can you do that? This quality.
What is flux?
Can I use that in A1111?
With xl models?
Its is a model, like SD3. Yes, I think there was an update for that. Search your custom nodes.
You can also try a lower CFG. A lot of sdxl model tend to show more realism at a lower scale.
It is all about the model you use. As written above, this is a custom made one.
Try the tile_colorfix preprocessor to avoid color bleeding. What I also like to do is to have two ControlNETs on—not just ControlNET Tiles, but also with IP Adapter, as IP Adapter dramatically reduces the hallucinations and allows for much higher denoising strengths to be used, allowing for much higher realism. Here is my example:
Here is the initial image.
This is the initial upscale. I used IP Adapter at full strength and let it stop at 0.5 so that the details would look right, using the DPM++ 2M Karras sampler. I had ControlNET Tiles at full strength (1) here and full time (1), and used a Denoising Strength of 0.75:
After that, I manually fixed some errors via drawing and digital tools, then inpainted them to make it more natural:
After that, I upscaled with the same settings I said above, while also using the same initial DAT upscaler in Ultimate SD Upscale (4xFFHQDAT). I didn’t mention this for the previous one, but for all of them I have the mask padding and the mask blur turned all the way up to reduce seams:
After that, I did some more inpainting and corrections, blurring any seams in unfocused areas, and then it came to the final result:
@raven elk You can also do this using SDXL upscaling too (I used a 1.5 model for the previous upscale), like in the examples below:
Flux is SD3 if it was good.
Like this. You can see it in its iegs.
Like this. You can see it in its iegs.
Like this. You can see it in its iegs.
Like this. You can see it in its iegs.
I bet she knows kung fu.
