#✨|sdxl
1 messages · Page 179 of 1
they mention it on their page
sdxl 0.9 is great still
which means it's time for Loras on SD3 😉
It's hands down better than SD3 (pardon the pun) I can bang out artistic abstract images all day in SDXL just using prompts and my custom workflow. SD3 just can't produce things like this currently
@verbal coyote Is your AnyLoRa XL model trained with tags? Or is it trained with natural language captions?
both
not for much longer tho
faces in sd3 are really varied and realistic
sdxl and sd15 look samey
so lol so much for consistency!
which we never had relaly so far to begin with
Will this channel be archived too like cascade and SD15? 😄 :))))
holy shit whats going on here, nothing makes sense, is this a michael bay movie? XD
it fucking better not
we are running in all directions sir and shooting and exploding buildings
/dream,set up a minimalist yet magicaforest
Nope, but SD3 channel will be.
Waw!
What are ya'll thinking about my realism approach. Think the skin texture should be a little bit less
make sure to watch in full ress
A Indian student angry on University
Here is the image you requested.
Congratulations! As my 1000th prompt, you have won a free upgrade to SD3 for this image.
Man on moon
Did you lose? 😄
is anyone here an exepert on installing stable diffusion?
I keepp getting this message
you do know that posting a jpeg error is an insult, don't you? There are even countries that sentence people to death for daring to post error logs as images.
(pastebin.com)
well i'm sorry but i need help installing stable diffusion and I need to show someone what the problem is
Holmes in search of sharks
I can tell you have the cosxl going. Colors really pop with it.
No I have not (it's not available on DrawThings), this is Mobius
Wow really ok I guess my phone is just making it look extra special
It is very vibrant rich colors, contrast... really good stuff
i've gotten crazy color with my dora too
really helps when the convolutional layers are trained
Makes me wonder if your stuff is like cosxl where it goes outside the normal bounds of sdxl colors
problem with training on base vs cosxl is you have to be really, really thorough to cover all bases if you want that pop
yeah, it def does
it's capable of much much darker images than any finetune i've tried too
the issue is if your training set is limited (as in, 691 images like mine, not millions) and not very broad in scope, you end up picking up the muted sdxl base colors the second you step outside the bounds of your trained captions
is mobius capable of really dark images?
Yeah, I really want to try your dora. It looks outstanding, very curious how it will do in my DT workflow...
trained on base
haven't tried it, I have a darkart mix which can do that
good with default stuff but my nodes/workflow really brings out the pop and detail
thanks, downloading it
can do shit like this if you have the nodes too https://github.com/ClownsharkBatwing/RES4LYF (WF embedded)
I'm not able to run comfyui, don't have a windows pc
mac?
intel mac and ipad pro m2
https://github.com/comfyanonymous/ComfyUI You can install ComfyUI in Apple Mac silicon (M1 or M2) with any recent macOS version.
idk if you have silicon
not too familiar with mac stuff tbh
ahh gotcha
That's why I'm using DrawThings which is actually pretty good
@uncut steeple Check PMs please
Is there an extention like latent coupling for sdxl?
love it 😄
so sdxl still top dog?
What's the best canny controlnet for xl finetunes right now?
until a community non merge model and/or sd3 large shows it is better.
cat
for some reason shrek is completely intact
4k
Can we run it in DT?
What is DT?
ah ok
is this the distilled version?
I haven't tried that. I assumed that's only for phones and very small devices
Full size is only 3 gigs
sorry but where did you see that? the distilled model is 6 gigs in their repo and they say it only reduces inference time, not vram usage
What are the best settings to train an SDXL character model with kohya at the moment? All the videos on YouTube are pretty old. For example, how many steps are necessary? What other settings should be made? Thank you ...
Is RES4LYF a workflow?
it's my node pack
whoa... downloading it now.
that would be huge if it's twice as fast and no performance drop
you probably already have math anyway
if not, just pip install it directly
Do you have a wf png or json?
Thanks
https://huggingface.co/ClownsharkBatwing/CSBW_Style/blob/main/csbw_225tkn_r96a48_12epochs_base1-0-fp16_save-fp32_mixed-no_prodigy_decoef08.safetensors and this is the most recent version of my dora that's used for that image (3 days ago)
np
you prolly have that i think
so the distilled version of hunyuan is like hyper-sdxl or lightning. it lets you run with half the steps, so twice the speed. seems to still look good.
i'm using a refiner...
so the speed is much faster, and the output still looks great.
do you see any difference in quality between the two?
distilled / original
although the seeds aren't going to perfectly line up
knowing that it's going through a refiner, i don't think there's an effective difference.
they both come out looking great.
is it possible to use sdxl with a 3060m?
Too many missing nodes - Manager can find none of the missing nodes. Update.bat was run however no change!
those are all my nodes... can you check your console log for errors?
my guess is there is an import missing
Create an action-packed scene from the movie "Rampage" featuring Dwayne "The Rock" Johnson. The scene shows The Rock standing heroically in the foreground with a determined expression, holding a high-tech weapon. In the background, a massive, genetically modified ape (George) is rampaging through a city, with buildings collapsing and debris flying. The sky is filled with smoke and explosions, and helicopters are circling overhead. The overall atmosphere is intense and chaotic, capturing the high-stakes action of the movie.
Cinematic, realistic, high-detail, with a focus on dynamic action and dramatic lighting.
g
ok but next time do it yourself
democracies and other political regimes in the contemporary world
Juggernaut xl + luma ai = liminal space found footage
cool!
Hi🐣
how long does it take to gen one?
Thanks !
Depends on the queue
Old 'something riding something' with SDXL :
I am just trying to see if I can replicate those more complex prompts from sd3 channel. Never really tried with xl to do stuff like this..
Omost is what you want.
Seems to work without omost just fine. But I've heard of it.
Looks very good. I'm curious about the workflow for this upscale
It's from afro_man and should be in the image.
Workflow is on Civit https://civitai.com/user/Afroman4peace
Turn off your room lights and see this image for 10 secs
btw give it a shot now... i updated requirements.txt to contain:
matplotlib
pywavelets
numpy```
and nothing else this time
Manager cannot locate these missing nodes, sad to say
I have too many conflicted nodes not importing - prolly my bad!
i was wondering with the requirements.txt command that doid'nt work for you before
in the RES4LYF folder: when you ran "pip install -r requirements.txt" you got an error
Adjusted my upscale to 9.5MP
No error
k cool... do the nodes load now
wow amazing 🙂 workflow Comfyui?
I'm generating from a discord bot using A1111 API
It's basically Hires, then Ultimate SD Upscale with high denoize (0.5+), high padding (256+) for consistency and Soft Inpainting to be seamless.
The one above is not seamless tho, was testing.
The images is always split in 9 tiles for USDU, whatever the ratio.
thanks for info
Does anyone here use SD 1.9?
There is no SD1.9...is there?
Raw generation / Details++ option
Ah, this is the webui, not 'SD'. As SD is the model
a okay
"I believe"
Just some SDXL cuties for the kids 🙂
Amazing, idk how but Luma AI won the race against sora 😁
Night sky. (Zoom-in)
that's fn awesome
Yes haha
Now we need a localy and open weight model with the same level
True, I hope svd gets updated and could achieve something close (but the requirements would be huge i think)
This is from a upcomming SDXL model I am about to release
Looks great
any suggestion for a retro anime style Checkpoint/Lora for Pony or SDXL??
thanks.. this one is from a released one.. (Colossus Project XL 10B) with the workflow I published for it.. I am still working on it though (WF)
Hô, very cool !
What denoize are you using while upscaling ?
With a1111 and Soft Inpainting, i can use a higher value (0.48-0.52).
Also always splitting the image in 9 tiles, it helps to keep consistency as well. (between left and right eyes on a portrait, for example).
Before my last webui update, i was also able to run Adetailer in each tile.
Finally, Soft Inpainting puts everything back into the image in a perfect seamless way.
The workflow should be inside the image. I use comfy ui
It also works well with Forge though
this one is a raw image from Colossus 10B I made with Forge without upscaling
GN its getting late here
SDXL+Deforumation https://www.youtube.com/watch?v=78QZtbduvbM
Deforumation
Rendered in 2.5K, upscaled to 4K
Fast food slogan
Big fan of the CosXL model that Stability AI released. One final model variant for SDXL 😄
Very good bright and dark generation ability
Visibly improved from SDXL original
This still makes great images.
Using my CosXL merge from https://civitai.com/models/240590/ultimateblendxl
绿色眼睛猫咪
Here is the image you requested.
在哪里创做的,我怎么没看到有创作的地方
Because of the current licence of SD3 I made up my mind and do something unreasonable.. As a checkpoint creator I normally holding back some checkpoints. to release later.. this is one of them. Its based on my own research. Today I will release DEMONCORE 4.1 MIDGARDBEAST. Try this with SD3 🙂
guys here is a way to add T5 to sdxl by i'm not a programmer so i don't know how could we do it : https://github.com/AIGText/Glyph-ByT5
@high skiff To not spam SD3 channel
Here is another test with the guy.

Before / After
Took 3K film grain against a grey background and re-interpreted/upsampled it with prompts in SDXL. It then renders in 3K straight from SDXL. Those examples are not upscaled. But very hard to control. A 0.01 decimal will change the output.
It's a merge with cosxl
Hey. So I learned something about cosxl merging yesterday because I'm a dummy that never did the math before. 🙂
Those input blocks 0-4 should be 0.1, not 0.5 or 1. That means it will still work but give the highest weight to the existing model you're folding into it. Definitely better results.
I mean output blocks.
Nice! I merged it a while ago and can't remember what I put in there.
The default workflow has 0.5 and I had done 1 thinking that was max my model, it was the opposite.
Putting 0 is bad as well, so 0.1 is the best of all worlds s
I did it with mobius yesterday and it's massively more prompt following than 0.5 or 1.
I didn't want to block out what CosXL is capable of though. It is sometimes better at composition. I'll give it a try later, thanks.
Möbius is probably better at prompt following than most any other model. Cosxl is better than sdxl base, but these other fine tunes are massive better than it. I've now got a workflow that does mobius cosxl and refines with aventishorizon cosxl and the output is great. @meager canopy
庆余年网页设计
Did you mean input or ouyput, because input looks terrible if I change those.
Output are the only ones defaulting to 0.5, all the rest are 1.0
Yeah output only. The same blocks that default to 0.5 in that workflow. Just set those 0.5s to 0.1
What did you set this too?
I left that alone
Also depends on your input model. Only looking at the hands, your default looks better than the 0.1. But maybe your own model isn't great with that. In side by side with mobius and Adventis, they both turned out better
This is Mobius
Setting min to 0.001 look better
...or not. I'm not sure if any look better or just different.
Hah that's like 99% of what we mess with. Better or just different hah
Big difference between the fp32 and fp16 Mobius too
Those were all fp32
After the recent developments I foresee a long and prosperous future for SDXL.
fp16
So as far as I know, mobius has a fp32, but cosxl only has an fp16, so the resulting merge is fp16. I used the fp32 input. Are you aware of a cosxl fp32?
NO, but the output of using the 2 are different.
...after merge
Ah. Ok well I'm glad I used that one then. 🙂
Just realised my seed was still on random 😦
Fp16 and 32 are identical in output
Independent of cosxl, there is a difference between those, but the majority of the time it's not something you'd care about.
It's not like a concept is different, just that a pattern on clothing is a tiny bit cleaner.
I meant after merge
My model after merge
Mobius with same seed/prompt after merge
It's a bit more "dynamic" 😄
Oh, I forgot. I like every other model, mobius hates perturbed. You have to get rid of the nice completely for it not to be messed up.
Unlike
Node
I was getting the same issues until I got rid of it
Not using it, that was during the merge.
Can you share the prompt, that merge probably just has lame settings by default.
cinematic film still Power Armor, sexy young woman with an angry face, ready to fight, On a rainbow bridge to Asgard, Energy gauntlet, power-enhancing technology, Powered up in brilliantly shining Power Armor, a sexy young woman stands with an intense expression, her fist raised and ready to strike. The armour gleams under unseen magical lighting, hinting at its advanced design. The setting is a majestic fantasy rainbow bridge that stretches across vast distances between two distinct realms
The image above (Mobius) and the one below (UltimateBlend_XL_v2.6) are with the same prompt/seed.
dynamic panorama of A mystical, glowing portal that seems to lead to another dimension, guarded by a group of ethereal, glowing creatures with flowing capes and staffs. by Hayao Miyazaki and Pascal Campion
Negative:
signature, logo, watermark, colourless, featureless
So one with no hands. One with good hands. One with bad hands. So it might just be seed dependent.
Yes, I'm not too fussed about that. Mine followed the prompt better for the other examples though.
Trying the other one now
I do like the output from Mobius
I added the negative I use
That said, some good outputs on that one from hunyuan and pixart as well
Do you plan to put the new 0.1 ultimate blend 2.6 up on civitai? I could try using that with my automation for refinement instead of aventis.
Really good
Heh 😛
I wasn't going to, it'd probably be quicker for you to merge v2.0 yourself than me upload it to Civit 😄
Do you feel that 2.0 with the merge would have the added prompt following you mentioned over mobius?
Wow, so much flatter colors
No, it's already pretty good.
That was with a lora and upscaled a lot.
The flatter tones look much better imo. Many models do images that look like as if someone just discovered Photoshop filters
Boo this man! 🙂
Haha ❤️
its not bot?))
Ugh, just imagine what SD3 would do with this...
Haha!
Reroll of my irridescent squirrel with extra noise
I have just uploaded my probably best SDXL model so far.. https://civitai.com/models/155977?modelVersionId=576344
Very cool gallery. Neat
Can anyone recommend a good outpainter for SDXL? One that doesn't get all morphed and blurry 😛
SDXL is sorely lacking in the in/outpaint department. if its not a gigantic tableau i'd honestly say use something SD 1.5 to get the base down then force and img2img over the top of that to add in details. lotta work but SD1.5 just seems more into in/outpainting than SDXL.
Oh hey, you here o// Thanks, will try it out, just to lazy to do stuff by hand. I also have Photoshop, but the results ain't great either.
oh lol. didnt even read your uname.
pix art sigma + sdxl refiner. Last one SD3 + sdxl inpainting
Awesome work. I love the details
ahah thanks, trying to adjust some values
Since you're going to train an own lora anyway, go with SDXL, you will just have way more options on how to refine etc.
Well the main advantage that Pony gives you is that it's trained on all kinds of cartoon characters, animes etc. so it recognizes them without using loras. By usng your own lora you kind of overwrite it's advantage. Ther are just way more models out for SDXL to get your desired look.
⛏️ ⭐ Amazing but @limber lynx and @freekhitman have done it and we have a new golden pickaxe leader for the SDXL Top 10 Models! Demon Core Midgard Beast (a bit of a mouthful but worth it!) totally smashed on the prompt adherence test). Make sure to follow the recommended settings and you'll be impressed for sure! 🥂 😱 https://docs.google.com/spreadsheets/d/1IYJw4Iv9M_vX507MPbdX4thhVYxOr6-IThbaRjdpVgM/edit?usp=sharing
iterative unsampling
Jugernaut xl v8 + dream machine
omg thats wowz
Awesome details, which model Is It?
Wow these look amazing compared to SD3. It is good to have a good model, whatever else comes. SDXL is good enough.
Thank you! realisticStockPhoto_v20.safetensors as a base and realvisxlV40_v40LightningBakedvae.safetensors as a refiner. Lora on 0.2 weight: SDXL_FILM_PHOTOGRAPHY_STYLE_BetaV0.4
Before/after refinement
Those are some very good models.
Yeah for people I usually use stock photo as a base, because it does not screw up faces, then refine it with different models depending on what I want
generate minimalist logo for a plumbing business named "H2O Plumbing", where a Tap is present and a drop of water is falling from it
SDXL still top dog.
/Uncomfortable /Kappa
So does anyone here remember SDXL launch. I wasn;t around. Was this channel on fire like the SD3 channel is now when XL dropped?
I don't think so. People was busy enjoying the model, actually.
One of my very first SDXL pics on Civitai :
When I try controlnet canny SDXL img2img it gives me these weird blotches on the wall. (When I do it for depth, it gives a repetitive fine dotted grainy pattern). What to use for buildings and walls instead?
the only issue i ever had with controlnet sis they make the image more blurry
SDXL+Demofusion 4096x2048 raw render wihiut upscaling. View in full screen
wonderful but why 2 suns..
Haha was just my very first test render. It's crazy slow : D
Will see how good demofusion is
well that was a hot mess
4096x3072 Freaks out completely lol. Will try an other model
i was told to work in resolutions of 1MP and then upscale thats for sd3 my bad.
Resolution too high for my blood pressure
lolol
Thank you very much for testing it. took me by suprise 🙂
4096x3072 raw render
#✨|sdxl message here's a link to the last message on July 25 at 11:59pm Eastern time. Sdxl dropped on the 26th
4096x3072
Cat with 4gb vram send help cute bunny in a tea-cup bilateral quad by artist "zentangle",tesselated polygons, afremov hues; by artist "tricolor";
Here is the image you requested.
pixart sigma because i had my app up while i worked on it(the cat with 4gb ram thing is a user on here, so i omited it)
And here's a second one from HunyuanDiT1.1 this time:
That's one of the greatest nodes in history
so fun
Is there a way to use controlnet with ipAdapter? I'm getting an error with controlnets selective range of applying conditioning
i use them together a lot, what is the error?
hmm thats basically what I was doing, maybe I got some connections mixed up
Double check your models. You might have an sd1.5 in your ipa loader
Or in your controlnet loader
Basically, make sure they're all sdxl models being loaded
seems to be good now that I redid it from the beginning
must have had something in the wrong place
Cool beans. On a side note, damn is mistoline+IPA PLUS a strong combo... I didn't expect that troll waifu to be spit out so close to the input one. I had the cnet turned way the hell down too
im gonna have to try that one out
I really like the dust on the hood in this one
SD3? Where did his legs go?! 😄
Grid-Girl
She's wearing her parachute 😄
Location is good, clothing...not so much
Lol
More SDXL Demofusion tests, watch in full screen/full res
What is this?
It's this: https://github.com/PRIS-CV/DemoFusion?tab=readme-ov-file It's a highres bucket renderer that runs with SDXL models. The images above have 400 iteration steps each.
Let us democratise high-resolution generation! (CVPR 2024) - PRIS-CV/DemoFusion
This is lacking the same detail though
How did you get those so sharp though?
Just seen the embedded flow
I run 240-400 steps on each image. The sharpness it does automagically. It's so sharp that it's kind of uncanny super real. If you know what I mean.
It also samples completely differently somehow. Everything looks so clean lol
Did you do it with Demofusion?
Yes
Could be the model, so I'm using the same Jug as you
Hm maybe you didn't install the package right? How did you run it?
Try setting steps to 30 at least
It was set to 80
😮
I run the cmd line version without gui, maybe it downloaded stuff that comfy doesn't have
Yeah it's slow lol. It's just a pity that even though it could, it doesn't add like many objects. All scenes look like super sterile at that res
My 4090
It's liquid cooled, and not been that hot for that long before
One image and I think I'm done with this node 😄
😛 MY heart tells me to abandon i too, but I just can't resist.
Any tip for fantasy skin color? If I do get one then the entire scene is that color, clothes, background 😭
Heh interesting 😛
40 vs. 120 steps, same seed
No it's the detailed one. It does 4 phases, so 4x30=120
The low res one is 10 steps per phase
No, but I don't know how it handles it in comfy. In standalone it does4 phases per default. Interference steps I usually set to 50-100
Oh same for you then
The inference_steps is just how many it does each step.
Ah... The following part of your input was truncated because CLIP can only handle sequences up to 77 tokens:
It does kind of a weird thing that it decides on what to detail and what not, so steps 50 is kinda required. Here an example where it just detailed the chrome ball
Yeah some models work better than the others. In realvis I got this:
Could be because it's a Hyper model and we're using too many steps...? 🤷🏻♂️
I have no idea haha. It's just so tempting. I love how it looks, but prompting it is just impossible.
10 steps is very bad
Yeah. Anything above 40 is fine. 10 doesn't work. https://civitai.com/models/155977?modelVersionId=576344 seems to work bst with it so far
You somehow don't get that clean look I do. It's more of what I'd want
Could be this prompt
What did you prompt?
Two majestic three-decked sailing vessels engaged in an intense maritime conflict, their hulls adorned with intricate carvings and flags fluttering in the wind; the sun casts long shadows across the rippling waters as flames from exploded cannons illuminate the horizon; a heavy mist swirls around the ships, ropes and sails, the ships are silhouetted against an orange and crimson sky as the first stars appear on the horizon
20 steps takes 1 min to run
No control over the noise at all though 😦
Hm you get a much grittier look. What model do you use?
The dragon one is nice!
I wonder if AR affects the doubling up and ships in the sky?
You managed to avoid them 😄
Oh, you're not using the same node 😄
I thought it looked a bit low-res...lacking detail
Same playground scene, same seed, 70 steps
Not just background, the ships are messy too
Speaking of HDR. WHat is the best way to save real 10or 12 bit images?
I got 2 HDR screens, monitor and TV
Don't compare with an image that I demonstrated as bad! 😄
I'm trying to improve your image, but it's too lacking in detail to start with.
Extra noise is the secret
To unleash the model
One of you mentionned 400 steps for a pic
How is this amount distributed ?
It's like xx steps per tile on a tiled upscale ?
Or xx steps per pass on the whole pic ?
Wizz sad when good things are standalone and not as an extension/nodes
It is a node, Demofusion
You enter steps and it runs 4 iterations over the image.
ah, thought it was a standalone gradio gui
how much time for a pic ? regarding the card
I'm also running a lot of steps, but result may vary
That was 10 steps in the ksampler 🙂
Also using Soft Inpainting in a1111, so i don't know how many steps SI is running. But i'm at 220+ without
Simple. Lovely !
Woooh that'S a difference
I have a 4090 and that one takes about 1 minute when I use 20 steps, so 80 steps total. That does need a Hyper model for low steps, otherwise twice as many
Hô i see.
It takes me ~850sec per pic with a 3060, kill me
I'm going the slowest possible way on purpose, tho 😂
I'm downloading boltning right now, will test with 70 steps for fun
You may fry your GPU 😄
For some reason my gpu runs at 78%
Perhaps because you're using CLI and different drivers?
Or cause of the res !?
I think you can reach the maximum usage only with square pics ~1MP
Hm will see, maybe I'll install the comfy node too. Currently running standalone, needs 17gb of ram too
It can do really large resolutions. The last pic I posted shows it was creating at 3072x2048, but it can do larger.
I'm still looking for the best upscale using a1111, limited with 12gb vram
I'm still improving my pics every month
You've been posting some great images!
70 steps, 2x2k. For some reason it did only 2 phases though
With a1111, when using Ultimate SD Upscale, i'm able to run adetailer in each tile, using the API.
I can't anymore, since i updated ...
I can, but i have to choose between ADetailer and Soft Inpainting, can't run both.
I split my pic in 9 tiles, whatever the ratio, and i use a high padding to keep the tiles consistent each others. So this way faces and hands can get 2, 3 or eventually 5 adetailer passes
@timid garnet read about this yesterday, but haven't tried yet https://github.com/deroberon/demofusion-comfyui/issues/9#issuecomment-1967333169
As you know demofusion can be used as a way to detailize current low-res images. So the image to image is quite a great implementation. The source of img2img from demofusion is like this: https://g...
Any of you guys tried lumina?
Nop
Installing demofusion via VoC now, see if it makes a difference.
Got this today. I'll just leave this here. 
ALso @timid garnet will try this today: https://www.reddit.com/r/StableDiffusion/comments/1cbaxsu/introducing_hidiffusion_increase_the_resolution/ looks promising
Ah, btw.
My current upscale process is at the limit.
My "only" way to improve the result, using the same process, is to improve the input image. So to improve the base generation and the hires parts.
Do you have a1111 installed ? and are you familiar with the 'noise_multiplier', or the 'extra_noise_multiplier' ?
I do, but only used a1111 to run deforum animations, don't really use it outside of that
If you can open it, i can show you what i mean.
You'll need to create a decent pic, well prompted, using hires as you usually do, and ping me.
The "how to create details" should be obvious at this moment.
Then, report and improve the logic on any UI
As a Deforum user, you should also understand what's going on with the noise multiplier.
The normal 'default' generation process that we're all using heavily limit the ability for the model to actually draw what it wants to draw.
Comfy and other solutions already use what i'm talking about, to be clear, but it's not on your control
I cutrrently have to render stuff for work, will ping you for sure later or DM you. Very interested for sure
Let me find some comparisons
Yeah would be awesome
Found them
These comparison are using hires, then my img2img payload for ultimate SD upscale and cie. Not Raw to upscale
The only difference is a single setting in the Settings tab
Random non realistic pic :
Some prompts are way more impacted than others.
You can feel what part the model is done with, and what part wasn't done yet.
This literally 'unleash' the model.
This setting apply to hires and img2img
Settings/img2img
The reasonable range is 0.075 - 0.1
This is exactly like Deforum, you have the same settings doing the same thing in Deforum
Ah, this is for normal SDXL models, for ~30 steps or so. Need to adjust for lightning or Turbo as well
Here is one without a huge impact :
Particles in the air aren't a bug or noisy artefacts, it's on purpose. The model want to draw them.
Very nice! The example with the girl and trees is heavy
Yes !
I'm like a 'noise expert' with deforum, i'm playing with SD since like 1.5 year... and i just discovered this a1111 setting last week.
My life is a lie
Also, as mentionned, it's not under your control in comfy or other solution, like Demofusion. It needs to be under your control.
I know that feeling. Every time I think I figured stuff out, 10 new rabbit holes appear out of nowhere
Are you sure about that? 🤔
For my pics, i'm using overweighted prompts, and only hires x1.4 (vram limited).
The pics could be, as i tested, wayyyy better if i coulb hires to 1.6 or 1.7. Wayyyy better
Noise injection in Comfy is a thing
My current problem is the pre-img2img part, it need a huuge improvment now.
this
Underated and not known enough by the pleb.
My other way to improve would be to downscale and redo the img2img process. But would need 25-30 min per pic, this is insane.
I'm using 0.48-0.52 denoize, for hires and img2img. I wanna keep as possible the raw generation unaltered.
So my only way to be seamless is to use Soft Inpainting... adding 50sec per tile. Also blocking the webui, for some shady reason.
If someone have a solution to be seamless, while using extra noise aand high denoize, it would be the best day of my SD life
Raw generation at 1MP / Details++ at 9.5MP
I mean, i need to replace Soft Inpainting, so i can run ADetailer for each tile.
Huh, stop bullying me 😭 😂
I'm using a Discord bot as UI. This bot is using A1111 API.
I won't rewrite the bot, so i'm stuck with A1111.
To be clear, i love it. But maaaan, why everyone is doing things for comfy 😭
😢
I can use Comfy in a1111 tho. But it's limited. Never tried
Because it's much easier to customise and do what you want.
There is an extension for this
@timid garnet Your image upscaled x1.5, a little noise added and sharpened
That's using your upscaled image as original
Damn buddy
And it's only one node. This is so sexy
No, that's a whole workflow 😄
The better the base image, the better the upscale. But i'm limited. meh
This is really good and consistent
Not a huge difference, but it was a good upscale to start with 🙂
My last generated pic :
When i see it, i'm like "Meh".
But then i compare :
Still, it's not enough...Wizz frustrated
Why do you need more?
Nobody will ever look at it as closely as you, and then it's forgotten about 🤷🏻♂️
I also tried to run several models in ADetailer, for several classes.
Running after Hires, then in each tile. That was super long, but a cool experiment.
True.
But I know 
This is actually my third option
Lower the res would improve immediatly the result.
I delete all my images after uploading them anywhere.
As i'm using discord, all the pics are in my server, and it's easy to search to download the pics from it
I keep them tho.
I'm uploading to Civitai, but the pic is not really the original
How's this one?
Also really fast lmao
Compared to me
As usual. I'm often "meh-ing", but the comparison is speechless
Is that demofusion?
No, I can't use demofusion to upscale in Comfy....can I?
The best workflow i've (not) seen so far is Zavy's workflow
But he don't wanna send it to meeeeeeeeeee
Riot

Oh sorry. Standalone has img2img with a scroll comparison , forgot you're using comfy
@meager canopy boltning 4kx4k 80 steps. at many steps it starts dreaming 😛
That's its biggest problem...and probably why I won't be using it. 😦
I'm also addicted to semi-random prompting
Trained a GPT2 model for this months ago. Still using it. So small, it's almost instant using the cpu. Text-completion mode, not as chatbot. Lack of variety over time, but still decent i think.
I have a comfy workflow with my prompting group. Need some manual edit to fully work if someone is lazy as i am.
cf. My civitai gallery on my profile.
Found you. Very cool gallery ((:
This looks amazing. Might have to learn how to write an a1111 extension.

Yes you should.
miyazaki mustang
I notice there are already 4 comfyui node implementations. I'm guessing a1111 integration must be much harder, and I barely know python. 😐
ahah jk jk
For Hidiffusion I need to downgrade numpy, but get errors. Anyone knows hot to solve it? Attached error log
Out of my (small) skillset
Maybe dumb, but make sense to me
Can find the file here:
https://pypi.org/project/numpy/1.19.5/#files
So
pip install "numpy-1.19.5-cp38-cp38-win_amd64.whl" is want you want ?
Hm will reset my voc env and see, but thanks
Had trouble with numpy recently anyway with other tools
Must have messed sdomething up along the way
Not a dev lol, i usually shut up when talking about code
Jesus voc downloads at 340 kb/s 😢
I have a new workflow
I'll post pics here, and you'll upscale for me.
I can pay with love
Interesting fail. (just a swimsuit, the woman is half in the water)
pg13
Is there an SDXL model that works really well for rpg battlemaps?
Blue viking tatoo, burning man, futuristic
Blue viking tatoo, burning man, futuristic
4.8x3.3k SDXL Hyper+Deep Shrink
whats the best web ui for SD if I have 6gb vram
currently using foocus and foocus mre
I am pretty new to ai image generation
which webui wont be your main concern I wouldnt think
but if you are new, automatic1111 is pretty user friendly
I heard its heavy?
I know there was a memory leak in it at one point, but the webui shouldnt be using your vram itself
I see, and it has ample guides on youtube?
absolutely
Since I really want to learn
nice nice
a lot of web uis tbh
very confusing which one to start on
auto1111 will get you on your feet pretty quick. learn how things work, what different settings do, the general workflow, then move on to something more advanced like comfyui. thats what I did
node based one?
yeah
that must be a steep learning curve
whats the real benefit of using node based system
making your own workflow that runs in one press, rather than having to do it in multiple runs, with each part in a different part of the UI.
also many many custom nodes to do things auto1111 doesnt have.
I see
so web uis dont have any impact on vram usage? Saw a few videos on youtube claiming that SD next leads to a 60% faster generation compared to a1111
auto1111 is like a toy car that you can get addons for to change how it looks and rolls.
comfyui in this example would be like a box full of lego car pieces that you can assemble in any way. you can make a limo 6x the length of the toy car if you wanted to.
running the webui itself wont make an impact on vram, but the settings that affect how the webui handles models can.
Alright then. Thank you for the guidance. I'll run foocus as well as download a1111 to learn more
about ai generation
I thought you were concerned about the vram usage of the webui itself eating into you available memory that would be used to hold models and whatnot
I was just concerned about the image generation time
but yeah, try em out, follow guides, and change parameters for the hell of it to see how it affects things
right
yeah, I normally just keep this up on my second screen
more style transfer fun
Yeah I undervolt my 2080 and lose maybe 5% performance in short benchmarks, but actually gain performance in long throughput because it stays cooler and doesn't have to thermally throttle at all(like if you're generating a bunch of images back to back). Drops my max wattage by ~50w
A girl, walking on the grass, full body, smiling
This card is super old, it's a 2080 FE if that gives any clues lol. I've never repasted/repadded it either. It will get to around 70C(the hotspot is probably 80C, but task manager doesn't poll that temp, have to use another app to see it) when doing long generations.
But ironically, the fans never spin up higher than 60 or so percent
Foreground: On the steps next to the circular sports ground of a rural school campus, a young male dressed in a white T-shirt and jeans is walking alongside a young female wearing a white long skirt and a pink top. The two are wandering on the steps, surrounded by many flowers and plants that sway in the wind. Background: A spacious rural school in China around 1997. Styled in Chinese animation. Long-focus lens. Side-tracking shot. ┬─┬ノ( º _ ºノ)
圣旨 龙纹 金色
Maybe it's better to speak english, so more people can answer you 🙂
@storm rootthanks!
i feel you, but i usually just tinker with stuff while i'm busy with other things. oh and here's what my gpu looks like after 20 minutes straight of generating a batch:
so as you can see, the undervolting lets me stay under the thermal limit (dunno if that 88C limit is for the hotspot or the edge though). i think this is a ~250w card btw
not bad. and no, undervolting is different from power limiting. undervolting is hard to tune, but if you do it right, you'll end up being able to hold the same high clocks, for less power. my power limit is set to 100
actually, even higher, 124% lol
see the custom curve for the voltages? it took a while to tune correctly, without causing crashes and stuff under sustained or transient loads
the thin line is the default curve
basically, i can get higher clock rates at lower voltages, which means you draw less power overall. anyways, it's helpful to do shit like this if you're going to be redlining your hardware a lot
i also did the same to my cpu, but just did a flat -0.125. saves a shitload of power under avx loads like rendering (i do a lot of 3d rendering and am in the game industry). and no, i don't OC ever, there's no point. like you said, just buy bigger. not worth the ~5% more performance if it means your shit's going to fail in 1/4 the time
how to set in automatic 1111 to generate one image in several resolutions ? that is, for example, 2 images in resolution 512x512 and 768x768 ?
in X/Y/Z plot there is no option
Does anyone know how it would be done ?
You get a different image created with any change of resolution. You'd have to do an upscale of the original image to change the size of it.
How were you creating the large image without duplicates? Didn't you say you weren't using upscale?
Automatic1111 is beginner friendly but if you're a bit used to it id recommend SwarmUI as it has a comfy UI backend but also a simple generation interface. So you can usea easy interfaces and if you're up for a challenge can learn comfyUI
I'd post examples of my stuff but i currently got it setup for R34 stuff requests so cant post
Did a quick search, focuuus is nice for "i got an idea image but no idea on how to customize the parameters"
Id say for beginners its good
Automatic1111 is beginner friendly and allows for some customization
SwarmUI is like automatic1111 advanced and optional comfyUI
I did try swarmUI but I am not trying to learn comfy rn
And comfy ui alone. Lol im not touching it
Oh the comfy is optional, the "generation" tab is basically all you need
Granted its some getting used to
I see, what does the generation tab have over a1111
Swarm: buildin video, easy resolution, a logical UI, civit ai downloader, active devoloper that constantly fixes or adds stuff based on user input, image history tab that easyly allows you to "reuse parameters" (model, prompt, seed, sampler etc)
Automatic1111: addons, less likely to break but less freedom
I can show you some stuff but im currently omw to a wedding
So if you got time later im happy to hop onto vc and share my screen
Later as in 6 hrs lol
Chilling in a passenger seat
what about youtube guides
etc
https://www.youtube.com/watch?v=HKX8_F1Er_w has a great guide but the discord is also very active
Do not skip any part of this tutorial to master how to use Stable Diffusion 3 (SD3) with the most advanced generative AI open source APP Stable Swarm UI. Automatic1111 SD Web UI or Fooocus are not supporting the #SD3 yet. Therefore, I am starting to make tutorials for Stable Swarm UI as well. #StableSwarmUI is officially developed by the Stabil...
The disc has more vids but i dont wanna spam
nice nice, I'll download swarm then and once ur back we can hop on vc
Oh apperantly the dev is now an ex stability ai dev
Yes, now part of comfy.org
Gonna watch&read the announcement rn
Do you have random ideas that you generate or are you doing requests for people
I generally just have a few ideas, mess with it for a hour and move on
I have a list of old prompts that I revisit to compare with how images used to be 😄
This is the best improvement I've seen yet!
I tend to focus on the "2d" images but yeah sdxl is getting better and better lately
It's a new workflow I've been working on.
it's not about that, it's about, for example, in X/Y/Z plot having the ability to generate several resolutions. It is not about scaling. It's about the same thing as with sampler or CFG.
I know, but if you change the resolution and keep everything else the same, you do not get the same image but larger.
Yo @meager canopy o//
Gave up on Hidiffusion yesterday lol It ate all my vram for some reason even at 2k and was not able to fix it : D
Niice!
I deleted it and beuilt my own workflow for these images I've been posting.
Yeah was just about to ask, they look good.
Thanks 🙂
I tried to break down what Demofusion was doing and recreate it in a workflow.
I did some tests too:
I saw those
I have my own
Gradual denoise with latent upscale in between, also Noise injection in some stages
Fancy. Are you using deep shrink?
Nice, the focus pops out really well
It is really sharp. Even hands and faces come out almost perfect. There's no detailer for either.
Yeah it looks $$$ ❤️ gj man
Really want to try hidiffusion but can't get it to run
The standalone kills my gpu and the comfyui node gives an error I wasn't able to fix
Error occurred when executing HiDiffusionSDXL:
'transformer_block' object has no attribute 'use_ada_layer_norm'
YOu know what that is?
None
😦
How did you get these to look like they're inside the glass?
Probably by writing better prompts 🤣
Artistic fail
Looks like an artificial horizon
Old father time...
The fight to get a place by the water was getting ridiculous
Oh I like the birds. Very nice
From the family album
