#✨|sdxl
1 messages · Page 90 of 1
ok
A1111, comf looks unnecessarily complicated to me, because in the A1111 obtains exactly what I want. But if the comfy is much faster, then maybe I need to think about a change
I thought A1111 was already fully ready for SDXL after such a long time
Which GPU are you using?
6900xt 16gb
If you have limited hardware and waiting so long for sdxl, you might want to try comfyui.
Comfui are using even ppl leaning on A1111 because about SDXL performance. A1111 need desperadely new update to handle SDXL
To be honest, it's still fast (I have a gf 4080) but I always try to gain time
I guess I'll just wait for the new update to A1111, comfi somehow discourages me with the way it looks - thanks for answers
i have 3,14it/s with 3070 1024x1024 🙂
hi, trying to run sdxl on vlad, even with diffusers, with/without refiner and not getting the model to even load...
getting this error...
ERROR Diffusers failed loading model using pipeline:
C:\Users\USER\SD\stable-diffusion-webui\models\Stable-diffusion\sd_xl_base_1.0.safetensors
Stable Diffusion StableDiffusionPipeline.init() got an unexpected keyword argument
'text_encoder_2'
right comfy @south frigate
comfy on the other hand...
upsample and downsample because of unet
I mean ComfyUI can be made to look reasonably decent IMHO, this is my daily driver view as it loads (and yes I have turned off the spaghetti which helps)
It all depends on what you're looking to achieve. I mean I would be the last person in the world to recommend my workflow to anyone who wants to tinker however if someone just want to have some point & click & select boxes ??
Then maybe ...................
i think see all those @south frigate workflow can make panic in human mind. It is working perfectly with most easy default workflow.
I did a quick test and it actually renders almost twice as fast as the A1111. I will look on civit for some interesting "bulid" to Comfy and see what comes out of it.
also you can gain 20-50% using AITemplate between model and Kdiffuser only.
This is the first time I've heard of what you write.
Its for A111 or comfy or both?
AFAIK only for ComfyUI, but can be for A1111 dont know
This may be a different way of asking. What workflow do you recommend downloading to Comfy for me as a newbie?
I recomment play with most simply default workflow to get familiar with it. For me is it good enough.
If something than probably from github @high skiff 's workflow. But you will have missed some nodes. But it is possible to get them all by using manager.
But as i said, i would play with default.
This is the thing that accelerate it, normaly is load checkpoint model connected to KSampler model.
*on 3000 series GPUs, on 4000 series it's around ~95%-120%
But it is not lossless.
we already confirmed that. however, you might get a bigger speed boost on 3000 series GPUs if you compile the AIT modules yourself- the precompiled ones that are shipped with the node are meant for 4000 series, therefor lower boost on 3000 series
yes i am happy with speed i have 🙂
can i use those boosts in A1111 somehow? tbh i dont know what is aitempalte and kdiffuser totally yet
only ComfyUI, sorry
any simple tut how to use it in comfy?
nope, I'm sure there will be in upcoming days though
love the effects!
Just reconnect. But you probably will need install manager. I have manager, dont know how install custom nodes other way.
Has anyone used this?
amazing
is this allowing you to use ComfyUI as a backend with any configuration?
Sorry, I don't understand what you mean, because my native language is not English.
But recently, many new ui interfaces have appeared on the basis of comfyui
yes
-same for me. I started looking at real people distantly. Like actually looking at facial features and studying them, admiring them
The stableswarmui is the official one to do that.
"Just reconnect" what? I have manager atm. Where can I find it (aitemplate/kdiffuser)?
i think loaders AITemplate
no wait a min
@south frigate in manager type AIT and dowlnoad it. Then in right click loaders load AITemplate
yes
yes as picture i posted. RMB loaders, load AITemplate
ah, sec
also dont forgot switch it to disable, and resolution in empty latent space to 1024x1024
keep_loaded have to be disable?
ok, its working
i think
8it/s and 15 sec render
from near 30 sec
what a magic
xD
now you can generate or examine nodes 🙂
is there anything else I should install to start with? is this all I should know to start with comfyui?
What card you using?
4080
thats fast or slow?
Thats right in expected values and further confirms all the latest gibber jabber about AIT workflows and the expected performance increases
AIT was compiled for 4000 series gpu, so thats good for you
any link to ait?
This was orginally written by: https://github.com/hlky - GitHub - FizzleDorf/AIT: This was orginally written by: https://github.com/hlky
i hope they will fast add it to A1111 then, so good
i want to like a1111 but comfy much faster/efficient
@south frigate also you can drag and drop image generated in Comfui into it, and you have all settings and workflow that was used to create it
even renders that were not made in comfy?
only for comfyui
yes only for comfy
yeah thats one of my fav features lol
What add-ons do you recommend for beginners? Is there anything worth downloading?
i used adetailer in a1111, but like I see in manager there is only facedetailer similar to it
Too bad the AIT is only for Comfy as I can't use that.
@south frigate if you are used to watch preview you can enabled it in manager window, but it cost some performance of course.
currently testing stableswarm, how do I make it see all the stuff in my ComfyUI install? it doesn't see the models
My workflow includes a few custom nodes that you'd need to download, but it does then give you controlnet and a face detailer. I'm just working on a new version, that doesn't use the refiner.
The SDXL workflow includes wildcards, base+refiner stages, Ultimate SD Upscaler (using a 1.5 refined model) and a switchable face detailer. Now wit...
What has Stable Swarm got over ComfyUI at all?
looks a little simpler than most I've seen, I'll try it soon
where should i add AIT if I want to render faster in ur workflow? (https://github.com/FizzleDorf/AIT)
in comfy_nodes
anyone doing portraits getting weird teeth artifacts?
Just point the path from stable swarm comfy install to old comfy install
Sd1.5?
sdxl1.0 0.9vae
i try to put neutral expression
only 1 in 10 images comes out right
I've not tried it
huh, awesome! stableswarm is just a fancier ComfyBox! by far my favorite
using my AIT workflow in same speeds as normal ComfyUI and stableswarm, no reason not to use stableswarm
base is kinda weird, crystalclearxl better for me
Need comfyui touch control
i can think of a few
fair enough, but still, it's awesome
hehe, i cant wait till they implement those targets, it will be a great frontend to comfy backen
I might work on it..
Not sure if comfybox is touch responsive
for now I'll stay with my nerdy ComfyUI setup, but stableswarm is way better than I anticipated
thats good to know
Just added v4, which now doesn't use the refiner, which means less overhead.
The SDXL workflow includes wildcards, base+refiner stages, Ultimate SD Upscaler (using a 1.5 refined model) and a switchable face detailer. Now wit...
i ditched the refiner and have started using other models to refine, initial tests are interesting
would love a technical explanation on right/wrong from someone that knows
Same here.
They don't seem to be sure themselves.
I don't think there is a correct way, the answer is always "just experiment".
what i have found really really interesting is that SDXL and SD1.5 work very well together in model stacks
of course you have to upscale/downscale accordingly in the workflow but its wild what it can do
Yes, I've been using it for weeks that way.
can you stack sdxl with itself?
Yes
is it any good?
The UltimateSDUpscaler, in the middle of that, uses a 1.5 model to upscale, and then another one for face detailing on the right. An SDXL model can be used here, but I've not found it as good yet.
LOL you doing the same shit im doing
C:\ComfyUI\models\upscale_models
can i just drop the sdxl base there or needs a new one?
different models for upscale
use comfy manager for easy
i could seed a screenshot of my shit but its UGLY lol
maybe one day ill make it pretty
i have two versions, one with regular US and another with USDU
Thanks! Mine has the workflow embedded in the image 🙂
will check it out, since you have a similar workflow i might just adapt haha
May as well. I'm sure that's how mine started out 😉
why reinvent the whl
where can i get one?
does anyone get considerably better results with upscaling ?
do you have comfy manager?
Comfy Manager or https://openmodeldb.info/
not sure, i just loaded the sdxl_simple.png workflow and moved it around
You NEED this:
is that nvidia only? the readme mentions it
SD best works with NVIDIA
the manager works with any
i am saying that SD works better with NVIDIA
not the manager
works fine
also these teeth artifacts i mentioned earlier, anyone can help me get clean pictures?
perfect teeth in positive prompt might help
however, this may lead to hilarious expressions depending on the seed
i mean i'm trying for a neutral expression
alright cool
Just inpaint the teeth 
teeth 😄
That's pretty much the way to handle any detail
how do i do that?
You should figure out how to inpaint. It's a useful skill. Find a yt tutorial for whatever you're using.
use refiner
what kind of steps yall using?
30
I prefer to think of it as a a MultiSampler Approach to creating AI generated Art rather than as a (in my case) pre condition, base, refiner pass
Effectivly all you are doing is multiple img2img steps with different models but using the latent space between steps rather than the pixel space
So you could have One sampler sending all the steps through using one model or you could split the creation process using 3 samplers and 3 models if you really wanted too (personally I use the same model dor Step/Sampler 0 & Step/Sampler2 and then the "main" model in Step/Sampler1 )
So for example these 3 images all follow the same format and wer e generated with same seed/prompt etc with only the model changing
LHS :: SDXL1 Base+SDXL Refiner
Centre :: SDXL1 Base + Dynavsion
RHS :: DynaVision+DynaVision
This is Stable Diffusion, ultimately there is no right or wrong way just the way you're happy with.
This is the way
with refiner?
My workflow doesn't use refiner
If it's with refiner, I used 26/4
cool
I get better results without it
i tried around 30 and 100 didnt see a lot of improvement
30+ is decent. I feel like I get improved results up to around 60.
It depends on your sampler as well.
lol good point
you think it understands that kind of grammar tho?
yes
anyone have a good workflow for outpaint, i have a subject but want to change the background
correct way to use refiner ?
don't do full denoise
the refiner is like an img2img pass and should use 10%-25% denoise strength
also a seed of 0 sounds like a bad idea. Use some random number
usually you use the same text conditioning as for the base model
For my trained model, can I manually input a noise and generate an image when I use it
hey i was wondering where you went. I went down the same path wondering if it is worth the extra time to upscale images by default in SDXL. As for the current finetunes I'm using, it is just not worth it, and it doesnt improve the image much.....not sure about base
And you know the cool kids from the 2.1 chat are in the SDXL show and tell chat? Come by
this part
Is there any option to install a graphical interface on ComfyUI? It seems to me so difficult to use normally that I don't want to look at it.
Yes.. but my pc is wasted right now 😭 using free colab from my phone to run comfyui ... comfyui has no touch control. Barely managing with a halfbaked touch implementation..
Use ComfyBox
I 've ended up with a daily view that acts as a pseudo traditional web page view especially with spaghetti turned off
(BTW COmfyUI does had a Graphical Interface, its just node based)
why are all these example images always full of pornography 🤔
If you think thats pornography youreally need to get out more
just look at your prompt
First comfyui
I can't even write it here cause its so full of pornographic content
I need to learn how to use the refiner model btw
I'm guessing it helps with distortions 😅
How is breasts porn?
it isn't if you don't show nipples right
it's about the prompt. I don't want to blame Winston in particular. It's just something I see very often here
They are just words.. 🤷♂️
IMHO you're being very small minded however I do agree that there are some dictionary based words in there that are used in some cpontexts to describe pornographic scenes howevr every word used is "proper english" rather than "vulgar slang" and they are written as "words" not "paragraphs & sentences"
New to danbooru tags?
how to "Start the ComfyUI backend with python main.py --enable-cors-header." i dont really know this installation... can u help? (from here:https://github.com/space-nuko/ComfyBox)
yes, I'm still new and disgusted by this.
Don't look on CivitAI
Dont use the internet!!
I know CivitAI and yes, I find it really sad that such a cool technology like SD is mostly used for porn
Or any NSFW Stable Diffusion discord servers
if you have an item that's transparent PNG and you want to add it to a table display, how would you do it? dont really want the original image to change but maybe around the edges so it blends with the ai generated background
mostly is a false pretense
hahaha it's not small minded to feel aversion to pornography. you don't need to be a hedonist to have open minded thoughts
again, there is no pron in my screenshot , is there??
hedonism is such a weird philosophy. it's very incompatible with stoicism
one fun thing about comfy is that the workflow is saved on the image, so you can easily load someone else's if you don't want to make yours from scratch
anyways, I never said "block these images to protect our children!!!!1111". I just find it always a bit strange when people post their workflows and then you see stuff like masturb***, submissive women and stuff in their prompts.
is there lowVRam option like SD
Os windows?
hi! have been mostly disconnected for 1 week or so. Any technological revolution lately?
you're very fine and i see a lot of stoicism in you and that's a-okay. don't let people cut you done because you're not into the same material they are
inpainting?
yes and are theose words in themselves pornographic?
having fun with the restart sampler, it's only in dev build for a1111 and requires a funky node in comfy but it's nice
used the setup for inpainting on comfy github, not getting great results yet
windows 10
oh it's bdsm. no wonder he's so defensive when people show aversions
I have nothing against pornography in general
Most people can imagine what he face loooks like when you describe it as an "orgasm face"
That phrase doesnt mean it pornogrpahic.
Orgasms are a perfectly bnatural occurnce
@rustic garnet join unstable diffusion server.. u will find new meaning in life..
me neither! i just don't tango with it
ty, will take a look!
and yet that was yuor immediate reaction, to highlight what you perceived to be pornography (when it isn't)
but in SD its often everywhere and this is something I find just sad. Like even many SD tutorials are using prompts of women in sexy pose. Why?
do you know what clickbait is?
what else can you use sd for
horny furries are the backbone of tech development
i got kicked off of there because i didn't want to post images in their hardcore channels so i posted in general imagery. a softcore princess peach. something you'd see in maxim maybe. they told me i had ot post in there nude channels. i said "i don't like looking at closeups of buttholes so i don't go in those channels" and they banned me
Do u run using the bat file?
lol thats funny
thats often claimed, but I doubt that the people who develop the methods are the same as the people who use it for jerking off 🤷
not really. they sort of just pick up trends. if anything they gave us discord. that's the pinacle of furry accomplishment
I thought you guys would have taken it half seriously
yeah, nothing happend prob cuz i dont undesrtand "Start the ComfyUI backend with python main.py --enable-cors-header." - idk what to do with it
I think the nsfw aspect of generative models still brings a lot of interest in it, if you look at civitai for example
the whole "pornography drives markets" thing is a myth. vhs never won because of porn. it's because betamax machiens were expensive and had a lot of compatibility issues and in many cases were uglier and bulkier. porn went with vhs because that's where the sales were. not the other way around.
you're presuming that all furries are simply in it for the purposes of maturbation? (BTW I find it interesting that earlier you self censored the word "masturbation" but are happy use the phrase "jacking off"
Double standards sir/madam delete as applicable
Open the bat file using notepad and copy the text inside send it to me..
what a topic
one context was a prompt to create images. the other was a conversation. different contexts. self censoring is soemthing everyone does and it isn't something we should be lambasting people for
I think you are right, but its not a very good example
can we just drop the whole "porn is important and you must like porn" debate. it's never been high level
meanwhile to keep the OP happy here is an image with a large feline in it
(apparently 9u55y is blocked although masturbation & jacking off aren't)
its nothing change xD this is inside bat file
Did you get it then..
Let me know if you managed to run ComfyBox
yeah, tryed a lot of thinks, i dont know how to install this
w/e
SwarmUI is an option which uses ComfyUI backbone and has a more common UI
It's just two clicks ... you run comfyui in the background with enable cors header argument. And in comfybox just run the run.bat ... that's it
+1, please and thank you
feel free to take it to DMs if you'd like to discuss it further, and not place it in the #✨|sdxl channel

actually there's a related question that keeps it on topic.
Ther eare any number of NSFW Specifc Models for SD1.5 such as "Uber Realistic Porn Merge" , has any one found any good equivalents for SDXL yet?
Dynavision works and is great if your particular taste veers toweards everything lookin Pixarified
Sorry, this is not a NSFW server and we do not encourage conversation about the like
thanks for the link. and I do note there is a difference between posting NSFW content and simply discussing it. None of the former has been posted ;o)
wanders off back to his hole
run_nvidia_gpu.bat right?
and when comfy is ON, i trying .bat from the comfybox and its window for 1 sec and insta off, nothing happend
That should work.. maybe try running both as admin
Well I did try contacting you directly but it appears I cant
SIGH
can contact me bruv
Did u git clone comfybox? Then u will have to build it... here is the link for built version https://nightly.link/space-nuko/ComfyBox/workflows/build-and-publish/master/ComfyBox-dist
i've just uploaded a new architecture LoRA model on huggingface,enjoy it https://huggingface.co/frank-chieng/sdxl_lora_architecture_siheyuan
nah you're fine, it was flower brought it up 🙂
sorry fruit
fruit/flower related
at first i thought this was a new lora architecture and was getting ready to dig into documentation
You can contact using a #1010934719455707218 ticket here on the server-- I don't accept DMs 🙂
but since you aired it out here:
I'm not here to debate you on the server rules, just kindly asked us to move the conversation as we don't facilitate this sort of convo
it's not a debate, argument, or something for me to get granular about with you in what or what not is classified as "nsfw" or "pornographic"
So, let's kindly move on
ty
still looks cool. thanks for uploading
Ta
/me waits patiently for rule #4 to get updated so that it alkso sayd " No Discussion of NSFW content will be tolerated"
cos at the moment it doesn't ;o)
would you like to move on?
is there a way for comfyui to update all my node packs automatically at start?
it is tiresome to be git pulling every pack manually
Not really but I probably will 🙂
Great, thanks!
opens door
NPs, consider the threat understood ;o)
Gee willy winkers, an online threat!??!
Hey everyone, does anyone have an optimized workflow for a low-resource system to share? Something simple without additional nodes, as I'm not familiar with working with them yet. I just want to generate some cool images, but my 16GB of RAM is running at its maximum capacity. I just encountered a Windows blue screen, and I don't want to stop using SDXL because of this. If anyone has a simple, optimized workflow for a modest PC, please reach out to me.
there's probably a rule about that somewhere lol
amount of VRAM?
I have 8 gb vRam and 16 gb ram
Yeah with 6GB of vram I could get the base model working, idk if the refiner is really possible with 16GB of RAM :(
maybe if the models are unloaded from ram everytime you switch the models, but that would be dog slow
and I'm assuming you're using comfyui, so idk how much less resource intensive it could go
you dould run it GPU only so it doesnt swap back to System RAM but with only 8Gb VRAM thats probably not a wise move.
I would get on amazon and but a couple of cheap sticks of say 32Gb to intsall as a 64Gb pair
I have a 7300MB SSD; I think it can handle this quickly. My issue lies in the VAE pass and also during the Refiner model; they are consuming all of my RAM.
Tiled vae?
yes
Fp16 vae?
Yes, I think I'll need to do that, stop trying to squeeze blood from a stone and get a new RAM kit. Unfortunately, I don't have the money at the moment; that will have to wait. I just wanted to try creating something cool in time to participate in the Civitai contest. It would be nice to have a new GPU as well.
one moment
This this [is the only workflow that manages to open without having red windows, I adjusted the passage to (Tiled), but even so, there are moments of high RAM consumption peaks, 99% during the VAE passage.
I'll see that, thanks buddy
@mossy canopy should I replace the 6gb model with it?
Models/vae folder
ok
serious now, what does this model at the top do? like its not a thing like openpose or canny and the like?
?
It's one of the new and fancy controlnet model for SDXL 
Yes..
but what though? like all of them in one?
Depends what you downloaded. For some reason they have the same name 😆🤦♂️
Run comfyui with this.. --force-fp16 --fp16-vae --dont-upcast-attention
Load vae separately in the workflow
ok, I work on it
im so lost and i think you've made it worse 😂
Or You could just try my colab notebook
Dl new ones! They are small and cute https://huggingface.co/models?pipeline_tag=text-to-image&other=controlnet&sort=modified&search=diffusers/controlnet EDIT: adding openpose https://huggingface.co/thibaud/controlnet-openpose-sdxl-1.0
Dl new ones and rename
Everything is new to me, I was used to 1111. I don't understand much about how Colab works, however, I have a premium Google account. I'm thinking of running SDXL in the cloud until I can upgrade my RAM. Let me know if this seems like a good idea?
You could run it on free colab.. no loss there.. per day 2.5-3 hours max on a single account. Never tried colab pro.. so no clue ..
Try running a few colab you will understand how it works..
I'm not sure how it works, I bought cloud storage from Google a while ago. From what I can see here, SDXL is allocated in this space. I'll follow the tutorial and see if the workflow becomes more comfortable. Well, I guess anything is better than blue screens, isn't it?

Double click empty space, enter sampler
Is the checkpoint merger in A1111 the best option?
or SuperMerger
That would be because @high skiff spent a lot of time ensuring it was written using only standard OOTB nodes included in ComfyUI .
As long as it doesn 't crash with an OOM error I wouldnt worry about it peaking at 99% usage
peeng
Ah, yeah
It took me more than 2 weeks to figure out how to get a high quality functioning high-res fix equivalent for SDXL using only stock provided nodes to ensure compatibility
stay in latent space?
No promises on a when, but I I'm currently working on the 1.1 release of my workflow, which should be coming complete with three separate pipelines.
One will be a very light standalone version of SDXL that can run on weaker computers, or just for people who want something light and simple
Another will be a full featured workflow with a secondary high-res fix that I am beta testing right now. From what I've seen, it's better in most ways, but you'll be able to choose between them
And then finally I am also prototyping a dedicated image to image workflow as well
No, staying in latent space causes a ton of issues
In the meantime i was just using ultimate upscale node
My workflow is base image, forex pixel upscale, down sample, in code to latent, send to a high resolution sampler, and only resume sampling from the high frequency details, in order to preserve the shapes and underlying textures of the base image
especially for @high skiff
I found that in almost every way comparable, ultimate upscale is unfortunately not as good as my current or even new high-res fix solutions
SDXL is not fine-tuned enough to deliver consistent images across tiles without having to use high denoise values with seams fix
My new high-res fix workflow that I have been working on uses a very stupid trick that shouldn't work, but clearly does lok
"resume sampling from the high frequency details," - explain, are you using step control?
Yes, I'm using the advanced k sampler from comfy UI in order to resume from a later step rather than starting at the beginning of the diffusion process
It also injects noise before it continues from that later step, which only influences the higher frequency details rather than the lower frequency details
Overall, it takes the general shape of the original image sprays a little bit of light noise onto it, and then applies just a few steps of additional diffusion to bring back those crisp and clear high frequency details
My new high-res fix also works in that way, except I'm using additive/multiplicative noise field which are very finicky, and I'm messing around to find the best value
It's the same speed as the original, but it greatly increases texture preservation, and overall high frequency detail
what are higher/lower frequency details? i'm not groking this analogy
its not an analogy, thats actually what they are called
they're wavelengths?
low frequency is big shapes, high frequency is fine shapes
effectively, yes
diffusion is based off of refining low to high frequency noise, which is what we call the "steps"
that seems like an analogy. nevermind. i dont want to argue i was just trying to understand
no worries
i still don't understand why big shapes are a frequency
but not going to argue this
dammit thats the step im missing, i was using base image > upscale > down > send to sampler with lower noise values
but you refined it,
I'm using the advanced k sampler from comfy UI in order to resume from a later step rather than starting at the beginning of the diffusion process
any advantages of doing this over just fully denoising and starting a new ksampler with less noise multiplier or is this just a shortcut
never considered only changing high freq
That's what I was doing for a long time as well, but I ended up finding out that the advanced k sampler does not work how I initially thought
I thought that because it had no denoise slider, it would either entirely redefuse the image, or continue off of where it left
When instead, it injects noise on the step that it's on, preserving the already diffused details and only refining the finer details above
Id be very curious the answer here
i guess i gotta study up on fourier transforms. i had no idea it was a big part of the diffusion process
The benefit to it is that instead of rediffing the image from the lowest steps all the way up to the highest steps, it's able to dedicate that entire diffusion range just to the finer detail steps, and it also prevents the model from changing the underlying shapes or deviating too far from the original image
hmmmmnmm
Try this node with this workflow.
https://huggingface.co/faisalhr1997/comfyui/resolve/main/2workflow.json
ok im going to test your hypoth, see if get something similar
So, effectively, the way my high-rise fix works is it upscales, downscales, in codes, and then only diffuses the high resolution or small details, preserving the underlying composition much more reliably. And because it only has to do a few steps to achieve the full diffusion spectrum, it's also significantly faster
this is the important bit imo
nd then only diffuses the high resolution or small details,
So, my high-res fix only needs about three steps to be successful
......i must see this in action
interesting, ya i kinda wanna test it out to see the differences myself
That is the most important part of the workflow, in order to get properly refined high resolution details, you have to turn up the denoise, but when you turn up the denoise without specifying what step to continue on, it starts to change and misshape the underlying composition of the image
@mossy canopy comfybox is finally working, can I edit this graph somewhere? idk yet how but I want to add AIT (https://github.com/FizzleDorf/AIT) to faster renders, is it possible?
how do you decide which step?
I would explain how my new high-res fix works, but I am not too confident I can properly explain it, and it's still not a finalized process
In my old high-res fix, which is the one that is available to the public, you just treat it as the denoise basically
Whatever step you start on out of the total image is the inverse of the denoise
So if you give it 50 steps to diffuse, but only start at 45, that means 5/50, or 10% denoise
i believe i understand with your explanation already, i can build myself
Is the new one uses a different process, you don't select which step it continues on, and it has some weird multiplicative multi-sampler noise jitter effects, all I know is that it takes the same amount of time and it produces even better results in my current one
Sure.. why not.. ComfyBox is fully customizable with custom nodes
I always get these ideas that fundamentally shouldn't work, but end up working lol
this i can deal with "weird multiplicative multi-sampler noise jitter effects,"
whatever works, works is how i roll
thats the beauty of comfy
the beaumfy *TM
That's how I got my original mixed diffusion implementation, which is where the base model doesn't finish defusing before it sends the unfinished still noised latent to the refiner, thus saving time on the diffusion, and also preventing the refiner from damaging the bigger shapes/composition
And then my fractional offset which is still something I'm looking into, that was another little happy accident where I tested something that fundamentally shouldn't work, but introduced some very interesting results, of which are not as important as they were back in 0.9, but still something cool
isn't that how the original sdxl workflows were made?
About Depth controlnet, i'm doing right?
Add custom nodes in comfyui as usual... it will automatically update in comfybox after restart
first ones i saw had the refiner chewing on the last steps of the latents
And then I came up with the first generation high-res fix, which isolated the second diffusion pass using only the base and not the refiner, while also only diffusing high resolution details on top of the original image to preserve quality as well as increased speed
the SDXL report talks about that process too, the pdf they released during the beta
And then the new high-res fix is a similar workflow, but fundamentally very different in how it diffuses the last few steps, and it produces even better results while not taking any more resources/time
Here, I have some images comparing the difference between my old and new high-res fix
just out of curiousity are you using this with offset lora?
SDXL has a weird latent shifty goings on
with high denoise it doesnt matter cause it will smash the latent together and redefine the space
no, I don't use the offset lora at all
@mossy canopy Yeah, but... cuz i am newbie here, where should I add this https://github.com/FizzleDorf/AIT I mean, before control net, after etc. i have no idea how to add it well
i was curious if it would help the jitter but i will test
my new high res fix relies on the jitter
but I am forcing the jitter
again, its a lot to explain
doesn't work with sdxl controlnets
whats a controlnet?
a gift from prometheus
model loader --> AIT --> next
base image vs old high res fix vs new high res fix
You can see from the base to old old high res fix, it smooths out and loses a lot of the high frequency detail, and the new version greatly enhances it
base vs old vs new
ohhh thats right, new folks entering the scene from the hype of sdxl wont know what controlnet is
gonna be fun to blow their minds
- middle looks like trash now
- this has the latent shift i was telling you about from 1/2 to 3, but your method seems to have denoised the artifacts, cycle through 2 and 3 quickly and you will see what i mean
for some images, the smoother appearance is preferred, but for anything textured or realistic, the new one will be preferred, which is fine cause they will both be included in the new 1.1 release
you can choose between them by simply changing one node connection
it does magical things for realism
mindblowing things
scroll down to the examples section https://github.com/lllyasviel/ControlNet-v1-1-nightly, in older SD models we had way more controlnets. some of them are just being released for SDXL now
i can only imagine haha!
base vs old vs new

and when my high res fix dropped, it was really the best option, while also being super light to run, but now I was able to crank it up to 11 with really no downsides
but in comfyui or comfybox?
its even MORE amazing for textures and skin
base vs old vs new
that is EERIE
comfyui
base vs old vs new crop's
have you had success using ait with sdxl controlnet? i thought it still wasn't working
maybe i need to update
also, i hate the way the middle looks, thats why i stopped using USDU
base vs old vs new
it holds onto crunch very aggressively lol
which is why both will be an option
cant you control how much it holds onto tho? very customizable i imagine
base vs my new workflow
also, the contrast difference is outside of the upscale
I also implemented a small image processing pipeline
so with the old way, yes, with the new one, no
The trick to hold on to high frequency detail ONLY works at a single step value
Any higher and lower and it nukes
I am still looking into it, so we will see
ooooooo
so latent > sampler 1st step?
the old one let you choose how faithful it was cause you could change the split
This one is the balancing of 10 plates at the same time aligning the stars to make it work
but it always works at the value I have now
yes
ah now it makes sense why you said 3 steps
where did I say 3 steps?
@steady grove
low to high frequency noise/detail
low frequency is big shapes, high is fine details
generally speaking, diffusion figures out the low frequency before the high
so I cut it and restart it again at just the high to preserve the low and medium
this one is more accurate to how image gen works
ohhhh, no thats for the original
lol
that was the minimum you could run it at for speed save
ah ok, i was confused
so at first I have to put it from ComfyBox into ComfyUI - but saving this file (.json) and trying to launch it in ComfyUI nothing happens, it just doesn't work. why? Everyone else is loading.
i use this "save" to pull this .json
try to make a picture with your workflow and load it in comfyui
i dont use box so im not sure how they wrap the comfy metadata
I'm curious too
excellent results, is Aether Real SDXL available now?
Not yet. Soon 🙂
looking forward to it
🤗
K
where is model loader? i cant find it also
I'm using 10x steps and 5x cfg
Works better
Not realistic though, need to work on that
already made a workflow that incorporates that, if they want they could just load one of my gens
they trying to get comfybox to do it
these are waves
Lacks realism, but I'm getting closer
I'ma try at 20
gimme link to ur with AIT
if u can
this image is 25 steps
so samplers do actual wave forms on the schedule? neat
finally something that doesn't look mega-complicated, thanks, I'm going to test it now
that one is more accurate to what it looks like, yeah
i thought it was more gausian random distribution each step
this is what an incomplete latent looks like
@steady groveThis is what an image looks like as it diffuses
well, thats with the residual noise, at least
looks like unipc
actually here, I can make a visual of it, its not that hard
@high skiff - Really want your new upscale steps to improve this image I've been working on today:
Latent image tricks https://youtu.be/OdMtJMzjNLg
Here are amazing ways to use ComfyUI. This node based UI can do a lot more than you might think. Especially Latent Images can be used in very creative ways. You can inject prompt changes. You can combine latent images to new results. Stop render steps and finishe the rendering after you changed to prompt, sampler and settings. A world of possibi...
thats old news by now, you can do a lot more than just that
Now would you please help me to install custom nodes? 😭😭😭
kinda a dumbed down video of how it works, IMO
use comfy manager to install the nodes, thats easiest
no need to run the scripts of a custom node
comfy will pull them in at startup
great stuff
Thanks! I've stumbled upon a few words recently that add some real nice impact to images.
can anyone help me with depth controlnet on comfy why is mine like this
try lowering the strength
Guys, I managed to replicate ComfyUI in Collab, but I'm unsure about how to add upscaler models. How do I install the upscaler models on Collab?
fun
Works better with base + refiner in my tests
can soemone please share their depth CN comfy file
Guys, in Collab, is there a way to obtain another upscale model besides the three available for access? I want to know if I can use 4x_NMKD-Siax_200k?
!wget -c <link to any model> -p /models/uspcale_models/
Hooo Thanks buddy
#!wget -c <LINK HERE> -P ./models/upscale_models/
cauition, i missed the period
take out the sharp to uncomment
i mean hashtag, music brain...
I did it
after much manual labor
fully functioning step by step frame diffusion workflow lol
you can take all of the images and put them together in a gif sequence to see how the image diffuses
Good lord lol
I can make it compatible with LoRA's too, but that will take over 20+ reroutes
so screw it, I'm in
@strong field Does this sound good?
also, it doesn't save the images just yet, so I have to redo all of the image nodes
they are just previews for now
capitalize the P
you misspelled "upscale"
under the folder name at the end
then should be good
should I do 1-10 on top and 11-20 on bottom, or should I do 1 top, 2 bottom, 3 top, 4 bottom?
makes more sense 1-10 top
alright, i can do that
so you can naturally see the progression
0r
0-9 top row
19-10 bottom row
so it runs clockwise
1-10
11-20
it now saves the images as well
not just previews anymore
as it is now, the samplers are actually set up in that
1,3,5,7,9
2,4,6,8,10
order, so I need to rearrange the whole thing to not have an aeurism lol
...wow!
um...
You can run comfy with --preview-method auto if you just wanna see it.
If you wanna save it, add a lil' something to the end there.
e.g.
for idx, x0 in enumerate(latent_list):
preview_image = previewer.decode_latent_to_preview(x0)
save_path = f"saved_images/image_{idx}.jpg"
previewer.save_image(preview_image, save_path)
This way, each image will be saved with a unique name in a saved_images directory.
I oom if I try to do preview in the ksamplers
aha, I see
could also save each steps as a latent
and pull 'em out of the VAE later.
@strong field I did it! Thank you very much for your patience in teaching me. I did it!
definitely use this:
there we go, added LoRA Support
lord almighty
gonna add a primative for sampler as well, this is fun haha
oh, that's not the right reaction at all
is gorjus
yep
Oh, now I gotta spam my Ritas.
Ah, i had Joe blocked, my bad, I thought you were somebody else @hard fractal
I saw you recommend the viewer
this is for making gifs to share with people

Nice
are you using the LoRA on CivitAI?
@hard fractalOh, Joe, I made yet another high res fix whish is even better than before. It runs in the same time and outperforms other methods even more, and also still runs stock
and supports LoRAs from end to end
I thought you had blocked me!
nope. a new fine-tuned SDXL model by @icy brook not trained on famous people (focusing on skin textures). rest is prompting - post is just upscale and some film grain.
I did, i am not sure why

I unblocked you, I thought you were somebody else
Guys, I received this error as soon as the image was sent to the Upscaler 4x_NMKD-Siax_200k. What should I do to make it work without errors?
Got a workflow?
whats the model brutha?
I stand before thee, thy tiny knight.
Working on 3 new ones for my 1.1 release
New workflow with a tighter pipeline, and 2 different high res fix options
Another one dedicated to just img2img
and a third light and easy to run pipeline with no fluff
but man, the new high res fix way outshines the other two
it preserves textures and enhances clarity much better
doesn't sandblast skin texture
Any ControlNet workflows?
None yet
Haven't needed controlnet yet
I am also working on a realism LoRA
which is in the early steps, but shows a lot of promise
Nice!
base SDXL (with my LoRA) vs old high res fix vs new high res fix
a new process I found yesterday
it shouldn't work, but it does haha
still testing it and learning more about it
the difference is small in that image
here, have this one
I would love to generate much higher resolution images. Without coherency lost that is.
base SDXL vs old high res fix vs new high res fix
new one absolutely kills for realism and texture
Yeah, that looks quite good.
it also does fantastic for textured things like water color
these are crops
base vs old vs new
new one massively enhances texture and tonality, while also picking up more grit like real water color
dumb question but where do i find diffusion_pytorch_model.safetensors
Allow me to get it for you.
its specific to comfy UI
it relies on comfy's special advanced sampler
I do not use nor want to use that UI.
been googling for like 10ms
🤷♂️
can someone share their comfy ui depth CN workflow
base vs old vs new
you can see how much better it enhances texture and tonality, while not hallucinating duiplicates and such
1825x1080?
thanks landed on the HF page and wasnt sure that was what i needed, appreciate it
2704x1600
Wanna see a comparison?
This is on the bot:
weird that the nose, ears, and eyes don't align
Nice
still much more control than before tho, thats cool to see
this is using my realism LoRA with my new high res fix as well
There it is:
actually, let me convert that oen so you all can see it at full 2048x
ahhh, ok, much better haha
I was like, that other one did not match lol
double depth.
looks super good!
there we go
fill image converted to JPG
I have not tested this new upscale to 4096x, I should try that
I can't see what is wrong this way
Do you have all the nodes? Screenshot the whole thing
Maybe trying to update comfy ui and the nodes, there's not much to go wrong, the midas is working fine
ok will try, thx for help
This is the end of the error, let me see the beginning

i don't know why
You're using a SDXL model, that's the problem
Loving ControlNet
It works but i get this error trying to use the same prompt encoder.
steampunk dia de los Muertos
how do i solve this problem
Why is controlnet not working for me with the bot?
Make new prompt and prompt encoders, make new everything, you will get more freedom, you will be able to use a different model
OK
I try it now, thank you for your help

guys i made skibidi toilet
does anyone know what this means?
adm 2816
making attention of type 'vanilla-pytorch' with 512 in_channels
Working with z of shape (1, 4, 32, 32) = 4096 dimensions.
making attention of type 'vanilla-pytorch' with 512 in_channels
missing {'cond_stage_model.clip_g.transformer.text_model.embeddings.position_ids'}
model_type EPS
adm 2560
making attention of type 'vanilla-pytorch' with 512 in_channels
Working with z of shape (1, 4, 32, 32) = 4096 dimensions.
making attention of type 'vanilla-pytorch' with 512 in_channels
missing {'cond_stage_model.clip_g.transformer.text_model.embeddings.position_ids'}```
wow, thanks for the unsettling circa 2004 cgi guy
5-7x smaller controlnet model? Hurrah. Still not supported in SDNext...boo. I'm about to try that comfy front end thing.,...can't remember the name
Having fun with cotrolnet running my old Disco Diffusion png's with the same prompt even keeping trending on artstation
hoodlums
Ruuuun !
Well, lol
reassembling in progress
I wonder how that happened?
very high res make this kind of stuff, or refiner
No refiner used
weird dimensions?
1920x1080
Worked with 2.1 so did we take a huge step backwards?
everyone so quick to blame the models. I don't know. depends on the prompt, seed, etc, I'd imagine. I was getting all sorts of 2 headed abominations with the beloved 1.5
this looks awesome - if you need a soundtrack let me know 😄
producer??
yeah - musician and producing
well fact is all the models are trained on specific pixel counts. so if we deviate from that specifically then they can do weird things. I don't know the science
I have music things myself
excellent, same! genre/instrument?
I have keyboards that make various sounds. and things with light up buttons that make sounds too
learned the piano - very mixed genres. from blues to ambient, from hybrid orchestral to electronic genres. the latest stuff is mostly demos for virtual instruments by my buddies and used to promote their products: https://soundcloud.com/masslevel
I'm pretty big time. let a guy online use one of my songs in the opening sequence of some corporate instructional video he was making.
sweet! those are nicely mixed
yeah, those are pretty high quality tbh
I really like Magic Castle 😄
prompt: Jiving to retro synthwave produced by Masslevel
Thank you 🙂 Glad you like it
that frog lives a cooler life than I do
Hahahaha
Haven't tried, it but this is supposed to reroute "for real"
And some other cool stuff that would be helpful if they work as advertised
Here is a variable based reroot system ? Like setting a variable, then using this variable as an input of an other node ?
Is there a better way to upscale than this way with SDXL? Like throw upscale image to set res, then Ksampler to take last gen's latent to continue generating the higher res where the lower res left off.
how to get it working?
Not sure about any of it tbh. 30b too
well what do you use to run them?
happy with my controlnet interface
I'm starting to realize more potentials with SDXL.
Using Img2Img to correct some minor issues at higher resolutions is useful.
Starting 1280x720 then upscaled to 1920x1080
So long, and thanks for all the fish.
good news everyone. for what I believe has been several days now, I've been using the wrong noise offset lora. it's been giving me some pretty subpar results. but I just assumed I'd been using it wrong. which I was, since it's for 1.5. I have the right one and I've had it the entire time. I just, at some point, started defaulting to the wrong one
and now I know why I kept getting tensor size errors, lolol
Well at least you know why now.
indeed. tbh it really didn't break images like some of the loras probably would. but definitely didn't do me any favors
the lack of naming standardization in general as far as loras concerned is extremely frustrating
I'm at the point where I might just purge about 75 percent of them since it's often a huge chore to figure out what they even are
If you know what each is made for, create sub folders in the lora folder. You can do the same with models
oh, I did for sdxl models. but didn't really seperate things much beyond that. problem is, for whatever reason, I just started using the wrong noise offset lora. it's not in the sdxl folder and I should have known better. but here we are
voxel xl
yeah some errors with the suit but pretty good
SD very rarely draws both wings. sad
I suggested fictiverse check out the voxel lora
and then he told me to look at who made it
loool
I dunno
He looks right out of an 80's film lmao
Yeah, LOL
I don't mean to judge, but he might have fetal alcohol syndrome
That nose, though
it's not a bd idea
Is that his wife or sister? Yes.
beautiful
SDXL is so...refreshing.
Being able to just generate nice 1080P images.
A welcome change.
1080P seems to be the most reasonable
4K is a letdown
I have no idea but I think there is the back of his head
We now know
that it is not human
I am afraid.
LOL
are you trying to render in straight 4k or upscaling/
?
Are these the aliens that guy mentioned in the court case?
1280x720 img2img upscaled to 1920x1080
Maintaining the original 1024x1024 pixel count for coherency
I was using 7b for captioning and damn, 80-90% hit rate
GGML or GPTQ?
Minigpt
MiniGPT. Interesting.
7b with 50 beams rocks and I was told 13b is even better
I would've passed you https://huggingface.co/TheBloke/Stable-Platypus2-13B-GGML
I looked at his
aye
don't like it and no batch

