#✨|sdxl
1 messages · Page 55 of 1
yeah I stop at 40
I need SDXL generated images of WOMAN for my training on my gf
WOMAN
why not generate them then?
is there any link with that done?
well I don't know how many should I use either now that I think about it
going around asking for woman is a good way to end up on a list buddy
there's no guide for sdxl yet - on how to do it properly.
There are a few bad guides for teaching only the face, and nothing else.
Or you can train the body - but get other faces on the body 🤣
#priorities
just go for it and figure it out on the fly. its actually kind of easy
when you say "get other faces on the body" I just imagine elon musk's head on some plastic doll asian girl body
bonus points is she has absolutely stuffed up milkies
I'd joke about using 2 loras - but that's still a bad idea unless you've got your captioning down to a science
?
all depends on many variables
At least I've had no luck
like what vram, what kinda training, and more
GPT 9000 is alive
aw caith you never responded to the sec tutorial guy, i had my 🍿 ready
From my quick testing it seems about batch size 1 without gradient checkpointing and batch size 4 with it
it depends on sampler too. Some sample twice per iteration which kinda throws off the numbers
link broken XD
That's weird, I'm using use_8bit_adam too
10~12 is the theoretical maximum on 24gb vram with adamw8bit
I use 8 cause I still like using my pc
with gradient checkpointing in your case? Or how?
these settings should get you there.
except you can now use fp16
(fp16 wasn't working in 0.9 - when I wrote that message)
Thanks
I'm not seeing big differences. I'm using bf16 and catching latents to disk, unet only
I guess I should stick to gradient checkpointing
But even that was around 23GB of VRAM and bs 4
bs 8 should give you around 18~20gb vram without overhead
on a linux PC that I access through ssh
sure you vram cache is empty, and you're not running comfy or A1111 in background?
is it just me or are the 2M samplers totally broken
Is this okay, ? or I need to uninstall those modules? I don't understand what modules are that and if I can brake something..
check your install. you probably missed something like xformers, diffusers, bits and bytes, or something
also cudnn
this for me or him
I'll check the logs
I don't use them. wasn't so happy with it - but probably just my settings. I had some nice images in the beginning with 2M but I'm on dpmpp_sde_gpu normal / karras now
was meant for dream
I don't mean like "not nice" I mean like totally broken
since linux and servers often dont work with the default install
dpm++ non-2M version
ok... steps, cfg or broken 😄
what in the world XD
all settings same except sampler
it almost looks like you had return with leftover noise at the end on
I'm on dpm 2m, and getting good results
I don't though
using the timestep for extra details on skin & background
can be removed though, since difference is very minor
That's lack of steps
Enable xformers for U-Net
import network module: networks.lora
create LoRA network. base dim (rank): 64, alpha: 32.0
Seems to be using xformers and bitsandbytes
I'll do a test run with bs 4
cudnn working? cause without that bigger batch sizes dont work
but fucking how? Literally using the default workflow with a 2M sampler. No advanced node, no refiner, no return noise, etc
CUDA SETUP: Highest compute capability among GPUs detected: 8.6
CUDA SETUP: Detected CUDA version 117
CUDA SETUP: Loading binary /home/bot/miniconda3/envs/kohya/lib/python3.10/site-packages/bitsandbytes/libbitsandbytes_cuda117.so...
checks out :/
odd
you using kohya with gui?
send me your config file. I'll check if some weird setting is added or missing
try mimizukaris setup with 2m, see if it happens there as well
you can apply my minimal changes from the screenshot afterwards, to get a bit more performance out of it
I'm still with the old command line and variables. The only think I had to change was the catching latents, because if you use 1024,1024 by default the max res seems to default to 1024 anyways
accelerate launch --num_cpu_threads_per_process $num_cpu_threads_per_process $repo_path/sdxl_train_network.py
--pretrained_model_name_or_path=$pretrained_model_name_or_path
--train_data_dir="$image_folder" --output_dir=$output_dir --logging_dir=$log_dir --output_name="${train_name}-${versionname}"
--train_batch_size=$train_batch_size --unet_lr=$ss_unet_lr
--max_train_steps=$max_train_set --lr_warmup_steps=$lr_warmup_steps
--use_8bit_adam --xformers --mixed_precision=$mixed_precision --persistent_data_loader_workers
--network_dim=$network_dim --network_alpha=$network_alpha
--shuffle_caption --keep_tokens=1
--caption_extension=".txt"
--lr_scheduler $lr_scheduler --min_snr_gamma=5 --network_train_unet_only
--resolution=$max_resolution --min_bucket_reso 512 --max_bucket_reso 2048 --enable_bucket
--save_every_n_epochs=$save_every_n_epochs --save_model_as=safetensors --save_precision=$save_precision --seed=$seed
--network_module=networks.lora
here's mine for comparison
accelerate launch --num_cpu_threads_per_process=2 "./sdxl_train_network.py" --enable_bucket --pretrained_model_name_or_path="A:/models/SDXL 1.0/sd_xl_base_1.0.safetensors" --train_data_dir="A:/Datasets/Concepts/nier/2B" --resolution="1024,1024" --output_dir="A:/Datasets/Concepts/TRAINING" --logging_dir="A:/Datasets/Concepts/TRAINING" --network_alpha="1" --save_model_as=safetensors --network_module=networks.lora --unet_lr=0.001 --network_dim=8 --output_name="2B_v1" --lr_scheduler_num_cycles="40" --learning_rate="0.001" --lr_scheduler="constant_with_warmup" --lr_warmup_steps="50" --train_batch_size="8" --max_train_steps="35800" --save_every_n_epochs="2" --mixed_precision="bf16" --save_precision="bf16" --seed="1234" --caption_extension=".txt" --cache_latents --cache_latents_to_disk --optimizer_type="AdamW8bit" --max_data_loader_n_workers="0" --keep_tokens="1" --bucket_reso_steps=64 --mem_eff_attn --flip_aug --shuffle_caption --gradient_checkpointing --xformers --network_train_unet_only
since I also use cli
oh right, buckets can escalate if you have too many of them
no issue if you have a few buckets, but each one does add a small amount of vram
When using loras and SDXL, would you need lora between both base and refiner?
I try to do all my loras with up to like 2~4 buckets - since that has multiple benefits
do you have your scheduler set to karras? that helped me on that type of artifact
base only. refiner is currently not compatible
karras fucks up the refiner when using return noise
loras help the base a lot and the refiner often isn't needed. if you want to use one with a refiner you'll need to specifically make one. i've used the refiner on loras too for the last 5 steps and its actually fine
Did you get to check if it catched the latents with the correct resolution? Because I had to add min bucket size and max_bucket_size
i'll have to take your word for it as i'm only using the base atm
otherwise the upper max was 1024 for some reason
Now it's this with repeats:
bucket 0: resolution (704, 1408), count: 10
bucket 1: resolution (704, 1472), count: 5
bucket 2: resolution (768, 1280), count: 70
bucket 3: resolution (768, 1344), count: 30
bucket 4: resolution (832, 1216), count: 320
bucket 5: resolution (896, 1152), count: 10
bucket 6: resolution (1024, 1024), count: 5
bucket 7: resolution (1152, 896), count: 10
bucket 8: resolution (1216, 832), count: 25
bucket 9: resolution (1280, 768), count: 5
bucket 10: resolution (1344, 768), count: 140
bucket 11: resolution (1408, 704), count: 60
bucket 12: resolution (1472, 704), count: 85
bucket 13: resolution (1536, 640), count: 40
eh yeah sry, that config from that job was without buckets. I also have to add them like you did
how much vram are you hitting with this?
on my specific setup - 19+overhead
1st pic is using the default steps "Kitten". Can't really see artifacts but just reducing the steps to 30 without changing anything else for the 2nd pic you get artifacting really bad. 3rd pic is unipc 30 steps.
the artifact is that the cat is brain damaged or hd a stroke
XD
more like infected with VAE 
I feel like the mimizukaris workflow uses so many steps the artifacts just slowly fade away
yeah. umm gimme sec - I have an amazing image I made to proof you right
but when you use an actual sane step count like 30 it shows fucked up
it got the VAEds
hmm... I'm thinking. I had this too a week ago... I'm looking for the metadata
bit big - but it shows exactly what you're experiencing
I just take the performance hit and go for 66 steps now
yea I think 2M samplers and LMS is bugged in comfyui
For 2M are you using Karras or Exponential?
maybe lms is just shit tho idk never used it
I always do with those
normal
You have to set both base and refiner to the same scheduler
Hmmm
lms was always a step hungry sampler
I'm not crazy I've tried just about every combo by now
keep in mind, multiple setups used to rely on 100~150 steps in 1.5
ok well, somethings broken, so like, double check
I'll give it a whirl when I'm back at a PC
I use my Principled node which physically does not allow you to have different samplers/scheduler for refiner/base
its why euler a, and other ancestral setups became so popular, because they run damn fast
so what is everyone using for sampler/scheduler on a base/refiner setup?
check yourself!
xD
euler/normal. debating switching to either ddim/normal or dpm ++ sde / normal
Use this workflow
20 steps 😂
sad racoon
I'm not about to render a bunch of 90 step test images or whatever to show that the artifacts are invisible if you put in 3x the horsepower
gonna go eat
fyi, I rendered 2500 test images to check what works best with my loras 🤣
yeah it looks like _sde without the 2m is the best at the low steps when using the refiner
@boreal bough Thanks for the help! It seems like I can do batch 8 with gradient checkpointing now. Added this:
--cache_latents --cache_latents_to_disk --gradient_checkpointing --mem_eff_attn --bucket_reso_steps=64
I always use 100% base on 2m sde karras
if you run base -> refiner
then you're essentially handing the refiner a chaotic unfinished image
meaning refiner does more - which has its positive and negative parts.
but for many artstyles & photos, refiner is just way too biased, so 2m sde stops being an option, since we want the base to do exactly 80% of the work
I think refiner kinda limited the output to its biase. Why would we use the refiner when the target is out of the refiner scope? If we throw away the refiner, the SDXL two stage architecture has no point.
Would we train the refiner to expand its scope to solve the biase issue?
If you're using 2m karras ofc the refiner is gonna look like garbage. The karras scheduler ruins the partial diffusion technique and 2m has noise problems anyways
if you're addicted to 2m karras coming from a1111 try unipc / normal. Does similar things but isn't broken
ah no. not what I mean, I was only referring to control. Refiner is an expert at the last 20% of image generation - so ideally we want him to focus on exactly that - not more nor less.
2M SDE causes irregelar noise patterns which the refiner wasn't trained for. So it can end up causes some glitches to persist, and at the same time cause certain refiner bias to pass through, when it does too much work. (but on a positive note - it will significantly add detail if it works - albeit the details won't be varied nor very original.)
"photograph of a brown haired woman" essentially always creates the same face for the woman, if the refiner is given enough steps & chaos to do its thing. (this is a bit of an extreme example - but the thing is that you won't know if bias is happening or not)
I'm not up to the with the conversation, then which samplers are recommended?
turns nostrils into eyes
I come from A1111 using sde karras
#✨|sdxl message
decide yourself!
that's without the reinfer right ^
Seems there is a market for a really good GAN just for making people. SD is a diffuser which is somewhat good at many things but not excellent at anything. Instead of base and refiner maybe the next model should be art model + person model.
yep - but refiner only adds 'details', it doesn't like change composition
that's what finetunes are for. SD is really good at people, just bad at generating the exact same person again and again
maybe you should open an issue on comfy for the 2M + SDXL compatibility issue
just upload that picture and it'll speak for itself
but is it a comfy issue? O: didn't even consider that till now
thought it was just the fact the weights got readjusted + pruned model (less accuracy)
cool side effect of the lora im making. if you crank up the values. you get very liminal pictures...
noice
maybe if @visual glade is here they'd know why 2M samplers are broken on XL
#✨|sdxl message
SDXL 0.9 breaks with 2M as well
so I assume its an sdxl config issue
also thanks to masslevel for helping me with this but my lora also improves coherence by a ton.
meanwhile I feel like my lora guide is turning into 90% FAQ and 10% guide 🤣
I don't see how it could be a Comfy issue. Seems to be more of a side effect of how the samplers work.
nothing wrong with that! faq can be a guide.
Do you released your guide? I am very interested in.
it's up to the maintainers to implement the samplers. IIRC some samplers on a1111 are kinda broken.
before and after. look at the tiles on the ceiling
not saying comfy's is but maybe 2m needs a compatibility patch for SDXL
yep - I'll post a prerelease version here to get a bit of peer review in, then it will go up on reddit & civitai as well
will take a bit though, as I'm doing a comprehensive guide - that mentions all the different kinds of loras that can be done with sdxl, and a full walkthrough of each type
always fun seeing commits like this
comfy: "I release and check sdxl chat if there are complaints."
also comfy: "there were no complaints again. nice."
you mean 2m_sde?
both
cause 2m_sde and lms are just bad samplers
dpm++ 2m and its sde version
regular 2m should work well (with karras scheduler)
with normal scheduler it produces super broken images on SDXL.
karras scheduler doesn't work well when using the refiner
yes and no. half of the community is using a variation of sytans workflow, which isn't compatible with karras
sd 1.5 you can use dpm++ 2m normal without any goofy ass artifacts
so I assume it's an sdxl base model compat issue
i mainly use dpm++ 2m, without issues
my 3 mains so far with testing ate Euler 32+8 (all normal), 2m_SDE_GPU 48+12 and uni_pc_bh2 48+12
i do not use sytan's workflow
same. but it assumes above 50 steps to get rid of the artifacts
... i also rarely use the refiner at all lol
run it with the normal scheduler on sdxl base for like 20 steps
turns everything into stained glass
I am using Comfy's basic Base+Ref
Is there some sort of 'Interogate Clip' image reverse lookup thing that is more applicable to SDXL?
Vit-H model
works especially well for real life photos. works okay for most artwork/renders
Hmmm, is that something I can use on a site?
can't help there - as I run it locally
dpmpp_2m_sde karras 20 steps (just base)
oh yeah. 2m x normal is pretty fucky. But karras is fine.
just use the base is ok.
Oh well maybe I can do that too then, do you call it in comfy or a1111 or is it its own webui or pythony something something?
I tend to use simple for everything
a1111 + "interrogator" extension
whoot, you rock!
dpmpp_2m_sde normal 20 steps (just base)
uff 🙂
should look like that once you have it setup
wait why does karras break on sytan's?
Shouldn't that be 40 to compare it to karras?
don't know. It just makes the refiner not work well
because of how noise works
mabye it denoises in a weird pattern that the refiner isn't built for
karras does things differently - which is good when it works alone, bad if you try to interrupt it midway
So what is the test? 2m+karras at 20+5?
yea. I use karras on all my 1-pass stuff
never when refining
does sytan do anything funky with noise? his workflow looked fairly simple to me
his workflow stops the base before it finishes denoising and has the refiner finish the noisy image
which is how the refiner is supposed to be used iirc
Rainbow colored nodes in Comfy 🙂 ❤️
Because those default colors just don't satisfy me
yea in the latent preview it always looks more or less fine till the last step
instead of 1+1/3
it does something like 2/3+1/3
and by doing that it both saves time, and adds detail as the 2/3 is just the right amount of chaotic left
awesome! works like a charm, thanks!
yeah now it's more like 4/5+1/5 - which checks out for how the refiner was trained
remember to use Vit-H in the clip selector!
20% is nice for photoreal
but on artwork it keeps turning my dragon nostrils into eyeballs
so 15%
dpmpp_2m_sde
normal
38/40 steps
yea I think something near the last step rogers it
92% base, 8% refiner also works well
do 39/40
I've essentially gone the mcmonkey route and base only.
since that works best when messing with loras
only when I do film like prompts do I switch back to full setup
yea this happened to me a bunch as well
dpmpp_2m_sde - normal - 39/40 steps
dpmpp_2m_sde - normal - 40/40 steps
XD I smell a conspiracy!
I mean we had that issue in a1111 before
10/10 vs 10/9
try doing a few steps with a low cfg like 5.5, then passing to a base sampler with a high cfg of 9.5 - 10.5
Hi, is there any advice on the VAE for SDXL 1.0?
Should we use the reverted VAE on hugginface?
Or use the default VAE in SDXL 1.0
use the reverted one which is the VAE of SDXL 0.9. the 1.0 VAE introduces some artifacts
uhhh I did dpm fast for 10 steps wtf
personally, I don't noticed the rainbow edges unless i zoom in past 100%
it's beautiful
Dors anyone know why sdxl makes a .npz file during lora training? What does it even do? Just curious
latent cache
any way to change font size on comfyui for my prompts?
@west breach ooo
scroll wheel
that just zooms in and out, i was wondering if there was actually a way to change it
2m_SDE Karras 20+5
Thanks for the advice @upbeat summit @west breach @boreal bough
I'll overwrite my save file of SDXL with the reverted VAE on hugginface
post the actual output file not a copy-paste
Since you mentioned dragons:
samurai with sword fights white dragon, epic, legendary, award-winning illustration
Although I am surprised they haven't fixed the SDXL 1.0 checkpoint with the reverted VAE
Seems like it would make sense to just replace the VAE in the hugginface SDXL 1.0 repo as well, rather than just the VAE repo
Probably taking a few slow days after sprinting to get 1.0 released. They'll probably fix it once they definitely identify the issue.
It doesn't happen to everyone.
not using the karras scheduler increases the detail the refiner makes tenfold
also used the non-2m sde since the 2m version is borked
I don't normally use Karras. I was merely doing so to see if something strange happened
I'd say the dragon scales and his face looking like soup is pretty strange
No love for the UniPC scheduler?
unipc is good
what does the Epoch setting mean here? I usually just set it the same as the max epochs
I didn't use it so the seeds would look similar
Ah I see
unipc looks like this on that seed
UniPC converges I think
It is one of my top 3: euler, 2m_SDE_GPU and uni-pc-BH
against itself yes. not against other samplers I don't think
maybe needs more steps
like so?
I got inpainting working if anyone wants to try. Just upload the image, take it to mask editor and save a mask, then run. Still rudimentary, but it works.
thanks for sharing it. I will try it out for sure
50 euler 50 ddim 50 unipc 25 dpm2.
unipc uses dpm stuff under the hood iirc so they converge similarly.
what does the Epoch setting mean here? I usually just set it the same as the max train epochs
anyone know how to properly feather a mask?
vaseline
I added it to my extensions collection: https://github.com/failfa-st/failfast-comfyui-extensions/tree/main
Enjoy
Nice, seems like with enough steps UniPC & DPM++ 2M are very similar
bless your soul
that's dpm2 not 2m since 2m is broken on sdxl
which one do you prefer?
Ah, damn
I was using 2M with the diffusers library, and it seems to look fine
Perhaps broken in AUTO?
Or comfy
I'm using comfy
so bright!
Can you rename a custom node without breaking an old workflow? E.g. If I improve a node and I want to use the same name, but don't want to break an old workflow that used the old version of the node
I've been using euler the whole time. Seems like the safest option.
Kinda want to play with the non-2m SDE see how the refiner likes it or probably doesn't
so here is an image with 2M_SDE and 20+5
and this is with 25 and no refiner
I prefer the one with the refiner
some of my workflows broke because a node was renamed. also sometimes when a parameter / widget is added, everything gets jumbled up
do a 9 batch of your 2m_sde (karras?) against non-2m sde normal so you can get some actual statistics going
but I was going to suggest giving it an abbreviation or something. like HMN (humblemikey nodes)
Look, as much as I sympathize with whatever issues you are having, I am not seeing it
I hoped there was a way to deprecate previous versions of the node without breaking things. Currently I'm adding V2 V3 to the newer nodes and just leaving the old ones as is
Look harder then.
Certainly nothing that screams: refiner is broken, or 2mj is broken
yeah I'm not sure if there is a better way to handle that. just experienced it from the user side
most just update their nodes
so if it breaks, it breaks
2M is objectively broken on XL
#✨|sdxl message
and karras just makes the refiner worse but it has the nice side effect of making 2m not look broken.
try karras alone using euler and euler karras and it's very clear it breaks the refiner's denoising.
WAS updated his save image node and than all my widget settings were like:
Quality = PNG
Format = 100
because he added a widget parameter
so the queue wouldn't run, before I fixed it
So let me get this straight: Euler + Karras 20+5 and then plain 25?
yeah it seems to use the old order of the outputs even if you name the outputs
(the second being with no refiner
yeah that whole system could be improved. I guess some node guidelines could make sense - more standardization. but that's a lot of work. because that parameter jumble doesn't need to happen. there are ways to handle this
@upbeat summit with V3 you can use multiple styles and multiple loras by using the lora:lora_filename style:style-name syntax instead of selecting them. was starting to find the very long list of styles annoying to scroll, and you can only pick one
fun idea. comfyui could be turned into a factorio like experience if you're willing to program an insane amount 🤣
that's great!
oh and you can select a image size or set a custom one
too late already did using your prompt. normal scheduler on left, karras on right, everything else is the same
all the small details look substantially worse on the karras
like look how the dragon's beard is soupy on the karras but clear individual strands on normal
scales are uneven in karras as well. Lumpy and mis-sized in some places
poor dragons were in the fukushima reactor meltdown
Both with refiner?
yes
you can open the images in comfy to see
50 steps, 42 base 8 refiner euler sampler
only difference is normal vs karras scheduler
this just gave me a revelation... WE DONT NEED A REFINER TO REFINE XD
oh god why didn't I think of this sooner
Base + Lora -> Base + Lora (trained on only last 20% of steps) to effectively make a new refiner
but will it work
would insta solve face issues, and only doubles training time, rather than x6, while avoiding overfitting
bold claims
it will work if we use it exactly like the refiner model - with denoise set to 20%
@nimble heart the issue of samplers messing up an image on the last steps has also been round in automatic1111 for some time
https://github.com/AUTOMATIC1111/stable-diffusion-webui/issues/3483
https://github.com/AUTOMATIC1111/stable-diffusion-webui/discussions/10193
interesting
is there a purpose of fitting all training lora images to 1 size or if they are different sizes that okay becuase of buckets?
whats the difference between the 2?
would also save vram funnily enough
top one borken get bottom
get 0.9. it fixed an error
vhs lines
dont think ive had it
I would suggest getting the 0.9. the 1.0 VAE introduces artifacts to the image
ok
They both contain the same unet. But the bottom one has an older version of the VAE. Some people were getting strange artifacts in images with the 1.0 VAE.
buckets work great. one size saves a bit of vram (but really not much) and makes working with big batch sizes easier
used a SDXL init image?
Yes sir
should i still use the sdxl_vae?
mk cool. cuz i really dont want to resize these haha
you dont want bucket 1 to have 60 images
then bucket 2 & 3 with 1 image each
other than that, all good
also what about these
The VAE is already contained in the model.
have at least as many images as your batch size, in each bucket
Look at the names and compare them with the other ones. I think you will understand.
k
ahh very cool... I had some images I made with MJ using a text prompt and three image prompts. I ran the three image prompts through Vit-H and then copy/pasted all three results from the image prompts along with the text prompt into SD and it is giving me some really close looking results.
your images look cool 🙂
what is Vit-H?
i've heard that, but i've never really cared about it and my loras always come out half decent
It is a model that you can select in an image interegator add on for A1111 that seems to work best for SDXL - https://github.com/AUTOMATIC1111/stable-diffusion-webui/discussions/7968
i think there's gradient checkpointing / batching code that solves those issues
it doesn't by defintion "break" anything, but it does duplicate those images until there are enough to fill up the bucket, introducing unnecessary bias
in my experience its safe to just throw all the images in a folder with whatever AR they got, an go for it
ah cool. I am on Comfy due to performance @rich coral
yeah i looked into that. i think early versions of kohya did that before i came along
i can't find any evidence in the current code that it's doing that
i'm no expert coder but i can comb through and find out whats happening to some extent
@upbeat summit check your messages
it's evolving
denoise is too high
https://github.com/darkstorm2150/sd-scripts/blob/main/docs/train_README-en.md#aspect-ratio-bucketing doesn't seem to mention the limit either. i've combed all through these manpages looking for that. i decided just to not care about it since i can't find any evidence of the theorized bias
my assumption is that early versions had it and it became engrained in the lexicon
kohya is releasing updates at a rate where no gui can keep up 😭
I can't just tell people "don't use the gui cause its old"
https://github.com/bmaltais/kohya_ss this stays up to date with the main branch
he's outdated :"D which is why I'm complaining
I don't blame him either
I mean kohya is damn productive
(dev branches - since main is ancient, and sdxl branch still has the typical flaws that are all fixed in dev)
in simple terms, what is ascore?
a prompt (embedding) you dont see that is always applied to both negative and positive
oaky im just not gonna play with the values and hold thumbs 😅
I should be too. having to switch models in A1111 is painful but then again I do need SD Ultimate upscale... I don't think that is on comfy... although... hmmm, maybe it can do it
well i mean, if you mean dev branches, don't recommend those to people in the first place. lol. if people want those they'll find them. they should get experienced enough with main branch to have to want those
the kohya gui has a dev branch too. dev2
negative is perfect as is. positive value can be raised and lowered by 1, to achieve more real/creative results
i hadn't checked out the dev branches though. what's good about them for sdxl?
i see o0
if they get the main branch, then sdxl training is partially broken
at the very least the sdxl branch
so, a score positive 8 is more real but less creative than a 1
i dont know what partially broken means
funnily enough the other way around higher = more artworky, lower for more photo like
tx 
I saw one merge from sdxl to main, back when sdxl training still had a lot of issues, such a sample gen, clip completely breaking, lots of errors, and performance issues
i use main. works for me fine. no errors. finishes without nans. /shrug
i been doing it that way since about a week before 1.0 dropped
lora training.
an 2 TIs
dev has TI working, clip working (though I still wouldn't recommend it), sample generation, speedups, better bf16 support, full bf16 support
auto dev?
i have all that in my branch too.
full bf16 only works on linux with updated bits&bytes last i checked
https://github.com/kohya-ss/sd-scripts/commits/main i don't know what you mean by main is ancient either. it gets updated daily
No need for control net with this workflow. It creates an edge detection mask and blends with a handmade mask in mask editor.
it was before 2 days ago. I see that he's adding all the new updates now though. cool 👍
no but i mean, i've never left main and i trained a ti like, over a week ago
on the gui
while 0.9 was out
well TI was always working. it just wasn't working as intended
hmm. i still don't see any good reason for recommending a dev branch to people, or for saying the dev branch of the common gui isn't kept up to date.
it would be more encouraging to people to hear how accessible all these methods are now and today instead of saying they're not, when they are
I'm not gonna recommend dev to people. I'm currently running main to see if I run into any problems while I go through the 7 loras
but for context on how active it is
https://github.com/kohya-ss/sd-scripts/commits/dev
is it possible to reproduce a single image from a batch in comfy?
i'm not sure what this was about then. /shrug.
you don't need to make things complicated for new users. FOSS lets you see the development process. don't have FOMO over it
/installmissingcustom nodes
i looked at that too. i just don't see much that's essential. and when i asked, you talked about TIs
for this
I don't remember the exact situations, but multiple times in the last two weeks I got bugs in the kohya gui, that weren't present in the dev branch of kohya
so yes I'm pretty damaged from those experiences
are you sure it wasn't a configuration issue? you don't seem to have speant much time on it if you can't recall
my fix was to take the 'command' and then use it in kohya directly. which then worked as intended
switching branches usually comes with deleting the venv, right?
cause I checked commits and often saw my exact issue addressed
turns out not just me having the issue
that's a big reset button that kind of allows a whole lot of confirmation bias in. a fresh venv could work elsewhere too. anyways, now it seems suddenly you do recall a lot about this process. see where i'm going with this?
moral of the story, why the dev branch?
yeah. cause I stopped writing on the side so I can properly focus on this conversation as you're clearly not letting it go
people recommding dev branches to newbs concerns me is all
i'm just saying, for me personally, it's been fine the whole time on the main branch kept up to date. same install i've always used. same venv even. it's kept on keeping on. i run the setup.bat file when ever i pull and the venv gets kept up to date. if there were a bug i were having, you'd best believe i'd figure out why before i deleted and rebuilt that venv. If only to avoid it in the future.
sometimes we can read "issues" and find our own. Then we delete the venv and rebuild it, albeit on a new branch, and it works again. 😮 . when really just staying on main and rebuilding the venv would've been fine.
i've had most issues when installing other guis and they screw with something in the global scope that breaks things.
i asked specifics about the issues you were talking about. that's all i ever wanted. not vague "i don't knows" and "lookmans"
fair enough
I did give kinda useless answers XD
🤣
chicken signifies nothin it was just first good bushwacker gif i picked xD
what setting do i use in khoya ss to train SDXL models? 1.5?
SDXL in automatic1111
offtopic but does anybody have any good controlnet workflows?
cant figure it out
(comfyui)
thats in beta i think
was asking for pre xl modeles
alr
sorry dont mean to argu i'll go
I really like the inpanting process, thank you. I want a node to do batch tries, you know where and which node i can change?
thank you! Was node suite has an image batch loader. Probably switch load image node with that.
Not for the input but for outputs, i want several results at once
you would want to set up multiple samplers and image nodes each set to their own seed
Trying to use SDXL to make some cyberdeck concepts for a physical one I want to build. Fun to dig into what sort of future tech images live in the model
ooh, what resolution did you generate that at?
Morning from the depths!
@visual glade Sorry for bother you, but I don't know who should I ping for the question. After having multiple conversations with different users, the custom fine tune of the base model should have custom refiner to achieve the best result. The lora which trained on base also should has a refiner version. Otherwise, the original refiner would ruin the base lora output. The original refiner has it limitations and it is not universal. How should we train the refiner? Would SAI release a proper training tools? Might be my precondition is wrong, the refiner is designed for universal and we just misunderstand the workflow. Please correct me if we were doing wrong with base+base Lora->refiner workflow.
Hey could anyone tell me how I can use SDXL? Please
ComfyUI, SwarmUI, SD.Next, A1111 and all those countless websites
That water looks extremely good! ❤️
Uhh, and would I get better results if I run it local?
most customizable results local, yes
better/best are relevant concepts
ty ty 🙂
clipdrop/dreamstudio might be faster or have quality advantages from using experimental new models
At least from a far 😄
@late marsh
comfy's an odd choice to ping for training questions. Kohya trainer is the go-to trainer rn for most things, idk about refiner
I see thanks, could u tell me how to run it locally? Or a source where it's explained
Thanks!
you can train loras on the refiner, they should be trained on the last 200 timesteps
I don't know who should I ping.😢
but really it depends on what you are training
Damn that's fire
thanks for the answer.
the second one is amazing, the third one has a perfect photo filter on it
99.9% sure I just cracked high res fix
like as if it was edited in Luminar AI
@visual glade got a better upscale workflow, and it only uses stock nodes!
Thanks! I have some basic guidelines for prompt but the rest of it is just from One Button Prompt script's randomness 😛
wewll they wowrk lol
Yeah
cool
ooo? do share
base vs 12.5% fix vs 25% fix vs 50% fix
base image
50% fix
2048x res too
sending to some colleagues to test
hi i am here
why do ya'll homies put 10k?
10k*?
because we set the total steps with an integer
just end with total steps
and we set the start and end steps as welwl wiwth an integer
okay thnx thnx thnx
epic scene
it's at 10k because that's the default and it gets ignored if it's equal or bigger than steps
k thats what i was looking for thnx
hey which do you guys recommend me the most, ComfyUI or SwarmUI?
swarmui lol
dissecting all of yr workflows 😆
i'm biased but i'm also right
be a big dawg and get comfyui. your powers will become limitless
do you know what swarmui is lol
@young mantle 
do you know what swarmui is lol
nope
i dont trust that link bro 😂
here's a hint
literally a link to the stability official github
jkkkkiiinnnggg 
that is looking sweeeeet
tbh I actually don't know a lot about swarmui. I was an a1111 peasant until recently
amen
SwarmUI = the full power of comfyui, including node editor, with the ability to use an easier UI mode (similar to auto in a way) and bonus power tools like grid gen, and also multi-gpu support for ultrarapid generation
why isnt SAIs official Motto " This is Stable Diffusion, This is The Way"
hmm. I'll have to look into that. so I can do all the comfyui stuff though? it's not restricted?
yes correct
awesome. I'll check it out
I do miss some of the features from a1111. but it just got too restrictive. I very much prefer the ability to put together a workflow. but I do miss some of the plug and play aspects. and using both is a hassle since my video card can barely handle running one of them
Would we have official extension manager for swarmUI?
i feel like im so close to figuring out something good with comfy, just need to push through the pain and frustration a bit longer
not yet. Waiting on, uh, community extensions to be built first
What's ur videocard?
Improved inpainting controlnet workflow. still a mess, but it works.
swarm is literally perfect for you
Great. Waiting for it
i am here
it just closes without going further
From the SwarmUI
I'm trying to install it
i am not here
Is there already a tool to train refiner? Does kohya already support this?
hi
hi!!!!
did you install DotNET?
try running launch-windows.bat?
6gb 3060, lol
uh, maybe, i don't thiiink so though?
new high res fix workflow I am working on seems to be fast, efficient, have controlable levels of fix/faithful, and also considerably fixes faces
does the job though. to a degree
its not perfect, but
can't really train stuff. but easy to do online
Download The Install-Windows.bat file, store it somewhere you want to install at (not Program Files), and run it.
I'm following this step
but
it doesn't work
it should
try running launch-windows.bat?
is there a StableSwarmUI folder?
hair changes color?
nope, it doesn't install anything
Anyone had longer generation times using the base 1.0 VAE 0.9 combined model than using the normal base 1.0 model?
can you... open a command prompt window, and run install-windows.bat through the command prompt window?
lemme see
'winget' is not recognized as an internal or external command,
operable program or batch file.
'git' is not recognized as an internal or external command,
operable program or batch file.
The system cannot find the path specified.
'.\make-shortcut.bat' is not recognized as an internal or external command,
operable program or batch file.
'.\launch-windows.bat' is not recognized as an internal or external command,
operable program or batch file.
ooooooo Okay.
Uhm.
First of all, are you, uh, signed in to an up-to-date windows 11 (or 10) computer? 0.o
yep but I'm in a modded one
oh. Well. Your mods broke important things? So... maybe don't do that?
But uh
https://git-scm.com/download/win if you install git manually that should resolve that
then run install-windows.bat again
Which should I use as Git's default editor?
eh doesn't matter you're extremely unlikely to ever use that part lol
I prefer to be careful XD
defaults are all good, only 'add to path' needs to be on
so, how long until you just start shipping a standalone?
.> i have two things externally installed, they are the easiest things possible to install, and yet still somehow they're constant issues. Standalone might be necessary .-.
they're installed through winget aka the free automatically enabled built-in windows installer magic tool that just does it magically. And somehow Zutamy modded that out of his windows install???
aaaaaaaaaaaa
hahah fax
I installed it
lemme see what happens
:D
:D
now I have
a StableSwarmUI folder
that's what u did say
and another .bat in my desktop
Any Comfy users know the right way to use the base/refiner models to do an upscale or img2img? Wondering if there's a way to effectively manage denoise since there isn't a denoise parameter
Am I supposed to configure the steps a certain way so that it makes less changes to the original picture?
lower the start step of the base
and give it an image instead of an empty latent one
I mean increase the start step
Will the base be doing anything prior to its start_step? For example, if there are 30 steps total and I want to start the refiner at 20, and I set the base to start at 20, will it be doing anything?
I see you got an inspiration 🙂
haha
@late marsh they finally took him down
everyone is inspirating now XD
Oh noes!
damn the cops are amazing on it
phew - what an adventure 😄
er -- set the refiner to start at 20, base starts at 0 still lol
But the Comfy man himself told me to set the start step of the base, not the refiner
base starts above 0 only if you're using base for img2img
how do u think the model will work on a RX580 8gb?
on windows? very badly
Well, I'm just happy if my images inspire others 🙂 Take a couple more!
Just the raw output
damn urs are awesome:O
no but it won't be adding noise either if it's not doing any steps
Thanks, so effectively it boils down to how many steps for the base and then how many steps for the refiner. More steps from the base = more changes.
Is there a reason that if you do no steps with the base, the refiner produces a very blurry image?
which do u recommend? linux?
for your card it's going to be very painful on both
sad to hear
but if you can handle linux that's probably going to be faster
Finally managed to get a LoRA to load. Seems as though Diffusers SDXL might not support LoRA scale yet as I get an error about the scale property not existing in the __call__ function when using StableDiffusionXLImg2ImgPipeline. It normally gets added via cross_attention_kwargs
it's just going to be painful because that card isn't officially supported by ROCm and and not sure how well it works unofficially
uhh sadly, well I'll see how it keeps going
Just tried. Kohya script doesn't support the refiner training yet.
Posting an example of an upscale where I ran the base for 1 step and the refiner for 10 steps
Its definitely similar but its not quite as much control as you can get from the normal method of setting a low denoise
dayum
Nice one
LG!!!!
knollingcase SDXL 
how do you get steps and end_at_step as parameter in the ksampler ?
and how/where do i get that brown step-control nodes from ?
convert input as weight
i dont understand, how do i do this ?
right click at the node
ok, i found, thanks
do you know what the guy in that example is using in the brown nodes called "steps" ?
that's a Primitive
how do i make one in my comfyUI ?
Would it make sense to add a parameter that sets the amount of noise added per step by the base refiner? Wondering if that would allow for a bit more control over how much the picture changes (functionally a denoise param?).
can one of you tell me how to make a primitive number in comfyUI pls?
hey, dumb question, but is it possible to train an SDXL Lora in Comfyui? (automatic1111 doesn't work for me)
you can use this here: https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
Here is a few tutorials from the inhouse dev: https://www.youtube.com/watch?v=AbB33AxrcZo&ab_channel=ScottDetweiler
Today we cover the basics on how to use ComfyUI to create AI Art using stable diffusion models. This node based editor is an ideal workflow tool to leave how AI art is generated, but also how you can really mess with the internal elements much more than you can with any other AI Art interface out there today. #comfyUI #stablediffusion
Install ...
it's just a primitive node. it does it if you connect it to the steps pin of a sampler
how to i make one in comfyUI ?
double click -> type primitive or right click -> add node -> utils
thx, that worked
there are also several node packages with all sorts of numerical nodes
I need to figure out how to use all that boolean nonsense so I can automate things
nice
I'm using stablediffusion-webui with DreamshaperSDXL model - first it generated image of this quality. And after some time it generates only images of low quality (second example for the same prompt). I haven't changed any settings/prompts, already tried to reinstalled venv. All other non SDXL models work fine and produce same nice results as usual. Does anybody know what may cause this and how to fix it ?
im sure ive experienced that too on comfy
are you getting any errors?
did you try dragging the good pic into comfy to make sure it can't reproduce teh same pic with all the settings exactly the same?
sometimes when my images do that I'll look and it'll be just an endless stream of errors in the terminal window
it looks ok in terminal window
you should post what you're working with. might be easier to figure out
sharpness looks really good. it might get a bit too "hot" if you check the color levels
https://github.com/AUTOMATIC1111/stable-diffusion-webui/ - I'm using this gui , not comfy
yeah, I'd pass that back through and blend it with the original
maybe overlay or something
2048x images, reliable, high resolution, consistent, mended deformities to a certain extent, all stock comfy UI nodes, running in sub 40 second times on my 3080 from initial prompt trigger
if you drag the good pic into the positive prompt field, then click the little blue square with a white arrow button under generate to populate the settings, then try running again?
I like to get the difference between the original image and the over-sharpened image, then use the result to do a dodge or burn on the original
I am messing with the post processing, as the img2img style work I am doing tends to wash out the levels, and I am trying to use the built in comfy UI tools for it
its not sharpened, its just more detail in general, its 4x the pixel count
nice!
fair enough. looks sharpened to me. but I'm not that smart
still trying to find the right color levels ATM
also, you can choose how much "fix" is in your high res
I have tested from 10% all the way to 50%
with no considerable deformities
what do you think the fix is doing other than sharpening and adding detail?
I assume you'll be putting it in your github once determined what's best?
does anyone have a preset .toml file for this?
Base 1024x crop vs 12.5% fix vs 25% fix vs 50% fix
yessir, and on the comfy UI wiki, now that its all stock nodes <3
I see
crisp!
so as you can see, you can choose how faithful to the original it is, or how much it changes it
how does it work?
All of my best finds always come from a leap of faith lol
effectively, upsacale, downscale, special diffusion ontop, post processing with image x image layering, and thats about it
not much more to share ATM as I find better values for it
sorry It's my bad - it was different model used, still it's a mystery to me why SDXL results are that ugly with the same prompt ((
its even very good at fixing faces
nice. I'm a bit tardy to the party I guess. I should probably scroll up and read
these are both crops, but you can see here
not perfect, but its a lot better
fixed her shoulderblades very nicely as well
even more extreme example here
by changing how much "fix" is in the high res
I really go nuts with these nodes. I've spent years doing random weird things with photoshop so I can bring all sorts of things out of images beyond just the ai aspect
not so hot in the bottom left, lol
we were all stuck on the A1111 workflow before, plus any extensions we added. now we can experiment with all sorts of crazy workflows
I need a workflow to make me a coffee in the morning 😄
I just need to figure out how to do more cfg stuff with comfortui
scheduling, rescaling, etc
comfort, lol. close enough
did I mention my new high res fix has the potential to help text considerably as well? Only tested one time, but after seeing all of this... I have hope
you're going to be like that dude from csi miami when he has the techs upscale license plates
I used to make fun of that
mr cyberpunk over here
I am using Lykon's Greg Rutkowsky LoRA, but using his way of upscale fix results in the upscaled and refined version looking much less crisp... any suggestions on how to get it as crisp as the base version?
it is genuinely only a matter of tim elmao
from pencil to 3d model
The refiner often produces a blotchy effect
looks like the base and refiner steps aren't lined up right in the 2nd pic
I can't decide what to think of the refiner
I certainly can't say I know how to use it correctly
what do you mean
the blotchy look
the future of shooting your weapon.. lol
Projekt Red still have a few bugs to iron out
I'm not sure if there's a good way to use the refiner as an upscaling method, using it with the base XL model changes too much and using it standalone doesn't seem to work (just makes the image blurry)
maybe I'm not understanding something
Obviously I can just use the base model with a regular sampler and just lower the denoise, so I'll probably just do that for now and only use the refiner during the low-res image gen
I am messing more with my upscale workflow... and Midjourney should be genuinely scared
for realism specifically
2048x2048
i can feel the fur from here
if all goes well, expect this 1.0 workflow out tomorrow
on my github, and the official comfy wiki
I found if you use the sd ultimate upscaler, you set the tile resolution to 512 or 768. The refiner model is more coherent at lower resolutions
yep 768 is the sweet spot
What exactly does sd ultimate upscaler do? Its a tile based upscaler?
yes
cuts the image into tiles at a more standard size, does them individually, then stitches them together
its what I first used for mmy SDXL upscale workflow, but my new method is faster and yeilds much better results
Would the sd ultimate upscaler be doing base+refiner for each tile?
sytan, do you have any suggestions on how to implement the refiner model properly in comfyui? I can get it to work, but I have no idea if I' doing it "right"
went for the black magic path I see
AFAIK it only uses one model, but the image has already been constructed so the refiner is used to fill in the small details
only refiner
https://comfyanonymous.github.io/ComfyUI_examples/sdxl/ drag and drop the top image into comfy ui
my method on my github uses my mixed diffusion, which uses part of the base model and part of the refiner to do one image, rather than running the refiner ontop as a second pass
a more refined version should be out late tomorrow, if all goes well
thanks. you know, I went to that page and forgot I could drag images in like that. and couldn't figure out what was informative about looking at those bottles
@boreal bough Wanna play with my beta upscale workflow, see how it works? :p
yes
looks very comic like
lol
I've been using the searge nodes to do the sampling, but not sure if it's necessary
I feel like I've barely scratched the surface of what I'll be able to do with comfy. and I'm probably doing things very inefficiently at this point. but the nodes satisfy me so I must keep going
anyone know how to fix this within kohya?
No data found. Please verify arguments (train_data_dir must be the parent of folders with images) /
use https://github.com/derrian-distro/LoRA_Easy_Training_Scripts/tree/SDXL
using that you can just use normal folder structures and can ignore the typical kohya folder structure
wake up Samurai, weve got lego to build
whats ur cfg? seems like it's too high
also that's the first time I've ever seen a recognizable signature...that's definitely the cyberpunk logo
that one did blow out a bit, same as Im using for all images so its OK
yeah I am seeing more variation in that effect
whats your prompt on that I am curious
hmm, let me check. probably some randomness,
my explosions are floating away :(
can you train loras with 8gb vram?
that's one I made on dreamstudio. I like to mess with that on my phone when I can't get hardcore on my computer.
Positive
"Bach's infinite analog synth symphony in a subatomic singularity
Negative
Mundane, earthly, blurry, smudged, cgi, low resolution
wow interesting xD
I thin I used one of the filters too. but I have to see which
ah don't bother too much, don't want to give you work ^^
either way. it was either the line art or analog photograph style. those two might be my favorites
analog photograph can make some really uncanny stuff
rainbow goblins in analog
tele-goblins
I'm using Scott Detweiler's Base-then-Refiner setup at ComfyUI - here is the Base photo, and then the Base+Refiner Photo - much much more detail and definition in the Refiner!!!
I'm just going to add a second Refiner stage - so the whole ComfyUI process will be Refiner-Base-Refiner - don't know if it'l be any better ... but here's hoping 🙂
first successsful 4096x upscale
extremely reckless on a 3080 lol
but on a 3090, very pratical
base 1024x image crop
vs the 4096x crop
so much detail
stahp
zooms into the atoms
16.666 million pixels
my kinda demonic number
curious how practical the middle stop is
somebody said atoms?
trying 3072x3072
ok, still too high res for 10GB VRAM
maybe 2662
WOOO
looks like barely lmao
9.8GB VRAM
that is 7x the pixel count of base 1024x1024
it's interesting when run diffusers cuz it has two prompts chain to different text encoder and the combination can result in mind blowing images
my mind is indeed blown, not sure if in a good way tho lol
LoRA training for sdxl is always CUDA out of memory even run with deepspeed VRAM saving.
need to get an a100
idk,but the previous diffusers Lora traning can run under 8GB GPU VRAM.sdxl can run on A100 40G for sure
I tried to get gpt4 to tell me how to build a laptop with dual a100s
hope you got a strong lap then xD
i use this trainer too and it's my favorite so far
it started going on about how laptop cases were too small. I told it that I wasn't looking for reasons it couldn't be done, just to tell me what to do. I think I'd need to carry around an external lithium battery to keep it running
batch size 2, activate gradient checkpointing (that works with my 3090)
Does a higher batch size in lora training use less or more vram?
surely more, right?
more
cool, thanks
this
my dude got some serious medical condition
is it even possible to train lora with 8gb of vram?
that's just the moon's gravity distorting your view
theyre doing the moon walk
What's strength model and clip for loras in comfy? Do I set value for both? I know how loras work in a1111, can someone 3xplain?
dunno, i used to have a 2070 super but never trained with it
maybe with some heavy gradient checkpointing
Yeah it just keeps giving me this
try with 4 accumulation steps
go higher and se what works or not
so concensus is we shouldn't be running loras on the refiner model?
i run lora on base model yeah, then i run the refiner for 5-2 steps
will do thanks
in your case always batch 1
I just ran the same thing through without refiner loras. it's slightly different, but can't tell if better or worse, lol
Detweiler's triple Refiner-Base-Refiner - works, but as I do a lot of art - I'm not convinced that the extra Refiner-B4-base does any good? 🙂
where is this detweiler triple refiner?
My RTX2070 is always maxed-out by A1111 (8Gb VRAM). ComfyUI is much better at memory management. SDXL is making RTX2070 GPU's look underpowered!
I managed to start the training on my PC (4070 12G) by setting
set PYTORCH_CUDA_ALLOC_CONF=max_split_size_mb:128
I have a 2070 Super, you have to use --medvram
if you dropped the cfg by one or two I think it'd pull out more detail there
Yes, --medvram is no use on my rig - always warned High Memory Usage - VRAM AVailable 8Gb - VRAM Used 8GB 🙂
or maybe not, but maybe
idk,the diffusers official page sdxl lora training run on A100 40G
Even on 10 steps it doesnt work lol
Without medvram, you won't be able to go very high in resolution with A1111.
I have abandoned A1111 for SDXL - now using ComfyUI 🙂
What's strength model and clip for loras in comfy? Do I set value for both? I know how loras work in a1111, can someone 3xplain?
then you might just can't, training at 1024 res eat a LOT of resources
I have just implemented Scott Detweiler's triple-process of Refiner-Base-Refiner - and its producing some stunning results
we know, bud
Share the workflow?
Wait what, how does that work
I guess I should google that name
anyone have problems with ComfyUI running out of system memory? my 32GB has a tendency to run out and --gpu-only runs out of VRAM (RTX 3060 12GB)
If I post a photo, and if it holds its metadata - you can drop it onto the ComfyUI Desktop?
nice
yes, the .png has the metadata
This was done using Refiner-Base-Refiner - I hope the graph comes up when you drop it onto your CUI Desktop?
Here are some more Triple-Process R-B-R
As a general Rule - the base photos look as if they need a good sharpening; while the refiner photos are crisp and clear
Are these use different denoising percentages for each step? Like 0%-20% then 20%-80% and 80%-100% or something similar?
no metadata for me at least, try to save and upload the .json file instead
why is it that the service can do portrait similarity and the SD has to be trained on the face?
OK, will save the 'json
Detweiler Triple-Process-R-B-R https://drive.google.com/file/d/1LLIY5IGwGFQ7fwiAg1VpysPD6g2Xorpo/view?usp=sharing
Hope it works?
my compact LoRa workflow with Ultimate SD Upscale (you may need the aspect ratio custom node)
seems good (did not have SDXL 0.9 models downloaded, so had to replace with 1.0)
Oh! Silly me, I should have used 1.0 as well --- I will try again 🙂
hmm, I guess I'll try it out
anybody know why I get an error on the ksampler advanced?
could anyone tell me what "clip text encode SDXL Refiner does?
I see you have batch size of 10, anyone know how to reproduce a single image (i.e. say I liked image nr 5, how do I re-run the workflow for only that image?)
some of this stuff still goes over my head
batch size 1 at Empty Latent Image - then at KSampler set noise seed to your own value, and change the setting (control after generate) underneath to 'fixed'
drag the image into comfy. it will have all the settings for that image
nvm looks like WAS number nodes are a bit fucked
What sampling method should I use for SDXL 1.0
I think I read ddim might not work well. maybe just try a few out and compare
is it possible to modify existing nodes so I can do things like have seed numbers sync up in all the samplers? is there some obvious method I'm missing?
KSamplerAdvanced
Find its SEED and reuse that
and what about iterative functions? is it possible to run it through the same sampler x amount of times? or is it easier to just copy the sampler and repeat it?
im using a1111 not comfyUI
Just update A1111
If you have Image Browser in A1111 you can find the SEED there
gotta git pull a1111. it doesn't do that for you
i got 1.5.1 the latest version
Dpm2sancestral, dpm sde, DPM 2m karaas, ddim, eular a
Dpm 2m sde
So you should have a error message, post it on pastebin ans send it there
error message?
You were talking about Ksampler error?
A1111 - go to Extensions/ find the URL and direct d/load Image Browser
TErminal?
no
Ah, right
He was asking which ones to use
personnally i didn"t changed my habbits
I got to admit I like 0.9 much better for artistic output - 1.0 is so faded and pastel-like 🙂
if I'd known I could run it on my subpar 6gb video card I would have tried to get 0.9. but read it required 16gb
I have 1060 6gigs, and I could run sdxl 0.9 not pruned, I'm guessing so could you
all good though. I'm figuring it out now. tbh I can make things at almost the same level with 1.5. but it takes a lot more, and there are a lot more misses
trying to create a more rugged look, not exactly what I'm looking for yet
liminal spaces
What's strength model and clip for loras in comfy? Do I set value for both? I know how loras work in a1111, can someone explain? Man someone tell me
well it's liminal for sure. very nice image degradation in the first one
Yeah using prompts like: hot pixels, extremely poor quality, low res, pixelated, worst photograph, dim light, Nokia camera photograph
