#✨|sdxl
1 messages · Page 68 of 1
I mean a friend of mine bought two full brand new active backplate block sets for his two A40s for AU$1200 total
see, damn expensive
they're solid copper and low volume
That is the only reason I never grabbed one of the server cards
or you can use two $10 fans and a 3d-printed fan duct 
AYAYA
I ceased 3d printing 4 years ago but the fans are loud
people sell the ducts on ebay for about 10 bucks
and the fans don't have to be loud
yep
they don't actually need as much airflow as you might think
every one I saw using them you have to leave the room
would like some noctua 120mm work?
ehhhh
😦
generally you want a blower or two 40x28 server fans, which can be found in "reasonably quiet"
it's just not straightforward
it has to blow down a channel it has. These were build for datacenters
ah fair
they're built for servers like the R740XD that my two are in 
wow, nice
on a 4090? there's no backside VRAM on 4090 or 3090Ti, and an EK block is ~$390 USD https://www.ekwb.com/shop/ek-quantum-vector2-fe-rtx-4090-d-rgb-abp-set-nickel-plexi
the A40 heatsink is actually identical to the A6000 one
and the shell is almost identical
you can swap the cover and fan from an A6000 on
but the bastards depopulated the fan header and PWM drive components, it's a Weird Tiny Connector, and the VBIOS won't drive a fan anyway
you can duct a big ole blower onto the back of it if you have room in your case but a pair of the quieter AVC 40x28 fans is usually the go
Just saying what Jay used and some I have seen are 500ish. I just purchased my first AIO and when it goes I will go open loop.
yeah 3090 is a bit different
as are the 48GB cards
because you do need to cool both sides of those
A40 seems to be about 5-6k used being a Tesla A40 ampere.
also, little PSA: if you're running a 4090 and you're using xformers, either don't (and just use torch SDP attention) or compile it yourself; the precompiled wheels for xformers are built with a stupid TORCH_CUDA_ARCH_LIST that does not include binaries for Ada or Hopper
i bought two for US$4k total
I've got no real tech skills, but is it possible with double GPU's, and if so even useful to add an old NVIDIA GeForce GTX1080 8GB GPU?
GTX 1000 series are a waste of time for AI
tell me about it with my 1060
yeah 1080 is pretty slow
Thank you. In my MX Master 3S Logi Options+ app, I had it set to "smooth scrolling", which is great for websites, but no so good for ComfyUI as it turns out. Too bad I have to sacrifice one for the other, but Comfy takes precedence at the moment.
Pascal can't do fp16 worth a damn
check out the advanced mouse options control panel thing where "enhance pointer precision" lives
technically it's not even emulated
it just runs as fp32
Which is what I meant
hands it off to fp32 but accepts the fp16
1650/1660 will not even do that
all GPUs should be able to load fp16 weights as fp32
Someone did some bad meds when making 1600 cards
the only exception to "pascal considered harmful" being the P100
which was explicitly built solely for fp16 zoom
and is unfortunately kneecapped by being stuck at 16GB
Yep, I almost purchased the P100 in Feb but the fans and blowers from hades I didn't
glad I didn't now with XL
textures can be stored in fp16 so GPUs can load them efficiently
yeah, you get the memory savings of fp16 but no performance benefit
it's just a question if pytorch is smart enough
also yeah by default CUDNN on a TU11x will decide that fp16 doesn't work
have to do some shenanigans to make it accept fp16 weights and run them in fp32
I had to compile xformers for my 1060 around 5-10 times and it was not an easy task. A1111's wheel finally worked but for a while there I had no choice.
Thank you, Neggles. Good suggestion. I've got it working fine now. Just had to deactivate a custom setting in the mouse manufacturer's software.
but realistically what you should do if you gave a GTX 16xx is sigh, throw it out the window, and buy a used 3060 

yeah the smooth scrolling stuff is not really worth it most of the time
idk why people keep trying to make mice behave like touchscreens
what is so aggravating is that businesses are complaining they can't get the 4080 or 4090 as they ask for 5 and get 0-2 because Jensen has stopped making Ada cards to control supply to keep prices high. At the current supply side there is a year's worth of cards in their warehouses.
I could see the supply dwindling on pcpartspicker
Zotac, no thank you. Gigabyte, I will pass. Not really much left.
MSI ceased to make the air cooled Suprim so damn
shrug another friend of mine has appx. 280 zotac 3090s deployed with waterblocks in machines in a datacenter in sweden
they've been spinning since launch and they've had a total of 3 failures, all from the same batch within the first 3 months, zotac preemptively replaced the entire batch (about 40 cards)
EU has far greater consumer protection laws than the USA 100%
shrug YMMV but I mean, ASUS and Gigabyte have always made garbage cards, MSI's 4000 series are... underwhelming
and have mad coil whine issues
Yep, to all of that
zotac were slightly cheaper than everyone else and actually put some effort into industrial design so here i am
also didn't fall for the 600W power limit meme which is nice
I used to be a Gigabyte fanboi then I had a run in with them and will never touch them again.
after the whole PSU debacle they're completely dead to me
Another option is 4090 Founders Edition cards from Best Buy (at least in the US and Canada).
FEs have pretty mid cooling though
I was going to do FE but never in stock in either BB or Nvidia so I ceased looking last month
yep, snagged one a few months ago from bb, even got a 10% coupon in , was like 1420 with tax
Zotac GAMING AMP Extreme AIRO vs Zotac GAMING Trinity OC
AIRO is the higher end one but they're pretty much the same
i have the airo, it has a slightly better VRM but not in a particularly meaningful way
price difference was $30 if it had been much more i wouldn't have bothered
30 USD difference
yeup
it's very quiet too
no coil whine, fans don't have any harsh tones
runs cooler than my friends' STRIXes 
I am waiting until after gamercon in a couple of weeks as I may just skip this nonsense and settle back to a good life again with an XTX. Not much caring if it can do SD or not just I need a card and not willing to be a victim to gouging.
why on earth you'd pay a $300 premium for a STRIX card that looks like an emo house brick and performs the same as every other 4090 is beyond me
but people keep doing it
yep, LOL
people pay for the brand, they like Asus
Asus with "issues" scared me off
asus is a cult not a brand change my mind
I have their monitor that will soon enough be upgraded after years and years of running. Nice IPS mon
with a panel made by LG and electronics made by Samsung 
the only good ASUS products are ones they barely made
yeah...no touchy. I SAID no touchy. Now sell it.
and after the absolute insanity they pulled with AM5
I will not touch them for that "issue" I mentioned.
"ah yes lets just blindly believe the values encoded in DDR5 SPD and completely fail to enable any overvoltage protection whatsoever in our voltage controllers what could possibly go wrong"
"oh, that"
Mushroom cloud
"well here's a beta bios to fix it but if you install it it might not fix the problem and also you won't be covered by warranty if it still explodes"
how about no.
yeah, what a load of you know what
it was very vindicating tho
I hope someone can come up with an inpaint extension for comfy like auto has then it will be great
i've been calling out ASUS' garbage build quality and approach for over a decade and now there's a Big Shiny Fuck-Up I can point at when people ask why
What is wrong with Asus cards? I plan to buy STRIX 40xx so pls tell me 
huge waste of money
Asus Strix are always top performers in benchmarks so wheres the problem?
performance is identical to every other 4090 on the market
there are no performance differences between Ada cards
they all run right against the redline
Oh, I had one of those 2008 ASUS boards that they saved 3c per electrolytic cap cause they went to off brand and years later, with no power they blew up as soon as I plugged them in. All of them vented.
So all of these benchmarks online are scams?
the only difference is cooler performance and the STRIX runs hotter than cards which are $300 less
show me a benchmark
all the benchmarks carried out by sites that actually follow a documented, repeatable benchmarking process show performance differences in the 1-2% range which is purely silicon lottery
buying a STRIX card is spending $300 to buy a card that's so huge you can't install anything else while it's in there, you'll struggle to fit it in most cases without dangerously bending the 12VHPWR adapter (so realistically you need a 12VHPWR modular cable or a longer extension adapter cable), and it performs the exact same
The amp airo is that a single connector or the new 12V one?
12VHPWR
good, my new PSU has that
comes with the good-quality adapter but I've got a modular 12VHPWR cable here just not gotten around to installing it
My monitor doesn't have DP only old school DVI-D and HDMI so I need a cheap cable to convert DP to DVI-D
dual link
The one with water cooling AIO doesn't look so big 🤔
and is even more money for no performance benefit
So theres no difference even in water cooled vs air cooled? 🤔
nope!
size and loudness
every single 4090 in existence performs effectively the same as every other one as long as you can keep temps below 85C
they run hard against the redline because they can™️
(and because they're afraid of AMD)
Another reason the AIBs are mad at Nvidia
yep
but jensen wants his (your) money
or had bad other issues
FE was the only 4090 that would fit my case. I have a new case/chassis now.
What about electronics quality? One of the comparisons mentioned Asus uses higher quality internals as some others
some brands, like zotac, do scrimp on how many chokes and vrms are used
uh yeah, I saw them open them up to show
the AMP Extreme AIRO has almost as many phases and the same voltage controller as the STRIX my dude
Gigabyte is bad about it
gigabyte are just as bad as ASUS, if not worse
You didn't let me finish as I was about to say if you get one that mentions OC on it get that it will have more
extreme or OC
they drop a few phases on the Trinity cards but the VRM is still capable of delivering over 500W perfectly comfortably
and since these cards lose basically zero performance when capped to 350W, who cares
also the AMP is $30 more and has a 24-phase VRM that could deliver 700W no trouble
I had my run in with lack of phases on my old mobo. I prefer to get as many populated as I can
zotac are using a 55A drMOS driver vs the 70A SPS in the STRIX, but that 70A SPS is over-rated and in both cases it's irrelevant as they can both deliver well north of the max the GPU could ever want
yeah no gigabyte are worse
a zotec had 3 or 4 but not AMP
i would buy an asus card before a gigabyte one
they drop to 20 phases on the trinity yeah
yep
but the card's power limit is lower as well so that's quite reasonable
that is what I am talking about
YET it say O/C which I presume is thanks to having 2 more than Gigabyte
except cutting my 4090 Trinity from its max of 495W down to 350W knocks so little performance off that it's within margin of error
how did that affect SD?
immeasurably small difference
25% less power 10% less FPS in games
495W is slower than 450W by about 1%
maybe
margin of error
350W is slower than 450W by about 2%
I have heard that before and that makes no sense
giving the GPU more power doesn't help when it's limited by memory bandwidth
495W is slower because it raises the temperatures enough to knock about 50mhz off the clocks
yeah, I will be bottlenecked as it is so doesn't matter
So whats the lowest reasonable wattage we can run them lol
below 300 you start to see significant perf differences
Alright I was thinking that the 495 was cooking it to death. Got it right
you limit that in afterburner or how? In Linux greenwithenvy is gone so I hate that
yeah, my pascal can't even take a minimum setting as it is not supported smi said
there's a min and max limit
not on my 1060
nvidia-smi -q -i 0 and look at the power limit section
my 1060 6GB goes down to 50W
600 watts lol. Thats what my whole setup drains on max right now
min setting I mean for frequency as 135mhz is stupid and makes my desktop sluggish
eg you can't set an A40 below iirc 100W? because the VRAM pulls 60W all by itself
card idles at 78W the moment you light it up from P8
I was hoping to set 2GHZ max freq and 600mhz min but it would take the max but not the min and afterburner doesn't work with min freq on this card either.
there's not a lot of benefit to adjusting frequencies even on pascal
yeah, there is
pascal there can be some benefits from overclocking
for me and the desktop
but pascal was the last gen where you could really make a meaningful difference
I fall down to my entire system taking 65 wattrs at idle
apart from some 2000 series cards
ampere and up it's a wash, half the controls/sliders don't actually do anything
voltage control on Ada is almost entirely gone
LOL
and dragging the slider up from 0 to 100 only slows your card down by making it run hotter
well, if I set my card to run at max always, and reboot my desktop runs so smoothly
afterburner settings don't persist across reboots 
card gets too hot though so setting a minimum would help but can't on pascal
you can turn down the PL or thermal limits
PL is most effective
but unlike Ampere/Ada you do actually lose performance for every notch down
yeah, but that only lowers my max which is counter productive
yeah
I wanted to raise my min
dropping idle power is difficult at best especially given how many programs decide to light up the GPU these days
On Turing you can
you can force pascal to sit in the first non-standby power state with a semi-undocumented nvidia-smi command
but it's usually counterproductive
would that let it sit at more than 135mhz?
fun fact: loading yahoo.com puts more load on a GPU than running half-life 2 at 1280x1024 did at launch
1060 is slow to wakeup
yes but it will bump your idle power up 20W for the privilege
fair tradeoff
aite lemme find the command
I would presume ADA wakes up faster than Pascal?
yeah
you shouldn't really notice in interactive usage though
this flag is intended for cases where you're spinning up lots of very short-lived bursty gpu processes
once it hit 1600mhz in web pages it is snappy but 135mhz it can take 1-2 s
h-uh. absolutely shouldn't take that long
as long as I can put it back if it isn't what I need
yeah
if your cpu is half decent turning off hardware accel in browser might be worth doing
One more question for the crowd. What do you think is the best value card right now? If you were to upgrade from something that is veery old and should last a few years again?
if you're doing AI stuff? used 3060 12GB or bite bullet and buy 4060Ti 16GB
or the ARC A770 which is massively underrated especially for linux
on the other hand alchemist is about to get refreshed
with the memory controller bug fixed
I am still holding out hope for Druid and Celestial now that the ceo is canned.
his stink is no where on the cesestial.
*celestial
wha?
ARC is Gelsinger's pet project he's killed entire business divisions at intel to keep it alive
The one Intel fired
that was years ago tho
Battlemage looks like it's going to be quite competitive at the mid-upper end and if they can keep it up Celestial will be right up there at the top with the best of 'em
the one who helped make the turd Vega series
they didn't fire him lol
nor did he make Vega
Raja is responsible for RDNA and RDNA2
he's done the same thing at Intel as he did at AMD - help them make the first gen, get them properly going on the second gen, then get bored and go elsewhere
They fired him and it is well known they did. His stench is gone now and good.
the only source on that is MLID who is questionable at best
Raja Koduri, the former head of the Intel's AXG unit is leaving Intel by the end of March. As part of the company's reorganization. Corporate speak for they let his ass go which is fired. No one liked him and none cried when mentioned he was leaving.
and he literally quit because he spent 3 months sick in bed and in and out of hospital + his work was done & they're folding AXG in with the habana group over the next few years anyway
shrug he's literally the reason that AMD is competitive today so

QUESTION!!!
I HAVE A QUESTION!!!!
I have a question*
How much Vram do I need in order to train SDXL?
does it already have dreambooth support?
12GB and yes
WAIT!?!?!?!?!?
WGHAT!?!?!?!?
SO I CAN WITH A 3080TI!?!??!?!?!!?!?!?
3080 TI?!?!?!!?!??!!?!?!?!?!?
iirc those came in 10GB and 12GB but either way yep
I HAVE 12!!
then yes
use this https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
use bfloat16 mixed precision and don't train the text encoder, 1024x1024 train res batch 1 will work
YYYYYYYYYYYYYYYYYYYYYEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEESSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS
why not just try it first?
training a DB for XL can a 4090 do it yet? Last I saw was just unet was taking 23GB at BS1.
lmao
I had 0 hopes
in what world
4090 you need to either not use xformers and use SDPA, or compile your own xformers
waaaaaaaaaiiiiiiiiiiiiiiiiiiiiiiiiiiiit
it's LORA!!!!!!!!!
dreambooth works then to fully train G/L and the TE as well as Unet?
you do not want to train the TE
why not?
the SDXL TEs are already very well trained and attempting to train them almost universally results in Worse Output
they're very, very picky
you don't need to train them
just training the unet works exceptionally well
wait
lora
is not the unet
is like an extra thing, no?
how much ram is needed to train a dreambooth model for XL then and at what BS?
dreambooth is just a word for doing lora training basically
technically you can apply the same approach to a full finetune but
(the prebuilt xformers wheels are built for 5.0+PTX 6.0 6.1 7.0 7.5 8.0 8.6 which doesn't include Ada (8.9) so you're better off using torch SDP attention)
The dreambooth I am talking about came long before loras
when I dreamboothed 1.5 I didn't Lora it
yeah but it's a bit silly
full finetune works fine as well resource requirements are about the same
making a full new model is never silly
so I can't?
you can
I thought the reason why they didn't build it for 8.9 is because it didn't change anything
it sure as hell does on my system 
mostly because see that 5.0+PTX
for the forward pass?
I'm the creator of OREN-4 by the way
or the backwards?
the world's best model
if it was 8.6+PTX it'd be minimal
i lose a solid 12% of it/s
on this card xformers prebuilt public wheel is significantly slower than SDPA but if i use a container built on top of nvidia's NGC container and compile xformers for 7.0 7.5 8.0 8.6 8.9 9.0+PTX it's significantly zoomier
the problem is 5.0+PTX which limits the PTX intermediate code to friggin' Maxwell functions
PTX 3080 TI
this is a silly question and im 95% sure the answer is "no", but, there's no backward in inferencing yea?
backwards is training
thought so
I haven't compared training perf on the differently-compiled xformers wheels, i just recklessly abuse github actions to build it for me (it does take almost 3 hours to compile on an actions runner)
works out smaller too, at the expense of maxwell and pascal support which (personally) i consider a waste of time but ymmv
If you guys wanted to train something
something good and big and important
something like OREN-4, the world's best model
would you
a. Train SDXL LoRA?
b. Train 1.5 with everything
c. both
Which one first assuming you only have 1 GPU?
yeah I built xformers on github actions for comfyui once and it took a very long time
order wouldnt matter to me unless you have a deadline
i'd do the one you're more interested in
Which one am I more interesed in?
if you have 0 preference then flip a coin
I think that I hate LoRA and love making SD 1.5 models, but maybe LoRA is good now and 1.5 is ancient
does LoRA training even make sense for a giant dataset???
Lora was great on 1.5 ...
like 3000 images with captions
not really, it couldn't do what I needed
it could put a style or a something in the image, but not make a different thing
like how I made OREN-4
i often too would get much better results with a full dreambooth model on 1.5 training, but i didn't dump tons of time into it once i had something that worked. But as far as 1.5 even the diffuser docs and scripts only mention lora's as far as dreambooth goes... https://github.com/huggingface/diffusers/blob/main/examples/dreambooth/README_sdxl.md
LORA can absolutely make something that isnt in the base data set.
It isnt a checkpoint, but it does its job excelently
yeah, a thing, or a couple of things, but not like a different thing in terms of different things
like OREN-4
It couldn't do something like OREN-4
you've really been out here repeating oren-4 being the best model for half a year
Well, it is technically LoRA using dreambooth method. Dreambooth is furthwer up the food chain than a lora.
nobody thinks that, just get over it
because it is
I asked people objectivly and they said it is
people who hated me said it too
want a proof?
like the people on your reddit post
bet, let me give you a prompt
they didn't see the comparison and weren't objective
Maybe I should dial the noise down on the upscaler lol
give
Positive: An anime drawing of an astonaut riding a dinosaur in space
no
It's a photography model
Positive: A woman wearing a red shirt, blue pants and green shoes with black hair
NO regional prompting
yes i understand this, but i'm just saying they don't give an example in the docs, so unless you really know machine learning or find someone that might, they don't give an example of it
a photography model can do good anime art as well
Yeah, I miss Dreambooth from Shiv.
not oren 4
High-altitude photograph of (nido dog) on a mountain climbing adventure, rugged peaks, expansive views, gear detail, dynamic weather, photography inspired by Jimmy Chin and Renan Ozturk
Negative: anime, deformed, glitch, blurry, noisy, off-center, cross-eyed, closed eyes, bad anatomy, ugly, disfigured, mutated
replace (nido dog) with (springer spaniel)
do the second prompt I gave
I'm using A1111. For sdxl should I be using the base model and generating 20 steps then sending the generated image to img2img and then running the refiner model for 30 more steps? What is the typical workflow? Thanks
I'll do
too much bloom
"she is the hunter of pets"
just because
nice image with too much bloom, oversaturated, too many arms, weird hands and feet, too smooth and not much detail
during the livestream today, a dev was suggesting 32 base 8 refiner (total of 40 steps) but there's no "best" if thatll ever be a thing
but Oren-4 automatically make any IDF soldeir with almost correct uniforms and always as a female
I want my woman wearing a red shirt, blue pants and green shoes with black hair already
that's a very niche application
Thanks I will try it. Did he mention anything about the denoising strength?
not sure on that one sorry
loading SD now
ok, and remember:
- no after detailer
- no regional prompting
- screenshot the A111/comfy ui and prove its 3 randomly generated seeds aswell
SD 1.5 models aren't going to be able to get the colors right and asking it to will probably make it mess up the face too.
thats the plan
SDXL can
no after detailer is upscaler?
yes, raw output
The best model can handle regional color prompts and generate a good image every time with any seed
I expect both
My model isn't good without the latent thing, it's the best because it overcokes the image in a cool way
then your model is not the best in the world
SDXL is
colors are all wrong
Check out that it made all of the women hot
This only my model can do
Even if the colors were right, the images were terrible...
No
let me upsace and you'll see
not my fault, SDXL has better clip, thus is better model
You're going to use something else to replace the faces aren't you?
NOOOOOOOOOOOOOOOOOOO
NO!!!
NO
I'M NOT
just latent upscale
lol
could always check the png data
Seee????
I think there is one person in the world that thinks your model is the best.
where are her pupils
IT IS THE BEST
She doesn't have them
notice that the pants look almost like IDF soldier pants
A. She is not hot
B. It’s clear your trolling
because it's trained on IDF soldiers
Here!!
this one came out better
since when do IDF soldiers wear bright red shirts
Notice the boobs
Shoes are supposed to be green
looks like a mannequin
They don't but it listens to the prompt and not just IDF soldiers
it's a huge datasets with captions
the captions talk about images like they are normal
but the images are insane
it's the best dataset
so what you're saying is that the pants could be blue if the model was better
NOOO
But, purely a question, is it against the rules to advertise a paid model?
It gives you a better output than your prompt
green pants are cooler
Feel free to give me more prompts
Wait... he charges money for this? 😆
you'll see
I made money on this once
According to Reddit, yes
Maybe it’s changed now to be fair
Someday you will make 1 more money and then you can say you made money on it twice.
Probably not today
Anime drawing of an old man
It's a photographicccc modelllllll
do Pippi Longstocking
A photo of a woman drawing an anime drawing of an old man
epiCRealism can do great anime outputs
Epic realism is just the best on 1.5 tho, that model is so good
a girl in the style of studio ghibli
true, my favourite model
hopefully they'll train it on SDXL soon
I hope so aswell. All my mediocre Lora’s were 10x better by just using their model
Pippi Longstocking in the style of studio ghibli, generated in epiCRealism, a ''photographic model''
somehow it works
better than yours
notice the IDF pants color
see, now you have no excuse for calling it a photographic model 🙂
Is this regional or a single prompt?
MINE IS BETTER!!!!!!!!
well first pass SDXL + second pass WD1.5, each have their own prompts
that's not studio ghibli style and it lacks some detail in the brackground
WWhile You Were Partying, I Studied the Braid.
Is that a prompt?
No, she just has two braids in her hands like swords.
There was a lot of hair in the dataset
Used it as a prompt
Oh no
I’ll give you that I’ve had some fun in the last hour
Quite enjoyable
oh so you are promoting your model
no
which is against the rules
oops
They have been the whole time
I know, I just wanted him to say it
Ahhh
Also it's against all logic, nevermind the rules.
To be fair, you did link your patreon
non-objective 
messing about with upscaling. 9.5k square image in 150 seconds on a 3080. now I need to work out how to get more details
More details? That image is pure fire
at 1:1 the faces arent corect
are they drinking orange juice with their burgers
XL seems to like juice a lot when you mention just "drinks"
Are alcoholic beverages not trained on?
ive seen shot glasses, so I guess so
nudity and juice, an AI's guide to the perfect breakfast
I have yet to find a workflow I like. Something simple where G/L goes to the sampler and I can see an image then another image is made post refiner. Seems that first image is a horrible mess with everthing I have tried. I thought I could just tap in to the link from sampler base to sampler refiner but nope.
iu have a dataset of 40 images
im sturggling with overtraining
idk what epoch to use
well that's mildly disturbing
I thought that was we were doing now. People heads on robots.
made this while tipsy, damn idk if ill ever get the ai to recreate this actors likeness so well again
idk I've been doing this one
Human ears on animals. That's a good one!
I think human teeth on animals is also very weird. Like smiling and stuff.
Yes like that!
ah yes, 8192x8192 because why not
anyone know the specifics on how caption dropout works?
haven't used it till now
so if i do caption dropout every n epochs = 1
caption dropout = 0.5
A.) so that means 50% of comma separated tags are dropped?
B.) on epoch 2, are 50% of all captions going to be dropped, or 50% of all remaining captions? (essentially until not single token remains)
C.) is keep n tokens immune from this setting?
don't zoom in too close though, because then you will see the imperfections
I’d highly recommend setting up a custom node for your inputs. Have that node do the math for you and then connect it to ksamplers and an SDulti upscale node. I did this today and it simplified my workflow massively.
Do you have any idea to use a input image as reference in sdxl like the same way in midjourney?
How does it work in Midjourney?
Midjourney will generate creativate results but these results are very related from the input image, like style, object.
oh god never mind. found kohya's officialy docu and translated it
Dropout caption every n epochs
Usually, images and captions are learned as a pair, but it's possible to train just on "images without captions" every certain number of epochs.
This option allows you to specify "drop out captions every ○ epochs."
For instance, if you set this to 2, you will conduct image training without captions every 2 epochs (2nd epoch, 4th epoch, 6th epoch...).
By training on images without captions, it is expected that your LoRA will learn a more comprehensive feature set from the images. It can also help prevent the image features from being tied too closely to specific words. However, if you use captions too sparingly, your LoRA could become ineffective at prompts, so be cautious.
The default is 0, and in the case of 0, caption dropout is not performed.
Rate of caption dropout
This is similar to the "Dropout caption every n epochs" mentioned above, but during the entire learning process, you can train on "images without captions" for a certain proportion of the time.
Here, you can set the proportion of images without captions. 0 means "always use captions during training," and 1 means "never use captions during training."
Which images will be trained as "images without captions" is determined randomly.
For example, if you train LoRA with 20 images, reading each image 50 times for just 1 epoch, the total number of image learnings is 20 images x 50 times x 1 epoch = 1000 times. If you set the rate of caption dropout to 0.1, 1000 times x 0.1 = 100 times, you will train on "images without captions."
The default is 0, and all images are learned with captions.
I was very wrong XD
One way would be to interrogate with Vit-H using the “best” preset and use it as a caption. This gives an image that feels like the first image
Last that you may need control nets when they release
I don't actually know how and spinning my wheels getting nowhere only frustrates me. No upscaling for now anyway just something simple.
okay, that canbe a good assumption。
Have you tried Comfy’s basic example?
That’s just inputs and ksamplers
yes
tbh, this comfyui is not for me only I am forced into it because auto's is DOA for XL, and limping for much else now.
I like to dissect things only in this I can't do that. I can't click a wire and see where it goes and most times it just a bag of frustration.
on the input of a node left click and hold, the connection is highlighted
in some of the complicated ones that is highly inferior and makes one easily lost, at least for me.
dragging a box to move it out of the way of the wires only makes it worse.
Oh, and another thing I miss from other node based systems I have used is click the wire and press delete and it goes poof, and ctrl-z to bring it back.
now thats some fine tuning if your usin that lol
have you thought about trying SD.Next?
literally steamrolling the sdxl model XD
It's a branch of A1111
to reroute connections. some use them to cleanup the spagett but you can have multiple connections off them
i think i have one.....i will find it
find the worlds smallest doggo in the background! XD
This has a bunch of them but didn't help, lol
lol yeah I accidentally stumbled upon a way to make Where's Waldo, and the tricky part has actually been eliminating them from the output.
the joys of tiled upscalers
they still give you best quality tho just need cleanup sometimes
most simple with and without refiner workflow i have
Hi everyone, so I just download the XL model along with the refiner, put it in the model/stablediffusion folder, but when I'm trying to choose the model, it processes it, but then it doesn't change the model at all but stay in the previews model I was using. Any idea what's happening or how to fix it?
winner winner chicken dinner
Hey Nido is back 🙂
Thank you. /fingers crossed
just finished testing a combination of upscalers to stick on the tail end of that giant workflow I screenshot earlier
A few more prompts couldn't hurt...
That looks remarkably similar to the one I made but the image before refiner (the base image) was always funky
i wont say its great, it is the very first one i used when .9 was out....i just modified the look of it. I dont use it anymore, but it works
i seem to get better results from another one i can send you if you want.....but it doesnt show the image before and after. I feel like it just saves a little time
it also has great notes that describe what is going on
Thanks, I thought the same as you that it dropped a certain portion of the captions at certain epochs.
Though now I want to be able to train captions without images so I can reduce the size of my datasets.
yeah, I was using the before and after for testing and learning
I am wanting to see what various prompts do for refiner
ah....then that one may be the one to use.....and it is simple to modify
send me the other, if you would
i like this one because there is a nice way to tell it total steps, and where you want the refiner to start

2704x1544 in 98s
@vital ermine I am working on getting one put together that doesnt use refiner at all because the brand new finetunes do not need it, and im going to add image to image. all very simple.
pretty good!
im learning how to do that as we speak
I think I might have figured out a little workflow trick to improve image output quality. or maybe someone else has done it before. but I haven't esen it
Guis guis guis
Is this it?
very nice
controlnet on SDXL, yes please 
but anyways, I'd have to test it out more to actually confirm it improves things, but thus far every instance has been positive
Not finished or not tested or summin like that
Just wondering if anyone has tried yet
I miss a viable inpaint with a brush like the one in Auto1111
it could be cleaned up quite a lot, but it seems to work
me too. I'm using auto about half the time, but the results just do not look quite as good to me, and its just slower for me
huge mem leak now too as evry single time I loaded refiner my page file grew until my 48gigs of ram was using a 128 gig pagefile
i only use base in auto....the extension to add refiner takes a shit ton of memory
exact same prompt and settings
the second one with your workflow? yeah, that coherence is a lot better
I went to img2img and loaded it there
that is what i prefer with finetunes. thats what i do
besides I just updated auto from feb release and the new one is so sluggish but I had to havce it for sdxl
ALMOST
took the first prompt and basically combined the negative leaving the L blank. then inverted it and sent it through the sampler upside down. so positive prompt in negative. then interrogated that and put it in the negative L prompt of the final image
that image to image in auto is nice....i like to change resolutions there. just not as easy to do all in one place in comfy,
basically had the ai make the opposite of what I want
When you sign up for a space gig but end up on a boat behind a turret
and then tell me what tha tlooks like
and use it as a negative
and you can still put your own negative in neg G
lol
so just put the positive in the negative and negative in positive, create an image
I get it - sounds fascinating. but I don't know the implications 😄
either insanity or genius! looks like the latter in your example 😄
I would do it with a1111 and not once did it make an image worse
every single time it was better
but it was a hassle as well
basically you're getting what the ai thinks is the opposite of your prompt and putting that in the negative. and I have it set up so it's unique for each seed number and all that
it's not super exact, but doesn't need to be
yeah it sounds super interesting. please share more of your findings and examples if you want 😄
I just like what the opposite images are, lol. they're not what you'd think at all
the ol switcharoo eh?
it's fun doing weird stuff like that.
you should also try emojis, leaving things blank, using other languages, and I'm sure there are more ideas
you can also leave the prompt field blank 
Anything will tokenize and convert to weights, even complete gibberish. With no prompt I think you are starting with fixed weights and then the random seed is fully responsible for the image.
Have your cat walk across the keyboard. The options are limitless.
it did make a thing
that's the beauty of it 😉
"its too small"
I'm not saying this has anything to do with it - my workflow with SD2 was using the hires fix to enhance fidelity and details in general.
are you giving your last step kind of a "low detail" image as guidance?
here's the negative image
makes sense 
they're identical settings other than the negative L
and I'm using some junk booro interrogator
so I made some guidance / reference image tests with SDXL. I think I noticed increased fidelity with some things... I know you are doing something different but you are also feeding it an image that it uses as guidance
or anti-guidance?!
well sort of, just using the description of the negative image, although there might be a better way to do it
essentially. just sending the output of the interrogator to the negative L input
since its just a list of words
and then you can still put in your normal negative to be sent to negative G
yeah, that sounds really interesting to explore. you are already showing some very promising results.
I was trying to do aerial urban photography earlier and it is very hard to get the details right - so I can tell you the details in that image are really good
what's cool with this setup is it's dynamic, so it'll change with each seed
nice
so this method obviously takes longer, but I'm thinking I wouldn't really need to run the negative image for that many steps. just enough to make it coherent
I also wonder if having it overlay on the initial output image would be preferable
Alright! Got the upscaling all implemented. No missing wire connections. Time to go watch an episode of a show lol
is there any facedetailer or restoration on it?
or isn't that image part of your research?
lol, nah.made that earlier
lots to explore in this one 😉
so I can say that this workflow doesn't necessarily always make the pictures "better." but they are definitely much more well defined and more true to the prompt
told it to make a lego metropolis and it went full lego
it's like it fills it out
some strange legos there
looks like it emphasized your prompt
yes. the optimal approach might be to have the negative L from the interrogator have variable weight
so less impactful to start but then increase influence through the steps
lots of unknowns and I'm just starting to learn about a lot of these tools. but I've had ideas bouncing around in my head for a while that I had no idea how to implement
yeah same here. that's how you discover things 🙂
heh
General Awareness has arrived
well with a1111 it was a bit more daunting to consider coming up with anything because I'm not much of a programmer. but the nodes are a nice happy medium for me. haven't even had comfy 2 weeks, but learned a bit in that time
Yeah you can build custom workflows without writing code. I enjoy it a lot as well 🙂
So all this hubbub about only using safe tensors is that attackers can't just stuff any ol data into the file format , right?
I wouldn't mind getting back into coding, or just taking it more seriously. took some classes in school, but never really did much with it. but I've been messing with python more lately so who knows
yet safetensors still have metadata in their format, and any dangerous data could also be stuffed into the metadata, so why are safetensors a solution?
this attack would likely occur through image metadata anyways, like it always has in the past since it's not a new attack.
what sort of things are you doing with the noise?
I don't know, bud. I don't make the rules
those images were the same prompt and same seed only the noise was different
how are you modifying it?
changing the offset?
well that wouldn't do that much actually
I'm just curious
i've successfully rola training of realistic character with sdxl 1.0,total steps 3500,10 epochs,the base model with fine tuning is not bad,this is epoch 3,looks like michelle yeoh already
ooh, there's a perlin power fractal? wat?
yep
I don't know what that means but I like it
perlin is a good one
3500 steps?? tjats a lot
used that in cinema4d years ago
something new to learn about. I've never even really heard of the concept before. I know what noise is, but not all these other things
if i'm not mistaken, the only thing the seed changes is the initial noise that's created. so changing the noise algorithm would be the same as changing the seed
still having issues with janky faces and no i2i to fix them
I think this workflow I'm messing with would help with face clarity. probably not 100 percent of the time, but mostly
yes,sdxl 1.0 base model you can set it to 9999 total steps as well,will not overcooked
actually this package has a very advanced face detection method https://github.com/ltdrdata/ComfyUI-Impact-Pack
it's huge, but my first results are looking promising. It has lots of settings... but it's the most advanced face fixer I've worked with yet.
I started to look into it since I have so many images that are interesting but the face makes them hard to pick
#✨|sdxl message sytan's workflow has a 2x i2i fix to see how this works. i've also added the facedetailer nodes to it which gives that specific face pass
that face detailer just seems like a lot, lol
i feel like that is just a long time for a character model. i can successfully bake a character in in about 300 steps probably with the right settings. how long did it take?
thats what i've put into what i've linked
As I understand it, .ckpt can contain code but .safetensors can only contain non-executable metadata. So it would be much harder to get that metadata to execute, as compared with code that is designed to execute.
it would be non executing data in either format. the attack is another process uses that extra data in it's execution
I haven't used a face fixer since 1.5 days so since 2.0 hit. I just i2i and done
takes around 2-3 hours,i just tested for different parameters and choose an appropriate config settings file,what's you dataset image number and batch size,repeats etc?
facedetailer is an i2i pass specifically masked onto the face
Probably need to put together a POC and send it to the developers so they can make .safertensors.
yeah I don't like the look of codeformer or gpfgan at all. I never use it. but this works differently since you can use any model for face restoration. it doesn't produce generic faces like codeformer etc. but still testing.
you will form a warp bubble soon
bro, are those all node paths? that thing is going to explode, lol
lol yeah
law of diminishing returns is the thing though, or in this case, return 0. a new file format doesn't prevent extra data, never will. a newer one couldn't do it either.
and your computer hasn't fallen over dead?
what's your fps :)?
jobs estimated to take about 30-35 min
Shhhh
lol i actually get a stable full screen video player on streaming services even with this 5120x1440 monitor
36 images, 7 epochs, 7 repeats, tagged of course, batch size 3, learn rate 0.002, adamw optimizer. 588 steps. learned the character very well. but take my knowledge with a grain of salt cuz i dont know much about loras i just feel like thats a lot of steps ya know?
but if I move my mouse around and it has to load the interface things get sketchy
without the video player everything is fine
except like dragging and dropping nodes/wires
during upscaling
are you running an a100?
4080 local
I was reading about the h100 earlier. those seem pretty legit
yeah but what's your ComfyUI FPS (fps are displayed in the bottom left corner)? 😄 Chrome based browsers give you definitely a better framerate compared to Firefox.
oh one sec
I mean if it doesn't feel sluggish - that's good
but mine started to lag a bit - not as fluid
number repeats should be at least 50,then it will show up more details,when batch size increase,txt2img results in higher weights becoming less saturated,but it take more training epochs to learn the target
mines 120 (monitor framerate) 😛
its hovering around 120
which is great until someone renames the Node to suit their workflow and then someone downloads that image , tries running and asks "where is this node from" ;o)
hmm interesting, ill try that out next time i train! but then again sdxl is wayy different at training then 1.5 and maybe the repeats arnt as needed?
ok so my node setup does kill the fps - and it's not even a lot of stuff compared to others. time to investigate 🙂 thx
good luck!
kittehhhh
I question whether I'll ever really understand how all these different samplers differ
thanks. other workflows do run with 120fps... so lets delete some nodes and see who's the culprit
The only way I'll ever "know" is if I build a workflow to let me test quickly every time I forget.
It's probably not SD Ultimate Upscale. That's the only non-default node I think I'm using
well with x/y plotting like in a1111 you could do a lot of parameter testing heh quickly
I'm going to see how much dpmpp_2m_sde and dpmpp_2m_sde_gpu differ
I think comfy can do that as well.
it can - but compare how it needs to be set up. not the same. but it works of course
I haven't loaded a1111 for a while. really not reasonable to run them both on my little barely sufficient video card, lol
I'm Commander Shepard, and dpmpp_sde_gpu karras is my favorite sampler in ComfyUI
I really thought it was decent until I found stable diffusion
omg no not that "curator"
I just pick whichever name is the longest
because that's usually the best once
whoa, dpmpp_2m_sde_gpu kind of deepfrying
f@ck ddim, too short
well you need to adjust your settings of course
yes, there have to be at leats 5 random sequences of numbers and letters for me to actually want to use it
so far haven't seen any advantage in the _gpu samplers though
I rarely use non converging samplers
heun is good for things like landscapes, but it's not all that exciting. I don't know how most of them even perform
even though they seem to work better on SDXL
I just like the noise more - probably placebo and voodoo thinking but I've been using it forever in a1111 etc
so are the converging samplers the ones that just get more clarity over time but don't more like euler a?
_SDE and _A are the weird guys, yes... where each iteration is different from the previous
it's not an area I've focused on much thus far
16 images at 2704x1544
goodness. it better be a banger
It's intended to be a production ready workflow for prompts ive tested in a smaller environment
when doing img2img I find "constant" samplers are better
But this was a trial run to ensure it all went smoothly
what are you doing exactly?
you made those?
they're definitely very clear and realistic looking
Goal!
but why so many node paths though? I still don't understand
because of the last few days and you posting images, I'm thinking I've subscribed to Nido's travel blog 😉 very nice
good lord. that'd take me a couple days
I need to get comfy working on colab again so I can do more crazy things with sdxl
nice. well I like really intricate things so it's right up my alley. just need to upgrade my vram situation at some point. not sure what the most cost effective method would be though
might actually be cheaper to just use colab honestly
but it kind of sucks
are you doing anything to the faces or is fixed through your sampler setup?
I have to try not to be salty about the efficacy of simply renting a google colab server vs owning local hardware but I am ultimately glad people without the hardware have options.
Renting A100s aint a bad life
truth
cant wait for that btw
Nothing manual, all through the setup.
I am anxiously awaiting controlnet for sdxl workflow though
sometimes I feel people just convince themself that they don't like or need what they don't understand, because everything has to be simple and god forbid educating themselves
The same people who will downvote you for not directly linking to a copy-pastable workflow that robs them of the educational experience they think they don't want
also anime boobs
I bet its a lot of anime centric folk complaining ftmp
least saturated node in blender
given I have a 3060 12gb... I don't find SDXL slow considering it's generating at 1024... it's like 4x images in 1.5
all this for a generative bowl of cheerios? ||jk||
People complaining that the newest tech doesn't run on their fridge
It wont run on my 3DS, I dont want it
unless it gets Celeron support I'm not interested
my nokia 6210 runs it just fine
dude when you are frantically jerking off on celebrity fanart p0rn those images have to come out fast!
oldschool collages
I think the user group with the biggest complains also prompts for the biggest boobs
they took a weird turn
it was like this when I found it
im tired of having to move around the thousands of strands of hair that is blender nodes just to render 10seconds of looped 3d porn,i can finally move away from that installs comfyui
I have seen many many millions of images in my life, and your images trick my brain quite often thinking at first glance they are real photographs. well done 😉
yeah the only thing that gives it away is
have y'all seen this yet: https://www.reddit.com/r/StableDiffusion/comments/15hag5s/sargezt_has_published_the_first_batch_of/?utm_source=share&utm_medium=web2x&context=3
can we use it in comfy?
yeah I downloaded a couple of them but too lazy try to 😄
yo - the QoL feature package https://github.com/failfa-st/failfast-comfyui-extensions by Pixelass added an option to hide all links / cables
Thank you so much!!!
That’s right. I’m just gonna reinstall then maybe that’s a fix for it
For real?
https://huggingface.co/SargeZT
that's pretty good
Good morning!
yeah the quality doesn't seem great at the moment
yeah but the examples look pretty poor 😦
Hi! Good morning 🙂
OMG each control net is 5gb
As he is an individual it might not be the best, but still he has room to make them better
Something is better than nothing
The reason I went through the hassle of setting up all those nodes is easier to see with this prompt in particular. I wanted a greater variety within the same theme from one prompt.
Each image varies slightly from its paired image, and each pair of images varies more from others the earlier the stage they separated from was
They are gonna prune ig, still waiting for them to release
they're diffusers too. it works good enough when i get it going in comfy ui but isn't super reliable. and i also can't get the annotators loading in comfy so well
absolutely and i really appreciate anyone pushing this technology forward
comfy's internal version of cnet isn't just pruning. whole new architecture so that you can use many without bogging xl down hard
(sorry I replied to you, it was meant for anybody 🙂 )
that controlnet in comfy? heh
lol nah just my neuroticism
train a lora with comfyui workflows
just use kohya instead
tools for what they're specialized for
it.. was... a joke
like take screenshots of comfyui workflows and train a lora with them
oh shit
that's genius
I thought you meant use the nodes to train a lora, and I got a good laugh
Yeah I have seen it, they are brilliant, but I don't think they would be out anywhere soon
stop killing unicorns!
-I was able to fix it. I had to extract the folder in in the modded_nodes one.
Yeah Joe said that they are moving to New arch
Are we part of this unicorn?
are pictures with sunglasses on a problem for training loras?
nope. one or two in the set are fine. caption "sunglasses" on them too. should work well
cool, thanks
Thats blender 
u right,if it was comfy it would have more nodes 🤠
who's a stable dog?!
when in doubt, drag from the node you want and pick primitive node under "util"
aside from that, break it til it works
i dont think rip. auto can make moves and keep things relevant yet. i think theres a ton of benefits to auto for new users still.
oh i have no idea
professional and prohobbyist workflows will move away from tools like auto though
