#✨|sdxl
1 messages · Page 35 of 1
Someone gotta train sdxl on loads of backrooms and liminal space images

im sure a lora will come out with it. i do like the look of pool rooms as i have done a few in blender so maybe ill train a lora for it 😛
Sdxl 
I'd love someone to train a deep focus LoRA.
1.5 never gets the poolrooms right
use laion dataset, so if they are there then they had, or no

do you want any specific lora?
im sure sdxl could make some pretty awesome ones. ive already generated a few and it did surprisingly well
split depth lora for trippy faux 3d animations..hmm i might try that
a generalist LoRA that's trained on deep focus photographs.
Yo guitarman, everyones beating around the bush, sdxl coming out today/tomorrow?
like landscape photography?
i'm still waiting for SD to learn what a razor blade is. not a disposable razor, not a straight razor, just s regular old iconic blade. For art. And also science. ;_;
the joke is to beat around the bush lol
it'll come out when Emad finishes playing Tears of the Kingdom
Next game of civ
can we get a guide on meaningful use of TE during LoRA training? cause the results of that are just chaotic unless you do TE first and then freeze it
How did they turn out?
Pretty sure the stage tomorrow is a good estimate lol
||<t:1690394400:R>||
The mystery is controlnet when
Great idea, we should ask @tiny walrus to help make one
could be! or, oo, I wonder how well SDXL knows split diopter
this kinda look
spielberg loves 'em
when you stare at it too long it looks like the foremost guy is photoshopped in lol
hot damn its the soggy bottom boys
Chilling in Chile eating chilli while it’s chilly
Tbh they both look photoshopped in or greenscreened
i think a model would be better
if can suggest the settings, i would love to collect the dataset and training on my machine
nah.
can sd generate "magic eye" pictures, like the ones that used to be all over the mall in the 90s? because thats something the world needs.
you just want to add depth effect?
Some of the LoRA types have more params than the original SD 1.5 model 🤯

i got this with a film still from a move of 2 men split diopter style
deep focus images, yeah.
ooooo
nice
🤷♂️
I'm just looking forward to the LoRAs that achieve finetune levels of changes, with under a 100mb
I dont mind a chonky lora as long as it can be trained on a 24gb card
I’m the future we will laugh at sdxl
I mean, you can train the full model on a 24GB card.
tbf, they dont need to be chonky.
I managed to fit over 100 concepts, fully learned, nothing overfit, into 43mb
just gunna one-up you here... 
wow, kaj, yknow what would be even better than one clean UI over comfy dropping soon?
agreed, sdxl should make perfect cats all the time..
I doubt I can on windows tbh
but wouldn't that do more damage than good?
considering batch size of 1. Unless something changed about finetuning itself, which I'm not aware of
Almost 2 years ago…
why no grad accum?
no no no we need BEYOND perfect cats, so those little nerds can get body image issues like the rest of us

That's sdxl 1?
If it's from 2 years ago, then no it's not lol
It’s from mindalle anyone remember that?
Can do--I was doing an exploratory one the other night. Totally feasible
aww
Kali, got any cool 1.0 pics?
so yeah it doesnt know split field diopter very well no matter how i prompt haha
make a miku lora 🙏
Anyone got nice car drifting pics
Love this one
because I completely forgot that that exists XD
good point 👍
yessir, comin' up
GA and Batch Size is like Peanut Butter and Jelly
How do you enable this with kohya
oh, that I'm not sure.
I am also not sure fwiw
Idk if there are other trainers, psuedo said he was working on one but he implied you can only do batch size 1 on a 3090...
Waifu uses the official SAI code?
Or just their own trainer
Thoughts on a1111 becoming obsolete?
I highly doubt it will become obsolete
for Waifu Diffusion XL they used a modified version of kohya's trainer afiak
Overshadowed? Perhaps.
I thought Kohya-ss had grad accumulation. There is at least a setting for it.
re training and batch sizes and all: use lora lol
can set batch higher with lora
and like... why would you not use lora when lora is an option? Excluding the case of full heavy model retunes like waifu
not to be confused with --gradient_checkpointing
Anything where you'd expect to be able to do it at home anyway, is better down with LoRA
lora was already a better option usually on SDv1, it's much more so on SDXL
(both because the base model is a harder target to train, and because loras trained on it are more powerful)
tbf, I still wouldn't finetune on my 4090. just my high end lora will run a total of 30 hours
and that's a mere 5k dataset
I wouldnt mind if it takes a week lol I have two computers now anyway
at that point move to runpod and get that 8stack of A100s
I like full models, idk exactly how loras work
don't forget that 1.5 lora =/= sdxl lora
With full models I feel more comfortable doing merges and stuff, not sure how that works with lora
We do not need 100s of identical merges.
How is it different
i thought they were trying to say that sdxl loras are much more powerful and would basically eleminate the need for whole finetuned models unless its a really big change?
long term it's likely, short term probably not
I mean I have my own datasets with art styles that are not super well understood by the base model imo
Or community finetunes
would it be possible for each node to show a percentage count thingy
But I would still like to merge with other community finetunes
I think there are going to be a lot of great UIs for SD and they'll all have their places with their respective pros and cons.
The biggest thing holding back Auto1111 is the fact it's built on Gradio
LoRAs are perfect for this.
the SD userbase should migrate to Comfy and InvokeAI. that is all
I've yet to see anything better than Gradio
seconded
How do loras work with supermerger?

Id rather just use full models which I vaguely understand how they work
What do you mean? You can use old LoRas with sdxl?
You don't merge them.
Yeah but I want layer specific merging
it's not gradio but the code quality that's going to make adding new features more and more difficult
And Gradio's inbuilt --share is really nice.
output folder is just full of mangled kitties after a debugging session
Yeah, the code quality is a mess but it looks like Auto1111 has become a better developer and has long stopped pushing unstable comits to master.
Oh I know that one first hand! Even with a rewrite though Gradio is tougher to extend as people are going to want more functionality.
poor kitties 
a1111 didn't even have a repo license till somewhat recently. I don't think he ever intended it to blow up, but since his was the first "extensible" UI it just kinda did
you need to diffuse them to save em
the theory of how a lora works, didn't change at all.
but sdxl model has way superior clip models + base weights and more parameters.
therefore lora can:
A.) make use of those extra params
B.) concepts already exist, you're just improving & bringing out existing hard to prompt for information
lora can't:
• Turn sdxl into waifudiffusion xl
there's an obvious solution to everything. rewrite in rust
Problem with rewriting it in rust is that the people making extensions doesn't know rust.
on it..
I guess the main feature I want a full finetune for is layer specific merging with other finetunes
my rocm 5.6 benchmarking kitties are worse lol
Sure you can merge a LoRA into the base model but that defeats the point.
oh really? I didn't know that
big ol floof mess of kitties
Take Auto1111 frontend and put it on ComfyUI backend and it's perfect.
oh, btw... um. how BIG is sdxl? im sure i wont need a new drive, but... will i need a new drive? lol
til too
About 18.5 GB with base + refiner.
the a1111 frontend isn't very good in my opinion
you probably could tbh. All the rust UIs have wasm/web support
The auto1111 frontend is still the best out there.
but previously, we needed finetunes because the model genuinely couldn't produce certain images/artstyles/photos due to a lack of information & parameters
sdxl doesn't lack this anymore. sure it has its issues, but not on a level where you need a finetune for 95% of situations
This was done recently kinda, but you still have all the same overhead from Auto1111 which doesn't really give you the true benefit of Comfy
ahh, thanks. not too bad, i have plenty of room for it
Not a true Front only with Comfy back
a1111 frontend isn't maintainable and actually doesn't lend itself to a good iterative workflow believe it or not
and there's an egui extension that adds node graphs. could rewrite comfy in rust
I havent tested a full finetune with it yet but the lora with my favorite dataset was kind of underwhelming
i got comfybox working last night. i might try to put together an sdxl ui in it for funsies
I've tried doing iterative workflows with ComfyUI and it is terrible for that.
Maybe because it is only rank 8
and you're sure it wasn't a lack of lora training settings?
can I drop this tweet? I think I can drop this tweet, why not... https://twitter.com/kaj718/status/1683985659172892672
It could be
How big was your dataset, and can you post the full captions of a single image as a sample?
Is this the project mcmonkey has been talking about?
he's probably talking about a different thing actually
that a native UI or is it a web framework
yes
Maybe but I dont want to share the artist I'm using lol
I'm optimistic to whatever mcmonkey is hinting towards.
will definitely be using that ui heck yeah. hope it has inpainting implemented too at some point
replace artist with <artist> or something. just want to get a feel for your captioning style
is 1.0 good at generating faces when in distance?
better than 0.9. that's for sure
this is awesome
okay 99% its a web framework.
or you just really like material design
it's react, running on a native webview using tauri
its open source, you can take a peek
damn you actually did rewrite it in rust
that's right, two new clean UIs over comfy lmao
I think I cry. fiiiiinnnnaaaalllllllyyyyyy a ui that doesn't remove the results after queuing more
@wicked frigate ❤️
oh wait. is this the work of kaj? O:
i was waiting to infodump about my one til tomorrow's stage event
yeah there's still some more cool stuff to come
kaj is building the super duper easy/beginner friendly StableStudio integration for comfy, I'm building a more poweruser oriented thing
that is the best ui ever and cant wait to try it out on 1.0 . i like comfy;s nodes, i like invokes style inpaint, and ease of use like a1111. BEST DAY EVER
are you using libtorch through ffi or is the backend still python
in that case. I take my love away from mcmonkey and give it all to @knotty lotus
❤️
no rust but we're all moving away from "python everywhere all the time"

no it's a forked version of comfyUI, same backend as everyone else, means you can use your same custom nodes
alright fine you don't get my one then
😭
Tauri is rust I thought
mcmonkey: Does yours support prompt-editing and x/y/z/...?
Tauri has rust stuff but coding is usually typescript
tauri just serves static files in a native webview and manages IPC/events/etc
you are
XYZ and all the other letters too yes lol
just make sure you sync your fork with the latest code
and kaj and the other engineers!
if its bad at distant images, then it would make no sense to train a lora on conematic photography
ooh!! ooh!!
there's some fixes that are important if you want it to work on older GPUs
I'm either sticking with Python forever, or moving back to assembly.
this took a couple of seeds without controlnet or img2img heh
if I had time I would rewrite comfyui in C
Let's just say @wicked frigate 's can do a/b/c/d/e/f/g/h/i/j/k/l/m/n/o/p/q/r/s/t/u/v/w/x/y/z and depending on your setup VERY fast too
i'll teach you C# myself
and you'll ignore me and use ChatGPT to write it for you
plz no.
lol
i'm a mac boi.

we tortured the bots a bit, to test distant faces. results were significantly better than expected. I'd give it the highest score of all the existing AI Image Generators that currently exist
Plot twist: They specifically trained a bunch of images of text that says SDXL
AND I NEED A MAC DEV TO ENSURE COMPAT IS PERFECT thank you for volunteering
are you sure? 
btw, saw your reddit post! Try ✨[Prompt/style]✨ instead of ~~[Prompt/style]~*~ *
and lmk what you think
you don't read your DMs apparently
should use stableLM instead
😅
50% chance of very rich programmer / 50% chance of did his fair share of game/app hacking XD
That's their next release, they snuck in SDXXL images in the training data to build up hype
that'd make custom nodes super annoying unless you included a whole scripting language interpreter in it
@hard fractal i have the image links of entire 1x.com, so if someone's interested I can share them, do you want them?
Cool, that you saw it 🙂 Thank you again for sharing your findings. I guess discord somehow changed the syntax. I will try it out for sure!
at least I wouldn't have the memory management issues of pytorch + python
SDXXL when?
yeah, I've done some emoji experiments back when SD 1.4 was in beta heh
What about SDXS?

I'm jp
a1111 changed the default allocator to tcmalloc a while ago which supposedly helps a bit
they even worked with VQGAN. an under-explored universe in my opinion 🙂
no dont think of it,
why to release a mini ver, SDXL can run on 6gb cards(probably) so is there a point to get SDXS
I don't understand how that would help
supposedly fixed their memory leaking problem
I think it just delayed it a bit
July 18th
If there is a user friendly front end maybe I will release my crazy animation workflow
SDXXXVIII is the one with janetjackson's nip slip
the band aid treatment is starting to become impressive XD
I really like the detail in this one. great images!
I don't even think it fully fixed it, I think it just delayed it long enough that it's not a massive issue anymore
starting to look like a final fantasy series game
I doubt regular malloc has memory leak bugs so switching to another one won't fix anything
it's probably one of the most tested functions
maybe the romans did have a real oracle that was a bunch of guys abacussing really hard
and malloc doesn't have anything to do with python or pytorch memory management
1it/month
with enough abasusses, you can simulate the whole world
sounds like a monty python skit
I don't think an abacus is turing complete is it?
it is with a human attached
If you can create a nand gate with an abacus it is turing complete
Funny enough? We do have one of those, haha
SD2049
lil baby model
Are you going to release it SDXS?
Is that the famous 30 FPS model
fair
That's the famous distilled model
Is the refiner a 512 model?
is the hype train over? i needa go to bed lmao

We'd have to give you all a couple days warning before putting it up on the bot 
People are like "he STILL HASNT RELEASED IT"
Like... ya. You think distilled 1.5 base was any good?
go sleep! XD
that's just what I read in this issue which caused them to set LD_PRELOAD to the system's tcmalloc dylib
https://github.com/AUTOMATIC1111/stable-diffusion-webui/issues/9323
Now, distilled SDXL 👀👀👀👀
are you really sure (sure)?
🚂 🚃 🚃 🚃
😅

Distilled SDXL 30fps 3D hologram video will be out July 18
SDXL distilled 
hey, thats when controlnet drops
SdXL
i think i am, ill see yall tomorrow when sdxl, maybe, possibly, probably, releases. even though ill be at work im definitely tuning into the super stage..
holograms are actually a really cool physical phenomenon and 3d projections aren't holograms
Isn't it about 14 hours till release?
Release of what?
So you can sleep with good conscience.
🤓
any estimate when control net would release?
July 18
Release of SD 3.0 of course, SDXL has just been a distraction from the real truth
In the year 20xx
it probably helps if tcmalloc is more efficient with memory with lots of small allocs and frees but it's not a real fix
We all expect SDXL but instead we get something even better.
but in reality, no - we'd rather reinvent ControlNets
Why reinvent it when it already works well?
@visual glade had an INSANELY BRILLIANT idea
because thats how tech improves
I posted about why on Reddit
Sure reinvent it eventually if you can do it better, but the original is still good.
That sounds like it will take a while
@hard fractal a simple question, how do control nets gets trained, sd is trained through images and captions, how is it trained?
original would make a 1.3 billion param model for every single ControlNet
SDXXX is the one with VinDeisel an IceCube
That's bigger than SD1.5
controlnets are not very efficient, they can be more efficient
I wouldn't mind
There is too many controlnets, most of them are not really needed.
I hope someone trains controlnets other than SAI then
the various models i don't think are the issue at hand
Then you're free to train one -- id imagine some people will from the community, the code is out there
Most CN are trained with the preprocessor image along with the original image and caption to teach the model what it should do
<t:1690394400:R>
anyone can. the OG were all made on the author's 3090
But, we care about normal folks running it.
I tried looking into how a1111 is able to eat 60 gigs of ram but their backend is split into so many weird semi-abstraction things that I have no idea where to begin with it. Think it'd actually be faster to just write a new gradio UI than trying to fix it.
How do you get Auto1111 to eat 60 GB RAM?
Feeding XL outputs directly into img2img animate diff in comfy.
XY grid with different models
You probably have a lot of caching turned on in Auto1111
nope
I'm set for 2 models and 2 vaes
Time for C0NTR0LN3XT
StableControl
My whole workflow is gonna be on hold since I hardcoded in the 1024x image size and there's no controlnets for SDXL 😢
ComptrollNet
StableNets
Joe, can we get few more preview pics of the control net?
any pose pictures?
Are you guys looking at something like this for controlnet?
https://canqin001.github.io/UniControl-Page/
No.
You just have the line Controlnet?
I hope their will be a good model for facial expressions
Not a fan of it anyway so I don't care too much that you're not releasing it.
ahhh, you are still figuring it out.
so it wouldn't be released anywhere under a month
We really want to innovate and provide new amazing stuff for everyone
How does SDXL scale with controlnets? 1.5 every layer of controlnet had a pretty substantial performance hit.
a shame might be harsh language. 2.0 certainly didn't have controlnet anything at release
It had a depth img2img model
@hard fractal will you release like a 100 image dataset sample with your training captions for both clips attached?
2.0's depth awareness is 256x256 pixels iirc
Which was fine, it was a good model
it's neat but experimental and kind of, meh
Really? I thought it was solid
It was attached to 2.0 which meant I never used it.
seems a weird thing to shame them over
I mean it's just a step backwards. And I'm not shaming anyone, just saying it's a shame there won't be controlnets ready on release when Emad implied there would be a while back
It's because we took a different step forward
they trained a new clip model?
We had them
apparently you can disarm the a1111 memory bomb with some cursed super user commands
How do you know that you're actually taking a step forward without an original version to compare against?
What's the harm in releasing if there's going to be community weights anyway
Ok, well I'll trust the community then lol. Hope the SAI version is good too
if i knew how to make useable controlnet datasets, i'd do some training myself
The info will be out there and you definitely can!
what is cr?
iirc either the controlnet github or the a1111 controlnet extension github has instructions on training your own controlnet model
There's a guide on how to do it, obviously generating the dataset could be difficult depending on what you're training
custom node looks like
i imagine it's a lot of image pairs. photo and a corresponding openpose read on it
yeah i've looked into it. doesn't seem like it'll take much to work out
can't wait! well i guess we'll have to, but i don't wanna!
oh shit, how did I not see this already being released
https://github.com/Stability-AI/StableStudio/tree/tauri
where can I find that custom node?
thought it was just a preview
if you want to know what's up in the world of ComfyUI just bookmark this url - everything mentioning comfyui on github sorted chronologically: https://github.com/search?o=desc&p=1&q=ComfyUI&s=updated&type=Repositories
its not what i thought it was. it's a front end for stability's online services. open sourced
It's just been kinda sitting there... being quiet 🙃
Take a look at that specific branch though
you can connect to localhost
Can you post an image with that you made with your setup... then I can just drag it into the UI
image embed meta only has the workflow, not any custom nodes
if you already have the node ig this would work then
looks exactly the same to me. it generates locally now?
I just need to put your two python files into the custom nodes folder right?
yea
😏
damnit XD lemme use my existing installlll
@knotty lotus ^
guess you gotta build that tree yourself? the actual release page takes me back to main branch. i'll figure it out
symlink everything
Yup Just tried it out, wonderful app
told it to install to my H drive and it just downloaded 5+ gigs to my C drive -_-
I worked too long in IT to ever use symlinks
why even ask me where to put a 22mb exe file if you are just gunna fill up my C anyways?
what does that even mean
because thats where appdata lives. cant you move your user folder to a drive you prefer if you want control over c like that?
symlinks only hurt if you're trying to sort out someone else's
yes, there is a folder called appdata on my C, which a lot of programs use for small files. If that was where the entire FREAKING PROGRAM was meant to go, then program files wouldnt exist 😛
no longer an issues - but in the past many apps (and multiple version of windows even) would never end some routines due to a never ending browser agent
also other reasons
appdata, yea. you seem pretty upset about it but it's being used as designed
I took the rich alternative, and got a 4TB SSD just for AI
anything that walks directories without a maximum depth safety is straight up insane should be locked away
that is in fact, not the point of appdata. its for stuff like settings, configs, small tools a program used. not the program
if 95% of your program is in appdata, you failed the point of appdata lol
minecraft?
Where do I get the SD vae.safetensors?
when did I ever talk about minecraft
one of the most succesful games in history and is 100% installed in appdata
I did this too but python and huggingface cache, appdata, etc bloats up my boot drive anyway
that's why I like portable standalones
and obilvion on the xbox restarted the whole fucking system sometimes when loading... doesnt mean its right lol
that took weirdly long but yes yes it does work in colab
yup. you find work arounds instead of blaming the developers. layer 8 problems
official SDXL vae is on hugginface. Alternatively you can just hook the vae decoder straight into the Base Checkpoint vae
I just like having my VAEs separate
will it let us use existing comfy installs? please
yes yes it does
❤️
classic over heating issue. people would keep their xboxs in an unventalated cubby hole or in the sunshine
i've been working on it half the time with a hacked internal version of comfy so it kinda had to work with existing installs from the start lol
now I wonder... am i redownloading 0.9 weights? 🤣
not the overheating you're used to either. this was the psu tripping in the xbox
no it wasnt, it was intentional because of memory use
just make sure it also works with the public version
cause this is taking a lot longer than it should
it does, i got it fully running out of the box on an empty windows 11 VM by autodownloading the official public comfy ref
appdata on windows is where you're supposed to install software that isn't system-wide. Not sure what the issue is. It's like linux' ~/.local/share/
nice, should be good then
Yeah probably to some mystery hidden folder on your boot drive that only power users know where to find lol
So annoying
there's only a couple games on the xbox that caused memory leaks. and only a couple versions of those games. oblivion wasn't one of them
I mean, Xbox 360 would overheat even if you kept it completely clean and in open air. They all needed a reflow too because of the red ring of death
did i mention stablestudio is best for beginners and my ui is better for the experienced powerusers lol
https://www.gamesradar.com/the-elder-scrolls-3-morrowind-restarts-your-xbox-and-microsoft-showed-bethesda-how-to-do-it/ phil spencer lied huh?
if you ever planned on having a leak, now would be a good time XD
hacking and repairing og xboxs was one of my side hustles and i cleaned about $30-50k from fixing them up over the years
Did you do PS optical drives too?
and yes I realize this is morrowind and not oblivion, I dont play bethesda games so I got the names mixed up, but point still stands
bro that wasn't oblvion and it was a literal software call to a reset function.
less so.
" if you're running low on memory you can reboot the original Xbox and the user can't tell."
read the article, use your eyes 😛
only because i knew a lot less people with ps2s in my region
don't tell SD UI devs about this trick....
"SD UI devs HATE this one trick..."
ssssh, stage event literally like 13 hours away
too bad they already know it. webui 1.4 literally added a "restart" button to the settings page.
Yeah then maybe you'll get more info
but I'll be DMing a DND session 😭 I wont be there until the event is over
specifically intended to clear the memory leak
lmao
did you read the article? it wasn't ever while loading. it was while playing the game. geeze.
auto's working on an actual fix for mem issues
"sometimes you get a very long load, that's us rebooting the Xbox"
talk about misrepresenting the problem
hmmm 😛
@visual glade possible to have a "x" "y" tiled latent node so we could create seamless images with XL in comfyUI? Like the Tiled box in A1111
tiled never worked well for me which is why I didn't implement it
i just still don't understand the original point. that your c drive is small?
what is a1111 going to do to try to fix the mem issues?
tiled is super useful for certain niches cases, eg generating textures to put on 3d models
I said it shouldnt be dumping the program into the appdata as that is not the point of appdata, you said but minecraft does it so thats ok, and I pointed out other programs in the past do fucky things. Doesnt mean its the right way to do it xD
I have some ideas to lower comfyui memory usage by a bit more but I would have to have it implemented a week ago to have it properly tested and ready for 1.0
@knotty lotus
it ignores the location I gave it for the install :/
this is a real issue, as that is a lot of space for the C drive. Especially for non-power users
Ayyy, just like I was saying
oh i misremembered what RAM issue there was a fix about: https://github.com/AUTOMATIC1111/stable-diffusion-webui/pull/11958
the memleak it had on linux was solved by integrating the tcmalloc fix thingy
treesize is ur friend, finding big files 'somewhere'
other memleaks idk about
goodmorning my fellow excited enthusiasts 😄
Also for anyone else reading, its not that my C is small, its a 1TB drive
Its that my H is 16TB and I would like big things that dont need to be fast, there 😛
sdxl leeks now in imax 3d
Ok Ill try that
For Windows I like using Windirstat
ufff. it even downloads the whole SD2.1 checkpoint :/
(hope that changes tomorrow!)
remember after install, run as admin -> else you can't find files that are behind admin folders
i used too, treesize is newer and multithreaded, waaay faster
I would reccomend looking at wiztree, if you have not. Instead of going folder by folder bruteforce, wiztree uses the file table to do it really fast
You should try running XL and AnimateDiff in the same node tree while watching youtube videos 😅
why test when you can push straight to main
yeah wiztree can do my 1tb C drive with 4 million items in about 5-10 seconds lol
i'll be checking out this one too, always good having more options
has the same fancy box view at the bottom and everything
winget windows mvp 😄
I'm glad these issues are coming to a head.
Yea its a pretty fun one. Could put it on the list after the next couple things
@visual glade @high skiff Do either of you know what this is or if it might be related to why my gens are slow?
Just need some training data.
and maybe a QR-loss.
haha
Hahah
think tcmalloc was only a partial fix as well. More just delayed the bomb
I mean, I'm sure there are python QR decoders.
yeah knowing pytorch I give that a 50% chance of not fixing anything
comfy bringing the snark, I love it.
i hear tomorrow there's gonna be a UI that gives you a choice of what, if anything, you want to autodownload while it's installing, yknow something like
(made up image, not real, mockup, fake news 2023)
tested it, i think i like wiztree more than treesize 😄
no that's just a harmless torch warning
Applied Team: "Comfy is better."
A1111 Stans: "Nuh uh."
SDXL 0.9 is, uh, released.
A1111 Stans: "But... halp?"
hops back into bot 10
sees someone asking for 'a little yellow boy'
Damn we gettin racist up in here huh
let me see
wiztree and Everything are 2 must haves if you do a lot of files lol
Yeah this tool is basically telling me my whole C drive is completely filled by appdata, .conda, miniconda, and hugginface cache folders. Getting into ML was a mistake
the 10 hours after webui 1.5 released were... interesting
sdxl turned into tech support for a while XD
tf you have installed
now I dont know if that user meant 'a little boy wearing yellow' and their english is bad but xD
i'm a madman, i don't need everything: i know where my stuff is lol xD
and this is why I've reduced my life to venv
i'm still a madman xD
hm. plausible that they meant blond
will keep an eye on future prompts.
yeah thats what im hoping
I could see if english wasnt your first language making that oopsie so
thats why I didnt instantly go full report mode lol
Doubt benefitting.
the simpsons are sad with your assumptions!
tomorrow?
oof. each version of torch with rocm is 1.6 gigs and ig it adds up
what's tomorrow?
July 18th is tomorrow
The applied stage, right?
ye
Who are they?
Yeah its annoying
I was strapped for cash when I bought my hard drive too
^ did you see that stability dev that said they were gonna talk about a new ui during the stage?
Someone told me to get a small SSD for the boot drive and a big HDD, big mistake lol
whoever that 'mcmonkey' person is, they seem really cool and handsome
that was true maybe like 8 years ago
ssds are so cheap now.
Sounds like they might be hanging around that @hard fractal guy too much
sadly he's not allowed to talk about it for another <t:1690394400:R>
thta's what i've been doing -> it just got worse progressively
I would still recommend that but 2TB SSD and 20TB HDD
I do believe it is the day after today my good sir.
sdxl 1.0 is overpowered 😮 no man should have this power!
hey nice that's a bot gen?
yeah
I've got 20TB SSD total, and 10TB HDD. I'm ready for SDXXL
Ok youre right, we will delete it
i got a fast ssd for my boot drive and a slow ssd for my data drive
oh and rest assured everyone, the bot is pure 1.0 at the moment and will stay that way as I am already in bed
4TiB C drive and 8 TiB E drive
(QVO drives are awesome, affordable super high cap ssd)
double check what's actually using the space. 99% of my hugginface folder is just DeepFloyd models which I haven't used in like 4 months.
brb messing with twoperc's bot files while he's asleep
All the bots channels 1-10?????
plz no 
but what about your Dataset drive?
does it go clonk clonk, or are you using a slow SSD for long term storage of images you'll use once every blood moon as well? 🤣
though honestly I bet if anyone actually opened up what I'm using right now code wise they'd throw bleach on their eyes, set their house on fire and run into traffic
80% of programmers be like
I would love to burn my eyes.... send it to me
In all seriousness, maybe one day 😉
lol
just run the IDE autoformatter on it it'll be fine
lmao
Yeah it'll nicely format all the gazillion things I have commented out from all the crazy and wacky things we've done over the last 4 months
go on vacation for a week and then re-open your code. you'd do the same, probably!
just ask chatgpt to rewrite it so it allllllll works with just one line of code 😉
also make sure you set your formatter line break to 80 characters for tty compatibility as well
I store everything on SSD lol
HDDs are so last decade
Do you all remember when you were REALLY MEAN TO US WHEN WE NEEDED A FEW EXTRA DAYS TO FINISH UP SDXL
nope
... or today....
No idea what you're talking about
He did, I was there
Did that really happen.... I only joined this discord server like a week ago.
Some people were menaces.
ummm when is he not a troll... hehe
True.
perfect. that's just enough time for me to finish DND XD
Lets just say when you have the settings all nice and people are getting great images for awhile and you suddenly start randomizing things like CFG, Samplers and Steps in wide variety... People get a bit shell shocked
we're counting from absolute beginning of time, right? that should give you enough time to finish up nicely
We're counting back to the beginning of unix time starting now
always weirded me out to see regulars not quite getting it when that happened
Which will come first? SAI's controlnet weights or the heat death of the universe?
Tough to say
the randomized settings ptsd was real. also the influx of 'sdxl bot is trash' users was... painful
CompTroll-N3T
You have no idea how much I appreciate you and the others who whole heartedly have been in this and understand the flux! I'm seriously going to talk to the community leaders here about getting a special role for those who contributed like you
Okay everyone I would like to introduce you all to my wives....
Community Archive Net?
Part of the reason I want to restore full stats
aww shucks. i really don't want anything extra, it's all for the science
i mean, it was stated that the sdxl bot was rlhf, so bad images were bound to happen...
ill take the role.. xD
oh shoot im supposed to be in bed.
So am I. Something about a stage and needing to look alive tomorrow...
early sdxl badges wen
"omg I knew it, they're censoring SDXL at the last minute, gimping it before ||the stage event|| on July 26"
Good morning
Where's Sytan? I need to change his roles
391 days be like
we need a pre-ban role
saw someone on reddit (of all places) asking for a nsfw fix for sdxl, because clearly it's only broken/censored
we finally got rid of all the non-cat generations :)
SDXL 1.0 will finally be the catpic generator we all wanted
I mean, SDXL 1.0 is absolutely amazing at cat pictures
SDXL only makes one image now, but you can Drag it like a GAN
Even when it's all jangled it always manages to pull off good cats...
fun fact: I actually have that in another discord
Isn't the true black and true white much better in sdxl 1?
it's better at posing cats in action poses than it is with people!
Send a photo
this is too good
It only makes pictures of food if youre hungry
'frienemies' role confirmed?
the refiner sometimes messes up cat eyes..
SDXPelled
almost
Parkat
one of my fav's
i'm not censoring you, i'm customizing you
What's nice is honestly 1.0 is much better at detail, so even though the refiner still brings it to another level and helps in pictures with lots of small faces, most of the time stock 1.0 is actually great or sometimes even better
noice.
Whats sdxl?
Something about July 18th?
i'm sure she's a nice girl
SDXL is a sad-laughing emoji with something on its chin and an eyebrow perked up
cats will sit on anything
hi, today xl release or delay?
I'm so excited for July 18th
<t:1690394400:R>
Delayed by 2 months
nice
@hard fractal @dapper current I am genuinly asking... What is it about hands that makes it super hard to generate well? Is it that there aren't enough images of hands to model from? Is there a way as a community we could actually help to get hands to improve more?
I wish we had a July 18th every week
me when i get up every morning lmao
hands don't have a standard "shape" they're in so it treats them like scales or fur and just pattern-fills fingers into the image
All the different angles and positions hands can be in from any angle in my best guess.
The problem with hands is due to the number of positions and angles they can be in combined with fingers. It's hard for AI to come up with a rigid "pattern" that is correct every time.
Also, go look at LAION one day.
sadly i would have to celebrate my birthday every week
so having more pictures of peoples hands wouldn't really help much to improve the model?
probably not it might just give you more misshaped hands lmao
Fine tunes do better with hands because they can ensure the pics are clear, high quality, and have similar poses for them
yes, sdxl does cats 😄 nice calico 😄
More does not necessarily mean better
@dapper current do you know any large fintuner?
i am willing to share my huge dataset with them, around 2million images
SDXL does much better with hands mainly due to it's sheer size
Well I do appreciate the info... I was always curious about how I could somehow help with that.
but i wouldn't agree upon that on hands and to generate realistic image
now that 1.0 is finalized and pushed to the bots 1-10, can we know which of the 3 models won? 😄
also many community finetunes are overfitted so i guess that helps with good hands since its just spitting out stuff close to the dataset
what's a bot
think my GPU just hanged.
all my kitten pics are gone now...
who said anything about sdxl? I just said 1.0 😉
Is that the new name of Twitter?
oops darn I've been figured out
did you ever do art?
When you draw hands '100% realistically', you'll first notice the weird thing, of when you look at your own hand in a normal non-posed position, it looks anatomically incorrect
that's issue No.1
next, fingers look similar, just having different lengths - but the lengths depend on view, bending and many other things
that's issue No.2
lastly, while you have 5 fingers, it's not like you can always see 5 fingers. often less are visible. so as an ai, it would under normal circumstances learn "5 or less fingers" (bad) or "there's a high chance of a finger being next to a finger" (still bad)
that's issue No.3
So how do we solve this using TI, LoRA or Finetuning?
The Answer: we don't
What we do is reduce the sample size to only 'posed' pictures where all 5 fingers are visible, or popular poses such as 'fist' or 'thumbs up' or 'pointing' - which essentially introduces extreme bias - but it's bias we want, at the cost of of flexibility
probably, fits musks naming rules (always put an X in it)
#1100170312106127410 message someone is making ai art of 9/11 in bot 1
Paypal used to be called X iirc
We are officially renaming sdxl as SDTwitterL
Twitter did take an L, that's true
Hahah
is clip guidance not possible to do locally?
Totally possible, I think diffusers has a decent implementation
seemed to me having that clip guidance in the api via auto made some truly killer images
just imagine people searching twitter video downloader, now they have to search x video downloader 💀 💀
Guys where I can find the notebook file setup in civitai / github pages?
That... is an interesting thought
To run them in Google colab
until musk makes a bad pun about buying SD. Then is forced to buy it for inhuman amounts. Then you all move to Fiji, and SD gets renamed to SD𝕏
I would be careful, he might try to charge you for using that character.
I'm safe as long as I stay in middle Europe XD
anyways, twitter doesn't restrict porn so its legit to change its name to x
I'm sure someday it will get there... I realize its not an easy issue to solve... But at what point does the Intelligence part of AI get to where it actually will think about all the points you just made and put it all together on its own... I also realize it learns the way we program it to learn. Which is why it will take time. I'm sure it will get there at some point in time. I am excited for that day. But yes I agree with everything you said.
to run what now
comfyUI install location is located in settings.json in the roaming directory it makes for itself
😉
there are definitely ways to train hands correctly. essentially do what SAI did for faces this time round.
its not an issue of the machine learning anymore, nor model limitations. Just a matter of painfully making that dataset where you finetune the full anatomical knowledge of a hand
plan is to allow changing install location for comfy & allow for downloading whatever weights you want
well that's when we announce the change twoperc
much effort for little return, when overfitting does perceptively similar results for 80% of the usecases
Models
oh i just realized you posted in every channel lol
If someone would be willing to help me learn how to do that I would be happy to try and train something that can finetune it.
speaking of hands, i remember sdxl really struggling with power tools in the past, so i tried some just now
https://github.com/comfyanonymous/ComfyUI#colab-notebook https://github.com/TheLastBen/fast-stable-diffusion
fast-stable-diffusion + DreamBooth. Contribute to TheLastBen/fast-stable-diffusion development by creating an account on GitHub.
wow a few of those are actually close to real tools
Tools, airplanes, pianos (still not perfect)... but much better in SDXL
will still request an 'auto' or 'manual' button during install, avoid polluting the poor C drive
yes
its not even the finetuning.
just the act of gathering roughly 200 images per anatomical part of a hand. hand captioning all those images
it doesn't sound bad, but this is the act of turning 1 concept "hand" into roughly 300 connected concepts. And if you still think its doable, remember that you also need to obtain a fairy equal sample size separated by age, ethnicity, gender & accessories such as rings. Dataset also need to be high quality enough, so you can have a closeup of the hand, and one uncropped to show the context of the image
its a monumental task that would take years.
It's a good business idea if nothing else - to then sell well averaged datasets for machine learning, to reduce possible bias
ummm... listen... I am tooo fat to climb mount everest... ok. I dont mind going for a walk to the kitchen but dayum... I see why it hasn't been done
That really does sound like a lot to do.
there's a fun conversation to be had here, about how challenging it would be to get a good nsfw finetune XD but wrong discord server to have that conversation
I bet if my wife wouldn't kill me that I could probably get that done in like 10 minutes compared to hands
I'm happy that that nasty dotty texture is gone from images in the bot now.
i could dig through some saved images and probably find an example, it was everywhere, but i don't think it was a watermark.
I am having trouble getting more in my images than like shoulders and up.... Maybe its the aspect ratio.. What is the closest resolution to 1920x1080 that is under the 1024 pixel mark?
I mean the 'invisible' SAI watermark, so that sdxl gen images that find their way online won't be used for future model training
720p
Which is 1280x720
well I wanted a pikachu being photographed in the wild but I guess that works too
the nikon must be so heavy
If i use the 1280 is my card going to shit itself when I upscale it to 4k?
its a bit of a biased aspect ratio. You'll have to get your prompts and negative prompts right to make that aspect ratio work consistently with people
How do you plan on upscaling
here's the "worst" one i could find. You can see it in the asphalt kinda. Many times in places of grainy-ish detail, you'd see these too-large bumps
probably why he looks so grumpy
a lieka dis
Hmm Idk
ah I see it. also isn't that from prompts?
it's the thing you get when you scan high end artwork
I think I've seen those dots before yeah, is this with refiner or no refiner out of curiosity?
their cheeks always look like seedless strawberries
if you get down to a defining a set of rules that are specific to making hands accurately, you have an atlas of a rule book
so prob an artwork tag got mixed in
from what i could tell, it was one of the contenter checkpoints, but maybe from settings? hard to know from this end
does anyone know that comfyui plugin/node that allows inline lora calling? like lora:name:1 in the prompt itself and not having to load it seperately
also, adding ", descreened" at the end of your positive prompt removes it
bot, pretty sure it wasn't the refiner, i saw a lot of it during the dark times of testing
though it shouldn't show up in first place
I feel like life is great right now... I'm going to wake up like a little kid and it will be Christmas morning with SDXL 1.0 nestled safely under the huggingface Christmas tree... What a time to be alive. 😉
14 hours to go?
12
<t:1690394400:R>
That's the event yeah? Not necessarily the release time
国人?
heyy, i just want to think happy thoughts before i go to sleep tonight
I’m pretty confident they will release it. The devs in here earlier were talking it up pretty hard. ‘‘Twas a good time lol
I was here... and I'm hoping they do
It seems that I don’t need to sleep tonight,
Deutsch
but it auto translates into everyones native language 😄
Oh yeah hehe sorry I’m so tired
Is there a live broadcast when SDXL is released?
thats what I got... not sure if that is the translation
wait a minute. why you not sleeping? xD go sleep @autumn forum you can play after work!
Can’t sleep to excited lmao but yeah okay 💤
Go to bed early and get up early to be healthy, don't stay up late, my friend
thank you
be ready for any announcement
Ye
for comfyui should I be setting my Clip Set Last Layer node before or after loading all my Loras?
doesn't matter
ty 😛
I'll probably be asleep during the broadcast it looks
bloody europeans
yes, tovarich
sdxl can nsfw?
sure can
wow
afaik the base model isn't trained with those kinds of images but it doesn't have strict filters like sd2.1 did
sdxl has not been trained on explicit nsfw data. That's as far as conversation is this server should go.
Shouldn't there be a situation where it cannot be popularized like sd2.1?
yeah they should really train it on NSFW, as far as ive seen SDXL is just worse than SD1.5 in every way because they didnt train it on NSFW /s
SDXL is worse than sd1.5?
yeah people dont know it yet but SD1.5 is and will always be the only good txt2img model
/s translates badly for non native speakers
I don't agree, I feel that SDXL has made great progress compared to sd1.5
he is joking.
Why can't we have the conversation, what's the big deal?
小智?
• cause for many people this is a server that is accessed at work
• there is no age limit to join here
• while completely valid to talk about, it's just in bad taste to do so on the official sai server, in front of sai employees who are negatively impacted by media coverage of nsfw in relation to SD
I mean I wouldn't want to have a graphic discussion, and I don't think it's in bad taste at all considering this is a community server about an open source model, no one should feel afraid to be critical of SAI in here of all places
why ?
With "portable" install you can save the file whereever you want, extract the contents, move the folder onto whatever drive you want and then run it.
Similarly when doing a git install, just start in a location that isn't (for example) your C drive
oh definitely, when it comes to being critical. I'm only referring to specifically "make nude photos", "train sdxl to make nude photos" & "can current sdxl be circumvented to make nude photos" nsfw conversations.
anatomy topics should be fine - but that's not what people want to talk about. especially since sdxl gets 10/10 when it comes to anatomy understanding
When will sdxl release?
10/10? Reproductive organs are anatomy too
when Emad finishes playing Tears of the Kingdom
?
yes. I agree.
but that's not what happens
it disregards your current settings, and first installs it to C appdrive folder regardless of what you do
SDXL is bad at drawing chair compared to some fine tuned 1.5 model
like a finetune specifically made for chairs or a general purpose finetune?
lemme rephrase that. contextual understanding of body proportions
which was a huge issue in 1.5 & 2.1
general
<t:1690394400:R>
I would still disagree with 10/10 but yeah it is a lot better
Thanks 👍
Looking through my output folder I really feel like the refiner model is not doing that much good for my gens
ok so what the flip is in this that the bot has moaned about ??
I'm rereading it multiple times and I cant see any words used that havent been used elsewhere
I think it was the 'dragged over coals' part 🤣
Yeah I agree with you... I feel like people are conflating nudity with hardcore p*rn
4 random seeds dreamshaper vs sdxl for product photo of a wooden chair, studio lighting. SDXL followed the prompt a lot better but in turn it has much more non-euclidean geometry so pick your poison
totally. I mean I lived in Greece for 10 years - so that should tell you enough about what I consider to be nsfw and what not XD
but yeah - stuff like bikinis sdxl does real well. so that shouldn't be an issue, right?
Yeah breasts are mostly what people want to gen so it will not be a huge problem for the model gaining adoptance
my issue is that SDXL is a bit like many of the 2.1 models in that it has problems with nipples
this man does science 🤣
nope just the one word
not lingerie
how about lingerie?
more steps seems to help. could also be rng
Some nice ones here actually
so you can't say 3rotic but you can say PORN okay
But that sort of geometry SD will always struggle with
to keep it short. loras are real easy to train.
like I said many times before, finetune levels of improvement are now doable with a simple 43mb lora.
I don't say that in theory, I say that because I've made a fair share of loras for 0.9 to test the limits. so just give it 2 weeks or less - and then see how happy you are with sdxl
so nipples is ok , sexy is ok, but 3r0t1c isn't?
Where it looks wildly different depending on the viewing angle
cos that makes complete sense lol
Might be I was biased on bad SDXL photo vs good 1.5 photo
bruh
the word snowflake itself is more likely to trigger IMHO lol
lol
My fear is they are all from roughly the same viewing angle. Diffusion models have no real concept of 3D, if you asked for the camera to be between the legs it would have no clue what it's doing, even asking for a top-down view would probably be too much for it. It's the same problem SD has with hands
this was solved though? at least with my lora of 2B Cosplay, I made gens from all angles, and could control them consistently
unrelated but i really really cant wait for a good text to 3d large scale world
obviously cant say the same for chairs XD
same
For human figures maybe, enough training data can overcome the problem
train a lora for upskirt photos of chairs
I feel called out 🤣
nsfw chair images
damm i was really looking forward to making chair porn with SDXL
nsfw chair
going to sleep now, see you guys tomorrow for the big event 🥳
I mean its only an hour of work. tempted to make a "upskirt chair" lora for 1.0 release 🤣
comfyui just added this as a feature. Using the conditioning timestep you could blend chair and pornography prompts to give the chair a massive horsecock or something
make a point that any dumb concept can be trained
lol i didnt expect a serious response
I'm the most serious person you'll ever meet
Ok this inspired me to see if SDXL is any good at guns
Yeah it seems pretty good 👍
Of course the fingers holding the gun are bad
irl people can't decide how guns should look so the ai 100% can't. gonna get 2 magazines 3 charging handles and 17 selector knobs
2 barrels
maybe if you specify something specific with a really strong presence like the 1911 it'll work better
I think it just assumes the gas block is a 2nd barrel
add a rear facing barrel to kill people behind you
concept art + rifle makes for fun guns
concept art of a super soaker M60 machine gun
WW2 scene of a battlefield in germany, with everyone holding super soakers
artist's rendition of a Nerf guillotine during the french revolution
what in gods name 🤣
im asking the bot rn
Nothing too crazy
ew handshake grip
with full auto trigger
I wonder if there will be a model like niji for sdxl in the future
there will be like 50 of them
tf is 'niji'
I hope so, cause base and refiner is still not in par with niji.
anime model I think
no
It's a collaborative model between spellbrush and midjourney which mainly focuses on anime
nijijourney
Trained, and nothing comes closer to it when it comes to anime as of now
Hmm from testing it seems like Comfyui's conditioning-average and similar nodes only work with the old CLIP
does LAION have NSFW images?
I used conditioning average with SDXL 0.9
It only works with CLIP-L though
Yes but they don't train on those
In fact it seems to totally disregard your CLIP-G prompt if you use it
1.0 day is finally here!!!
Message 19.00 UK time?
delayed to 2024
20.24 ... 24 minutes after 8 p.m.? 🙂
any new release?
is today the day we get jebaited?
im hyped
also yeah cant wait to see if its moved.
if it is, good, more time for me
B
and I want TensorRT SDXL, that ain't happening either
who said its not
I have been trying altering with the code to make it work. TensorRT only works on 1.5
wait do i need an extension to run a TensorRT model on Auto?
yes
it's worth it though
id assume it works on a 3090
obviously
what im i looking at in terms of speed improvements?
Its an optimization made by NVIDIA. almost double the speed if not even more without precision decrease
so its faster than the speed of light
pretty much, it's black magic
I bet after this community figures out how to use this with SDXL it might become a default optimization
it's not relevant for you. 1.5 is nothing compared to SDXL. I advise you to wait until it supports SDXL
i can use the main AutoBranch right... not the old Dev branch?
.
maybe i want super speed 1.5
fair enough.
wait but you have a 3090.. it's not already super speed with 1.5?
yeah... but why would i say no to more extra super speed
it doesn't work with high resolutions.. the A1111 implementation isn't coded correctly.
ugh... fine
however, I bet TensorRT for SDXL will show up eventually =]
Will building Loras work the same with SDXL and do you need that "refiner" thingy when it releases today?
LoRAs should be easier to make, the refiner is not as crucial, but it will be used by most people here I bet
oh will they work? no. SDXL LoRAs for SDXL
Ty for the answer
what are you using to make these?
SDXL0.9
are these SDXL 1.0
Has anyone noticed these red dots all over darker parts of images coming out of SDXL?
First time seeing that 
watermark
using Diffusers?
huh? stable diffusion is a diffuser
wdym?
yeah, a1111
a1111 im not sure. Vlad's uses Diffusers which has a watermark




