#✨|sdxl

1 messages · Page 23 of 1

halcyon tusk
#

also on the bots 1 to 10

true wing
#

this was my understanding as well

urban fjord
#

Keep in mind that memory use scale differently with SDXL than 1.5 so reducing res won't help as much now as before.

kindred plinth
urban fjord
#

McMonkey mentions in the gist that you will exceed 8 GB VRAM at certain points which is why an updated Nvidia driver is important.

urban fjord
civic sigil
#

So SDXL isnt good for the grid method like 1.5 is?

urban fjord
#

High resolution won't add details SDXL hasn't learnt.

#

Caq: If you're talking about what I think you're talking about it is just less needed.

kindred plinth
#

wait is this thing based on an actual prompt?

true wing
kindred plinth
civic sigil
civic sigil
kindred plinth
#

oooh yeah idk then i thought u meant tiled upscaling

kindred plinth
eternal fog
#

Yeah @wicked frigate No idea what's going on with the trainer. Regardless of what size I set my images to, it always uses 12GB+ of VRAM and starts to use system memory and goes insanely slow, or with the old Nvidia Drivers it OOMs

urban fjord
fresh path
urban fjord
#

In 1.5 you needed different tricks to achieve higher resolutions due to VRAM usage but with SDXL scaling more gently that would of course not be needed.

delicate grotto
true wing
tawdry panther
#

comfyui is so clever wtf it caches the parts if they dont change

delicate grotto
#

ah

eternal fog
#

Something has broken with the training scripts

#

And it seems like you can no longer train LoRA on less than 12GB VRAMish

#

Or at least that's what happens to me

true wing
tawdry panther
#

its so good wtf

#

automatic1111 feels clunky in comparison

#

it might just be that i dont like gradio

sage basin
#

When I was training last week on a 12GB 3060, I had to turn the text encoder training off (and cache the text encoder inputs) otherwise it'd try to allocate like 60GB

fresh path
trim orbit
#

juggernaut is a trained 1.5 model

sage basin
#

I tried it again just now with the text encoder on and it works, but it's painfully slow

eternal fog
sage basin
#

I can definitely feel your pain

true wing
fresh path
#

okay, thanks, I believe sdxl will replace 1.5 entirely

lusty raptor
paper phoenix
#

voting question: are people voting on preferred generation between two images or how well it followed the prompt?

eternal fog
true wing
trim orbit
lusty raptor
trim orbit
#

1.5 has so much momentum to it

fresh path
#

idk, the speed is pretty good

civic sigil
trim orbit
eternal fog
civic sigil
true wing
eternal fog
#

Yeah it's not

civic sigil
#

I imagine gradient checkpointing is a big reason

trim orbit
#

12 and 16gb gpu sales will skyrocket

eternal fog
#

I can tell it to train 1x1 images and it still OOM, there's something broken

#

trying to force full fp16 training and see what it does

fast barn
paper phoenix
eternal fog
trim orbit
#

when i first bought an 8gb gpu, people thought i was nutter cause it'd never be needed for years. then gtav pc version came out.

soon as theres a need, people upgrade in huge droves

civic sigil
shy kelp
#

seems like some of the A/Bs by the bot is just using the same model with refiner at different strengths

trim orbit
true wing
trim orbit
#

i look at steam hardware survey instead

#

most gpu owners run steam

civic sigil
true wing
paper phoenix
civic sigil
shy kelp
#

but i think 35% is generally too high

#

so they might be testing what exactly is the best %

true wing
fast barn
urban fjord
trim orbit
#

bitcoin never really made a hardware arms race. there were some software competitions. Ethereum mining was more about deploying as many gpu's of any performance as possible

wicked frigate
trim orbit
#

bitcoin sure accelerated ASIC development for a quick bit

fast barn
trim orbit
#

Machine Learning has significantly upgraded Nvidia's hardware design

civic sigil
eternal fog
paper phoenix
true wing
kindred plinth
trim orbit
#

market is so fucky

kindred plinth
paper phoenix
#

selling cycles as service could bump it up but i might as well be writing the worlds most boring scifi now

kindred plinth
#

also inflation lol

paper phoenix
#

its off sdxl chat but just to bookend my thought, i can imagine web servers having AI and accessing sites being generated on the fly by specifically tuned AI.

true wing
paper phoenix
#

but again, boring fanfic.

grand canyon
#

Node based UIs are very common when you get into the more technical side of creative software. Look at Nuke, Unreal, Resolve, Blender, and many many more. More technically oriented artists will likely feel very at home in ComfyUI. Less technically oriented artists probably aren't going to be using Stable Diffusion directly in the first place and will more likely use text to image through higher level tools like Photoshop.

I think one of the biggest weaknesses Comfy has right now is using it for highly interactive workflows. This is not an inherent issue though. It can be improved. Even then, artists that do anything involving non-realtime rendering may find that iterating on comfyui flows is not that foreign. Comfy will need the ability to run workflows programmatically so that it can be integrated with render farm management software though.

trim orbit
#

training a 3rd friend on sdxl. can't share iamges causee my friends are private types and don't want me doing that. good learning experience though.

Using kohya to train loras with the new prodigy optimizer. it's a dadaptation based one. works nice. 15 epochs with 20 images repeated 10 each. takes about 2 hours each time.

#

its pretty remarkable how easily good it is

civic sigil
tawdry panther
#

this is wonderful

paper phoenix
#

what machine do you have @trim orbit

#

i know someone training rn too and they're getting glacial numbers

trim orbit
true wing
soft bone
vast narwhal
eternal fog
trim orbit
#

yeah if i launch it wrong, it slags and estimates 30-40 hours. gotta have bf16 on, caching the text on, --network_train_unet_only on

paper phoenix
trim orbit
kindred plinth
trim orbit
#

it'll be cool to see what Jim Keller's TensTorrent pushes out next. Their original hardware line was a good start.

#

maybe he'll end up back at AMD again

true wing
slender coral
#

In 7 years we're going to have biotech GPUs 😄

trim orbit
civic sigil
sudden cliff
slender coral
#

And we won't be using vectors for graphics processing.

true wing
kindred plinth
vernal cloak
#

What GPU did you use to render that? A Quadro?

grand canyon
# civic sigil It needs more nodes built in too tbh, the experience of developing custom nodes ...

Probably, but there are some really good custom node libraries available now. I've been dipping my toes into developing custom nodes, and the biggest issue I see is lack of documentation, but that will come in time. I find custom node development pretty trivial in most cases. I've worked on plugins for various DCCs though, and am very familiar with node based UIs, so this is pretty familiar territory for me.

sudden cliff
kindred plinth
#

between models getting easier to run, more specific accelerators, and the more demanding stuff being able to be run remotely its deffo a toss up as to how important buying expensive AI accelerators for local use will be

true wing
trim orbit
#

i was a good enuffer for years and years

civic sigil
trim orbit
#

thats why i always bought amd

civic sigil
#

And primitives in general seem to have bugs

#

That I cant quite figure out

#

Like weird incompatibilities

vernal cloak
true wing
# trim orbit thats why i always bought amd

Honestly if Nvidia wasn't so absolutely dominant in the ML and VR space I would still be with AMD. I started out literally getting scrap computers from a local college my uncle would give me and just upgrading the GPU and ram.

kindred plinth
true wing
#

my frankenstein machines ran pretty well

trim orbit
civic sigil
trim orbit
#

i love frankenpc's

kindred plinth
true wing
vernal cloak
trim orbit
#

for a while i was turning it on by leaning way over with a paper clip and getting into the mobo just right to touch it to two specific pins and to close the circuit

true wing
civic sigil
latent zodiac
#

just use ziplm 🤣

trim orbit
true wing
hard fractal
kindred plinth
true wing
hard fractal
true wing
#

thanks for that

vernal cloak
# hard fractal A100

Ahh..... There is a few people who use that GPU on here. Yh. It's around 48Gb Vram isn't it?

trim orbit
#

a100 has 40 and 80gb models

eternal fog
#

@wicked frigate Ok yeah confirmed, there's something that's been added to the code between that commit in your gist and now that has caused the training to use more VRAM.

I've noticed with this - ImportError: cannot import name 'StableDiffusionXLPipeline' from 'diffusers'

The newer script uses that StableDiffusionXLPipeline, but the older script doesn't

kindred plinth
#

cant it also do shared memory?

trim orbit
#

nvlink capable yeah

vernal cloak
#

80Gb. blimey... That is nuts

kindred plinth
#

yeah and you can get even more by linking them lol

rustic temple
kindred plinth
trim orbit
kindred plinth
#

Yeah kk that's what I expected

true wing
visual glade
#

6000 series from AMD get more performance than that with ROCm

kindred plinth
visual glade
#

6800XT is about 7it/s last time I tested and it might have increased a bit with latest comfyui changes

trim orbit
kindred plinth
#

Yeah GH100 is fucking sick

trim orbit
visual glade
#

and 3090 TI is 24it/s or slightly more depending on how many windows I have open, at least that's what I get on ComfyUI

true wing
kindred plinth
#

Is comfyui considered the fastest?

true wing
trim orbit
#

the 4080 numbers are about what I see still currently. i've been getting those since i got it and optimized that hog as much as i could. think it was about february i hit 23it/s with the right config. ever since python 2 i've had it steady 23-24it/s

kindred plinth
#

Me with my 4.5 it/s (sd1.5) rtx 2060

trim orbit
#

relied on it with my vega64 for 1440p gaming

kindred plinth
#

Yeah dlss is awesome and deffo better than fsr

visual glade
kindred plinth
trim orbit
#

i've got sdxl working on the /dev branch of automatic and i think vladmantic's sd.next has support in the main branch

kindred plinth
#

Cuz the huge customizability of node based isn't always necessary

trim orbit
#

comfy is faster still

#

refiner doesn't hook up well in auto either

kindred plinth
#

Yeah which is why I'm curious if it can work with a simpler ui

#

But keep the comfy backend

visual glade
#

yeah you can easily make a simple ui on top of it and there are a few of them

trim orbit
#

the dream ™️

grand canyon
#

Backwards compatibility is always difficult with these sorts of things. For production ready custom nodes you probably need to release an entirely new node and deprecate the previous one if you are changing the interface in a breaking way.

This is also why VFX studios lock down their software during a project and for the most part only upgrade between projects.

In terms of issues with primitives, I'm curios what issues you're running into.

trim orbit
#

don't f with the tool chain

kindred plinth
trim orbit
#

there's always that one technical engineer guy in a design studio who is panicking over anyone suggesting new changes

civic sigil
trim orbit
#

"CHANGE?!"

civic sigil
#

And changing the inputs in general often breaks my nodes on existing workflow, it works for new nodes tho

kindred plinth
#

thx

grand canyon
civic sigil
#

And last time I did it it still broke for unknown reasons

soft bone
true wing
grand canyon
civic sigil
visual glade
#

when I update a node in a way that it breaks something in the frontend I add code to fix it

soft bone
civic sigil
#

I'm mainly concerned about primitives not being compatible but its whatever, my workflow is almost complete

delicate grotto
livid basin
#

For reference, in academia at least I have never met anyone using AMD/Intel GPUs for research ever

upbeat summit
#

are we doing monsters?

livid basin
#

It is either Nvidia GPUs (consumer or commercial) or Google TPUs, that's it

urban fjord
#

Monsters?

grand canyon
true wing
visual glade
#

no a custom node can't do this unless it also adds something to the frontend

#

by that I mean a file in the web/extensions folder

livid basin
upbeat summit
livid basin
#

I have never met anyone using AMD or Intel to train large scale networks

trim orbit
#

there's already a bunch of 0.9 loras on civit

#

gogo go go go go

civic sigil
#

1024

#

Yes way

trim orbit
#

a1111 has support already if you install /dev branch

true wing
#

just make a copy. That's what I do before any A1111 update.

upbeat summit
soft bone
trim orbit
#

just make a new folder then

true wing
eternal fog
#

Who is it that made that 2b Lora for SDXL. I've forgotten who it was.

upbeat summit
upbeat summit
livid basin
#

Basically it seems that the prompts that are in showdown/pantheon are more tightly clustered than the prompts in Bot-X

true wing
#

It does a lot of styles but I think people like the mid journey 'look' so the model has a bit of a bias in that direction

olive flare
#

Hey what's the best current workflow to use in comfyui ?

true wing
#

they put a lot of work into it and it shows

olive flare
#

Yeah! That's the one I was using last week, has it been updated ?

summer grove
#

I pruned the model from 13 g to 6 g this is the output seems to look pretty similar and took a lot less time. Not sure if there are other pruned models out there or not

upbeat summit
#

@west breach I've send you a message

livid basin
true wing
visual glade
#

you can use comfyui to "prune" models

visual glade
summer grove
#

just a model convert down to fp16 and no ema

visual glade
#

do this but without merging

eternal fog
#

It's not really pruning is it for SDXL, it's just converting it to FP16 instead of FP32

livid basin
#

Ah so just casting, not real pruning

kindred plinth
#

christ it hasnt even been a year since SD 1.5 what the fuck

livid basin
#

For the love of god, people need to stop upvoting midjourney style pics in the bot-X channels, that's how we end up with a base model that can't do photo realism

true wing
#

Just to be clear, these were all generated by the bot but a lot of them were while the refiner was on. Just because 'base model' has two different meanings at this point.

grand canyon
eternal fog
#

ok well I've found a specific version of the training scripts thanks to mcmonkey that lets me train 1024x1024 at batch 2

soft bone
true wing
eternal fog
# true wing Nice. How much VRAM is it using?

I think it might be using a tiny bit over 10 and going into system RAM. I'm just checking at batch 1 to see if it changes the speed. As it was still running 3s/it even with all the VRAM being used.

west breach
somber hill
#

First Ever SDXL Training With Kohya LoRA - Stable Diffusion XL Training Will Replace Older Models : https://youtu.be/AY6DMBCIZ3A

How to install #Kohya SS GUI trainer and do #LoRA training with Stable Diffusion XL (#SDXL) this is the video you are looking for. I have shown how to install Kohya from scratch. The best parameters to do LoRA training with SDXL. How to use Kohya SDXL LoRAs with ComfyUI. How to do checkpoint comparison with SDXL LoRAs and many more cool stuff.

...

▶ Play video
eternal fog
paper phoenix
#

thats p tight

eternal fog
#

ugh, I'm getting NaNs during training now I'm a bit in though

paper phoenix
#

dang

eternal fog
#

I have had this working before, so I'll try with some different settings

olive flare
#

Are you using no half vae ? or the updated vae for half precision ?

eternal fog
#

Was using the updated vae

olive flare
#

oh okay! I don't know then 🙂

eternal fog
#

It might be AdamW, it was doing this before, but prodigy worked, so I'll try that again

olive flare
#

no

#

Has any of you been able to produce sampling images during the training ? no matter what parameters I'm using, it's always... weird

#

SDXL lora training yes, on kohya

urban fjord
#

Sampling during training isn't working for me so I just hope for the best and test afterwards.

magic dawn
#

Is there any tool for training refiner yet?

olive flare
soft bone
amber fulcrum
trim orbit
#

i've made a few loras. they're good. might run a large set i have over night

soft bone
#

eyes :/

livid basin
strong bramble
soft bone
#

they dont pass for real

livid basin
#

Oh wow, you really gotta squint to see that. I glanced at it and thought it was fine

#

I even zoomed into the image and took a look at the eyes, but forgot to look at the pupils

lusty raptor
#

we're a bit used to seeing them with reflections, so maybe that's why it's not immediately obvious

livid basin
#

Really speaks to how good these models are, even after you told me the eyes were messed up it wasn't obvious to me

soft bone
trim orbit
#

dof is a little funky. shoulders in focus but the chest look like it's miles away

eternal fog
#

lol so @wicked frigate it will start training on that commit. But it NaNs almost instantly regardless of settings.

soft bone
olive flare
#

if you want I have a hack to produce sampling images during training of LORA but it requires messing inside 2 python files
But it's really slow and needs more VRAM

trim orbit
#

adetailer in auto1111 is actually helping my lora gens. not to it's best effect though. it's just straight img2img ing through the base model again, without any controlnet

agile quarry
inner ruin
#

I have mastered the LoRA

#

I am become LoRA, destroyer of animes

fair crow
#

Fucking weeb 🥲

visual glade
agile quarry
#

colab T4

soft bone
inner ruin
soft bone
#

training time?

primal vault
inner ruin
#

(on an A100 80gb lol)

soft bone
soft bone
inner ruin
soft bone
#

Its not very good lol

inner ruin
#

which defeats the purpose (+I used reg images)

#

now I'm trying to bring it down to rank 16

#

LR was 4e-4, trying 1e-4 as well

soft bone
#

storage isnt an issue for my purposes. I wonder if thats the only big downside of high rank

inner ruin
#

so it slows down inference. It's an annoyance mostly, totally fine

inner ruin
soft bone
inner ruin
soft bone
#

or when i squint

true wing
inner ruin
#

I thought someone would have nailed a face params by now but it's a bit hard

marble dew
#

I got a NVIDIA GPU laptop at last what should I run SDXL on frens

eternal fog
inner ruin
marble dew
#

we'll all be hyperdreamboothing in no time tho I'm sure

inner ruin
#

(would sincerely love some tips)

inner ruin
true wing
soft bone
#

in the spirit of the new For Honor hero

livid basin
#

I'm very glad the style prompts got released, to me that's just as important as having the SDXL 1.0 weights

#

No joke

#

It makes reproducing the bot-X/showdown/pantheon stuff easier

urban fjord
#

Though with the random models going on we can't exactly reproduce much.

soft bone
livid basin
#

Fair, I don't need to reproduce the images exactly, just get close enough

#

For research

urban fjord
#

I hope things get better when I can run 1.0 locally as most of the prompts I've tried seem to be worse in 1.0 than 0.9.

soft bone
civic sigil
#

Tbh most of the benchmarks for vram use are done on linux with gradient checkpointing too

latent zodiac
urban fjord
#

Face LoRA's are boring

eternal fog
olive flare
#

it's funny to see that "deformed" made it to the list of the negatives in the style prompts 😄
"ugly,malformed,deformed,strange,2345_fingers,two_heads"

inner ruin
civic sigil
soft bone
true wing
trim orbit
#

bf16 works best for no nan's

latent zodiac
trim orbit
#

everytime i use fp16 i get nans 10min in

civic sigil
#

But I was able to get up to 768x 1 batch size on my 3060 12gb (8 rank lora)

#

On windows

eternal fog
livid basin
#

Stop voting for midjourney style pictures and we all end up with a better base SDXL 1.0

latent zodiac
#

Especially if all you have is a M1 lol

civic sigil
#

Yeah I still dont get how mcmonkey got his results

urban fjord
trim orbit
#

difference with MJ is we can train our own embeddings

#

or loras or whatever

#

base model looking like MJ for short prompts? oh well

true wing
eternal fog
# civic sigil Hmm ok

Hmm Kohya just updated his scripts after I logged in issue and it's training with the newest version now

latent zodiac
#

1.0 is awesome on the #bots so much easier!

urban fjord
#

I was trying some styles in 0.9 and one of these hit the mark and the other seems quite off. If all the differs here is the model then it seems like they differ a lot when it comes to understanding prompts/styles.

livid basin
#

Let us pray to the SD gods, if the base model ends up like midjourney it is going to be fun in a not so good way

eternal fog
latent zodiac
#

SDXL has produced the funniest image i could never get in any flavour of 1.5

eternal fog
civic sigil
#

Hmm that does look promising

civic sigil
#

I forgot ab that

eternal fog
#

Well it's not NaNed yet

#

And it would have by now

urban fjord
#

I'm a bit worried to see that the 3 models were that close in the piechart.

civic sigil
#

Thanks for reporting the issue

eternal fog
#

I'm using Prodigy as that's worked for me before, so I'll let this run and see what it does and then try with AdamW because that's never worked for me on SDXL

civic sigil
#

Same

eternal fog
#

Prodigy also heats my VRAM up less

latent zodiac
#

no Lion ?

eternal fog
#

I'm just trying to get this damn thing to work

latent zodiac
#

anyone know if SDXL will work for us mere mortals running a M1 pro lol ?

livid basin
#

Get a Nvidia GPU and run linux like the rest of us 😄

#

No Mac/Windows/AMD bugs

true wing
#

you could try running 0.9 and see?

trim orbit
#

prodigy has been working good for me

soft bone
#

spooderman

soft bone
urban fjord
#

18 seconds per its for this round of LoRA training. Everything is fine...

#

If only I had 13 GB VRAM instead of 12 GB

eternal fog
#

holy shit the Lora worked

#

15 images at 1024, 15 minutes and it's got the style

livid basin
#

What are you training on?

eternal fog
livid basin
#

Sorry, I mean what hardware

eternal fog
#

3080 10GB

livid basin
#

Ah oof, that 10GB

eternal fog
#

It's finally worked though and it's worked pretty well for a no effort attempt

#

The art is slightly horny though, so looking for something suitable to post

#

It's not perfect by any means, as I just used some random settings. But you can tell it understands the style.

primal nimbus
#

kohya?

eternal fog
#

Yeah

#

He just pushed an update that made it work on less than 12GB

primal nimbus
#

what do you run the checkpoint on after that

#

auto?

eternal fog
#

ComfyUI

nocturne dove
#

I've been using loras with comfy and training with kohya. Works great.

eternal fog
#

Something I have noticed though is it breaks out of the style really easy if I change the aspect ratio to one I didn't train on

#

But that might be the settings I used

#

Plus the fact that I only ran it for 15 minutes

nocturne dove
#

sdxl 0.9 is pretty good at styles. Out of the box, it's better at "by Frazetta" style than when I fine-tuned 1.5 for it.

livid basin
#

Images seem a bit deepfried and over saturated

#

Too high cfg?

nocturne dove
#

Mine? They are supposed to be that way.

livid basin
#

Nice

soft bone
#

i was born in the wrong generation 🥴

livid basin
#

We don't have the vietnam war thou

#

No vietnam war, no hippy culture

fresh path
soft bone
eternal fog
fresh path
#

I want to know if lora on specific characters works well

eternal fog
#

It should do

#

I've got some old datasets I can try, but I dont have enough time today

nocturne dove
#

I trained a lora on a person and it was as good as when I do the same for 1.5.

eternal fog
#

They all have Booru tagging and I'm not sure SDXL likes that

livid basin
#

1girl as a single word skews the image towards nsfw

#

Just from my analysis of the images in showdown

eternal fog
#

Now I don't know if this is just the low training time, the dataset, or the lack of training the text encoder. But if I prompt anything that isn't in the training data, it loses the LoRA style completely

nocturne dove
#

In addition to painters, 0.9 also understands the styles of a lot of famous photographers. So if you have not tried that, then you should. It's fun.

eternal fog
#

I have to crank the weight a lot

livid basin
#

I wonder if it has anything to do with dual text encoders

eternal fog
#

It could just be crap training settings

livid basin
#

For the failure to adapt to unseen words

fresh path
eternal fog
#

Nah, well trained style LoRAs work with pretty much anything

#

It does sort of work with the weight cranked though

fresh path
#

she’s wiggly

eternal fog
#

Yeah I think that is training time + Small dataset

fresh path
#

and .9

latent zodiac
#

do we know what artists we can use ?

nocturne dove
#

I like "painting by rembrandt" for portraits.

#

I like "Photo by Herman Leonard" for photographic portraits

eternal fog
#

I might try a run with Booru tags tomorrow and compare and see what the difference is.

#

Now I know it works I can try stuff

lusty raptor
#

but I had to run simple prompts in G, and most of the tags in L

eternal fog
#

Did you train the text encoder?

#

Because I wonder if that would also make a difference

lusty raptor
#

i did

eternal fog
#

I don't think I have enough VRAM to do that

lusty raptor
#

been meaning to try one without, but been too busy

trim orbit
#

when you guys are making loras, are you doing --network_train_unet_only ? does that mean captions don't work?

eternal fog
#

Captions do work, because I used my captions to test the LoRA and it did what I expected

#

I think you just aren't training the bigger text encoder

nocturne dove
#

I use the UI and I just set the learning weight for the text encoder to 0. No idea if that's correct, but it has worked and I only use captions in txt files.

urban fjord
#

WaifuXL is out 😄

autumn forum
eternal fog
trim orbit
#

yeah i'm training with captions too and they seem to work. i was just curious if it's confirmaiton bias or what

nocturne dove
#

I trained a person that the model didn't know. So the fact that it works is a lot more obvious than with a style.

eternal fog
#

I've been picking specific styles so I can tell

latent zodiac
#

i like this SDXL style guide!

lusty raptor
#

will be cool to see how an early finetune looks

civic sigil
trim orbit
civic sigil
#

Unrefined chad

lusty raptor
#

any of you tried refiner on anime and had any luck?

#

i tried it on some akira-ish images and it was no bueno

eternal fog
soft bone
autumn forum
livid basin
#

So what do people think of SDXL 0.9 vs 1.0?

#

Based on the bot so far?

vast peak
#

What is SDXL exactly?

upbeat summit
# soft bone

very cool - really like the styles you are exploring. I also like making analog photography images in general

vast peak
#

When exactly is the release date?

livid basin
#

1 week

vast peak
#

What happened exactly?

shy kelp
#

doing some extra fine tuninigs i guess

autumn forum
soft bone
urban fjord
# livid basin Based on the bot so far?

Based on local generations vs 1.0 on bot then 1.0 is a step down. I hope this is just do to settings being wrong and that 1.0 will actually be better than 0.9.

eternal fog
#

ok the training with AdamW came out a lot better than with Prodigy

upbeat summit
# soft bone you're the analog wizard have you used the analog film style in pinned messages?...

I created this analog photography portrait prompt build for SD 2.0 - it also works great with SDXL. Just fill in the placeholders:
cinematic movie extreme close-up still of an epic scene of a [ETHNICITY] [OCCUPATION] in the [SEASON] at [DAYTIME], centered, looking into the camera, fog atmosphere, volumetrics, photorealistic, from a western movie, analog, very grainy, film still, kodak ektar, fujifilm fuji, kodak gold, cinestill 800t, kodak portra, photo taken by thomas hoepker

#

but there might be better ways with SDXL. I just refactored a couple of my old prompts

soft bone
eternal fog
#

Hmm ok, I think when you don't train the text encoder the prompts work better in the text_L

eternal fog
#

Putting them in Text_G seems to then pull the style away

upbeat summit
soft bone
eternal fog
dusk mica
#

Sdxl 1.0 looks much better for me imo

eternal fog
soft bone
#

MC face reveal

vast narwhal
#

Besides Big boobs, what is impossible for SXDL to do?

dusk mica
#

Wat

eternal fog
#

It can do big boobs fine

trim orbit
#

youtuber told me it was unpossible so i'm sticking to it. i love youtuber!

vast narwhal
vital python
#

Is the sdxl release delayed by a week ?

vast narwhal
dusk mica
#

What does style do technically? Is it added to the prompt or smth elese entirely?

upbeat summit
urban fjord
soft bone
#

had to run my own test. big enough?

dusk mica
#

Deviantart: Larger

sudden cliff
#

prove it to the man

upbeat summit
vast narwhal
latent zodiac
autumn forum
soft bone
latent zodiac
#

I am sure it's a matter of time

trim orbit
autumn forum
eternal fog
#

roop faces are only about 128x128

soft bone
#

rare photo of me

trim orbit
#

roop is very low res yeah. thats even more apparant on sdxl codeformer helps a lot

latent zodiac
#

yeah downsamples to 128x128

#

there is talk of a 256 and 512

soft bone
vast narwhal
#

The developers made a ranking of the best and worst samplers, did they release it?

solemn bear
#

for sdxl?

vast narwhal
latent zodiac
#

anyone got a list of known artists ?

#

@solemn bear sup man!

soft bone
solemn bear
latent zodiac
#

@solemn bear loving your work on PAI man

solemn bear
latent zodiac
#

yeah artists names would be good

undone bloom
#

So, from what I can tell, one of the final candidate SDXL models has zero terminal SNR in it?

latent zodiac
soft bone
#

I will pay for someone to make an eye-fixer node

latent zodiac
#

@soft bone codeformer?

#

inpaint ?

#

lol

soft bone
#

like a one-shot method tho. inpaint takes cherrypicking, codeformer ruins likeness once I'm generating on trained faces @latent zodiac

lusty raptor
#

it would be nice

undone bloom
#

I really do hope the zero terminal SNR model is selected as the final one... because if it isn't I'm going to have to figure out how to shard SDXL on a TPU to add it back in...

lusty raptor
#

had this one a bit ago, it's great except....

topaz snow
#

personally i enjoy having my generations' eyes look like they're tripping on acid

gritty moat
#

Why it looks like that only difference between two variants are clip skip value?

soft bone
shy kelp
#

@soft bone how you upscaling these?

eternal fog
#

ok, so I think character LoRAs are going to be easy

urban fjord
eternal fog
#

I just did a test with a dataset of 120 images, ran it for 15 minutes, so about 600 steps, my tagging is dogshit, but the likeness is pretty much spot on

trim orbit
#

i dont think loras for the base work for the refiner. so i wonder if thats what interoperability means

eternal fog
#

That shouldn't really matter

urban fjord
#

The refiner doesn't need to know your subject to refine it.

undone bloom
# urban fjord I don't know which you're talking about but one is so bad at following prompts t...

I have been finetuning a zero terminal SNR model off of SD1.5 and I can say that prompting it is absolutely different, most likely because it doesn't have the learned average brightness from the broken noise scheduler holding it back from doing what it wants to do. The upside is that you have control over the brightness. The downside is that you have A LOT of control over the brightness.

latent zodiac
urban fjord
#

I'm happy with 0.9 abilities when it comes to light

eternal fog
#

for the small amount of training this LoRA is extremely good for a character

trim orbit
#

doesnt take much with phoenix optimizer. so simple

undone bloom
#

yeah but zero snr models can do this

eternal fog
#

I used AdamW Optimiser

urban fjord
#

So can 0.9, all you need to do is to set aeshetic score to 1 billion in ComfyUI drhead.

#

Another 0.9 image. Manages bright and dark portions of the image well.

soft bone
#

0.9 is great at dark

undone bloom
#

this is what i managed to get out of the discord bot for the same prompt

soft bone
urban fjord
#

Solid black background is a gimmick, while it says something about the model what matters is the images you can create with it.

#

I don't actually want images where I can't see anyting.

undone bloom
#

i've also noticed that models with zero terminal snr perform better at inpainting

#

you know how often times even with the inpainting model you'll get a sort of "halo" cast around an area where it's obvious where you inpainted? even without a proper inpainting model zero terminal snr models don't do that

#

and they can handle higher denoising strength without hallucinating

soft bone
deft quiver
#

is their an sdxl fork ?

undone bloom
deft quiver
#

very cool, this was something I was hoping would be an architectural choice but I'm glad to see ppl adding it in

trim orbit
#

note to self. read about snr

azure oxide
undone bloom
#

i do plan to release a training base SD1.5 model trained on laion aesthetics 5plus with duplicates and low quality matches removed. probably SD2.1 too, and SDXL if I have to and I can figure out how to cram it on a TPU...

undone bloom
#

it is not going to be easy to do full scale finetunes of SDXL. the best plan I have so is to store the text encoder and VAE on one core and to split the U-net into thirds for TPUv3 training. If that works decently fast it should allow some data parallelism as well.

boreal bough
#

hot damn I missed a lot while sleeping

urban fjord
#

That's why sleeping is bad.

boreal bough
#

emad showing up was not what I was expecting x_x

#

well - while the delay was expected, the general communication we got instead was pretty nice

#

after style prompt release - did they mention/promise anything else?

soft bone
urban fjord
#

Did you catch the LoRA stuff?

boreal bough
#

only that it will be released at a later date

#

or did the get sped up to today as well O:

urban fjord
#

Yeah that was what I meant, that there is something that will be released later.

#

Did you catch the free 4090 to everyone Opera style?

boreal bough
#

I have no idea what this means - but I'm intrigued

urban fjord
#

Just joking

#

But no, I don't think you missed anything.

#

I just hope WaifuXL is good, it's just taking me forever to download as the download stops and I forget to start it again.

fair crow
#

Like how many ppl are actually in this discord and then read and scroll through the sdxl channel

urban fjord
#

It should have been obvious that there was an misunderstanding as there was nothing really planned for today like a stage event.

livid basin
#

While the delay is unfortunate, in a months time no one will care about the delay

naive goblet
#

What does this mean? which model is which?

fair crow
#

We are on a bit of a time crunch because we have to demo our product soon - which is a bit of a pain when the model is non-comercial. I'm sure other ppl banked on it too

trim orbit
#

keeping it blind to not affect biases

hearty ginkgo
#

four images i was able to get last night before i went to bed 1000 steps each upscalled by 4x

eternal fog
#

So I just tested making a LoRA with 4 total pictures. It actually worked pretty well.

azure oxide
eternal fog
soft bone
eternal fog
#

It's done pretty well on the face

eternal fog
urban fjord
#

Looks like my latest LoRA just failed, I see no different when applying it or not and I don't know if I messed up training or if Comfy ui is bugged.

shy kelp
#

hey guys. as I understand it we are down to just a few release candidates as the devs figure out what will ultimately be the 1.0 release?

#

LoRAs for 0.9 may not work with 1.0 right?

fair crow
#

Just a bummer

urban fjord
#

You shouldn't depend on a unreleased open-source project for your commercial product.

shy kelp
#

yeah wait until 1.0

fair crow
#

Keep on thinking that

dusk mica
#

Lets just wait till 1.0

livid basin
#

Could always use SD 2.1

#

Or 1.5

shy kelp
#

release has been pushed back I here. Which is fine by me. I want the best release candidate.

livid basin
#

Cherry pick a few seeds

shy kelp
#

hear not here

eternal fog
#

@soft bone Yeah you should give that a go, this is 4 images that aren't even tagged properly that I ran for 600 steps and whilst it's not perfect it's very very close

trim orbit
#

i've got 0.9. i'll do fine till release

livid basin
#

An amazing thing I've noticed with SDXL, is that it seems to be much better at generating deep blacks and bright whites than SD 1.5 and SD 2.1

undone bloom
#

that's because it's trained on offset noise

boreal bough
eternal fog
livid basin
vast narwhal
livid basin
#

Would certainly screw up inference sampling

undone bloom
azure oxide
#

ya whatever emad was doing here was pretty good for a pure black background

fair crow
#

@boreal bough Maybe we should partner indeed 🤔 Just too short term

shy kelp
#

What I care about is hands and feet. Kinda tired of the monster 6 fingered mutants.

#

Hoping 1.0 addresses that.

livid basin
#

I've thought that maybe they are using a different noise distribution, but a simple offset should not be able to explain it

#

Or at least, they can't change the inference noise distribution

#

In my mind, the only possibility is if they use a different noise distribution during training

#

And it can't be offset

undone bloom
livid basin
#

That's very alarming

undone bloom
#

also the noise distribution during training and inference is already different, it's not pure gaussian during inference after a few steps

livid basin
#

It would mean information leakage in the training

hearty ginkgo
#

so whats the difference between true black and true white?

undone bloom
livid basin
#

Assuming a sufficiently powerful network that is

undone bloom
#

ill dig out the paper hold on a sec

livid basin
#

It should be 0

#

Log SNR can be negative infinity

undone bloom
livid basin
#

hmm

#

one sec

boreal bough
#

I bet this was a huge thing internally XD

#

I'm genuinely surprised they solved it

undone bloom
#

this paper is unfortunately somewhat sparse on details which is part of why people have done very little with it

boreal bough
#

now I just need to know how - so i can replicate it for some of my loras

livid basin
#

Ok, I've read it, and it isn't clear why epsilon-prediction is not possible with 0 snr at last step

soft bone
eternal fog
#

It was 4 images no repeats and 200 epochs

livid basin
urban fjord
#

That sounds a bit too much Arron.

undone bloom
eternal fog
urban fjord
#

Maybe it worked but it's a waste of time most likely.

soft bone
hearty ginkgo
eternal fog
livid basin
#

Yeah that's pretty white

#

@undone bloom Fair, but can you reference any paper where they claim offset noise

#

For SDXL?

undone bloom
hearty ginkgo
#

and this is true black correct?

undone bloom
#

i do believe the offset noise comes from the SDXL report

livid basin
#

Oh wow they do indeed

#

That's a weird choice

raw prairie
undone bloom
#

"finally, we implemented offset noise from the famous paper 'Don't Implement Offset Noise'"

warm hazel
#

Does anyone know if SDXL is only trained on 1024x1024 images?

livid basin
boreal bough
#

the advantage of true white/black is, that you can automate the next steps to turn it transparent, or use it as a background in a word document

livid basin
#

I've never seen a good justification for why offset-noise should work

#

There's hand wavy justification

#

But nothing theoretical

urban fjord
#

SDXL is trained on different aspects ratios.

livid basin
#

And intuitively, offset noise would break the training/inference distributions quite a bit

warm hazel
#

That's good.

boreal bough
#

sdxl offset noise need to be 0, when training in kohya

patent moon
#

I wish the models had an alpha channel to do transparency. Would be really cool for some things.

undone bloom
boreal bough
#

(then it uses the hardcoded values in kohya - which are the original sdxl noise values) 0.5~1.5 tooltip is no longer correct

hearty ginkgo
#

what about

livid basin
warm hazel
#

tigigeririget

boreal bough
hearty ginkgo
livid basin
#

holy shit

shy kelp
hearty ginkgo
boreal bough
raw prairie
undone bloom
#

At least from my experiences testing you really need to do more of a stress test for pure black/white backgrounds. Try to get a white subject on a white background or a black subject on a black background. That'll make any flaws apparent.

urban fjord
#

I don't understand anything, why is this LoRA trained on photos turning images more oil painty...

warm hazel
#

Polar bear in snow

raw prairie
undone bloom
deft quiver
#

if you use the sdxl refiner it does seem to superimpose patterns into the image

hearty ginkgo
raw prairie
#

Strange

undone bloom
#

it does warn you that v-parameterization is for SD 2.0. I simply don't agree with it.

deft quiver
#

yeah that confused me alot

#

lol

#

I thought it was an anachronism

urban fjord
#

Wait, what do v-parameterization do for 1.5 and SDXL?

undone bloom
warm hazel
#

I think it makes your prompts really precise?

deft quiver
#

for 1.5 it does nothing

boreal bough
#

...I didnt actually test emads true white either XD

#

lemme check that real quick if they even truly solved it

hearty ginkgo
livid basin
#

Fewer artifacts

urban fjord
#

I did check Emad's black and it is not perfectly black but blacker than my 0.9 results by a tiny amount.

hearty ginkgo
#

.

warm hazel
#

I've tried a v prediction model and from my experience it only generates what you prompt so you need to be very specific on what you want in the image.

boreal bough
#

I mean its close - but not "true white"

azure oxide
#

found it, here's the response on snr + sdxl

hearty ginkgo
#

@boreal bough can you check

boreal bough
#

off by less than 1%. that's true black

hearty ginkgo
#

now true white

hearty ginkgo
autumn forum
#

Closest I’ve gotten so far but I haven’t really been testing

patent moon
#

Can you just prompt for “white ffffff”

undone bloom
# azure oxide found it, here's the response on snr + sdxl

At least with my experiences... like, most of this is devoid of objective metrics, but my experiences tell me that there are a lot of ways that multiple parts of the SD workflow are actively fighting against the broken noise scheduler and it shows in so many places and usually manifests as hallucinations. Full-resolution inpainting is the most obvious one I know of and I really hope I do get my inpainting training code working to demonstrate it more properly.

deft quiver
#

I'm honestly suprised to see schedulers stick around in diffusion for so long

#

seems to be the source of alot of heartache

boreal bough
autumn forum
boreal bough
#

even on custom trained loras, where the bg is 100% white, and prompt is "white background"
after training its polluted into greyish again

hearty ginkgo
#

this isnt true white but its pretty good.

boreal bough
#

its definitely cool!

livid basin
#

Well, hopefully their engineers will make the right choice

#

Use 0 snr noise instead of offset noise

#

But given that SDXL 0.9 used offset noise, it is a minor disappointment

warm hazel
#

I'm wanting SAI to put more energy into Harmonai but I might be biased...

soft bone
eternal fog
soft bone
#

afaik lemme check

hearty ginkgo
lilac viper
#

Are bots 1 thru 10 using sdxl 1.0 to render or is it still .9?

lilac viper
#

Awesome. Thanks

eternal fog
#

So I just did a stupid test that actually worked resonably well

#

I took some random photo of someone online, it's a crap quality photo too 338x238

#

And created a LORA off the 1 image

#

This is the Original

heady vale
#

votes from the bot gens will determine which of the three 1.0 models is chosen for release

eternal fog
#

That's not bad for a single image

tribal gale
#

generating animals on black background has been a thing for me for a few days now

autumn forum
eternal fog
#

I didn't caption the facial expression, so she's sort of stuck with that expression, but it's worked really well

#

It's also captured the poor quality of the image though

autumn forum
#

i mean the facial expression is all the same

#

oh yeah haha

west breach
#

one of those images you randomly get. you're not sure what it is, but it looks really good 😄

boreal bough
# tribal gale generating animals on black background has been a thing for me for a few days no...

did you manage to get true white to work in any way though?
since that is one aspect I genuinely gave up on.
Perceptable white is easy - but the moment you paste it into a word document, it instantly shows.

While a post processing workflow can automate this, it essentially invalidates my dnd portrait loras, since they rely on true white backgrounds, for seemless intgegration into character sheets for printing

#

even emads true white post wasnt true white - still sepia polluted

eternal fog
eternal fog
#

But I think you could actually make a resonably one from 3 or 4 pictures

tribal gale
#

not sure if it's true white tho

autumn forum
boreal bough
sudden cliff
#

'stunning photograph of an orange landscape with a white tiger, 35mm photograph, professional, 4k, highly detailed'

eternal fog
#

I need to try it with a picture that isn't only 300x200

boreal bough
#

how do I replicate it? XD

urban fjord
#

Will we ever get SD using the alpha channel too for transparent backgrounds?

lament rune
#

Anyone have a good img2img pipeline that works? I'd love to try!

boreal bough
eternal fog
#

Kinda mad how good this can be from a single image

autumn forum
eternal fog
#

As in it's mad

#

Not I'm mad lol

#

works in other aspect ratios as well

autumn forum
#

ohhhh facepalm

urban fjord
#

I tried WaifuXL and I'm glad to see that 0.9 LoRA works on that one, that means there might be a benefit of training LoRA on WaifuXL and run them on 0.9

eternal fog
# soft bone yes

Honestly not sure then, it took me ages to get it working, Would OOM every time until that update.

undone bloom
knotty lotus
#

👀

soft bone
#

its infuriating to get the highest vram consumer gpu available and still not be able to train a little lora

soft bone
#

3090, just the 24gb i mean

boreal bough
shy kelp
#

okay.....I just spent a week's pay to get a used....USED 3090. Because I don't have a million dollars to get a 4090

#

and I won't be able to train with it?

soft bone
#

you will my shits just broken

undone bloom
#

24GB gpus have just barely enough to train LORAs on SDXL

boreal bough
#

lots of settings - where getting even one wrong results in nothing working, so yeah. I dont blame you ^^'

shy kelp
#

SDXL development on civitai is gonna be painfully slow over a number of months.

soft bone
#

everyone will just rent

boreal bough
shy kelp
#

Well i invested in a 3090...at least I will be able to generate images even if I can't train.....

undone bloom
#

general models are going to be a huge pain in the ass to train -- you'll either have to rent a cloud GPU or figure out how to shard the unet

shy kelp
#

okay so a 3090 can be used to train and a 16GB can also train...albeit slower

undone bloom
soft bone
shy kelp
#

Well can a 16GB card train or no?

undone bloom
#

also i'm quite a bit more concerned with FFT

inner ruin
eternal fog
autumn forum
#

anyone got good lora training settings for kohya?

#

for sdxl of course

boreal bough
soft bone
#

json for proof

urban fjord
#

With 24GB VRAM you've got a lot of flexibility and LoRA training should be easy.

shy kelp
#

any training on sdxl is referring to 0.9 correct? Y'all realize that's just for testing? 0.9 models will not be compatible with 1.0.

hearty ginkgo
#

@boreal bough how close is this?

undone bloom
inner ruin
urban fjord
#

I'll wait to see how incompatible they will be.

inner ruin
#

It’s not like the architecture is different

shy kelp
#

ah well okay

soft bone
inner ruin
#

They’re just RLHFing the model

undone bloom
shy kelp
#

great so you're implying a 5090 will be used? Assuming Jensen doubles the vram

vale eagle
#

After some test on WDXL, I think it might be good at 768,768 rather than 1024,1024

boreal bough
shy kelp
#

768 then upscaling after?

undone bloom
#

Guess I better figure out how to shard the damn unet then

urban fjord
#

I get 7 s/it but it doesn't matter when my LoRA dataset is small.

autumn forum
shy kelp
#

A100s? So basically you mean money to pay for renting resources then

boreal bough
#

yep

inner ruin
shy kelp
#

WEll....at least my 3090 will be able to generate SDXL images in somewhat of a timely manner even if it's not the best for training.

boreal bough
#

did the math last time, about 360$ to produce a finetuned version of 0.9 trained on 6.5k raccoon images.
didn't go through with it, since 1.5 equivalent finetune levels can now be achieved with lora.

shy kelp
#

Jensen needs to make a consumer cut down version of a H100.....which I guess is what the 5090 will be.

boreal bough
#

proper finetunes of sdxl should aim high, since a lot is now possible that wasnt possible before. just gonna be expensive T.T

urban fjord
#

With an A100 you can get a lot done in terms of LoRA for the price you pay.

shy kelp
#

What about the H100?

#

Or better yet......two 3090s using nvlink

#

but damn my 1200w psu won't be enough.

livid basin
#

A6000 Ada is cheaper and 48GB

shy kelp
#

what do you mean by "cheaper"?

boreal bough
livid basin
#

Like $7000, very reasonable

boreal bough
#

not sure if we can realistically talk about it

livid basin
#

A6000 Ada is well within the price range of many here I bet

shy kelp
#

oh hell well $7000 is a bargain then in comparison 😄

undone bloom
#

you should in theory be able to train it on two 3090s with NVLink

eternal fog
undone bloom
shy kelp
#

Someone out there will have access to an A6000 I'm sure

undone bloom
#

sharding a unet is an absolutely annoying pain in the ass, it isn't really efficient either, but it will work

shy kelp
boreal bough
#

depends on your goal - but you can set up a runpod with a single A100 if you wanna be serious about training. Its the least expensive option - unless you already have access to high end equipment

vale eagle
#

Same seed, same workflow. WDXL0.9 1024 vs 768

undone bloom
shy kelp
#

what's a runpod?

shy kelp
livid basin
#

Just FYI, the A6000 and the A6000 Ada are two different GPUs

boreal bough
undone bloom
boreal bough
shy kelp
# undone bloom yeah

Used they are going for around $700. It will take months cause I'm working paying other bills before I can swing a second card.

#

lemme check those runpod pricing

#

It makes sense to rent for training unless you have powerful gpus.

undone bloom
#

Also I can't really offer any guarantees on how much moving data across devices is going to slow you down with that. But you'd be doing all-reduce gradient updates if you were renting a multi A100 pod anyways, and with model parallel you only have to move the intermediate tensors I think.

boreal bough
#

how often do you plan on training?
training doesnt take long, you're essentially just paying to a fail a few times until it works properly, then its like 14~21min per lora

shy kelp
#

I'm mostly a user so I guess I will wait to see what civitai community comes up with. But I want to do a personal project which may require training models.

boreal bough
#

then you'll see if runpod is cheaper or not\

#

but most people dont make enough loras where its worth getting a dedicated card for it

shy kelp
#

I just want to finish a project a concept artist promised me he would.....but never did.

#

Figured maybe I could AI to copy his style

#

Aaron Beck is a concept artist known for his robots.

#

He did a piece called: "Robot with Laser" had a very unique style

fluid sedge
#

Hey are you guys voting at all? A/B? Dem/Pub? Red/Blue?

shy kelp
#

I asked him to flesh it out with pics with multiple angles. But he never got around to it.

#

Hoping SD could help me finish his work.

fluid sedge
#

I never know what to do when the prompt isn't followed as much, but picture is better. I mean do you want better pictures that don't follow your prompt?

undone bloom
boreal bough
# shy kelp Figured maybe I could AI to copy his style

A.) offer him more money
B.) learn stable diffusion, learn training, buy expensive equipment, spend 30+ hours learning the tools, then create a model based on the artist - to essentially commit art theft since its specialized to impersonate them, have your image
C.) pay someone on fiverr 15$ to make it for you in the style of the other person

XD

#

all are viable options - just saying that the alternatives are less effort

shy kelp
fluid sedge
#

Thank you Dr. Head I can see why they gave you the degree

shy kelp
#

I want to learn SD but I can't afford expensive equipment except maybe for renting runpod services I gues

fluid sedge
#

Why don't you pay someone like 20 bucks to do your whole project

fluid sedge
#

I just do SD as a hobby just throw me a few bucks and I'll work a dollar an hour. I do it anyway

#

4freee

shy kelp
#

Fiverr you mean those virtual beggars? "will make ai art for tips"

shy kelp
#

Lemme post a pic of what I mean...... one sec.......

boreal bough
heady vale
fluid sedge
#

There's a big enough gap between people that know AI art and people that don't, that you can still get paid

#

For it

boreal bough
#

oh god fiverr gonna be a bad place once sdxl is full release + loras

hearty ginkgo
#

its this true white?

fluid sedge
#

Probably will last fo another year... Yearish? And then I imagine there will be a very small demo of extremely tech-unsavvy elderly people but at that point... There's usually not a ton of that medicare/medicaid money being spent on small business/enterprise

autumn forum
#

yeah i feel like sdxl will start the existents of paid loras or something dumb lol

shy kelp
#

I want multiple pics from different angles of this robot.

boreal bough
autumn forum
#

rgb(247, 245, 248), f7f5f8

shy kelp
#

so I want SD to learn how to make robots in Aaron Beck's style

#

of this specific robot in fact

hearty ginkgo
#

rip

fluid sedge
#

There's not a really a lot of learning. It's not like chess, or a programming language. It's more like learning, idk, ProCreate.

#

There's some little bits here and there that aren't exactly handed on a silver platter but they're really nothing

shy kelp
#

If SD was a human engineer I would ask him to backwards engineer the robot and create detailed schematics from multiple angles.

#

That's what I want SD to do.

livid basin
#

I'm dead serious

shy kelp
#

Yeah I heard of DragGAN. That would be interesting to see how SD might interpret the legs.

#

the arms imply the style of the legs

livid basin
#

There's also dragDiffusion, but as I understand, dragdiffusion doesn't really work

livid basin
shy kelp
#

In my interpretation of that specific robot I picture him around 7 and a half feet tall like a Predator in size.

livid basin
#

Wow, you're in luck, SDXL loves making tall people/robots

#

I'm not even joking

fluid sedge
#

Inpaint can take care of 90% of what these specialty jawns can do, just without the snazzy affect

livid basin
#

SDXL with human bodies/robot bodies loves making them super tall and long

shy kelp
#

Right! Which is why I'm excited for SDXL and why I spent $700 on a used 3090 from a gamer who didn't mine with the card and treated it like a baby.

fluid sedge
#

What've you got RAM wise?

urban fjord
#

Most of the cost of training a single LoRA on a A100 in the cloud is just setting up the environment. Training itself is super fast.

shy kelp
#

64GB ddr4 and a 5800x3d

undone bloom
#

I am going to slap the SDXL VAE on SD 1.5 and train it until it is aligned.

shy kelp
#

i could max out the board to 128GB ram if i have to

boreal bough
livid basin
#

lol

fluid sedge
#

Wow I thought 16GB RAM and 3070 already spoiled me

urban fjord
#

Your racoon LoRA is looking good Caith

livid basin
#

Someone said SDXL 1.0 was better at not making super tall people, what do yall think?

#

because I still see tons of Suuuuper tall people/robots

fluid sedge
#

Not that RAM does a whole lot besides cache'ing models... I hear 0.9xl requires a station wagons worth though (of RAM)

shy kelp
winter raptor
#

SDXL currently specializes in slenderman proportions

shy kelp
urban fjord
#

You can get short robots too if you want.

undone bloom
# shy kelp What will that do?

SDXL's VAE is better at reproducing fine details. So if I train the model with it I'll get an SD 1.5 which can produce better fine details.

#

This comes at the cost of making it incompatible with everything though, I think.

fluid sedge
#

Maybe that's what the people wanted. There's a lot of demand of long-legged people in the art world.