#✨｜sdxl | Stable Diffusion | Page 150

hasty smelt Oct 18, 2023, 7:37 PM

#

I'm a new user, someone more qualified could give a better answer for this, but it seems that Vae is the time when many calculations are being done, it seems that the model takes all the files from the SSD and at that moment throws everything into the RAM, so more RAM is better, you can see the peak on the graph at the moment where the Vae pass begins.

cyan crown Oct 18, 2023, 7:38 PM

#

hasty smelt I'm a new user, someone more qualified could give a better answer for this, but ...

there is an alternate vae that speed up the process

#

look for sdxl_vaefp16.safetensors

hasty smelt Oct 18, 2023, 7:40 PM

#

cyan crown there is an alternate vae that speed up the process

Thanks friend, I don't worry about Vae anymore, after the RAM upgrade I didn't have any more problems with it.

viscid warren Oct 18, 2023, 7:42 PM

#

cyan crown look for sdxl_vaefp16.safetensors

Does it work for dreamshaper too?

cyan crown Oct 18, 2023, 7:42 PM

#

I hav 64GB of CPU RAM and 12 GB of GPU ram and it speeds up the last step a lot

viscid warren Oct 18, 2023, 7:42 PM

#

hasty smelt I'm a new user, someone more qualified could give a better answer for this, but ...

I have 32gb ddr5 ram but ram cleanuper which can clean 5-6 GB of ram before a procedure

cyan crown Oct 18, 2023, 7:42 PM

#

it works for every checkpoint that needs vae

viscid warren Oct 18, 2023, 7:43 PM

#

Sometimes after closing many Firefox tabs my ram is still drained it cleans it up

viscid warren Oct 18, 2023, 7:43 PM

#

cyan crown I hav 64GB of CPU RAM and 12 GB of GPU ram and it speeds up the last step a lot

Ddr5 is CPU ram right?

#

I also have 12gb vram on rtx4070

cyan crown Oct 18, 2023, 7:44 PM

#

4070 me too

#

DDR5 yes

viscid warren Oct 18, 2023, 7:44 PM

#

CPU is amd Ryzen 9 7900 x3d

#

Should I upgrade it to 64gb ram?

viscid warren Oct 18, 2023, 7:45 PM

#

cyan crown 4070 me too

Although I have the dual version idk how much that matters

#

Can overheat easier

cyan crown Oct 18, 2023, 7:46 PM

#

viscid warren Can overheat easier

not for mine

viscid warren Oct 18, 2023, 7:46 PM

#

cyan crown not for mine

You have 3 fans?

cyan crown Oct 18, 2023, 7:46 PM

#

22

viscid warren Oct 18, 2023, 7:46 PM

#

Gpu

cyan crown Oct 18, 2023, 7:46 PM

#

2

viscid warren Oct 18, 2023, 7:47 PM

#

Dual version too ?

cyan crown Oct 18, 2023, 7:47 PM

#

yes

#

66° at 100%

#

celsius

#

75 max

viscid warren Oct 18, 2023, 7:48 PM

#

Nice

#

What CPU you have for it

cyan crown Oct 18, 2023, 7:48 PM

#

5950x

viscid warren Oct 18, 2023, 7:49 PM

#

AMD Ryzen?

cyan crown Oct 18, 2023, 7:49 PM

#

yes

#

Noctua NH-U12A as cooler

raw saffron Oct 18, 2023, 7:50 PM

#

wont downloading models break my pc? each model is like 10gb agony

#

xd

cyan crown Oct 18, 2023, 7:52 PM

#

raw saffron wont downloading models break my pc? each model is like 10gb <:agony:10029611831...

😂 use sdxl base model! it's great for everything

raw saffron Oct 18, 2023, 7:52 PM

#

where can i download it? im so confused with all of these models xd
scared to download the wrong one

viscid warren Oct 18, 2023, 7:53 PM

#

raw saffron where can i download it? im so confused with all of these models xd scared to do...

Stable diffusion art website sdxl

#

Tutorial

cyan crown Oct 18, 2023, 7:54 PM

#

https://huggingface.co/stabilityai

stabilityai (Stability AI)

viscid warren Oct 18, 2023, 7:54 PM

#

They also have link

raw saffron Oct 18, 2023, 7:54 PM

#

ty 🙂

cyan crown Oct 18, 2023, 7:54 PM

#

download sdxl base, sdxl refiner, sdxl vae to start

raw saffron Oct 18, 2023, 7:55 PM

#

how do these 3 differ

cyan crown Oct 18, 2023, 7:56 PM

#

you use all of them together

raw saffron Oct 18, 2023, 7:56 PM

#

oh yeah nvm xD

#

and the ui

cyan crown Oct 18, 2023, 7:57 PM

#

#

for UI look this

#

https://www.youtube.com/watch?v=OVQnYaP9I6g

YouTube

Digital Art Guide

2 Min Install AUTOMATIC1111 for StableDiffusion on Windows Made ...

Welcome to this exciting new video on AI insights! Today, we'll be diving into the topic of installing StableDiffusion on Windows with NVIDIA GPUs.

Instructions :

Workflow for Nvidia Cards

Install Python (https://www.python.org/downloads/release/python-3106/). During installation make sure to Add Python to PATH.
Try on cmd if Py...

▶ Play video

real vine Oct 18, 2023, 7:59 PM

#

i want to make people on a meeting table what kind of technics i should use ?

cyan crown Oct 18, 2023, 7:59 PM

#

it's the same for 1.5 models and sdxl

cyan crown Oct 18, 2023, 7:59 PM

#

real vine i want to make people on a meeting table what kind of technics i should use ?

a photo or a paint?

raw saffron Oct 18, 2023, 8:00 PM

#

cyan crown https://www.youtube.com/watch?v=OVQnYaP9I6g

yeah im actually just watching this xD

#

ty

real vine Oct 18, 2023, 8:01 PM

#

i target for photo realism

#

one person is easy but if people gathers it becomes weird results

raw saffron Oct 18, 2023, 8:02 PM

#

wait is there a diffetence between sdxl base 1.0 or xl base 1.0

#

i think its the same?

cyan crown Oct 18, 2023, 8:03 PM

#

yes+

raw saffron Oct 18, 2023, 8:03 PM

#

cyan crown download sdxl base, sdxl refiner, sdxl vae to start

https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0

thats the correct model right?

stabilityai/stable-diffusion-xl-base-1.0 · Hugging Face

real vine Oct 18, 2023, 8:03 PM

#

@cyan crown can we make realistic people faces with embedings are they for this kind of things

raw saffron Oct 18, 2023, 8:04 PM

#

raw saffron https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0 thats the corre...

is this model restricted?

#

and does it let me use chatacters? dalle3 doesnt allow that

#

so u gotta have to trick it saying "an image of someone familiar to... "

cyan crown Oct 18, 2023, 8:05 PM

#

dalle 3 is another thing......

raw saffron Oct 18, 2023, 8:05 PM

#

yeah ik

#

but is it restricted like dalle or na?

crisp owl Oct 18, 2023, 8:06 PM

#

not run locally. If you're running on a colab or something, the person hosting it may or may not

cyan crown Oct 18, 2023, 8:06 PM

#

what you mean with restricted?

jade hill Oct 18, 2023, 8:08 PM

#

is there a sampling method considered as the "base one" for XL ?

cyan crown Oct 18, 2023, 8:09 PM

#

no

#

I use DPM++ 2M Karras

jade hill Oct 18, 2023, 8:09 PM

#

ok, thanks 🙂

raw saffron Oct 18, 2023, 8:10 PM

#

cyan crown what you mean with restricted?

blood for example
or celebrities

cyan crown Oct 18, 2023, 8:11 PM

#

#

#

ok?

lusty wolf Oct 18, 2023, 8:13 PM

#

Peeking in...

raw saffron Oct 18, 2023, 8:14 PM

#

ah ok ty xD

#

whats the difference between the base to the base vae

#

there are 2 files of the base models except for that one of them has vae in their name

crisp owl Oct 18, 2023, 8:17 PM

#

0.9 is the good one. 1.0 is borked

cyan crown Oct 18, 2023, 8:17 PM

#

look at the picture i posted before

crisp owl Oct 18, 2023, 8:17 PM

#

There a base model released now specifically with 0.9 baked in it

raw saffron Oct 18, 2023, 8:18 PM

#

im confused

crisp owl Oct 18, 2023, 8:19 PM

#

https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/blob/main/sd_xl_base_1.0_0.9vae.safetensors

sd_xl_base_1.0_0.9vae.safetensors · stabilityai/stable-diffusion-xl...

cyan crown Oct 18, 2023, 8:19 PM

#

look at the picture!

#

you have 3 files to use

crisp owl Oct 18, 2023, 8:19 PM

#

1.0 base model with 0.9 vae is good

cyan crown Oct 18, 2023, 8:19 PM

#

raw saffron Oct 18, 2023, 8:19 PM

#

oh this

raw saffron Oct 18, 2023, 8:20 PM

#

cyan crown

oh so u need both of them
whats the "vae" for exactly? what does it do

crisp owl Oct 18, 2023, 8:20 PM

#

decodes latent space to pixel space

cyan crown Oct 18, 2023, 8:20 PM

#

decode

raw saffron Oct 18, 2023, 8:21 PM

#

ah ok ty 🙂

shy kelp Oct 18, 2023, 8:21 PM

#

i recommend 1.5 before sdxl

#

at least to know how things work

nimble heart Oct 18, 2023, 8:22 PM

#

starting with xl is find just dont use the refiner

raw saffron Oct 18, 2023, 8:23 PM

#

whats the refiner?

shy kelp Oct 18, 2023, 8:23 PM

#

mans is going to hit generate and wait 30 minutes for 1 image lol
start with 1.5

raw saffron Oct 18, 2023, 8:24 PM

#

why though?

cyan crown Oct 18, 2023, 8:24 PM

#

why 30min?

#

what is your gpu?

shy kelp Oct 18, 2023, 8:24 PM

#

exaggerated

nimble heart Oct 18, 2023, 8:24 PM

#

XL makes an image in like <10 seconds with the right settings

shy kelp Oct 18, 2023, 8:25 PM

#

and 1.5 is <5 thishowitis

crisp owl Oct 18, 2023, 8:25 PM

#

impatience nowadays

nimble heart Oct 18, 2023, 8:25 PM

#

why would you need an image every 2 seconds

#

XL produces substantially better quality

#

I understand if you're vram constrained and XL takes like 50x as long due to offloading but otherwise idk why you'd still use 1.5 outside of those animate tools

cyan crown Oct 18, 2023, 8:27 PM

#

and qr monster 😄

shy kelp Oct 18, 2023, 8:27 PM

#

people still doing qr stuff? lol

nimble heart Oct 18, 2023, 8:28 PM

#

it shouldn't be hard to train a qr monster for XL im surprised it hasnt been done

shy kelp Oct 18, 2023, 8:28 PM

#

there is one thishowitis

nimble heart Oct 18, 2023, 8:28 PM

#

everyone's too focused on waifus

crisp owl Oct 18, 2023, 8:28 PM

#

you can use it to do otherstuff than just QR's

shy kelp Oct 18, 2023, 8:28 PM

#

and it works fine

nimble heart Oct 18, 2023, 8:28 PM

#

oh? is it good?

crisp owl Oct 18, 2023, 8:28 PM

#

And the official qr monster team is working on an sdxl model

cyan crown Oct 18, 2023, 8:28 PM

#

shy kelp people still doing qr stuff? lol

sometime

rustic garnet Oct 18, 2023, 8:29 PM

#

shy kelp and it works fine

really? I thought the one released was bad

nimble heart Oct 18, 2023, 8:29 PM

#

I feel like we're getting pretty far from QR codes lol

rustic garnet Oct 18, 2023, 8:29 PM

#

yeah, but its great

#

you still see the mona lisa but only from looking at the image from far away

shy kelp Oct 18, 2023, 8:30 PM

#

pretty sure this is xl, i forget

cyan crown Oct 18, 2023, 8:30 PM

#

rustic garnet you still see the mona lisa but only from looking at the image from far away

yes that is the fun fact

kind pendant Oct 18, 2023, 8:32 PM

#

where do you guys get your models from?

shy kelp Oct 18, 2023, 8:32 PM

#

custom

crisp owl Oct 18, 2023, 8:32 PM

#

civitai mostly

cyan crown Oct 18, 2023, 8:32 PM

#

with SDXL you can have almost any style with base model

kind pendant Oct 18, 2023, 8:33 PM

#

ah.. thank you ^^

kind pendant Oct 18, 2023, 8:33 PM

#

cyan crown with SDXL you can have almost any style with base model

löl

crisp owl Oct 18, 2023, 8:34 PM

#

rustic garnet Oct 18, 2023, 8:35 PM

#

kind pendant löl

it's true. Base model is already very good. Custom models are not better in general, only for very limited styles

#

I would always try base model first before downloading custom models

cyan crown Oct 18, 2023, 8:36 PM

#

I'm trying all the styles of base model

shy kelp Oct 18, 2023, 8:36 PM

#

XL, no Loras

cyan crown Oct 18, 2023, 8:36 PM

#

they are thousands

kind pendant Oct 18, 2023, 8:37 PM

#

rustic garnet I would always try base model first before downloading custom models

the base model is the skeleton that barley follows the prompt at all... civitai is a treasure trove of awesome ^^ thx

rustic garnet Oct 18, 2023, 8:37 PM

#

that might be the case for 1.5

#

for SDXL I cannot agree on that

shy kelp Oct 18, 2023, 8:38 PM

#

kind pendant the base model is the skeleton that barley follows the prompt at all... civitai ...

i wouldn't say treasure trove of awesome

cyan crown Oct 18, 2023, 8:38 PM

#

kind pendant Oct 18, 2023, 8:38 PM

#

cyan crown

uh.. nais

hasty smelt Oct 18, 2023, 8:42 PM

#

guys, I am following the tutorial to use (--medvram-sdxl) however I am not finding the file in my folder similar to the video file (webui-user) to open in notepad, does anyone know what I should do? Thanks

crisp owl Oct 18, 2023, 8:43 PM

#

scroll down more?

shy kelp Oct 18, 2023, 8:43 PM

#

lol

cyan crown Oct 18, 2023, 8:43 PM

#

😄

hasty smelt Oct 18, 2023, 8:44 PM

#

crisp owl scroll down more?

I have it, but it's .sh, I can't edit it. 🥺

crisp owl Oct 18, 2023, 8:45 PM

#

.bat

cyan crown Oct 18, 2023, 8:45 PM

#

the .bat

hasty smelt Oct 18, 2023, 8:46 PM

#

habby

#

thanks guys

steady grove Oct 18, 2023, 8:46 PM

#

you can edit a .sh if you just drag and drop it into notepad or another editor. regardless, you'll want the .bat because .sh is a linux shell script

#

god speed pickle rick

crisp owl Oct 18, 2023, 8:46 PM

#

cyan crown Oct 18, 2023, 8:47 PM

#

steady grove Oct 18, 2023, 8:47 PM

#

crisp owl

reminds me of that weird fiinal seaoson of american gods, well, i mean, they were all weird i guess

hasty smelt Oct 18, 2023, 8:47 PM

#

steady grove you can edit a .sh if you just drag and drop it into notepad or another editor. ...

waow thanks

cyan crown Oct 18, 2023, 8:48 PM

#

crisp owl Oct 18, 2023, 8:48 PM

#

steady grove reminds me of that weird fiinal seaoson of american gods, well, i mean, they wer...

Just missing Jormungandr unfortunately, otherwise this would have been perfect
cinematic photograph, space view of earth with jormungandr serpent slithering under the entire globes oceans, massive yggdrasil tree distant in space with nine realms connected to its branches

#

it keeps trying to make the branches made out of jormungandr

shy kelp Oct 18, 2023, 8:51 PM

#

(jormungandr serpent:1) -> 1.4

crisp owl Oct 18, 2023, 8:52 PM

#

Yeah testing weights currently

I just process batches of 5 to go through a full process and do work on the side while waiting for em to finish

#

~200 seconds for a full process including upscale

raw saffron Oct 18, 2023, 8:57 PM

#

is there any way to use sdxl on my phone? like maybe somehow run it on some kind of server

shy kelp Oct 18, 2023, 8:58 PM

#

like your server or from a host?

#

because if it's a host, you're going to be paying a lot

south horizon Oct 18, 2023, 8:58 PM

#

colab is free

shy kelp Oct 18, 2023, 8:59 PM

#

for xl?

#

lol

cyan crown Oct 18, 2023, 8:59 PM

#

for xl PC or pay

south horizon Oct 18, 2023, 8:59 PM

#

mm I dunno, never tried it with xl

shy kelp Oct 18, 2023, 8:59 PM

#

then why say it's free?

south horizon Oct 18, 2023, 8:59 PM

#

things like runpod aren't that expensive are they?

shy kelp Oct 18, 2023, 8:59 PM

#

for XL it will be, 1.5, whatever

cyan crown Oct 18, 2023, 8:59 PM

#

since it's like a drug....yes

crisp owl Oct 18, 2023, 9:01 PM

#

I've seen people using --listen 0.0.0.0 when they talk about using from a phone, but that's about all I know. No clue about the specifics.

cyan crown Oct 18, 2023, 9:01 PM

#

crisp owl I've seen people using --listen 0.0.0.0 when they talk about using from a phone,...

you can use on your LAN without any trick

#

adding --listen to webui-user

#

I tried and it works fine

crisp owl Oct 18, 2023, 9:05 PM

#

No Yggdrasil here, but still kinda neat

shy kelp Oct 18, 2023, 9:05 PM

#

that requires your pc to be on the same network though right?

cyan crown Oct 18, 2023, 9:06 PM

#

shy kelp that requires your pc to be on the same network though right?

yes

#

or you can do port forwarding on your router and use something like duckdns to reach outside

shy kelp Oct 18, 2023, 9:09 PM

#

port forwarding days on my own pc days are long over for me lol

cyan crown Oct 18, 2023, 9:11 PM

#

well to reach your sd interface from outside your LAN you nned to do PF

#

on the router

fallow prism Oct 18, 2023, 9:13 PM

#

What about share=true and using the gradio url?

shy kelp Oct 18, 2023, 9:13 PM

#

people are too trusting, i just vpn or disconnect the internet altogether

cyan crown Oct 18, 2023, 9:14 PM

#

yes VPN is the best solution

#

because you don't risk people using exploits of the interface

olive perch Oct 18, 2023, 9:36 PM

#

OutOfMemoryError: CUDA out of memory. Tried to allocate 20.00 MiB (GPU 0; 2.00 GiB total capacity; 1.61 GiB already allocated; 0 bytes free; 1.65 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

hey i keep getting this error when trying to generate using the base sdxl model or just the 1.5

#

what can i do about it?

crisp owl Oct 18, 2023, 9:37 PM

#

If in A1111, make sure you have optimization flags set in your webui-user.bat file

olive perch Oct 18, 2023, 9:39 PM

#

whats A1111?

#

oh

#

what optimization flags?

crisp owl Oct 18, 2023, 9:41 PM

#

When I still used A1111, this was my settings.

TBH not sure if you still need the first two -- flags, but guessing you don't have a super beefy GPU, so at least keep the --medvram

set COMMANDLINE_ARGS= --opt-sdp-attention --no-half-vae --medvram

olive perch Oct 18, 2023, 9:42 PM

#

ill try ty

#

ill add the --medvram only, if it wont work ill just add the others

crisp owl Oct 18, 2023, 9:43 PM

#

the alternative to the first opt-sdp-attention is --xformers
So that's another you can try also.

#

IF you want some light reading
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Command-Line-Arguments-and-Settings

GitHub

Command Line Arguments and Settings

Stable Diffusion web UI. Contribute to AUTOMATIC1111/stable-diffusion-webui development by creating an account on GitHub.

olive perch Oct 18, 2023, 9:44 PM

#

oh ty

olive perch Oct 18, 2023, 9:46 PM

#

crisp owl When I still used A1111, this was my settings. TBH not sure if you still need t...

do i need to restart the entire program after adding a parameter?

crisp owl Oct 18, 2023, 9:46 PM

#

yah

olive perch Oct 18, 2023, 9:46 PM

#

okki

#

still get the error after adding the medvram parameter oof

crisp owl Oct 18, 2023, 9:49 PM

#

What size image are you trying?

hasty smelt Oct 18, 2023, 9:50 PM

#

What do you think is the best to use?

olive perch Oct 18, 2023, 9:51 PM

#

crisp owl What size image are you trying?

512x512 ;-;

#

crisp owl Oct 18, 2023, 9:52 PM

#

hasty smelt What do you think is the best to use?

I use currently dpmpp_3m_sde_gpu/exponential for my main processing, and either ddim/ddim_uniform or dpm_adaptive/Karras for my facefix node

crisp owl Oct 18, 2023, 9:52 PM

#

olive perch 512x512 ;-;

What gpu/how much vram?

olive perch Oct 18, 2023, 9:52 PM

#

how do i check that

hasty smelt Oct 18, 2023, 9:53 PM

#

crisp owl I use currently dpmpp_3m_sde_gpu/exponential for my main processing, and either...

Thanks budy

crisp owl Oct 18, 2023, 9:55 PM

#

olive perch how do i check that

olive perch Oct 18, 2023, 9:56 PM

#

0.5GB/4

#

i mean im on a laptop rn though.. xd

#

didnt expect for the images to be generated quickly but didnt expect them to not generate at all too

#

is 4gb not enough?

viscid warren Oct 18, 2023, 9:57 PM

#

olive perch is 4gb not enough?

Pretty low

glad grove Oct 18, 2023, 9:57 PM

#

for 1.5 is barely enough,for sdxl no, specially a laptop gpu

crisp owl Oct 18, 2023, 9:57 PM

#

so 4....that's probably not gonna work in A1111 at least for SDXL.
Maybe you can squeeze em for 1.5 models in A1111.

Comfyui is more memory friendly, so perhaps you'd have better luck with it, but, it's more complex also

olive perch Oct 18, 2023, 9:58 PM

#

i see

#

ty

glad grove Oct 18, 2023, 9:58 PM

#

post the name of the gpu here

crisp owl Oct 18, 2023, 9:58 PM

#

same screen I posted, top right name

olive perch Oct 18, 2023, 9:59 PM

#

idk the full name, its from intel though

glad grove Oct 18, 2023, 10:00 PM

#

if its intel iris its not good enough for ai

olive perch Oct 18, 2023, 10:00 PM

#

i see

#

ill try on my pc when i get back home

#

was curious how it would be on my laptop

#

ty

crisp owl Oct 18, 2023, 10:01 PM

#

super slow if you set everything up perfectly lol

#

tiled processing, low vram flag, a bunch of dumped stuff to CPU processing...it would be slow.

#

probably possible, but not worth it likely

hasty smelt Oct 18, 2023, 11:09 PM

#

crisp owl I use currently dpmpp_3m_sde_gpu/exponential for my main processing, and either...

Where can I download dpmpp_3m_sde?

half cedar Oct 18, 2023, 11:10 PM

#

You cant handle that sampler

#

#

spring fulcrum Oct 19, 2023, 12:06 AM

#

Has anyone found a way to use the GPT 4 idea2img in comfyui?

maiden gale Oct 19, 2023, 12:08 AM

#

Im trying to batch process images in in Comfy, have a directory with images in name order, does anyone know of any custom node that has this feature?

I tried WAS suite, but it doesnt pull the images in name order.

crisp owl Oct 19, 2023, 12:52 AM

#

https://github.com/ltdrdata/ComfyUI-Inspire-Pack
Should have that ability, I haven't used it, so you'd have to read through the github and see what it requires

GitHub

GitHub - ltdrdata/ComfyUI-Inspire-Pack: This repository offers vari...

This repository offers various extension nodes for ComfyUI. Nodes here have different characteristics compared to those in the ComfyUI Impact Pack. The Impact Pack has become too large now... - Git...

maiden gale Oct 19, 2023, 1:02 AM

#

crisp owl https://github.com/ltdrdata/ComfyUI-Inspire-Pack Should have that ability, I hav...

It has it, but it just wont load in the name order

#

It has some random order, couldnt figure out what it is

#

I'm trying to batch prompt them at the same time so the order will matter

maiden gale Oct 19, 2023, 1:21 AM

#

I've found a solution by changing the script with the help of chatgpt, what a time we live in lol

#

thanks @crisp owl , I changed the load order used in the WAS suite loader

crisp owl Oct 19, 2023, 1:38 AM

#

maiden gale I've found a solution by changing the script with the help of chatgpt, what a ti...

Nice, yeah I've used it for some crafty code changes at times.
Helpful for some things for sure

static berry Oct 19, 2023, 1:47 AM

#

ivory blaze Oct 19, 2023, 2:30 AM

#

ChatGPT is catching up to SD!

#

like for real LOL

#

I doubt I can decode that, no extended characters... but lol. I guess it thinks since I can supply images now it can make them some how

nimble heart Oct 19, 2023, 2:41 AM

#

copy paste the data into an html file so your browser decodes it for you

#

looks like a valid png header so it might actually work

static berry Oct 19, 2023, 3:01 AM

#

half cedar Oct 19, 2023, 3:21 AM

#

That would be my luck. Escaping an inferno in a suburu wagon

indigo carbon Oct 19, 2023, 3:30 AM

#

do you see it?

crisp owl Oct 19, 2023, 3:30 AM

#

Frey's scalable ship

crisp owl Oct 19, 2023, 3:30 AM

#

indigo carbon do you see it?

https://tenor.com/view/troll-troll-face-gif-25116980

Tenor

indigo carbon Oct 19, 2023, 3:30 AM

#

I've hid something here

indigo carbon Oct 19, 2023, 3:34 AM

#

ivory blaze like for real LOL

that's more similar to how pixel diffusion works actually, it diffuses each pixel individually instead of diffusing the latents

#

I doubt it will make anything by writing PNG data, but it might be trying to color each pixel individually

#

very dumb approach to generating images though, but it'd be interesting to see an LLM being able to actually create a PNG

half cedar Oct 19, 2023, 3:38 AM

#

indigo carbon I've hid something here

Terminator 2 villain?

indigo carbon Oct 19, 2023, 3:40 AM

#

half cedar Terminator 2 villain?

nope, is it really too well hidden?

crisp owl Oct 19, 2023, 3:40 AM

#

ah

#

now I see

indigo carbon Oct 19, 2023, 3:40 AM

#

I can see it, but that's probably because I see the origin

#

I've hidden a word this time

#

it's really COOL that it's possible with SDXL now, but it's not quite as good as QR monster imo

crisp owl Oct 19, 2023, 3:47 AM

#

Theirs is quite good, wonder how long until they release their sdxl version

indigo carbon Oct 19, 2023, 3:47 AM

#

the one I'm using is also somewhat decent, but it really struggles when the inputs have more complex shapes

#

it also deteriorates quality way more than QR monster used to

crisp owl Oct 19, 2023, 3:48 AM

#

indigo carbon Oct 19, 2023, 3:49 AM

#

crisp owl

I'm assuming he's talking about quality deterioration? because that's the only flaw I see with the one I'm using (which isn't QRmonster)

crisp owl Oct 19, 2023, 3:50 AM

#

No clue, I can't make assumptions on that

indigo carbon Oct 19, 2023, 3:50 AM

#

link to that post? I'd keep track of that

crisp owl Oct 19, 2023, 3:51 AM

#

in this discussion
https://huggingface.co/monster-labs/control_v1p_sd15_qrcode_monster/discussions/63

monster-labs/control_v1p_sd15_qrcode_monster · SDXL

hoary saddle Oct 19, 2023, 5:11 AM

#

indigo carbon do you see it?

sneaky....

#

indigo carbon Oct 19, 2023, 5:14 AM

#

hoary saddle

oh, that's actually a model I already released a week or so ago. I just use it as a base model in my workflows

hoary saddle Oct 19, 2023, 5:15 AM

#

i gotta figure out how to train i guess, 2 409024G cards and i'm rendering 1024x1024 jpgs....boring

#

too lazy to watch a 2 hour tutorial on youtube

crisp owl Oct 19, 2023, 5:16 AM

#

I'll borrow one

hoary saddle Oct 19, 2023, 5:16 AM

#

will email it over to ya 😉

crisp owl Oct 19, 2023, 5:16 AM

#

Sweet, looking forward to downloading it!

indigo carbon Oct 19, 2023, 5:16 AM

#

hoary saddle i gotta figure out how to train i guess, 2 409024G cards and i'm rendering 1024x...

training isn't exactly what I did to make that. I calculated CLiP and UNET ratios to create a great (imo) model

hoary saddle Oct 19, 2023, 5:17 AM

#

indigo carbon Oct 19, 2023, 5:17 AM

#

indigo carbon training isn't exactly what I did to make that. I calculated CLiP and UNET ratio...

I explained exactly what I did in that model's description

hoary saddle Oct 19, 2023, 5:17 AM

#

indigo carbon training isn't exactly what I did to make that. I calculated CLiP and UNET ratio...

interesting

#

#

that works great tho if you play around with the cnet settings

indigo carbon Oct 19, 2023, 5:30 AM

#

hoary saddle

huh, yeah, that's better than what I usually get with the available QR controlnet model.. what'd you do here?

#

this is what I'm currently working with. I was thinking the only thing limiting this from getting this better is the controlnet model?

uncut fiber Oct 19, 2023, 5:35 AM

#

how about white subject and black background, inverted image?
in controlnet it is definitely better.

indigo carbon Oct 19, 2023, 5:36 AM

#

uncut fiber how about white subject and black background, inverted image? in controlnet it i...

the QR cnet is trained on barcodes and stickers, those have white background and black lines/whatever

#

I tried the other way, it wasn't as good

uncut fiber Oct 19, 2023, 5:37 AM

#

o.k.

indigo carbon Oct 19, 2023, 5:38 AM

#

but idk, I feel when you're trying to control SDXL too much it looses the ability to make stuff like this

#

again, a flaw caused by CLiP

#

CLiP never likes being controlled too much, it's just really good at the things it's intended to do. CLiP is SD's bottleneck imo, that's the only thing limiting complex multimodal inferences that won't cause quality degradation

analog roost Oct 19, 2023, 5:51 AM

#

Hi guys, has anyone noticed weird color issues with dynavisionXL after installing and enabling the TensorRT extension and switching to the dev branch of Automatic1111?

#

More than half of the images generated now have the issue in my case

#

In some cases to kind of extremes like this

#

Even more ridiculous 😅

hoary saddle Oct 19, 2023, 5:58 AM

#

indigo carbon huh, yeah, that's better than what I usually get with the available QR controln...

just bumped cnet up to 1.2 and denoise on the first sampler to about 90

#

analog roost Oct 19, 2023, 5:59 AM

#

Though admittedly some images generated like this are cool like this skirt

finite thunder Oct 19, 2023, 6:26 AM

#

Hi all, is there a way to check the list of artists' images that are included in the dataset and haven't opted out?

crisp owl Oct 19, 2023, 6:28 AM

#

not quite, but you can find lots of resources of people who's done that and compiled together on their own sites
https://weirdwonderfulai.art/resources/stable-diffusion-xl-sdxl-artist-study/

Weird Wonderful AI Art

Harmeet G

Stable Diffusion XL (SDXL) Artist Study

finite thunder Oct 19, 2023, 6:41 AM

#

interesting! thanks for sharing.

uncut fiber Oct 19, 2023, 7:28 AM

#

https://rikkar69.github.io/SDXL-artist-study/
or here

SDXL 1.0 Artistic Studies

Artist Studies

Latent exploration with SDXL

indigo carbon Oct 19, 2023, 8:40 AM

#

interestingly, it will always perform the same styles if you make up artist names

#

I used a made up name once, then did it again with an entirely different prompt that has the made up name; both images had a similar style

#

not sure if this is a good example:

#

"by jeff fuctional"

#

"bucket of water, by jeff fuctional"

#

similar color accents; but not as presistant with the made up style as I remember it being

icy brook Oct 19, 2023, 9:02 AM

#

solar merlin Oct 19, 2023, 9:38 AM

#

I keep getting red spotting visuals on my texture generations using SDXL. I have tried using the 0.9 VAE and different schedulers but they all still seem to produce this type of distortion, any ideas?

uncut fiber Oct 19, 2023, 9:40 AM

#

There used to be tilled option which i believe can be found in quicksettings, are you using it?

solar merlin Oct 19, 2023, 9:44 AM

#

This is using diffusers so I am manually applying the circular convolution for the Conv2D function. But the tiling is fine. It is the red artifcats that appear with SDXl in both tiled and untiled images

#

a better image

#

(watermark is also off)

uncut fiber Oct 19, 2023, 9:47 AM

#

i dont know. Probably try different SDXL model with baked in VAE

solar merlin Oct 19, 2023, 9:48 AM

#

I get it using the default baked in VAE too. But I guess maybe trying some other models is a good idea

minor glacier Oct 19, 2023, 10:08 AM

#

Have you got invisible-watermark in your enviroment, that looks a bit like the watermark.

solar merlin Oct 19, 2023, 10:11 AM

#

minor glacier Have you got invisible-watermark in your enviroment, that looks a bit like the w...

nope, that is removed and deleted from all code

strong copper Oct 19, 2023, 10:45 AM

#

mellow tendon Oct 19, 2023, 10:47 AM

#

I'm loving the 2x faster generation speed with the TensorRT SD Unets.
But now I just find myself running Huen sampler at the same speed as I used to run DPM++ 2M Karras, just because I slightly prefer the output of Huen (but it is 2x slower than others).

fierce hollow Oct 19, 2023, 10:49 AM

#

is that with the new nvidia extension?

#

guess that doesn't work with comfy for now

mellow tendon Oct 19, 2023, 10:54 AM

#

Yes, the new Nvidia Extension.
The ComfyUI developers are planning it but with a low-priority
https://github.com/comfyanonymous/ComfyUI/discussions/1775
because of all its limitations.

GitHub

Last NVIDIA drivers claim a 2x speed increase with TensorRT Extensi...

NVIDIA just released the 545.84 drivers and they show a 2x speed improvement on image inference on this page: https://www.nvidia.com/en-us/geforce/news/game-ready-driver-dlss-3-naraka-vermintide-rt...

uncut fiber Oct 19, 2023, 10:55 AM

#

it working for SDXL for you as well?

mellow tendon Oct 19, 2023, 10:56 AM

#

Yes it works in SDXL.

uncut fiber Oct 19, 2023, 10:56 AM

#

i have this driver but i need as well github thing
https://github.com/NVIDIA/Stable-Diffusion-WebUI-TensorRT

GitHub

GitHub - NVIDIA/Stable-Diffusion-WebUI-TensorRT: TensorRT Extension...

TensorRT Extension for Stable Diffusion Web UI. Contribute to NVIDIA/Stable-Diffusion-WebUI-TensorRT development by creating an account on GitHub.

mellow tendon Oct 19, 2023, 10:57 AM

#

if you want to upsacle to 2k you need to generate an extra Unet (well for just about everything you need an extra unet)

uncut fiber Oct 19, 2023, 10:57 AM

#

yes but it doesnt work for me with SDXL

mellow tendon Oct 19, 2023, 10:58 AM

#

Did you generate the Unet with the SDXL model you want to use selected?

#

you need to do it for every model and lora you want to use.

#

I tried optimising a Lora and just got a load of errors, I think that is in Beta.

frozen terrace Oct 19, 2023, 11:00 AM

#

Does SDXL + Controlnets (IP-Adapter, depth etc) work when using TensorRT in A1111?

uncut fiber Oct 19, 2023, 11:00 AM

#

yes and it doesnt work

#

@mellow tendon yes

#

https://github.com/NVIDIA/Stable-Diffusion-WebUI-TensorRT#common-issueslimitations
@frozen terrace probably closest answer

GitHub

GitHub - NVIDIA/Stable-Diffusion-WebUI-TensorRT: TensorRT Extension...

TensorRT Extension for Stable Diffusion Web UI. Contribute to NVIDIA/Stable-Diffusion-WebUI-TensorRT development by creating an account on GitHub.

frozen terrace Oct 19, 2023, 11:08 AM

#

Thanks, don't see a limitation there regarding my needs.

mellow tendon Oct 19, 2023, 11:13 AM

#

uncut fiber yes and it doesnt work

When you say it doesn't work, do you mean you don't see a speed-up when you have the Unet selected? Or you get an error?

mellow tendon Oct 19, 2023, 11:16 AM

#

uncut fiber yes and it doesnt work

Have you correctly installed the Extension so you see the TensorRT Tab?

fierce hollow Oct 19, 2023, 11:36 AM

#

does every resolution require compiling a new module? or is it similar to ai template where, say, a 2048 module can be used with 1536 resolution

#

or wait, more like, can I do 2048x1536 and 2048x2048 with the same module or would that requires 2 different ones

#

exporter seems kind of confusing

vale eagle Oct 19, 2023, 11:53 AM

#

Using LLM (GPT-4V) to self refine the image. https://arxiv.org/pdf/2310.08541.pdf

mellow tendon Oct 19, 2023, 12:06 PM

#

vale eagle Using LLM (GPT-4V) to self refine the image. https://arxiv.org/pdf/2310.08541.pd...

Yeah, that looks crazy good, I cannot wait to get my hands on that if it ever releases publicly. (not sure if it needs to use GTP 4 in real time to work?)

mellow tendon Oct 19, 2023, 12:07 PM

#

fierce hollow does every resolution require compiling a new module? or is it similar to ai tem...

Yeah I read that anything within the max resolution setting will work.

fierce hollow Oct 19, 2023, 12:07 PM

#

ah sounds good, thanks

noble shoal Oct 19, 2023, 12:09 PM

#

vale eagle Using LLM (GPT-4V) to self refine the image. https://arxiv.org/pdf/2310.08541.pd...

That looks like an incredibly resource expensive way to generate a image.

mellow tendon Oct 19, 2023, 12:11 PM

#

noble shoal That looks like an incredibly resource expensive way to generate a image.

hmm, using an LLM to refine the prompt to a level where you don't need to use Controlnet or Loras or Cherry pick from a large batch seems like it would save loads of resources to me.

viscid basin Oct 19, 2023, 12:12 PM

#

Is it okay to generate images of my favourite kpop idol for personal use?

mellow tendon Oct 19, 2023, 12:12 PM

#

Oh but I suppose that is generating a load of image but that is happening in the cloud.

noble shoal Oct 19, 2023, 12:13 PM

#

mellow tendon hmm, using an LLM to refine the prompt to a level where you don't need to use Co...

If I go through the flowchart it seems like it needs to generate N numbers of images to get to the point of generating the final output.

noble shoal Oct 19, 2023, 12:16 PM

#

mellow tendon Oh but I suppose that is generating a load of image but that is happening in the...

Unfortunately, the cloud is still using resources.

#

In my opinion, and I can only talk for myself. I think it's a bad idea 💡.

mellow tendon Oct 19, 2023, 12:19 PM

#

I still think if it can generate the image you want in one go (with a small batch of Drafts) it could be efficient, I just ran a bacth of 40 images and still didn't get quite what I was looking for.

#

I got some "happy accidents" along the way, with these learned dragons.

viscid basin Oct 19, 2023, 12:20 PM

#

Those are cool

#

Can you do cute dogs doing human things?

uncut fiber Oct 19, 2023, 12:34 PM

#

i got speedup in exported models.

#

i got error when tried to export SDXL

fierce hollow Oct 19, 2023, 12:46 PM

#

repo says you need to be on dev branch for sdxl

uncut fiber Oct 19, 2023, 12:51 PM

#

yes and one was there and it took him 3 mins to generate image, so it 9 times slower 🙂

#

i think i need be logged in nvidia and download that zip file probably. But SD is working o.k.

mellow tendon Oct 19, 2023, 1:19 PM

#

uncut fiber yes and one was there and it took him 3 mins to generate image, so it 9 times sl...

@qwerty_qwer sounds like you might have downloaded the wrong extension, that is what I did first. If you try and search for it in Automatic1111 it finds the Automatic1111 TensorRT extension with the same name as the Nvida TensorRT extension. Try deleting that and install from the nvida URL.

mellow tendon Oct 19, 2023, 1:20 PM

#

mellow tendon @qwerty_qwer sounds like you might have downloaded the wrong extension, that is...

If there are tabs call Onnx or something you have the wrong one.

uncut fiber Oct 19, 2023, 1:22 PM

#

it says it should be installed from URL?
I have only tab TensorRT, but haven't tried advanced tab, supposing i have to have choose some converted model to appear? For SD models it is working, but still getting some errors starting A1111 about entry points. Probably i need download zip from nvidia which means signup and login?

#

It wasnt Qwerty_Qwer, it was @oblique swan i think

indigo carbon Oct 19, 2023, 1:24 PM

#

uncut fiber https://github.com/NVIDIA/Stable-Diffusion-WebUI-TensorRT#common-issueslimitatio...

yeah, that's identical with speed to AITemplate except it's not as flexible.. if NVIDIA would have released this like a month or two ago this would be relevant to most of us

#

this would make sense if TensorRT is faster than AIT, but that's not really the case if both are done correctly

uncut fiber Oct 19, 2023, 1:26 PM

#

AIT is already working?

indigo carbon Oct 19, 2023, 1:26 PM

#

the next target for optimizing diffusion is pulling off something like exLLaMa for diffusion

indigo carbon Oct 19, 2023, 1:27 PM

#

uncut fiber AIT is already working?

for the longest time, it's been available on an old version of ComfyUI for ages; but it's now compatible with the latest ComfyUI

uncut fiber Oct 19, 2023, 1:27 PM

#

o.k.

indigo carbon Oct 19, 2023, 1:31 PM

#

the speed difference between pure PyTorch to AIT and TRT on a modern GPU is about 2-3 times the normal speed with no degradation when done correctly. HOWEVER; with language models- there's something called exLLaMa that does a WHOPPING times 8 boost by having actual optimized kernals. the day this happens to diffusion is the same day diffusion becomes an instant AI

whole kettle Oct 19, 2023, 1:41 PM

#

indigo carbon CLiP never likes being controlled too much, it's just really good at the things ...

Was kinda thinking about this before I joined. It doesn't really understand sentences right? I was reading the code and it seemed like it just sees keywords for the most part. Felt like it has to be doing something more than just that but thinking back on how the prompts turn out it really doesn't seem like it does.

indigo carbon Oct 19, 2023, 1:43 PM

#

whole kettle Was kinda thinking about this before I joined. It doesn't really understand sent...

unlike BLiP; CLiP just connects text to images, it won't have an understanding of language, and it's ONLY good at connecting text to images. however- more modern encoders such as BLiP are able to encode everything; making multimodal WAY easier and better

whole kettle Oct 19, 2023, 1:44 PM

#

yeah so it just tells it what nodes to go to without any understanding of what a direct object is right? It just sees "Dog is node 52521 in model"

#

or even their relation to eachother

indigo carbon Oct 19, 2023, 1:44 PM

#

the only reason SDXL can have text in the images is because the UNET is a masterpiece, that's it. CLiP is a bottleneck

indigo carbon Oct 19, 2023, 1:46 PM

#

whole kettle or even their relation to eachother

and CLiP is only good at doing that with just the text, this is why things like IPAdapter are required to make it able to blend/zeroshot images- and it will still loose plenty of quality

mellow tendon Oct 19, 2023, 1:47 PM

#

whole kettle Was kinda thinking about this before I joined. It doesn't really understand sent...

This video shows it well, "a plate without a bannana on it" cannot be done with just Clip right now https://youtu.be/TL2A8MYXsCE?si=MgROx4PW4kfHzocl&t=383

YouTube

MattVidPro AI

Mindblowing results! DALL-E 3 Quality AI Art using GPT-4 Vision & SDXL

Get Magical AI for free and save 7 hours of busywork every week: https://getmagical.com/matt

▼ Link(s) From Today’s Video:

Research: https://idea2img.github.io/

ChatGPT: https://www.futurepedia.io/tool/chatgpt

► MattVidPro Discord: https://discord.gg/bQgcbjs2Sg

► Follow Me on Twitter: https://twitter.com/MattVidPro

------------------------...

▶ Play video

rustic garnet Oct 19, 2023, 1:49 PM

#

the problem in my opinion is the limited training data for captions

#

initially, people used the ALT attribute in images to caption them

#

which means most image captions are rather uninformative

indigo carbon Oct 19, 2023, 1:49 PM

#

BLiP on the other hand, can encode both text and images just as easily; due to having an additional LLM component, it can have an excellent understanding of language. SDXL completely masters txt2img, but that's it. I can almost guarantee that if SDXL would have a more modern encoder, it would DESTROY other stuff completely

rustic garnet Oct 19, 2023, 1:50 PM

#

just think about an image of a battlefield with many soldiers and corpses, smoke and everything. The caption of such an image is not "battlefield, dead soldiers, corpses and blood on the ground, Napoleonic era", it's rather something like "Waterloo"

#

thus, CLIP was the best tool for this kind of data

#

it is trained to assign captions to images and embed them in the same space. This makes it extremely robust to bad captioned data

indigo carbon Oct 19, 2023, 1:51 PM

#

mellow tendon This video shows it well, "a plate without a bannana on it" cannot be done with ...

that's a very ugly excuse of an attempt to improve diffusion. gluing an LLM with diffusion is a horrible idea

rustic garnet Oct 19, 2023, 1:51 PM

#

I'm not sure if we are much better in this regards nowadays. My experiences with BLIP are rather... bad. Sometimes it's okay, but most time BLIP gives me totally wrong captions

indigo carbon Oct 19, 2023, 1:52 PM

#

when it comes to the QUALITY of the images, SDXL pretty much destroys everything else, but it won't be as creative as Dall-e 3 due to CLiP being a bottleneck.

rustic garnet Oct 19, 2023, 1:53 PM

#

DALL-E, Imagegen and DeepFloyd are models that use LLMs trained on text-data only. So this models don't have the disadvantage that they are only trained on bad captions, they are trained on the whole internet text corpus. HOWEVER, they are NOT trained on images and, thus, have no idea about visual components

#

they have a better text understanding, but probably a worse "style" understanding, as they don't have any knowledge about visuals.

#

I'm pretty sure the reason SDXL is using CLIP is because it turned out to be better than the alternatives

indigo carbon Oct 19, 2023, 1:54 PM

#

BLiP2 has knowledge about visuals

rustic garnet Oct 19, 2023, 1:54 PM

#

if you look into the SDXL source code you will see that they tried different text encoders, too, such as Flan T5

rustic garnet Oct 19, 2023, 1:54 PM

#

indigo carbon BLiP2 has knowledge about visuals

yes, but is BLIP better than CLIP? I'm not so sure about that. as said, for me BLIP often gave me really bad results

indigo carbon Oct 19, 2023, 1:55 PM

#

rustic garnet yes, but is BLIP better than CLIP? I'm not so sure about that. as said, for me B...

did you try BLiP2? it's really insane, I asked it about the expressions of the characters in an image and it even started making up their thoughts

rustic garnet Oct 19, 2023, 1:55 PM

#

in theory BLIP should have a better text understanding as it is instruction trained. But I wouldn't say this is guaranteed. As said, it always depends on the quality of your training data

rustic garnet Oct 19, 2023, 1:55 PM

#

indigo carbon did you try BLiP2? it's really insane, I asked it about the expressions of the c...

yeah, most time it cannot even distinguish males from females

#

also, BLIP2 is a really large model. You cannot have both, a model that fits into consumer hardware and a model that is state of the art

indigo carbon Oct 19, 2023, 1:56 PM

#

for me it even distinguished the styles and expressions. you must've not inferred it correctly then

rustic garnet Oct 19, 2023, 1:56 PM

#

indigo carbon for me it even distinguished the styles and expressions. you must've not inferre...

sometimes it does, sometimes it's really stupid

indigo carbon Oct 19, 2023, 1:57 PM

#

rustic garnet also, BLIP2 is a really large model. You cannot have both, a model that fits int...

SDXL is pretty much state of the art when it comes to the quality, but it won't be as creative as other things due to CLiP. HOWEVER; if your prompt is rich enough, it will make insane stuff

#

maybe if SAI came up with a new text/image encoder..

vale eagle Oct 19, 2023, 1:58 PM

#

#

SDXL base image

#

Just trying the Iterative self-refined Idea2Img prompt from https://arxiv.org/pdf/2310.08541.pdf

indigo carbon Oct 19, 2023, 2:00 PM

#

vale eagle

again, I wasn't complaining about the quality, it's just that CLiP is definately a bottleneck, look at IPAdapter for instance; that's the only thing that ACTUALLY enables it to get image input

#

if the text encoder itself would have image input capabilities IPAdapter won't be necessary and there won't be any degradation when doing image input

vale eagle Oct 19, 2023, 2:01 PM

#

indigo carbon again, I wasn't complaining about the quality, it's just that CLiP is definately...

The paper explored how good the model (SDXL 1.0 base) could be with "good" prompt.

#

with current text encoder

indigo carbon Oct 19, 2023, 2:02 PM

#

vale eagle with current text encoder

that was kinda my point..

#

the text encoder itself is fine, it just won't have a good understanding feeding off of small prompts and it won't get image input

whole kettle Oct 19, 2023, 2:03 PM

#

Yeah if you roll the right seed and hit the right nodes in just the right way it does a good job.

vale eagle Oct 19, 2023, 2:03 PM

#

It could be a huge improvement by using new techs which just came out within a few months.

indigo carbon Oct 19, 2023, 2:04 PM

#

whole kettle Yeah if you roll the right seed and hit the right nodes in just the right way it...

i'd say even the best job, it's just that CLiP is what's holding it back from pulling it off at a higher rate and having more capabilities

#

the solution to this seems to be SAI making a new text encoder that is best at all worlds

rustic garnet Oct 19, 2023, 2:05 PM

#

indigo carbon again, I wasn't complaining about the quality, it's just that CLiP is definately...

that has nothing to do with the text encoder oO

#

SDXL is trained on CLIP text tokens. In principal you can include images, as CLIP embeds images and text into the same space, but then your image would be only a single token which does not make much sense

indigo carbon Oct 19, 2023, 2:06 PM

#

rustic garnet that has nothing to do with the text encoder oO

image input? of course it does, SDXL can't blend/zeroshot images without the assistance of IPAdapter

rustic garnet Oct 19, 2023, 2:06 PM

#

you can condition SDXL on images, too, like ControlNets are doing.

rustic garnet Oct 19, 2023, 2:06 PM

#

indigo carbon image input? of course it does, SDXL can't blend/zeroshot images without the ass...

because they haven't trained SDXL on image conditioning

#

that has nothing to do with the text encoder

#

you can train SDXL with image input if you want. It's just a decision

mellow tendon Oct 19, 2023, 2:07 PM

#

Dall.e 3's prompt following/understanding is simply amazing when compared to SDXL, when your output isn't being block by the filters...

rustic garnet Oct 19, 2023, 2:07 PM

#

Controlnets are doing that. IPAdapter is doing something similar. They encode an image into "text-like tokens" like CLIP and train an conditioning on that

indigo carbon Oct 19, 2023, 2:08 PM

#

rustic garnet you can train SDXL with image input if you want. It's just a decision

are you telling me that it's theoretically possible to make a version of something like SDXL that can get multiple image inputs and blend them?

uncut fiber Oct 19, 2023, 2:08 PM

#

@mellow tendon and do you know minimal requirements for it? I am happy sai is keeping to make it real for say 4GB gpu cards or even lower.
And day by day more tags on black list.

rustic garnet Oct 19, 2023, 2:09 PM

#

indigo carbon are you telling me that it's theoretically possible to make a version of somethi...

yeah, why not. The unet is getting an conditioning as input. SDXL is trained on text conditioning, but you could use any kind of conditioning. ControlNets are a variant/finetune of SDXL that are using images as conditioning

#

I'm pretty sure you could even train a controlnet to blend multiple images

#

the most natural way of blending images, though, is just using their CLIP embedding

indigo carbon Oct 19, 2023, 2:10 PM

#

one flaw with controlnet is it almost always causes the model's quality to degrade the more you try to force it to do something

rustic garnet Oct 19, 2023, 2:10 PM

#

indigo carbon one flaw with controlnet is it almost always causes the model's quality to degra...

this is also the case with text input

#

as more your conditioning moves away from the training data, as more difficult is it to get good results

indigo carbon Oct 19, 2023, 2:11 PM

#

rustic garnet this is also the case with text input

not really, I've been making 300 word prompts and the images come out incredible

rustic garnet Oct 19, 2023, 2:11 PM

#

it has nothing to do with the number of text tokens

indigo carbon Oct 19, 2023, 2:11 PM

#

probably token normilization, isn't it?

rustic garnet Oct 19, 2023, 2:11 PM

#

SDXL is always using at least 75 tokens

#

if you give a short caption it just fills it with blank tokens

#

what I mean is if you force SDXL to follow a strange prompt then it will degrate image quality. Long texts give good results because they often give SDXL much freedom

#

if you make an image from a song lyrics there is no "correct outcome" you enforce. You are happy with any nice looking result

indigo carbon Oct 19, 2023, 2:15 PM

#

anyways, whenever SD3.0 comes out, I'm assuming it'll have image conditioning like @rustic garnet mentioned is possible, that'll probably be a huge step

#

or even possibly the next focus is mastering other components of the model? it seems like SDXL mastered the UNET, but idk about all the other stuff

mellow tendon Oct 19, 2023, 2:23 PM

#

uncut fiber <@589174193925390347> and do you know minimal requirements for it? I am happy sa...

Sorry I don't understand what you are asking??? I have 24GB of Vram so not sure about 4GB....

rustic garnet Oct 19, 2023, 2:23 PM

#

SDXL IS the unet 😅
all other components (vae, text encoder) are independent of sdxl

indigo carbon Oct 19, 2023, 2:29 PM

#

rustic garnet SDXL IS the unet 😅 all other components (vae, text encoder) are independent of ...

the VAE is also SDXL, SAI has a repo that contains just the VAE of SDXL for some reason

rustic garnet Oct 19, 2023, 2:43 PM

#

yes, but it's trained independently from sdxl and can be used independently

indigo carbon Oct 19, 2023, 2:53 PM

#

true, VAE is just pixel to latent and the other way around

#

but I think conditioning is dependent..? if you use another model's conditioning on a UNET with a different architecture the Ksampler will fail

rustic garnet Oct 19, 2023, 3:04 PM

#

yes, the unet depends on the vae and the text encoder input. Use a different conditioning and you have to retrain the unet

#

I just say the unet is the main component that is trained - all other components were trained too, but independently from it (and sometimes by different labs and on different data). If you look into the source code of SDXL you will see that many different conditionings were implemented. They haven't chosen CLIP for no reason. I guess it was the best trade-off between hardware requirements and visual appealing

#

if you compare SDXL with DeepFloyd IF, which is using an LLM trained on pure text, you will see that DeepFloyd IF has a MUCH better text understanding than SDXL. However, I don't find the images from DeepFloyd IF visualy appealing... maybe it's because they still haven't published the highres model. But I think it might also have something to do with the text encoder is not good with styles and aesthetics

vale eagle Oct 19, 2023, 3:25 PM

#

The model need to be run on consumer level hardware is a limitation. They could do better without this.

tribal lantern Oct 19, 2023, 3:35 PM

#

Well, Dalle-3 has shown you can have both aesthetics (NOT really detailed styles) and careful prompt following; some times. But overall styles/aesthetics is defiantly SDXL's strength. Especially organic and fine details (a jungle) SDXL manages to create much, much better. At first i was impressed by Dalle-3's prompt following, i still am, but even there, i start to notice it also has a tendency to fail once scenes get really out there. At the same time, it's awesome that it works for simple things where sdxl has a tendency to flat out ignore aspectgs of a prompt, it and won't do things at worst or needs carefully formulated prompts at best. On 1, 2 and 3 of my wishlist for SD-next is better prompt-following., especially for coarse "details" (fine details like color of eyes are solvable like https://rich-text-to-image.github.io/) but i've yet to find a good solution for coarser ones affecting (composition of) whole scene.

#

Maybe one model won't need to do all, just create the basic scene in a model that does the prompt, then enhance in sdxl. Kinda like sketch to image but the sketch is a different model

cyan crown Oct 19, 2023, 3:52 PM

#

Dalle3 is better with writings and understanding prompt. SDXL is better with quality

#

So one good idea could be creating base image with DE3 and then use it with Controlnet in SDXL for example

hoary saddle Oct 19, 2023, 4:10 PM

#

finite thunder Oct 19, 2023, 4:16 PM

#

uncut fiber https://rikkar69.github.io/SDXL-artist-study/ or here

This is great! I will review the Community Submissions on your GitHub to add the missing artist names.

hoary saddle Oct 19, 2023, 4:22 PM

#

#

cyan crown Oct 19, 2023, 4:38 PM

#

DALLE_2023-10-19_18.37.35_-_Foto_in_16_9_di_un_set_Lego_ispirato_al_film_The_Godfather._Sullo_sfondo_si_trova_la_scatola_del_set_con_una_chiara_e_ben_definita_scritta_The_Godf.png

DALLE_2023-10-19_18.37.22_-_Scena_in_16_9_di_un_set_Lego_tematico_su_The_Godfather._Sullo_sfondo_la_scatola_del_set_mostra_in_modo_prominente_la_scritta_The_Godfather._In_pr.png

DALLE_2023-10-19_18.37.15_-_Foto_panoramica_in_16_9_di_un_set_Lego_basato_sul_film_The_Godfather._La_scatola_del_set_con_una_ben_visibile_scritta_The_Godfather_domina_lo_sf.png

urban fjord Oct 19, 2023, 4:44 PM

#

The problem with using controlnet to blend images is that you still need training images for that, but if you have it then it is quite doable.

uncut fiber Oct 19, 2023, 4:54 PM

#

@finite thunder it is not my github 🙂 but there were owner presented. Mmnt
it is @cursive warren i think

wet nacelle Oct 19, 2023, 4:57 PM

#

static berry Oct 19, 2023, 5:07 PM

#

cyan crown Oct 19, 2023, 5:10 PM

#

SDXL Version

native knot Oct 19, 2023, 5:12 PM

#

cyan crown SDXL Version

Images you can hear.

cyan crown Oct 19, 2023, 5:12 PM

#

😂

noble shoal Oct 19, 2023, 5:47 PM

#

I once again screwed SDXL by training a Lora on mostly 80x112px images of Faces. Getting some glorious output here at the extreme resolution of 88x120px 😅

#

At least i get 11.51it/s 🤷‍♂️

steady grove Oct 19, 2023, 5:48 PM

#

cool i guess

noble shoal Oct 19, 2023, 6:02 PM

#

steady grove Oct 19, 2023, 6:03 PM

#

i bet he really nose how to party

noble shoal Oct 19, 2023, 6:04 PM

#

steady grove i bet he really nose how to party

Oh yeah, he can sense them from miles away.

hoary saddle Oct 19, 2023, 6:37 PM

#

dont be so picky

hoary saddle Oct 19, 2023, 6:37 PM

#

cyan crown SDXL Version

these are insanely good

crisp owl Oct 19, 2023, 7:01 PM

#

something smells fishy

upbeat summit Oct 19, 2023, 7:20 PM

#

masslevel-sdxl-20231017014958-1080107767194936-upscale-post-nm.png

#

masslevel-sdxl-20231016180444-485336132717779-upscale-post-nm.png

solar merlin Oct 19, 2023, 7:24 PM

#

I keep getting these dot mess on my white images I create. Is ist something to do with trying to create white images?

#

I am using sdxl + refiner

native knot Oct 19, 2023, 7:27 PM

#

solar merlin I keep getting these dot mess on my white images I create. Is ist something to d...

That looks like the noise that having the bugged 1.0 model+vae causes.

solar merlin Oct 19, 2023, 7:29 PM

#

@native knot I am using the base 1.0 model but then swapping the VAE with this code

#

but also, on the stability ai huggingface card they reinstated the 0.9 VAE so i am left confused https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/tree/main

stabilityai/stable-diffusion-xl-base-1.0 at main

#

im kinda stumped since I "think" I am applying all the various fixes

crisp owl Oct 19, 2023, 8:03 PM

#

Yeah so there's the base_1.0_0.9.safetensor model which has the proper working vae baked in. the 1.0.safetensor is the wonky one.

The standalone vae files, normal and fp16 are these
https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/tree/main/vae

solar merlin Oct 19, 2023, 8:05 PM

#

crisp owl Yeah so there's the base_1.0_0.9.safetensor model which has the proper working v...

hmm i still get the weird colous though when swapping out the vae with the fixed version (my code above)

hasty smelt Oct 19, 2023, 8:06 PM

#

Hi everione, I'm wondering if there is a command to update the "run_nvidia_gpu.bat" file. I've never update it, so I don't know if its necessary.

crisp owl Oct 19, 2023, 8:06 PM

#

Are you generating outside of standard size ratio's? Adding any other models/controlnets/lora's/etc?

solar merlin Oct 19, 2023, 8:07 PM

#

crisp owl Are you generating outside of standard size ratio's? Adding any other models/co...

no res is 1024x1024, no other controlnets or loras. Just base for 30 steps output as latent then into refiner for 15.

crisp owl Oct 19, 2023, 8:07 PM

#

hasty smelt Hi everione, I'm wondering if there is a command to update the "run_nvidia_gpu.b...

ComfyUI_windows_portable\update
then run the update_comfyui.bat file

hasty smelt Oct 19, 2023, 8:07 PM

#

crisp owl `ComfyUI_windows_portable\update` then run the `update_comfyui.bat` file

Thanks solesbeedude

crisp owl Oct 19, 2023, 8:08 PM

#

don't run the python_dependencies file

#

you don't need to unless you know what you're doing.

crisp owl Oct 19, 2023, 8:09 PM

#

solar merlin no res is 1024x1024, no other controlnets or loras. Just base for 30 steps outpu...

That's odd...
I'd still lean to a vae issue just given what it looks like in your shared screenshot.
I've personally never run into that issue ever though

hasty smelt Oct 19, 2023, 8:10 PM

#

crisp owl don't run the python_dependencies file

Can I update smoothly using the update_comfyui.bat file?

crisp owl Oct 19, 2023, 8:11 PM

#

Yup, you can run that file whenever, the update will only apply when you restart your entire instance

hasty smelt Oct 19, 2023, 8:11 PM

#

thanks buddy

rustic shadow Oct 19, 2023, 8:22 PM

#

#

#

icy brook Oct 19, 2023, 8:57 PM

#

lusty wolf Oct 19, 2023, 9:24 PM

#

Just waiting for things to get better in South Africa...

crisp owl Oct 19, 2023, 9:37 PM

#

You from there? Where at?

wet nacelle Oct 19, 2023, 9:57 PM

#

#

#

#

#

crisp owl Oct 19, 2023, 10:10 PM

#

looks like the passageway between Riverside County and Orange County in SoCal lol

#

wet nacelle Oct 19, 2023, 10:17 PM

#

crisp owl

HA!

crisp owl Oct 19, 2023, 10:22 PM

#

There we go, had to download the app to replicate the sun. Even sets in the same direction lol

soft bone Oct 19, 2023, 10:23 PM

#

i trained on a buddies custom car 🤔

crisp owl Oct 19, 2023, 10:35 PM

#

hmmmm.....well, I'd giess somewhere close to 1024

#

but I haven't used ultimate upscale for SDXL

#

can always test and see if the outcome is wonky though

static prawn Oct 19, 2023, 10:37 PM

#

static prawn Oct 19, 2023, 10:38 PM

#

crisp owl can always test and see if the outcome is wonky though

felt like it doenst make a big difference, the image at all look a lil different, but not the quality imo

crisp owl Oct 19, 2023, 10:39 PM

#

Nice, probably if your pc can handle it, any speed increase between 1024 tiles vs 512 tiles?

static prawn Oct 19, 2023, 10:40 PM

#

i felt like it doesnt make a big difference

#

bec i have doubled tiles with 512x512

#

with 1024x1024 i have bigger tiles

#

having less tiles is definetely an advantage anyway

#

im running a gtx 1070, still happy with it 🙂

#

best gpu purchase of my life so far

crisp owl Oct 19, 2023, 10:42 PM

#

I notice with my vae tiled nodes in ComfyUI, if I keep the tiles at 512 vs changing to 1024, I lose about 30 seconds, and I haven't noticed a difference in quality.
I'm running a 2060S, this thing has been a trooper lol

steep wave Oct 19, 2023, 10:44 PM

#

static prawn Oct 19, 2023, 10:44 PM

#

i just dont get why sdxl is so sensitive to prompts , i often have completely oversaturated, overexposed results

crisp owl Oct 19, 2023, 10:46 PM

#

I've mostly seen that with specific checkpoints.
I had protovision really disliking some prompts, but if I change to a different checkpoint, be completely perfect

lilac wren Oct 19, 2023, 10:47 PM

#

I finally managed to train a LoRA model with my 8 GB of Vram, in 512x512, but I was looking for speed (2h40). The results are very impressive, despite the small number of images and steps.
(model : Leah Dizon)

12897-1663273333-close-up_to_a_nymph___lora_LeahLora_v4_SDXL_1___and_leafy_dress__on_a_mossy_log__near_a_crystal-clear_lake__in_a_magical_fo.png

12912-1280909775-a_woman_standing_in_time_square__lora_LeahLora_v4_SDXL_1__in_a_chic_sexy_dress_circle_lenses_medium_breasts_best_quality_1.png

#

12914-3216161951-a_woman_standing_in_time_square__lora_LeahLora_v4_SDXL_1__in_a_chic_sexy_dress_circle_lenses_medium_breasts_best_quality_1.png

icy brook Oct 19, 2023, 10:59 PM

#

#

Aether Bubbles & Foam, coming tomorrow on Civitai.

wet nacelle Oct 19, 2023, 11:00 PM

#

icy brook Oct 19, 2023, 11:01 PM

#

#

mellow tendon Oct 19, 2023, 11:10 PM

#

nimble heart Oct 19, 2023, 11:11 PM

#

Colored latents are fun

wet nacelle Oct 19, 2023, 11:20 PM

#

#

native knot Oct 19, 2023, 11:45 PM

#

And don't ever talk to my son again!

wet nacelle Oct 19, 2023, 11:51 PM

#

#

half cedar Oct 20, 2023, 12:48 AM

#

https://twitter.com/russellcrowe/status/1715118486262038776?t=b-pxlh3NtE9_WOtOXlHXKQ&s=19

indigo carbon Oct 20, 2023, 12:49 AM

#

tribal lantern Maybe one model won't need to do all, just create the basic scene in a model tha...

maybe future SD versions will have a pre-sampling to just create vague shapes than have a modern UNET like SDXL decode them?

#

I feel like that would make it even harder to blend images, this means that both modelA and modelB will need to have image conditioning to enhance the capabilities beyond what current SD can do

#

though I know Kadinsky2.2 can blend images without needing something like IPAdapter; but I'm unsure if that's because of the different encoder, or because it is pixel diffusion

lilac wren Oct 20, 2023, 1:02 AM

#

13014-1032690323-a_woman__lora_LeahLora_v4_SDXL_1__circle_lenses_sexy_black_tanktop_with_a_mini_skirt_on_a_futuristic_Yamaha_motorcycle_a_p.png

indigo carbon Oct 20, 2023, 1:04 AM

#

also idk about SDXL having NO understanding of language; I just wrote here- "a polite and friendly octopus drinking tea"

#

it figured out the tophat on its own, so unless polite means wearing as tophat, it did get creative here

lilac wren Oct 20, 2023, 1:24 AM

#

13023-1095653720-a_woman__lora_LeahLora_v4_SDXL_1__circle_lenses_sexy_black_tanktop_with_a_mini_skirt_on_a_futuristic_Yamaha_motorcycle_a_p.png

#

12983-3353664928-a_woman_standing__lora_LeahLora_v4_SDXL_1__circle_lenses_sexy_transparent_draped_dress_golden_jewels_in_the_great_ancient.png

weary yacht Oct 20, 2023, 1:31 AM

#

nice.. got the Intel A770 doing 1920x1080 SDXL

weary yacht Oct 20, 2023, 2:41 AM

#

dude.. this might be my new wallpaper

nimble heart Oct 20, 2023, 3:19 AM

#

what's the speed you get on that?

weary yacht Oct 20, 2023, 4:10 AM

#

doing 1920x1080 is like 5-6 seconds per iteration... smaller SD1.5 stuff was under 1.3S/it

#

so an image like that above just a straight, un upscaled 1920x1080 takes like 9-10 minutes

#

probably 4-5 if I wasn't using an insanely high number of steps

#

10 minutes for this.. I'll use the same prompt and do it for 50

crisp owl Oct 20, 2023, 4:14 AM

#

#

Fenrir walking away from Thor who is busy making the second attempted chain

weary yacht Oct 20, 2023, 4:16 AM

#

3 minutes, 54 seconds, 1920x1080, 50 steps

zinc cargo Oct 20, 2023, 4:39 AM

#

icy brook

foam is going to get so nsfw suggestive very fast 😛

crisp owl Oct 20, 2023, 4:47 AM

#

That was my first thought also 😆

#

But the bubbles are neat for sure

zinc cargo Oct 20, 2023, 4:48 AM

#

foam also gonna be cool, but you know 🙂

crisp owl Oct 20, 2023, 4:48 AM

#

people gonna people thomas

prime juniper Oct 20, 2023, 5:04 AM

#

Looking for a SDXL specialist to create a dreambooth Model for me Photorealism (no more cameras 🙂 or photography studio ). Can anyone help me create this?

pure crystal Oct 20, 2023, 5:22 AM

#

#

analog fern Oct 20, 2023, 6:32 AM

#

Curious to ask, can AI generate characters or animations like this?

nimble heart Oct 20, 2023, 7:07 AM

#

there's some 1.5 tunes/loras meant for making spritesheets

#

you'll have to either inpaint or doctor them to make them consistent though

nimble heart Oct 20, 2023, 7:49 AM

#

mermaid skeleton found by researchers in the abyssal zone

rigid lagoon Oct 20, 2023, 7:52 AM

#

mermaid skeleton found by researchers in the abyssal zone

nimble heart Oct 20, 2023, 7:54 AM

#

that's not the prompt lol

vale eagle Oct 20, 2023, 7:55 AM

#

One image explains Dalle3

nimble heart Oct 20, 2023, 7:56 AM

#

rigid lagoon mermaid skeleton found by researchers in the abyssal zone

grainy submarine footage of a skeletal mermaid in the deep abyssal ocean was what i used, but you probably wont be able to re-create it in the bot because it uses a black input latent

vital ermine Oct 20, 2023, 7:57 AM

#

nimble heart Oct 20, 2023, 7:58 AM

#

oh general the AMD Nod.AI acquisition completed

vital ermine Oct 20, 2023, 8:03 AM

#

I wish Nod was for training

#

Maybe now it will be

nimble heart Oct 20, 2023, 8:04 AM

#

sounds like they hired them for their general torch knowledge

#

not just SHARK

#

some of this black latent shit's kinda nightmare fuel lmao

#

she has the omae wa mou eyes

vital ermine Oct 20, 2023, 8:17 AM

#

black latent?

nimble heart Oct 20, 2023, 8:17 AM

#

yea

vital ermine Oct 20, 2023, 8:17 AM

#

never heard of it

nimble heart Oct 20, 2023, 8:18 AM

#

instead of feeding the ksampler torch.zeros() I feed it a latent with the approximate VAE values of black

#

its a custom node I wrote

vital ermine Oct 20, 2023, 8:18 AM

#

Oh, nice. Yeah, I barely touched latent

nimble heart Oct 20, 2023, 8:19 AM

#

it lets you get near-black images

#

I'm 99% sure its what Midjourney does

vital ermine Oct 20, 2023, 8:19 AM

#

Has to be

nimble heart Oct 20, 2023, 8:19 AM

#

it picks up on keywords like "dark, black, bright, white" etc and changes the input latent color

#

instead of just neutral grey like SD does by default

vital ermine Oct 20, 2023, 8:20 AM

#

dynamic latent iow?

nimble heart Oct 20, 2023, 8:20 AM

#

iow?

vital ermine Oct 20, 2023, 8:20 AM

#

in other words

nimble heart Oct 20, 2023, 8:20 AM

#

ah

#

yea

#

so I made a little node that creates colored latents in the 6 main colors + black/white

#

at any strength

#

it works super well

vital ermine Oct 20, 2023, 8:21 AM

#

yeah, MJ has always been about the tricks behind the curtains

nimble heart Oct 20, 2023, 8:21 AM

#

if you want just a night time scene you can use not-quite-black

vital ermine Oct 20, 2023, 8:21 AM

#

I have zero knowledge of node crating

nimble heart Oct 20, 2023, 8:21 AM

#

its like 50 lines

#

if that

vital ermine Oct 20, 2023, 8:22 AM

#

this one needs that

nimble heart Oct 20, 2023, 8:22 AM

#

XL_CONSTS = {
    "black" : [-21.675981521606445, 3.864609956741333, 2.4103028774261475, 2.579195261001587],
    "white" : [18.043685913085938, 1.7262177467346191, 9.310612678527832, -8.135881423950195],
    "red" : [-19.665550231933594, -19.79644012451172, 10.68371868133545, -12.427474021911621],
    "green" : [-3.530947685241699, 14.075841903686523, 26.489261627197266, 8.67661190032959],
    "blue" : [0.45569008588790894, 16.3455867767334, -17.67197036743164, 4.145791053771973],
    "cyan" : [12.434264183044434, 26.013031005859375, 4.298962593078613, 7.954266548156738],
    "magenta" : [-0.9616246223449707, -5.109368801116943, -12.062283515930176, -9.02152156829834],
    "yellow" : [-6.609264373779297, -10.563915252685547, 32.47910690307617, -8.209832191467285],
}
class BSZColoredLatentImageXL:
    @classmethod
    def INPUT_TYPES(s):
        return {"required": {
            "color": (list(XL_CONSTS.keys()),),
            "strength": ("FLOAT", {"default": 0.5, "min": 0.0, "max": 1.0, "step": 0.1}),
            "width": ("INT", {"default": 1024, "min": 16, "max": nodes.MAX_RESOLUTION, "step": 8}),
            "height": ("INT", {"default": 1024, "min": 16, "max": nodes.MAX_RESOLUTION, "step": 8}),
            "batch_size": ("INT", {"default": 1, "min": 1, "max": 4096}),
        }}
    RETURN_TYPES = ("LATENT",)
    FUNCTION = "generate"
    CATEGORY = "latent"

    def generate(self, color: str, strength: float, width: int, height: int, batch_size: int):
        samples = torch.empty([batch_size, 4, height // 8, width // 8])
        cols = XL_CONSTS[color]
        for batch in samples:
            batch[0].fill_(cols[0] * strength)
            batch[1].fill_(cols[1] * strength)
            batch[2].fill_(cols[2] * strength)
            batch[3].fill_(cols[3] * strength)
        return ({"samples":samples},)

entire code for it.

nimble heart Oct 20, 2023, 8:23 AM

#

vital ermine this one needs that

darker?

vital ermine Oct 20, 2023, 8:23 AM

#

nimble heart Oct 20, 2023, 8:23 AM

#

yea that's the node. 0 strenght is an empty latent like you're used to. 1.0 strength is pure black

#

or pure white/blue/red/etc

#

only works on XL

vital ermine Oct 20, 2023, 8:24 AM

#

0.5 is the mid grey?

nimble heart Oct 20, 2023, 8:24 AM

#

No 0.5 would be like 25% lightness approx

#

SD is mid grey by default.

#

so 50% black is 25% lightness if that makes sense

#

cause 0% black is gray

#

it works like that because I cant blend latent colors, only multiply

vital ermine Oct 20, 2023, 8:25 AM

#

I already feed into latent and this has no pass though 😦

nimble heart Oct 20, 2023, 8:25 AM

#

if you have my whole pack, the Offset node has a -1.0 -> 1.0 node that just adjusts your existing latent

#

so 0.0 is gray, -1.0 is black, 1.0 is white

vital ermine Oct 20, 2023, 8:25 AM

#

I have your pack

#

oh, sweet

#

let me try that now

nimble heart Oct 20, 2023, 8:26 AM

#

might be more your style

#

use it before the noise is added

#

cause it's multiplicative

vital ermine Oct 20, 2023, 8:27 AM

#

what is it doing?

#

wow

#

just tried 0.5

nimble heart Oct 20, 2023, 8:28 AM

#

    def offset(self, latent, offset: float):
        samples = latent['samples'].clone();
        if offset > 0:
            cols = XL_CONSTS['white']
        elif offset < 0:
            cols = XL_CONSTS['black']
            offset = abs(offset)
        for batch in samples:
            if offset != 0:
                batch.mul_(1 - offset)
                batch[0].add_(cols[0] * offset)
                batch[1].add_(cols[1] * offset)
                batch[2].add_(cols[2] * offset)
                batch[3].add_(cols[3] * offset)
        return (latent | {'samples': samples},)

basically it adds offset% white or black and multiplies the latent by the inverse to compensate

vital ermine Oct 20, 2023, 8:28 AM

#

0.5 and 0. 0 is like it was without

nimble heart Oct 20, 2023, 8:29 AM

#

yea the node does nothing at 0

vital ermine Oct 20, 2023, 8:29 AM

#

-1.0

nimble heart Oct 20, 2023, 8:29 AM

#

except consume a few MB of memory by caching the latent I guess

#

yea 1.0 is usually a bit too strong unless you're literally making a black background with a single thing on it

vital ermine Oct 20, 2023, 8:30 AM

#

well, I like it

#

thank you for this

nimble heart Oct 20, 2023, 8:30 AM

#

sometimes -1.0 looks good though

vital ermine Oct 20, 2023, 8:30 AM

#

the -1.0 is perfect

nimble heart Oct 20, 2023, 8:31 AM

#

for positive values +1.0 will basically turn it into a digital doodle on a pure white photoshop canvas lol

#

so +0.5 is usually the limit unless it's like an angel in white robes in a blizzard

#

if you ever use 2.1 or 1.5 the RGBA node can do something similar since it has arbitrary colors.

#

for img2img or other non-empty latent scenarios you'd have to merger it though

#

comfy has nodes for latent blending/merging too btw

#

built in

#

they might be in _for_testing still

vital ermine Oct 20, 2023, 8:35 AM

#

yes, I think that is where I saw them

#

I needed this last night as I was fighting with way too dark

#

nimble heart Oct 20, 2023, 8:37 AM

#

yea hypothetically you could use that to mix my colored_latent_image node with an existing one instead of just using the offset

#

but idk that's a lot of effort hence why I just made the offset

vital ermine Oct 20, 2023, 8:38 AM

#

I just did

#

nimble heart Oct 20, 2023, 8:39 AM

#

but yea I'm playing with -1.0 black to make some sketchy research footage and its great

vital ermine Oct 20, 2023, 8:39 AM

#

I need to remake this now as I will use it to colour the latent instead of the image back into latent

nimble heart Oct 20, 2023, 8:39 AM

#

if you're blending then leave the colored latent at 1.0 strength and just change the blend

#

should work similarly maybe

vital ermine Oct 20, 2023, 8:40 AM

#

yes

#

I would think

#

unless it overpowers it

nimble heart Oct 20, 2023, 8:42 AM

#

life pro tip if you clamp the values in addition to the black latent you can straight up make pure black values

#

also the refiner does indeed still work with colored latents without totally breaking down but I'd say its even less useful tbh

vital ermine Oct 20, 2023, 8:43 AM

#

I wonder if post processing I removed for my look if I can do them in latent? gonna try

nimble heart Oct 20, 2023, 8:44 AM

#

i still post process

#

that's what that screenshot is

vital ermine Oct 20, 2023, 8:44 AM

#

see, I prefer to get rid of post and work entirely in latent if possible

#

yeah, I can.

nimble heart Oct 20, 2023, 8:45 AM

#

with pure black its not really necessary but with other values it's still allergic to black/white pixels

#

it'll limit itself to like 10% 90% lightness

vital ermine Oct 20, 2023, 8:46 AM

#

lol

nimble heart Oct 20, 2023, 8:47 AM

#

also some other values are fun for specific looks.
Yellow does good for golden hour sunlight
cyan for underwater coral reef stuff
etc

#

cyan can also be used for images that have a lot of sky

#

then white if it's a bright image and SDXL starts darkening things to compensate

vital ermine Oct 20, 2023, 8:48 AM

#

it sure did mix them as red + white is pink

nimble heart Oct 20, 2023, 8:48 AM

#

color mixing doesnt work though

#

if you mix red + blue it doesnt make magenta

vital ermine Oct 20, 2023, 8:48 AM

#

no, it was a nice accident

nimble heart Oct 20, 2023, 8:50 AM

#

sometimes it makes the ocean stuff have black bars. must source movie stills?

vital ermine Oct 20, 2023, 8:51 AM

#

I had that and it is weird

#

something from source, it has to be

nimble heart Oct 20, 2023, 8:51 AM

#

This one's fun. Has that effect I was going for of a research submarine finding some eldritch shit

vital ermine Oct 20, 2023, 8:53 AM

#

nimble heart Oct 20, 2023, 8:53 AM

#

also seems like the black significantly reduces the overtuning effect of "mermaid" to just make hot women with nylon fish tails. must force it to approach the image totally differently

nimble heart Oct 20, 2023, 8:54 AM

#

vital ermine

like a like a black/yellow mix?

#

also one thing I do if I'm really in the colored weeds is to connect the latent to an ImagePreview node before it samples so you get a little preview of the color you're feeding the sampler each time you run it

vital ermine Oct 20, 2023, 8:55 AM

#

Yep, that is what I do

#

#

WHOA

nimble heart Oct 20, 2023, 8:57 AM

#

white?

vital ermine Oct 20, 2023, 8:57 AM

#

I accidently went from 0.3 white to 1.0 on the above image

nimble heart Oct 20, 2023, 8:57 AM

#

lol

#

bones r white

vital ermine Oct 20, 2023, 8:57 AM

#

vital ermine

on this image

#

I meant to go to 0.4

#

#

wish I could do 0.35

supple knot Oct 20, 2023, 9:00 AM

#

can I move this file anywhere and it will still work the same? ComfyUI_windows_portable

nimble heart Oct 20, 2023, 9:00 AM

#

ah I could probably change the steps to be 0.05 instead of 0.1

#

I didnt think it'd make a big difference tbh

supple knot Oct 20, 2023, 9:01 AM

#

or does it have to be at the top of a drive?

nimble heart Oct 20, 2023, 9:02 AM

#

vital ermine wish I could do 0.35

should I update the nodes to step at 0.05 or does it not really matter do you think?

#

unless you're talking about the comfyui node which I cant change

supple knot Oct 20, 2023, 9:02 AM

#

I'm getting a error with the qrcode system that I cant fix

#

Error occurred when executing ControlNetLoader:

module 'comfy.sd' has no attribute 'ModelPatcher'

vital ermine Oct 20, 2023, 9:04 AM

#

nimble heart should I update the nodes to step at 0.05 or does it not really matter do you th...

Well, look how HUGE of a difference 0.3 and 0.4 were.

nimble heart Oct 20, 2023, 9:04 AM

#

yea slight color changes affect the composition a lot

#

I pushed to git it should step at 0.05 now

#

I'm not gonna do 0.01 lol

#

should be able to just update it with the manager

vital ermine Oct 20, 2023, 9:05 AM

#

well, for testing I would like to see if .01 matters. I think it might

nimble heart Oct 20, 2023, 9:06 AM

#

you can change it yourself in the file it's just the "step": 0.1 value on the node input thingy

slow yoke Oct 20, 2023, 9:06 AM

#

Hi guys, I hate to interrupt and not sure if this is the correct channel, but I used Realistic Vision and ChilloutMix model, all works fine, but as soon as I switch to SDXL, eerything becomes like this. Any help is appreciated!!!! :)))))

nimble heart Oct 20, 2023, 9:07 AM

#

#

just reastart the comfy server and it'll take

rustic garnet Oct 20, 2023, 9:07 AM

#

slow yoke Hi guys, I hate to interrupt and not sure if this is the correct channel, but I ...

which sampler, scheduler and model?

supple knot Oct 20, 2023, 9:08 AM

#

in general SD 1.5 is 512 x 512 SDXL is 1024 x 1024 @slow yoke

nimble heart Oct 20, 2023, 9:08 AM

#

nimble heart

hypothetically you could also allow the min/max to be greater than they are too but that's actually insane

#

like how do you have 150% black?

slow yoke Oct 20, 2023, 9:08 AM

#

rustic garnet which sampler, scheduler and model?

sampler is Euler a, model is sd_xl_base_1.0

slow yoke Oct 20, 2023, 9:08 AM

#

supple knot in general SD 1.5 is 512 x 512 SDXL is 1024 x 1024 <@746106069327282227>

thank you, will try

nimble heart Oct 20, 2023, 9:08 AM

#

slow yoke Hi guys, I hate to interrupt and not sure if this is the correct channel, but I ...

that's a VAE thing

vital ermine Oct 20, 2023, 9:09 AM

#

class BSZLatentOffsetXL:
# {{{
@classmethod
def INPUT_TYPES(s):
return {
"required": {
"latent": ("LATENT",),
"offset": ("FLOAT", {
"default": 0.0,
"min": -1.0,
"max": 1.0,
"step": 0.01,
}),
}
}

nimble heart Oct 20, 2023, 9:09 AM

#

idk what UI you're using but make sure you're using automatic or the XL vae

nimble heart Oct 20, 2023, 9:09 AM

#

vital ermine class BSZLatentOffsetXL: # {{{ @classmethod def INPUT_TYPES(s): ...

negative black doesnt work I dont think. makes like brown?

rustic garnet Oct 20, 2023, 9:09 AM

#

hm, weird, it looks like it is not fully denoised. But yeah, you definitely should use a resolution of at least 1024x1024, otherwise the image will look very ugly

#

see here for a list of resolutions that work well with SDXL: https://www.reddit.com/r/StableDiffusion/comments/15c3rf6/sdxl_resolution_cheat_sheet/

From the StableDiffusion community on Reddit: SDXL Resolution Cheat...

Explore this post and more from the StableDiffusion community

nimble heart Oct 20, 2023, 9:10 AM

#

I'm 99% sure it's the 1.5 vae on an XL latent

#

which is why it works fine with ChilloutMix

#

somewhere in the UI the VAE is manually set to sd-ft-mse or something

rustic garnet Oct 20, 2023, 9:10 AM

#

for sampler I wouldn't use Euler A. If you want a non-deterministic sampler use some of the Karras DPM SDE samplers. Or simply use DDIM or UniPIC for deterministic.

rustic garnet Oct 20, 2023, 9:11 AM

#

nimble heart I'm 99% sure it's the 1.5 vae on an XL latent

wow, I wouldn't have expected that this is even possible lol

#

I mean, technically, yes, but I would have expected you get pure noise back then

nimble heart Oct 20, 2023, 9:11 AM

#

they're both 8x latent space so they technically work

#

just XL was trained from the ground up so it's totally different

vital ermine Oct 20, 2023, 9:11 AM

#

I guess I changed the wrong thing

#

it only does 0.05 now

nimble heart Oct 20, 2023, 9:12 AM

#

LMAO when you see your neighborhood abyssal demon on your way to work 👋

nimble heart Oct 20, 2023, 9:13 AM

#

vital ermine it only does 0.05 now

refresh the browser too sometimes it gets stuck

vital ermine Oct 20, 2023, 9:13 AM

#

nimble heart Oct 20, 2023, 9:13 AM

#

i think the comfyui web app caches what the nodes' value sliders are

vital ermine Oct 20, 2023, 9:13 AM

#

yeah

nimble heart Oct 20, 2023, 9:13 AM

#

so if you already restarted the server gotta F5 as well

slow yoke Oct 20, 2023, 9:14 AM

#

Thank you guys for all the help, let me give it a try

vital ermine Oct 20, 2023, 9:15 AM

#

I noticed this flash by

#

Failed to download lbpcascade_animeface.xml

slow yoke Oct 20, 2023, 9:15 AM

#

nimble heart idk what UI you're using but make sure you're using automatic or the XL vae

It worked, changing to Automtic erase the problem completely, thank you 🙂

vital ermine Oct 20, 2023, 9:15 AM

#

OUCH, 2011

nimble heart Oct 20, 2023, 9:15 AM

#

vital ermine Failed to download lbpcascade_animeface.xml

not mine

#

despite the stuff looking complicated my nodes are probably the most mundane of all the node packs. just slightly altered existing comfyui nodes for the most part

vital ermine Oct 20, 2023, 9:17 AM

#

yes, 0.33

#

slight change

#

I prefer that one

#

0.34

nimble heart Oct 20, 2023, 9:18 AM

#

that's mostly gonna be seed variance at that point

vital ermine Oct 20, 2023, 9:18 AM

#

0.01 is good stuff

#

my seed is locked as is no memory changes stuff (no-mem sdp)

nimble heart Oct 20, 2023, 9:19 AM

#

actual colors are the same

vital ermine Oct 20, 2023, 9:19 AM

#

don't care it made a good change because the latent color changes content as we know

nimble heart Oct 20, 2023, 9:19 AM

#

vital ermine my seed is locked as is no memory changes stuff (no-mem sdp)

I meant latent color is also a seed of its own

#

its like a variance seed

vital ermine Oct 20, 2023, 9:20 AM

#

yeah. I will stick with .01 changes as I like this the best.

nimble heart Oct 20, 2023, 9:20 AM

#

so the tiniest of changes will affect an image even if the actual color is the same

vital ermine Oct 20, 2023, 9:20 AM

#

yep

#

latent space is a weird, wonderous, and freaky place

nimble heart Oct 20, 2023, 9:22 AM

#

One thing i give XL credit for is most of my "mermaid" things dont have legs. in 1.5 you had to blacklist like feet legs knees etc and it'd still most likely fuck up. XL i dont have to blacklist anything

vital ermine Oct 20, 2023, 9:22 AM

#

2.0/2.1 had legs too

nimble heart Oct 20, 2023, 9:23 AM

#

never used it runs like shit

#

2 is slower than XL

#

if you autocast 2 it just NaNs instantly.

#

and fp32 is like 1/5th the speed of fp16

#

sometimes it makes them red 🤔

supple knot Oct 20, 2023, 9:30 AM

#

I was trying sand castles you all got any good ones

nimble heart Oct 20, 2023, 9:30 AM

#

i saw sytan with some earlier

#

when he was showing off his photography lora

supple knot Oct 20, 2023, 9:31 AM

#

on this channel?

nimble heart Oct 20, 2023, 9:31 AM

#

dont remember

#

think so

#

ugh

#

4k waifu

vital ermine Oct 20, 2023, 9:53 AM

#

nimble heart sometimes it makes them red 🤔

I hate how I am forced to to use fp32 for the vae

nimble heart Oct 20, 2023, 9:53 AM

#

on XL?

vital ermine Oct 20, 2023, 9:53 AM

#

yes

nimble heart Oct 20, 2023, 9:53 AM

#

bf16 probably works

#

since its a scale issue on fp16

vital ermine Oct 20, 2023, 9:53 AM

#

yeah, I use that on comfy now but still the same amount of vram being sucked up

nimble heart Oct 20, 2023, 9:54 AM

#

damn really

vital ermine Oct 20, 2023, 9:54 AM

#

on automatic1111 we have to use the fp32 for vae

nimble heart Oct 20, 2023, 9:54 AM

#

i thought auto had bf16 support

#

sd.next does

vital ermine Oct 20, 2023, 9:55 AM

#

yes, so I wonder why use it? I think speed as bf16 is faster than fp32? I dunno

nimble heart Oct 20, 2023, 9:55 AM

#

yea bf16 should be speed comparable to fp16 I think

#

thought it was supposed to use less mem though

vital ermine Oct 20, 2023, 9:55 AM

#

I despise vlad with a passion. I mean pure hatred the kind you probably do not know. in other words, no thanks.

nimble heart Oct 20, 2023, 9:56 AM

#

interesting.

vital ermine Oct 20, 2023, 9:56 AM

#

trust me, I have a legitimate reason for it so I don't go anywhere near him, or his work.

nimble heart Oct 20, 2023, 9:57 AM

#

I've only spoken to Auto in pull requests and never to vlad so im not sure what the whole deal is

vital ermine Oct 20, 2023, 9:57 AM

#

I have no idea Auto isn't really in control his rag tag band of devs are all over the place.

nimble heart Oct 20, 2023, 9:58 AM

#

sd.next has a HF diffusers backend so its pretty nice. UI is a little jank sometimes though

#

idk anything else that supports arbitrary Diffusers models

#

I guess i could write my own CLI script it wouldnt be too hard

vital ermine Oct 20, 2023, 9:59 AM

#

I can't even find any trainers that use diffusers BUT one from hugging face. Really janky but damn the quality of diffusers directly I like it

nimble heart Oct 20, 2023, 10:00 AM

#

but making a full UI with live previews and everything is pain

nimble heart Oct 20, 2023, 10:00 AM

#

vital ermine I can't even find any trainers that use diffusers BUT one from hugging face. Re...

simpletuner?

#

unless you mean loras

vital ermine Oct 20, 2023, 10:00 AM

#

been dead for a long time now

#

I mean all of them, yes and for xl

nimble heart Oct 20, 2023, 10:01 AM

#

vital ermine been dead for a long time now

??

vital ermine Oct 20, 2023, 10:01 AM

#

OneTrainer really took its place

#

I used to be on the ST discord then they said it was dead and I just left it

nimble heart Oct 20, 2023, 10:02 AM

#

this one? https://github.com/bghira/SimpleTuner

#

gets updated all the time lol. I might try to get it working with Lora later once ROCm isnt having a crisis

vital ermine Oct 20, 2023, 10:04 AM

#

Don't know now as it was a while back. They were being asked for someone to take it over but honestly it just never was my thing. for DB I used Shiv's. FT no way in hell am I going to hand curate 3k+ images and captions.

#

This was before XL was even being talked about

nimble heart Oct 20, 2023, 10:06 AM

#

ST can use DeepSpeed now for 24gig cards so if you're doing full checkpoints it might be spicy.

indigo carbon Oct 20, 2023, 10:06 AM

#

nimble heart ST can use DeepSpeed now for 24gig cards so if you're doing full checkpoints it ...

DeepSpeed is very similar to AITemplate

vital ermine Oct 20, 2023, 10:07 AM

#

DeepSpeed spanked me on WIndows. Being Microsoft I was shocked but I didn't like being spanked by it. I decided it won, I lost, and moved on.

indigo carbon Oct 20, 2023, 10:07 AM

#

vital ermine DeepSpeed spanked me on WIndows. Being Microsoft I was shocked but I didn't lik...

it's not compatible with windows, microsoft are stupid, didn't you know?

nimble heart Oct 20, 2023, 10:07 AM

#

i got rocm's deepspeed fork to actually work but it miscompiles if you use stage 2 cpu offloading

vital ermine Oct 20, 2023, 10:07 AM

#

indigo carbon it's not compatible with windows, microsoft are stupid, didn't you know?

I do now.

soft bone Oct 20, 2023, 10:07 AM

#

this accident has lotr level bloom

nimble heart Oct 20, 2023, 10:08 AM

#

accidents are always fun

indigo carbon Oct 20, 2023, 10:08 AM

#

it's just a shitty version of AITemplate except it's only for LLMs, best optimization for LLMs is exLLaMa; which is x8 speed

nimble heart Oct 20, 2023, 10:08 AM

#

one of my favorite 1.5 images was with the completely wrong settings

#

nimble heart Oct 20, 2023, 10:09 AM

#

indigo carbon it's just a shitty version of AITemplate except it's only for LLMs, best optimiz...

i mean it seems to train diffusion models fine

vital ermine Oct 20, 2023, 10:09 AM

#

I haven't heard a word, for the last month, about the new xformers being released. Supposedly it is done but was waiting on torch.

#

more mem efficent and faster

nimble heart Oct 20, 2023, 10:09 AM

#

yea it's based on Flash Attention 2 now

vital ermine Oct 20, 2023, 10:10 AM

#

problem is needs tensor cores of ampere and ada cards only

indigo carbon Oct 20, 2023, 10:10 AM

#

nimble heart i mean it seems to train diffusion models fine

maybe, idk. WE. NEEDS. exDiffusion

vital ermine Oct 20, 2023, 10:10 AM

#

well, I want it for training

nimble heart Oct 20, 2023, 10:10 AM

#

exllamav2 uses Flash Attention 2 natively but I haven't been able to successfully compile it on rocm yet

#

there's a PR to merge the Flash Attention 2 changes into pytorch 2.2 as well

#

so scaled dot product will get the same speedup eventually too

vital ermine Oct 20, 2023, 10:11 AM

#

sdp is worse than xformers for Nvidia cards

#

especially training

nimble heart Oct 20, 2023, 10:11 AM

#

hypothetically you could directly use the flash attention lib on stable diffusion instead of through xformers/sdp

indigo carbon Oct 20, 2023, 10:12 AM

#

maybe they could make something like exLLaMa for training?

vital ermine Oct 20, 2023, 10:12 AM

#

that would rock

nimble heart Oct 20, 2023, 10:12 AM

#

exllama is hyperoptimized for inference specifically isnt it

vital ermine Oct 20, 2023, 10:12 AM

#

I feel like it is the Commodore64 days again and every single byte counts.

nimble heart Oct 20, 2023, 10:12 AM

#

it compiles a microkernel for the model/context shape/gpu

#

so similar to AIT i guess

indigo carbon Oct 20, 2023, 10:13 AM

#

nimble heart exllama is hyperoptimized for inference specifically isnt it

the x8 speed boost comes from optimized kernals; this is why I first thought it will be easy to make something similar for diffusion

#

but exLLaMa for diffusion seems far for now.

nimble heart Oct 20, 2023, 10:14 AM

#

i mean you already have something like that with AIT. it just doesnt seamlessly compile the kernels for you and just work™️

indigo carbon Oct 20, 2023, 10:15 AM

#

nimble heart i mean you already have something like that with AIT. it just doesnt seamlessly ...

it does just work depending on your system..

nimble heart Oct 20, 2023, 10:15 AM

#

exllama works always. not depending on your system

indigo carbon Oct 20, 2023, 10:15 AM

#

if you have a 3000 series card and above it will work right away

indigo carbon Oct 20, 2023, 10:16 AM

#

nimble heart exllama works always. not depending on your system

true, and the speed up is even higher

nimble heart Oct 20, 2023, 10:16 AM

#

exllama hot-compiles a kernel if your gpu isnt included in the pre-shipped ones

#

so it works on AMD and everything too

#

the first gen takes +30 seconds while it compiles then it's gucci

indigo carbon Oct 20, 2023, 10:16 AM

#

AITemplate compiles engines, exLLaMa compiles kernals right?

nimble heart Oct 20, 2023, 10:16 AM

#

idfk

#

I have no idea what the difference is

#

they call it a kernel in the readme so

indigo carbon Oct 20, 2023, 10:17 AM

#

they are different, optimized kernals are more flexible than optimized engines

#

and faster in this case

vital ermine Oct 20, 2023, 10:17 AM

#

I do not get it. I trained YET again and still blurry but non of my data was blurry

nimble heart Oct 20, 2023, 10:17 AM

#

SHARK does the compiled kernel thing for diffusion models too

#

but it's pretty fiddly I've found.

vital ermine Oct 20, 2023, 10:18 AM

#

if I switch from base the blurry goes away but base is what I trained on

nimble heart Oct 20, 2023, 10:18 AM

#

blacklist "blurry"

indigo carbon Oct 20, 2023, 10:19 AM

#

nimble heart SHARK does the compiled kernel thing for diffusion models too

by how much is it faster than pure PyTorch? because exLLaMa is about 8x as fast in my experience

nimble heart Oct 20, 2023, 10:19 AM

#

comparing exllama to transformers is apples oranges

#

exllama runs on quantized models

vital ermine Oct 20, 2023, 10:20 AM

#

I can't release this like this having people type blurry in the neg

nimble heart Oct 20, 2023, 10:20 AM

#

you cant quantize SD afaik so it'll not be 8x

vital ermine Oct 20, 2023, 10:20 AM

#

let's see if it works

#

I typed blurry in the neg

nimble heart Oct 20, 2023, 10:20 AM

#

and if you're talking about exllama 1, that's a 4bit quant which is substantially smaller than a fully fp16 PyTorch model

#

so it's going to be a lot faster

#

exllama2 can use mixed precision quants, and 8bit cuts the speed in like half compared to the default 4bit

indigo carbon Oct 20, 2023, 10:21 AM

#

idk, I feel like if we'll have something like exLLaMa for diffusion we could get close to instant image generation

vital ermine Oct 20, 2023, 10:21 AM

#

nimble heart Oct 20, 2023, 10:22 AM

#

i mean it might be possible to do an exllama2 approach mixed-precision quant by brute-forcing all the layers to find which can be tuned down without spitting NaNs

#

so you could have like 9.82bit SDXL

#

or whatever

uncut fiber Oct 20, 2023, 10:22 AM

#

13b 5q is very slow in comparison with 7b
Only optional, otherwise 50% users cant use it.

nimble heart Oct 20, 2023, 10:23 AM

#

yea 7b models are lightning fast even at 8bit

#

7b 4bit is like 100T/s

uncut fiber Oct 20, 2023, 10:23 AM

#

yes should i install xformers?

soft bone Oct 20, 2023, 10:23 AM

#

vital ermine I do not get it. I trained YET again and still blurry but non of my data was bl...

i get the same problem. fixed when i switched to prodigy

nimble heart Oct 20, 2023, 10:23 AM

#

uncut fiber yes should i install xformers?

for what?

uncut fiber Oct 20, 2023, 10:24 AM

#

i got it as option in gradio for oob*ga

indigo carbon Oct 20, 2023, 10:24 AM

#

I get 8T/s with GPTQ and 62T/s with exLLaMa... I would love to see the day this will happen to diffusion models

nimble heart Oct 20, 2023, 10:24 AM

#

yea that's a pretty spicy gain.

vital ermine Oct 20, 2023, 10:25 AM

#

soft bone i get the same problem. fixed when i switched to prodigy

Is that something ADAfactor does?

nimble heart Oct 20, 2023, 10:25 AM

#

try exllama 2 with flash attention if you can

#

should be even faster

#

if it's a 4bit gptq model you dont need to convert it to exl2

indigo carbon Oct 20, 2023, 10:25 AM

#

nimble heart try exllama 2 with flash attention if you can

I tried that on Linux. it was a pain to do but it got to over 80T/s

soft bone Oct 20, 2023, 10:25 AM

#

vital ermine Is that something ADAfactor does?

no its an alternative to it. like adamw

nimble heart Oct 20, 2023, 10:26 AM

#

uncut fiber i got it as option in gradio for oob*ga

yea, just know that not all backends will use it.

#

like exllama 2 doesnt use xformers

vital ermine Oct 20, 2023, 10:26 AM

#

soft bone no its an alternative to it. like adamw

No, I know what it is just wondering if this blurry is an issue with adafactor?

soft bone Oct 20, 2023, 10:26 AM

#

oh idk i only used adam before prodigy

uncut fiber Oct 20, 2023, 10:26 AM

#

o.k. i am now using gguf models. lama.cpp

nimble heart Oct 20, 2023, 10:26 AM

#

not sure about llama.cpp

#

since it offloads xformers might not make a difference?

vital ermine Oct 20, 2023, 10:26 AM

#

soft bone oh idk i only used adam before prodigy

adafactor is adam it is a variant that tries to find the right LR as it goes.

#

ada short for adam

nimble heart Oct 20, 2023, 10:27 AM

#

lol the adam mixed-cpu kernel is exactly what miscompiled in DeepSpeed for me

soft bone Oct 20, 2023, 10:27 AM

#

could be rank or lr doing that to you as well. or an optimization like mem effn attn

indigo carbon Oct 20, 2023, 10:27 AM

#

so ideally; exLLaMa is 8x as fast as GPTQ, so if you have something identical to exLLaMa for diffusion- your it/s should also be x8 as fast

vital ermine Oct 20, 2023, 10:28 AM

#

prodigy is so mem hungry I can barely get BS2-4 (forgot now)

nimble heart Oct 20, 2023, 10:28 AM

#

indigo carbon so ideally; exLLaMa is 8x as fast as GPTQ, so if you have something identical to...

if you can somehow quant a diffusion model down to 4bits without fucking it up then maybe

vital ermine Oct 20, 2023, 10:28 AM

#

well, that is straight up Dreambooth

nimble heart Oct 20, 2023, 10:28 AM

#

else no way in hell

#

probably more like 2-2.5x

indigo carbon Oct 20, 2023, 10:29 AM

#

nimble heart probably more like 2-2.5x

that's what we get with AITemplate

vital ermine Oct 20, 2023, 10:29 AM

#

I finally got it to train and it blurs like that :/

soft bone Oct 20, 2023, 10:29 AM

#

vital ermine prodigy is so mem hungry I can barely get BS2-4 (forgot now)

yeah im using bs1 but its working great for styles

nimble heart Oct 20, 2023, 10:29 AM

#

indigo carbon that's what we get with AITemplate

then I'm not sure it'll matter unless you can develop a way to quantize diffusion models without producing NaNs

#

exllama's black magic is in it's handling of the quantization

indigo carbon Oct 20, 2023, 10:31 AM

#

exLLaMa also has its own attention? I tried it with and without Xformers and it was a little faster without Xformers

nimble heart Oct 20, 2023, 10:32 AM

#

no diea

#

idea

#

exllama 2 uses flash attention 2

#

I've never used the original exllama

#

it uses flash attention or something else as a fallback. not sure what the fallback is

vital ermine Oct 20, 2023, 10:33 AM

#

soft bone yeah im using bs1 but its working great for styles

For prodigy what are you using for its Optimizer extra arguments?

indigo carbon Oct 20, 2023, 10:33 AM

#

I think exLLaMa 1 just has a built in version of Xformers

soft bone Oct 20, 2023, 10:33 AM

#

vital ermine For prodigy what are you using for its Optimizer extra arguments?

decouple=True weight_decay=0.35 d_coef=2 use_bias_correction=True

indigo carbon Oct 20, 2023, 10:34 AM

#

nimble heart then I'm not sure it'll matter unless you can develop a way to quantize diffusio...

even if we'll be able to quantize SDXL in the future, I'm not sure if AITemplate can make engines for 4bit models

nimble heart Oct 20, 2023, 10:35 AM

#

i dont think so, which is why GPTQ and exllama became a thing

#

so we'll need something like an SDQ first then after an ExSD can be made

indigo carbon Oct 20, 2023, 10:36 AM

#

nimble heart so we'll need something like an SDQ first then after an ExSD can be made

hope this happens one day

#

I guess it's kinda bound to happen eventually, but the question is when

vital ermine Oct 20, 2023, 10:37 AM

#

wow, 0.35 for the decay?

#

constant for the scedulere or cosine? annealing is too much mem

#

I am stumped as to which one?

soft bone Oct 20, 2023, 10:40 AM

#

vital ermine constant for the scedulere or cosine? annealing is too much mem

this is what im playing with rn

📎 prodigy-styles.json

vital ermine Oct 20, 2023, 10:40 AM

#

soft bone this is what im playing with rn

Thank you

#

Those are 500 step difference checkpoints

nimble heart Oct 20, 2023, 10:43 AM

#

sometimes I wonder if I should've gotten the 16 core...

vital ermine Oct 20, 2023, 10:44 AM

#

Next year I get zen 5 and 16c/32t is what I am after but for python stuff it will not help but does if I do anything else along with it

nimble heart Oct 20, 2023, 10:44 AM

#

i have zen 4 12c and it compiles pretty fast

#

tbh not often i can peg all 24 threads like that

vital ermine Oct 20, 2023, 10:45 AM

#

I went from 1600 to 5600 a few months ago and glad I did but I already have issues with just 6

nimble heart Oct 20, 2023, 10:45 AM

#

most of the time it only uses 24 for a few seconds then decreases as jobs complete

vital ermine Oct 20, 2023, 10:45 AM

#

6 cores 12t not enough

nimble heart Oct 20, 2023, 10:45 AM

#

so unless you're compiling the linux kernel or pytorch it'll fall off after 12c

soft bone Oct 20, 2023, 10:45 AM

#

interesting

nimble heart Oct 20, 2023, 10:46 AM

#

hey flash attention actually compiled?

#

now lets see if it dies

#

:O

vital ermine Oct 20, 2023, 10:47 AM

#

kill it, kill it real good

nimble heart Oct 20, 2023, 10:47 AM

#

okay it's not faster but exllama didn't bitch about Flash Attention not being installed

uncut fiber Oct 20, 2023, 10:47 AM

#

anything to enable using only gguf models @nimble heart ?

nimble heart Oct 20, 2023, 10:48 AM

#

no idea I don't use gguf

uncut fiber Oct 20, 2023, 10:49 AM

#

o.k. what model i can afford and you can suggest having 16GB RAM and 8 VRAM?

vital ermine Oct 20, 2023, 10:52 AM

#

double both of those

#

min

fierce hollow Oct 20, 2023, 10:52 AM

#

nimble heart hey flash attention actually compiled?

do you mean like compiled with visual studio, cuda, and the whole circus installed or did they add like windows wheels or something to pip

nimble heart Oct 20, 2023, 10:53 AM

#

i mean with ROCm on Linux

fierce hollow Oct 20, 2023, 10:53 AM

#

oh that's a whole another can of worms I guess

nimble heart Oct 20, 2023, 10:53 AM

#

should work just fine on nvidia

vital ermine Oct 20, 2023, 10:54 AM

#

I know my next pc will have 64, or 128gb of rams right off, all slots filled.

nimble heart Oct 20, 2023, 10:55 AM

#

anyways the exllama "you dont have flash attention" warning went away but it's functionally identical. Same vram usage, perf, etc. So I'm guessing it fails and falls back later or the rocm flash attention is just all stubbed functions to pass tests currently.

uncut fiber Oct 20, 2023, 10:57 AM

#

will try sdp_attention and see if any difference

fierce hollow Oct 20, 2023, 10:57 AM

#

you can check by importing attn from exllamav2

#

like uhh

#

python -c "from exllamav2 import attn; print(attn.has_flash_attn)"

nimble heart Oct 20, 2023, 11:01 AM

#