SDNext WebUI on Intel ARC | Intel Insiders Community | Page 3

chrome bone Aug 9, 2023, 3:38 AM

#

oh wait, i remember having higher it/s when using ipex thats not compiled with AOT

#

just slow start but speed is better

keen marsh Aug 9, 2023, 3:39 AM

#

yup

#

But it could be the commit though

#

they changed some stuff about memory etc.

#

likely still working on it so we are getting the pure upstream when compiling, some stuff might be worse atm

chrome bone Aug 9, 2023, 3:40 AM

#

ill patiently wait them to fix it then

#

though im sure a lot of ppl out there are keen to try SD in native windows

keen marsh Aug 9, 2023, 3:41 AM

#

if I am up to it....I may try and compile another day...nah, this is fine. Diffusers seems to work well, just the vae's don't show up

chrome bone Aug 9, 2023, 3:41 AM

#

yeah

#

pain peko

keen marsh Aug 9, 2023, 3:41 AM

#

I wonder if they have a diffuser version for some of these vae's?

chrome bone Aug 9, 2023, 3:42 AM

#

the models are the same

#

just the pipelines are written differently

#

afaik

#

so some samplers and plugins you cannot use when choosing diffusers backend

keen marsh Aug 9, 2023, 3:43 AM

#

I think it may be safetensors, some are still ckpt and pt files

keen marsh Aug 9, 2023, 3:44 AM

#

chrome bone so some samplers and plugins you cannot use when choosing diffusers backend

makes sense.

tall grove Aug 9, 2023, 7:34 AM

#

Hm so linux still quite a bit faster?

keen marsh Aug 9, 2023, 12:00 PM

#

Diffuser backend is just as fast, and if you use the prebuilt wheels its fast but you have to wait 10-15minutes to start.

coral mulch Aug 9, 2023, 10:54 PM

#

I'm still stuck with the same error I had before, as I genuinely have no clue how to fix it.

keen marsh Aug 9, 2023, 11:12 PM

#

Vipitis seemed to get it to work with just having the conda installed and not deleting python 3. See if he can help you out. A theory I had for why it didn't work for me is i couldnt update python to the latest version and could only use 3.10.6 while conda is newer. 🤷‍♂️

grave condor Aug 9, 2023, 11:12 PM

#

126 error?

#

I got it to switch to 127 for a while.

coral mulch Aug 9, 2023, 11:12 PM

#

When in WSL, Error 139. When I tried windows, it was a dll error.

grave condor Aug 9, 2023, 11:12 PM

#

no clue about WSL

coral mulch Aug 9, 2023, 11:12 PM

#

or one of its dependencies.```

#

For windows.

grave condor Aug 9, 2023, 11:13 PM

#

I got ipex to work for CausalLM inference today but the JIT delay is horrible

keen marsh Aug 9, 2023, 11:13 PM

#

grave condor I got ipex to work for CausalLM inference today but the JIT delay is horrible

I added a wheel up a few posts, requires one file edit in the instructions.

grave condor Aug 9, 2023, 11:14 PM

#

I might try that on Friday. Got a busy day tomorrow. And really need to get to sleep this time

#

Was up past 5 am the last few days

#

alarm at 7.30 which is in almost 6 hours

keen marsh Aug 9, 2023, 11:15 PM

#

They are compiling from a branch and not xpu master, so xpu master adds a file we dont need that causes an error since it doesn't exist in torch

grave condor Aug 9, 2023, 11:15 PM

#

coral mulch ```"C:\Users\dbs_5\OneDrive\Desktop\automatic\venv\lib\site-packages\intel_exten...

it's all about activating the one API environment with the servars script.

keen marsh Aug 9, 2023, 11:15 PM

#

grave condor I might try that on Friday. Got a busy day tomorrow. And really need to get to s...

No doubt, I habe another wheel that doesn't need an edit I may uplaod it

grave condor Aug 9, 2023, 11:16 PM

#

I did the ipex webinar today and it was completely useless. They just talked about CPU stuff and a hyperparameter searching script they implemented.

keen marsh Aug 9, 2023, 11:17 PM

#

If you want to compile yourself, change xpu master to the xpu 2.0 branch in the bat file

grave condor Aug 9, 2023, 11:17 PM

#

no useful information for GPU/xpu and my questions didn't get real answers either.

#

You can't save the JIT kernels to use them again or in other processes is what they confirmed to me.

coral mulch Aug 9, 2023, 11:17 PM

#

grave condor it's all about activating the one API environment with the servars script.

Which I already did before starting.

#

That, and MKL + dcp

keen marsh Aug 9, 2023, 11:18 PM

#

@grave condor#0 what version of python do you have?

grave condor Aug 9, 2023, 11:18 PM

#

3.9.4 I believe

proper cradle Aug 9, 2023, 11:19 PM

#

coral mulch I'm still stuck with the same error I had before, as I genuinely have no clue ho...

don't use --use-ipex if you don't want to troubleshoot ipexrun

keen marsh Aug 9, 2023, 11:19 PM

#

That may be the reason it wouldn't run right with python 3.10.6, i got the same error dan does. Had to I delete and add conda to path(doesn't matter for me since I dont do any real programming)

grave condor Aug 9, 2023, 11:19 PM

#

proper cradle don't use --use-ipex if you don't want to troubleshoot ipexrun

does ipexrun also work for xpu? I might need it for accelerate launch

proper cradle Aug 9, 2023, 11:20 PM

#

xpu mode is available but it's slower than no ipexrun at all

#

cpu mode is the fastest

grave condor Aug 9, 2023, 11:20 PM

#

eh, I will try accelerate launch for the eval script, I believe by accelerate config got the xpu registered.

proper cradle Aug 9, 2023, 11:21 PM

#

accelerate has --use-xpu cmd arg

grave condor Aug 9, 2023, 11:22 PM

#

I am using accelerate.Accelerator.device() right now for a simple device agnostic Implementation

#

haven't tried it on on all three options tho. But wanted to before I push it

coral mulch Aug 9, 2023, 11:44 PM

#

00004-An20apple20atop20a20wooden20table20Analog20photo.png

#

Got it working again.

#

On WSL*

#

00011-A20golden20knight20further20down20the20gravel20trail.png

#

My only issue now

#

Is that line that keeps showing up

#

00014-A20golden20knight20further20down20the20gravel20trail.png

proper cradle Aug 10, 2023, 12:28 AM

#

That line is a weird issue happens only at 1024x1024

#

Diffusers with attention slicing turned on doesn't have that issue

coral mulch Aug 10, 2023, 12:29 AM

#

I have attention slicing on and that is happening.

proper cradle Aug 10, 2023, 12:29 AM

#

Attention slicing off = Scaled Dot Product
Attention slicing on = Diffusers

#

Try turning it off and reload the model

#

It'a a weird issue and it doesn't go away without a complete restart

#

And it only happens at 1024x1024

#

768x1024 or 1024x1536 don't have this issue

coral mulch Aug 10, 2023, 12:33 AM

#

00019-A20golden20knight20down20a20pathway20lora20pixel.png

#

00020-A20golden20knight20down20a20pathway20lora20pixel.png

#

00022-A20knight20blocks20your20path20down20the20forest.png

#

768x768 works rather well.

proper cradle Aug 10, 2023, 12:36 AM

#

Try 1080x1080

coral mulch Aug 10, 2023, 12:37 AM

#

00025-A20knight20blocks20your20path20down20the20forest.png

#

Yeah, 1080x1080 is fine

proper cradle Aug 10, 2023, 12:38 AM

#

This happens in all models on IPEX with 1024x1024

coral mulch Aug 10, 2023, 12:38 AM

#

00026-A20knight20wearing20gleaming20golden20armor20guarding20the.png

proper cradle Aug 10, 2023, 12:38 AM

#

I couldn't find why this happens exactly at 1024x1024

coral mulch Aug 10, 2023, 12:39 AM

#

I'll probably do 1280x720 for nice 16:9 images

#

Or 1024x768

proper cradle Aug 10, 2023, 12:40 AM

#

I go straight for 1920x1080 on SDXL

#

Then regenerate it with Img2Img at 3840x2160

coral mulch Aug 10, 2023, 12:41 AM

#

I cannot do 1920x1080

proper cradle Aug 10, 2023, 12:41 AM

#

Here is an example image i generated on my A770

#

8GB?

coral mulch Aug 10, 2023, 12:42 AM

#

proper cradle Aug 10, 2023, 12:42 AM

#

proper cradle Here is an example image i generated on my A770

This needs 12 GB

coral mulch Aug 10, 2023, 12:42 AM

#

Model CPU offload is used

#

Mine still seems to over-use sysram

#

Vram isnt an issue

proper cradle Aug 10, 2023, 12:42 AM

#

No offload, all move options are on, VAE slicing and VAE tiling is on

coral mulch Aug 10, 2023, 12:42 AM

#

And not attention slicing?

proper cradle Aug 10, 2023, 12:43 AM

#

Attention slicing off = Scaled Dot Product

coral mulch Aug 10, 2023, 12:43 AM

#

And this is without LORA right?

proper cradle Aug 10, 2023, 12:43 AM

#

Without

coral mulch Aug 10, 2023, 12:43 AM

#

Hm.

#

#

proper cradle Aug 10, 2023, 12:44 AM

#

proper cradle Attention slicing off = Scaled Dot Product

Added a patch to dynamically slice it to keep under 4GB
https://github.com/vladmandic/automatic/commit/9d17cf4c122b98b25b8cb9e3388c1a75df68cdb2

GitHub

IPEX Diffusers fix can't allocate 4GB+ with SDP · vladmandic/automa...

coral mulch Aug 10, 2023, 12:44 AM

#

Do you use these?

proper cradle Aug 10, 2023, 12:45 AM

#

coral mulch

Like this

proper cradle Aug 10, 2023, 12:45 AM

#

coral mulch Do you use these?

NO

coral mulch Aug 10, 2023, 12:45 AM

#

Alright.

proper cradle Aug 10, 2023, 12:45 AM

#

They are FP32

coral mulch Aug 10, 2023, 12:45 AM

#

Well I don't use them.

#

I was just wondering is all.

#

Didn't know they were FP32.

#

https://pastebin.com/rVVNqRJ3

Pastebin

dan9070@dbs580:~$ cd automatic && ./webui.sh --use-ipexCreate and a...

Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

#

I get an out of resources error when trying to generate with those diffuser settings.

#

No refiner, just a pixel art LORA.

proper cradle Aug 10, 2023, 12:51 AM

#

Your RAM usage is 15 GB but it still runs out of RAM?

coral mulch Aug 10, 2023, 12:51 AM

#

WSL is set to a ram limit of 24GB with a swap of 40GB.

#

When fully loaded it looks like this.

proper cradle Aug 10, 2023, 12:53 AM

#

Wait, your GPU dies before it runs out of resources

#

Device Not Found error when trying to load a Lora

coral mulch Aug 10, 2023, 12:53 AM

#

Let me try running a 1920x1080 image without a LORA.

#

Generation was a success.

#

Something's wrong with LORAs.

proper cradle Aug 10, 2023, 12:56 AM

#

Lora support is still experimental in diffusers

coral mulch Aug 10, 2023, 12:56 AM

#

That's very disappointing.

#

Yep, I can do 1920x1080 with the refiner included.

#

I really wanted to use loras, though.

proper cradle Aug 10, 2023, 12:59 AM

#

Try lower res

coral mulch Aug 10, 2023, 12:59 AM

#

proper cradle Aug 10, 2023, 12:59 AM

#

1024x1536 is pretty stable

coral mulch Aug 10, 2023, 1:01 AM

#

Didn't work. Out of resources.

#

I don't even think it will generate a 1024x1024 image with a LORA.

#

Nope.

#

Only way to do it is through model CPU offload.

#

Yep. With model CPU offload I can do 1080x1080 images with loras enabled.

#

#

00036-A20knight20blocks20your20path20down20the20dirt.png

restive parcel Aug 10, 2023, 2:55 AM

#

my linux setup is so borked I get "out of resources" at anything above 1024, following SDXL optimization guide. 64gb sys ram, a770 LE

#

+nothing looks good, so i'm gonna have to review all of my setup thonk

restive parcel Aug 10, 2023, 4:35 AM

#

fixed my linux setup Honma_Yay except for hires fix is not working at all, but hopefully more stuff will get updated to diffusers backend and then i won't have to

#

if I could do training on my card, I could just make my own loras...

#

thonk

#

thas prolly not good right

coral mulch Aug 10, 2023, 4:56 AM

#

restive parcel thas prolly not good right

You renamed it according to the guide, I assume.

#

Rename it back to that, and set the VAE loading precision to be FP32.

restive parcel Aug 10, 2023, 5:01 AM

#

so the instructions have changed slightly since then?

#

ahhh shoot, getting black images with SDXL again

#

or maybe its only CounterfeitXL getting black images

#

refiner isn't working at all though...

#

refiner: disabled

#

tried rebooting a few times

chrome bone Aug 10, 2023, 5:19 AM

#

huh

#

you need to use fp16 fixed vae for counterfeitxl

#

the baked in vae never worked for me

#

also set precision to fp16

#

there is no need to upcast

restive parcel Aug 10, 2023, 5:28 AM

#

yeah i'm using the fixed vae and precision type BF16

#

i tried fp16 and it didn't work any differently

chrome bone Aug 10, 2023, 5:29 AM

#

then i have no idea why

#

specifically i didnt set anything about precision

#

so bf16 may or may not work, probably disty knows better

restive parcel Aug 10, 2023, 5:30 AM

#

I need to also ask disty about xpu-smi

#

I can't get it to output gpu stats

#

the whole output is blank except for frequency and power

keen marsh Aug 10, 2023, 5:51 AM

#

restive parcel I need to also ask disty about xpu-smi

You have to run it with sudo or root.

restive parcel Aug 10, 2023, 5:51 AM

#

I did

#

same output

keen marsh Aug 10, 2023, 5:53 AM

#

Not at computer, but are you running it with the watch command that pings it every few seconds? I found that it would kinda glitch out a bit at first before it started working.

restive parcel Aug 10, 2023, 5:54 AM

#

I am not, how do I use that?

keen marsh Aug 10, 2023, 5:55 AM

#

Its something like this i think $ watch -n <interval> <command>
Replace <interval> with time interval at which you want command to repeat, in seconds. Replace <command> with command you want to repeat.

For example, if you want to run top command every 5 seconds, type following command −

$ watch -n 1 (or the number of seconds to run it. I don't know it by heart I will look it up real quick

#

Dunno how all that extra stuff added to my post, but I think thats it

restive parcel Aug 10, 2023, 5:58 AM

#

oh that's really neat, thank!

keen marsh Aug 10, 2023, 5:59 AM

#

No problem

restive parcel Aug 10, 2023, 5:59 AM

#

I still don't get utilization, but at least its eventually giving me memory use

keen marsh Aug 10, 2023, 5:59 AM

#

Yeah, it seems a bit glitchy. Sometimes making the prompt window bigger fixes it lol

restive parcel Aug 10, 2023, 6:20 AM

#

oh yay, i'm starting to get stuff DinaKEK

00023-anime20naruto20shippuden20final20fantasy20Sephiroth20long20katana.png

restive parcel Aug 10, 2023, 7:06 AM

#

oh, I wasn't expecting to see the line on Original backend with an older model

#

and now I'm back where I was before, can't render using the old backend

#

thonk

#

and i'm going blind from a migraine, so I guess i'll try more another day

#

I thought it started working on its own but something is very wrong DinaKEK

00035-Zweihander20sword204k20ultra20HD20photography2055mm20lens.png

coral mulch Aug 10, 2023, 12:16 PM

#

restive parcel yeah i'm using the fixed vae and precision type BF16

Use FP16 precision.

#

Use 1080x1080 resolution. Do not use 1024.

#

Set channelslast as well

#

#

This genuinely seems like the best overall setup on WSL

tall grove Aug 10, 2023, 12:18 PM

#

Isn't the current sd xl implementation flawed in the web ui? Atleast I saw an open issue working on something to do with it.

coral mulch Aug 10, 2023, 12:19 PM

#

00053-A20knight20blocks20your20path20to20the20castle.png

coral mulch Aug 10, 2023, 12:19 PM

#

tall grove Isn't the current sd xl implementation flawed in the web ui? Atleast I saw an op...

Some things just don't work.

#

LORAs for example do not work with sequential CPU offload, and will not run without model CPU offload.

tall grove Aug 10, 2023, 12:20 PM

#

https://github.com/AUTOMATIC1111/stable-diffusion-webui/pull/12377

GitHub

Alternative refiner implementation by AUTOMATIC1111 · Pull Request ...

Description

two settings on Stable Diffusion page: Refiner checkpoint and Refiner switch at.
first lets you select a model.
second lets you select a ratio.
Runs two rounds of sampling: one for Ref...

coral mulch Aug 10, 2023, 12:20 PM

#

Well I'm using vladmantic's automatic webUI

#

Not that.

#

A different fork entirely

#

https://github.com/vladmandic/automatic\

GitHub

GitHub - vladmandic/automatic: SD.Next: Advanced Implementation of ...

SD.Next: Advanced Implementation of Stable Diffusion - GitHub - vladmandic/automatic: SD.Next: Advanced Implementation of Stable Diffusion

tall grove Aug 10, 2023, 12:20 PM

#

Yeah but its a fork from it

#

Did the sd xl implementation not come from the base?

coral mulch Aug 10, 2023, 12:21 PM

#

tall grove Yeah but its a fork from it

It's quite different, though. Vladmantic's fork has 1500 commits ahead, 1200 commits behind.

tall grove Aug 10, 2023, 12:21 PM

#

Yeah but did he make the sd xl support or move it from main

coral mulch Aug 10, 2023, 12:21 PM

#

I don't think it came from the original branch

tall grove Aug 10, 2023, 12:22 PM

#

Atleast the main branch seems to be working on a better solution

#

Actually not sure

coral mulch Aug 10, 2023, 12:23 PM

#

00056-A20draconian20knight20blocks20your20path20to20the.png

tall grove Aug 10, 2023, 12:23 PM

#

All I see is comfy ui us better for some reason

coral mulch Aug 10, 2023, 12:23 PM

#

Voxel and pixel art loras are great, man.

tall grove Aug 10, 2023, 12:24 PM

#

Well I cant mess with this for a month or so so hopefully everything is sorted by then

keen marsh Aug 10, 2023, 12:31 PM

#

restive parcel I thought it started working on its own but something is very wrong <:DinaKEK:90...

In native windows I have noticed that some samplers glitch in diffusers (so far with sd1.5) unipc doesn't work at all. Euler doesnt work with standard and Euler a with diffusers Maybe try messing with samplers, might not be the same issue though. Haven't gotten sny lines yet with sdxl, but I am using cpu model offload and have all the ipex optimizations enabled as they seem to help in windows.

proper cradle Aug 10, 2023, 12:33 PM

#

tall grove Yeah but its a fork from it

SDNext is a complete rewrite at this point

proper cradle Aug 10, 2023, 12:34 PM

#

tall grove Did the sd xl implementation not come from the base?

No

#

A1111 was 3 weeks late and A1111's SDXL implementation is terrible

keen marsh Aug 10, 2023, 12:36 PM

#

tall grove Yeah but its a fork from it

Sdnext was vlad diffusion which was originally a fork but changed the name because it became much different. Sdnext is better updated and keeps GPUs other than Nvidia in mind. Most extensions will work with both though, and there is a refiner extension but I have never tried it.

proper cradle Aug 10, 2023, 12:36 PM

#

coral mulch LORAs for example do not work with sequential CPU offload, and will not run with...

Works on the dev branch with Attention Slicing turned on and no offload.

coral mulch Aug 10, 2023, 12:37 PM

#

proper cradle Works on the dev branch with Attention Slicing turned on and no offload.

How do I git checkout the dev branch?

#

What's the URL?

proper cradle Aug 10, 2023, 12:37 PM

#

Akane lora, 2048x3072

coral mulch Aug 10, 2023, 12:37 PM

#

00060-A20draconic20knight20blocks20your20path20to20the.png

proper cradle Aug 10, 2023, 12:37 PM

#

git checkout -b dev but it will be merged soon anyway

coral mulch Aug 10, 2023, 12:38 PM

#

Very good.

keen marsh Aug 10, 2023, 12:39 PM

#

One thing I dont like, is sdnext disables control net when starting with sdxl, which sucks when switching back and forth to sd1.5.

coral mulch Aug 10, 2023, 12:39 PM

#

keen marsh One thing I dont like, is sdnext disables control net when starting with sdxl, w...

This is not something that happened to me.

#

It stayed enabled even when I swapped to diffusers.

keen marsh Aug 10, 2023, 12:39 PM

#

It disables when starting the ui

proper cradle Aug 10, 2023, 12:39 PM

#

It will get disabled if you restart

coral mulch Aug 10, 2023, 12:40 PM

#

Btw disty

#

once I checkout to that branch

keen marsh Aug 10, 2023, 12:40 PM

#

Waiting for them to add the controlnet sdxl models but seems they are too big

coral mulch Aug 10, 2023, 12:40 PM

#

would I just do ./webui.sh --use-ipex --upgrade --reinstall

proper cradle Aug 10, 2023, 12:41 PM

#

do a git pull and this should be fine

tall grove Aug 10, 2023, 12:41 PM

#

oh right

#

man sd xl has so much potential

coral mulch Aug 10, 2023, 12:42 PM

#

tall grove man sd xl has so much potential

An amazing model.

#

So far I've been exceedingly impressed with it.

tall grove Aug 10, 2023, 12:43 PM

#

just seems to be an ass to run

coral mulch Aug 10, 2023, 12:43 PM

#

Does the dev branch resolve the 1024x1024 resolution issue btw?

tall grove Aug 10, 2023, 12:43 PM

#

hopefully that doesnt damper community engagement

proper cradle Aug 10, 2023, 12:43 PM

#

coral mulch Does the dev branch resolve the 1024x1024 resolution issue btw?

That was an issue for a loong time

#

I couldn't find a fix for that

coral mulch Aug 10, 2023, 12:43 PM

#

Why does 1024x1024 specifically cause artifacting, though?

#

That's what I don't understand.

proper cradle Aug 10, 2023, 12:44 PM

#

Same thing happens on original backend too

coral mulch Aug 10, 2023, 12:44 PM

#

Weird.

#

Swapped to the dev branch, enabled sequential CPU offload

#

disabled model CPU offload

#

put on a pixel art LORA.

#

Black images.

#

🤷‍♂️

proper cradle Aug 10, 2023, 12:49 PM

#

coral mulch Swapped to the dev branch, enabled sequential CPU offload

I have fix for that

#

Don't use any offloading in the meantime

coral mulch Aug 10, 2023, 12:50 PM

#

Alright.

proper cradle Aug 10, 2023, 1:03 PM

#

coral mulch Black images.

Pushed the fix

#

git pull

coral mulch Aug 10, 2023, 1:23 PM

#

#

https://pastebin.com/wggGsuwH

Pastebin

10:53:15-061565 ERROR Arguments: args=('task(fasqofyyt9biiuk)', ...

Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

#

Cannot copy out of meta tensor; no data!

#

Why am I getting meta tensor errors now?

#

I had this before, too. Not sure what caused it.

#

Can you not use sequential CPU offload combined with the sequential apply for LORAs?

#

@proper cradle Even with sequential CPU offload off, and LORA set to diffusers default

#

I'm getting meta tensor errors.

#

11:10:51-849166 ERROR    Arguments: args=('task(zf6p5v3burvufxj)', '', '', [], 20, 3, 0, True, False, False, 1, 1, 6, 6,
                         0.7, 1, -1.0, -1.0, 0, 0, 0, 1080, 1080, False, 0.3, 2, 'Latent', 20, 0, 0, 0.8, '', '', [], 0,
                         False, False, 'positive', 'comma', 0, False, False, '', 0, '', [], 0, '', [], 0, '', [], True,
                         False, False, False, 0, False) kwargs={}
11:10:51-850403 ERROR    gradio call: NotImplementedError```

#

Had to fully shutdown and restart WSL in order for it to generate an image with all offload types disabled.

#

Well model CPU offload + sequential apply LORA works.

00002-A20knight20blocks20your20path20to20the20castle.png

#

Yeah, meta tensor errors for sequential only.

#

Did a git pull, states I'm already up to date

#

So no clue 🤷‍♂️

#

dan9070@dbs580:~/automatic$ git pull
Already up to date.
dan9070@dbs580:~/automatic$ git branch
* dev
dan9070@dbs580:~/automatic$```

proper cradle Aug 10, 2023, 2:02 PM

#

coral mulch Did a git pull, states I'm already up to date

git checkout origin/master
git branch -d dev
git checkout origin/dev
git pull

coral mulch Aug 10, 2023, 2:05 PM

#

00007-A20portrait20of20a20knight20wearing20heavy20stone.png

#

HEAD is now at 417ef540 Merge pull request #1971 from Aptronymist/master
dan9070@dbs580:~/automatic$ git branch -d dev
warning: deleting branch 'dev' that has been merged to
'refs/remotes/origin/dev', but not yet merged to HEAD.
Deleted branch dev (was 0a7105d5).
dan9070@dbs580:~/automatic$ git checkout origin/dev
Previous HEAD position was 417ef540 Merge pull request #1971 from Aptronymist/master
HEAD is now at 0a7105d5 Fix SDXL LoRa offloading and SD 1.5 parsing
dan9070@dbs580:~/automatic$ git pull
You are not currently on a branch.
Please specify which branch you want to merge with.
See git-pull(1) for details.

git pull <remote> <branch>

dan9070@dbs580:~/automatic$ git branch

(HEAD detached at origin/dev)
dan9070@dbs580:~/automatic$ git pull

#

That good?

keen marsh Aug 10, 2023, 2:08 PM

#

That just means your not on the main branch iirc. You will need to switch back to update later though i think anyway

coral mulch Aug 10, 2023, 2:09 PM

#

I'm using the dev branch to utilize Sequential CPU offload with Sequential LORA.

keen marsh Aug 10, 2023, 2:09 PM

#

Sequential stopped working for me in windows native version, haven't tried latest commit though.

#

But model offload runs well. Sequential is slower now anyway

coral mulch Aug 10, 2023, 2:10 PM

#

Sequential is meant to be slower.

#

Lol

#

Apologies if I misunderstand git a bit, as I'm not usually one to mess heavily into repositories.

#

@proper cradle Is * (HEAD detached at origin/dev) correct for git branch?

proper cradle Aug 10, 2023, 2:26 PM

#

git checkout dev

coral mulch Aug 10, 2023, 2:27 PM

#

Switched to a new branch 'dev'```

proper cradle Aug 10, 2023, 2:27 PM

#

git pull

coral mulch Aug 10, 2023, 2:27 PM

#

Already up to date.```

proper cradle Aug 10, 2023, 2:27 PM

#

git status

coral mulch Aug 10, 2023, 2:27 PM

#

On branch dev
Your branch is up to date with 'origin/dev'.

nothing to commit, working tree clean```

proper cradle Aug 10, 2023, 2:27 PM

#

This should work

#

coral mulch Aug 10, 2023, 2:31 PM

#

proper cradle

I'm still getting the meta tensor error.

#

Where is this located?

proper cradle Aug 10, 2023, 2:32 PM

#

coral mulch Where is this located?

System info

#

coral mulch Aug 10, 2023, 2:33 PM

#

Identical settings to mine

#

updated: 2023-08-10
hash: 0a7105d5
url: https://github.com/vladmandic/automatic/tree/dev```

proper cradle Aug 10, 2023, 2:41 PM

#

What lora are you using?

#

Kurokawa Akane Lora:



Prompt: (masterpiece, best quality, highres, anime, pixiv), (1girl, kurokawa akane, blue hair, green eyes, medium hair, gradient hair, solo, full body, standing on an abstract water), (bloom, swirling lights, light particles, detailed, 8k), <lora:training_model:1.0>
Negative prompt: (worst quality, low quality:1.4, lowres, blurry), (3d, interlocked fingers, loli, 2girls),
Steps: 40 | Seed: 4107994374 | Sampler: Euler a | CFG scale: 10 | Size: 1024x1536 | Parser: Full parser | Model: SDXL_astreapixieXLAnime_v16 | Model hash: 432e15eb | VAE: sdxl-vae-fp16-fix | Version: 0a7105d | Pipeline: Diffusers | Operations: txt2img | Lora hashes: "training_model: efe6c5dadf89"

Time taken: 1m 46.25s |

GPU active 1016 MB reserved 1358 MB | System peak 341 MB total 16288 MB

coral mulch Aug 10, 2023, 2:42 PM

#

proper cradle What lora are you using?

Just a question.

#

The moment I swapped to this branch, it stopped detecting all of my SDXL safetensors.

#

#

The only one is 1.5.

#

Refreshing does nothing.

proper cradle Aug 10, 2023, 2:43 PM

#

Try setting pipeline to autodetect

#

And refresh

#

coral mulch Aug 10, 2023, 2:44 PM

#

#

Available models: /home/dan9070/automatic/models/Stable-diffusion 1

#

proper cradle Aug 10, 2023, 2:48 PM

#

Does it have the correct perms?

coral mulch Aug 10, 2023, 2:48 PM

#

I don't know what perms would've changed from the previous repo.

#

It detects only SD 1.5.

proper cradle Aug 10, 2023, 2:48 PM

#

Run ls -lh in that folder (Inside WSL)

coral mulch Aug 10, 2023, 2:48 PM

#

-rw-r--r-- 1 dan9070 dan9070 6.5G Aug 9 20:58 dreamshaperXL10_alpha2Xl10.safetensors
-rw-r--r-- 1 dan9070 dan9070 6.5G Aug 9 20:52 sd_xl_base_1.0_0.9vae.safetensors
-rw-r--r-- 1 dan9070 dan9070 5.7G Aug 9 20:52 sd_xl_refiner_1.0_0.9vae.safetensors
-rw------- 1 dan9070 dan9070 4.0G Aug 9 20:44 v1-5-pruned-emaonly.safetensors

#

I think I might know why

#

Set them all to rwxr

#

Nothing changed. They're still not detected.

#

1.5 is, though.

proper cradle Aug 10, 2023, 2:50 PM

#

Remove the dots from file?

#

I've never seen this before

#

remove cache.json

#

Voxel Lora + Pixel Lora with Sequential CPU Offload:
Time taken: 3m 6.21s | GPU active 1915 MB reserved 2242 MB | System peak 1526 MB total 16288 MB

coral mulch Aug 10, 2023, 3:00 PM

#

Fixed.

#

Json deleted, reinstalled the models incase of some weird corruption

#

All models once again are detected.

#

NotImplementedError: Cannot copy out of meta tensor; no data!

#

#

updated: 2023-08-10
hash: 0a7105d5
url: https://github.com/vladmandic/automatic/tree/dev```

#

#

Yeah, I'm stumped now.

#

OH wait

#

upcasting is still on

#

Nope.

#

Still same error, sadly.

#

Even without a lora selected, I get a meta out of tensor error.

#

Guess I'll stick with Model CPU offload for now

#

00002-A20portrait20of20a20knight20wearing20gold20etched.png

coral mulch Aug 10, 2023, 3:54 PM

#

📎 message.txt

#

It seems that with CPU model offload and the refiner model loaded I now get this error.

#

Removing the negative prompt, I now get "IndexError: string index out of range" with the same traceback.

#

I can't even seem to load the model refiner without CPU offload now either

#

Or even use it regardless of what offload method I use.

#

I just get that error.

#

Dev branch moment

#

Went back to main branch. Refiner works.

#

🤷‍♂️

restive parcel Aug 10, 2023, 4:19 PM

#

coral mulch Use FP16 precision.

this ones 1016 x 1016, FP16

#

original backend, sd1. 5 base model

coral mulch Aug 10, 2023, 4:20 PM

#

00023-An20ancient20underground20temple20The20temple20is20etched.png

#

Negative prompt: Mutated. Disfigured. Multiple Limbs. Disfigured weapon/sword. (((More than one)))
Steps: 20 | Seed: 3147534735 | Sampler: DDIM | CFG scale: 6 | Size: 1080x1080 | Parser: Full parser | Model: sd_xl_base_1.0_0.9vae | Model hash: be9edd61 | Refiner: sd_xl_refiner_1.0_0.9vae | Latent sampler: DDIM | Image CFG scale: 6 | Denoising strength: 0.3 | Refiner start: 0.8 | Secondary steps: 20 | Version: 417ef54 | Pipeline: Diffusers | Operations: "refine | txt2img"```

coral mulch Aug 10, 2023, 4:21 PM

#

restive parcel original backend, sd1. 5 base model

Is the same problem at 1080x1080?

restive parcel Aug 10, 2023, 4:22 PM

#

not sure, I went to bed after

keen marsh Aug 10, 2023, 4:53 PM

#

coral mulch Sequential is meant to be slower.

They were about the same for me for a while, but I mean slower than it was before.

proper cradle Aug 11, 2023, 9:12 AM

#

@coral mulch everything got merged to master and refiner issue is fixed too

#

Start the webui with --reinstall if you want to use Sequential offload

coral mulch Aug 11, 2023, 4:27 PM

#

Alright, thank you.

coral mulch Aug 11, 2023, 6:17 PM

#

proper cradle <@204342691964780546> everything got merged to master and refiner issue is fixed...

I actually had to git pull from the master branch, by git branch origin master

#

Then it updated.

#

Sequential LORAs work now.

#

00028-An20ancient20underground20temple20The20temple20is20etched.png

#

Thank you, again.

#

00031-A20closeup20of20a20knight20wearing20plate20armor.png

#

@proper cradle With sequential on, I can use 9GB of my VRAM with the LORA to generate 4096x4096 images.

mellow sparrow Aug 11, 2023, 8:57 PM

#

sequential slows performance down considerably though correct?

coral mulch Aug 11, 2023, 8:58 PM

#

mellow sparrow sequential slows performance down considerably though correct?

It does, at a huge VRAM cost decrease.

mellow sparrow Aug 11, 2023, 8:59 PM

#

have you found a good sweet spot between vram usage and performance? I have 48gb RAM and a A770 16gb VRAM card

#

I was about to check out that pixel art lora too. Pretty neat

tall grove Aug 11, 2023, 9:02 PM

#

how much can you get out of just using the vram?

mellow sparrow Aug 11, 2023, 9:04 PM

#

I've done up to 512x1024, but as soon as I hit 1024x1024 it starts throwing errors...But I havent been using the low vram flags. I tried once and saw a 11x incrase in render time for the same res

tall grove Aug 11, 2023, 9:05 PM

#

seems a bit low tbh unless sd xl uses that much vram

mellow sparrow Aug 11, 2023, 9:06 PM

#

comfyui is supposed to be alot more efficient than auto1111

tall grove Aug 11, 2023, 9:18 PM

#

idk sd.next seems to have diverged a lot from it

restive parcel Aug 11, 2023, 9:18 PM

#

sd.next is very different

proper cradle Aug 11, 2023, 9:39 PM

#

tall grove how much can you get out of just using the vram?

4096x4096 with model shuffling, attention slicig, vae tiling, fp16 vae and vae upcasting false

#

VAE tiling is a must unless you have an Nvidia A100

#

VAE upcasting false = FP16

#

No one should use FP32

#

Attention slicing fixes NaNs above 2032x2032

#

Model shuffling sends unused models to RAM so it won't sit in the VRAM, doing nothing, No performance hit.

tall grove Aug 11, 2023, 9:55 PM

#

Oh that seems more normal.

#

Was beginning to think 16gb was small 😞

#

*too

coral mulch Aug 11, 2023, 10:50 PM

#

I will state however that without model CPU offload or sequential offload it doesn't really work with Loras yet

#

At least on my side.

proper cradle Aug 12, 2023, 12:17 AM

#

coral mulch I will state however that without model CPU offload or sequential offload it doe...

Turn on Attention Slicing

coral mulch Aug 12, 2023, 12:38 AM

#

proper cradle Turn on Attention Slicing

it is on.

mellow sparrow Aug 12, 2023, 1:27 AM

#

is there a trick to getting sdxl refiner working? I get this error when starting up with refiner.py ModuleNotFoundError: No module named 'sgm'

#

base sdxl model is working fine

proper cradle Aug 12, 2023, 9:00 AM

#

mellow sparrow is there a trick to getting sdxl refiner working? I get this error when startin...

refiner.py?

#

Don't use random extensions

#

https://github.com/vladmandic/automatic/wiki/SD-XL

GitHub

SD XL

SD.Next: Advanced Implementation of Stable Diffusion - vladmandic/automatic

coral mulch Aug 12, 2023, 1:33 PM

#

00070-A20corrupted20tentacular20plated20knight20Voxel20style20lora.png

#

Looks like it decided to work now.

#

No offload, it works.

mellow sparrow Aug 12, 2023, 7:05 PM

#

Disty:thanks for the link. Will check it out later

pastel geode Aug 15, 2023, 1:05 AM

#

Any of you tested this?
https://youtu.be/GZLjbTPLCVk

YouTube

Owen Spangler

Stable Diffusion on Intel Arc A770 Installation Tutorial (Vladmandi...

The full list of commands and links can be found on my GitHub: https://github.com/ospangler/intel-arc-stable-diffusion-tutorial

Be sure to check out @Archive-pg2zn 's tutorial at https://www.youtube.com/watch?v=ub9150aOMMc on how to setup the wslconfig file, additional tips, error troubleshooting during Vladmandic installation, and improvements...

▶ Play video

restive parcel Aug 15, 2023, 1:25 AM

#

looks like theyr'e just doing a video tutorial for WSL2 setup?

coral mulch Aug 15, 2023, 8:40 PM

#

restive parcel looks like theyr'e just doing a video tutorial for WSL2 setup?

Seems there's some commands he uses that I do not have.

pastel geode Aug 15, 2023, 9:04 PM

#

im tempted to try it out since if it doesnt work out in the end, i could just unregister my wsl but im not really familiar with the commands he used

#

and regarding the OneApi toolkit, im curious if it will appear under programs in control panel even though he is installing it via wsl cuz the installation appeared on windows (15:08)

coral mulch Aug 15, 2023, 9:07 PM

#

I've gotten SDXL already working through Disty's method.

#

No Aivan, I don't think so.

#

It's still within WSL

pastel geode Aug 15, 2023, 9:08 PM

#

coral mulch No Aivan, I don't think so.

good cuz i dont want to keep tab of things i have to uninstall if this goes wrong in wsl hahaha

coral mulch Aug 15, 2023, 9:09 PM

#

I assume you meant the oneapi basekit GUI right?

#

The reason why that shows up is because he's running the GUI installer for the base kit.

#

It's the same on Windows and Linux

pastel geode Aug 15, 2023, 9:09 PM

#

coral mulch I assume you meant the oneapi basekit GUI right?

yea

coral mulch Aug 15, 2023, 9:09 PM

#

WSL2 supports graphical interfaces (WSLg)

#

Disty's method skips that entirely by directly installing what is needed through CLI.

pastel geode Aug 15, 2023, 9:10 PM

#

interesting cuz i tried ssh-ing to my university’s lab computer using wsl to open a program but no graphical interface appeared. I could use X2Go, but less software, the better. No worries tho!

keen marsh Aug 16, 2023, 1:07 AM

#

Its a little wonky to get going, but you can run it in native windows now.

novel sphinx Aug 16, 2023, 2:07 AM

#

i just tried the new openvino version, downloadin sdxl now to try it but for sd1.5 it is blazing fast

grave condor Aug 16, 2023, 2:13 AM

#

they got 11it/s on A770 https://youtu.be/a28Le2l4MA4 see around 12 minutes.

YouTube

Intel Software

Generative AI with OpenVINO | OpenVINO DevCon | Intel Software

Generative AI is exploding, bringing potential AI applications that could change everything we do. One example of this recent progress is the release of text processing models, which possess the capability to solve complex problems like passing medical and law exams, akin to human abilities. However, one critical question remains: can we run the...

▶ Play video

novel sphinx Aug 16, 2023, 2:20 AM

#

yeah thats what i acheived was 11.12 it/s

grave condor Aug 16, 2023, 2:30 AM

#

with 1 images or 4?

novel sphinx Aug 16, 2023, 2:36 AM

#

1

#

single batch

#

sdxl does not appear to work, although i set gpu in the openvino script settings it infers on the cpu with that model selected

grave condor Aug 16, 2023, 2:50 AM

#

does it move any of the models onto GPU?

restive parcel Aug 16, 2023, 2:54 AM

#

11 it/s on arc Inani

novel sphinx Aug 16, 2023, 2:54 AM

#

yes it works great with any sd1.5 based models

#

1st run is slow because it compiles the model

#

subsequent runs run at just over 11it/s

restive parcel Aug 16, 2023, 2:55 AM

#

oh it just handles the compiling for me? even more of an improvement over previous openvino xD

novel sphinx Aug 16, 2023, 2:56 AM

#

yes it has that baked in

coral mulch Aug 16, 2023, 2:56 AM

#

Imagine SDXL at that speed

#

AlchemistSad

novel sphinx Aug 16, 2023, 2:56 AM

#

works great on windows

#

yeah i mean idk if sdxl will be that fast but if they get sdxl working i would imagine 3-4

coral mulch Aug 16, 2023, 2:57 AM

#

Well no of course not

broken grail Aug 16, 2023, 2:57 AM

#

I had 11it/s or so working on arch, sd1.5, before a performance regression with pytorch 2 that brought me to 3 it/s at best

restive parcel Aug 16, 2023, 2:57 AM

#

i mean, you won't be doing 1024^2 at that speed of course

coral mulch Aug 16, 2023, 2:58 AM

#

I think I've underestimated sequential CPU offload lmao

novel sphinx Aug 16, 2023, 2:58 AM

#

openvino is fast and this is pretty easy to configure the guide in the wiki for a1111 is extremely straightforward and nothing convoluted to do

coral mulch Aug 16, 2023, 2:58 AM

#

It's slow for single generations

#

But amazing for large batch sizes

#

With model CPU offload, I can do 12 images per batch in 2 minutes

restive parcel Aug 16, 2023, 2:59 AM

#

sheeesh

novel sphinx Aug 16, 2023, 2:59 AM

#

wiht sdxl?

coral mulch Aug 16, 2023, 2:59 AM

#

Yes.

novel sphinx Aug 16, 2023, 2:59 AM

#

thats pretty good ngl

broken grail Aug 16, 2023, 2:59 AM

#

wow

#

~2it/s thereabouts?

coral mulch Aug 16, 2023, 3:00 AM

#

I'm going to test Sequential now

broken grail Aug 16, 2023, 3:00 AM

#

assuming 20 iterations per image ig

coral mulch Aug 16, 2023, 3:00 AM

#

to see how high I can get on batch size

broken grail Aug 16, 2023, 3:00 AM

#

I keep getting really weird artifacting past batch size 2

novel sphinx Aug 16, 2023, 3:00 AM

#

i wouild imagne single image is slow for sequential because how it processes form cpu to gpu but all the following images would be fast

coral mulch Aug 16, 2023, 3:00 AM

#

oh yeah uh

#

I forgot to mention

broken grail Aug 16, 2023, 3:00 AM

#

supersaturated colors and noise

coral mulch Aug 16, 2023, 3:00 AM

#

that's with sequential LORA on

novel sphinx Aug 16, 2023, 3:01 AM

#

in wsl i was getting like 2.3 it/s in sdxl so thi sseems about right

broken grail Aug 16, 2023, 3:01 AM

#

does ipexrun work for you guys?

#

i have an identical setup to disty, pretty sure, and it's barely working on my machine

#

kinda stinks

coral mulch Aug 16, 2023, 3:01 AM

#

I have disty's working on my side.

broken grail Aug 16, 2023, 3:01 AM

#

idk if the bifrost card is any different or if I need to update some microcode or something

coral mulch Aug 16, 2023, 3:02 AM

#

broken grail Aug 16, 2023, 3:02 AM

#

wow you've got all the vram savers on

#

I haven't tried much of sdxl yet

coral mulch Aug 16, 2023, 3:02 AM

#

Indeed

broken grail Aug 16, 2023, 3:02 AM

#

what's the typical performance penalty with that suite?

coral mulch Aug 16, 2023, 3:02 AM

#

I didn't show that either huh

#

I'm sacrificing speed for sheer image generation

broken grail Aug 16, 2023, 3:03 AM

#

also: anyone notice any significant difference between bf16/fp16?

coral mulch Aug 16, 2023, 3:03 AM

#

So I just tested

#

I can do Batch size 24 on Sequential

restive parcel Aug 16, 2023, 3:04 AM

#

broken grail also: anyone notice any significant difference between bf16/fp16?

for me, it can be the difference between getting an image and not getting one DinaKEK but I don't know when exactly is right for which one. My setup seems cursed though, I can't get good faces on any model

coral mulch Aug 16, 2023, 3:04 AM

#

The higher the image batch size you can get, it seems you get closer to actual image gen performance

#

However it IS slower than Model CPU offload

#

Ope, it's lower than that.

#

16 seconds.

broken grail Aug 16, 2023, 3:05 AM

#

restive parcel for me, it can be the difference between getting an image and not getting one <:...

haha same here, on occasion

broken grail Aug 16, 2023, 3:05 AM

#

coral mulch The higher the image batch size you can get, it seems you get closer to actual i...

this makes sense, all the cpu/gpu shuffling costs are amortized over the batch

coral mulch Aug 16, 2023, 3:05 AM

#

Then the real question is

#

What's the best batch size to generation speed?

broken grail Aug 16, 2023, 3:07 AM

#

hm

#

I would test it but keep getting numerical instability past batch size 2

restive parcel Aug 16, 2023, 3:07 AM

#

broken grail haha same here, on occasion

I haven't been able to get anything on my current linux setup yet. SDXL has broken faces and sd1.5 models no longer work at all

broken grail Aug 16, 2023, 3:07 AM

#

anyway I'd imagine you'd get discontinuities whenever you need to kick on another vram saver

broken grail Aug 16, 2023, 3:07 AM

#

restive parcel I haven't been able to get anything on my current linux setup yet. SDXL has brok...

what's your distro?

coral mulch Aug 16, 2023, 3:08 AM

#

It did it.

#

#

1280x720 images.

#

Zero prompt with just negatives generates some interesting outcomes.

broken grail Aug 16, 2023, 3:09 AM

#

why so gray?

#

oh zero prompt

#

I would occasionally get really gray results with second pass

#

kinda had a faded look. pretty cool

#

annoying though

restive parcel Aug 16, 2023, 3:10 AM

#

broken grail what's your distro?

Ubuntu 23 this time. I killed a couple of others before it

broken grail Aug 16, 2023, 3:12 AM

#

hm

#

try disabling ipexrun if you haven't already

#

what flavor errors are you getting

coral mulch Aug 16, 2023, 3:14 AM

#

im still on 22.04LTS

broken grail Aug 16, 2023, 3:15 AM

#

anyway i gotta go to sleep

#

hopefully i can get my litany of errors sorted out in the coming days

#

i was nagging disty on github since they were the only other person I knew about running sd on arc but now I've found this servers so things should be smoother

coral mulch Aug 16, 2023, 3:16 AM

#

#

I love how despite not having any prompts

#

Somehow it still puts together a coherent image on it's own

#

This model blows 1.5 out of the water

novel sphinx Aug 16, 2023, 3:17 AM

#

okay correction, only the 1.5 base model seems to work properly with the openvino implementation, whilst other models will work and generate an image, im assuming the openvino compilation pipeline messes the models up as other models just output complete garbage

coral mulch Aug 16, 2023, 3:19 AM

#

https://cdn.discordapp.com/attachments/204342863947890689/1140830765802852353/00241-A_fallout_new_vegas_desert_ranger_Pixel_Art.jpg

#

Ye it seems a batch size of 4 is perfect

#

3.8s/IT

keen marsh Aug 16, 2023, 3:48 AM

#

novel sphinx openvino is fast and this is pretty easy to configure the guide in the wiki for ...

A1111 works with open vino?

coral mulch Aug 16, 2023, 3:57 AM

#

Just a lil' question

#

What do you have to do to get ComfyUI opearting on Arc?

keen marsh Aug 16, 2023, 3:59 AM

#

coral mulch What do you have to do to get ComfyUI opearting on Arc?

They have official support I believe,https://github.com/comfyanonymous/ComfyUI/discussions/476 ran it in native windows, its slower than sdnext

GitHub

Intel Arc Graphics Thread · comfyanonymous ComfyUI · Discussion #476

ComfyUI now supports Intel Arc Graphics. (#409) Since the installation tutorial for Intel Arc Graphics is quite long, I'll write it here first. Chinese version / 中文版: HERE Intel Extension for P...

#

Also, sdxl didn't work for me, but I didn't really know what i was doing. 1.5 worked fine though

coral mulch Aug 16, 2023, 4:00 AM

#

I'm trying to get the SD.Next ComfyUI Extension to work lmao since it's literally just ComfyUI

keen marsh Aug 16, 2023, 4:03 AM

#

https://github.com/openvinotoolkit/stable-diffusion-webui/

GitHub

GitHub - openvinotoolkit/stable-diffusion-webui: Stable Diffusion w...

Stable Diffusion web UI. Contribute to openvinotoolkit/stable-diffusion-webui development by creating an account on GitHub.

#

I wonder if this could work for sdnext as well. Not a fan of automatic since it's never natice support for their platform, its always a fork that may never get maintained etc.

novel sphinx Aug 16, 2023, 4:10 AM

#

the openvino appartently doesnt work with other scripys and such so i dont think it would work with sd.next as its heavily modified

#

this will likely change in the future as development continues but its nice to have an easy to use webui version using openvino which is the fastest on arc by far

ember orchid Aug 16, 2023, 4:31 AM

#

We have a thread A1111 for Arc on here
#1141164275990278206 message

coral mulch Aug 16, 2023, 4:47 AM

#

oh hey it owrks

#

SDXL ComfyUI extension does indeed work

restive parcel Aug 16, 2023, 5:06 AM

#

nice

keen marsh Aug 16, 2023, 5:12 AM

#

coral mulch SDXL ComfyUI extension does indeed work

How is the speed for you?

#

Also, man a few months ago i dont think i imagined so many options for arc gpus. Coming along fast imo

coral mulch Aug 16, 2023, 5:23 AM

#

@keen marsh Literally the same it seems. At least in terms of normal non-offload running, 1 IT a second basically (with LORA)

#

Then again this IS just the extension

#

It's using all the same packages my main venv is using

coral mulch Aug 16, 2023, 5:56 AM

#

It uses A LOT of VRAM though it seems

#

It's nowhere near as optimized

#

Yeah nvm I can barely run it lol

#

It runs for the first two images then explodes

#

Well at least I got a lil' taste of ComfyUI, and I don't really like it tbh.

#

🤷‍♂️

keen marsh Aug 16, 2023, 6:08 AM

#

Yeah not s big fan either, it was slower for me with sd1.5 too. Probably not bad when you get over the learning curve though, seen people make very fine too ed iterations pretty quickly with it.

proper cradle Aug 16, 2023, 7:54 AM

#

keen marsh I wonder if this could work for sdnext as well. Not a fan of automatic since it...

SDNext has openvino_fx compiler as an option in the compute settings

#

But It's slower than no compile at all on my end

#

And it uses more VRAM

proper cradle Aug 16, 2023, 8:00 AM

#

broken grail also: anyone notice any significant difference between bf16/fp16?

Yes, this started to happen after PyTorch 2.
BF16 is faster on original backend.
FP16 is faster on diffusers backend.

chrome bone Aug 16, 2023, 8:03 AM

#

i think its simply because of precision conversion somewhere in the pipeline, bf16/fp16 shdnt affect speed per se

#

just guessing though

broken grail Aug 16, 2023, 11:08 AM

#

proper cradle Yes, this started to happen after PyTorch 2. BF16 is faster on original backend....

Thanks

keen marsh Aug 16, 2023, 3:03 PM

#

proper cradle But It's slower than no compile at all on my end

interesting, I wonder if it will fair better in windows? Or if an environment for openvino needs to be created? I don't know enough and just hack my way through things, but I will start looking into these things soon.

#

if 11it/s is possible, maaaan lol

broken grail Aug 16, 2023, 3:04 PM

#

I noticed compile was slower for one off generations but would speed up with larger batches and consecutive runs

broken grail Aug 16, 2023, 3:04 PM

#

keen marsh if 11it/s is possible, maaaan lol

Definitely possible

keen marsh Aug 16, 2023, 3:04 PM

#

I will check out this a1111 fork and look into sdnext's openvino backend on my system maybe today.

broken grail Aug 16, 2023, 3:05 PM

#

https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html

SD WebUI Benchmark Data

SD WebUI Benchmark Data; Author: Vladimir Mandic

#

search a770

keen marsh Aug 16, 2023, 3:05 PM

#

broken grail search a770

so that's with openvino backend on sdnext?

broken grail Aug 16, 2023, 3:06 PM

#

no

keen marsh Aug 16, 2023, 3:06 PM

#

or does that benchmark work on a1111?

broken grail Aug 16, 2023, 3:06 PM

#

that was ipex

#

in sdnext

#

invokeai optimizations

keen marsh Aug 16, 2023, 3:06 PM

#

i've only ever gotten 6/it's, but I do have a750, maybe it's not possible on it

proper cradle Aug 16, 2023, 3:06 PM

#

Diffusers is way faster than original backend

#

OpenVINO WebUI uses diffusers too

keen marsh Aug 16, 2023, 3:07 PM

#

ahhh, okay. I never tried that in linux

broken grail Aug 16, 2023, 3:07 PM

#

my 11it/s was without diffusers iirc

#

or maybe not

#

idk

#

don't remember

keen marsh Aug 16, 2023, 3:07 PM

#

my native windows ipex is a percentage slower than linux right now, the self compiled one with AOT anyway

proper cradle Aug 16, 2023, 3:07 PM

#

broken grail my 11it/s was without diffusers iirc

You can't use the ortiginal backend in OpenVINO SD WebUI

broken grail Aug 16, 2023, 3:07 PM

#

whatever sdnext defaults to

proper cradle Aug 16, 2023, 3:07 PM

#

broken grail whatever sdnext defaults to

Original backend

broken grail Aug 16, 2023, 3:07 PM

#

proper cradle You can't use the ortiginal backend in OpenVINO SD WebUI

that was with ipex like a month ago

proper cradle Aug 16, 2023, 3:08 PM

#

I get 8.3 it/s at 512x512 on original backend with FP16

novel sphinx Aug 16, 2023, 3:09 PM

#

Native windows with openvino a1111 fork sd1.5 512x512 is 11.2 it/s

broken grail Aug 16, 2023, 3:09 PM

#

proper cradle I get 8.3 it/s at 512x512 on original backend with FP16

was that with or without batching

proper cradle Aug 16, 2023, 3:09 PM

#

Without

broken grail Aug 16, 2023, 3:09 PM

#

I was using compilation warmup and batching

#

That's why

#

Anyway diffusers/vino is faster than original/ipex nowadays?

keen marsh Aug 16, 2023, 3:10 PM

#

novel sphinx Native windows with openvino a1111 fork sd1.5 512x512 is 11.2 it/s

I was reading that models don't work right though? Do you have to convert them? or have you tried?

broken grail Aug 16, 2023, 3:10 PM

#

I wonder if zen kernel is causing me trouble

#

gotta get ipexrun fixed

novel sphinx Aug 16, 2023, 3:11 PM

#

yeah they dont seem to work right, the fork automatically converts them when it does its model compile, you get generations but the outputs dont match what the model is for

proper cradle Aug 16, 2023, 3:11 PM

#

proper cradle You can't use the ortiginal backend in OpenVINO SD WebUI

OpenVINO SD WebUI is entirely different thing

#

OpenVINO actually slows things down in SDNext

keen marsh Aug 16, 2023, 3:12 PM

#

broken grail Anyway diffusers/vino is faster than original/ipex nowadays?

seems to be

novel sphinx Aug 16, 2023, 3:12 PM

#

For example an anime model will still output real life images more like what the 1.5 base model would produce

proper cradle Aug 16, 2023, 3:12 PM

#

proper cradle OpenVINO actually slows things down in SDNext

8.5 down to 8

novel sphinx Aug 16, 2023, 3:12 PM

#

Albeit it seems like if a model is say trained more for nsfw content that seems to stick just not the models desired style

keen marsh Aug 16, 2023, 3:13 PM

#

novel sphinx yeah they dont seem to work right, the fork automatically converts them when it ...

I wonder if there is an issue with the conversion process? Doesn't seem like it should change anything

broken grail Aug 16, 2023, 3:13 PM

#

novel sphinx For example an anime model will still output real life images more like what the...

sounds like the wrong VAE is being used

keen marsh Aug 16, 2023, 3:13 PM

#

Not sure if that fork is maintained , which is why i like SDNEXt tbh

broken grail Aug 16, 2023, 3:13 PM

#

maybe it's not loading an embedded vae

keen marsh Aug 16, 2023, 3:14 PM

#

Vae shouldn't effect the style that much? mostly color output

broken grail Aug 16, 2023, 3:14 PM

#

ehh

keen marsh Aug 16, 2023, 3:14 PM

#

Also, make sure you use clip skip

novel sphinx Aug 16, 2023, 3:14 PM

#

i also notice far better results using dpm++ 2m karas vs euler a using the openvino fork. This vae thing is possible. I haven't done alot of extensive testing yet but it is nice to have a solution thay works natively on windows

broken grail Aug 16, 2023, 3:14 PM

#

fine details are all vae

novel sphinx Aug 16, 2023, 3:14 PM

#

Is clip skip an extension?

broken grail Aug 16, 2023, 3:14 PM

#

it's a setting

novel sphinx Aug 16, 2023, 3:14 PM

#

Okay

keen marsh Aug 16, 2023, 3:15 PM

#

No, you can set it in the options.

broken grail Aug 16, 2023, 3:15 PM

#

you discard the last n layers of CLIP

keen marsh Aug 16, 2023, 3:15 PM

#

Most anime models need clip skip 2, most realistic models need 1.

broken grail Aug 16, 2023, 3:15 PM

#

essentially it weakens guidance

novel sphinx Aug 16, 2023, 3:15 PM

#

Do note that openvino fork disables all other scripts other than the openvino acceleration script

broken grail Aug 16, 2023, 3:15 PM

#

broken grail essentially it weakens guidance

sort of like "blurring" the meanings of the words

keen marsh Aug 16, 2023, 3:15 PM

#

I find that it doesn't make THAT big a difference though, just changes the image the style is usually the same. Sometimes stuff like "masterpiece" can make frames around the image in some models lol

#

Could be different in openvino though

chrome bone Aug 16, 2023, 3:18 PM

#

it should have a big impact on image generated, not just colors (though thats probably what you see in practice). VAE decoders is just a neural network that convert latent image (that humans cannot comprehend) back to pixel space

keen marsh Aug 16, 2023, 3:19 PM

#

It's worth a shot. Vae might need to be converted to openvino as well right? Not sure if the fork does that

proper cradle Aug 16, 2023, 3:19 PM

#

keen marsh It's worth a shot. Vae might need to be converted to openvino as well right? N...

It does

#

It will run on the CPU otherwise

chrome bone Aug 16, 2023, 3:20 PM

#

thats the case a few months back (and honestly i think still is), you cannot just use custom models without conversion

novel sphinx Aug 16, 2023, 3:20 PM

#

So you think i should set clip skip to 2? It defaults to 1

keen marsh Aug 16, 2023, 3:20 PM

#

novel sphinx So you think i should set clip skip to 2? It defaults to 1

typically the model page will tell you which to use

#

You can also add it to the main page in the options so you don't have to go to settings all the time.

#

sdnext already does that for you btw

novel sphinx Aug 16, 2023, 4:05 PM

#

yeah setting clip skip and such seems to not change anything, definitely should not be getting like real life photorealistic images from conterfeit but here we are

proper cradle Aug 16, 2023, 4:57 PM

#

broken grail Aug 16, 2023, 5:04 PM

#

https://tenor.com/view/impressive-very-nice-patrick-bateman-gif-15247511

Tenor

proper cradle Aug 16, 2023, 5:08 PM

#

Beats RTX 3070 at batch size 16

#

Pretty close to a RTX 3070 Ti

keen marsh Aug 16, 2023, 6:29 PM

#

novel sphinx yeah setting clip skip and such seems to not change anything, definitely should ...

Does it have a baked vae?

novel sphinx Aug 16, 2023, 7:05 PM

#

Im not sure but using seperate vaes have no effect on the image so they're being ignored

keen marsh Aug 16, 2023, 7:15 PM

#

I had an issue where certain vaes didnt seem to work in diffusers, also some sanplers made garbled output

coral mulch Aug 16, 2023, 8:20 PM

#

Any news on High-res fixes yet for Vladmantic?

#

I'm kinda itching for it.

pastel geode Aug 16, 2023, 8:23 PM

#

So I finally decided to try https://www.technopat.net/sosyal/konu/using-stable-diffusion-webui-with-intel-arc-gpus.2593077/
on a clean Ubuntu wsl but it appears that it doesn't let me **load my weights. **. Is there a way to resolve this?

Technopat Sosyal

Guide: Using Stable Diffusion WebUI with Intel ARC GPU's

In this guide, we will install and use Stable Diffusion WebUI SD.Next with Intel ARC GPU's.
Intel PyTorch Library doesn't have native support for Windows so we have to use Native Linux or Linux via WSL.

Setup WSL on Windows:
Follow these instructions to setup Linux environment in Windows, then...

ember orchid Aug 16, 2023, 8:28 PM

#

A1111 OpenVino solution already has a fix for "Restore Faces" update soon

pastel geode Aug 16, 2023, 8:28 PM

#

pastel geode So I finally decided to try https://www.technopat.net/sosyal/konu/using-stable-d...

I wonder if its because I have both my igpu and dgpu
#1084296011675082843 message

proper cradle Aug 16, 2023, 8:28 PM

#

pastel geode I wonder if its because I have both my igpu and dgpu https://discord.com/channe...

Disable iGPU

coral mulch Aug 16, 2023, 8:29 PM

#

ember orchid A1111 OpenVino solution already has a fix for "Restore Faces" update soon

But does it support SDXL.

proper cradle Aug 16, 2023, 8:29 PM

#

Or you can try xpu_VISIBLE_DEVICES env variable

coral mulch Aug 16, 2023, 8:29 PM

#

🤔

pastel geode Aug 16, 2023, 8:29 PM

#

once I do that, do I just run ./webui.sh --use-ipex?

ember orchid Aug 16, 2023, 8:30 PM

#

FYI I fixed my issue with the A1111 OpenVINO solution by reinstallling my driver, disabling my RTX and reinstalling

proper cradle Aug 16, 2023, 8:30 PM

#

pastel geode once I do that, do I just run `./webui.sh --use-ipex`?

Try xpu_VISIBLE_DEVICES=1 ./webui.sh --use-ipex

#

Try 1 or 0

proper cradle Aug 16, 2023, 8:31 PM

#

proper cradle Try `xpu_VISIBLE_DEVICES=1 ./webui.sh --use-ipex`

This should hide the iGPU from IPEX

pastel geode Aug 16, 2023, 8:31 PM

#

proper cradle This should hide the iGPU from IPEX

task manager says Arc is my gpu 1 so i should put 0 instead right?

proper cradle Aug 16, 2023, 8:32 PM

#

pastel geode task manager says Arc is my gpu 1 so i should put 0 instead right?

iGPU is 2 or 0?

pastel geode Aug 16, 2023, 8:32 PM

#

0

coral mulch Aug 16, 2023, 8:32 PM

#

The number is the GPU ID.

proper cradle Aug 16, 2023, 8:32 PM

#

use 1

coral mulch Aug 16, 2023, 8:32 PM

#

Use 1.

broken grail Aug 16, 2023, 8:35 PM

#

coral mulch Any news on High-res fixes yet for Vladmantic?

wdym? second pass?

pastel geode Aug 16, 2023, 8:36 PM

#

proper cradle Aug 16, 2023, 8:36 PM

#

ipexrun things

broken grail Aug 16, 2023, 8:36 PM

#

hey that's my error

#

I think

proper cradle Aug 16, 2023, 8:36 PM

#

don't use --use-ipex to disable ipexrun

coral mulch Aug 16, 2023, 8:36 PM

#

broken grail wdym? second pass?

Old SD (2.1 and prior) had a fix for higher resolutions to maintain coherency.

#

It was broken in SDXL.

broken grail Aug 16, 2023, 8:37 PM

#

oh you mean SDXL second pass

#

ok

coral mulch Aug 16, 2023, 8:37 PM

#

...

#

No, I really don't.

proper cradle Aug 16, 2023, 8:37 PM

#

coral mulch Any news on High-res fixes yet for Vladmantic?

Use Img2Img

broken grail Aug 16, 2023, 8:37 PM

#

"hires fix"?

proper cradle Aug 16, 2023, 8:37 PM

#

It's the exact same thing

broken grail Aug 16, 2023, 8:37 PM

#

proper cradle It's the exact same thing

I thought hires fix upscaled in latent space

#

thus saving a round trip through the VAE

proper cradle Aug 16, 2023, 8:38 PM

#

broken grail I thought hires fix upscaled in latent space

If you select any upscaler with hires, latent upscaling goes out the window

#

And latent upscaling was generally bad in my experience

pastel geode Aug 16, 2023, 8:39 PM

#

without ipex

proper cradle Aug 16, 2023, 8:39 PM

#

Try disaling iGPU from the BIOS

pastel geode Aug 16, 2023, 8:40 PM

#

Okay, and what command should i run after?

broken grail Aug 16, 2023, 8:40 PM

#

proper cradle If you select any upscaler with hires, latent upscaling goes out the window

oh I would always use latent upscaling with 512x512 base images; it would work decently well, but unreliably with frequent NaNs and unstable outputs

proper cradle Aug 16, 2023, 8:40 PM

#

pastel geode Okay, and what command should i run after?

same ./webui.sh

coral mulch Aug 16, 2023, 8:40 PM

#

grid-137631324620-20A_knight_raising_his_sword_in_the_air.png

#

Would it even be remotely possible to generate a 4096x4096 image on SDXL without artifacting or duplicating?

broken grail Aug 16, 2023, 8:42 PM

#

with lots of manual intervention absolutely

proper cradle Aug 16, 2023, 8:42 PM

#

coral mulch Would it even be remotely possible to generate a 4096x4096 image on SDXL without...

Nope. 2048x2048 maybe but 4096x4096 is too much for SDXL:

broken grail Aug 16, 2023, 8:42 PM

#

my trick for super duper resolution stuff is to generate at multiple "scales"

#

and stick things together

proper cradle Aug 16, 2023, 8:42 PM

#

proper cradle Nope. 2048x2048 maybe but 4096x4096 is too much for SDXL:

This is a direct 4096x4096

broken grail Aug 16, 2023, 8:42 PM

#

granted it's inconsistent and only works well for niche things

#

unless you mean directly

proper cradle Aug 16, 2023, 8:42 PM

#

if you manage to get a decent 2048x2048 image, you can upscale it

#

Generating at 1920x1080 and upscaling to 3840x2160 works well:

coral mulch Aug 16, 2023, 8:44 PM

#

Would you second-pass upscale it?

#

Or just go into extras

proper cradle Aug 16, 2023, 8:45 PM

#

Img2Img

coral mulch Aug 16, 2023, 8:45 PM

#

Bruh.

proper cradle Aug 16, 2023, 8:45 PM

#

Re-generate

#

Upscaling from extras isn't good

#

2048x3072 with Lora:

pastel geode Aug 16, 2023, 8:46 PM

#

pastel geode without ipex

its normal to have that warning since this is my first time setting it up right?

proper cradle Aug 16, 2023, 8:46 PM

#

Selected model not found?

pastel geode Aug 16, 2023, 8:47 PM

#

yep

#

checkpoint*

proper cradle Aug 16, 2023, 8:47 PM

#

proper cradle Selected model not found?

This is normal for the first setup since it will look for a model.ckpt

broken grail Aug 16, 2023, 8:51 PM

#

also as a tip when upscaling via img to img it's often beneficial to include more close-up related things in your prompt, since you're essentially running the model on small areas at a time

#

the extreme case of this is to upscale first via simple interpolation, than inpaint areas one by one to add more detail

#

this process could be carried out forever, in theory, especially with ControlNet to keep the model in line

#

but it's hilariously labor intensive

proper cradle Aug 16, 2023, 8:51 PM

#

proper cradle 2048x3072 with Lora:

This is without Tiled Upscale

broken grail Aug 16, 2023, 8:52 PM

#

oh you're just directly tossing it in?

#

hot damn

proper cradle Aug 16, 2023, 8:52 PM

#

Yep Img2Img it in one go

broken grail Aug 16, 2023, 8:52 PM

#

what's the vram limited resolution on that one?

#

I'm guessing that's with all the vram savers on?

proper cradle Aug 16, 2023, 8:53 PM

#

With Attention Slicing and VAE Tiling and Model Shuffling, A770 16GB is VRAM limited to 4096x4096

broken grail Aug 16, 2023, 8:53 PM

#

nice

proper cradle Aug 16, 2023, 8:54 PM

#

Only VAE decode is Tiled

#

--lowvram and --medvram (aka cpu offloading) is disabled

ember orchid Aug 16, 2023, 9:02 PM

#

coral mulch > But does it support SDXL.

Not at this time. IPEX WSL/Linux is the path for SDXL on Arc

coral mulch Aug 16, 2023, 9:08 PM

#

Which is what I have currently set up.

#

👍

#

Nvm I answered it myself.

#

I'm an idiot lmao

coral mulch Aug 16, 2023, 9:30 PM

#

@proper cradle When resizing, do you just use Resize Fixed

proper cradle Aug 16, 2023, 9:34 PM

#

coral mulch Aug 16, 2023, 9:35 PM

#

Thank you.

#

grid-221158638020-20A_blue_glowing_sword_in_a_pedestal_in_a_magical_forest_pixel_art.png

#

grid-3368830020-20A_blue_glowing_sword_in_a_pedestal_in_a_magical_forest_pixel_art.png

broken grail Aug 16, 2023, 10:15 PM

#

well I got ipexrun working

#

it was model compile

coral mulch Aug 16, 2023, 10:20 PM

#

1791730116_-_A_blue_glowing_sword_in_a_pedestal_in_a_magical_forest_voxel_style._lora_VoxelXL_v1_1.0_.jpg

#

Yeah, img2img makes a HUGE difference in quality.

#

I am very pleased with the outputs.

#

82056437920-20A_skeletal_hand_coming_out_the_ground.png

proper cradle Aug 17, 2023, 5:46 AM

#

Pushed the Windows fix by @paper horizon, can someone test it?

#

Also is it detecting OneAPI if you don't use --use-ipex?

novel sphinx Aug 17, 2023, 6:37 AM

#

I remember previously when ipex for windows first release and i tried it was detecting oneapi without the --use-ipex

novel sphinx Aug 17, 2023, 7:47 AM

#

OSError: [WinError 126] The specified module could not be found. Error loading
"C:\Users\KingOfMemes\automatic\venv\lib\site-packages\torch\lib\backend_with_compiler.dll" or one of its dependencies.

#

detects oneapi without --use-ipex tho so the audodetection works fine, i have tried launching with an without --use-ipex and done --reinstall just to make sure but still no luck on windows at the current moment

#

03:48:15-367904 INFO Installing package: torch==2.0.0a0 torchvision==0.15.1 intel_extension_for_pytorch==2.0.110+gitba7f6c1 openvino==2023.1.0.dev20230728 -f
https://developer.intel.com/ipex-whl-stable-xpu
03:48:19-672083 ERROR Error running pip: install --upgrade torch==2.0.0a0 torchvision==0.15.1 intel_extension_for_pytorch==2.0.110+gitba7f6c1
openvino==2023.1.0.dev20230728 -f https://developer.intel.com/ipex-whl-stable-xpu

#

this happens during the install as well

proper cradle Aug 17, 2023, 8:40 AM

#

novel sphinx detects oneapi without --use-ipex tho so the audodetection works fine, i have tr...

Do you see this warning?
Incompatible torch version {installed_torch_ver} for ipex windows, reinstalling to {ipex_torch_ver}

proper cradle Aug 17, 2023, 9:01 AM

#

novel sphinx 03:48:15-367904 INFO Installing package: torch==2.0.0a0 torchvision==0.15.1 ...

Removed Torchvision from this

paper horizon Aug 17, 2023, 11:47 AM

#

novel sphinx OSError: [WinError 126] The specified module could not be found. Error loading "...

try conda install libuv

#

you have to do that in a conda environment as ipex tutorial says

grave condor Aug 17, 2023, 12:44 PM

#

I managed to get it working without an conda env and just installed it into my system pip

#

but I am using the VSCode oneAPI env setup, which might rely on conda under the hood

#

I would consider conda a dependency

novel sphinx Aug 17, 2023, 1:00 PM

#

Yes i saw the incompatible torch version and then it reinstalled

paper horizon Aug 17, 2023, 1:06 PM

#

it's okay to not use conda as long as uv.dll is in your library path. conda install libuv just simplifies things

keen marsh Aug 17, 2023, 1:31 PM

#

Anybody gotten sequential offload to work on native windows in sd.next?

#

I do have torch, ipex and torchvison compiled from source. I guess i could upload the wheels.

#

I want to tey and compile the specific git# that intel used to see if speed in native increases or if aot just makes it slower, the it takes hours with aot.

chrome bone Aug 17, 2023, 1:37 PM

#

you can just git checkout at specific commit

#

i doubt its working well currently.. theres no reason to not upload a functioning prebuilt wheel file otherwise

keen marsh Aug 17, 2023, 1:40 PM

#

you edit the compile.bat with the get# where it has "xpu-master" also xpu-master adds a file call that doesn't exist in pytorch but is easily fixed with a simple comment line. I use Vipitis's .bat file still

#

I also edited a compile file for just Ipex

#

I have the wheels uploaded in this thread somewhere, but those you have to edit one file. If you compile from the xpu-2.X it works without the need to edit, and that's where the git# they use is from. It's from way back on july 25th (my birthday btw lol)

#

for sd.next this isntalled for me without any error in windows if you want to use the prebuilt wheels " torch==2.0.0a0 torchvision==0.15.2a0 intel_extension_for_pytorch==2.0.110+gitba7f6c1 -f https://developer.intel.com/ipex-whl-stable-xpu "

novel sphinx Aug 17, 2023, 1:45 PM

#

yes it all installs fine, always did tbh, still havent gotten it to actually launch within windows

#

OSError: [WinError 126] The specified module could not be found. Error loading
"C:\Users\KingOfMemes\automatic\venv\lib\site-packages\torch\lib\backend_with_compiler.dll" or one of its dependencies.

#

this is the error i now receive

keen marsh Aug 17, 2023, 1:46 PM

#

sorry edited

keen marsh Aug 17, 2023, 1:46 PM

#

novel sphinx this is the error i now receive

hold on that may be where you need to edit a file.

#

I just woke up so i am slow right now give me a min to check

#

yeah, that is the file that doesn't exist in pytorch do this

#

2.)Locate the init.py file in your intel extension for pytorch folder pip

"your_python_directory\Lib\site-packages\intel_extension_for_pytorch\ init.py"

3.) Comment out line 100

#from . import _inductor

#

should work after that

#

It is in xpu-master for some reason, but it is not in the xpu 2.X branch

#

If you compile from the git hash or the specific xpu2.x branch it doesn't exist

novel sphinx Aug 17, 2023, 1:54 PM

#

im not seing that in my file

keen marsh Aug 17, 2023, 1:55 PM

#

https://github.com/intel/intel-extension-for-pytorch/blob/xpu-master/intel_extension_for_pytorch/__init__.py

GitHub

intel-extension-for-pytorch/intel_extension_for_pytorch/__init__.py...

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform - intel/intel-extension-for-pytorch

novel sphinx Aug 17, 2023, 1:56 PM

#

yeah my version already doenst have that line

#

from the prebuilt wheel

keen marsh Aug 17, 2023, 1:57 PM

#

hmm...everything running from oneapi environment? Call all variables etc "C:\Program Files (x86)\Intel\oneAPI\setvars.bat"
"C:\Program Files (x86)\Intel\oneAPI\mkl\2023.2.0\env\vars.bat"
"C:\Program Files (x86)\Intel\oneAPI\compiler\2023.2.0\env\vars.bat"

novel sphinx Aug 17, 2023, 1:57 PM

#

correct

keen marsh Aug 17, 2023, 1:57 PM

#

Running in Conda?

paper horizon Aug 17, 2023, 1:58 PM

#

novel sphinx OSError: [WinError 126] The specified module could not be found. Error loading "...

run in conda env and do conda install libuv

#

launch SD.Next by python launch.py --use-ipex

keen marsh Aug 17, 2023, 1:58 PM

#

Forgot to mention, conda is necessary for me

#

I actually had to replace my python with it, as RC posted in this thread

paper horizon Aug 17, 2023, 1:58 PM

#

I've checked the dependencies of torch and it requires libuv

keen marsh Aug 17, 2023, 2:00 PM

#

you can also try this wheel, but you do need to edit that line out https://drive.google.com/file/d/1UnlMNvzqiqHW9aAiXv56u2M-EDn1JJwr/view?usp=sharing

Google Docs

intel_extension_for_pytorch-2.0.110+git9fccbf1-cp310-cp310-win_amd6...

paper horizon Aug 17, 2023, 2:00 PM

#

miniconda3\envs\{env_name}\Library\bin\uv.dll

keen marsh Aug 17, 2023, 2:00 PM

#

then copy the folder to your VENV in automatic

#

that wheel also does not need 15minutes to start, it is compiled with AOT so starts right away. But it is 2it/s slower than normal in original backend for some reason

#

Which is why I may try and compile from that git# but I don't really feel like spending another day on it lol

paper horizon Aug 17, 2023, 2:02 PM

#

keen marsh that wheel also does not need 15minutes to start, it is compiled with AOT so sta...

wow, you compiled the AOT wheels yourself! How long does it take on your platform?

keen marsh Aug 17, 2023, 2:02 PM

#

lol, took 4-6 hours

#

does 90% then when it gets to cmake cpu goes to 15% and it takes HOURS

paper horizon Aug 17, 2023, 2:03 PM

#

may I know your CPU and RAM?

keen marsh Aug 17, 2023, 2:03 PM

#

5600, 32gb of 3200

proper cradle Aug 17, 2023, 2:04 PM

#

keen marsh does 90% then when it gets to cmake cpu goes to 15% and it takes HOURS

What is the GPU usage when that happens? Probably compiling on the GPU.

keen marsh Aug 17, 2023, 2:04 PM

#

I have another wheel you don't need to edit a file, but too lazy to upload tbh lol

novel sphinx Aug 17, 2023, 2:04 PM

#

even with libuv and lauching from conda still get same error

keen marsh Aug 17, 2023, 2:04 PM

#

proper cradle What is the GPU usage when that happens? Probably compiling on the GPU.

I didn't really check, but it definitely wasn't high.

keen marsh Aug 17, 2023, 2:05 PM

#

novel sphinx even with libuv and lauching from conda still get same error

Yeah, it's a pain tbh. I deleted python 3 and added miniconda to path and that's when it worked. Obviously this shouldn't be the case. Vipitis got it running though

#

I don't use Python for anything else so It didn't matter to me.

paper horizon Aug 17, 2023, 2:06 PM

#

novel sphinx even with libuv and lauching from conda still get same error

what about python -c "import torch"

keen marsh Aug 17, 2023, 2:06 PM

#

I think the conda python and system python need to be the exact same or something

#

there is also some reference to conda in one of the compile.bat files I think

#

I could only get python 3.10.6 when conda is like 3.10.12 so that may be why

proper cradle Aug 17, 2023, 2:07 PM

#

What this returns when you run this in the webui env?
pip show torch

novel sphinx Aug 17, 2023, 2:08 PM

#

Name: torch
Version: 2.0.0a0+gitc6a572f
Summary: Tensors and Dynamic neural networks in Python with strong GPU acceleration
Home-page: https://pytorch.org/
Author: PyTorch Team
Author-email: [email protected]
License: BSD-3
Location: c:\users\kingofmemes\automatic\venv\lib\site-packages
Requires: filelock, jinja2, networkx, sympy, typing-extensions
Required-by: accelerate, basicsr, clean-fid, clip, clip-interrogator, compel, facexlib, gfpgan, invisible-watermark, kornia, lpips, open-clip-torch, pytorch-lightning, realesrgan, timm, tomesd, torchdiffeq, torchmetrics, torchsde, torchvision

PyTorch

paper horizon Aug 17, 2023, 2:09 PM

#

did you launch sd.next with webui.bat or python launch.py? Try the latter and use python from your conda env with libuv

novel sphinx Aug 17, 2023, 2:09 PM

#

i have tried that same dll error

paper horizon Aug 17, 2023, 2:09 PM

#

(maybe delete the venv directory too

novel sphinx Aug 17, 2023, 2:09 PM

#

and verified libuv is installed with conda list

paper horizon Aug 17, 2023, 2:09 PM

#

https://github.com/vladmandic/automatic/discussions/2023

GitHub

Steps to run SD.Next on native windows with Intel Arc GPU · vladman...

Preparations Install Intel GPU Driver, Build Tool for VS 2022 and oneAPI Base Toolkit by following IPEX installation guide (Windows) Install Miniconda (or Anaconda) Download Miniconda3 - Python 3.1...

#

just recorded my steps

novel sphinx Aug 17, 2023, 2:11 PM

#

i will try full reinstalling through conda and seing what happens

keen marsh Aug 17, 2023, 2:11 PM

#

paper horizon just recorded my steps

is your official python 3.10.12?

paper horizon Aug 17, 2023, 2:11 PM

#

novel sphinx and verified libuv is installed with conda list

then maybe try https://github.com/lucasg/Dependencies to analyze what dependencies are actually missing for miniconda3\envs\sdnext\Lib\site-packages\torch\lib\torch_cpu.dll

GitHub

GitHub - lucasg/Dependencies: A rewrite of the old legacy software ...

A rewrite of the old legacy software "depends.exe" in C# for Windows devs to troubleshoot dll load dependencies issues. - GitHub - lucasg/Dependencies: A rewrite of the old legacy...

#

Python 3.10.12```

keen marsh Aug 17, 2023, 2:12 PM

#

Also, how do you update python 3 to the latest version in windows without installing 3.11

paper horizon Aug 17, 2023, 2:12 PM

#

yes

#

conda create -n {env_name} python=3.10

keen marsh Aug 17, 2023, 2:12 PM

#

I could not for the life of me figure out how to update from 3.10.6, theys topped uploading install files

paper horizon Aug 17, 2023, 2:12 PM

#

it gave me 3.10.12

keen marsh Aug 17, 2023, 2:12 PM

#

I mean your python outside of the venv

#

My theory is that they need to be the same to work, as I couldn't get conda to work with ipex while python was installed in my path

paper horizon Aug 17, 2023, 2:13 PM

#

I didn't install another python other than from miniconda

keen marsh Aug 17, 2023, 2:13 PM

#

Now i can run it without conda as well btw.

#

so no python3 on your system already?

paper horizon Aug 17, 2023, 2:14 PM

#

yes

keen marsh Aug 17, 2023, 2:14 PM

#

Okay, yeah that's the same for me then. Only Vipitis seems to have gotten it to work with python 3 installed

#

although I never tried 3.11

paper horizon Aug 17, 2023, 2:16 PM

#

someone tried 3.11, and IIRC, SD.Next doesn't support 3.11

#

lol

proper cradle Aug 17, 2023, 2:18 PM

#

paper horizon someone tried 3.11, and IIRC, SD.Next doesn't support 3.11

Upscalers and rembg won't work, everything else should work fine.

#

These are the errors you will get:

📎 message.txt

keen marsh Aug 17, 2023, 2:19 PM

#

Also, have you gotten sequential cpu offload to work in windows? @paper horizon

#

It did work with native wheel IIRC, but it doesn't with my prebuilt one for some reason. It may have never worked though and I am misremembering

paper horizon Aug 17, 2023, 2:21 PM

#

keen marsh Also, have you gotten sequential cpu offload to work in windows? <@3516904355944...

haven't tried it yet

novel sphinx Aug 17, 2023, 2:34 PM

#

ok installing and configuring thru conda has been successful thus far

broken grail Aug 17, 2023, 2:51 PM

#

out of curiosity, has anyone gotten any kind of training working on 1.5 or XL?

#

recently

#

I think disty had a patch for inversion for 1.x torch, but it would crash after a few iterations for me

novel sphinx Aug 17, 2023, 2:57 PM

#

it is not inferencing on windows

#

WARNING Torch FP16 test failed: Forcing FP32 operations: Tensor on device meta is not on the expected
device xpu:0!

#

got this

#

hitting generate doesnt throw any errors but literally nothing happens

#

no activity on cpu or gpu

#

and my igpu is disabled so thats not the issue

#

C:\Users\KingOfMemes\anaconda3\envs\sdnext\lib\site-packages\numba\np\ufunc\parallel.py:371: NumbaWarning: The TBB threading layer requires TBB version 2021 update 6 or later i.e., TBB_INTERFACE_VERSION >= 12060. Found TBB_INTERFACE_VERSION = 12020. The TBB threading layer is disabled. this also happens when launching

grave condor Aug 17, 2023, 3:00 PM

#

in taks manager, the GPU activity is hidden under the "compute" graph

novel sphinx Aug 17, 2023, 3:00 PM

#

oh wait i lied this must be long aot thing people have talked about i see some progress happening now

keen marsh Aug 17, 2023, 3:02 PM

#

The prebuilt wheels will take about 10-15 minutes on the first inference. You have to restart the ui each time you change diffusers i think.

novel sphinx Aug 17, 2023, 3:02 PM

#

original backend does not work on windows for me i get api errors diffusers seems to be working

#

i see okay

#

guess ill just let it warm up

keen marsh Aug 17, 2023, 3:02 PM

#

novel sphinx original backend does not work on windows for me i get api errors diffusers seem...

You hqve to complete restart to use it I think

keen marsh Aug 17, 2023, 3:03 PM

#

novel sphinx guess ill just let it warm up

The wheel i posted doesn't have that problem btw(long first generation) just edit that file and drop it into vent after install.

grave condor Aug 17, 2023, 3:04 PM

#

I kinda want to try compiling with AOT for python 3.9 but don't really want to spend 6 hours... and my CPU is even older

#

did you modify the script to just build ipex and use the troch prebuilt wheel instead?

proper cradle Aug 17, 2023, 3:35 PM

#

novel sphinx WARNING Torch FP16 test failed: Forcing FP32 operations: Tensor on device meta ...

What this outputs?

import torch
import intel_extension_for_pytorch as ipex

def test_fp16():
    x = torch.tensor([[1.5,.0,.0,.0]]).to("xpu").half()
    layerNorm = torch.nn.LayerNorm(4, eps=0.00001, elementwise_affine=True, dtype=torch.float16, device="xpu")
    _y = layerNorm(x)
    return True
        
if test_fp16():
    print("Pass")

keen marsh Aug 17, 2023, 3:39 PM

#

grave condor did you modify the script to just build ipex and use the troch prebuilt wheel in...

you can use the torch prebuilt, first time I compiled all but subsequent times I compiled just ipex. Torch and torchvision don't take that long to compile, maybe an hour all together if that.

#

I suggest modifying to compile ipex from here https://github.com/intel/intel-extension-for-pytorch/tree/release/xpu/2.0.110 then no need to edit files. The git# they compiled in the prebuilt is there as well.

GitHub

GitHub - intel/intel-extension-for-pytorch at release/xpu/2.0.110

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform - GitHub - intel/intel-extension-for-pytorch at release/xpu/2.0.110

#

Also, you exchange slow startup for slower inference speed, but it's still fast enough IMO. Especially with diffusers

novel sphinx Aug 17, 2023, 3:51 PM

#

Well, i got an image to generate after the long startup

#

Was close to 10 it/s

#

Then it crashed and locked my whole pc up

grave condor Aug 17, 2023, 3:54 PM

#

I believe you can call torch.compile or torch.jit.trace to get better inference performance down the road

#

but I need it to just quickly work for developing this stuff

keen marsh Aug 17, 2023, 3:55 PM

#

well, google restricted my wheel file for whatever reason

#

Just going to delete it I guess, not worth the review. Not sure many used it anyway.

grave condor Aug 17, 2023, 3:57 PM

#

I will just give it a try and let it run for a while

keen marsh Aug 17, 2023, 4:00 PM

#

I sent it for review, just in case google thinks I'm hacking people or something lol.

grave condor Aug 17, 2023, 4:20 PM

#

keen marsh I suggest modifying to compile ipex from here https://github.com/intel/intel-ext...

is that in like 63 of the script?

chrome bone Aug 17, 2023, 4:21 PM

#

i think he means use this branch instead of xpu master

grave condor Aug 17, 2023, 4:23 PM

#

I got a CMake Warning: Manually-specified variables were not used by the project: and then it lists a few things as well as the USE_AOT_DEVICES which I added. I hope that's just a warning that remains from defaults they used. Don't want to add up with the same JIT variant in two hours

chrome bone Aug 17, 2023, 4:23 PM

#

it is bound to happen

#

DoggoHeart

grave condor Aug 17, 2023, 4:24 PM

#

do we know if there is a trick to check?

#

is there like ipex.aot_enabled()?

chrome bone Aug 17, 2023, 4:24 PM

#

i..dk. the only one who got it to compile is aaron

keen marsh Aug 17, 2023, 4:24 PM

#

grave condor is that in like 63 of the script?

set "VER_IPEX=v2.0.110+xpu" or the specific git#

grave condor Aug 17, 2023, 4:25 PM

#

at the top?

keen marsh Aug 17, 2023, 4:25 PM

#

yes

#

how i got it to compile aot was like this "compile_bundle2.bat 1 2 ats-m150 " (bundle 2 is my edited bat for just ipex)

#

this is using the bat you edited earlier btw

#

also, I did this in conda in the oneapi environment etc

#

outside of conda it failed

#

changed it to .txt just incase there is some sorta issue uploading .bat files

📎 compile_bundle2.txt

#

I recommend trying the specific git# they used tbh, I'm hoping it is faster as I'm not sure if AOT makes it a bit slower or they changed something in the code since then

#

Also it pulls a lot more warnings than when using xpu-master, but it works the same in the end

grave condor Aug 17, 2023, 4:33 PM

#

I now hope my compilation finishes successfully but I will look at the changes.

keen marsh Aug 17, 2023, 4:33 PM

#

No doubt, if you used xpu-master then you will need to edit the init.py file and comment out that line that pulls the error

grave condor Aug 17, 2023, 4:34 PM

#

30 minutes in and it's 488/1049 so an hour seems reasonable

keen marsh Aug 17, 2023, 4:34 PM

#

Really not sure what it's trying to pull from pytorch, but it doesn't exist

#

Oh, if you are using AOT, it will hit 1047 and take about 4 hours from there lol

#

They acknowledged this on the github as well, and say they are trying to fix it.

#

If the cmake exe is running it is still compiling, cpu was around 15% at that point

grave condor Aug 17, 2023, 4:35 PM

#

keen marsh set "VER_IPEX=v2.0.110+xpu" or the specific git#

that's the case with the script on the release branch already.

#

so I am hopeful

keen marsh Aug 17, 2023, 4:36 PM

#

oh nice, I haven't checked since I compiled

grave condor Aug 17, 2023, 4:36 PM

#

they fixed some stuff in the file inside the release branch

#

I grabbed the script from GitHub today so I should have all the fixes

keen marsh Aug 17, 2023, 4:38 PM

#

yeah I see it now, this should work

#

No need to edit anything afterward

grave condor Aug 17, 2023, 4:39 PM

#

I didn't throw out my old ipex version. But there is a force_reinstall. As long as I get the wheel files it should be good

keen marsh Aug 17, 2023, 4:40 PM

#

You can install the prebuilt wheel over it fine, the wheel you compile will be inside the Dist folder

grave condor Aug 17, 2023, 4:46 PM

#

I got some many warnings by now haha

keen marsh Aug 17, 2023, 4:48 PM

#

yeah, lol

grave condor Aug 17, 2023, 5:35 PM

#

90 minutes and I am on 775

#

I need that i9 in my next build

chrome bone Aug 17, 2023, 5:40 PM

#

arrow lake next year

#

built on 20a node

#

would be a shame if you decided to go 14900k

grave condor Aug 17, 2023, 5:44 PM

#

yeah, it doesn't sound like the smartest decision

#

but I have waited long enough and there will always be a next gen.

grave condor Aug 17, 2023, 6:11 PM

#

got to 1047 in around 2 hours

keen marsh Aug 17, 2023, 6:46 PM

#

oof, I would plan to add a couple hours from my estimate. Took maybe an hour or less for me to get there, don't think it took that long.

grave condor Aug 17, 2023, 9:38 PM

#

Successfully installed intel-extension-for-pytorch-2.0.110+git509a378
it took in total around 5 hours 30 minutes. and the step 1047 was reach after 2

#

let's hope this works

grave condor Aug 17, 2023, 9:59 PM

#

it does work, still had a short wait on first inference but it is fast enough to be useful without any specifc tweaking. Well worth the 6 hours.

tall grove Aug 17, 2023, 10:14 PM

#

This wheel file would work for anyone?

#

If so weird they haven't done aot version yet

keen marsh Aug 17, 2023, 10:18 PM

#

Probably don't want to compile for 6 hours, it's slower too so they may be trying to figure that out as well as decrease the compile time.

grave condor Aug 17, 2023, 10:34 PM

#

The wheel I have is for python39

#

seems like the last two hours is just ocloc.exe running

keen marsh Aug 18, 2023, 10:21 PM

#

apparently there are controlnet models that work with sdxl but only in comfyui right now

broken grail Aug 19, 2023, 5:42 PM

#

I can't seem to get img2img upscale on diffusers working

#

It just makes the images noisier (?)

#

hmm, might only be occuring at higher resolutions

#

are these related to VRAM usage? is there some sort of soft cap I'm hitting that drops quality?

#

FWIW

#

hmm, got it working at 1.9 scale...bet this is just a 1024 issue again

#

wonder why 1024 res is so cursed

restive parcel Aug 19, 2023, 7:54 PM

#

especially since that's the exact resolution its meant to work best on

proper cradle Aug 19, 2023, 9:07 PM

#

broken grail It just makes the images noisier (?)

What is you denoise strength?

#

Too low and it will be noisy

#

Too high and it will change the image too much

broken grail Aug 19, 2023, 9:15 PM

#

no, it gets more noisy

#

if you watch the interim images it goes from noisy to noiser

proper cradle Aug 19, 2023, 9:15 PM

#

Base res?

broken grail Aug 19, 2023, 9:15 PM

#

uh

#

I think it was like 1024x1280

#

probably 1024 issue

#

proper cradle Aug 19, 2023, 9:16 PM

#

So hires is 2048x2560?

#

1024 curse shouldn't hapen at 2048

broken grail Aug 19, 2023, 9:16 PM

#

no sorry base was 512x 640

proper cradle Aug 19, 2023, 9:17 PM

#

Yep, probably 1024 curse

broken grail Aug 19, 2023, 9:17 PM

#

that's so odd

proper cradle Aug 19, 2023, 9:17 PM

#

try 1080

broken grail Aug 19, 2023, 9:17 PM

#

I did 1.9x scaling and that fixed it

#

I wonder what the cause of that 1024 bug is

proper cradle Aug 19, 2023, 9:17 PM

#

broken grail FWIW

Also enabling both move base options will save 6 GB VRAM without any performance loss

proper cradle Aug 19, 2023, 9:18 PM

#

proper cradle Also enabling both move base options will save 6 GB VRAM without any performance...

If you ran out of memory when VAE decoding*

#

Or using refiner

broken grail Aug 19, 2023, 9:18 PM

#

Sure

#

I take it latent upscale is also broken on diffusers right

proper cradle Aug 19, 2023, 9:19 PM

#

Also hires is working on the dev branch

broken grail Aug 19, 2023, 9:19 PM

#

sweet

broken grail Aug 23, 2023, 2:32 PM

#

there's no ControlNet on diffusers, right? What would it take to get working?

grave condor Aug 23, 2023, 3:00 PM

#

there is StableDiffusionControlNetPipeline directly in diffusers. I used it today for a project

keen marsh Aug 23, 2023, 3:03 PM

#

broken grail there's no ControlNet on diffusers, right? What would it take to get working?

There is I believe, just not for sdxl or at least not in the webui version yet.

broken grail Aug 23, 2023, 3:05 PM

#

grave condor there is StableDiffusionControlNetPipeline directly in diffusers. I used it toda...

nice

grave condor Aug 23, 2023, 3:07 PM

#

there was a ControlNet for XL chapter in the docs https://huggingface.co/docs/diffusers/api/pipelines/controlnet_sdxl

ControlNet with Stable Diffusion XL

keen marsh Aug 23, 2023, 3:17 PM

#

you can use controlnet is comfyUI btw. Both controlnet and controlnet loras

proper cradle Aug 23, 2023, 3:26 PM

#

broken grail there's no ControlNet on diffusers, right? What would it take to get working?

Needs a new UI work

coral mulch Aug 23, 2023, 8:32 PM

#

keen marsh you can use controlnet is comfyUI btw. Both controlnet and controlnet loras

Is ComfyUI as performant as vladmantic yet?

keen marsh Aug 23, 2023, 9:50 PM

#

coral mulch Is ComfyUI as performant as vladmantic yet?

I don't think so, I haven't used it but the one time. Don't think they have any of the ipex optimizations it's just barebones support

#

only tried in native windows though

open sundial Aug 24, 2023, 9:59 AM

#

coral mulch Is ComfyUI as performant as vladmantic yet?

No idea how much performance you can get out of Vlad, but I'm generating 4k resolution images in about 180 seconds using an RTX 3080 10GB on ComfyUI

#

And if I wanted to make it 8K resolution, it would take ~800 seconds

#

Does Vlad produce 4-8k imagery without VRAM errors?

#

on 10GB VRAM?

#

https://civitai.com/models/133287?modelVersionId=146688

GigerCraftXL - v9.6 BF16 (RTX Graphics) | Stable Diffusion LoRA | C...

Happy Halloween! Almost... Make sure you select the version for your graphics card at the top. FP16 for GTX cards and older and BF16 for RTX and ne...

#

Also Enjoy my new LORA for horror style 😄

keen marsh Aug 24, 2023, 2:58 PM

#

open sundial No idea how much performance you can get out of Vlad, but I'm generating 4k reso...

Thats Nvidia, Arc needs optimization

proper cradle Aug 24, 2023, 3:09 PM

#

open sundial Does Vlad produce 4-8k imagery without VRAM errors?

A770 16 GB can generate direct 4096x4096 without --medvram or --lowvram

#

https://github.com/vladmandic/automatic/wiki/SD-XL#vram-optimization

GitHub

SD XL

SD.Next: Advanced Implementation of Stable Diffusion - vladmandic/automatic

open sundial Aug 25, 2023, 6:32 PM

#

proper cradle A770 16 GB can generate direct 4096x4096 without --medvram or --lowvram

Nice. But it should be able too with 16GB of VRAM.

#

Would be nice to see how it runs on the lower VRAM cards

proper cradle Aug 25, 2023, 6:33 PM

#

SDXL 1024x1024 can run on 2GB GPUs with --lowvram

open sundial Aug 25, 2023, 6:35 PM

#

Fair. That's pretty decent. I'm on a 10GB VRAM + 16GB of system ram and producing 4k images using only Tiled VAE, A larger than average Page/Swap file and fp16 precision.

#

I've produced 8k and larger, but it just take 10 minutes plus

proper cradle Aug 25, 2023, 6:36 PM

#

proper cradle SDXL 1024x1024 can run on 2GB GPUs with --lowvram

1 GB with FP16 and 2 GB with FP32. 2 GB GPUs generally doesn't support FP16.

open sundial Aug 25, 2023, 6:37 PM

#

That is unfortunate considering that FP16 is only useful for low end GPU's really

#

I mean, if it weren't for the lag that it causes my system, I'd still be using the fp32 vae myself

proper cradle Aug 25, 2023, 6:38 PM

#

open sundial I mean, if it weren't for the lag that it causes my system, I'd still be using t...

BF16 runs fine

#

Or use this:
https://huggingface.co/madebyollin/sdxl-vae-fp16-fix/blob/main/sdxl_vae.safetensors

sdxl_vae.safetensors · madebyollin/sdxl-vae-fp16-fix at main

open sundial Aug 25, 2023, 6:38 PM

#

Not if you have a card pre-RTX

#

Bf16 is RTX only

#

For nvidia anyway

proper cradle Aug 25, 2023, 6:39 PM

#

Intel ARC defaults to BF16 on SDNext

open sundial Aug 25, 2023, 6:39 PM

#

Awesome, because BF16 is the most optimum for SDXL

proper cradle Aug 25, 2023, 6:39 PM

#

BF16 generally runs faster on ARC than FP16

open sundial Aug 25, 2023, 6:39 PM

#

Interesting.

#

I released a BF16 LORA yesterday, then found that all GTX users need fp16 lol

#

so I had to put out 2 versions lol

#

Has anyone done any Arc A380 testing then?

With StableSwarm now a thing, the opportunity to generate multiple batches of images, per graphics card, simultaneously is now a thing.

#

Does anyone have any experience using StableSwarm with multiple GPU brands?

proper cradle Aug 25, 2023, 6:53 PM

#

Splitting batches to multiple GPUs were already a thing?

proper cradle Aug 25, 2023, 6:53 PM

#

open sundial Does anyone have any experience using StableSwarm with multiple GPU brands?

Probably possible with a proper server to connect to seperate APIs.

open sundial Aug 25, 2023, 6:54 PM

#

proper cradle Splitting batches to multiple GPUs were already a thing?

Of course it was already possible, but with the new StableSwarm API, all of the features of ComfyUI can be utilized across multiple GPU's remotely acrosss a network or over the internet.

#

And it's just more accessible in general than older methods

#

I have an A380 as a secondary and I'm just checking what options I have for potential workflow improvements

onyx moth Aug 26, 2023, 12:39 AM

#

hello, I tried OpenVINO for a while and it's not quite there yet, comfy looks like it still has some issues with it too. can someone direct me to a solid guide for sd.next? I want to run 1.5 and eventually sdxl. thanks!

#

also if theres some easy to follow documentation on what it can and cant do I would love to see it. Intel Arc A770 16GB

#

would I just follow the instruction on the pinned post in this thread? edit: it looks like this is the way

keen marsh Aug 26, 2023, 1:11 AM

#

onyx moth would I just follow the instruction on the pinned post in this thread? edit: it ...

If you mean openvino support, I think you just git clone sd.next, then run webui.bat --open-vino and i believe it will set everything for you, just don't change it from fp32.

onyx moth Aug 26, 2023, 1:23 AM

#

keen marsh If you mean openvino support, I think you just git clone sd.next, then run webui...

I keep hearing ipex works better so I'm trying to go with that

proper cradle Aug 26, 2023, 1:24 AM

#

https://github.com/vladmandic/automatic/wiki/Intel-ARC

GitHub

Intel ARC

SD.Next: Advanced Implementation of Stable Diffusion - vladmandic/automatic

coral mulch Aug 26, 2023, 1:24 AM

#

proper cradle https://github.com/vladmandic/automatic/wiki/Intel-ARC

Should be pinned.

keen marsh Aug 26, 2023, 1:24 AM

#

If you mean native windows, then it's way more complicated as you have to also compile it yourself. If you want to use IPEX go for wsl2, the install is a lot mroe involved than openvino

#

If you don't compile, you have to wait 10-15 minutes before your first generation, but after that it is pretty fast

#

https://intel.github.io/intel-extension-for-pytorch/xpu/latest/tutorials/installation.html

onyx moth Aug 26, 2023, 1:27 AM

#

I just went through this install: https://www.technopat.net/sosyal/konu/using-stable-diffusion-webui-with-intel-arc-gpus.2593077/

Technopat Sosyal

Guide: Using Stable Diffusion WebUI with Intel ARC GPU's

In this guide, we will install and use Stable Diffusion WebUI SD.Next with Intel ARC GPU's.
Intel PyTorch Library doesn't have native support for Windows so we have to use Native Linux or Linux via WSL.

Setup WSL on Windows:
Follow these instructions to setup Linux environment in Windows, then...

#

Its the same right?

proper cradle Aug 26, 2023, 1:28 AM

#

onyx moth I just went through this install: https://www.technopat.net/sosyal/konu/using-st...

Same but forgot to update ons thing

#

Run;

sudo apt install libjemalloc-dev

onyx moth Aug 26, 2023, 1:29 AM

#

rn_image_picker_lib_temp_f3db12c8-42e4-4e96-8a8d-f16e82e42492.jpg

#

This is where I'm at can I run that after it's done

#

Thanks man.

#

📎 message.txt

#

I ran libjemalloc, did I miss something?

grave condor Aug 26, 2023, 1:44 AM

#

I am writing a small controlnet app and trying to run it on my A750. But it is really slow. What is the trick to speed up inference with the different models. or which file do I need to look at the find the solution.
As this will be hosted it needs to be device agnostic.

onyx moth Aug 26, 2023, 2:00 AM

#

was I supposed to install libjemalloc in automatic folder? I did it after going cd

keen marsh Aug 26, 2023, 2:09 AM

#

onyx moth

it's best to use webui.py to run in the venv, not sure if that's causing your error though

pastel geode Aug 26, 2023, 2:24 AM

#

onyx moth

I got this error before.
./shared/source/os_interface/os_interface.h
Do you happen to have your igpu enabled? Check to see if you have 2 gpus in task manager. I managed to fix mine after disabling igpu multimonitor on my asus motherboard.
After that, try running it again. If it doesnt work, Reinstall.
#1084296011675082843 message

onyx moth Aug 26, 2023, 2:25 AM

#

oh, this? gpu 0

pastel geode Aug 26, 2023, 2:26 AM

#

onyx moth oh, this? gpu 0

yea

#SDNext WebUI on Intel ARC