ComfyUI for Intel Arc using IPEX | Intel Insiders Community | Page 4

karmic jasper Nov 1, 2024, 4:42 PM

#

cause i drop that in comfy and nothing loads

#

tried another one, that also brings nothing into comfyui, i don't get it

#

yeah, none of the json files bring anything in. not sure whats wrong

#

still getting this

reef ivy Nov 1, 2024, 4:57 PM

#

karmic jasper tried another one, that also brings nothing into comfyui, i don't get it

try zooming out and scrolling. The workflows seem to be in a different place or hit reset view

karmic jasper Nov 1, 2024, 5:02 PM

#

reef ivy try zooming out and scrolling. The workflows seem to be in a different place or...

yeah its not that

#

it's nowhere

reef ivy Nov 1, 2024, 5:02 PM

#

karmic jasper still getting this

do you have all 3 encoder files?

#

Also make sure you select everything and make sure it loads, sometimes the workflow has the wrong name for files or you have them in a different place or different name etc

reef ivy Nov 1, 2024, 5:16 PM

#

karmic jasper yeah its not that

I have no idea, you can try downloading the cogvideox nodes in manager first maybe. Everything loads fine for me. Also what browser are you using? Sometimes that can have issues with webui's. I am on firefox, and have used chrome/edge(though not with cogvideo yet)

#

Oh, and make sure comfy is up to date

karmic jasper Nov 1, 2024, 7:13 PM

#

reef ivy I have no idea, you can try downloading the cogvideox nodes in manager first may...

I'm also on FF

karmic jasper Nov 1, 2024, 7:13 PM

#

reef ivy Oh, and make sure comfy is up to date

It should be.

brisk blade Nov 1, 2024, 7:38 PM

#

Curious. How are you running this? native linux install? Windows? WSL?

reef ivy Nov 1, 2024, 7:56 PM

#

I am in native windows, but the 4gb issue is with arc in general.

#

The hikacks bypass it somehow(have to ask disty about that)

earnest grotto Nov 1, 2024, 8:01 PM

#

brisk blade Curious. How are you running this? native linux install? Windows? WSL?

It's a hardware limitation
It (shouldn't be) the end of the world, it should always be possible to work around in software, but it's still not great

earnest grotto Nov 1, 2024, 8:01 PM

#

reef ivy The hikacks bypass it somehow(have to ask disty about that)

Split things up into smaller chunks

reef ivy Nov 1, 2024, 8:02 PM

#

karmic jasper It should be.

Do other workflow jsons work proper? How are you downloading and loading into the UI?

brisk blade Nov 1, 2024, 8:04 PM

#

earnest grotto It's a hardware limitation It (shouldn't be) the end of the world, it should alw...

ah ok my bad

karmic jasper Nov 1, 2024, 8:17 PM

#

reef ivy Do other workflow jsons work proper? How are you downloading and loading into t...

Yes, the flux one I saved works fine

reef ivy Nov 1, 2024, 9:04 PM

#

karmic jasper Yes, the flux one I saved works fine

did you install the cogvideoxwrapper nodes? you can do it in the manager, or git pull. I honestly have absolutely no clue why these would work different for you. Even if you don't have them installed they should show up red.

karmic jasper Nov 1, 2024, 9:07 PM

#

reef ivy did you install the cogvideoxwrapper nodes? you can do it in the manager, or gi...

i'll try to install them

karmic jasper Nov 1, 2024, 9:40 PM

#

reef ivy did you install the cogvideoxwrapper nodes? you can do it in the manager, or gi...

the json files still don't work

reef ivy Nov 1, 2024, 9:41 PM

#

I have no clue, only thing with me is they aren't centered so you have to zoom out compared to other workdlows. If comfy is updated I have no idea.

karmic jasper Nov 1, 2024, 9:41 PM

#

yeah, i zoomed out and everything

#

lets see if i can update comfy

#

nope, its latest version

reef ivy Nov 1, 2024, 11:27 PM

#

Try and git pull in the comfy folder

reef ivy Nov 1, 2024, 11:31 PM

#

karmic jasper yeah, i zoomed out and everything

try and drag and drop this image, it should give you the fun workflow for img2vid

karmic jasper Nov 1, 2024, 11:32 PM

#

reef ivy try and drag and drop this image, it should give you the fun workflow for img2vi...

will do, running an llm atm.

karmic jasper Nov 1, 2024, 11:36 PM

#

reef ivy try and drag and drop this image, it should give you the fun workflow for img2vi...

yup, this one worked. Told me im missing two nodes

#

Gotta generate an image to test it with

karmic jasper Nov 2, 2024, 12:02 AM

#

ok, it seems to be working!

#

Thank you mate, much appreciated

reef ivy Nov 2, 2024, 12:30 AM

#

no doubt, glad it worked.

karmic jasper Nov 2, 2024, 12:51 AM

#

reef ivy no doubt, glad it worked.

however im getting gifs... and if i pick a video format i get an error.

#

something about format

reef ivy Nov 2, 2024, 12:53 AM

#

karmic jasper however im getting gifs... and if i pick a video format i get an error.

You need to install the requirements for the video nodes, i had the same issue and it wouldn't install automatically. You have to activate env and go to the folder and pip install -r requirements. I will give you the name in a minute

#

This effects all the video output bot just cog video

#

start your venv manually and then navigate here ...ComfyUI\custom_nodes\ComfyUI-VideoHelperSuite and then pip install -r requirements.txt make sure your in that folder you dont' want to install the default comfy requirements

#

This should install the needed encoder in the venv, I tried updating the node through comfy but this was the only way it worked for me

native jackal Nov 2, 2024, 8:53 AM

#

Hi Guys, I just bought a laptop with Intel Core Ultra 7 155H CPU with 32gb RAM. I just generated an image on SD3.5 medium model with the example workflow provided by stability AI and it took 1 hours and 12 minutes to generate! Is there a way to accelerate this to more reasonable waiting times (like 10 minutes)? I appreciate the help!

karmic jasper Nov 2, 2024, 8:55 AM

#

native jackal Hi Guys, I just bought a laptop with Intel Core Ultra 7 155H CPU with 32gb RAM. ...

What you really need for any AI application nowadays is dedicated graphics cards. So I don't think there is much you can do

native jackal Nov 2, 2024, 8:57 AM

#

I was afraid this would be the answer 😂 Anyway thank you for quick reply 👍

earnest grotto Nov 2, 2024, 9:01 AM

#

I don't think 1 hour is quite right regardless 🤔

earnest grotto Nov 2, 2024, 9:02 AM

#

native jackal Hi Guys, I just bought a laptop with Intel Core Ultra 7 155H CPU with 32gb RAM. ...

did you specifically install ipex or did you just clone comfyui and run it?

native jackal Nov 2, 2024, 9:43 AM

#

I installed ipex and then followed the instructions on the first post of this channel but got this error when try to pip install -r requirements-ipex-ultra.txt. Any help is highly appreciated 🙏

#

earnest grotto Nov 2, 2024, 10:03 AM

#

native jackal I installed ipex and then followed the instructions on the first post of this ch...

What version of python are you using

native jackal Nov 2, 2024, 10:24 AM

#

3.11.0

reef ivy Nov 2, 2024, 3:59 PM

#

native jackal 3.11.0

you need 3.10

nocturne fjord Nov 2, 2024, 8:19 PM

#

Hello, has anyone managed to get comyui's Flux Train to work with a 770. I get the following error at the start of the workout "RuntimeError: FP64 data type is unsupported on current platform."

#

Loading: ComfyUI-Inspire-Pack (V1.6)

Total VRAM 15931 MB, total RAM 32127 MB
pytorch version: 2.3.1+cxx11.abi
Set vram state to: NORMAL_VRAM
Device: xpu

Loading: ComfyUI-Manager (V2.51.9)

ComfyUI Revision: 2804 [cc9cf6d1] | Released on '2024-10-31'

#

Python main.py --auto-launch --bf16-unet --disable-ipex-optimize --fast with hijack

reef ivy Nov 2, 2024, 8:29 PM

#

Only pytorch version that supports arc is 2.5 or ipex afaik

#

And fp64 isn't supported natively, have the same issue trying to use frame interpolation ATM.

nocturne fjord Nov 2, 2024, 8:40 PM

#

I guess it's not a good idea to install pytorch 2.5 because it's much too recent?

reef ivy Nov 2, 2024, 9:34 PM

#

Only issue I have heard is that it is slower

#

But it supports intel natively

#

I am stll on ipex though, as speed is important to me lol

rocky sedge Nov 2, 2024, 10:39 PM

#

i am using comfyUi with stability matrix

#

and currently using ultra 7 258v with intel arc

#

Can anyone suggest ways to make better use of resources?

reef ivy Nov 3, 2024, 12:19 AM

#

Found a workflow for extended videos in cogvideox-fun models, used the 5b-fun since 2b is kinda nonsense so far. Real low res but I may try an video upscale node. With fun models you can have a start and end image, not sure how to do it with the regular and this work flow is crazy looking lol.

#

Here is workflow, you will get some errors but the answer is in one of the comments https://civitai.com/models/792021/cogvideox-image-to-video-with-video-extension-x7-low-vram

#

Honestly, pretty cool for a video on an a750 intel arc gpu that couldn't even generate images over a year ago. Progress is crazy.

earnest grotto Nov 3, 2024, 1:21 AM

#

nocturne fjord Hello, has anyone managed to get comyui's Flux Train to work with a 770. I get t...

Yes (not this specific trainer), however I couldn't get any loras to work, ones I trained or others

#

Has this changed with 2.5?

earnest grotto Nov 3, 2024, 1:51 AM

#

Cuz when I tested with earlier 2.5, it was still borked

reef ivy Nov 3, 2024, 3:58 AM

#

You don't have the t5 model loaded , you have the vit L loaded twice

reef ivy Nov 3, 2024, 4:24 AM

#

The cpu offload option gave me errors like it wasn't supported. You got the 5b gguf to work? They also gave me errors, I did just update maybe I will try again. Also haven't actually tried the gguf clip models are they faster with same precision?

earnest grotto Nov 3, 2024, 7:00 PM

#

So, no one else either can get flux loras working?

reef ivy Nov 3, 2024, 8:22 PM

#

earnest grotto So, no one else either can get flux loras working?

They work for me? I use gguf models though.

earnest grotto Nov 3, 2024, 8:23 PM

#

reef ivy They work for me? I use gguf models though.

Regular (fp8) loras work on GGUF models?

reef ivy Nov 3, 2024, 8:23 PM

#

Tried detail lora's and turbo lora's

#

So far yeah, but I think not all loras work

earnest grotto Nov 3, 2024, 8:24 PM

#

Do you wanna test out a lora for me? Are you using 2.5?

reef ivy Nov 3, 2024, 8:24 PM

#

No, I am on Ipex.

#

Here is a workflow I am building, it has a lora loader. I think it should show the lora I am using. https://civitai.com/models/876388/flux1-turbo-alpha

#

I can try the lora, but I am using Ipex

#

Also, is there a way to update comfyui without having to re-add the ipex hijacks each time?

#

I keep forgetting then get the 4gb error lol

earnest grotto Nov 3, 2024, 8:30 PM

#

reef ivy Also, is there a way to update comfyui without having to re-add the ipex hijacks...

My script can do that, but it will also re-create its conda environment every time too
otherwise, no, you gotta keep re-adding it

reef ivy Nov 3, 2024, 8:34 PM

#

I tried a bunch of git stash commands and it "worked" but completely broke the file lol.

earnest grotto Nov 3, 2024, 8:34 PM

#

reef ivy I can try the lora, but I am using Ipex

https://mega.nz/file/HMo0WIjB#RfUIoBHrv7YCCaWjHlNsQEeJnO0M2L_ERuJ-BnU2Wyo
https://mega.nz/file/eMBlGIIZ#sXLgBLG9RvFJ2I_tzhoU71Y5YSa10udOIrIJxfWJFvs
It's a character lora. Shouldn't matter which one. Example prompt, "Anime style drawing of Wann kneeling on a reflective floor"

earnest grotto Nov 3, 2024, 8:35 PM

#

reef ivy I tried a bunch of git stash commands and it "worked" but completely broke the f...

just plain git stash is enough, no extra stuff, in case you want to actually save the changes you did (which you probably don't)
otherwise git restore ., then pull, then re-edit

reef ivy Nov 3, 2024, 8:39 PM

#

Should lora be full strength?

earnest grotto Nov 3, 2024, 8:40 PM

#

yes

#

the expected result is not a pure black image

reef ivy Nov 3, 2024, 8:42 PM

#

earnest grotto Nov 3, 2024, 8:42 PM

#

so it works

#

nice

#

i'll go plop my hacks on github later

reef ivy Nov 3, 2024, 8:43 PM

#

Maybe issue with pytorch 2.5?

earnest grotto Nov 3, 2024, 8:44 PM

#

I had tried with both 2.5 and 2.3, and it was pure black

#

May be something fixed with windows drivers now, idk

#

actually, training might still not work on windows, I'll have to try and see

reef ivy Nov 3, 2024, 9:34 PM

#

earnest grotto May be something fixed with windows drivers now, idk

Oh it could be I am still on older drivers because of the vram issue, I am on 5971. But have you tried linux?

earnest grotto Nov 3, 2024, 9:35 PM

#

This was both on linux and windows. I only train on linux, I don't use wsl and training on windows natively hasn't worked for a while

#

actually i may have only tried linux, i don't remember anymore, i'll try again

reef ivy Nov 3, 2024, 10:07 PM

#

Strange, it could be the driver I guess.

#

Nice! How many steps did you use? Just a heads up you can get decent quality with like 10 steps.

reef ivy Nov 3, 2024, 10:35 PM

#

bf16 is the only one that works, and the fp8 converter thing doesn't work either so no speed boosts for us at all(at least none that worked for me). I need to try the cpu offload again, it also gave errors last I tried.

wicked fulcrum Nov 4, 2024, 2:47 PM

#

All we're looking at ComfyUI as an optional backend to AI Playground.
With this we'd commit workflows that integrate well with AI Playground and provide value added features.

I'm checking if this community would be interested in submitting workflows for us to test and review for this.

If interest, I'll create a ComfyUI workflow thread here, for shared workflows.

earnest grotto Nov 4, 2024, 11:21 PM

#

What about workflows that need specific custom nodes?

#

And/or potentially even models

#

Specifically:
Upscaling with upscale models (tons of those, different results), like realesrgan-x4plus, the nodes for this are in by default but no model; good for textures and anime
Upscaling with SUPIR, needs nodes and model, good for normal realistic images
Inpainting with powerpaint, needs nodes and model, uses a non-inpainting SD1.5 model, does object removal much better than regular inpainting models, can be better than regular models besides that but that stuck with me

wicked fulcrum Nov 5, 2024, 1:56 AM

#

earnest grotto What about workflows that need specific custom nodes?

Custom Nodes and models are Ok.

#

We'll probably look at creating a manifest for a workflow, everything needed for it, with a user controlled option to download and install.

The harder part is input types images, masks etc. If AI Playground already has the input then we can map. If it doesn't then that would be harder to implement

reef ivy Nov 5, 2024, 3:18 AM

#

What would be cool is to have a toggle to view the nodes and edit etc. If you ever did music there is a program called reason where you can toggle and view everything like a hardware setup and mess with the connections then flip back to ui.

earnest grotto Nov 5, 2024, 3:29 PM

#

@reef ivy Do you use Linux/WSL and if yes, would you like to test out training a flux lora through comfyui on your 8gb gpu?

#

xpu-smi is telling me my vram usage is consistently below 8gb. It is only getting polled something like every 20 seconds but it certainly got me thinking

#

i kinda don't think there's a vram equivalent/argument for ulimit to test myself 🤔

reef ivy Nov 5, 2024, 6:39 PM

#

I have wsl but haven't used it in almost a year, also no clue how to train

reef ivy Nov 5, 2024, 8:53 PM

#

https://blog.comfy.org/mochi-1/ Mochi in comfy, there is a 9gb model under low ram solutions and a fp8 clip, someone ran it on a 3060 12gb.

Comfy Org Blog

Run Mochi in ComfyUI with consumer GPU

We are excited to announce that ComfyUI now has optimized support for Genmo’s latest model, Mochi! This integration brings state-of-the-art video generation capabilities to the ComfyUI community, even if you're working with consumer-grade GPUs.

The weights and architecture for Mochi 1 (480P) are open and available, and Mochi 1

reef ivy Nov 5, 2024, 10:47 PM

#

Seems like it will take at least an hour to genetate at 30 steps, but seems to be working so far. Not sure I want to wait that long

reef ivy Nov 6, 2024, 1:18 AM

#

it works

#

Takes like an hour on a750 with default settings, I ran it i with less steps and lower res and it sped up a little, took like 17 minutes but lower res the output was bad.

#

make sure you update comfy

#

and use the workflow they provide, should be able to click and drag tha photo

#

I'm not sure, I update manually and have an old script that doesn't have that option.

#

Just have to re-edit the file for hijacks each time

earnest grotto Nov 6, 2024, 1:22 AM

#

It does, just re-run it

#

Also updates the custom nodes it installs, if they're already installed

#

It doesn't update edited files, i should go fix that

reef ivy Nov 6, 2024, 1:24 AM

#

If you get it working let me know how fast it goes, it's real slow on a750. And i think I read it's also slow on amd, but not sure what they were using.

#

so far less frames doesn't seem to speed up generation much if at all, and lower res only does to a point. So it's much diffrent from cogvideo

#

might let it sit for an hour and see how good it is, but don't feel like it now

#

Are you running their workflow? or the one from mochiwrapper? I am using the one they posted

#

I am using the scaled version

earnest grotto Nov 6, 2024, 1:44 AM

#

  if bias: bias = bias.to('cpu')
  o = torch._scaled_mm(inn.to('cpu'), w.to('cpu'), out_dtype=input.dtype, bias=bias, scale_a=scale_input, scale_b=scale_weight).to(inn.device)

there, quick dirty hack

#

might not work since with fp8 i know things might also not be implemented for cpu

reef ivy Nov 6, 2024, 1:45 AM

#

I just changed this to 5 steps because it's so slow but here is the workflow, it is default 30.

#

you should also be able to drag that video I posted earlier after downloading it #1193952640225267802 message

#

that resolution is super low though lol

#

also, it's much faster now. Maybe I was having some issues when i tried, sometimes comfy goes slow until I re start it.

#

oom'd at vae, need to use tiled probably

earnest grotto Nov 6, 2024, 1:49 AM

#

yeah some bot here deletes things

#

no idea why

reef ivy Nov 6, 2024, 1:50 AM

#

If a piece of code is too long it auto deletes, probably for security reasons.

earnest grotto Nov 6, 2024, 1:50 AM

#

rip dan

earnest grotto Nov 6, 2024, 1:50 AM

#

reef ivy If a piece of code is too long it auto deletes, probably for security reasons.

it's short

#

banned

reef ivy Nov 6, 2024, 1:51 AM

#

It deletes with like more than a couple lines

reef ivy Nov 6, 2024, 1:51 AM

#

earnest grotto banned

wtf, wow

earnest grotto Nov 6, 2024, 1:51 AM

#

I've posted longer that didn't get deleted

reef ivy Nov 6, 2024, 1:52 AM

#

It's been deleting almost everything for me, if it's longer than like 2 lines maybe even one line sometimes

earnest grotto Nov 6, 2024, 1:52 AM

#

What he was posting was fairly short, and I think sometimes he posted my thing copypasted without the code block?

reef ivy Nov 6, 2024, 1:52 AM

#

even with code block it deletes for me sometimes

#

it's a recent issue

earnest grotto Nov 6, 2024, 1:53 AM

#

reef ivy it's a recent issue

nah, i've had it happen for quite a while

reef ivy Nov 6, 2024, 1:53 AM

#

Who should we contact to get him unbanned?

earnest grotto Nov 6, 2024, 1:54 AM

#

no idea who specifically

#

204342691964780546

#

IDK how i'd turn that into a mention, dammit discord

somber trellis Nov 6, 2024, 1:55 AM

#

Wow.

earnest grotto Nov 6, 2024, 1:55 AM

#

Well that was resolved fast

reef ivy Nov 6, 2024, 1:55 AM

#

They deleted all your posts too it seems?

earnest grotto Nov 6, 2024, 1:55 AM

#

normal for a ban

somber trellis Nov 6, 2024, 1:56 AM

#

It was three lines, 277-280 of ops.py

#

Was I to replace that with your code.

earnest grotto Nov 6, 2024, 1:56 AM

#

either way, replace the 2nd line you posted which does have bias, with

  o = torch._scaled_mm(inn.to('cpu'), w.to('cpu'), out_dtype=input.dtype, bias=bias.to('cpu'), scale_a=scale_input, scale_b=scale_weight).to(inn.device)

#

and 4th one which doesn't have bias with

  o = torch._scaled_mm(inn.to('cpu'), w.to('cpu'), out_dtype=input.dtype, scale_a=scale_input, scale_b=scale_weight).to(inn.device)

#

try again

#

might still not work, stuff is unimplemented for fp8

somber trellis Nov 6, 2024, 1:59 AM

#

#

I should've just posted an image.

#

There, that look correct?

earnest grotto Nov 6, 2024, 1:59 AM

#

i wonder, do spambots post code that often? because I have not seen any in a different server I'm in that gets spambots fairly often

earnest grotto Nov 6, 2024, 2:00 AM

#

somber trellis There, that look correct?

yea

#

might also be giga slow though 🤔

somber trellis Nov 6, 2024, 2:02 AM

#

17.80it/s

#

on bf16

#

set to fp8 weights via the 'load diffusion model' node.

#

Because I remembered

#

#

upbeat crow Nov 6, 2024, 2:06 AM

#

If anyone gets banned in the future please just contact a mod, we can fix the issue immediately. sorry for auto mod.

reef ivy Nov 6, 2024, 2:08 AM

#

somber trellis

wow, that is pretty fast tbh. I can't get the vae to work, tiled vae isn't working. Gonna try the mochi decode node

reef ivy Nov 6, 2024, 2:08 AM

#

upbeat crow If anyone gets banned in the future please just contact a mod, we can fix the is...

Do you know what's up with the code strings? Seems like it auto deletes and bans people now, should we just not post it like that anymore?

upbeat crow Nov 6, 2024, 2:11 AM

#

reef ivy Do you know what's up with the code strings? Seems like it auto deletes and ban...

to my knowledge nothing was changed, let me reach out to the admins and double check things. Just keep posting now, if anything happens just ping any mod. I wanna make sure you guys have freedom to post in here

reef ivy Nov 6, 2024, 2:12 AM

#

okay, thanks a lot. Appreciate it

reef ivy Nov 6, 2024, 3:36 AM

#

can't get the vae to work, tiled vae gives errors and the mochiwrapper nodes refuse to install. Guess low res is all i can do

somber trellis Nov 6, 2024, 3:52 AM

#

reef ivy can't get the vae to work, tiled vae gives errors and the mochiwrapper nodes ref...

the vae decodes with the mochi node is causing me problems using it with the gguf variant

#

https://github.com/kijai/ComfyUI-MochiWrapper

#

https://huggingface.co/Kijai/Mochi_preview_comfy/tree/main

reef ivy Nov 6, 2024, 4:54 AM

#

I cant get it to even install at all, will try another day.

somber trellis Nov 12, 2024, 1:50 AM

#

If anyone else can get cogvideo 5b working that'd be cool.

#

I wish we could run mochi but that's 24gb vram

#

🤷‍♂️

reef ivy Nov 12, 2024, 2:47 AM

#

somber trellis If anyone else can get cogvideo 5b working that'd be cool.

Can get the fun models to work since I can lower the resolution and frames etc

#

5b regular can only run at a set res and frames

sour depot Nov 13, 2024, 1:02 PM

#

,,,sam loader(facedetailer) dont work on ipex 2.3.110 but work fine with 2.1.40, any fix?

📎 4384a4478d8ef372.txt

upbeat crow Nov 13, 2024, 1:32 PM

#

Just checking in, have issues cleared up??

#

In this chat?

earnest grotto Nov 13, 2024, 1:49 PM

#

upbeat crow Just checking in, have issues cleared up??

For me, it was happening mostly when posting blocks of code
I haven't done that recently and I don't have any code to post right now, so, 🤷
If it happens again, I'll DM you the offending code

upbeat crow Nov 13, 2024, 2:44 PM

#

We changed limits in this channel only, it should help but still curious

somber trellis Nov 14, 2024, 7:24 PM

#

@earnest grotto The latest IPEX version has issues with Florence2 and reading filepaths

#

the florence2node by kijai

#

Works fine on the previous version.

earnest grotto Nov 14, 2024, 7:28 PM

#

ipex doesn't read files

#

what do you mean

somber trellis Nov 14, 2024, 7:29 PM

#

I'll show you the error.

#

It's probably some dependency ipex uses, but I don't know why.

#

📎 message.txt

devout tangle Nov 14, 2024, 7:38 PM

#

How do i install comfy and llms using pytorch 2.5.1 instead of ipex

#

Is it better than ipex?

civic charm Nov 14, 2024, 7:39 PM

#

somber trellis It's probably some dependency ipex uses, but I don't know why.

IPEX hijacks transformers and that hijack fails

#

Replace import ipex with this:

try:
    import transformers # ipex hijacks transformers and makes it unable to load a model
    backup_get_class_from_dynamic_module = transformers.dynamic_module_utils.get_class_from_dynamic_module
    import intel_extension_for_pytorch as ipex
    ipex.llm.utils._get_class_from_dynamic_module = backup_get_class_from_dynamic_module
    transformers.dynamic_module_utils.get_class_from_dynamic_module = backup_get_class_from_dynamic_module
except Exception:
    pass

somber trellis Nov 14, 2024, 7:41 PM

#

civic charm Replace import ipex with this: ``` try: import transformers # ipex hijacks t...

Where do you want me to put this, just in case I'm an idiot.

civic charm Nov 14, 2024, 7:41 PM

#

find the import intel_extension_for_pytorch in model management

somber trellis Nov 14, 2024, 7:41 PM

#

oh ye

#

👍

somber trellis Nov 14, 2024, 7:44 PM

#

civic charm find the `import intel_extension_for_pytorch` in model management

What about ipex_to_cuda? That's the same location where that's imported too is it not?

civic charm Nov 14, 2024, 7:44 PM

#

after this: transformers.dynamic_module_utils.get_class_from_dynamic_module = backup_get_class_from_dynamic_module

somber trellis Nov 14, 2024, 7:45 PM

#

    import transformers # ipex hijacks transformers and makes it unable to load a model
    backup_get_class_from_dynamic_module = transformers.dynamic_module_utils.get_class_from_dynamic_module
    import intel_extension_for_pytorch as ipex
    ipex.llm.utils._get_class_from_dynamic_module = backup_get_class_from_dynamic_module
    transformers.dynamic_module_utils.get_class_from_dynamic_module = backup_get_class_from_dynamic_module
    from ipex_to_cuda import ipex_init
    ipex_init()
    xpu_available = True
except Exception:
    pass````

civic charm Nov 14, 2024, 7:45 PM

#

yep

somber trellis Nov 14, 2024, 7:45 PM

#

👍

#

works now

civic charm Nov 14, 2024, 8:33 PM

#

devout tangle Is it better than ipex?

It doesn't have random corruptions like ipex but it is significantly slower

somber trellis Nov 14, 2024, 9:26 PM

#

#

Now I can do big funny auto-I2V cog workflow

#

slo

#

w

earnest grotto Nov 14, 2024, 10:12 PM

#

do any of these local video models produce decent results
the online stuff has been pretty disappointing to look at
show us your results

reef ivy Nov 15, 2024, 12:23 AM

#

earnest grotto do any of these local video models produce decent results the online stuff has b...

Mochi is the best but if you expect them to compete with the paid models then no. Mochi seems pretty close though, and runs on local affordable gpus so that is something.

reef ivy Nov 15, 2024, 12:47 AM

#

civic charm It doesn't have random corruptions like ipex but it is significantly slower

are the hijacks still needed with it?

civic charm Nov 15, 2024, 12:47 AM

#

reef ivy are the hijacks still needed with it?

Yes

#

4 GB issue is a hardware issue

#

Alchemist is a 32 bit architecture

uncut bronze Nov 15, 2024, 5:22 PM

#

Is it still recomanded to use IPEX as in the original post explained or is there a better Method to get ComfyUI to run by now?

earnest grotto Nov 15, 2024, 5:27 PM

#

civic charm It doesn't have random corruptions like ipex but it is significantly slower

@uncut bronze ^

earnest grotto Nov 15, 2024, 5:28 PM

#

uncut bronze Is it still recomanded to use IPEX as in the original post explained or is there...

I've made a python script, which you can just run and it will install ComfyUI with IPEX for you, apply Disty's hijacks, and optionally download some custom nodes or some models

#

Seems pretty good?

uncut bronze Nov 15, 2024, 5:29 PM

#

It actualy does. I have Comfy running with Ipex allready though. But I would love it for a second install to try the hijacks

#

I had some issues with bf16 and the fooocus nodes and with torch audio

uncut bronze Nov 15, 2024, 5:39 PM

#

earnest grotto Seems pretty good?

Okay I have now clou how to use Disty's hijack. Do where do I have to put the Hiijak. And do I need to run different requirements for them or update pytorch or anything?

earnest grotto Nov 15, 2024, 5:41 PM

#

You git clone the hijacks repo in comfyui's comfy folder, find where intel_extension_for_pytorch is imported in model_management.py and edit that so it also does from ipex_to_cuda import ipex_init and ipex_init() right afterwards

uncut bronze Nov 15, 2024, 5:54 PM

#

Thanks, Uff, with oneAPI installation and all thats a lot more complicated 😄

earnest grotto Nov 15, 2024, 6:36 PM

#

windows or linux

reef ivy Nov 15, 2024, 6:37 PM

#

earnest grotto I've made a python script, which you can just run and it will install ComfyUI wi...

did you see this #1193952640225267802 message not sure if this should be the new edit or if it just works for that llm node.

earnest grotto Nov 15, 2024, 6:37 PM

#

oh, i saw it but forgot

#

I'll edit the script later

reef ivy Nov 15, 2024, 6:37 PM

#

Cool, seems like this will work with pytorch as well? If not using ipex

earnest grotto Nov 16, 2024, 3:35 AM

#

reef ivy Cool, seems like this will work with pytorch as well? If not using ipex

I've updated it. 2.5 might explicitly need some basekit component or whatever on windows, I'll see if there's some stuff floating around so it won't be needed like with 2.3
For now, the script will still only install 2.3

devout tangle Nov 16, 2024, 6:59 AM

#

earnest grotto I've made a python script, which you can just run and it will install ComfyUI wi...

Where can i find the script

devout tangle Nov 16, 2024, 7:18 AM

#

Do you have a script for linux

earnest grotto Nov 16, 2024, 1:41 PM

#

@devout tangle ^
Windows only for now.

devout tangle Nov 16, 2024, 6:59 PM

#

earnest grotto <@600609864871575573> ^ Windows only for now.

Ok, i got it to work, thx, i have another question though, after like 4-5 images my 50gb of ram gets fill up with cache and pc starts to hang and lag or even just unresponsive at all, how to to solve this problem

earnest grotto Nov 16, 2024, 7:42 PM

#

devout tangle Ok, i got it to work, thx, i have another question though, after like 4-5 image...

restart pc
I think this is just a windows issue, thought i thought this was fixed already

devout tangle Nov 16, 2024, 7:42 PM

#

earnest grotto restart pc I think this is just a windows issue, thought i thought this was fixe...

Its on linux

earnest grotto Nov 16, 2024, 7:42 PM

#

restart whatever is using the ram

#

what kernel

devout tangle Nov 16, 2024, 7:43 PM

#

I didnt install out of tree gpu driver, just regular that came with distro

devout tangle Nov 16, 2024, 7:43 PM

#

earnest grotto restart pc I think this is just a windows issue, thought i thought this was fixe...

Kernel 6.11.7

swift aurora Nov 16, 2024, 7:43 PM

#

anyone know a relatively painless fix for the numpy-problem?

earnest grotto Nov 16, 2024, 7:43 PM

#

swift aurora anyone know a relatively painless fix for the numpy-problem?

post the error

#

probably you want numpy==1.26.4

earnest grotto Nov 16, 2024, 7:46 PM

#

devout tangle Kernel 6.11.7

try 6.5, idk

devout tangle Nov 16, 2024, 7:46 PM

#

earnest grotto restart whatever is using the ram

I have to restart comfyui then? It takes all the ram

earnest grotto Nov 16, 2024, 7:55 PM

#

yes

devout tangle Nov 16, 2024, 8:10 PM

#

It's not convenient, it must be a big of some sort

earnest grotto Nov 16, 2024, 8:12 PM

#

it is a bug yes

#

are you launching with --lowvram, --highvram, etc.?

civic charm Nov 16, 2024, 9:04 PM

#

devout tangle Ok, i got it to work, thx, i have another question though, after like 4-5 image...

Instal tcmalloc and use start the webui like this:

LD_PRELOAD=/usr/lib/libtcmalloc.so.4 rest_of_the_command

swift aurora Nov 18, 2024, 7:47 AM

#

Suggestion; If possible, add pre-requisite part to the pinned script-post mentioning that you need to have anaconda/forge/miniconda installed. Just for clarity's sake

honest hull Nov 22, 2024, 7:04 PM

#

you have iGPU enabled?

somber trellis Nov 22, 2024, 7:05 PM

#

honest hull you have iGPU enabled?

Disabled.

#

Also, I'd like to mention that the flux toolkit loras (depth and canny) work on arc, but will not load if the main model is loaded in fp8 dtype with a seperate lora. It results in a black screen.

honest hull Nov 22, 2024, 7:06 PM

#

tried it with --bf16-unet ?

somber trellis Nov 22, 2024, 7:06 PM

#

...You are asking common-sense questions, Li.

#

Yes.

#

But it's always good to make sure.

#

🤷‍♂️

honest hull Nov 22, 2024, 7:07 PM

#

haha my bad.. been trying to help troubleshooting non-stop these days..

somber trellis Nov 22, 2024, 7:07 PM

#

The flux fill fp8 model works great

#

so inpainting and outpainting on arc is no problem

#

I assume the main flux canny and depth models (non-lora) would work but nobody has converted them to FP8 yet

#

and I don't want to install two 23 gigabyte files lmao

honest hull Nov 22, 2024, 7:09 PM

#

one thing that we could experiment is to use the gguf model

#

instead of FP8

somber trellis Nov 22, 2024, 7:09 PM

#

GGUF works fine

#

It's just slower

#

But more accurate

#

higher precision than base fp8

somber trellis Nov 22, 2024, 7:35 PM

#

So I made my own fp8 versions and they work

#

📎 mem_eff_fp8_convert.py

#

This script I found off of Kijai's stuff is nice to have.

honest hull Nov 22, 2024, 10:19 PM

#

somber trellis It's just slower

for gguf models.. you can improve the speed by adding --reserve-vram 5.0 to comfyUI launch arg

somber trellis Nov 23, 2024, 1:30 AM

#

honest hull for gguf models.. you can improve the speed by adding --reserve-vram 5.0 to comf...

Why would I do that on an a770?

#

That'd literally cripple the bandwidth I have.

#

🤷‍♂️

earnest grotto Nov 23, 2024, 3:30 AM

#

try and see

#

i know lowvram improved my speed, and running the t5 off the cpu could be even faster than shuffling it around

reef ivy Nov 23, 2024, 3:59 AM

#

It will still use more than 5gb vram if it needs to (At least it seems to sometimes), it just makes the gguf models go faster. (on a750 anyway)

earnest grotto Nov 23, 2024, 4:01 AM

#

it attempts to leave that much vram to the rest of the system, not to reserve it for comfyui itself

honest hull Nov 23, 2024, 4:46 AM

#

yeah i can even get fp16 gguf model running with decent speed when launching with reserve vram 4.0

#

without it, it keeps swapping with DRAM for smaller chunks and taking much longer time

#

for NV users they seem to have similar memory management techniques. for example on a 4060 8GB you would see only 7GB out of 8GB is being used for Flux.1 Q8 running, and can notice the increase in system RAM usage while inferencing.

earnest grotto Nov 23, 2024, 6:07 AM

#

What's up with the larger than usual amount of mysteriously vanishing messages now

reef ivy Nov 23, 2024, 3:17 PM

#

Nvidia added a cpu offload option to the drivers a while back, it can be toggled on/off in drivers also a comfyui command for i think.

honest hull Nov 23, 2024, 6:32 PM

#

earnest grotto What's up with the larger than usual amount of mysteriously vanishing messages n...

Larger than 4GB memory allocation error? that was a while back in ipex 2.0 era

somber trellis Nov 23, 2024, 8:10 PM

#

earnest grotto What's up with the larger than usual amount of mysteriously vanishing messages n...

That was me asking you about InstantIR

#

But I removed my messages since they were at like 2 AM for me

earnest grotto Nov 24, 2024, 12:59 AM

#

honest hull Larger than 4GB memory allocation error? that was a while back in ipex 2.0 era

i mean messages in this chat, and others

somber trellis Nov 24, 2024, 12:59 AM

#

yes i know i said it here lmao

#

https://jy-joy.github.io/InstantIR/

earnest grotto Nov 24, 2024, 12:59 AM

#

saw something in digital-art, went to see, nothing
the rvc thread now had a notification, but nothing

somber trellis Nov 24, 2024, 1:00 AM

#

https://github.com/smthemex/ComfyUI_InstantIR_Wrapper

#

This node errors out 🤷‍♂️ I want to use this over SUPIR since it's better overall at image restoration

#

📎 UzqbVybf.txt

earnest grotto Nov 24, 2024, 1:02 AM

#

lower the resolution, say what happens

#

use all vram-reducing options you can

somber trellis Nov 24, 2024, 1:11 AM

#

at a resolution of around 480x240 it still failed

#

🤷‍♂️

earnest grotto Nov 24, 2024, 1:12 AM

#

use the lowest possible resolution
I have no idea

somber trellis Nov 24, 2024, 1:12 AM

#

0.1 megapixel scale causes a tensor a and b mismatch due to it literally being too small

#

rip

red niche Nov 24, 2024, 1:33 AM

#

RuntimeError: Numpy is not available?

#

using numpy==1.26.4

#

@earnest grottotor

earnest grotto Nov 24, 2024, 1:40 AM

#

red niche RuntimeError: Numpy is not available?

Show the whole error, show a pip list

red niche Nov 24, 2024, 1:42 AM

#

📎 c.txt

#

earnest grotto Nov 24, 2024, 1:48 AM

#

that's a pretty old ipex

red niche Nov 24, 2024, 1:51 AM

#

intel_extension_for_pytorch @ https://github.com/Nuullll/intel-extension-for-pytorch/releases/download/v2.1.10%2Bxpu/intel_extension_for_pytorch-2.1.10+xpu-cp310-cp310-win_amd64.whl

that is what came from the requirements

earnest grotto Nov 24, 2024, 1:58 AM

#

Install using my script

#

.

red niche Nov 24, 2024, 2:01 AM

#

should I remove all previous modules?

earnest grotto Nov 24, 2024, 2:01 AM

#

Put the script in some random folder. Run it. It will create everything for you.

#

Whatever you had previously won't matter.

red niche Nov 24, 2024, 2:05 AM

#

nice script

#

feels very welcoming to use

#

for people like me

earnest grotto Nov 24, 2024, 2:06 AM

#

that's the point yes

somber trellis Nov 24, 2024, 9:03 PM

#

So I tried out GGUF models with reservevram enabled

#

They equal the speed of FP8 models

#

but are actually more accurate to fp16

#

🤷‍♂️

somber trellis Nov 25, 2024, 4:24 PM

#

ltxvideo works with comfycore base nodes, but not nodes from ltxvideo-comfyui

#

Faster than flux too btw, at 2.5S/IT.

#

sly trench Nov 25, 2024, 10:26 PM

#

I got this error when running ComfyUI. I used a-One-fan's setup file. Can anyone give me some advice? Thanks so much!

!!! Exception during processing !!! The program was built for 1 devices
Build program log for 'Intel(R) Arc(TM) 140V GPU (16GB)':
-11 (PI_ERROR_BUILD_PROGRAM_FAILURE)

earnest grotto Nov 26, 2024, 3:49 AM

#

sly trench I got this error when running ComfyUI. I used a-One-fan's setup file. Can anyone...

When installing, did it say it installed for an integrated GPU?

#

You can run it again, show what it says

reef ivy Nov 26, 2024, 6:04 AM

#

You may need to disable your IGPU when using arc for AI, not 100% sure though there might be workarounds now.

earnest grotto Nov 26, 2024, 6:07 AM

#

The issue here is pretty likely that I didn't expect Intel to have a dedicated GPU called "140V"

#

And it probably installs the igpu version of ipex

#

which is also partially why i made it spit the name of the GPU and if it decided it's dedicated or integrated back to the user when installing

reef ivy Nov 26, 2024, 6:12 AM

#

oh that's battlmag-- i mean xe2 mobile. Not sure what that runs on as far as ipex

earnest grotto Nov 26, 2024, 6:23 AM

#

wait, so it is integrated

earnest grotto Nov 26, 2024, 6:23 AM

#

sly trench I got this error when running ComfyUI. I used a-One-fan's setup file. Can anyone...

That GPU does not have VRAM.

honest hull Nov 26, 2024, 9:10 AM

#

140V is the Core Ultra series 2 integrated GPU

#

need to replace the ipex wheels with

#

conda install libuv python -m pip install numpy==1.26.4 torch==2.3.1.post0+cxx11.abi torchvision==0.18.1.post0+cxx11.abi torchaudio==2.3.1.post0+cxx11.abi intel-extension-for-pytorch==2.3.110.post0+xpu --extra-index-url https://pytorch-extension.intel.com/release-whl/stable/lnl/us/

earnest grotto Nov 26, 2024, 9:55 AM

#

honest hull 140V is the Core Ultra series 2 integrated GPU

Do they have something special in the name to identify them, or is there a list of lnl igpus vs mtl ones, vs desktop arc

#

I get the GPU by basically looking at powershell's Get-WmiObject Win32_VideoController, which is fairly descriptive but I don't think gives info on generation and such

#

maybe it does but i'm on windows rn

sly trench Nov 26, 2024, 3:34 PM

#

honest hull 140V is the Core Ultra series 2 integrated GPU

You’re absolutely right. I’m running this on a laptop with Core Ultra 7 258V (32GB RAM) and no dGPU.

I installed IPEX 2.3.110 following Intel’s instructions. Then I ran pip install -r requirements.txt in ComfyUI directory.

Server initially reported missing modules like opencv-python, which I installed individually. After resolving those, everything worked perfectly without any errors.

I haven’t tried hijacks yet. I’m newbie and not sure what it can do.

sly trench Nov 26, 2024, 3:42 PM

#

earnest grotto When installing, did it say it installed for an integrated GPU?

a-One-fan installer detects my device as “possibly integrated GPU.”

earnest grotto Nov 26, 2024, 3:42 PM

#

I am the One fan in question

#

I can just hardcode a check for "140V" right now but i want to see if there's a better way to do it

sly trench Nov 26, 2024, 3:49 PM

#

earnest grotto I can just hardcode a check for "140V" right now but i want to see if there's a ...

Thank you. Looking forward to an update for a-One-fan to support the Core Ultra Series 2 from you.

honest hull Nov 26, 2024, 7:41 PM

#

earnest grotto Do they have something special in the name to identify them, or is there a list ...

there is nothing special in the naming.. the extra index url downloads the wheels compiled with lnl as the AOT target.. which is for the Core Ultra series 2 iGPUs

#

--extra-index-url https://pytorch-extension.intel.com/release-whl/stable/lnl/us <--- lnl/us at the end instead of xpu/us or mtl/us

#

multi-AOT for more devices should be WIP. once that's done then we can use the same wheel for different devices

#

technically it should still work if you use other AOT wheels, it would take a long time for the first image generation to compile kernels for that device.. tho

earnest grotto Nov 26, 2024, 7:44 PM

#

I mean in the name string of the GPU, at least the one that powershell command spits out, so i can identify which to download for
as i'm pretty sure that's not the only lunar lake igpu

honest hull Nov 26, 2024, 7:44 PM

#

oh

#

AFAIK,

#

MTL = "Intel(R) Arc(TM) Graphics"
LNL = "Intel(R) Arc(TM) 1**V GPU"
ACM Arc = "Intel(R) Arc(TM) A*** Graphics"

#

maybe using regex to filter "Intel(R) Arc(TM) A" and " Intel(R) Arc(TM) 1** "

earnest grotto Nov 26, 2024, 7:52 PM

#

hmm, what about something like the A60

#

2 or 3 digits? 🤔

honest hull Nov 26, 2024, 7:54 PM

#

A60 uses the same wheel as A770

#

so if name starts with Intel(R) Arc(TM) A then download the A770 wheels???

earnest grotto Nov 26, 2024, 7:55 PM

#

Hmm, I guess I'll do that, thanks

earnest grotto Nov 28, 2024, 6:47 AM

#

welp, that was odd, my SSD decided it should load a model for ~700 seconds

#

Oh well

#

@sly trench Download the script again. Should say 0.0.7p now. Run again. Should work now.

#

https://media.discordapp.net/attachments/945357545005019166/1013932684969525328/smoldanc4.gif

earnest grotto Nov 28, 2024, 9:35 AM

#

damn, from 8s/it to 14s/it, 2.5 man

earnest grotto Nov 28, 2024, 12:36 PM

#

damn... something in 2.3 causes the flux trainer i'm using to save zeroed out loras, but it works with 2.5

earnest grotto Nov 28, 2024, 1:10 PM

#

i'm turning into shadow the hedgehog here

civic charm Nov 28, 2024, 1:30 PM

#

Probably because of BF16's rounding to zero issue thanks to its lack of precision

#

And random IPEX corruptions doesn't help either

#

Cache the CLIP embeds on 2.5

#

Do not run CLIP on IPEX

sly trench Nov 28, 2024, 4:22 PM

#

earnest grotto https://media.discordapp.net/attachments/945357545005019166/1013932684969525328/...

Thank you. I will try the new script on LNL laptop tomorrow, please wait for me to report the result.
But now I tried with MTL laptop (Core Ultra 7 155H) and got this error. I also tried version 0.0.6 but still got the same error.

#

📎 can_NOT_allocate_memory_block_with_size_larger_than_4GB.txt

#

The script identified the device as a Meteor Lake iGPU.

#

And I have no dGPU

earnest grotto Nov 28, 2024, 4:33 PM

#

sly trench

How big of an image did you try to make

sly trench Nov 28, 2024, 4:35 PM

#

earnest grotto How big of an image did you try to make

512x512

earnest grotto Nov 28, 2024, 4:36 PM

#

sly trench 512x512

Show the nodes

sly trench Nov 28, 2024, 4:38 PM

#

You mean this?
C:\Users\Lenovo\Comfy_Intel\ComfyUI\custom_nodes\websocket_image_save.py
C:\Users\Lenovo\Comfy_Intel\ComfyUI\custom_nodes\ComfyUI_TiledKSampler
C:\Users\Lenovo\Comfy_Intel\ComfyUI\custom_nodes\ComfyUI_IPAdapter_plus
C:\Users\Lenovo\Comfy_Intel\ComfyUI\custom_nodes\ComfyUI-GGUF
C:\Users\Lenovo\Comfy_Intel\ComfyUI\custom_nodes\ComfyUI_ExtraModels
C:\Users\Lenovo\Comfy_Intel\ComfyUI\custom_nodes\rgthree-comfy
C:\Users\Lenovo\Comfy_Intel\ComfyUI\custom_nodes\ComfyUI-KJNodes
C:\Users\Lenovo\Comfy_Intel\ComfyUI\custom_nodes\comfyui_controlnet_aux
C:\Users\Lenovo\Comfy_Intel\ComfyUI\custom_nodes\comfyui-inpaint-nodes
C:\Users\Lenovo\Comfy_Intel\ComfyUI\custom_nodes\comfyui-tooling-nodes
C:\Users\Lenovo\Comfy_Intel\ComfyUI\custom_nodes\ComfyUI-BrushNet
C:\Users\Lenovo\Comfy_Intel\ComfyUI\custom_nodes\ComfyUI-SUPIR

#

I'm using it as Local Server for Krita AI Diffusion.

earnest grotto Nov 28, 2024, 4:40 PM

#

sly trench You mean this? C:\Users\Lenovo\Comfy_Intel\ComfyUI\custom_nodes\websocket_image_...

This

sly trench Nov 28, 2024, 4:47 PM

#

earnest grotto This

I'm not using ComfyUI on webUI.
I'm using it as Local Server for Krita AI Diffusion plugin.
Yesterday when I ran it on my LNL laptop everything worked perfectly fine.
But I can't get it to run on the MTL laptop in any way.

#

Have you ever encountered this error? What can I do to fix it.

#

Thank you very much.

earnest grotto Nov 28, 2024, 5:02 PM

#

Yes. I will look into it in a bit, I'm on linux rn

sly trench Nov 28, 2024, 5:04 PM

#

earnest grotto Yes. I will look into it in a bit, I'm on linux rn

I'll be waiting for good news from you. Thank you again for your help 😊

earnest grotto Nov 28, 2024, 5:51 PM

#

civic charm Cache the CLIP embeds on 2.5

on some more testing, the lora is full of zeroes during training and remains so after backward-ing, and after whatever else
same embeds for 2.3 and 2.5, works with bf16 on 2.5
I remember now, the 2.5 build I had done didn't have whatever change they did with the attention to slow everything down, it was running at the same speed as 2.3

#

gonna try 2.6

earnest grotto Nov 28, 2024, 6:30 PM

#

welp, from 8 @ 2.3, to 14 @ 2.5, to 11.5 @ 2.6

#

I guess they improved a bit with 2.6

#

not quite 8 but oh well

nocturne fjord Nov 28, 2024, 7:22 PM

#

Hi, who managed to get bitsandbytes working on the arc gpu?

earnest grotto Nov 28, 2024, 7:24 PM

#

nocturne fjord Hi, who managed to get bitsandbytes working on the arc gpu?

what do you need bitsandbytes for

nocturne fjord Nov 28, 2024, 7:25 PM

#

earnest grotto what do you need bitsandbytes for

To improve training speed

#

in kohya

#

their installation instruction is not very clear https://huggingface.co/docs/bitsandbytes/main/en/installation?platform=Intel+CPU%2BGPU#multi-backend

Installation Guide

#

They ask for v2.4.0+ (ipex) but xpu version is not available

earnest grotto Nov 28, 2024, 7:38 PM

#

I think it's safe to say no one has bitsandbytes working.
they've probably just copypasted the cpu ipex because they plan to support gpus as well, but haven't gotten that to work yet?
2.5 ipex looks to be in the works. if we assume the guide is actually true anyways you will likely need to build it yourself, which is gonna take a while.
You can then get the multi-backend bnb and test, but it says some things are not supported so I'd really expect whatever is needed for training to be among them

#

There are other ways to get faster training. What python version are you using? What model are you training? Windows or Linux?

nocturne fjord Nov 28, 2024, 7:39 PM

#

I am using version 2.6.1dev+XPU

earnest grotto Nov 28, 2024, 7:40 PM

#

what model are you training?

nocturne fjord Nov 28, 2024, 7:40 PM

#

Flux dev fp8

#

512x512

earnest grotto Nov 28, 2024, 7:41 PM

#

windows? linux?

nocturne fjord Nov 28, 2024, 7:41 PM

#

windows with https://github.com/cocktailpeanut/fluxgym (fork of kohya)

GitHub

GitHub - cocktailpeanut/fluxgym: Dead simple FLUX LoRA training UI ...

Dead simple FLUX LoRA training UI with LOW VRAM support - cocktailpeanut/fluxgym

earnest grotto Nov 28, 2024, 7:42 PM

#

linux will be faster

nocturne fjord Nov 28, 2024, 7:43 PM

#

I tried without much success on Linux ubuntu 24

earnest grotto Nov 28, 2024, 7:43 PM

#

how many s/it are you getting on windows and with what gpu

nocturne fjord Nov 28, 2024, 7:43 PM

#

A770 7s/its

earnest grotto Nov 28, 2024, 7:44 PM

#

hmm, that's pretty fast

nocturne fjord Nov 28, 2024, 7:45 PM

#

In fact I am also looking to train with a higher resolution.

#

I am limited to 512x512

earnest grotto Nov 28, 2024, 7:46 PM

#

nocturne fjord I am limited to 512x512

why?

nocturne fjord Nov 28, 2024, 7:46 PM

#

I think bitsandbytes can save me some memory

#

I need to enter images of width 512x1024

#

train*

earnest grotto Nov 28, 2024, 7:48 PM

#

Ah, you're not splitting the model or fluxgym doesn't do that

#

Is that it?

nocturne fjord Nov 28, 2024, 7:49 PM

#

I tried the --split option but it requires more than 64 gb ram

earnest grotto Nov 28, 2024, 7:50 PM

#

Probably a fluxgym issue

nocturne fjord Nov 28, 2024, 7:50 PM

#

I think the best solution is to run simpletuner another script with more optimizations

#

They use Quanto

#

But I couldn't get it to work.

earnest grotto Nov 28, 2024, 7:55 PM

#

If you build a version of pytorch 2.5 with xpu support from before they did whatever they did with slowing down attention, it'd probably be faster

#

i don't have such a build, and no gaurantees, it's a gamble

earnest grotto Nov 28, 2024, 7:57 PM

#

nocturne fjord I tried the --split option but it requires more than 64 gb ram

I train the model split with a comfyUI wrapper for kohya. the newer version of it does eat ram like crazy but the older one doesn't
splitting the model will slow things down, so, if you want higher resolution you'd end up trading off speed even with 512x512

nocturne fjord Nov 28, 2024, 7:59 PM

#

Ok, I will continue to experiment and give feedback.

sharp goblet Nov 28, 2024, 9:45 PM

#

Why this happen? Conda is whitelisted in firewall, port is unused, it has admin perms, thx for the help in advance and sorry if this isn't the correct channel

earnest grotto Nov 29, 2024, 6:24 AM

#

sharp goblet Why this happen? Conda is whitelisted in firewall, port is unused, it has admin ...

You already have comfyui running

warm radish Nov 29, 2024, 9:42 AM

#

What changed with Pytorch 2.5 having XPU support? Is IPEX optional? ( I see pytorch detects an xpu device without it installed). It is unclear whether some ipex versions are tied to certain torch versions.

earnest grotto Nov 29, 2024, 9:48 AM

#

warm radish What changed with Pytorch 2.5 having XPU support? Is IPEX optional? ( I see pyto...

IPEX is not necessary
IPEX is always tied to a specific torch version
Performance is much worse (Pytorch foundation issue)
Some bugs fixed?

#

2.6 looks to improve on that performance regression

warm radish Nov 29, 2024, 9:55 AM

#

earnest grotto IPEX is not necessary IPEX is always tied to a specific torch version Performanc...

thanks. For now I am just testing on a Meteor Lake iGPU, I am new to Arc. good to know IPEX is just for perf optimization.

earnest grotto Nov 29, 2024, 9:55 AM

#

No. The performance regression with 2.5 is for every GPU.

#

Intel or otherwise

#

Except supposedly, H100s?

warm radish Nov 29, 2024, 9:56 AM

#

earnest grotto IPEX is not necessary IPEX is always tied to a specific torch version Performanc...

Where is the iGPU max memory set? Is it BIOS? I see it OOMs with >4G requests even if it says total capacity is 28G

#

torch.OutOfMemoryError: XPU out of memory. Tried to allocate 5.59 GiB. GPU 0 has a total capacity of 28.66 GiB.

earnest grotto Nov 29, 2024, 9:56 AM

#

Did you install ComfyUI with my script, which will apply disty's hijacks?

#

If yes, I have no idea why currently

warm radish Nov 29, 2024, 9:57 AM

#

no, I am just trying plain pytorch, just setting up to see the basics work at all

#

do you know if there are other discords/channels where such xpu stack related discussions are held (not necessarily in gen-ai context)?

#

I heard there is one, this was the first hit for intel-discord

earnest grotto Nov 29, 2024, 9:58 AM

#

no idea

sharp goblet Nov 29, 2024, 2:38 PM

#

earnest grotto You already have comfyui running

Nop, when I change from wifi to simdata it works, I don't know what could be causing not working on wifi

earnest grotto Nov 29, 2024, 2:56 PM

#

warm radish Where is the iGPU max memory set? Is it BIOS? I see it OOMs with >4G requests ev...

You seem like the more active mtl user here
Open ComfyUI/comfy/ipex_to_cuda/hijacks.py
Go to line 7, device_supports_fp64 = torch.xpu.has_fp64_dtype() if hasattr(torch.xpu, "has_fp64_dtype") else torch.xpu.get_device_properties("xpu").has_fp64
below it, add
print(f"\n\n\nfp64: {device_supports_fp64}\n\n\n")
run comfyui, you will see a lot of black space and in the middle of it "fp64: True" or "fp64: False", show which it is

OR

alternatively you can activate the conda environment, go into python and do what that does

#

preferably you'd do the latter as I'd probably want to try out a few other commands

civic charm Nov 29, 2024, 3:13 PM

#

There are these too:

#

If you still want to use attention slicing, use the IPEX_FORCE_ATTENTION_SLICE=1 env var

#

Xe2 should support 64 bit

#

So it shoudln't have 4GB issues and FP64 issues anymore

earnest grotto Nov 29, 2024, 3:15 PM

#

ah, this is making me realize
with the fp64 emulation that's shaping up, we're probably gonna reach a point where alchemist can do the fp64 data type but can't allocate more than 4gb?

civic charm Nov 29, 2024, 3:16 PM

#

earnest grotto ah, this is making me realize with the fp64 emulation that's shaping up, we're p...

Probably

#

Pytorch 2.5's FP64 emulation causes exactly this (manually enabled)

#

So have to use force attention slicing env var as well

earnest grotto Nov 29, 2024, 3:19 PM

#

civic charm If you still want to use attention slicing, use the `IPEX_FORCE_ATTENTION_SLICE=...

I'll probably make it set that

tribal hare Nov 30, 2024, 12:22 PM

#

I was using ComfyUI without any issues and generated several images. After stopping the server, I tried to restart it, but this happened.😭

earnest grotto Nov 30, 2024, 12:32 PM

#

tribal hare I was using ComfyUI without any issues and generated several images. After stopp...

Something with your environment is broken

earnest grotto Nov 30, 2024, 12:33 PM

#

earnest grotto You seem like the more active mtl user here Open ComfyUI/comfy/ipex_to_cuda/hija...

@sly trench
Open ComfyUI/comfy/ipex_to_cuda/hijacks.py
Go to line 7, which is device_supports_fp64 = torch.xpu.has_fp64_dtype() if hasattr(torch.xpu, "has_fp64_dtype") else torch.xpu.get_device_properties("xpu").has_fp64
make a new line below it, add
print(f"\n\n\nfp64: {device_supports_fp64}\n\n\n")
run comfyui, you will see a lot of blank space and in the middle of it "fp64: True" or "fp64: False", show which it is
then undo the new line and text you added

tribal hare Nov 30, 2024, 1:10 PM

#

earnest grotto Something with your environment is broken

The issue has been resolved by uninstalling and reinstalling PyTorch 😄 👍

somber trellis Dec 1, 2024, 1:05 AM

#

just realized the non-comfyui vanilla nodes for ltxvideo work in arc, just not with --lowvram enabled.

#

having --reservevram 4.0 does just entirely mitigate the issue

#

and using https://github.com/SeanScripts/ComfyUI-Unload-Model in case certain things doesnt unload is good too

reef ivy Dec 1, 2024, 5:01 PM

#

Comfy should go to lowvram automatically if needed I think it's just slower than enabeling it by default, maybe --reservevram speeds up the process where you don't need to enable it manually.

#

I want to update my drivers but the vram issue still isn't fixed I don't think. The reserve vram command helps in comfy but not sure there is alternatives for other applications.

reef ivy Dec 1, 2024, 7:53 PM

#

LTX in comfyui, 50 steps with a default prompt/workflow. There are some tricks to getting more movement and better video. It's super fast, 2.77s/it there abouts. There are some quants but they ran much slower for me (probably due to --reserve-vram)

#

This was img2video, generated the image with flux with the default prompt.

#

might try my darth vader and compare to cogvideox lol

#

https://github.com/sandner-art/ai-research/tree/main/LTXV-Video used motion fix workflow, lowered the resolution to 768x512(recommended by devs).

GitHub

ai-research/LTXV-Video at main · sandner-art/ai-research

Settings for AI Training. Contribute to sandner-art/ai-research development by creating an account on GitHub.

#

Tips for better output https://www.reddit.com/r/StableDiffusion/comments/1h26okm/ltxvideo_tips_for_optimal_outputs_summary/

From the StableDiffusion community on Reddit: LTX-Video Tips for Op...

Explore this post and more from the StableDiffusion community

sly trench Dec 2, 2024, 8:00 AM

#

earnest grotto <@455658122552279040> Open ComfyUI/comfy/ipex_to_cuda/hijacks.py Go to line 7, ...

Hi @earnest grotto . It showed fp64:True

#

Additionally, your 0.0.7 version working fine on LNL laptop

earnest grotto Dec 2, 2024, 8:08 AM

#

sly trench Hi <@311915623179485186> . It showed fp64:True

This is with meteor lake, right?

sly trench Dec 2, 2024, 8:12 AM

#

earnest grotto This is with meteor lake, right?

Yes. it showed fp64:True on MTL laptop

#

I have both MTL and LNL

rustic sonnet Dec 2, 2024, 12:16 PM

#

MTL has native FP64 support

earnest grotto Dec 2, 2024, 12:34 PM

#

rustic sonnet MTL has native FP64 support

datatype sure, but apparently it can't allocate more than 4gb

rustic sonnet Dec 2, 2024, 12:36 PM

#

earnest grotto datatype sure, but apparently it can't allocate more than 4gb

That's because it doesn't support int64

earnest grotto Dec 2, 2024, 12:41 PM

#

ah
seems like an odd choice of what to support

earnest grotto Dec 2, 2024, 1:05 PM

#

man, hopefully i don't get some really weird 700 second long load time again

#

nice, I didn't
wonder why my ssd decided to do that that one time

#

@sly trench Ok, I've updated the script, download it again, put it where you put it last time and run it again, it updates when you do that, faster than reinstalling. Regardless of that, it should work now

reef ivy Dec 2, 2024, 5:44 PM

#

I wonder if it would be possible to get the 4gb fix committed to comfy? Its annoying to redo it each update tbh

reef ivy Dec 2, 2024, 5:44 PM

#

rustic sonnet That's because it doesn't support int64

😓

earnest grotto Dec 2, 2024, 6:16 PM

#

reef ivy I wonder if it would be possible to get the 4gb fix committed to comfy? Its anno...

It's probably best for a workaround like that to be included on intel's side, as a part of pytorch now

#

Especially now that some fp64 emulation has made it to 2.5

reef ivy Dec 3, 2024, 4:34 AM

#

Could having multiple conda environments in windows slow stuff down even if they aren't both running.edit Seems I had to reinstall libuv/ipex files into conda env, even the quant models are faster now.

honest hull Dec 3, 2024, 3:57 PM

#

https://youtu.be/Dl81n3ib53Y?si=exgnqEZvaHYQ0iQU

YouTube

Intel Gaming

Meet the Intel Arc B-Series | Intel Gaming

Introducing the Intel Arc B-Series GPU family, offering high performance for modern gaming at 1080p and 1440p resolutions, complete with the latest AI upscaling and ray tracing capabilities.

Increase responsiveness and more with Intel Arc gaming technologies, with Intel Xe Super Sampling (XeSS) technology boosting visual quality and performan...

▶ Play video

#

https://youtu.be/cYPZye1MC6U?si=HtqTcODVubBhnuhP

YouTube

Intel Technology

Deep Dive: Running AI on Intel Arc GPUs

Ever wanted to explore the latest generative AI tech but were intimidated by how to get started? Intel makes it easy no matter what level of expertise you’re at. From beginner to enthusiast, learn how Intel enables anyone to use generative AI through Intel’s own AI Playground, from text-to-image creation to customizable chatbots.

Join Bob Duff...

▶ Play video

reef ivy Dec 3, 2024, 6:36 PM

#

honestly, very cool. I hope we will be able to import are own workflows and edit them in comfy. (seems like it from the demo).

reef ivy Dec 3, 2024, 7:13 PM

#

latest cogvideox updates broke all my workflows for it. No clue how to fix it

reef ivy Dec 3, 2024, 8:09 PM

#

finally figured out what was wrong, but now all outputs are garbage. If anybody tries out cogvideox again let me know if you can get decent output.

honest hull Dec 3, 2024, 8:26 PM

#

reef ivy honestly, very cool. I hope we will be able to import are own workflows and edi...

that’s the goal

reef ivy Dec 3, 2024, 10:02 PM

#

There is something majorly wrong with the latest ipex in windows in terms of speed, its often 5x slower but sometimes its the same speed. I realized I had installed an older ipex which was why it was fast, but its not compatible with florence 2 so I upgraded and now speed is ridiculously slow half the time. Is this issue known? I know i reported it many months ago, i could update drivers but haven't since there is a memory issue in the latest ones.

#

for instance, same prompt and settings for ltx video. last ipex version it's 2.55s/it all the time

earnest grotto Dec 3, 2024, 10:07 PM

#

reef ivy There is something majorly wrong with the latest ipex in windows in terms of spe...

ipex, or pytorch 2.5

reef ivy Dec 3, 2024, 10:07 PM

#

ipex, latest version 2.3xx

#

I am on older drivers, but it was a problem months ago as well. There is a memrory issue for a750 now in latest drivers so haven't updated.

#

also, ipex hasn't updated since then either

#

frustrating since some new stuff like the florence VLM won't work on the older version

earnest grotto Dec 3, 2024, 10:10 PM

#

there is a 2.5 ipex in progress
if you're using my script, I poked into adding 2.5 and with it 2.1.40 but gave up somewhat halfway through when I couldn't get 2.5 working on windows without the basekit, I can fully add 2.1.40 as an option

reef ivy Dec 3, 2024, 10:11 PM

#

I here pytorch is still slower, probably about what I am getting in 2.3

#

I could try it and see I guess

#

yeah, now it's back to 11s/it lol it's wierd

#

2.1.4 is fast, but it isn't compatible with all the newer stuff

#

Now it's back to 2.44s/it during the same generation? it's like it's changing randomly.

earnest grotto Dec 3, 2024, 10:17 PM

#

2.5 is slower and also has bugfixes

#

e.g. if you want stable cascade, that works on 2.5

#

doesn't on 2.3

#

2.6 is somewhat faster than 2.5, haven't tested too much

#

i think it's still slower than 2.3, just not almost-2x-slower

reef ivy Dec 3, 2024, 10:22 PM

#

I may try 2.6 out, I think ipex 2.3 is just bugged for a750 in windows. I may also try new drivers, but last time it wouldn't even run flux without oom.

#

I can get the whls off github? Don't think i've ever installed straight pytorch tbh lol

earnest grotto Dec 3, 2024, 10:24 PM

#

reef ivy I can get the whls off github? Don't think i've ever installed straight pytorch...

https://pytorch.org/docs/stable/notes/get_start_xpu.html#platform-windows
2.6 is nightlies

#

of course, since it's nightlies, it's probably possible that you get one where someone did a whoopsie and everything is completely broken

reef ivy Dec 3, 2024, 10:27 PM

#

earnest grotto of course, since it's nightlies, it's probably possible that you get one where s...

lol, yeah. Appreciate it I will give it a shot. What version is the preview build on? 2.5?

earnest grotto Dec 3, 2024, 10:27 PM

#

yeah 2.5 is the preview

reef ivy Dec 3, 2024, 10:27 PM

#

for the 4gb fix, what should I alter?

#

just take out import ipex?

earnest grotto Dec 3, 2024, 10:29 PM

#

for the time being though, you will need the basekit (or rather, a part of it?) to run them, as per the link above in that article explaining requirements
https://www.intel.com/content/www/us/en/developer/articles/tool/pytorch-prerequisites-for-intel-gpu/2-5.html
(Scroll down for windows)

Intel

PyTorch Prerequisites for Intel® GPUs

These prerequisites let you compile and build PyTorch 2.5 on Linux systems with optimizations for Intel® GPUs.

reef ivy Dec 3, 2024, 10:30 PM

#

maybe that's this

earnest grotto Dec 3, 2024, 10:30 PM

#

most likely

honest hull Dec 3, 2024, 10:55 PM

#

it could also be driver/windows issue.. I noticed that when shared GPU memory is being used by > 200MB.. it drops perf by like 4x.. but on comfy you could try adding —reserve-vram 6.0

honest hull Dec 3, 2024, 10:58 PM

#

reef ivy Now it's back to 2.44s/it during the same generation? it's like it's changing r...

if you open task manager, look at buttom of the GPU tab where it says “Shared GPU Memory”, whenever that is being used >0.2-0.3GB, perf drops..

earnest grotto Dec 3, 2024, 11:02 PM

#

reef ivy just take out import ipex?

comfyui and disty's hijacks are made to work with xpu support in 2.5
you should move init_ipex above any ipex importing, still in the try-catch block
I'd assume the transformers load issue isn't present in 2.5 since there's no ipex

civic charm Dec 3, 2024, 11:10 PM

#

civic charm Replace import ipex with this: ``` try: import transformers # ipex hijacks t...

For Florence 2

reef ivy Dec 3, 2024, 11:52 PM

#

honest hull it could also be driver/windows issue.. I noticed that when shared GPU memory is...

I have it set to 4.0 but I could try 6.0 and see if it helps.

reef ivy Dec 3, 2024, 11:52 PM

#

earnest grotto most likely

couldn't get either version of pytorch to work at all, installed one api, called environment kept getting different errors. The procedure entry point is the main one.

reef ivy Dec 4, 2024, 12:22 AM

#

honest hull it could also be driver/windows issue.. I noticed that when shared GPU memory is...

So far 6.0 seems to be more stable, it is a little slower but consistent now. Thank you

#

But ipex2.3 is a regression from 2.1.4, in windows with comfyui on a750 anyway

wicked fulcrum Dec 4, 2024, 3:59 PM

#

reef ivy There is something majorly wrong with the latest ipex in windows in terms of spe...

Im seeing something similar. When running loading and unloading models that take you to the edge of memory.
It might be a memory leak and that driver fix. Remember that driver fix that fixed SDXL in A750. It looks like that driver starts to swap memory when under max memory. This shared memory mode interrupts and slows compute.
Getting to max memory seems to happen as you load and unload models. 10-20% of memory never clears and causes a memory bottleneck

reef ivy Dec 4, 2024, 5:25 PM

#

So far, I've only experience it with 2.3 ipex, 2.1.4 doesn't seem to suffer from it. At least for Comfyui, I've found that the sweet spot seems to be --reserve-vram 5.0 this keeps most of the speed and stays consistent. At 4.0 it goes randomly slow and faster etc. At 6.0 the speed decreases more with no other benefit. I have only tested LTX video and Flux so far though. Maybe the memory fix they are working on will fix it

reef ivy Dec 4, 2024, 7:49 PM

#

Can Ipex and IpexLLM be run from the same Env? or would they conflict with eachother? going to install Ollama to run LLM's in comfy but not sure if I should run it in the same Env or not.

#

Also is there any issue with running multiple conda environments? Would resources get stuck or slow down etc.

wicked fulcrum Dec 4, 2024, 8:20 PM

#

reef ivy Can Ipex and IpexLLM be run from the same Env? or would they conflict with eacho...

Currently AI Playground is running IPEX and IPEX-LLM in the same environment. We are finding that adding in additional Frameworks and APIs (ie Llama.cpp) may require we have separate environment or at least server instance as oneAPI DLLs may be inconsistent across these projects

reef ivy Dec 4, 2024, 8:25 PM

#

Okay thanks. I was thinking that is what ai playground was doing. But wasn't sure. Also seems ipexllm wants 3.11 while ipex wants 3.10

#

Gonna try and just install ollama in its own environment abd see how it goes with comfy, probably be easier when ai playground intergrates comfy

wicked fulcrum Dec 4, 2024, 8:44 PM

#

I believe AI Playground is on 3.10.11 @honest hull can check me on that

honest hull Dec 4, 2024, 9:47 PM

#

we are on 3.11

#

https://github.com/intel/AI-Playground/blob/dev/service/requirements-arc.txt

GitHub

AI-Playground/service/requirements-arc.txt at dev · intel/AI-Playgr...

AI PC starter app for doing AI image creation, image stylizing, and chatbot on a PC powered by an Intel® Arc™ GPU. - intel/AI-Playground

lucid lily Dec 5, 2024, 12:22 PM

#

Hello everyone, I recently purchased an A770 graphics card, and I want to run comfyui on it. I followed the pinned instructions to install comfyui and run it, but many newer features are not available. I found that Intel® Extension for PyTorch* v2.3.110+xpu was released a few months ago, and the requirements.txt in the pinned tutorial is somewhat outdated. Is there an updated version of the requirements?

earnest grotto Dec 5, 2024, 1:34 PM

#

@lucid lily Use this script ^

reef ivy Dec 5, 2024, 6:59 PM

#

lucid lily Hello everyone, I recently purchased an A770 graphics card, and I want to run co...

There are multiple pins for different wayst to install, Vik's script is the ideal way, especially for an a770. The other way is still there because of issues with speed on a750 and under cards in windows. Only thing that won't work with the older ipex that I found is Florence2 from my (limited) testing.

lucid lily Dec 6, 2024, 8:42 AM

#

@earnest grottoThank you. I used the script you provided for the installation, and everything went relatively smoothly. However, after completing the installation, when I launch ComfyUI using the shortcut, I receive the following error:
Error loading "J:\Comfy_Intel\cenv\lib\site-packages\torch\lib\torch_xpu_ops_aten.dll" or one of its dependencies.
I checked and found that this file is not problematic, but I'm not sure why it failed to load successfully.

lucid lily Dec 6, 2024, 1:38 PM

#

@earnest grottoUm... I tried reinstalling from scratch, and this time it started without any errors. It's working now, very strange.😅

earnest grotto Dec 6, 2024, 1:52 PM

#

hmm

#

bet something didn't install properly

rough crystal Dec 7, 2024, 2:44 PM

#

I installed AI playground through official website. Everything's work as expected

reef ivy Dec 7, 2024, 3:59 PM

#

Can triton work on intel, and on in intel in windows? https://purz.notion.site/Get-Windows-Triton-working-for-Mochi-6a0c055e21c84cfba7f1dd628e624e97 Also, the Tencent video models apparently works on 8gb now runs faster with sageattention but I think you need triton for it.

Purz on Notion

Get Windows Triton working for Mochi | Notion

Step 1 - Modify your Visual Studio Build Tools 2022 (Or Install It)

civic charm Dec 7, 2024, 6:31 PM

#

reef ivy Can triton work on intel, and on in intel in windows? https://purz.notion.site/G...

Triton works on Intel with PyTorch 2.6

#

Triton itself doesn't work on Windows

reef ivy Dec 7, 2024, 7:49 PM

#

Seems they got it working for windows in that post? But it needs cuda stuff

reef ivy Dec 7, 2024, 11:08 PM

#

Anybody on a770 tried the hunyuan model?

somber trellis Dec 8, 2024, 1:02 AM

#

Me. I can't get it to work effectively at all, and at lower resolutions gives me tensor errors 🤷

#

LTXVideo works with PAG.

earnest grotto Dec 8, 2024, 1:21 AM

#

They don't want people in europe using their models so I'm not trying out another okish image generator 😔

somber trellis Dec 8, 2024, 1:42 AM

#

earnest grotto They don't want people in europe using their models so I'm not trying out anothe...

hunyuan isn't an ok-ish model tho

#

it's an actually competitive one

#

and it can run on a 3090

earnest grotto Dec 8, 2024, 1:42 AM

#

that's ok-ish to me

somber trellis Dec 8, 2024, 1:42 AM

#

16gb

#

of vram

#

we cant tho

#

for some reason

#

ChippySad

#

funny ah 32-bit architecture

earnest grotto Dec 8, 2024, 1:42 AM

#

i think the images the current best models like flux generate, are ok-ish

#

nevermind that each has its own drawbacks

somber trellis Dec 8, 2024, 1:43 AM

#

do you not tinker with realism or guidance models

#

there are ways to pull out details

#

like uh...

#

https://github.com/logtd/ComfyUI-Fluxtapoz

#

https://github.com/Clybius/ComfyUI-Latent-Modifiers

earnest grotto Dec 8, 2024, 1:45 AM

#

somber trellis https://github.com/logtd/ComfyUI-Fluxtapoz

this looks to be doing in effect what can be done with flux.1 tools

somber trellis Dec 8, 2024, 1:45 AM

#

well fluxtapoz has PAG for flux

#

and SEG (Smoothed Energy Guidance)

#

they can help improve prompt adherence, combined with stuff like perpnegguider

#

which is designed to help further follow prompting

earnest grotto Dec 8, 2024, 1:47 AM

#

somber trellis https://github.com/Clybius/ComfyUI-Latent-Modifiers

This isn't magic that will make things way better
rescaling cfg is already in comfy by default, think others might be too

somber trellis Dec 8, 2024, 1:48 AM

#

They are just nodes that make it easier to manipulate. More UI friendly.

#

After all, ComfyUI is down to the bones a visual scripting language of sorts.

earnest grotto Dec 8, 2024, 1:49 AM

#

somber trellis they can help improve prompt adherence, combined with stuff like perpnegguider

Using that with flux is going to kill generation times, unless you're using schnell but then you're kinda defeating your own point, schnell is worse

#

I'm kinda getting tired of bad fingers at this point honestly

somber trellis Dec 8, 2024, 1:49 AM

#

Yes, it's slower.

#

I kinda wish we just had better models.

#

LTXVideo 0.9 is great for how small it is 🤷‍♂️

earnest grotto Dec 8, 2024, 1:51 AM

#

furthermore, in stacking a lot of the almost-placebo improvements like PAG, SAG, perpneg, AYS and so on, I've sometimes had cases where the model starts failing to denoise

somber trellis Dec 8, 2024, 1:52 AM

#

I've not gotten blackscreened images from that, but for some reason I do from the lora nodes when used with flux.

#

Including LoraLoaderModelOnly.

earnest grotto Dec 8, 2024, 1:52 AM

#

Not black. Just images with some leftover noise in them

somber trellis Dec 8, 2024, 1:52 AM

#

Oh.

earnest grotto Dec 8, 2024, 1:52 AM

#

in spots

somber trellis Dec 8, 2024, 1:53 AM

#

ComfyUI-dpmpp_2m-3.5-30-2024-12-05_02-28-25-0041.webp

#

🤷‍♂️

#

I don't seem to have that issue

#

ComfyUI-dpmpp_2m-1.0-30-2024-12-07_15-05-13-0366.webp

#

This was one I did yesturday with all of the nodes

#

this is pixelwave flux btw

earnest grotto Dec 8, 2024, 1:54 AM

#

that was with SDXL, maybe it's less likely with flux but I'm hesitant to use flux much due to how slow it is and how biased it is towards real life photos with strong depth of field

somber trellis Dec 8, 2024, 1:55 AM

#

ComfyUI-dpmpp_2m-3.0-30-2024-12-07_17-03-19-0376.webp

#

you can definitely get past that problem with flux

#

ComfyUI-dpmpp_2m-3.5-30-2024-12-05_01-59-51-0033.webp

#

ComfyUI-dpmpp_2m-3.5-30-2024-12-05_17-29-29-0135.webp

#

ComfyUI-dpmpp_2m-3.5-30-2024-12-06_03-22-39-0184.webp

earnest grotto Dec 8, 2024, 1:57 AM

#

somber trellis

I'd say you don't need flux for this

somber trellis Dec 8, 2024, 1:57 AM

#

Well you don't.

#

Any model above SDXL imo really isn't a requirement anymore

earnest grotto Dec 8, 2024, 1:57 AM

#

but, i don't want abstract shapes or eldritch non-people

somber trellis Dec 8, 2024, 1:58 AM

#

ComfyUI-dpmpp_2m-3.5-30-2024-12-05_02-54-19-0049.webp

#

🤷‍♂️

#

ComfyUI-dpmpp_2m-1.0-4-2024-12-06_17-14-32-0333.webp

#

I wonder what you look for.

#

Do you want full artistry?

earnest grotto Dec 8, 2024, 1:59 AM

#

somber trellis Any model above SDXL imo really isn't a requirement anymore

SDXL and most of its finetunes still suffer from non-zero snr-ness, and that doesn't seem to be the case for anything after it, at least not to that extent, disty said SD3.5 (don't remember which?) wasn't actually trained with zero snr

#

it was one of the big things SAI were touting with the original SD3

#

I like the SDXL finetunes. But then, there's no SD3.5 finetunes, and I don't think flux finetuning has progressed much

somber trellis Dec 8, 2024, 2:02 AM

#

nvidia sana looks nice for how fast it is

#

if only it worked properly on arc

#

because it kept giving me terrible outputs with the wip node

earnest grotto Dec 8, 2024, 2:03 AM

#

It is not supposed to work properly on anything other than nvidia because nvidia explicitly want to forbid that

#

in their license

somber trellis Dec 8, 2024, 2:03 AM

#

Well someone is doing it

#

So I gotta hope they get it fully operational

#

#

https://github.com/NVlabs/Sana?tab=readme-ov-file#to-do-list

#

https://github.com/zmwv823/ComfyUI-Sana

#

thats the node i tried

reef ivy Dec 8, 2024, 2:10 AM

#

somber trellis we cant tho

Nvidia users are getting it down to 10, supposedly even 8

earnest grotto Dec 8, 2024, 2:18 AM

#

somber trellis Do you want full artistry?

Ideally, I'd want a good 3d model generator and then I can do as I please with the models, but that's still far off
For image generators, I want

Coherent, symmetric, 5-fingered 5-toed humans (none do this)
And by extension, other coherent things too. Guns? Tanks? Most of my attempts with controlnet, like putting in the effort to pose an openpose skeleton, just made me facepalm, finetune couldn't do crossed legs properly and even with controlnet just fused them together
action (can any do 2 people fighting? I think no)
Styles and some reference sheet for them
most anime finetunes do this indirectly. PonyXL author wanted to be malicious about this. Base Flux struggles. SD3.5 looks to do styles, but I don't want to be finding out what it can or can't do
In general, I'd like to see the actual captions used for 100-1000 random varied images in these models' datasets. Surely that's not too much to ask for?
Zero snr or something close enough (this means the model can generate very dark or bright images)
A fast inpainting model that's better than SD1.5
among other things

#

SD1.5 still feels like the best for inpainting

#

I haven't tried the new Flux controlnets

#

So, it might tick that box of not failing to follow them

reef ivy Dec 8, 2024, 2:24 AM

#

earnest grotto Ideally, I'd want a good 3d model generator and then I can do as I please with t...

https://www.reddit.com/r/StableDiffusion/comments/1h7xmin/trellis_is_amazing/

From the StableDiffusion community on Reddit: Trellis is Amazing.

Explore this post and more from the StableDiffusion community

earnest grotto Dec 8, 2024, 2:25 AM

#

reef ivy https://www.reddit.com/r/StableDiffusion/comments/1h7xmin/trellis_is_amazing/

#

I tried it

#

As a reference, the 2nd image is from a playstation 2 game.

somber trellis Dec 8, 2024, 2:25 AM

#

ok instant ps2-level models

#

g

#

g

reef ivy Dec 8, 2024, 2:26 AM

#

I'd say you'll get what you want within the next year or so. Although if it stays open by then who knows

somber trellis Dec 8, 2024, 2:26 AM

#

3d modelling will obsolete itself if we can use an inpainter to Draw a base model image t-pose

earnest grotto Dec 8, 2024, 2:26 AM

#

somber trellis ok instant ps2-level models

it looks better than previous 3d model generators, i'll give it that, it is good to see those are actually improving
But yeah, it's ps2-level... ish. Can be worse.

somber trellis Dec 8, 2024, 2:27 AM

#

A lot of these will revolutionize in 2025.

earnest grotto Dec 8, 2024, 2:27 AM

#

somber trellis Dec 8, 2024, 2:27 AM

#

I feel it.

#

Let's just hope uh

#

🤷‍♂️

#

ye

reef ivy Dec 8, 2024, 2:27 AM

#

Kijai updated cog video nodes and now almost nothing works right anymore. Been messing with ltx and can get decent results sometimes.

earnest grotto Dec 8, 2024, 2:28 AM

#

I'm not gonna dunk on it too much since the hf space errored out when trying to do textures over 1024^2 or simplify less than 0.95, but...

somber trellis Dec 8, 2024, 2:28 AM

#

LTX 0.9 btw, theyre gonna release the 1.0

#

which hopefully fixes the long-prompting issue

#

which they know

reef ivy Dec 8, 2024, 2:29 AM

#

Yeah, hopefully they train human movement into that one. Also the need to add noise to make the image worse so it animates

somber trellis Dec 8, 2024, 2:29 AM

#

but ltxvideo is already OP for video-to-video

reef ivy Dec 8, 2024, 2:29 AM

#

I haven't tried video to video yet

somber trellis Dec 8, 2024, 2:29 AM

#

darn

earnest grotto Dec 8, 2024, 2:29 AM

#

reef ivy Kijai updated cog video nodes and now almost nothing works right anymore. Been ...

The video models look decent but I'm still not a big fan of the fact that they're all fundamentally limited to short clips
With that in mind, those recent ai generated ads from coca cola and such get even more obvious

somber trellis Dec 8, 2024, 2:29 AM

#

thats videotovideo via ltxtricks

#

https://github.com/logtd/ComfyUI-LTXTricks

earnest grotto Dec 8, 2024, 2:30 AM

#

why hand not green >:(

somber trellis Dec 8, 2024, 2:30 AM

#

i dunno but its pretty close

reef ivy Dec 8, 2024, 2:30 AM

#

probably the denoise value

somber trellis Dec 8, 2024, 2:31 AM

#

the other two nodes in ltxtricks

#

are cool

#

one of them essentially does what an application you might know in the past did

reef ivy Dec 8, 2024, 2:31 AM

#

image+video to video looks really good from the example. Although it was pretty much a perfect one to one drawing

somber trellis Dec 8, 2024, 2:31 AM

#

a youtuber got famous for it

#

lol i cant remember that software

#

https://www.youtube.com/watch?v=SY3y6zNTiLs&themeRefresh=1

#

Ah, I remember.

#

Ebsynth.

#

https://www.youtube.com/watch?v=0RLtHuu5jV4

#

from 2019 lol

earnest grotto Dec 8, 2024, 2:36 AM

#

somber trellis https://www.youtube.com/watch?v=0RLtHuu5jV4

ah, i don't know that
or is that the thing that joel haver uses https://www.youtube.com/watch?v=c6MW-qdNoYA

YouTube

Joel Haver

Elden Ring from the NPC's Perspective

ELDEN RING out now! - https://www.bandainamcoent.com/games/elden-ring

Featuring @comedianalecrobbins @cerspence and Calvin LaVallee - https://www.youtube.com/channel/UC_DudUFOztlAHj5JlHXbuDQ

More Animations - https://www.youtube.com/playlist?list=PLKtIcOP0WvJDZemPYZZQSqotCgpps5DbX

Subscribe for weekly short films.

Support -
Patreon: https:...

▶ Play video

somber trellis Dec 8, 2024, 2:36 AM

#

it is the thing joel haver uses

#

its rotoscoping + ebsynth

#

he says so in the comments of the vid i posted

#

reef ivy Dec 8, 2024, 2:38 AM

#

That was an amazing video tbh, I only know of ebsynth from a1111 extension, never tried it though. I think animatediff replaced it

reef ivy Dec 8, 2024, 2:38 AM

#

somber trellis

if he posted that today, he'd get flamed to all hell lol.

somber trellis Dec 8, 2024, 2:39 AM

#

aaron

#

use the pag node from ltxtricks

#

it slows it down a bit but greatly helps consistency

#

Todays world sucks

#

reef ivy Dec 8, 2024, 2:43 AM

#

The stg nodes? Yeah, I use them. I was able get them to work with img2video as well. They help consitency but also slow down movement (at least wtih img2video). Longer steps also slow down movement, not sure if that is only with stg nodes though.

#

People use the Detail Daemon nodes as well, but for img2video I could not get them to work properly.

somber trellis Dec 8, 2024, 3:09 AM

#

ltxvideo can give some pretty interesting and atmospheric outputs tho

reef ivy Dec 8, 2024, 3:11 AM

#

I need to try some more txt2video stuff. Need to setup my llm to prompt for it though.

#

Running florence2 to ollama to ltx feels really cool tbh. The vram issue pops up sometimes with ipex 2.3 though

somber trellis Dec 8, 2024, 3:18 AM

#

reef ivy Running florence2 to ollama to ltx feels really cool tbh. The vram issue pops u...

I use wd-14 tagger's largest model

#

with underscores removed

#

fed that into api llm

#

thats what I use to img2img usually, that and combine it with flux depth/redux

reef ivy Dec 8, 2024, 3:38 AM

#

I need to check that out.

#

On another note cogvideox fun models just don't seem to work anymore with the new workflow for img2img. Either just black output or misty noise that vaguely looks like the image. Gguf still doesn't work for cogvideo either

somber trellis Dec 8, 2024, 3:45 AM

#

reef ivy Dec 9, 2024, 6:43 AM

#

https://www.reddit.com/r/StableDiffusion/comments/1h9d9xy/svdquant_now_has_comfyui_support/ new 4bit quant support in comfy apparently, not sure if it runs on intel or not

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

torpid moat Dec 9, 2024, 11:34 AM

#

has anyone tried to install sd3.5 large and succeeded, if so i need help

earnest grotto Dec 9, 2024, 12:07 PM

#

torpid moat has anyone tried to install sd3.5 large and succeeded, if so i need help

https://dontasktoask.com

Don't ask to ask, just ask

formal tusk Dec 9, 2024, 12:14 PM

#

Hi Bob and Community. 1st: Thanks to INTEL to all the effort they take to establish themself as an additional GPU developer and vendor; I also owned an ARC A770. So my Q directly to Bob:
Why does INTEL spent that less effort on also get all those nice tools and running smoothly on Linux, since all those big models are trained on it on big fives side?
It is a huge amount to take to get this GPU also running ComfyUI since there are much less instructions and hints to find to set it up on Debian as there are,e.g, for Windows.
And, please, no offense, but also e.g. the python version in those tools from INTEL is still on V3.10 and so some kind of outdated related to latest Debian dists.
So, u always have to look for some elder,e.g, pip wheels and stuff to get downloaded to get it nearly some kind of running/working.
Will there be a bit better instruction to get this running?
Regards.

reef ivy Dec 9, 2024, 3:42 PM

#

torpid moat has anyone tried to install sd3.5 large and succeeded, if so i need help

Depends on what you count as success. I can get the turbo model to work right, but the dev model outputs trash half the time. Apparently something to do with Ipex, should work if you use pytorch (but obviously that is still slower for now).

reef ivy Dec 9, 2024, 3:45 PM

#

formal tusk Hi Bob and Community. 1st: Thanks to INTEL to all the effort they take to establ...

You can use pytorch 2.5 or 2.6 natively now, but there is a speed issue that hasn't been fixed yet. (not just with intel). They probably haven't updated ipex since it is included natively now.

earnest grotto Dec 9, 2024, 9:18 PM

#

formal tusk Hi Bob and Community. 1st: Thanks to INTEL to all the effort they take to establ...

Many things do not work with the latest python version, including some webuis themselves, this is normal
It's assumed that if you're on linux, you can figure things out yourself, otherwise you wouldn't be using linux
You can get older python from the deadsnakes ppa, or use conda and make a conda environment with an older python
https://intel.github.io/intel-extension-for-pytorch/index.html#installation?platform=gpu&version=v2.3.110%2Bxpu&os=linux%2Fwsl2&package=conda
You'll likely want 2.3 for the best performance ^
Note that the pip instructions here are broken, at the very least there is no conda prefix with pip
OR, you can get 2.6 for more compatibility, less bugs https://pytorch.org/docs/stable/notes/get_start_xpu.html (nightly)

Welcome to Intel® Extension for PyTorch* Documentation!

This website introduces Intel® Extension for PyTorch*

civic charm Dec 9, 2024, 11:11 PM

#

Everything works fine with Python 3.11, it is just that the UIs are very stubborn in staying with Python 3.10

#

The only issue is Python 3.12

#

Not many things support 3.12 yet

honest hull Dec 10, 2024, 7:27 AM

#

many projects still use numpy < 2.0 too.. above pip installs would install latest numpy and it might also break projects

earnest grotto Dec 10, 2024, 11:57 AM

#

@formal tusk If you're still struggling with getting ComfyUI running, here's a linux version of the pinned script https://raw.githubusercontent.com/a-One-Fan/ComfyUI-Intel-Installer-Script/refs/heads/other_one/Setup_ComfyUI_Intel.py
Though I can't guarantee that it will just work

sly trench Dec 12, 2024, 5:27 AM

#

earnest grotto <@455658122552279040> Ok, I've updated the script, download it again, put it whe...

Hi @earnest grotto . I've reinstalled your 0.0.8 version on my MTL laptop.
But it occur this error. Any ideas?

📎 ComfyUI_MTL_error.txt

quartz kelp Dec 13, 2024, 4:37 PM

#

Now that the B580 is released I have a question about it, does it also have a 4GB memory block allocation limit?

reef ivy Dec 13, 2024, 5:05 PM

#

I don't think so, but someone will have to test. Xe2 shouldn't need it

civic charm Dec 13, 2024, 5:54 PM

#

IPEX 2.5 is listed on the intel repo

#

Seems like the US mirror is giving access denied error on every ipex version

#

CN mirror works but very slow

reef ivy Dec 13, 2024, 6:13 PM

#

nice, will try it out and see if it works. Maybe coincide with b580 launch

civic charm Dec 13, 2024, 6:15 PM

#

civic charm CN mirror works but **very** slow

This is the speed i am getting from the CN mirror rn (I have 1000 Mbps download)

reef ivy Dec 13, 2024, 6:22 PM

#

Haven't seen DSL in a long time lol

civic charm Dec 13, 2024, 6:37 PM

#

Now it is giving me the expected speeds to China

#

Probably was a caching issue

civic charm Dec 13, 2024, 7:25 PM

#

IPEX 2.5 is a little bit slower but completely acceptable for the accuracy improvements

reef ivy Dec 13, 2024, 7:29 PM

#

How much slower? on windows 2.3 is already slower than 2.1.4 for me, so it might be faster lol

civic charm Dec 13, 2024, 7:29 PM

#

I am using a custom model arch

#

Went from 2.6 s/it to 2.8 s/it

#

But now i am able to run CLIP on the GPU without corruptions

reef ivy Dec 13, 2024, 7:30 PM

#

Yeah, that's not bad at all. I am downloading now. I guess there is no need for a oneapi update?

#

Nice, so stable 3.5 should work now

civic charm Dec 13, 2024, 7:30 PM

#

Linux installs those from pip after ipex 2.3

#

idk what happens on the windows side

reef ivy Dec 13, 2024, 7:31 PM

#

I couldn't get it to pip install, so made a requirements file. Might not work then. Windows typically have to install oneapi afaik.

#

so far seems way slower in windows, but I do have a new driver update waiting which could improve things. Also issues with memory in latest drivers so could be faster on older one lol.

#

flux from about 5or6/sit's at 1024 to 12.9s/it and ltx from 2s/it to 82s/it lol.

#

gonna update the drivers and try again, then if that is worse still might try older driver, then go back to 2.3 or 2.1 if it's still terrible

earnest grotto Dec 13, 2024, 7:41 PM

#

requirements files are just stuff to install with pip, laid out in a file so you don't have to type them out
source control is more convenient, whatever else

reef ivy Dec 13, 2024, 7:42 PM

#

I tried to copy the pip install from the other one and input the new links but it wouldn't work for me, might have neede the entire url for each file like I did in requirements file though.

reef ivy Dec 13, 2024, 9:53 PM

#

ipex 2.5 is unusably slow in windows, at least with comfy ui

civic charm Dec 13, 2024, 10:05 PM

#

are you using ipex 2.5 or pytorch 2.5?

reef ivy Dec 13, 2024, 10:18 PM

#

ipex

#

My guess it's a compounding issue from 2.3 and current drivers with a750 memory allocation.

#

2.3 is slower than 2.1.4 but it can be mitigated with reserve-vram 6.0, nothing seems to help with ipex 2.5

#

can't view vram usage anymore wtih the new arc control thing, so maybe it's running on cpu or something? seemed too fast for that with flux though.

honest hull Dec 13, 2024, 11:41 PM

#

reef ivy can't view vram usage anymore wtih the new arc control thing, so maybe it's runn...

use xpu-smi to view vram usages

#

https://github.com/intel/xpumanager

GitHub

GitHub - intel/xpumanager

Contribute to intel/xpumanager development by creating an account on GitHub.

reef ivy Dec 13, 2024, 11:41 PM

#

I am on windows😭

honest hull Dec 13, 2024, 11:42 PM

#

works on windows 😉

reef ivy Dec 13, 2024, 11:42 PM

#

Ohh didn't know that, thanks!!

honest hull Dec 13, 2024, 11:43 PM

#

xpu-smi.exe dump to view the metrics available.. most of them are avaiable but some might report N/A as the tool is developed for Data Center GPUs

#

xpu-smi.exe dump -m 0,18 -d 0 should show you the GPU utilization as well as memory used in MB unit

honest hull Dec 13, 2024, 11:59 PM

#

civic charm Linux installs those from pip after ipex 2.3

same on windows.. starting with ipex 2.3, installation of the oneAPI base toolkit is no longer needed

#

when you pip install ipex it also installs the oneAPI dependencies. for ipex 2.5.110 it should be installing dpcpp-cpp-rt==2025.0.4 mkl-dpcpp==2025.0.4 etc

reef ivy Dec 14, 2024, 12:23 AM

#

It worked it was just like 100% slower in everything for me in windows. Probably a750 related

#

2.1.4 is the most stable and fastest but it is no longer compatible with alot of new stuff

somber trellis Dec 14, 2024, 2:24 AM

#

still on 2.3 myself

#

trying to get comfy_extramodels sana to work

#

#

I don't seem to be getting the greatest results.

#

Maybe there's a problem.

#

I've tried it at multiple CFGs, on euler simple

#

reef ivy Dec 14, 2024, 3:43 AM

#

Haven't tried that, but It's geared specifically for nvidia all together so maybe that's why. could also be the clip issue that 2.5 might fix

primal hatch Dec 14, 2024, 6:58 AM

#

Someone please give me the bat file for installing comfy ui

earnest grotto Dec 14, 2024, 7:05 AM

#

@primal hatch

silk umbra Dec 14, 2024, 12:49 PM

#

the script dosen't work with the b580

earnest grotto Dec 14, 2024, 12:51 PM

#

civic charm This is the speed i am getting from the CN mirror rn (I have 1000 Mbps download)

The official guide for 2.3 also seems to have switched to china? https://pytorch-extension.intel.com/installation?platform=gpu&version=v2.3.110%2Bxpu&os=linux%2Fwsl2&package=pip

silk umbra Dec 14, 2024, 12:52 PM

#

in fact i can't get comfyui to work on my b580 at all

#

is it too new?

earnest grotto Dec 14, 2024, 12:52 PM

#

silk umbra the script dosen't work with the b580

I don't know if installing ipex for 2.3/pytorch 2.6 manually would work on battlemage at all at the moment

#

it might indeed be too new

silk umbra Dec 14, 2024, 12:52 PM

#

ahhh so wait?

earnest grotto Dec 14, 2024, 12:53 PM

#

I'll edit the script to install the same thing for battlemage as it does for alchemist, but you'll have to find out if that works or not, and I lean towards no

silk umbra Dec 14, 2024, 12:53 PM

#

yeah

earnest grotto Dec 14, 2024, 12:54 PM

#

Also I guess I'll have to check if it can be downloaded from the us at all anymore or they've just moved to china since my script downloads from the US

#

I'll be uh, waiting on a windows update in the meantime though

#

So don't expect that in 10 minutes

silk umbra Dec 14, 2024, 12:55 PM

#

okie

civic charm Dec 14, 2024, 12:58 PM

#

earnest grotto The official guide for 2.3 also seems to have switched to china? <https://pytorc...

Saw that too

#

I hope US will be back up soon

#

CN connections are slow even if it works properly

silk umbra Dec 14, 2024, 1:30 PM

#

@earnest grottowait the ps1 file works

earnest grotto Dec 14, 2024, 1:30 PM

#

That's from when I decided to be very generous with detecting the GPU

#

That will install alchemist stuff

silk umbra Dec 14, 2024, 1:31 PM

#

ohhh.....

silk umbra Dec 14, 2024, 1:31 PM

#

earnest grotto That will install alchemist stuff

is there even new stuff for battlemage?

earnest grotto Dec 14, 2024, 1:32 PM

#

I don't know

#

Discord crashed due to windows updating

#

???

#

I assume that battlemage support will need to be added to pytorch, and IPEX, and I assume that it will not be retroactively added for 2.3 which is not as laggy as 2.5 or 2.6

silk umbra Dec 14, 2024, 1:39 PM

#

#

yeah the ps1 script didn't bring me much further

earnest grotto Dec 14, 2024, 1:43 PM

#

I don't think this is a battlemage issue

#

I'll check it out

silk umbra Dec 14, 2024, 1:44 PM

#

okie

earnest grotto Dec 14, 2024, 5:05 PM

#

@silk umbra @sly trench Updated

silk umbra Dec 14, 2024, 5:06 PM

#

nice

earnest grotto Dec 14, 2024, 5:09 PM

#

silk umbra nice

You can just download the new version and replace the old one with it. Run it again, it will not have the same error again but I'd still expect it to not work

silk umbra Dec 14, 2024, 5:10 PM

#

ye did it

earnest grotto Dec 14, 2024, 5:13 PM

#

If it doesn't work, you wanna follow some of my directions instead after that?

silk umbra Dec 14, 2024, 5:14 PM

#

@earnest grotto sure

#

WAIT

earnest grotto Dec 14, 2024, 5:14 PM

#

It's just loading

silk umbra Dec 14, 2024, 5:14 PM

#

IT WORKS

earnest grotto Dec 14, 2024, 5:14 PM

#

silk umbra IT WORKS

Did you generate an image?

silk umbra Dec 14, 2024, 5:14 PM

#

#

no not yet

#

i usually use comfy with swarmui

earnest grotto Dec 14, 2024, 5:15 PM

#

well then you don't know if it works

#

so generate one

silk umbra Dec 14, 2024, 5:16 PM

#

ok downloading flux rn

somber trellis Dec 14, 2024, 5:31 PM

#

Where do you get the whl for ipex 2.5.10?

#

Actually, should I even bother going from pytorch 2.3

tiny bolt Dec 14, 2024, 5:40 PM

#

not officially released

#

https://pytorch-extension.intel.com/release-whl/stable/xpu/cn/

#

find it amusing how these people discover it so quick

earnest grotto Dec 14, 2024, 5:41 PM

#

well, since the us one is down, it's probably worth taking a look into the cn one to see if there's anything new

earnest grotto Dec 14, 2024, 5:41 PM

#

sly trench Hi <@311915623179485186> . I've reinstalled your 0.0.8 version on my MTL laptop....

and it has been down for a while now

tiny bolt Dec 14, 2024, 5:42 PM

#

https://pytorch-extension.intel.com/release-whl/stable/xpu/us/intel-extension-for-pytorch/

#

it works for me tho

#

ah okay i see what you mean

earnest grotto Dec 14, 2024, 5:43 PM

#

cn might've sped up since disty posted though

#

it was slow but it wasn't too slow

somber trellis Dec 14, 2024, 5:47 PM

#

#

If it takes me half a minute to download it

#

i think its ok

somber trellis Dec 14, 2024, 6:19 PM

#

but it doesnt exactly work

#

🤷‍♂️

#

lol

silk umbra Dec 14, 2024, 6:39 PM

#

earnest grotto so generate one

i can't get it to gen

earnest grotto Dec 14, 2024, 6:40 PM

#

silk umbra i can't get it to gen

What does the command prompt say

silk umbra Dec 14, 2024, 6:40 PM

#

earnest grotto What does the command prompt say

Failed to validate prompt for output 9:

CheckpointLoaderSimple 4:
- Value not in list: ckpt_name: 'Flux\flux1-schnell-fp8.safetensors' not in []
  Output will be ignored
  invalid prompt: {'type': 'prompt_outputs_failed_validation', 'message': 'Prompt outputs failed validation', 'details': '', 'extra_info': {}}

earnest grotto Dec 14, 2024, 6:40 PM

#

Show your workflow

#

And please post screenshots, not copy text

silk umbra Dec 14, 2024, 6:41 PM

#

im using swarmui

earnest grotto Dec 14, 2024, 6:41 PM

#

Do it in comfyui then

silk umbra Dec 14, 2024, 6:41 PM

#

do you have a workflow i can try

#

idk comfy very well

earnest grotto Dec 14, 2024, 6:42 PM

#

Bottom of this post

#

Download the image. Drag-and-drop it onto comfyui in the browser

silk umbra Dec 14, 2024, 6:42 PM

#

oh ok

earnest grotto Dec 14, 2024, 6:42 PM

#

Did you download flux with my script or one of the various all-in-one variants

silk umbra Dec 14, 2024, 6:45 PM

#

earnest grotto Did you download flux with my script or one of the various all-in-one variants

off the web

earnest grotto Dec 14, 2024, 6:46 PM

#

silk umbra off the web

https://comfyanonymous.github.io/ComfyUI_examples/flux/

ComfyUI_examples

Flux Examples

Examples of ComfyUI workflows

#

Use the workflows from that then

silk umbra Dec 14, 2024, 6:58 PM

#

rip

#

@earnest grotto

earnest grotto Dec 14, 2024, 7:03 PM

#

Yeah that's what was expected

#

Your swarmui issue is unrelated

#

I'd bet in a week's time support will be here

silk umbra Dec 14, 2024, 7:08 PM

#

okie

reef ivy Dec 14, 2024, 7:36 PM

#

somber trellis but it doesnt exactly work

you download all the packages? Torch, Torchaudio/vision etc? It 100% worked for me in windows, however it was so slow it was pretty much useless.

#

Feel like speed and stability has regressed from each version after 2.1.4 tbh

#

could also be a skill issue on my part lol

somber trellis Dec 14, 2024, 7:38 PM

#

reef ivy you download all the packages? Torch, Torchaudio/vision etc? It 100% worked for...

It isn't just the torch+xpu packages I tried

#

I wanted to use it in conjunction with ipex 2.5.10

#

but idek if that works

#

also uh

#

phi-4 released

#

reef ivy Dec 14, 2024, 7:39 PM

#

Yeah, I did it with ipex 2.5. I just found the links for each and copied them to a requirements.txt file. Ipex likely still needs the special version of each

somber trellis Dec 14, 2024, 7:39 PM

#

reef ivy Dec 14, 2024, 7:40 PM

#

#

I wonder how true those benchmark results are tbh.

#

from user feedback seems on par with qwen2.5 14b

somber trellis Dec 14, 2024, 10:24 PM

#

when you realize you just had to restart your pc when properly re-installing ipex-llm ollama

#

now it workin like a charm with phi-4

#

ComfyUI-dpmpp_2m-3.5-30-2024-12-14_18-53-20-0085.webp

reef ivy Dec 14, 2024, 10:41 PM

#

Will they release a 7b model?

somber trellis Dec 14, 2024, 10:43 PM

#

🤷‍♂️ No clue yet. This model's brand new.

reef ivy Dec 15, 2024, 12:47 AM

#

There is a new video2audio model, but apparently only works on 2.5.1 pytorch

#

https://github.com/kijai/ComfyUI-MMAudio

GitHub

GitHub - kijai/ComfyUI-MMAudio

Contribute to kijai/ComfyUI-MMAudio development by creating an account on GitHub.

somber trellis Dec 15, 2024, 2:06 AM

#

reef ivy Yeah, I did it with ipex 2.5. I just found the links for each and copied them t...

What version of torch+xpu is compatible with ipex 2.5.10?

#

If there was any real reason to swap to it, that is...

reef ivy Dec 15, 2024, 3:39 AM

#

somber trellis What version of torch+xpu is compatible with ipex 2.5.10?

I used the one for windows here https://pytorch-extension.intel.com/release-whl/stable/xpu/cn/torch/ For me no reason to swap because it was tooooo slow. Although you may need it for new stuff like that video2audio model I posted. Not sure if the ipex 2.5 is better than the regular torch2.5 that already has xpu natively though. I could never get the regular torch to work for me

somber trellis Dec 15, 2024, 4:20 AM

#

Yep, far slower indeed.

#

Ipex didn't seem to help much either, I guess everything's still in its testing phase

#

imma keep with 2.3

#

nothing seems better atm

#

ComfyUI-dpmpp_2m-3.5-30-2024-12-14_23-16-01-0105.webp

wicked fulcrum Dec 15, 2024, 6:13 AM

#

Sneak peek coming to AI Playground 2.0 with ComfyUI workflows and support for Flux.1

civic charm Dec 15, 2024, 10:35 AM

#

somber trellis What version of torch+xpu is compatible with ipex 2.5.10?

None

#

Use the torch from ipex

primal hatch Dec 15, 2024, 4:48 PM

#

wicked fulcrum Sneak peek coming to AI Playground 2.0 with ComfyUI workflows and support for Fl...

waiting for this 😍😍

reef ivy Dec 15, 2024, 5:33 PM

#

Intel should make some comfy nodes for llms, then I won't have to run it through ollama if using it in a workflow with ai playground

somber trellis Dec 15, 2024, 5:40 PM

#

reef ivy Intel should make some comfy nodes for llms, then I won't have to run it through...

could just use ipex llm ollama to run it with sycl

#

ok well 2.5.1 works on the a770 le no problem it's just half speed like aaron said

#

rip

earnest grotto Dec 15, 2024, 5:46 PM

#

pytorch 2.6 is faster while still having fixes that came with 2.5

#

if that is what you're looking for

#

otherwise can just stick with 2.3 for now

reef ivy Dec 15, 2024, 7:39 PM

#

somber trellis could just use ipex llm ollama to run it with sycl

I do but i'd like to run it all in the same env without needing to load up a server. It also causes some issues with vram for me as ipex2.3 has some speed and memory issues with a750 sometimes.

reef ivy Dec 15, 2024, 7:40 PM

#

earnest grotto pytorch 2.6 is faster while still having fixes that came with 2.5

I could not for the life of me get it to work with comfyui in windows. I called the oneapi files and it loaded but then errored out. Only ipex seems to work for me.

#

I may start up another wsl2 enviornment and see if I can get more speed in windows.

honest hull Dec 15, 2024, 8:36 PM

#

reef ivy I could not for the life of me get it to work with comfyui in windows. I called...

you shouldn't need to call oneapi vars for upstream Pytorch 2.6

#

just pip install torch torchvision with the extra url that points to pytorch.org

#

pip install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/xpu

reef ivy Dec 15, 2024, 9:08 PM

#

I will give it one more try with this install command, thanks.

honest hull Dec 15, 2024, 9:24 PM

#

reef ivy I will give it one more try with this install command, thanks.

would recommend testing it with a simple standalone script first and a fresh python env..

for example

create env and pip install torch torchvision using above link
pip install diffusers transformers accelerate
run below code snippet

from diffusers import AutoPipelineForText2Image, DEISMultistepScheduler
import torch

pipe = AutoPipelineForText2Image.from_pretrained('lykon/dreamshaper-8', torch_dtype=torch.float16, variant="fp16")
pipe.scheduler = DEISMultistepScheduler.from_config(pipe.scheduler.config)
pipe = pipe.to("xpu")

prompt = "portrait photo of muscular bearded guy in a worn mech suit, light bokeh, intricate, steel metal, elegant, sharp focus, soft lighting, vibrant colors"

generator = torch.manual_seed(33)

image = pipe(prompt, generator=generator, num_inference_steps=25).images[0]  
image.save("./image.png")

honest hull Dec 16, 2024, 12:30 AM

#

also add this to ComfyUI/comfy/model_management.py if you are using pytorch 2.6 nightly. it improves performance by about 1.5x .. The perf issue with upstream Pytorch seemed to be not yet fixed in 2.6 nightly either (happens on other vendors too)

reef ivy Dec 16, 2024, 1:54 AM

#

do the ipex hijacks still work in windows or am i doing something wrong? Get this error with latent upscale instead of oom Current platform can NOT allocate memory block with size larger than 4GB! Tried to allocate 4.25 GiB (GPU 0; 7.75 GiB total capacity; 4.19 GiB already allocated; 4.35 GiB reserved in total by PyTorch)

#

this is what I have now with the florence fix as well try: import transformers # ipex hijacks transformers and makes it unable to load a model backup_get_class_from_dynamic_module = transformers.dynamic_module_utils.get_class_from_dynamic_module import intel_extension_for_pytorch as ipex ipex.llm.utils._get_class_from_dynamic_module = backup_get_class_from_dynamic_module transformers.dynamic_module_utils.get_class_from_dynamic_module = backup_get_class_from_dynamic_module from ipex_to_cuda import ipex_init ipex_init() xpu_available = True except Exception: pass

civic charm Dec 16, 2024, 10:31 AM

#

reef ivy do the ipex hijacks still work in windows or am i doing something wrong? Get th...

If you are using ipex 2.5, update the hijacks too

lucid lily Dec 16, 2024, 12:18 PM

#

Some nodes of ComfyUI need fp64, but my arc a770 don't have it. Any solutions?

earnest grotto Dec 16, 2024, 12:28 PM

#

@lucid lily ^

#

Or you edit model_management.py as shown right above you

#

and you clone disty's hijacks into the comfy folder

#

https://github.com/Disty0/ipex_to_cuda

lucid lily Dec 16, 2024, 1:32 PM

#

add here?

earnest grotto Dec 16, 2024, 1:35 PM

#

You find the try-catch block that does import intel_extension_for_pytorch as ipex and you replace it with what you see above
I think the except might've had something else that isn't pass but not sure

lucid lily Dec 16, 2024, 1:44 PM

#

OK, I got it

#

still report same error😅

earnest grotto Dec 16, 2024, 2:05 PM

#

@lucid lily Please link the exact custom node and copypaste the full stack trace, and link any models if they need to be seperately downloaded

somber trellis Dec 16, 2024, 2:10 PM

#

honest hull also add this to ComfyUI/comfy/model_management.py if you are using pytorch 2.6 ...

I assume we just add the command disty mentions there to model_management.py?

earnest grotto Dec 16, 2024, 2:11 PM

#

probably after ipex_init()?

somber trellis Dec 16, 2024, 2:11 PM

#

well considering youre not using IPEX in 2.6.0

#

earnest grotto Dec 16, 2024, 2:11 PM

#

I want to know too, seems odd if that's for a different backend

somber trellis Dec 16, 2024, 2:12 PM

#

wouldnt you just import ipex_init() and thats it

#

then that command ig

#

    from ipex_to_cuda import ipex_init
    ipex_init()
    torch.backends.cuda.allow_fp16_bf16_reduction_math_sdp(True)
    _ = torch.xpu.device_count()
    xpu_available = torch.xpu.is_available()
except Exception:
    pass```

lucid lily Dec 16, 2024, 2:14 PM

#

@earnest grottoany instructions?🥲

earnest grotto Dec 16, 2024, 2:14 PM

#

earnest grotto <@1074965956230656111> Please link the exact custom node and copypaste the full ...

These are the instructions

#

Do this

lucid lily Dec 16, 2024, 2:17 PM

#

📎 message.txt

somber trellis Dec 16, 2024, 2:25 PM

#

well im on 2.6.0

#

i wanted to see if 2.6.0 would fix my issue with sana in comfyui outputting improper outputs

#

#

https://github.com/user-attachments/files/18027854/SanaV1.json

#

#

Same workflow on arc

#

on both 2.3.0 and 2.6.0

#

also torch 2.6 with that command makes the performance equal to 2.3

lucid lily Dec 16, 2024, 2:43 PM

#

@somber trelliscan you run any custom_node like semantic segmentation which use fp64 when using cuda with ARC?

earnest grotto Dec 16, 2024, 2:44 PM

#

@somber trellis you have some spare nvidia gpu? you wanna run some stuff for me with nvdiffrast, after some time?

earnest grotto Dec 16, 2024, 2:44 PM

#

lucid lily <@204342691964780546>can you run any custom_node like semantic segmentation whic...

No he can't.

#

Support for fp64 will come. It's already in an experimental state in 2.5/2.6
If you want things fixed now, do what I said

#

And I'll look into the custom node and I can put patching it into my installer script or tell you what to edit or whatever else

silk umbra Dec 16, 2024, 2:47 PM

#

wait did you guys get stable diffusion working on arc b580

lucid lily Dec 16, 2024, 2:50 PM

#

@earnest grottoright here

📎 message.txt

earnest grotto Dec 16, 2024, 2:51 PM

#

lucid lily

Yes I already saw it

earnest grotto Dec 16, 2024, 2:51 PM

#

earnest grotto <@1074965956230656111> Please link the exact custom node and copypaste the full ...

.

#

Link the custom node

#

Link any models if they need to be seperately downloaded

lucid lily Dec 16, 2024, 2:53 PM

#

https://github.com/Fannovel16/comfyui_controlnet_aux

GitHub

GitHub - Fannovel16/comfyui_controlnet_aux: ComfyUI's ControlNet Au...

ComfyUI's ControlNet Auxiliary Preprocessors. Contribute to Fannovel16/comfyui_controlnet_aux development by creating an account on GitHub.

#

Did I misunderstood?😅

#

I found any nodes which utilize Semantic Segmentation preprocessor in comfyui report fp64 error when using Arc

somber trellis Dec 16, 2024, 3:01 PM

#

earnest grotto <@204342691964780546> you have some spare nvidia gpu? you wanna run some stuff ...

no im on arc

#

i have a 1060 but its not installed lmao

#

Are there any other commands I should add to model_management other than torch.backends.cuda.allow_fp16_bf16_reduction_math_sdp(True)

lucid lily Dec 16, 2024, 3:16 PM

#

@earnest grottoI replaced any datatype float64/double in this nodes with float/float32, but still got same error report😅

somber trellis Dec 16, 2024, 3:18 PM

#

instantir works on torch 2.6+xpu at 4s/it

somber trellis Dec 16, 2024, 3:41 PM

#

lucid lily <@204342691964780546>can you run any custom_node like semantic segmentation whic...

I can run flux which uses fp64 datatypes in its mmdit py

#

Ipex-to-cuda fixes that issue for you in most cases but certain custom nodes you might need to modify it manually

earnest grotto Dec 16, 2024, 3:44 PM

#

the hijacks don't cover every scenario

reef ivy Dec 16, 2024, 6:18 PM

#

civic charm If you are using ipex 2.5, update the hijacks too

This was with 2.3, not sure what's going on with it. I am going to try 2.6 sometime today. I only get this error with latent upscale so far, but my code string is correct?

#

yeah, hijacks updated same error. Might just be something with the latent upscale nodes that don't get hijacked? Gonna try 2.6 later on fingers crossed it works this time

civic charm Dec 16, 2024, 6:35 PM

#

resolution?

#

You can't split it below 4GB after a point

reef ivy Dec 16, 2024, 8:15 PM

#

It was going from 768*512 to 2x that, I would have expected an oom rather than the 4gb message. Its an ltx vid though. I have been trying to find an upscale method that didn't take forever

earnest grotto Dec 16, 2024, 8:16 PM

#

You do oom

#

4.25+4.19>7.75

reef ivy Dec 16, 2024, 8:31 PM

#

But oom usually has another message not the 4gb, when I got that before I didn't have the hijacks working. Its strange

somber trellis Dec 16, 2024, 10:21 PM

#

not sure if i wanna use 2.3.0 or 2.6.0

#

or ipex 2.5.1

#

2.6.0/2.5.1 is not far from 2.3.0 with the cuda mem eff sdpa fix

reef ivy Dec 16, 2024, 10:25 PM

#

is there a way to get it to run with comfy? Getting not compiled with cuda errors

#

I got it to work in a fresh env by itself with the test script, but comfy won't recognize xpu or something

somber trellis Dec 16, 2024, 10:26 PM

#

with 2.6.0?

reef ivy Dec 16, 2024, 10:26 PM

#

Yeah

somber trellis Dec 16, 2024, 10:26 PM

#

In model_management.py you need to remove all ipex-related code for xpu

reef ivy Dec 16, 2024, 10:27 PM

#

okay thanks

somber trellis Dec 16, 2024, 10:27 PM

#

    from ipex_to_cuda import ipex_init
    ipex_init()
    torch.backends.cuda.allow_fp16_bf16_reduction_math_sdp(True)
    _ = torch.xpu.device_count()
    xpu_available = torch.xpu.is_available()
except Exception:
    pass```

#

for me i just use this

#

that third line helps with performance

reef ivy Dec 16, 2024, 10:27 PM

#

also, did you get a lot dependency errors?

somber trellis Dec 16, 2024, 10:27 PM

#

I got some 2025 oneapi dependency errors that I fixed by installing either their 2025.0.2 or 2025.0.1 counterparts

#

otherwise I have torch, torchaudio and torchvision for torch 2.6.0

reef ivy Dec 16, 2024, 10:28 PM

#

this is what I am getting.

#ComfyUI for Intel Arc using IPEX

Loading: ComfyUI-Inspire-Pack (V1.6)

Loading: ComfyUI-Manager (V2.51.9)

ComfyUI Revision: 2804 [cc9cf6d1] | Released on '2024-10-31'