ComfyUI Intel Arc performance | Intel Insiders Community | Page 1

quasi orbit Aug 2, 2025, 8:00 PM

#

I just started experimenting with Comfy and the Wan2.2 model.

I manage to get everything working and it's utilizing the GPU. But I get a feeling I'm getting very low performance compared to a similarly tiered Nvidia card.

Here's my specs

Intel Arc a750 8bg vram
Intel i7 13700k
32gb ram
AI running on a fast Samsung SSD 970 EVO Plus drive.
Running ComfyUI with Powershell in windows (not WSL2)

I tried running a Wan2.2 14B Q3 K M gguf model paried with LightX2V 14B 480p Lora model that fits in my GPU vram.

Tried running a test with 16fps and 120 length

It runs. But I'm getting 700-900s/i . Which feels very slow in comparison to someone who runs more or less the same workflow but is getting something like 20-30s/i on his Nvidia RTX 4060, also with 8gb vram (he was running 80 length).

Feels like something is off with this huge gap in performance?

zinc locust Aug 2, 2025, 8:59 PM

#

https://discord.com/channels/554824368740630529/1193952640225267802

scenic glacier Aug 3, 2025, 1:35 AM

#

They are likely using sage attention and torch.compile. you can get torch.compile working on intel but you have to do a little manual installation of stuff. Also rtx 4060 is a much faster gpu than an a750, it should be comparable to a 3060 without any nvidia optimizations

zinc locust Aug 3, 2025, 3:01 AM

#

sage attention and torch compile are not the difference between 900 seconds and 30 seconds

scenic glacier Aug 3, 2025, 3:56 AM

#

could be the vram stuff as well, --reserve-vram would likely help or using the block swaps in kijais nodes

zinc locust Aug 3, 2025, 4:03 AM

#

yes #1193952640225267802 message

scenic glacier Aug 3, 2025, 4:05 AM

#

I would like to mess with kijais nodes and the gguf models , haven't ever had a chance. The block swap lets you more directly control how much ram/vram to use.

quasi orbit Aug 5, 2025, 12:03 AM

#

Is there any new information on the B60 GPU's? Trying to find pricing, release dates etc. But can't find any. Might get a B60 dual GPU, depending on the pricing and availbility in europe.

scenic glacier Aug 5, 2025, 12:16 PM

#

Q3* so probably

outer perch Aug 10, 2025, 6:43 PM

#

I'm speaking out of turn here, as I've only recently picked up an A770 with the intent to try and get a better understanding of how AI technologies work. I already had an A380, but I am also anxiously waiting for the new pro cards. I imagine they're being held up by the software stack. IPEX works well but I think is incomplete. I can't get parallelism working between the two intel cards in my system. And now, if I'm reading right, engineering seems to be pushing AI Playground, while still having IPEX in active development supposedly.

It certainly feels like Intel is trying to attract the engineering/enterprise market rather than building a hobbyist user base. Like I get it, enterprise is where the money is at and Intel needs as much help as they can get right now. But I feel like it would be reasonable to develop parallel use cases.

And while it may come off as ungrateful, I'm just relaying observations and conjecture. I have no way of knowing what's actually going on and I could be entirely incorrect.

scenic glacier Aug 10, 2025, 9:22 PM

#

Honestly, I'm not even sure intel knows what it wants to do. This new philosophy of everything needs to be immediately profitable is not looking to good for anything new or innovative for intel. I'm glad we are still seemingly getting anything graphics related right now, but I have no clue what they will do in the future with their gaming/consumer graphics cards.

zinc locust Aug 11, 2025, 1:26 AM

#

outer perch I'm speaking out of turn here, as I've only recently picked up an A770 with the ...

IPEX works well but I think is incomplete
You don't need IPEX anymore.
I can't get parallelism working between the two intel cards in my system.
As this is a comfy thread - Comfy doesn't do multi-GPU by default. And image gen isn't super multi-GPU friendly. You'd need to start writing your own script, and then, you'd probably want T5/CLIP on the A380 and the rest on the A770 and to just do them in sequence like that.
engineering seems to be pushing AI Playground, while still having IPEX in active development supposedly
I don't understand what this statement is supposed to imply. AI Playground is completely unrelated to IPEX. It's a convenient frontend for ComfyUI, diffusers and llamacpp.
It certainly feels like Intel is trying to attract the engineering/enterprise market rather than building a hobbyist user base
Not sure how you got to this conclusion from what you said before but, every big tech company wants the enterprise market because it's more profitable

outer perch Aug 11, 2025, 1:49 AM

#

zinc locust > IPEX works well but I think is incomplete You don't need IPEX anymore. > I ca...

Valid. As I said, I'm trying to catch up in the area. Many people have been doing this for much longer than I have and the whole AI ecosystem moves very quickly. Discord is not something that I have used enough to be able to follow well yet.

#

I assume you're implying that llama.cpp has native support for ARC, so IPEX isn't needed anymore. I can't get multigpu processing working in any LLM system. I noticed today that Intel seems to be favoring vLLM which I haven't tried yet.

I understand this is a ComfyUI thread, and I apologize for speaking out of turn in it.

AI Playground is a front end for all of these systems is it not? And it's Windows only.

My comments come from various Intel statements that I've found. Perhaps this is more stream of consciousness, and feel free to ignore me. My point was simply a wishlist. I know development is hard. I just feel like Intel as a company would be more motivated to expand compatibility and documentation with a little more urgency.

zinc locust Aug 11, 2025, 1:55 AM

#

llama cpp does not use pytorch, nevermind ipex

#

py torch is for py thon

#

not c++

#

there's a bunch of llama cpp versions that use sycl, opencl and whatever else

scenic glacier Aug 11, 2025, 1:56 AM

#

it can use ipex-llm afaik, not sure if ai playground is using it though.

zinc locust Aug 11, 2025, 1:58 AM

#

also i might be wrong about llama cpp, i don't exactly remember if this is what ai playground was using, but it's something in the same vein. many other LLM inference servers, like vllm as you say, or ollama or others

#

ai playground installs a bunch of things for you and conveniently lets you use them. you can use comfyui, llama cpp, whatever else by yourself

scenic glacier Aug 11, 2025, 1:59 AM

#

you can try it here https://github.com/ipex-llm/ipex-llm/releases/tag/v2.3.0-nightly since ai playground is using pytorch it's likely not using this though

GitHub

Release 2.3.0 nightly build · ipex-llm/ipex-llm

Starting from version 2.3.0b20250611, the llama-cpp-ipex-llm Xeon portable zip supports AMX and TP.
Starting from version 2.3.0b20250724, the llama-cpp-ipex-llm portable zip supports llama-mtmd-cli...

zinc locust Aug 11, 2025, 1:59 AM

#

you can hack it to work on linux. but if you're on linux, you can also just get llama cpp yourself

outer perch Aug 11, 2025, 2:03 AM

#

Thanks @scenic glacier Aaron, I have tried that version. The container detects both of my installed cards, but prioritizes my 770 over my 380 and won't share vram between them which is what I was hoping for. I'm successfully running ollama-vulkan which is slow, but does what I've asked it to, which is use models that are a little larger.

@zinc locust True! But I'm not trying to run this in a command line with one model. I'm trying to use OpenWebUI because what interests me more is running models and comparing output. Maybe this is possible, but it's beyond my skill level.

#

On topic, I'm running SD.Next which is working well enough to play with. I wasn't aware that generative AI did not see benefits of multigpu processing, so that's neat.

zinc locust Aug 11, 2025, 2:05 AM

#

outer perch On topic, I'm running SD.Next which is working well enough to play with. I wasn'...

multi gpu for llms is a thing

outer perch Aug 11, 2025, 2:05 AM

#

sorry, I used generative ai wrong, I meant image gen

zinc locust Aug 11, 2025, 2:07 AM

#

also multi gpu for training image gen is a thing. just not very much for inferencing

outer perch Aug 11, 2025, 2:11 AM

#

That makes sense to me.

knotty gazelle Sep 23, 2025, 9:08 PM

#

Hello 🙂 i am returning to ComfyUI after a long break.

Successfully compiled pytorch 2.8.0 and intel_extensions. Is there anything i need more to add to increase Comfy performance? 🙂

scenic glacier Sep 23, 2025, 11:09 PM

#

Install with viks script and you should have everything you need for performance I think.

#ComfyUI Intel Arc performance