#Longer generation time for Qwen Edit 2251 after update

1 messages · Page 1 of 1 (latest)

grand helm
#

Greetings. After updating from yesterday's version of SwarmUI, for some unknown reason, Quen Image Edit's runtime increased from 1 minute 15 seconds to 3-4 minutes. Rolling back to the previous version ([992a4bed: LTXV Latent Upscale basic support]) helps. I'm attaching screenshots (Before 1 and 2, After 1 and 2) with logs.

abstract basalt
#

there's been some minor UI tweaks, some documentation, and some edge case handling on very specific configs that don't apply to you

toxic tinsel
grand helm
#

I understand and accept your arguments. It's just strange that yesterday, with identical settings, generations in Qwen took about 1 minute, and I could do them in batches while watching YouTube videos in the background. Today, it takes 3-5 minutes per generation, and during it, I can't even move the mouse. Everything is so slow. I haven't changed anything at all, haven't updated the system, drivers, or anything else. I'll continue experimenting, maybe I'll find the cause.
Weird shit actually.

grand helm
toxic tinsel
#

Kinda sucks tho because it breaks my normal Qwen/Z-Image combination

grand helm
toxic tinsel
grand helm
#

I can't find any info in generation history or in options

toxic tinsel
grand helm
#

So i should change them somehow and return back old ones or what?

grand helm
toxic tinsel
#

You select it here, in the UI or change the default

grand helm
#

Thanks i'll try!

grand helm
grand helm
#

I got error with this text encoder

toxic tinsel
#

It will do that if youre trying to use it with z-image because its not the right one for z-image

#

z-image uses qwn_3_4b

grand helm
#

I'm not use z-image it is pure qwen 2251 + lightning lora + anime2real lora 🙁

toxic tinsel
#

o

#

Maybe I had them mixed up, and qwen uses Qwen3-4B? I dont think so though

grand helm
#

It is better now, but still not ~1 minute generation time

toxic tinsel
#

Yeah, thats the fp8 model, which is the original one that is twice the size of the gguf

#

If the gguf text enc doesnt work, that may be something monkey needs to look at.

grand helm
#

Are all these text encoders compatible with Qwen Edit or regular Qwen?

toxic tinsel
grand helm
toxic tinsel
grand helm
#

I understand. But no luck with Qwen2.5-VL-7B-Instruct-Q6_K (same result like with Qwen2.5-VL-7B-Instruct-Q4_K_M).

#

@abstract basalt maybe do you have an idea why it is so oh mighty guru?

#

Omg it was 30 seconds yesterday!

grand helm
#

I figured out the problem, and I'm shocked. It turns out that if you don't specify the BASE application, the generation takes a long time. If you specify it, it takes 30 seconds. I don't know why there's such a huge difference; I'm not an expert on this matter.

chrome abyss
#

You realize that your first screenshots are cold gens ( first of the the session) and this one is a warm gen?

#

Cold always takes longer because the files have to be moved to ram and vram

grand helm
#

In my situation it doesn't matter. If you don't apply Base in Lora, the generation time will be slow, no matter the first, second, or hundredth. But if you install it, it will always be fast, and the VRAM problem will magically disappear.

chrome abyss
#

using an input image made it so that you are genning in a smaller resoultion

#

due to the option "smart image prompt resizing"

#

higher res = slower gen and more vram usage

#

scratch that

#

confused myself

#

every single slow gen you posted is a cold gen though

#

and every fast one a warm one

#

I know you said it doesn't make a difference, but you might've misunderstood

#

just test it, restart swarm do any gen and it will be slow, press the gen button again and it will be fast

grand helm
#

And if i'll disable @base for lora it will be 3-5 minutes

naive hamlet
chrome abyss
#

honestly not too sure what you mean by that, but disabling a lora will make the first gen after that take a bit longer

naive hamlet
#

and this fix doesn't make sense, but good for you if it works 😄

chrome abyss
#

can just say that every screenshot you posted has nothing out of the ordinary

grand helm
chrome abyss
#

on a cold gen the split between gen time and prep time isn't really reliable

#

always look what the sum is

#

and the sum on that one isn't out of the ordinary for you

grand helm
chrome abyss
#

can you point out the warm gen that took 3 minutes?

#

ah sorry

#

my fault

grand helm
#

I can disable @base and it will be even 5 minutes

#

The reason is unclear, but without @base the video memory simply gets clogged and that's it, generation takes much longer.

#

35 seconds, yahoo! 🙂

#

In warm it is 26 seconds even.

toxic tinsel
#

This is interesting.

#

Are you saying it generates faster if you assign the lora to base?

grand helm
#

Yes it is

#

Although this is not entirely true, generation occurs normally when there is BASE, but when there is not, the video memory is full and generation is much slower.

#

I am writing through an online translator, my English is not good enough.

#

Maybe it will work for WAN 2.2 too? Have to test later