#๐Ÿ†•๏ฝœsd3

1 messages ยท Page 83 of 1

radiant quiver
#

To generate

#

A island, thats filled with foliage and stylized trees, and stylized rocks

errant dust
#

free

real terrace
dusky thistle
real terrace
dusky thistle
#

i'm using random attention guidance

dusky thistle
real terrace
#

Impressive, and why are you using Cascade?

dusky thistle
dusky thistle
#

that and flux are the top two

#

there's so much unrealized potential with cascade that's bothered me for months... finally been writing the code that's been missing

#

these aren't upscales they're directly generated at these resolutions

#

and i'm hitting a peak of 11gb vram

#

2560x1536

real terrace
real terrace
dusky thistle
#

it's gone way past base cascade quality with this stuff

#

the potential was always there

#

i'll never understand why SAI didn't promote it more, they really achieved something great with it imo

#

it's sad it remains overlooked

#

good luck getting this image in a straight shot out of any other model

real terrace
dusky thistle
#

yeah they announced sd3 a week later like it was coming out in two weeks lol

#

so no one ever developed tools thinking it'd be pointless

#

been doing my best to fill in those gaps

#

got self-attention guidance implemented, along with random attention guidance, and tiled sampling now for stage B

#

finetuned stage B which helped a lot

young blade
dusky thistle
#

implemented a custom noise sampler designed specifically for stage B

real terrace
dusky thistle
#

i realyl ilke this one

#

really unique

real terrace
dusky thistle
alpine summit
dusky thistle
dusky thistle
vital crag
#

does anyone have a chatgpt prompt to take a main prompt and split it up into the 3 different text encoders for sd3?

dusky thistle
#

i'd ask this guy

vital crag
#

no thank u... i'll keep my head attached to my neck

dusky thistle
radiant quiver
dusky thistle
dusky thistle
radiant quiver
#

Ohk

dusky thistle
#

right now flux is the best if you're looking for prompt comprehension imo, and cascade is best for style and fidelity

radiant quiver
#

Ohk

#

Whats the best for making textures for game assets

dusky thistle
#

no idea

torn wharf
#

nf4 flux weights in forge! what's nf4?

hollow swift
#

Is flux lora only for FP8 dev?

dusky thistle
real terrace
torn wharf
#

this is forge forge

real terrace
torn wharf
alpine summit
sterile pendant
vital crag
#

Just looking for something simple for now. Maybe even just a detailed description of what the different encoders take in so I can engineer my own prompt. I kinda winged something rn but could be better

sterile pendant
#

Well you can always make a good system prompt that spits it all out in one reply, but you'll have to manually copy/paste the different prompts out

#

I know that it's definitely entirely doable because I've done it in the past. The JSON method would just save the carpal tunnel of copy/pasting over and over

#

And automate that portion, you could even have the node have pins for each entry

bitter hearth
#

Rob who made Robmix has a ChatGPT GPT that does this

#

it can give T5, Clip L and Clip G

#

personally I use a prompt someone here posted that was slightly modified from the auraflow paper

#

I think I changed the word count but that's about it

#

although I have finally started writing my own prompts because I now do different prompts for different Ksamplers instead of feeding all the same

sullen moss
bitter hearth
#

^ toad in mario should have been this

sullen moss
robust crater
#

girls

#

prompt

sage burrow
#

Uhm, nice try Schnell lol

alpine summit
sage burrow
#

Pro did the best job, however, a lineup of people holding hands, was doomed to much post processing needed lol. This is the after I removed some extraneious arms and fingers

alpine summit
dry wave
#

Flux trains amazingly

#

better than SDXL

#

I'm sure we will see a lot of cool Flux Loras in the future

alpine summit
rain current
brittle nexus
#

The NF4 forge's model is fast, very fast

sacred jewel
rain current
bitter hearth
alpine summit
rain current
rain current
alpine summit
bitter hearth
#

wow

#

(i) NF4 is significantly faster than FP8. For GPUs with 6GB/8GB VRAM, the speed-up is about 1.3x to 2.5x (pytorch 2.4, cuda 12.4) or about 1.3x to 4x (pytorch 2.1, cuda 12.1). I test 3070 ti laptop (8GB VRAM) just now, the FP8 is 8.3 seconds per iteration; NF4 is 2.15 seconds per iteration (in my case, 3.86x faster). This is because NF4 uses native bnb.matmul_4bit rather than torch.nn.functional.linear: casts are avoided and computation is done with many low-bit cuda tricks. (Update 1: bnb's speed-up is less salient on pytorch 2.4, cuda 12.4. Newer pytorch may used improved fp8 cast.) (Update 2: the above number is not benchmark - I just tested very few devices. Some other devices may have different performances.) (Update 3: I just tested more devices now and the speed-up is somewhat random but I always see speed-ups - I will give more reliable numbers later!)

steep widget
bitter hearth
rain current
#

It's the one I've downloaded; now I'm going to try it with the experimental node

bitter hearth
#

im assuming you use comfyui, do you need any specific nodes to run this or can you just load it up in the usual flux workflow?

#

waow b..

#

@rain current curious to know how you find it in terms of speed and quality, i think i will wait for further developments with schnell version of the nf4

rain current
#

Well, no idea, because I'm getting an error in KSampler, and I have comfyui updated

sacred jewel
bitter hearth
#

A ball monster

dusky thistle
icy drift
errant dust
dusky thistle
rain current
dull star
#

I've seen the nf4 posts but still

#

6-8GB people can finally run this model!!!

#

I really hope it doesn't influence quality that much

#

I used to have only 6GB too, so I know their pain

#

and NF4 may outperform FP8 (e4m3fn/e5m2) in numerical precision, and it does outperform e4m3fn/e5m2 in many (in fact, most) cases/benchmarks.
sounds magical

#

and apparently its faster too

bitter hearth
sacred jewel
bitter hearth
dull star
#

same, I might not install the comfyui node

#

I'll wait for an implementation

bitter hearth
#

incidentally i finally wiped out a1111 from my pc

#

never have i thought before i'd be so drawn into comfyui

#

i used to be pessimistic about the noodles

#

now i love it

sacred jewel
bitter hearth
#

and it all happened when i understood how the nodes work

sacred jewel
sacred jewel
dull star
icy drift
sacred jewel
rain current
#

But it doesn't work with the Loras

icy drift
#

2.5% speedup. (That's insignificant IMO.)

#

With BF8 (ignore the sign they're holding, just keeping prompt constant. It's BF8)

rain current
#

The same, slightly faster............ 2 seconds catwhaaa

icy drift
#

Honestly there's a really obvious difference. That's surprising.
The girls are much sadder with BF8 (seriously what???), and the lines are more expressive with NF4.

#

Trying to think up an experiment that could show a clear quality difference... ๐Ÿค”

torn wharf
#

Good morning. Are y'all still laughing at me behind my back today? Been a couple days since I got told everybody makes fun of my balls when I'm not around

#

Just checking if I'm still the trend

icy drift
#

We're talking about a new model format that's supposed to be faster and higher quality.

torn wharf
#

I found it last night actually, but I couldn't get it working on forge

icy drift
sacred jewel
sacred jewel
errant dust
icy drift
# errant dust WHere is the node located? I have it installed but Comfy menus dont show it

You have to install bitsandbytes, but I'm using a custom anaconda environment and I did that manually from the command line. Don't know how to do it on a normal install. ๐Ÿ˜•
(And I had your exact error, so I'm pretty sure this is the issue. If you scroll in your command prompt window after restarting, you should see a notice like "no model 'bitsandbytes'")

sage burrow
# icy drift With NF4

When your computer creates images at the speed of light, perhaps the difference isn't as obvious?
Lowram mode results anyone? lol

icy drift
#

Okay, some slight failures:
an anime girl writing on a chalkboard in a classroom, she is wearing a sailor school uniform, she has a red ribbon in her hair, there is a window on the left showing blue sky and a school football field, there is a bird sitting on the windowsill, there is a desk on the right, there is an apple on the desk, on the chalkboard is written the text "BF8", there is a nameplate on the desk that says "BF8",

errant dust
#

I cannot see it anywhere

icy drift
#

Okay, I imagine I can see slightly higher quality, but could be sample size margin of error.
Going to try locking the seed / using increment next.

errant dust
#

Ok, weird, it is not appearing

icy drift
errant dust
#

No dice. I just Git cloned it

#

and BnB is installed for sure

#

the search turns up nothing

sage burrow
sacred jewel
icy drift
#

Okay, I'm convinced that for this particular test, NF4 is unquestionably superior.
Will try different test design next.

BF8
Text correct on board: 3/4
Text correct on nameplate: 3/4
Desk and apple on right: 1/4
Football field visible: 3/4
Score: 10/16

NF4
Text correct on board: 4/4
Text correct on nameplate: 4/4
Desk and apple on right: 3/4
Football field visible: 4/4
Score: 15/16

#

I mean, that's a HUGE difference in quality.

sacred jewel
errant dust
#

Well, I cannot figure out why the node refuses to be recognized in the Custom Nodes, so I submitted an issue

icy drift
torn wharf
#

bug reports tend to need reproducibility

sacred jewel
torn wharf
# sacred jewel

are you using painting loras or just getting sweet paintings from flux raw

sacred jewel
#

Flux Dev btw full model fp16

torn wharf
# sacred jewel

i'm no farmer but i've seen jeremy clarkson's farm, and caleb would probably trash talk that size of a field.

sacred jewel
torn wharf
errant dust
# icy drift But I don't think it can be a Comfy issue, or it would affect me too...

Doesn't matter. I followed standard steps, so if there is a procedure issue, it should be clarified since it is still a ComfyUI node. I pip installed BnB, then I went to CLI in Custom Nodes folder and Git Cloned the new node code. It installed fine and shows up in the folder too. Yet every time I start ComfyUI (also updated BTW) it does not appear

sacred jewel
#

I do not mind finding good seeds. some may have forgotten that

torn wharf
sacred jewel
torn wharf
#

because it does matter. if only you can reproduce it on your machine, it's often not a project issue. bug reports really rely on reproducibility

icy drift
errant dust
mortal mesa
errant dust
pseudo owl
icy drift
errant dust
#

I did not install to Comfy's environment. But as I understand it, as there was an issue reported, it would affect it running

#

not the node being recognzied

torn wharf
icy drift
#

This time, BF8 won the score for the prompt-adherence metrics I picked, but it also produced visually inferior results. Not sure how to grade those...

BF8
canopy visible, contains "BF8": 4/4
trumpet in top left: 4/4
trumpet rubies: 3/4
anime top right: 4/4
anime green hair: 4/4
UFO bottom left: 4/4
UFO blue jets: 4/4
cat bottom right: 4/4
cat gray calico: 4/4
total score: 35/36

torn wharf
#

yeah. it works fine. it's not a project issue

icy drift
#

NF4
canopy visible, contains "NF4": 3/4
trumpet in top left: 4/4
trumpet rubies: 4/4
anime top right: 4/4
anime green hair: 4/4
UFO bottom left: 4/4
UFO blue jets: 4/4
cat bottom right: 3/4
cat gray calico: 4/4
total score: 34/36

Prompt `a glass window of a storefront, inside the window there are four products on display in a grid, in the top left of the display is a trumpet with ruby gemstone valves, in the top right is an anime figurine with green hair, in the bottom left is a miniature UFO spaceship with blue jet thruster flames, in the bottom right is a cat sleeping with gray and white calico stripes,

the photo is a view from the street, and there is a canopy over the storefront window that says "BF8"`

icy drift
# errant dust not the node being recognzied

The node does not show up at all in the search unless bitsandbytes is installed into Comfy's environment. That is the exact behavior I got.
If you scroll through your command prompt startup log, you will see "no module bitsandbytes".
There will not be a future update that fixes this problem for you, so I kindly recommend continuing to troubleshoot it.

torn wharf
torn wharf
errant dust
torn wharf
#

if its a portable install you want to use the python embedded pip

icy drift
torn wharf
#

no environment activation required afaik. just make sure you're running the embedded exe.

This is the confusing nonsense that makes me wonder why python is even a standard. can't wait till we migrate away from this ridiculous dependency hell situation

#

i found a limit of the network. it can't do mc hammer hammer dancing on ice skates

icy drift
icy drift
torn wharf
torn wharf
#

oops forgot to post this

#

same prompt. i think this expresses the problem perfectly. flux is a white guy trying to do the hammer dance

errant dust
#

And sure enough the issues display I am not the only one with this problem

torn wharf
alpine summit
errant dust
torn wharf
#

oh you blocked me. noted. will reciprocate

rain current
#

Add Friend

torn wharf
#

i dont think he's a friendly

rain current
#

2 or 3 times faster? It's the same for me! Maybe a couple of seconds less ๐Ÿค”

icy drift
alpine summit
torn wharf
#

i think on the newer cards it doesn't improve much. but it do uses less vram for me. i haven't done any quality testing but it doesn't feel worse

rain current
icy drift
icy drift
radiant ledge
rain current
#

What I don't understand is why some users gain so much speed...

icy drift
alpine summit
torn wharf
#

Can flux even do a basic hammer

#

No hammer dancing . No hammer sharks. What gives

rain current
#

I haven't seen any changes in VRAM either... ๐Ÿ˜…

icy drift
#

FP8 vs NF4 VRAM usage. NF4 uses 2.3GB more VRAM. ๐Ÿ˜•

#

Testing if increasing image size leads to greater speed increases.

#

NF4 at 1440x1440

rain current
icy drift
#

BF8 at 1440x1440

#

That's a 5% speedup.

#

Okay, I'm done looking for any noticeable difference.
Yes, NF4 is technically very slightly faster, and maybe technically very slightly higher quality.
There's no reason not to use it, so I'm going to go ahead and use it.

#

If your device has very low VRAM, maybe you will see better results.

untold ginkgo
#

what is low VRAM, 12GB?

icy drift
torn wharf
#

I'm not seeing very much gain on 16 for speed. 4080 is newer than other 16gb card too

errant dust
#

nvm, just got back and did not see yur previous post

errant dust
#

it uses MORE memory? That is in direct contradiction of what was 'promised'

sacred jewel
icy drift
#

Honestly I'm done testing it, and I'm sticking with using it.

sacred jewel
icy drift
#

Flux + ToonCrafter.

#

Also just realized ToonCrafter requires 512x320, which Flux can totally do. Gonna try that next.

rain current
torn wharf
#

oooooooo i got flux working in forge oo goodie

#

deleted the venv. always a good solution

#

new forge got all kinda memory controls too like where to swap and how much. llly changing the game again

icy drift
#

Should try some other movement maybe...

rain current
alpine summit
torn wharf
# rain current

lol how soon till people are flexin by wearing a gpu on a chain? with the plastic slot guards still on

#

imagin seeing a guy with a 5090 hanging from his neck. sure it's ridiculous but you know that guy got some kinda money then

rain current
#

If someone gives me a 5090, I'll wear it around my neck for an entire month

sage burrow
#

buys dead 3090 on CL for $50, sends to r_made ๐Ÿ˜‰

torn wharf
#

maybe like, some insulation between me an the card

#

just gotta sit with the desktop between my legs an the riser oughta reach

#

https://youtu.be/q5xvwPa3r7M?t=310 think they got like 10 risers chained here. should be good

Thanks to LG for sponsoring this video! Check out their Share the Art #LG gram campaign: https://www.lggramevent.com
https://youtu.be/6oKU9APBrwI
Learn more about LG gram laptops http://www.lg.com/us/laptops/

Can PCIe extensions cause a loss in GPU performance? We investigate!!

Tunnelbear sponsor link: Visit https://www.tunnelbear.com/LTT and ...

โ–ถ Play video
mortal mesa
#

on a 2080 the NF4 loader/checkpoint got me to SDXL speeds, insane. 5-7 min down to sub 45 seconds

alpine summit
errant dust
#

Schnell or Dev?

#

@icy drift Your tip solved the issue, thanks.

mortal mesa
bitter hearth
#

that's still quite a lot slower than SDXL, right?

#

45 seconds for 20 steps

mortal mesa
#

i mean i havent used SDXL in a while and i forget but ya when i stuffed it with Cnets IPadapters seems right for me, the point being i went from 5-7 minutes to 45 seconds on 20 steps of flux dev on a 2080 TI 11GB

bitter hearth
#

its still a good drop yeah

rain current
#

45 seconds is how long it takes for me with 20 steps on a 4060 Ti 16GB... And from BF8 to NF4, there's no difference in time. It seems like the improvement is more noticeable on older cards...

alpine summit
mortal mesa
errant dust
#

Here is my read now:

#

so 11GB or more would be a big diff per this

hollow swift
#
https://github.com/lllyasviel/stable-diffusion-webui-forge/releases/tag/latest
Press any key to continue . . .```

how can i install forge i got this problem
errant dust
#

First question: running from main Python or embedded?

#

(this is for Comfy use of the TF4 I assume)

torn wharf
#

its for forge. you can tell because the url to the forge project is there and he says "forge"

#

my first guess is AMD card

sacred jewel
torn wharf
#

or maybe i'm wrong. maybe "she says"

errant dust
sacred jewel
torn wharf
#

there's an 80s arnold schwarzenegger lora for flux d now. it looks good! it says it's 90s arnold but naw. that all looks like commando era arnold pre t2 but i dont think loras work with nf4 quite yet and i'm knee deep. ill test it later. https://civitai.com/models/638000/arnold-schwarzenegger-1990s-flux-lora?modelVersionId=713426

** - First attempt / Please don't expect perfection. Currently works on ComfyUI (make sure you're up to date or it may not). At the time of upload ...

errant dust
#

Depends on the rig I guess and your local setup.

#

Right now my GPU is at 100% load and 68 C

#

in the laptop

#

Some laptops are MUCH worse at heat management than others, that is not in doubt

torn wharf
alpine summit
errant dust
#

My previous ASUS had a cool system in which when you opened it, meaning the lid was up, the entire bottom would open to allow ventilation

torn wharf
#

my desktop pc has a cooling system called surface area

errant dust
#

THough back when I was training NNs for a living I had two rigs with dual 2080tis, and those required permanent A/C in their area

torn wharf
#

it works by being really big and spread apart so that more heat can escape into the air stream

errant dust
#

Electricity bills were brutal too, especially when you add that I live in Rio de Janeiro, a tropical city

torn wharf
#

i do have some issues with my desktop though. like, it's a rebuilt area 51 from dell but they use proprietary connectors for the case button. i CAN get the adapter and put it on there but i never have so i've been needing to power on my PC this way all the time

errant dust
#

so the a/c never let up. Even now in the middle of winter we had a 4-day spike of 90 F

#

(32 C)

torn wharf
#

bonus points if you know what im doin

errant dust
#

Though curiuosly enough, it isn't that much worse in summer

#

The average temp diff is 10 C from winter to summer

icy drift
#

The toes never come out looking very good. Always messy. ๐Ÿ˜•

torn wharf
errant dust
#

So my first image with NF4 was indeed faster, but in large part due to the fp8 Clip tensors. It made a really bizarre mistake, so i am now trying the NF4 with the fp16 CLip (slower) to see if this helps

icy drift
#

They're fine for a hobgoblin. No nails except on the big toe, which is gnarled. Also her heel looks more like a wrist.

torn wharf
#

i see the wrist ankle now

#

the the big toe is half thumb

icy drift
#

Trying to prompt for feet now to see if that helps.

torn wharf
#

pedicured feet?

real terrace
#

I tried this one and then the basic Schnell, I think the latter gave me better results.

#

this schnell-dev-merged generated good but sometimes images has this kind of scan lines or things like that

#

Or somekind of pixelated grain or something

errant dust
#

The proof is in the pudding but the only way you're going to be able to really know if this is the model itself or flux is to generate the same image with the same conditions and same seed and test with the different models

#

If you don't then you'll never know if it wasn't just the luck of the draw

icy drift
#

I don't have any reliable way to compare the overall quality of two models. (But Schnell is terrible.)

real terrace
icy drift
#

No luck. Still messed up feet.

real terrace
icy drift
#

But these hands are practically photo quality. Seriously don't think I have any complaints.

real terrace
errant dust
#

but YMMV

bitter hearth
#

Schnell is unusable to me

#

its fine for research purposes

errant dust
#

Dev-Schnell was fine

bitter hearth
#

ah I haven't tried that one

errant dust
#

keep in mind, all need fp16 Clip

#

fp8 for me is a systematic problem

bitter hearth
#

I don't like the baked in aesthetic of flux
so I am staying on SDXL

#

I try each task with RealVisXL, Juggernaut, Zavychroma and Helloworld
usually one of those four will be better than the others for whatever the task is

craggy crest
#

Burger vrs optimized burger

bitter hearth
#

wow that would be quite a burger

errant dust
#
  1. line-art cartoon of dogs playing rugby. They look serious and ferocious and are wearing uniforms. At the top in large dynamic letters are the words, "Don't mess with the All Blacks!"
#
  1. An ornate teacup is on a table in an old-style coffee shop. In the teacup is a frigate in a storm with very choppy waters as lightning strikes the mast. On the teacup you can read in elegant and ornate letters with ornate decorations, "Tempest in a Teacup".
#

I don't test anime art BTW as I find it all looks the same and is boring as all hell. I have some others that test prompt ability

#

here is a third

icy drift
#

I'm trying to do inpainting in ComfyUI, and I can't get a soft / blurry mask. I'm loading a mask image that's soft, but Comfy is sharpening it. Does anyone know how this works?

bitter hearth
#

I don't actually do any testing or benchmarking of models myself

#

cos usually the papers will show some benchmarks anyway

#

so I am happy to just assume their results are relatively right

#

typically its FID on MS Coco, Imagenet or CelebA

errant dust
#
  1. There is an ocean with the upper part above surface and the lower part below the surface. On top of the surface is a 17th century sailing ship on a calm ocean on a sunny day. Below the surface the water gets increasingly darker the deeper it gets and there are massive eldritch sea monsters.
errant dust
bitter hearth
#

its quite expensive to calculate a FID score
I don't really feel motivated

errant dust
#

I don't bother with scores. I read them and look, but in the end what matters is the results I get, not what their wonderful papers and demos and announcements show

real terrace
bitter hearth
#

ok that Schnell image is good

#

to be fair

errant dust
errant dust
#

NO, I got that part, but was wondering what setup yielded that exact result

bitter hearth
#

there are issues with FID score
not correlating perfectly with human preference
but its really not too bad

real terrace
craggy crest
bitter hearth
real terrace
bitter hearth
#

that one is not so great yeah

#

its true that sometimes it does well

errant dust
#

They can serve as a baseline to tell me it is worth checking out, but in the end it is what I get that matters

bitter hearth
#

its a bit like audio gear

errant dust
#

like you preffering XL

bitter hearth
#

you can optimise for your personal tastes if you want

#

I didn't particularly feel the desire to

errant dust
#

I just saw two reviews talking about how Flux Pro is clearly unbeatable, and the new god of image AIs. Sounds great and then you see the 'reviews' and see they made exactlt 4 images and tested ONLY photorealism of people. Myeah, ok, but what if I have other interests?

craggy crest
errant dust
#

so NF4 works excellent bur REALLY needs fp16 text Clip

icy drift
#

Inpainting with Flux. Just in case anyone was believing the "impossible" nonsense. ๐Ÿ™‚
Original render:

#

Inpainted (to get full art version):

errant dust
#

here are two NF4 with that Rugby prompt above: "line-art cartoon of dogs playing rugby. They look serious and ferocious and are wearing uniforms. At the top in large dynamic letters are the words, "Don't mess with the All Blacks!" The first is with the embedded fp8. The second is with fp16:

bitter hearth
#

its one of the reasons I like FID, because its 30,000 images

errant dust
#

The fact that the fp8 has no rugby ball at all... is a huge fail

#

The All Blacks are a super famous rugby team for anone who does not know the reference

bitter hearth
#

ooh ok

craggy crest
errant dust
#

dude, it had 30 steps

craggy crest
# errant dust dude, it had 30 steps

revised your prompt: cross section view of the ocean. On top of the surface is a 17th century sailing ship on a calm ocean on a sunny day. Below the surface the water gets increasingly darker. We can see a massive octopus lurking near the bottom of the image.

real terrace
errant dust
craggy crest
errant dust
#

see the images above. workflow is included

craggy crest
errant dust
#

the two rugby cartoons

real terrace
#

Oh I would love to tile upscale with flux (well the Schnell version), hope it can be done soon... and looking forward to test that inpaint

icy drift
errant dust
errant dust
#

Dall-E 3. Notice how the bottom and top seem to be utterly disconnected?

#

the weird white caps even make it seem like this is the beach wash coming in

#

not ugly art per se, but a real fail for the overall image

rain current
bitter hearth
#

Flux Pro/Dev and to a slightly lesser extent Midjourney are really good at getting the overall idea of what an image is meant to do as a whole yeah

craggy crest
#

@errant dust it's done to make it clear to the viewer that it's the top of the ocean - as if we're looking through a thick piece of glass. okay, i don't have CheckpointLoaderNF4 and it's not showing through missing custom nodes. where'd you get it?

bitter hearth
#

the big private SD3 is also very good

#

SD3 ultra

errant dust
bitter hearth
#

Midjourney is pretty baked though

#

I could never really subscribe to such a baked model

craggy crest
errant dust
#

I do. Are you using the embedded Python or your own?

#

it changes the isntructions

craggy crest
errant dust
#

ok, so open CMD and go to the ComfyUI folder

craggy crest
#

k, there

errant dust
#

now paste and enter this: ..\python_embeded\python.exe -s -m pip install -U bitsandbytes

#

without this, your node won't even appear even if installed

craggy crest
#

okay. ran, seems to have installed fine

errant dust
#

ok, if you have not installed the node, but have git installed, go to the custome_nodes folder, also in command line

craggy crest
#

your workflow loads now, give me a minute

errant dust
#

then you should be good to go

#

place the file in checkpoint not unet

#

refresh and my image should now show the workflow with no missing nodes

craggy crest
craggy crest
real terrace
torn wharf
#

one of my goto prompt exercises is to recreate movie scenes and flux is really good at it. i think i remembered why i've never really been a foot guy. one movie scene in particular ruined my world view forever. you might know it already before clicking it.

#

aw i forgot to spoiler it. oh well

proven pecan
#

Schnell

icy drift
#

(The one with the border is the original. The full art one is inpainted / outpainted.)
Using differential diffusion node, although I'm not sure it does anything to Flux. I would be surprised if it did.

torn wharf
#

smart to repost those since that gave more of a buffer to kathy bates ankle hobbling pic

#

too cute to contrast to that horror

dry wave
craggy crest
# errant dust It won't run with Loras BTW

i didn't try to run it with a lora, just out of your workflow. but since you don't seem happy with it, and it's throwing code errors for me, i'm not going to spend any more time on it. i don't have a need for it personally

errant dust
#

I only stated that fp8 clips are not good for it. The same was true for fp8 T5 for SD3 Medium mind you

#

which was much worse than fp16

#

woah

#

it fit the entire NF4 Dev model into my VRAM

#

My render time dropped from 20 mins to 2

#

using fp16 clip

#

ok, 3

#

and 30 steps

#

ok, that alone is a bit of a game change if results are not obviously worse or something

#

There ARE differences with identical seed but too little info to make any calls

#

So first of all, I ran your lovely sea monster with plain Dev as a reference. took 1836 seconds. Then NF4 Dev, took 216 seconds.

mortal mesa
#

i had the same experience on an 11 GB card, amazing

errant dust
#

Yes, but this is not even 11Gb, this is a laptop 4060 with 8

#

and believe me, even Schnell was not this fast with 4 steps

mortal mesa
#

ya, more amazing

errant dust
#

Schnell was Schlow

mortal mesa
#

ya same here just a tad faster

#

dev 20 steps was taking 5-7 minutes

errant dust
#

5-7 mins was my wet dream

#

I did enjoy a speed boost too by manually installing CUDA

#

12.6 from the Nvidia site

#

then upgrading Torch to make use of it

torn wharf
#

Flux pro? More like... FLUX SLOW

errant dust
#

heh, it is not even entirely loaded into VRAM, but compare these two lines:

Before:
Loading 1 new model
loaded in lowvram mode 5728.075

Now:
Loading 1 new model
loaded in lowvram mode 64.0
Prompt executed in 189.01 seconds

mortal mesa
#

i think fp16 T5 is noticeable also, trying this to simplify my files

craggy crest
torn wharf
#

oh sweet. meta in canada has boosted to llama 3.1 now

errant dust
# craggy crest i've never tried fp8 on SD3 medium. now i have to

Yes, while there is ample talk about it being worse, it is often hard to measure. That is, until I found a very simple way that showed up each and every time. Text. Same prompt and fp8 always botched the text more than fp16. Sometimes criminally so. Interestingly, Flux Dev is not above this issue either, and highlights how much worse fp8 is, and is likely impacting every other aspect of the output.

mortal mesa
naive socket
#

RuntimeError: Input type (float) and bias type (struct c10::Half) should be the same

errant dust
#

Ex: fp8 first and then fp16 next. Flux Dev. The phrase should be "Don't mess with the All Blacks!"

craggy crest
#

grab one of my workflows and play around with it

dull star
errant dust
#

ok, one thing is certain, NF4 for 8GB Vram cards is a total game changer

dull star
#

yeah im so happy that its getting democratized to lower end people

mortal mesa
#

caste is dictated by GPU Model

dull star
#

I used to have 6GB of vram at the infancy of (public?) Stable Diffusion (2022 august)

#

so even though I had enough vram to run it and all, I still struggled with a bunch of other AI stuff that were just simply out of my reach

#

so I know how it feels for them

#

I am so glad for nf4

errant dust
craggy crest
errant dust
#

ok, now I must try the Schnell version just to see how it looks

errant dust
craggy crest
errant dust
#

ahhhh... in that case

#

I assumed the original prompt

craggy crest
errant dust
#

Anyhow, NF4 Dev gave me this:

dull star
errant dust
craggy crest
craggy crest
# errant dust Ok, but how would you split them?

you give each of them what they are best at. clip_l is your background stuff, ambience, tiny details you don't notice unless you look for them but that make the image stand out, and clip_g is the black and white, just the facts mam "this is what the image is"

errant dust
#

Ok, let me first compare NF4 Schnell as apples to apples first

torn wharf
errant dust
#

Regardless, I will be very interested in seeing your Flux workflow

torn wharf
#

im playing with the new meta.ai model right now. it can do vegeta

dull star
#

nice

#

vegeter

craggy crest
# errant dust You mean T5 not g I assume

i mean exactly what I said. clip_g (sd3 medium) is your workhorse. you give it the simple, black and white, this is what the scene is. you give t5xxl the richly detailed, narritive prompt - and the text - and you give clip_l the fine subtle details, background, ambient stuff

torn wharf
errant dust
#

WIll that work with Flux?

#

meaning the three encoders?

#

well, will find out soon enough

torn wharf
#

same prompt in flux dev. Vegeta eating a carrot like bugs bunny

craggy crest
torn wharf
#

doesn't know vegeta i'm tellin u

dull star
#

that's funny cause it knows goku

torn wharf
#

vegeta would be mad about it

errant dust
#

oh boy. NF4 bombed really really bad. Let me try again with 6 steps

#

maybe the fp16 will fix this cluster f***

torn wharf
#

wanna know a fun dragonball fact? all the saiyan names are vegetable words. vegeta, kakarot, brolly, bardock, caulifa, cabba. its the anime veggie tales

craggy crest
errant dust
#

ok, NF4 Schnell is total no go

dull star
#

too distorted?

#

or low quality?

torn wharf
#

nf4 dev on forge

craggy crest
rain current
torn wharf
#

can't do broly either but this is good

errant dust
#

Seeing is believing. This is plain NF4 Dev and next is NF4 Schnell. The prompt is: "cross section view of the ocean. On top of the surface is a 17th century sailing ship on a calm ocean on a sunny day. Below the surface the water gets increasingly darker. We can see a massive octopus lurking near the bottom of the image."

torn wharf
craggy crest
errant dust
#

Oddly, the Schnell is ALSO same speed since the memory management is worse and bleeding more into my normal RAM

torn wharf
#

next to test is the dev/schnell blends in nf4

errant dust
#

here is one example:

craggy crest
errant dust
craggy crest
#

was starting to think it wasn't allowed to spell that word

bitter hearth
torn wharf
#

balls

acoustic kite
#

256x256 image made in 5 seconds with flux. this could be as fast as sdxl

#

gets text right every time with the new nf4 model

#

i get 3.3it/s on an rtx 4070ti super

#

5 seconds in total for a perfectly readable 256x256 image

#

another prompt, 5.5 seconds at 256x256

#

somehow text is perfectly fine

#

amazing

#

at 128x128 it struggles lol

acoustic kite
#

takes 4 seconds per image so it's not worth it at 128x128

#

but 256x256 takes 5 seconds at 3.3it/s with like 6gb vram used up i think

#

let me check again

#

not a lot of vram

#

ok nvm it uses 12.3 gb vram

#

but with that it takes 5 seconds and .5 seconds to load a new prompt

#

very very fast and no reduced quality @craggy crest

#

not even worth it to use fp16 or fp8 at this point

#

here is 512x256. 6.7 seconds at 16 steps

#

starting to break at 1024x128 lol

#

understandable

craggy crest
acoustic kite
errant dust
#

Yes, I am using the NF4 Dev at 1024 x 1024 as my baseline, since it is the usual size I work with, and my laptop typically chokes on the Dev model to the point that ~20 mins is not unusual. So let me show you Dev on my Teacup prompt (over 20 mins), and then NF4 just now at 1:47 flat.

#

Prompt: An ornate teacup is on a table in an old-style coffee shop. In the teacup is a frigate in a storm with very choppy waters as lightning strikes the mast. On the teacup you can read in elegant and ornate letters with ornate decorations, "Tempest in a Teacup".

#

Dev (normal) and NF4 Dev next:

#

There are micro-differences, sure, such as NF4 in this case is slightly warmer overall, but 'worse'? No way. In fact, In realized now the Dev was 50 steps, and NF4 was 30. So let me give you the 50 step NF4 to compare properly.

alpine summit
errant dust
#

Dev (normal, 1k, 50 steps) and NF4 Dev (1k, 50 steps):

acoustic kite
errant dust
#

and I was lying. Almost embarrassed to tell the truth. That 50 step Dev took my poor rig well over 45 mins. Really. The NF4? See for yourself:

alpine summit
errant dust
#

One curiosity I have noticed is that NF4 is affected much more by greater steps beyond 30 than the 'normal' Dev. In my tests, and I did quite a lot (in spite of the insane times) 50 steps was barely noticeably different than 30 steps. Not so for NF4 Dev.

#

To illustrate, here is Dev at 30 steps and then 50 (using fp16 Clip):

#

Now for NF4 Dev fp8 at 30 steps, 40 steps, and 50 steps:

#

Sorry, was missing a 40 for the fp16. Ok, and here is NF4 Dev fp16 Clip at 30 steps, then 40, then 50:

#

fp8 clip is the clear dodgy choice, and frankly I saw no speed benefit or memory. NF4 at 30 steps is already superb, and yes there are some changes with more steps, nothing that would have me second guessing the step count. And just for completeness sake, in this particular case, 35 steps seems to be the goldilocks point. So Dev (normal) 30 steps, NF4 Dev 35 steps (both with fp16 clip)

torn wharf
#

i wont make any definitive claims but i feel like nf4 model often produces more interesting details

vital crag
torn wharf
#

making a prompt for that purpose is not so hard. LLMs are really easy to direct. here's how i'd do one, and then adjust it as needed.

"We are going to make prompts for an image generation model. The model has 3 separate text encoders that work together, but understand text in 3 different ways. When i give you an idea, you will write 3 prompts for each text encoder. The first one is for Clip VIT-L and should have a list of style tags separated by commas. it should follow a pattern like: style, subject, actions, setting, detailing. The second one is for Clip VIT-G. It's the text input that affects the scene the most. Write a couple of sentences describing the scene and it's style in clear natural language. The third input is much more robust and understands a lot more descriptions. It uses the T5 encoder. The prompt for this one should be one or two paragraphs describing the image in detail with elaborate natural language and rich vocabulary. A bit of backstory to the scene can even help to harness the creative power of the model.

Now, given that information, i have an idea about a cat in the sunshine"

this should work in any LLM assistant available. i put it into meta.ai and got this result. I just made it up right now. i make new instructions all the time depending how i'm feeling. it's the same jist usually

vital crag
#

thank you

torn wharf
#

the image

sage burrow
torn wharf
#

is that what glif is, llm driven?

#

if i was to improve on that prompt, i'd give examples . maybe limit what kind of scene components are given to L

bitter hearth
#

glif is secretly comfy

#

but they took the noodles and put them in a cupboard in the back room to hide them

#

the noodles are still there though

brittle nexus
dusky thistle
#

5120x3072

bitter hearth
#

there's a node on github that is like deep shrink but more gradual

#

I wonder if it would boost Wรผrstchen

#

would be next level if it worked

#

so normal deep shrink is a 2x latent shrink at bock 3 for a duration of 0.3

#

this node is usually more like a 2x shrink at block 3 which then slowly grows back to normal size across the whole generation

torn wharf
#

playing with this forge version with flux support. no image to image seems. i can't make it work

sterile pendant
#

(had to search through my posts to remember who i was talking to about it all)

vital crag
bitter hearth
#

flux has both clips?

sterile pendant
#

no problem, i havent tried it out, but it looks like it should work. plus, all the different fields you can set to random

bitter hearth
#

not sure why but I thought it had one clip

sterile pendant
bitter hearth
#

ah ok

sterile pendant
#

i'll check the code

bitter hearth
#

on every model I always forget which one is the smart clip and which one is the tag-prompting clip

sterile pendant
real terrace
sage burrow
#

Anyone remember that old model called SD3?

torn wharf
sterile pendant
#

yeah, since they all use the same shit under the hood, minus G in this case

torn wharf
#

thats literally all you need to do to adapt it. ignore the G prompt

#

am i right that because bitsandbytes added support for qlora, that's why we're now able to load models in nf8?

sterile pendant
#

not sure, i'd have to look at their repo. havent looked at any recent advancements they've made lately. all i know is that fluxdev nf4 in comfy is a godsend. the quality is mostly equal to or just a hair lower than the fp8, but i get 2.5 sec/it now, down from 8 sec/it on this 8gb card. if i want, i can always save off the latents and resample with the slower fp8 version or something

real terrace
sterile pendant
#

i want to try a schnell version at nf4 to see how it does, saw some people uploaded some already on civit, but F making an account there to download them

sage burrow
# torn wharf is that what glif is, llm driven?

You can add LLMs to improve prompts, so in the case with this, it can be added as part of the workflow, so whenever someone types a prompt, all of the above also happens to be addded (so they don't have to type in that aspect every time).

errant dust
#

It's crap

#

The Schnell version

sterile pendant
#

figured

#

well maybe the dev-schnell merge version will do better

errant dust
#

You can scroll up I posted some comparisons but it's just crap

sage burrow
sage burrow
torn wharf
#

can i edit a glif after it becomes popular and eveyrone who visits it gets the new version? asking for a friend not because i want to make one that gets popular and then change it to make everything that's prompted into a butt prompt

sterile pendant
torn wharf
#

hmmm

sterile pendant
errant dust
#

No, I mean the Schnell NF4

sterile pendant
#

ahh alright

errant dust
#

As to speedups.... I went from 65s/it to 3.5

#

With NF4 Dev

sage burrow
# errant dust The Schnell version

Is the Dev version worth experimenting with? I seem to recall you mentioning something about waiting 45 mins for an image though! ๐Ÿ˜ฆ I don't remember if that was in relation to nf4 or not though

errant dust
#

You're completely correct that was what I reported and now I'm getting 2 minutes

#

With this new miracle

torn wharf
#

on my 4080 i'm getting 20 steps in 22 seconds

sage burrow
errant dust
#

Honestly nothing special. I keep my workflows in all the images I share here but basically it's just using nf4. Though I strongly recommend keeping the fp16 clip.

sage burrow
#

I tried install missing nodes, tried searching for it....

torn wharf
#

it sayus its on manager now

errant dust
sage burrow
torn wharf
#

comfyui is getting for loops now and thats super interesting to me

sage burrow
#

how many versions of flux will there be I wonder ๐Ÿ˜„

torn wharf
#

random thought

sage burrow
bitter hearth
#

its kinda hard to think of a use for them

torn wharf
#

loops mean that few nodes can be repeated many times

#

i can't come up with a reason to use them on a single images. but i feel potential

sterile pendant
torn wharf
#

there's these things i've seen, one button datasets they call themselves. it could make those more interesting and potentially use less custom nodes

sage burrow
#

either comfyui manager search feature sucks, or this isn't true

sterile pendant
#

for now, you can manually install it

bitter hearth
craggy crest
#

and apparently he's updated it again sometime today

errant dust
#

I'm not sure what that means. Load the vanilla Dev as NF4?

#

Never crossed my mind

craggy crest
sage burrow
dusky thistle
sage burrow
#

WHy would tht not be listed on the github page?

torn wharf
#

yeah he was saying that the speed increases are more random than first thought . more data needed

sterile pendant
sterile pendant
#

it should handle the rest and install the requirements for you

sage burrow
sterile pendant
#

there's more to it than that, delete the folder from your attempt and do it this way

#

like you have to manually update bitsandbytes within the comfy python venv

hallow lion
#

I have so many node conflicts theyll soon start a war amongts themselves

sage burrow
#

Has comfy manager always had that feature???!!!!!

sterile pendant
#

yeah, i always forget it's there lol

#

been there for ages too

sterile pendant
#

but otherwise, you have to do the usual git pull within the custom nodes folder, then from the python embed folder, you have to run python.exe -m pip install -r somepathhere\requirements.txt

#

venv=virtual environment

#

when working with python, almost all these apps use their own virtual environment so they dont f each other's dependencies up

#

but the comfy manager automates the installation and takes care of installing the dependencies to the comfyui venv with the install via git url option

torn wharf
#

Comfy manager is nice toolset

sterile pendant
torn wharf
#

Imagine just installing new software to the entire globe hahaha like hey everybody have a new finger today. it's an update.

bitter hearth
#

thanks for the finger I got it too

sage burrow
bitter hearth
#

now I can play a Cminor(11,13) chord with one hand

sterile pendant
bitter hearth
#

its scary yeah
that that happened

sage burrow
#

So how does one get around that? You all installed it somehow ๐Ÿ˜‰

sterile pendant
sage burrow
#

I really wanted to try the install from git feature, darnit lol... I was all raedy to use it for the last 10 custom nodes that it couldnt find lololol

sterile pendant
#

alright here's what you do, go into your comfyui custom_nodes folder
at the top of the window where the folder path is, click it to start typing in it and type cmd and press enter (opens a cmd prompt with the current folder path to save hassle)
then in the cmd prompt, run git pull https://github.com/comfyanonymous/ComfyUI_bitsandbytes_NF4.git
close the cmd prompt window
after that, go up two folder directories and you'll see python_embeded
go into that folder, type cmd in the folder path at the top again to open up a cmd window
run the command python.exe -m pip install -U bitsandbytes (it's the only thing in the requirements.txt file that it needs)

sterile pendant
sage burrow
#

I'll try going up a diff way lol

sterile pendant
#

do it the way i described using the file explorer

sage burrow
#

or I guess I should be typing dir each time?

#

I did

sterile pendant
#

if you do it the way i said, you won't need to do any cd nonsense

sage burrow
#

You prob mean go back to file explorer instead of trying to naviagate in dos or whatever it's called now

sterile pendant
#

like this, and then type cmd and press enter (ignore the folder in my image, i was doing something else, you should be in the custom nodes folder and then for the later step, the python embedded folder)

#

it will open up a cmd prompt with the current folder path for you, so you dont have to do the cd blah\blah\blah stuff

dusky thistle
sage burrow
#

got distracted after it suggested I upgrade pip, but I think I finally got it!

sterile pendant
#

awesome, and you can run the suggested command, just make sure to copy/paste it exactly, since it will handle installing it to the comfy embedded python venv

alpine summit
sage burrow
sage burrow
#

I'll re-read your instructions and try it again

mortal mesa
sterile pendant
acoustic kite
#

there is a nf4 schnell version available which is just crazy. 3 steps and upscaled 2x using 4 steps with sdxl lightning

hallow lion
#

Gracias.

acoustic kite
#

it doesn't text decently well

silk wyvern
sterile pendant
alpine summit
acoustic kite
#

i was wrong

sterile pendant
#

well schnell is always kind of slightly less reliable at text, but yeah that looks pretty good

bitter hearth
#

how low can schnell resolution go and still be okay?

acoustic kite
#

1.7 seconds

#

but idk if i would call the resolution bearable

alpine summit
sterile pendant
sterile pendant
sage burrow
#

it looks like I can run it from any directory

sterile pendant
#

yep, thats why i manually entered in the folders

torn wharf
#

the absolute path

#

god emporer rx808

sterile pendant
#

lmao, i hate cmd prompt stuff and i hate python

alpine summit
sterile pendant
#

though i can't remember if del will work without some kind of flag like -y or something, so you might have to input a Y if it asks you about deleting the folder

mortal mesa
sterile pendant
#

i know flux can supposedly handle all the way down to like 0.1 megapixels

#

and that's right around that mark

#

yeah, 115,200 pixels

mortal mesa
#

im going the other way though, just added a bunch of 2MP resolutions to a node to be able to lazy select them

alpine summit
sterile pendant
sage burrow
#

So after receiving this (reason I hate windows11) message, I opened up the command prompt, cd to downloads, typed in nf4.bat and it looked like it did all it was supposed to... However...

#

I also restarted and refreshed comfy

sterile pendant
#

maybe try deleting the node from the workflow and make sure to update comfyui. also, that smartscreen is just because it's a bat file and people can put malicious shit in them

#

it's possible the node has a different ID now or something under the hood and that workflow is trying to load an old version

sage burrow
#

Hmph, I read it before I ran it, windows should chill lol

sterile pendant
#

but the fact it's letting me do it at all is awesome with only 8gb vram

mortal mesa
#

with the NF4 node i had to force a bitsandbytes upgrade to get mine to work, seems i had an older version that didnt work, might be worth a peek

sterile pendant
#

yeah that's why i had them add the -U tag in there. the requirements file doesn't specify a version, but it needs the latest build i think

#

most comfy installs already have bnb in them if you mess around with things like llm nodes

#

ahh cool, someone did upload the dev-schnell merge as an nf4 version https://huggingface.co/silveroxides/flux1-nf4-weights/tree/main
it's the bitte-guidance version "Bitte is schnell with double blocks from dev. if name has guidance in it, it will also include guidance keys which has interesting effect." from the comments

#

it's the correct merge style like from the comfy blog post where it only swaps the double blocks

bitter hearth
#

SDXL does 512x512 nicely now

#

with res-adapter

mortal mesa
sterile pendant
#

not sure, probably some other blend of the two maybe. i know someone had done a bunch of mixes like 2:1, 10:1, 20:1 and stuff, so maybe it's one of those? dunno, would have to ask in the comments

#

well bitte-guidance works pretty well and does actually respond to changes in guidance

#

flux1-bitte-guidance-bnb-nf4, 8 steps, euler-simple, flux guidance 2/4/8:

mortal mesa
sterile pendant
#

another test with the flux1-bitte-guidance-bnb-nf4 version @ different guidances

sage burrow
#

Thanks to google I found out how to get around comfyui manager not letting me install from a github url ๐Ÿ™‚

sterile pendant
sage burrow
#

For once it wasn't windows being a control freak lol

sterile pendant
#

sweet, well i was worried the folder path might have been an issue before with the cmd prompt stuff

sage burrow
#

Flux nf4; 4 minutes w only 8gb gpu

sterile pendant
#

Like on this 8gb 2080, it's around 2.5 seconds per step

sage burrow
#

16GB RAM/GeForce RTX 4060

sterile pendant
#

ahh that's why, you're running out of system memory and having to use the pagefile from your drive

sage burrow
#

So good at txt!

sage burrow
#

What?! It has a voice to txt option?!!!!

dusky thistle
sage burrow
bitter hearth
#

I have given up on res-adapter

#

IDK if the comfy node is broken

#

I can only sometimes get it to work

#

its meant to make SDXL generate nicely at 500-1500 pixels

serene basin
#

CAR FLYING

#

CAR FLYING

bitter hearth
sage burrow
#

nf4/schnellDev

dusky thistle
rain current
mortal mesa
hollow swift
#

any help?

dusky thistle
#

home sweet home

mortal mesa
sage burrow
# hollow swift any help?

You can disable that, just be careful since there are viri in some sometimes.
Just Google the error (or scroll up here)

noble coyote
#

Flux.Schnell - prompt from a Bible passage (Ezekiel)

dusky thistle
frail shoal
#

Does NF4 work with rtx 2xxx cards ?

dusky thistle
bitter hearth
# dusky thistle

the fine lines on trees you are getting today are the best fine lines I have ever seen for AI image gen

#

maybe fine lines are a big strength of cascade arch

dusky thistle
#

i'm also using random attention guidance which i don't think anyone's done

bitter hearth
#

ah okay I see

dusky thistle
#

maybe cascade has some extra innate ability there but the base out of the box workflow, that kinda stuff looks like a mess usually

#

squiggly squishy lookng shit

noble coyote
#

8Gb VRAM RTX 2070

#

Flux.Schnell - waiting on nf4 ...

real terrace
noble coyote
#

I'm getting error with nf4 - Error(s) in loading state_dict for Flux:
size mismatch for img_in.weight: copying a param with shape torch.Size([98304, 1]) from checkpoint, the shape in current model is torch.Size([3072, 64]).

#

nf4 does not work with RTX2070 - shd use fp8 instead!

#

๐Ÿ˜ฆ

real terrace
#

I was checking the new Forge, it seems to work with that model nf4

sterile pendant
#

2.5 sec/it

noble coyote
#

I am getting this error RTX2070 and nf4

#

2.5 s/it is cool - using Schnell - I'm getting 10.33s/it - or 100 seconds/image

dull star
#

Sorry if this was posted already

#

But we got boring reality for flux!!!

hallow lion
#

Holy molly that was fast]

sterile pendant
#

oh he updated it to say "Now on the manager for easy installation. Make sure to select Channel:dev in the ComfyUI manager menu or install via git url."

alpine summit
dull star
sterile pendant
bitter hearth
sterile pendant
#

I think this is going to be my new default prompt. I love when the RNG gods give you terrible random prompts when using LLMs. Feels like winning the lottery.

bitter hearth
sterile pendant
#

from a 2080 FE 8gb

bitter hearth
#

really?

#

that's interesting

sterile pendant
#

yes

bitter hearth
#

cause the author mentioned otherwise

#

i read about that yesterday btw

sterile pendant
#

it might be that only the 2080 has the feature and maybe the 2070 and lower dont

dull star
#

isn't nf4 schnell bad?

#

dev is definitely good, but idk about schnell

bitter hearth
#

currently with flux schnell regular regular checkpoint i get max 40 seconds to generate an image

#

and if nf4 speeds that up even more that'd be exciting

bitter hearth
#

the reason im avoiding dev cause my system has 12gb vram and 32gb ram

#

also tested a dev model that takes about 2 min to generate ... not worth it

sterile pendant
#

they use the same vram and ram, just one does shit in 4 steps, the other needs many more

#

think of it like a regular sdxl model vs its lightning variant

bitter hearth
#

the resource use between those are vastly different

sterile pendant
#

no, they arent. you must not have the same fp versions of them. if they are both fp8, they both use the same resources

#

now you might have gotten one that was packed with the fp16 t5 encoder and maybe the other had an fp8 t5 encoder

bitter hearth
#

so you are saying fp8 version of dev and schnell both uses the same resources?

sterile pendant
#

yes, as long as the text encoders packed with them are the same precisions

bitter hearth
#

im using single checkpoint

sterile pendant
#

and the dev/schnell transformer can be fp16, fp8 or now nf4

#

assuming you're using something like dev fp8 + t5 fp8 + clipL + vae and schell fp8 + t5 fp8 + clipL + vae, they will use identical resources

bitter hearth
#

yes but when i ran dev model that took me much longer with 20 steps ... i mean 20 steps is not alot but that felt pretty long

#

so im guessing the dev model was straining my system

sterile pendant
#

oh right, yeah, one is a 4 step model and the other is meant more for like 20-40 steps

#

so for instance, i'm getting ~2.5 seconds / iteration on my pc. this means 4 steps takes 10 seconds and 40 steps would take 100 seconds

bitter hearth
#

100 seconds for one image is kinda long too

#

im not unhappy with schnell

sterile pendant
#

not compared to what it was before lol

noble coyote
sterile pendant
#

shit would take like 7 minutes

bitter hearth
sterile pendant
#

so a minute and a half is nothing for a high quality image

bitter hearth
sterile pendant
#

2.5 seconds per iteration. fp8 was like 8 seconds per iteration

#

and the quality is still great usually

bitter hearth
#

ohh, thought you meant that was the regular dev lol

noble coyote
#

Dev at 20 iterations = 40 minutes on my RTX2070 8Gb

bitter hearth
#

yeah so 100 seconds for me would be too long for my patience

sterile pendant
#

so it works fine for me

bitter hearth
#

lol i noticed i can't play youtube on my 2nd montior while running flux

#

or that lags my system

noble coyote
#

Schnell is 100 seconds/image

bitter hearth
sterile pendant
# bitter hearth or that lags my system

in the task manager, you can go to details, locate the python.exe in the list, right click, set afinity, and uncheck one of the first in the list. this will free up a thread to do other shit

bitter hearth
sterile pendant
#

disable hardware acceleration in your browser

bitter hearth
sterile pendant
#

since it's not used as much for diffusion

bitter hearth
#

i guess

sterile pendant
#

oh it is definitely the solution in your case

#

assuming your CPU isn't super old

bitter hearth
#

it is kinda old but not too old

#

its ryzen 5 seriess

sterile pendant
#

oh that's plenty enough

bitter hearth
#

Tried prompting only for the word, didn't work thomas

bitter hearth
noble coyote
#

NF4 Loader - I take it that the checkpoint must now go under checkpoints and not unets? Also, where do I now input weight_dtype?

noble coyote
#

And the dtype?

bitter hearth
#

dont use it

noble coyote
#

OK, thanks

bitter hearth
#

you dont need encoders with single checkpoint

#

all the encoders, vae, and clips are included in that checkpoint

bitter hearth
noble coyote
#

I have dev_bnb_fn4 loaded

bitter hearth
hallow lion
#

Now that Flux is out, when is the Capacitor model coming?

noble coyote
#

But its erroring

bitter hearth
noble coyote
#

This error with dev_fp8 Error occurred when executing SamplerCustomAdvanced:

4-bit quantization data type None is not implemented.

hallow lion
prisma rampart
#

testing flux lora training on a 4090, testing on 1/3 of my giger dataset since its not a style it knows.

#

still training, no vram for photoshop so shitty comparison straight from the trainer, progress from same seed "woman wearing biomechanical armor"

bitter hearth
hallow lion
#

I will rock flux Dev 80 at 512/512 ๐Ÿ˜„

noble coyote
#

I'm perhaps one RTX 2XXXX owner whose GPU will not work with nf4 ๐Ÿ˜ฆ

hallow lion
#

takes me 3 minutes for a 1024/1024 on flux dev 8fp

#

๐Ÿ˜„

bitter hearth
#

waow waow colors

bitter hearth
sterile pendant
#

schnell 4 step with the nf4 version is 10 seconds per image, i'm testing it out right now