When attempting to run the FLUX1-dev-bnb-nf4 model with the new bitsandbytes_NF4 node, I receive an OOM error message.
ComfyUI execution error: Allocation on device 0 would exceed allowed memory. (out of memory)
Currently allocated : 3.46 GiB
Requested : 13.50 MiB
Device limit : 4.00 GiB
Free (according to CUDA): 0 bytes
PyTorch limit (set by user-supplied memory fraction)
: 17179869184.00 GiB
I can run the model without issues in ForgeUI, and I also didn't have problems with the FP8 version in Swarm (aside from the painfully slow times I get). From what I understand, the issue lies with the bitsandbytes_NF4 node.
I posted this in an already open issue on the node's GitHub. For now, I think we just have to wait for an update or an alternative node.
If anyone has found an alternative solution, I would appreciate the information!