#HuggingFace ?
65 messages · Page 1 of 1 (latest)
You just click download on huggingface website and save the model in the models folder, simple as that
Hi Berni, thanks for answering. if there's like 163 models for example. will it compile them together ?
cause it says 1 of 163 - so I'd have to manually download all 163 and then stick them in the folder ?
Unfortunately not. Usually someone else repacks those annoying models into a single safetensor file. There is a official ComfyUI huggingface page that has a lot of these repacked models
Okay , thats what I thought. I did see some people using Qwen but looking at the list there has to be a compiled one somewhere lol
Yeah you have to google around a bit to find them, huggingfaces search box works pretty well.
Otherwise someone usually also uploads it to Civit (so that people can generate using their cloud generation)
https://civitai.com/models/1981702/qwen-edit-2509-new-version
Thank you Berni ! Appreciate you taking the t ime
Anytime. Hope you have a beefy GPU tho as Qwen is a pretty chonky model
I think its ok for now rtx 5070 ti and 128gigs of ram , not sure what it requires but we'll see !
16GB of VRAM on there is pretty good, won't fit the 8bit quant of it but something like a 5 bit quant should fit, not best for performance but it will run
thank you for the advice , ill do trial and error , still learning all this. its been fun though. I'd love to try the video makers. Any suggestions?
Wan 2.2
Tho you will need a pretty well optimised workflow to fit it into VRAM ( Qwen and Wan are the hardest to run, your card will easily handle anythyng older with great performance tho like Flux and back)
I dont think i can run flux its why i went with Stable diffusion
the 5k rtx series dont work with flux?
Flux 1 Dev in fp8 format is 11GB so it should fit no problems
excuse my lack of knowledge , whats fp8 ?
i thought it was do the pytorch and cuda versions that wouldnt work
8bit floating point format
Back when models were smaller we didn't care as much how much memory they take up so they mostly used FP16 back then. This means each parameter takes 16 bits to store (also known as 2 bytes)
AI Image models supported by SwarmUI are documented on the Model Support doc, and video models on the Video Model Support doc. These docs should be the first place to check when you're unsure how to use a model.
there's links for everything in the model docs
oh okay i understand so as long as the model is flux fp8 it wont turn into a blurry mess
8bit quant would be weird to ever use. fp8 is normally used (fp8 is not quantized)
(some people do use gguf q8 but it's a really odd choice to ever make)
Flux is a bit anoying to work with in general, it doesn't like certain settings and makes a blurry mess. Also needs a lot of steps to get a sharp result (hence slow) so you might want to look into some of the Lightning LoRAs to let you use much less steps
well I like SD more so far
flux seems to always make the same character faces no matter what or the same animal look. not sure if you know what i mean
Yeah SDXL is our workhorse right now. It makes amazing results every time
its like copy paste but different colors lol
Yeah the base Flux model is not very good, you want to be using some of the comunity made finetunes of it that are better trained for the particular style you want to generate. Main benifit of Flux is that it is a smarter model that can handle more complex scenes. It won't be any prettier than a good SDXL finetune model
if you use only SDXL , can you generate difference faces without downloading community models? example trying to make a couple , the guy and the girl will always look the same unless i use a community model even if you describe the faces down to a pixel
That's part of learning how to best use AI tools.
They say it takes no skill to write a prompt and make an image... true, but the result is just as uninspiring. It takes skill to generate nice results.
I'll keep reading guides on how to make better prompts then !
Expect to spend many hours experimenting with various ways of using AI tools to learn the craft, watching what others do with AI to pick up ideas etc...
Heck even prompting strategies can change hugely as you change between different models. Just don't be discuraged by this not being as easy as it looked at first, everyone has to start somewhere. Also it is not always about generating the perfect result in 1 shot, sometimes you might run the image trough multiple AI tools before getting to the final result
It's already been a steep learning curve. I'll keep poking at it everyday !
I heard Chroma was really good for prompts
But to be fair. for face swapping, QwenEdit works damn well (No native support for that in Swarm yet, so you need to dive into the ComfyUI noodles, but you can turn those noodles into a custom SwarmUI workflow)
I'm still rather timid about the workflows for comfy ui. I understand it's purpose but modifying and changing it I wouldn't know where to start
Yeah they are for advanced users. Best to first get familiar with the basics (prompting, collecting good models, LoRAs..etc) then onto the intermediate (ControlNets, regional prompts, segmentation prompts, wildcards, refiners..etc) and then finnaly drive into the deep end (like wrangling latest novel models in ComfyUI)
Civitai.com is the most valuable resource you have in the world of AI image generation. Not only is that where you get all the best models, but you can also see what other more experienced people are doing with those models so you can follow their methods
only one ive been able to use , huggingface is rather a mess for me and even tried getting it from ollama
but it was like how many times can I shoot myself in the foot to get 1 model
Huggingface is good for LLM models and folowing the latest state of the art stuff (where you need to know what you are doing)
with Qwen , is there a different model I should use than fp16 ?
Go register an account on Civit, you will find a lot of use out of it. Not only does an account let you see all the NSFW content, but it lets you bookmark and track your favorite models and if you like you can also post your AI creations on there to show them off to the world
You will not be able to run Qwen at FP16, that would need very expensive server GPU
As i said best to stick to SDXL based models for a start. It just works, most of the people making AI art use it as it can produce some very nice looking results
sounds good! I thought I'd give it a try for fun lol and yeah Civitai only thing I understand and know
Berni , once again! thank you for your time and patience. Much appreciated ! Off to further adventures with more knowledge at hand this time 🙂
Thanks for asking the question @sacred latch . As another new user to swarm (regular civit user), when something says get the model from huggingface, I go there and then just stare at all the files and folders with no idea what to do. Reading this thread was helpful.
For Civit users trying to run the latest tech the easy shortcut is so instead set the Civit search filter to only show "Workflows" this lets you find a workflow for the model you want to run. Offten the workflow description or a comment inside the Comfy workflow will tell you where to get models needed to actually run that workflow (potentialy also have a separate ComfyUI instance for testing them, especialy if you like to just blindly click "Install missing nodes" in the Comfy Manager)
only if you want to try out tech that is both (A) brand new and (B) non-standard. For example the new audio models.
For anything that's relatively more normal, eg any regular image or video model, Swarm works great
QwenEdit works great and super easily in Swarm
AI Image models supported by SwarmUI are documented on the Model Support doc, and video models on the Video Model Support doc. These docs should be the first place to check when you're unsure how to use a model.
Is there already native QwenEdit support in Swarm? (not sure how to feed it multiple input images trough the generate tab, or is it just single image from Init Image)
I made myself a custom Swarm workflow for QwenEdit and i like it a lot. (Tho with these more advanced models like editing and video it usually makes more sense to use comfy noodles anyway,)
Excellent summary table for models btw! 👍
Really gives you a good picture of what is out there ( I need to give HiDream and Hunyan a try, they didn't get the hyped up online so i never looked at them)
not sure how to feed it multiple input images trough the generate tab again, refer to the linked docs
multiple image support that was added in the 2509 model was even demonstrated in the announcement post #announcements message
HiDream is kinda irrelevant due to qwen and hunyuan outclassing it at a similar scale