lighter note... new flux lora looks like a huge success https://civitai.com/models/655633/the-beavis-and-butthead-loogie-lora
#🆕|sd3
1 messages · Page 89 of 1
Good song
That's actually good! Flux surprises me more and more each day lol
my forge produces nothing but grey boxes
think I like comfy better
forge did a lot of ground breaking work recently but i don't think it was ready for an actual release in this state.
my sd 1.5 turn out just fine, as you guessed I was trying with flux
we're in the #flux channel so i just assumed
😄
so has anyone used dreambooth with Flux or SD3? They aren't showing up in the dropdown list
even though I have refreshed and restarted plenty
dreambooth belongs to who?
Google says Google.... 😦
we had such discussions, but I haven't said that unets are transformers, I said that the sdxl unet consists of transformers
My NF4 Flux Weights are in Checkpoint folder
I clarified this because people often claim that sdxl has a convolutional unet architecture while pixart and flux have transformer architectures. But sdxl is also a transformer architecture. These terms are not exclusive.
because comfyui is a hacked piece of code. Many namings are off. The t5 text encoder for example is a subclass of CLIPModel which doesn't make much sense
they implemented comfy for sd 1.5 and hacked support for other models in there later - often without changing names
T5 loading into clip seems weird af too
the code is extremely messy - but to be fair: this technology is evolving super fast. Often there is just no time to write clean code if you want to support all new stuff as early as possible
I think it's in business leader's best interests to keep the technical terminology obtuse and innacurate. Making sure the community is not a well informed population seems like a Machiavellian approach. it's hard not to believe it's intentional.
Flux.Dev and Flux.Schnell are in my unet folder; Flux.NF4 is in my checkpoint folder
yeah but at the same time comfy dev / anon publically calls out other devs for having sloppy messy code too
or: it's a software project of a single guy who is too busy for Machiavellian approaches as he is too busy refactoring his code base
it's a business with investors now
yes, I agree and I found that always very strange behavior
flux sits nicely in my forge where every other checkpoint goes, models\stablediffusion... But the T5xxl goes in the Vae folder lol bizarre https://tensor.art/models/759856135286068673/FLUX-HYPER-TRAINED-DREAM-DIFFUSION-BY-DICE-V-1
t5 in vae?! this is getting preposterous.
Perhaps controlnet would work to bring elements into flush which aren't part of the original training?
so much terminology confusion. so many avoidable situations. hard to beleive it's just because they haven't gotten around to a refactor
running this now but t5 is in clip
4 bit quantisation data not implemented - oops
But that input is for selecting VAE, text encoder,... it's not that the T5 is going to the VAE folder....
With what data has this 'HYPER TRAINED' been trained?
more info here and more images to view the difference , Hope you all like it https://www.shakker.ai/modelinfo/bbe99aa7e86540a082f1e18b3af9131c?from=personal_page
Our hub provides members with exclusive access to an elite selection of AI image generation models, designed to produce superior quality images that stand out in any creative project
so why would google care if they support sd3 or flux with it?
Prompt = Pope Francis by jay ryder, in the style of surreal creatures, punctuated caricature, uhd image, humorous tableau, necronomicon illustrations, loose gestures, sparse
Prompt = I feel beginnings Sudden trembling Of silence Ordinary Over the hills The wind Sighs Deep inside itself Breathe deeply The world Waits
Prompt = john mancuso's mummy (powder) In the style of cyber punk surrealism, baroque portraiture, mirror, made of insects, kinetic art, unique framing and composition, trick of the eye paintings
Prompt = that girl in the kitchen is going crazy, in the style of humorous animal scenes, colorized, chrome-plated, caninecore
Is there a png with workflow available for this Checkpoint at all?
yes the links are in the model discription bro
I got the w/f 😄
dream <> fp8 <> guff.... I don't see any differences with the dev-fp8
Stormy
Comfy Workflow : https://openart.ai/workflows/maitruclam/comfyui-workflow-for-flux-simple/iuRdGnfzmTbOOzONIiVV
Vae : https://huggingface.co/black-forest-labs/FLUX.1-schnell/tree/main/vae
Clip download clip_l.safetensors and t5xxl_fp8_e4m3fn.safetensors from :
https://huggingface.co/comfyanonymous/flux_text_encoders/tree/main
OpenArt
Created by: Lâm: It is a simple workflow of Flux AI on ComfyUI. EZ way, kust download this one and run like another checkpoint ;) https://civitai.com/models/628682/flux-1-checkpoint-easy-to-use Check out more detailed instructions here: https://maitruclam.com/flux-ai-la-gi/ Just 20GB and no more download alot of thing. it was a bug when i tried ...
work flow , vae, clip l, and your T5xxxl
just take those from the hugging face dont down load the lot
But is there any explanation for why the results are identical to dev-fp8? Or am I not understanding what this model is supposed to do....
what sort of machine is it being run on?
I don't understand the question
sorry bro im back, yes its the dev base i trained. Try the model with font and text the main training was needed there
Oh because that was just the go to folder for the image generation portion of the models where they didn't include text encoders or the vae.
Within comfyui, he actually renamed the load unet node to diffusion model loader or something like that. Wouldn't surprise me if he adds a new folder named it as well, but also allows comfyui to also check the legacy unet folder to prevent it from breaking old workflows and whatnot
Made it!!
10 minutes at 20 steps
8Gb VRAM, 64Gb RAM
This Flux Dream Diffusion seems to get the text right 90%+ of the time! When I use NF4, it gets 70% nearly right; and 10% exactly right!
Yes, it's true that unets are convolutional networks. Just because they have the some hacked in transformer operations does not make them transformer based architectures. The actual unet itself is not a transformer, with regards to the ML definitions of things. Dit models are though.
Pakistan?
Ouch!
He is getting cooked not cool im afraid in that weather
Lol, ac works good in the house
Was my truck temp earlier so it's always a bit warmer in it
Texas? Arizona?
I knew Admiral William Crowe (Bill) from Oklahoma
Joint Chairman of the Chiefs of Staff under Bill Clinton
UK Ambassador 1995
Think he was a senator years ago
He also appeared in an episode of Cheers (he was an actor before enlisting)
Born and raised here, just a big bearded redneck 😂
Who loves beer and bonfires
Cheers 😂
laughs in DND
flux training is slow if you do a 2k dataset (like over 24h) - but so damn worth it
even freaking kenkus are working 😄
Fix loras having a weak effect when applied on fp8. comfyanonymous committed 27 minutes ago
shit 🤣 now my lora looks completely different
just updated
*angry shakes fist
One not too bad one from my lora, though still missing very important aspects lol
Is this the long tube in aliens where he has to go do the thing?
So flux dev doesn't know about the tesla cybertruck. it makes weird tesla model s looking trucks. cool but not what elon was showing off x premium got flux as the backend. So flux pro is running a newer model already
That tunnel was even narrower. Oh speaking of that the new Alien movie it total meh
hmmmm
what was the prompt for that?
cause damn thats cool
"a horror painting of a teen girl wearing pajamas in a dark hallway at night. The girl is running toward the camera screaming. She has a look of fear on her face. Her pajamas are torn and tattered. She is bleeding from her arm. A hideous glowing tentacle monster zombie is behind her chasing her. The monster has a huge mouth with big teeth and is smiling. by H.P. Lovecraft"
according to their website. flux.dev is a distilled version of pro (unless I misread that) (and schnell an even further distilled one)
right. pro couldn't do cybertrucks at launch either. grok2 is using flux as it's image back end though and it knows the cybertruck. they must've either made them a custom model or updated the pro version
pro couldn't do cybertrucks at launch either
oh! O:
damn
if they're actually training new versions then thats impressive. cause that thing is expensive to train
s
they also could just refine their pro base
a more "vhs" looking one
xai has something like 8billion in funding secured so they can afford it
oh yeah. twitter/X is definitely using a custom model, as it includes all of the filtered content. like it can do all celebrities that are definitely not included by default
i was doing 70's tv show and gave it a graphic text overlay, and it made the graphic way past the screen margin, just like the good ol days. it has a nice retro aesthetic that's more proper
but they probably just ran a large enough finetune over it, with all (sfw) images ever posted on X 🤣
it just can't do asciii art, which kind of sucks, I want to do more 80s computers
need to make an ascii art lora. I wonder if Pro can do ascii art
SD3 at least could do that correctly. It sucked at most other things tho 😄
🤣 tried it with my lora
i know the perfect place for a dataset. ice.org
Any idea why flux loras on sd forge don´t appear?
Don't think they're supported yey
they are (with fp8 flux at least)
in forge or comfyui?
forge
Very questionable
At least they look better than this
wait until controlnet comes around
Not SD3 nor Flux but still want to show off my latest work. America's favorite blonde as dark magician girl
Also made the yu-gi-oh collectable card version
I'm basically gonna turn all the classic-era yu-gi-oh characters into photoreal. But I might wait until Flux finetunes + cnet come around since this took FOREVER with 1.5
didn't slim shady return with frosted tips an all and killed her chart numbers for a bit? Think ol slim is more fave.. but i guess he ded now
apples and oranges my friend
Unlimited power will only come after perfect consistency and the abiliy to move that perfect consistency perfectly consistently (at least for 5-10 seconds) comes. This si when Hollywood will fall.
Right ow it's a joke. Pile on as many controlnet sor IP adapters as you want consistency is not there.
It has to be down to the last mole and hair
Says here that Madonna holds the most #1 hits of any recording artist, an she's blonde too.
lmao, imgsys.org added the three flux versions (i've seen some pro ones on there, but it's not on the leaderboards yet. probably needs more votes to show) and already, they are the top by a landslide finally dethroning realvisxl 4.0
taylor swift has the highest monthly listener count on spotify at 100M+
in fact the only one to have reached 100M
pretty safe bet she's America's favorite. Sure as hell is mine
the skin detail is also not consistent. patience is the name of the game my friend. by end of year we're gonna have a killer finetune I'm sure
Yeah it won;t be long for sure
she's a natural brunette
We used to say in 10 years when we talk about a new advancement in graphics. With AI its mor elike in 6 months. XD
Anyone remember Silicon Graphics International? Those were the days 😛
so was marilyn
made the units that rareware used for DKC fkyeh i remember sgi
https://youtu.be/mMLHHxV0vWU peak sgi
Killer Instinct Arcade Introduction
Acquisition via MameUI32
Heli is a bit small...
why does it keep getting the heli scale wrong lol
i guess SGI stations were also on production of Jurassic Park and Starship Troopers, which imo still hold up. So those maybe peak
maybe the scale is right but she's an amazonian
Yes, the scale of things is the next big thing to tackle for the AI models. Even SORA messed this up big time in those early promo videos.
Because it's a... Little bird
[flux lora experiment]
Just went thourgh 50 imgs on the imgsys rankings... it isn't even close flux demolishes.
Well dooooooooh.
I just put up my 2nd Flux lora on civitai. It is better in the showing what it's supposed to way, but still sucks lol
Well either way thanks for developeing and furtherign the tehcnology for all of us.
I can't wait until all the training data is released for flux 🙂
SD3 also 😄
you assume there won't be something before before that
link?
Unfortunately I like to make images that are more than just portraits of hollow people looking at the camera ....
That's awesome... you put out fantastic images!
Bottom right kind of looks like my wife's Uncle
i just wung an image through ultimate upscaler in forge. it was easy. this was just 0.55 denoise over 40 steps and megapixel squares
think i'm about done with the subreddit community. i saw a post saying flux has no creative styles at all and can't do img2img work. i told them that i dont know i think that's maybe wrong , but they insisted and then reported me for harassment. reinstating my account was a mistake. it's like 4chan there now. a place where misinformation goes to thrive.
looking at this upscale i'm just reflecting on that from earlier, "can't do img2img" comments are baffling me
i dont think it's all 14 year olds. there are lots like me still hanging around there
pick any current social trend or hate train and reddit will be right on board. It's pathetic, they don't think for themselves
lurking in the shadows
sausage party! the new sequal series was dynomite
i can't really spot any seams on that upscale i did. remarkable.
less denoise should maintain more consistency too
It does the best img2img ever!
Thank you for using comcom analytics.
"comcom analytics" supports all community managers (moderators and server owners) by stats, visualization, and analytics.
If you have any questions, feel free to ask us!
Your dashboard
Help
Support server
Other languages
en: help
ja: help Japanese
Thank you for using comcom analytics.
"comcom analytics" supports all community managers (moderators and server owners) by stats, visualization, and analytics.
If you have any questions, feel free to ask us!
Your dashboard
Help
Support server
Other languages
en: help
ja: help Japanese
Flux
is 847 isnt it?
lol
Pope Francis says Good Morning! Flux@nf4
... trying to find the custom node which is 'breaking' ComfyUI since the new frontend ... ComfyUI works, but like swimming thru treacle!!!
Found it - and disabled - efficiency-nodes
I love the new front end
add node menu is less buggy for me
and can easily filter for node packs
managed to squeeze a bit more image quality out of SDXL
with SEG node
Flux@nf4
looks like the nf4 is not bad
Flux@nf4
ugly
Successfully ugly? 😄
ugly ugly
basic flux dev?
Enough woman face prompt
We all know these models can do that well
Time to game some bit 
Flux@nf4
Slowly trawling thru custom nodes - disabling three at a time - to stop ComfyUI "working like its swimming thru treacle!"
Flux@nf4
https://fluxpro.art/create Flux Pro Free? 😮
Yes, exclusively Flux.1 Dev for me... I get 20secs generations using simple scheduler. Beta scheduler can vary between 20 to 60 seconds per generation. Worth it for the extra quality
dpmpp_2s_ancestral now works with flux/sd3/aura:
https://github.com/comfyanonymous/ComfyUI/commit/2622c55aff9433d425a62e5f6c379cf22a42139e
just note that since it's a 2s sampler, it will take twice as long per step. but hell yeah, finally an ancestral sampler
For those like me who are really dumb, where would an ancestral sampler be better than a non ancestral one?
well it's a really complicated topic that doesn't really have any one-size-fits-all answer. the big key thing is that it opens the doors for a new set of options
Thanks... all I know about ancestrals is that they add noise back between steps? or something like that?
it's the in the name, the whole ancestral part. basically, yeah, they keep track of prior steps
You thought I was [that] smart? pfft... I thought ancestral meant they were older... LOLOL
lol...
meaning older than modern ones 😄
they were discovered by decoding hieroglyphics
👆
insert "i should call her" memes
city96 also added in k-quants for the ggufs now. so you can use quants like q4_k_s now, which is better than both q4_0 and q4_1 and only a hair bigger than q4_0
Hmm...
Mia Khalifa serving food in a restaurant wearing a protective revealing leather outfit
Aquascape
these are all amazing, I guess its flux dev ?
Thanks... yes, all Flux.1 Dev 👊
Why do you insult other people's work?
Oh rly?
Will check it out
It had been asked about in the community chat at Hugging
The question is at what price
yeah its a deal with the devil at that point
My testing had shown 5_0 to produce the best balance, and 4_1 to be the best balance (diffs are nitpicks)
okay thanks
haven't tried flux yet I'm just taking notes about it
The biggest strength of the GGUF is that it did not drop performance speed like a hammer from the sky if you exceeded the perfect 8GB limit
So you could produce images at say 1280 x 1280 and it would not become an instant turtle. NF4 is great but has that issue for 8GB cards. 1k is fast and 1280 is s snail
The fine details are really were you can see the effects of greater and greater quantizaton. I have an image and settings that helped me pinpoint this
ah yeah that's an issue because fine details are my main focus
this is why I always multiply my sigmas by 0.8
I literally have no idea what that would do or how to enforce that with Flux
New ComfyUI commit 7333216
YFG prolific and always hi-quality output!
sample_dpmpp_2s_ancestral_RF
we can get ancestral samplers for flow models now?
My 8Gb RTX 2070 has always worked well with NF4 - but maybe I am the exception?
Dunno
that HAS to take a while to render though like what, 3 minutes
render time depends on hardware
he literally said 8GB RTX 2070
ah ok didn't see
90 seconds is a typical 1024x1024 render - which I am happy with 🙂
90 seconds is fine yeah
Flux.Dev however - when I use LoRAs - can be upwards of 25 minutes
So Flux.Dev is only experimental with me
Not sure that Flux LoRAs are really necessary - prompting can almost produce the same results
current loras are probably pretty small datasets and fast training
compared to what may come later
Thanks for the kind words... you guys (and girls) are of course the biggest inspiration. I plagiarize often 😛
raiding the midjourney discord for IP adapter or control net inputs is a fun thing
... oh the prompts I've "borrowed" over the years ... ! 😄
I am NOT NEARLY as creative as the best around here... so yeah, I "borrow"
in my experience Florence2 can steal prompts pretty effectively
Does this have a Florence w/f embedded at all?
there's a node but I forgot its name
Perhaps you missed what I said specifically. It is fine if I do 1024 x 1024. but if I try larger image sizes like 1376 x 1376, it exceeds my VRAM and shifts to snail mode.
Thanks, its good to know!
If you use the gguf versions, you can even do 2048x2048 images without ooming and the time scales pretty linearly. Like if you got 4 sec/it at 1024^2, it will be like 18 sec/it at 2048^2. I have an 2080 8gb GPU btw
this is a lora just trained. Please try reproducing because I'm curious about the prompt
I should have added that the LoRAs I've been using can be done via prompting i.e. landscape, pop-art, art-nouveau. This looks is very sepcific and is probably better served via a LoRA
A regal figure adorned with intricate jewelry and a glowing crown sits on a throne amidst a futuristic cityscape. The scene is dominated by blue and golden hues, creating a dramatic contrast and highlighting the detailed architecture.
oh and i forgot to mention in my last post to you that i'm using the q8 gguf, not even the smaller ones since it only makes less than a one second/it difference between q4 and q8 for me
is gguf better/faster than nf4
yes, it's better. not sure about faster though, but it should be roughly equivalent
and it doesn't plague you with oom issues or spilling into the sysmem
even comfyanon recommends using it now and likely won't bother further updates for the nf4 format addon
from comfyanon's repo for the nf4 loader node "NOTE: This is very likely Deprecated in favor of GGUF which seems to give better results:"
Q8
yes, you need the node of his as well
i recommend testing the q8 and q4_k_s models to see if there's a massive speed difference between them, but for me at least, there isn't
like i'll get 3.3s/it with the q4 and 4s/it with the q8. q8 is basically 99.9% as accurate as the full fp16 model
and again, that's with 8gb vram and 32gb sysmem
Well, for me on the first run Q_5 K is very slow
that's normal, it has to load the model
it doesn't do it until it hits the ksampler step
for some odd reason
NO, I mean the plain rendering is crazy slow
what card?
With Comy and only 12GB of VRAM (64GB RAM) I get 2.6s/it with 4070 plain
Laptop 4060
have you updated the nodes today? i saw he put in a bunch of updates last night
I did actually since the first time it actually crashed without the update
i know he said something about the k quants potentially being slower "Most of them are coherent, though the speed may be slower compared to the legacy quants (for now)."
but for me, they haven't been that different. maybe try the 4ks version instead. i'd have to check the code to see if the version giving your problems has some extra expensive math in it
Yeah... we are talking plain Dev speeds practically
for Q_5 K, but I restarted
it was at 35s/it
Dev is just too big for us mere mortals
could be that annoying windows memory management issue as well, or if you don't have 32gb sysmem, it's doing a bunch of pagefile shuffling that's limited by your drive speed
and you're using the fp8 version of the T5, right?
I like this one. 🙂
Regardless, the normal non-K GGUFs did not have this issue at any of the sizes
Which LoRA?
trained it today
I'm not the only one waiting for @hexed dirge to get back on his publicly-available-LoRA-horse once more! His LoRAs are a cut above!
Ethereal Grace
Anyhow, a restart seems to have normalized the speed to 12s/it
Which is a bit slower than Q_5 plain, but not dreadully so. Maybe 25% slower
I will check image and then try Q_4 K
I have a plain Dev to compare to needless to say
Reading this prompt is seems promising: "In an abstract realm of dense, intricate patterns, vibrant colors twist and tangle, forming an intricate maze of chaotic beauty. Each thread of color and texture weaves through the next, creating a pulsating rhythm of energy and emotion. The tangled lines blur the boundaries between order and chaos, a surreal dance of endless possibilities."
but try it in sd3, it almost gives noise
Recent commits have jarred my ComfyUI workflows - I had a nice setup 2 hours ago at 3s/it - and then i updated!!! 12s/it
It's the one I use (dev-Q8_0). I've done quite a few tests and comparisons, and it's the one that works best for me. Plus, it works very well with Loras.
Combing thru custom nodes and disabling - its like using a nit-comb on your kiddie's hair!!! 🙂
I thought the quantized models were incompatible with LoRAs
they are compatible I believe but its just that most things didnt support it
... just d/loading Q8-GGUF - its on a very slow (or very busy) - link
Poetry
gguf q8 - 24steps
A pretty nice thing I found is just flux schnell seems to perform better at 512x512 res instead of 1024. Text on images is flawless and prompt following is slightly better as well. Same seed, same steps, 512 is left and 1024 is right
Prompt: "A monkey holding a sign that says "Well it seems like flux is better at 512 resolution then 1024?" and on the top right it says "Here is more text to prove that.""
Does GGUF go into Checkpoint or Unet?
unet or the new diffusion_models folder
guess comfyanon heard my words about the folder needing to be renamed yesterday
... don't have that folder ... yet
Ok, so the plain rendering of Q5 K took 6 mins. By means of comparison, NF4 or Q4_0 took roughly 2.5 mins. But plain Q5 was no speed demon so this is not unexpected. First analysis shows Q5 K to be 99.5% identical to pure Dev. You need a microscope to see the diff
update comfy and run it to make the folder
"Enable non blocking transfers in lowvram mode." oooh that might solve some issues https://github.com/comfyanonymous/ComfyUI/commit/73332160c8c9843876680fb04f037793c73d55b6
I updated to the new commit about 90 minutes ago - it broke my w/f - so I did a hard reset to the previous commit
git reset --hard 2622c55
So no diffuser_models folder ...
well how did it break your workflow?
Went from 3s/it to 12s/it
I've been having node clashes - been disabling over 100 of them - seems like IPAdapter-ComfyUI and efficiency-nodes-comfy are the culprits
The minute I disabled these two - the speed went back to normal
yeah that's probably the main problem then. well for now, you can test by saving your current workflow as a backup. then update comfy and make a blank basic workflow to test
But the recent commit knocked the speed back down
a lot of addons use monkeypatches, so they might f with the rest of the systems under the hood, if they aren't up to date
OK, Q8-GUFF had d/loaded ... pip install --upgrade gguf
at least on thing sd3 is a little better at than flux, grainy photo stuff
you installed the city96 gguf addon with the manager, right?
This lens flare is very realistic
No - via git clone
ahh, you're installing the requirements.txt then
and when you load the model, the node will be something like gguf, just search for that. everything else would be the same as the nf4 workflow, just replace that one node
you'll still use the same dualclip loader for the clipL and t5
and VAE
I'm using ae.sft
is the node
How did you revert to a previous Comfy?
git reset --hard 2622c55
That reverts?
Indeed it does
anyplace specific to do that?
🤩
nice to hear that you make Loras again!
thanks
ok, I asked because my main folder has three folders: Update, the Python and ComfyUI
OK, you have portable then
yes
I use system ComfyUI
I don't use the portable stuff, but that was the starting point
My 'reply' was:
C:\Users\silve\Applications\ComfyUI_SD3-Flux\ComfyUI>git reset --hard 2622c55
HEAD is now at 2622c55 Automatically use RF variant of dpmpp_2s_ancestral if RF model.
Sound right?
Yes its exact
it does say this was released today you know
Loading: ComfyUI-Manager (V2.48.6)
ComfyUI Revision: 2563 [2622c55a] | Released on '2024-08-18'
What do I do for a VAE now I am using the GGUF UnetLoader?
Use the standard Flux VAE
is there an fp16 dev that works with forge?
Sourced from where?
Flux
The GGUF UnetLoader has no VAE out ...
The one hiving off the checkpoint loader?
flux vae should be a separate file and you load it with Load VAE
Does anyone have a simple flux dev (not ns4) wprkflow which uses a lora?
I am getting 7.5 s/it - still its probably node conflicts - even after a back-commit
NF4 was 3s/it
What was the commit for yesterday?
2622c55
It was updated about 2 hours ago - sorry, 2622c55 was updated about 10 hours ago ...
WHich is why I asked about yesterday
And a new 7333216 commit is in place
Don't know about yesterday's commits - mebbe there's a list somewhere?
Q8-GGUF
5 minutes
Which is about 3.5 minutes slower than nf4
I heard that with gguf it can be possible to run multi-gpu, is that true?
I really think that flux upscaling kills the need for 4xultrasharp style upscalers
those are fast but the quality aspect lags
if you only have 8gb of vram, try using the q4 gguf. it should all fit into memory then
the speed differences between nf4 and q8 are probably due to your system's current state. maybe you have more browser tabs open this time.
I'm getting advisory warnings ... \ComfyUI\custom_nodes\ComfyUI-GGUF\dequant.py:8: UserWarning: To copy construct from a tensor, it is recommended to use sourceTensor.clone().detach() or sourceTensor.clone().detach().requires_grad_(True), rather than torch.tensor(sourceTensor)
???
tha'ts more for the developer to see
OK
something pytorch changed and wants devs to know about
Q8-GGUF
That would be neat as then you could upscale and refine with sdxl or sd15 faster. 😮
no, it's complicated. prompt was something like:
{day|night} time, {flat colors|realistic lighting}, {front|back} view, (Perched on a sunlit hill, the intricate Spanish coastal town unfolds like a painting. Tourists wander narrow, winding streets, their footsteps echoing against brightly colored facades. Balconies adorned with vibrant flowers overlook the shimmering sea, while the scent of salt and citrus fills the air. Laughter drifts from bustling cafes, where locals and visitors savor the town’s timeless charm:0.32), (signature text "NOEDEL" in the bottom right corner:1.23)
so maybe a photorealistic prompt description together with "flat colors"
Love it. Is this with a LoRA?
yes
Private or publicly available?
private
Had to install forge to use flux dev nf4. Worth it. Very fast, i'm even upscaling.
I remember you don't like civitai for good reasons - but any chance you will publish your loras? Maybe on huggingface instead?
the artstyle is awesome. But also: this extreme fine details would probably not be possible with SDXL
maybe, yes. Now I'm testing them but can think about huggingface
the problem is that with civitai I got token for training
with HF no
ah, I see. i train my stuff on my graphic card. Fortunately, with all these optimizations that's possible even with Flux
well I have 12GB 4070
not suitable for training flux loras
hm :/ yeah, probably difficult
Could anyone got the flux dev q6 working on forge? The q8 .gguf work fine but the q6 continue giving me errors 
@heady patrol a simple wedding-ring with a broken metalik heart and a red gem as blood
#artisan-2 a simple wedding-ring with a broken metalik heart and a red gem as blood****
#😊|co-creators a simple wedding-ring with a broken metalik heart and a red gem as blood
"gabriel the angel of death comes down for your poor soul"
Death agreed long ago specifically not to interfere with the living, but from time to time he just can't resist to fool around a little bit...
This is amazing, from a LoRA I assume. Tell me, would you be willing to post on some site? Also, would you be interested in doing Frank Frazetta? I have a good number of his stuff digitalized. Though not labeled, which is fixable.
I mean his fully illustrated comics
not the 40 odd covers
At the moment I'm testing that Lora. After I should think about the best place to post. Also since i use buzz it costs 2K buzzs for training and anyone can.
I am unfamiliar with the service. Can you link?
I have a dinky 8GB laptop 4060 so.... training locally is not an option
"It fixed some perf issues but caused other issues that need to be debugged." might be relevant to the issues you were having and that you had to rollback for:
https://github.com/comfyanonymous/ComfyUI/commit/6730f3e1a362d5f3ed44f8541517b03356e7bf0e
True, but I wouldn't know where to begin to train on it
with that link.. Upload images and select flux.....
if you can just get to the level of linux knowledge to open up a working python environment on a fresh machine
then you can use diffusers
Yeah.... You could have simply said, "If you learn Chinese and Japanese and Aramaic, and pray to Buddha, it is easy"
Meaning Neon, not Andreac
I agree its a pain
no. It's just easy.
fighting servers
ah ok
also, not sure which country you are in
cloud is better value the closer to Sweden you are
not sure why but Swedish internet is insanely good and that goes for their datacenters too
@bitter hearth I generated hundreds of GB to train chess NNs for well over a year of nearly 24/7 macines on Vast.AI, so I am familliarwith the challenges of using it. It is a monster pain, and you'd need to have all the tools and whatnot well set and oiled
Brazil, and nothing here is easy or cheap in terms of that sort of thing
SOme is though. Plain internet is obtainable at decent rates
but the rest is ugh
Anyhow, I did resolve the speed issues with the new GGUF models on Comfy, by roling back to yesterday and setting --lowvram
ah I see
South America situation must be very different yeah
Don't even think of lumping all the countries here as one. I don't mean that out of some nationalistic statement, just that the economic and technical realities for each are as disparate as Europe to remote Africa
ah okay yeah I see
Anyhow, gguf Q4_K t (4.7s/it) is still slower than NF4 (3.5s/it), but is also better quality. Q5_K (6s/it) produced a near match of pure Dev, but the sample size of one needs at least a couple more to show this is a reality or a lucky coincidence. Will test Q6_K first
sounds good
I would love to have a frazetta lora. If you are willing to share the data I could train one.
my current approach of training frazetta is to train on sd 1.5 created images with frazetta in the prompt - but the sd 1.5 inages only roughly approximate his style
More than happy to, but likely need to breakdown the full page illustrations into single images.
sd ultimate upscale. i used the forge extension but its probably so good on comfyui too. i'm tellin you. this was barely any effort. just .. ugnn.. i just threw settings at it and this is easily the best results i've gotten from the extension
time to make a plan an get serious with this. flux is a seriously high quality tool that opens up so much potential
that's really high res yeah
just told it do 4x , 0.55 denoise, megapixel patches. i think i had seam fix on too , but it made the two versions, one without seam fix and one with and i can't tell which is supposed to be fixed
might've been 0.51 denoise
vintage photograph from a waterpark in my hood growing up. before and after flux ultimate upscale at 0.48 denoise
lol it added a ton of boulders to the walk way, but that's actually realistic for this place
crazy how well that works
https://imgsli.com/Mjg4OTE2 it's so baffling that people are running around reddit claiming flux can't do img2img
use the link. it shoudln't have embedded. it's a comparison site
with comfy any model can do inpainting
I don't understand why reddit makes things up
the idea of a diffusion or ret flow model that can't do img2img makes no sense
if it can't make an image from an image then how is it going to make an image from latent noise? 🙃
yeah forge should be set up to do inpainting now. img2img wasn't working earlier this week but it sure is today
Hard to pick the best approach.
the guy argued with me and said that he had talked to one of the flux creators on reddit and they told him that they didn't condition the model for img2img and it woudl always produce hazy outputs no matter what. i was like "uhhhh.. this sounds like \my dad works for microsoft\ " then he reported me for harassment
the whole sub reddit makes no sense
wow yeah that's bad
afaik the podcast that just came out is the ONLY public posts BFL have done outside of their website
i guess thye made 3 twitter posts too
what's the podcast?
https://a16z.com/podcast/the-researcher-to-founder-journey-and-the-power-of-open-models/ it shoudl be on the various pod networks too. play, apple, spotify, i dunno how half it works
its pretty good and worth a listen
ty
lower denoise and a more descriptive prompt is helping to upscale this image a lot.
5 min it'll be done. brb
Would you be willing to share best settings for Flux Training on CivitAI? I want to give it a shot... never done it before. Unless the defaults are fine...
Thank you 👊
Apparently Flux Pro is very pro-aged people since apparently this image is of a "YOUNG man":
All I can say is that if you are 25 and look like that, plastic surgery is an option
He has seen too many AI generated hands.
at lower denois it left a lot of the low quality artifacts, kept the crowds smilar, but also changed my favorite part the arbutus tree
trying forge's multidiffusion integrated now. its working good too!
Those sliding tubes are a proper Moebius strip.
You'd end up in the 4th dimension sliding thru those
thats a real park that closed when i was a kid. this is from before my time.
😮
though i ddi get to experience all fun before it died. those are legit
here's another angle from teh banana pool . these kinda waterparks were pretty common around canada
yeah screw it. i'm tossing the ultimate upscale script. multidiffusion is all you need. i used 0.41 denoising on this one and it barely touched the low quality artifacts of the original, but i can already tell it did a LOT better of a denoise and in a faster amount of time on account of less vae calls
prompt was simply:
Cadillac Escalade Interior
whats nice about flux is cars dont have some kind of concept interior where the passenger has some gunner seat with a steering wheel too. they're proper now
i was kinda thinking of that cool intro scene in the new ghost busters with the sweet ecto 1 gunner seat mod when i typed it
it is pre cool
that gunner seat was neat
INSANE how it just generates a high quality image from such a short prompt... just like that.
Meanwhile me on sd3 doing random crap
Feel the holiness of the cookie
You sure know how to get the most out of SD3.
Best model 
Not to mention its fine sense of irony. "average SMALLEST American car"
Hey latest AI models come with a humor module.
Yes, I saw when I asked for a young man
just get a GMC denali. they're teh same thing with less brand prestige tax
And much different interior. I prefer the Escalade dash... the new Electric one has an insane dashboard
you know you got an escalade .
actually the new chinese electric SUVs have been catching my eye and making me say wtf
the yangwang caught my eye first because, well, the name. the yangwang u8. wild looking
(real photo for sweet interior reference)
Real photo of the ballpocalypse that happened in a closed off country 
yeah thats it. boxxie
Can someone enlighten me on the proper way to use Flux LoRAs? I see some workflows call for ONLY the model pipeline and others have the full Model and CLIP in the pipeline...
ROTFL... this was with the Arnold LoRA LOOL
a guy with a hero mask
this what Flux.1 Dev knows about this car:
prompt
yangwang u8
Snake Pliskin in : Escape from Ballsylvania : Balls out
/generate a guy running
It's Sawnick the Hedge Dog.
i love how oddly specific we can get with prompts now with flux 😂
anyone remember april editions of game mags?
One thing that has become quite visible is that the text parser of Flux Pro can produce strikingly different output. At first I wrote this off as just RNG doing its merry dance, but no longer.
sonic after he browsed 4chan
More specifically, Pro's text parser produces many times far more interesting, if not always as precise output
I am attributing this to the parser since I have not seen anything in Dev and Pro that suggests vastly different quality image output per se
Sanics !
@remote holly qanics!
What's that on its mouth 
"The treasure is buried in the ....." <croaks>
he didn't wanna run alone
he probably did but she just invited herself. typical. amirite?
So to illustrate what I meant about Pro and Dev, here is an image by Pro (note that 2 out of 4 times it fails to produce any text at all), and the images are all in these lines:
To contrast, here is Dev (tried multiple images, but all are similar):
The prompt is: A captivating image featuring a young man seated at a desk, diligently working with a pen in his hand, his silhouette barely visible against the backdrop of a mesmerizing fantasy scene. The double exposure technique reveals a lush green forest, a valiant knight, and a female mage casting a spell, all seamlessly blended together. The words "Imagination and creativity are the mind's wings" are elegantly written in large bold letters on the side.
size was 3:2 or on Dev 1280 x 800
pro sacrifiiced prompt adhearance there. probably an aesthetics guidance thing. the dev gens had the double exposure technique down pat
and really, if we're talking about text, just generate the image and then use any editor from 1999 or newer to render text on it.
Blobstyle LoRA
trying to make the beastie boys but flux is like "Austin Butler as THE BEASTIE BOYS"
i'd watch it too
this is closer but even if i really lean into hard descriptions of each of them, zeroing in on one album in particular, can't do it . doesn't know beastie boys very well. thinks they're just generic boys but they're not. they're not generic boys at all!
this ones like timothe chalmamet stars as mike d
could he pull it off? from muad'dib to wonka to michael diamond?? i have hope
testing 2mp`(ggufq8_0)
It has officially been confirmed, Flux can infact create male anatomy! Needs a lora to do so though.
Prompt: "a comic drawing of a software engineer sitting at a desk with an exasperated angry look on his face. He is pounding his fist on the desk. There is a high end workstation PC on the desk. The monitor is displaying a pair of female breasts sticking out. He is shouting 'I was looking for Breadth first algorithms!'. A female employee is standing behind him looking at the PC monitor with a shocked look of horror on her face and she is fainting. She is shouting 'HR!!!' "
You can get two different speech bubbles with accurate text?! 🙂 🙂 🙂
Look what I found today!
It would probably be handy to perfect some SD3 images
https://huggingface.co/spaces/SkalskiP/FLUX.1-inpaint
Apparently! I tried it and it pretty much worked
I will add the prompt to the post
and I had to get a run gens to get a really good one of course, but only about 5 or 6 times
I was also curious if you had prompted the black censorship line or not, now I know 😄
Now I'm going to try ithat layout on some other themes!
Cx
my favorite of the upscales. keeps the most quality. has just enough jpeg to be believable.
Big tubes
#🆕|sd3 message the orignal pic is here. it was a real park

everyone doing flaming gpus. missing out on the stuffed pillow gpus. throwback to my first day with flux when i was tryin to break down the walls of expectations
if you think about it, the brain is just a lot of balls with tentacles
I've been using flux1dev_v10.safetensors this entire time. Is that the best one for low vram? I've seen so many come out that I've completely lost track now
nf4 doesn't help for testing my Dev loras, do not trying to use it currently
Finally getting around to training a Flux LoRA
brain balls
is that the spy dataset??
when flux gets ip adapter dataset building is going to take off
Yup... I just uploaded Epoch 5... totally used default settings from CivitAI. Not sure what GREAT settings should be
and with LoRA at 1.00 strength
has anyone else tried stuff like prompt editing on the newer models? it never seems to work like it did before with sd15 and sdxl. like [cat|dog] style prompts
heres a [horse:dog:0.6]
it seems to blow out the details when you shift the attention. here's a [horse\🐱0.5]
doing the alternating steps approach like [horse|cat] just completely blows out the subject
Nope I added that myself
if cfg is set to 0 it does nadda
i its probably really incompatible considering the t5 and parallel text network
I tried it a few weeks back and it didn't seem to make a difference. I should try again. What could be interesting is using it when using Claude or GPT as a prompt enhancer, it's likely that by now they would know what that means... Goign to try that I think
my best results were low guidance and 2cfg. i think its a case of the code and the architecture not lining up
POSTULATION. This is a ball.
No spine, no problem:
Friendly reminder that chiropractic "treatments" are pseudoscience
add the word 'rabid' to that
whoah the knit is great
this is so cool
must have chucked a lot of it in the training data I guess
I like the effects of this prompt a lot
not what i prompted at all but its pretty. trying different extensions on forge
looks nice
oic . i got one button prompt turned on still
Beksinski good
||Cacca||
I'm sure it can be done better - but here is one (barely) after about 20 tries SD3 (w/f in png)
with the new comfy ui execution inversion
what you can do is set it to retry until a vision model says its correct
do what?
Conditional loops
run the generation with different seeds
sorry I forgot to actually say what should happen in the loop 😄
Using Dice_AI's Platinum SD3 DD Checkpoint
Has his leg gone thru the bath? Or is the bath growing around his leg?! 😄
SD3 is great for a magic mushroom trip. LOL
AuraFlow0.3
This is just super gorgeous
i am sharing this here because i cant share it in general channel
dev or schnell?
and is it fp16, nf4 or a gguf quant?
ignore the second question if you don't know
Edit to add: it's mostly glif, flux, html, then save web page as image than flux superpowers
It is, then my remix of it lol
we rly need them to publish the flux paper
to find out what sorcery produced such amazing text
I want to know what they captioned the dataset with
It's via gliff, with Claude sonnet help etc etc
ah okay yeah
Just hit the remix button and it's all in there https://glif.app/@LadyLalita/glifs/cm00yuxa500026doktv9xznfu