#๐๏ฝsd3
1 messages ยท Page 111 of 1
8 steps
he's got a VAE loading
how is it comapred to just dev + 8 step lora?
I also tried decoupled one from civit and worked too
Not tried that, due to using fp16 models and trying to reduce lora usage
but I had different workflow, very simple one with single positive and negative and basic clip L instead of that carzy name
here. click on this image, then click the words open in browser, then right click on the large image and save as. then drag and drop it into comfy and queue prompt. tell me if you have issues
dont' change anything
I think they just have a massive dataset with adult content, which the open-source community is afraid to use.
seriously? you're talking about openAI remember
Dev with hyper is closer to Dev than Schnell
you can do Dev with Schnell lora at partial strength to get a kind of hybrid
there are two kinds of Schnell lora though, ones that leave guidance blocks as Dev and ones that turn the guidance blocks to Schnell
saved as webp, and nothing happens in comfy when i drop image into
yeaa yeaa, but it also got that specific style somehow, i thought it could be from cgi
Do you have it running locally?
you didn't follow my steps. first you have to click on the image, because it's Not the original image. that'll open it in a viewer. THEN under the image you'll see the words open in browser. click those words. that'll open it full size and the original image THAT image is what you right lick and save as. THAT image has the workflow in it
no
Yes - use the w/f buried in the PNG
what is schnell lora? I wanted to try 4step dev but found nothing
there are some on civit and hugging
Has anyone tried upscaling or inpainting yet with 3.5 L?
they subtracted schnell from dev to make a lora
so, just huge dataset and tons of resources?
all of flux IS basically a lora. they froze it with dpo
and if you adjust the strength of schnell lora, you can make the dev checkpoint more or less schnell-like
thanks, I will look into it
possibly, its hard to know
I was hoping for the paper by now
Wait, to confirm, is this SD3.5, or OmniGen? (I know it's my OmniGen prompt, just making sure before I cancel my install.)
i know for a fact that's what was done. it's also why it's not trainable. it is frozen
now it works, i miss one step, but now i see default workflow with your prompt, and defaul checkpoint loader
and you're never going to get a paper - robin's not about to write down what they did
3.5L
LOL almost fell for it. ๐
omnigen seems to be slow, but it's features are very interesting, would like to try locally
so are you getting good images now?
I thought OmniGen was someone's name ๐
It's an LLM trained to generate images.
https://github.com/VectorSpaceLab/OmniGen?tab=readme-ov-file
You can give it images as part of the prompt and tell it to do stuff with those images.
Try Searge Flux LLM
Omnigen works very differently yeah
i need to change checkpoint loader cuz i use gguf models
go talk to meta - it generates images, it just doesn't think it can
how about don't use gguf, okay? cazuse 3.5 doesnt' use unet
you'll need to wait till gguf for 3.5 is released with the right archtechture
12gb video and 32 ram idk
but it worked for me
it'll run, it'll just be slow
I have same setup and can run both, sd35l and t5 at fp16
Almost got some good text from it ๐
the Omnigen paper cited Meta Chameleon as inspiration
you might try Comfy's Fp8 model https://huggingface.co/Comfy-Org/stable-diffusion-3.5-fp8/tree/main
what a crazy workflow
Not!
he should be fine with fp16 as I am
LOL it turns out Omnigen is literally using the SDXL VAE, that's cool
he should be, but he doesn't seem to want to run the workflow i gave him, with the models it requries, and not change things
yeah you only need FP8 if you are trying to get a very specific speedboost
in Comfy its called -fast
It's been stuck at 0/50 steps for ages. Don't think it's gonna work. ๐ (Local OmniGen.)
downloading default 3.5turbo to test it on my setup and your workflow
Suggests too small VRAM
Oh!
It's not even trying. No idea why. Maybe once they release the Diffusers version. It's a 3.8B parameter model. Even at full f32 precision, it should use around 16GB VRAM, with plenty left for the latent and context.
I wanted to try it with 12gb ๐ฆ
so, I shouldn't try
I'd wait for the diffusers or for someone to make a Comfy node. (Not sure how hard that would be.)
sad that often cool things being left with no support
my workflow is designe for 3.5 large, not turbo, but it might work with it
It's been out for 5 hours. Code has been out for 2 hours.
people are asleep. patience
wait for Diffusers if you want support is what I would say
oh actually its in Diffusers already lol
๐ฎโ๐จ
oh I it's been a day already
good lord! clear up space
pack stuff into .zip files if you have to
Just installing a new C:\ drive - 4Tb SSD - up from 2Tb, which is overfull!
๐ฅณ
Anyone had any luck with Latent upscaling SD3.5 images? Mine end up like this, with x1.5 upscale
It seems to hate latent noise injection too...
Ahhh! Just realised I had the wrong CFG set on 2nd stage. This is with noise injection ๐
ah that's great I got scared lol
it worked with DPM++ 2S so I knew it could take at least a little bit of extra noise
My SSD M.2 ports are underneath my graphics card, which means I have to disconnect it if I want to add anything, and it has that crazy power cable that can only be connected/disconnected like 12 times before it fails.
my CPU cooler means I can't change ram lool
I think this is a common design fault. You need to change it for water cooled (small head) to free up the other slots. I have the same problem so had to discard the old RAM to upgrade.
yeah I think small head water cooled is nice
time to buy a data center and a few racks
"But if it aint broke don't fix it" so I'm very hesitant to mess with my PC. (Even though I've built dozens / worked in IT.)
I have this vast PC case - it seems to be three or four times larger than the motherboard!!! SSD access is luckily very straightforward
I have an external SSD with nothing on it at the moment, and I could get more. It's not even really that slow if I want to load models from it. But I just cleared out HF's cache and deleted some old LLMs. I'm really just hoarding old stuff I know I'll probably never use again.
All my checkpoints and LoRAs are on a 1Tb external SSD as well
Sometimes I wonder if these models will be outlawed in the future, and sites like HF and Civitai shut down, which makes me a little hesitant to delete them, even though I know that's almost definitely never going to happen.
nope
won't happen
Input image/change widget to input ... feed to KSampler
It always gets the same letter wrong! ๐ฆ
Well, still nice to have your own copy
SD 3.5 with FLUX pro 1.1
I actually prefer the first..
its kinda obvious flux has that extra oily shine
OmniGen workflow anybody, svp? ๐
some call it too plasticy
Ok...is this for stable artisan?
I haven't really done something like this so...๐ฅฒ
I've never noticed it quite so bad as that comparison though
sd 3.5 is pretty damn good
ComfyUI
Oh thanks ๐๐ผ
i like how sd 3.5 doesnt over do realism
SD3.5 is a tad gritty and grainy for photorealism; but it is majorly good at art imho
yeah i would honestly prefer sd3.5 to be slightly more fine tuned for realism but not as much as flux
just to make the image a bit crispier
Use my Super SD3.5L workflow for the crispier image
sure, link?
loaded it, what's up with the note at the bottom tho
You can add a 4th KSampler (Advanced) and set to 40/30 - 40 for an even sharper image
It stores my prompts - u can safely delete
rendering one image now
๐
yeah its not
Nice image anyway
Cool
those are sd 3.5 turbo
these are flux schnell
was using gguf q4 model for flux
this too is flux schnell but you can see how strong the tones are
comparing it with sd3.5
if sd3.5 can be fine tuned a bit to make the images bit more vibrant thatd be perfect
and most of the flux images are half body, cause it struggles with full body, which btw is much easier with sd 3.5
your multi sampler pass to make images crispier is interesting, im going to download the large model and test it out
Olivio Sarkas did similar wit Flux - his idea originally - kudos!
oh. then i can try the flux right away
how long does it take to generate one image using sd3 or sd3.5?
I found this but not sure if its a good working one https://civitai.com/models/879251/stable-diffusion-35-large-gguf?modelVersionId=985076
I'll post a w/f
yeah.. i'll grab q8... ty
Yummy!
i was editing your previous wf
what happens if you don't put pineapple on it, to quench the evil inside
No problem - create, develop, extend - into the known Universe!!!
(I'm sounding like Toy Story!)
thanks a lot for the flux wf ... made it much easier ๐
SD3.5L_Super
You're welcome
Horrible anatomy.
These folks show the typical anatomies of the peoples of Indonesia
Nah, so far, so good.
Wait for SD 3.5 XL . lol
A more Western look SD3.5L_Super
But it's something at least.
Let's wait for community finetunes.
Has there been any online 3.5 lora or checkpoint creators released yet? ๐
If anybody wants to finetune OmniGen, it is MIT LICENCE which is straight up one of the most "you own this" licenses in existence:
Do you have an OmniGen workflow at all? ๐
I don't have it in Comfy, and I couldn't get the local code working. Only used in HF spaces demo, and I was surprised by the quality for the "woman lying in the grass" prompt.
Also, OmniGen has controlnet and IPadapter built in as soon as it's in comfy.
SD3.5L_Super - quite good on the whole - 3 x KSampler
๐ธ
Hmm...
SD3.5 with 2 x Ultimate upscale on main character
SD3.5 generate a image if world war 2
I've missed your excellent art since I abandoned PiXart-Sigma ๐
yeah, it was good while it lasted, but got superseded by flux and now sd3.5
PiXart-Sigma has morphed into Sana?
yeah, i thought so? sana is the pixart-team right?
the sd3.5 large model is going to be great for fine tuning, but not as good as turbo distilled
nVidia - so prolly the PiXart guys
yeah
Q8 gguf sd3.5 turbo with default workflow on comfyui ...
didnt use any condition zero out or any of that sd3.5 stuff
and that q8 is fp16
I wish it would do better skin texture or is it the prompt
wrote a basic prompt
no filtering a gorgeous Viking warrior princess.
i could add some quality prompt to make the image look more realistic
and i was actually testing out the gguf model

Running the prompt on my setup
Different noise setting
A hauntingly beautiful portrait of a cracked porcelain doll with an otherworldly allure. The subject's kintsugi-like skin is adorned with intricate cracks and fissures, yet its sensual gaze still exudes an unsettling charm. Specular lighting accentuates the reflective surface of the porcelain, creating a hyperrealistic effect that is both captivating and unnerving. The anime-inspired proportions and high-resolution 8k clarity render every detail, from the cracked porcelain to the hooded features, with unnerving precision. The overall atmosphere is one of eerie disturbance, evoking feelings of foreboding and malevolence - as if the doll might come to life at any moment, its cursed existence lingering like an unseen presence. the actual prompt
i already stole it
and default checkpoint loader?
gguf diffusion loader
i shared the workflow #๐๏ฝsd3 message
thanks
mine is close to your original
yeah, but using vision models usually doesn't get you the prompt that actually made it...
i know but chatgpt is pretty close with image description
can't deny that tho
where glass rose
a photorealistic painting,I'll leave you choking on every word you left unspoken Rebuild all that you've broken, and now you know I'll leave you choking on every word you left unspoken Rebuild all that you've broken, and now you know Every wound will shape me, every scar will build my throne,I didn't say a word I looked back into my eyes
this is why i like song lyrics tho -> you can't chatgpt those. these are the same prompt, just different seed
so sd3.5 turbo model is originally 16gb but this gguf is 8gb.... isnt that nice
"this isn't the kind of reaping i signed up for"
The prompt from @lavish sparrow
Now I recall - the man who uses pop lyrics for prompts!!! LOL
๐
i suppose there's ... a few of us, but i'm one of them, yeah ๐
who was in charge with the artist studies back then? would be interesting to see for 3.5
Lyrics prompt in Super_SD3.5L
"WTF DID I JUST READ"
L Stability AI. Flux.Dev is still way better
care to elaborate, or is this just something you think/feel?
I will share it when I do more detailed tests, this is my current opinion by looking at outputs
this is somethign they have stated themselves too... you know? You're going to be to more specific than that.
but it doesn't fail at shattering faces, making the world a jigsawpuzzle that gets sucked into a void, or other weird prompts i made -> something flux did fail at
so just being apretty image isn't all there is. prompt comprehension is a thing too. and that's where it's better than flux
if your prompt difficulty is : "person standing" flux will win, pretty much 100% of the time
Oh, I just realized you have a Stable Society role, so I can't expect you to be objective about SAI ๐
everyone knows Flux looks better\
the idea is that SD3 is not distilled
which will make it much better than Flux, given time
"a black heart"
i got that for winning an art contest.
i've also tested kwaikolors, hunyuan, pixart alfa and sigma
Flux dev already has a de-distilled version, and although it is still experimental, it works extremely well
What?
and now we have another model that seemingly works pretty well, but is just like pixart or kwai -> a base model
so, as far as basemodels go, i like sd3.5
at the moment I don't think its aesthetics are very good
I've been following the various de-distill attempts quite closely
this one looks cool ngl
It is not fully de distilled at all, it still has many biases of flux. Only thing is it supports negative prompts and cfg.
i really like how well sd3.5 responds to negative prompting.
It doesn't even work properly on the negative prompt, youre right
if the de-distill works then we can switch back over to flux :shrug:
I don't know, maybe I developed a dislike for StabilityAI after being disappointed while eagerly waiting for SD3 at the time
๐คท
well I guess it wasn't possible to be a BFL fan before they were founded
yeah LOL
I thought pixart model was coming but then the pixart team made Sana
which is different
yeah I think we are getting just Sana from them
this is what Sana looks like by the way
its not a great image quality but its super fast
But if it's not good then you have to start refining it, which defeats the purpose of it being fast and you may as well have used Flux or SD3.5 to start with ๐คท๐ปโโ๏ธ
yeah I am having trouble thinking of how Sana would be useful
I think there are some scientific models that use diffusion now that it might help
weather maps apparently
bio-engineering.
source: company i work at
wow nice
I would give it a try. The idea of sana is amazing.
yeah I will definitely try it
sniper-wolf? XD (the cuter edition)
the VAE itself will be useful separately anyway
I'm feeling bubbly today
I always want a MoE pipeline to generate images and Sana seems good at this.
In size and speed
useful for "edge" devices like phones as well
ww2 fairy
Looking for teeth?
waifu design schematics
Some concerning hands there ๐ค
...and here!
I think I'm going to have to try and add a Flux detailer ๐ฆ
fuck who broke the universe jar
My current ComfyUI workflow for anyone who needs it. It has GGUF and Lora support and an optional upscaler
HO. HO. HO.
Isnt that the base workflow 
Oops! My bad
please help a noob get started with stable diffusion๐
how do i start generating images here?
a lot of people here generate on their own computer, but you need a powerful gpu to do that
i have a powerful gpu
Super_SD3.5L w/f
i mean is there a channel where i generate it ,like in midjourney?
Anyone get SD3.5 working with 8gb vram?
there used to be bots, but uhh... i haven't been here for a while, so i dunno if there's still generation in discord
Yes and here's my w/f
You can try the GGUF versions found here https://civitai.com/models/879251?modelVersionId=985027
Those should reduce VRAM cost
@noble coyote thanks I'll test it out. Do I need any launch parameters?
sexy zombie taco. i dnno why. but now it is a thing.
alien understanding of coffee
"u broke my heart"
Does anyone know... if clip g l and T are the same for sd3 and 3.5. I havent downloaded any new clip and just used what was there from when sd3 released
suspicious spy-bird
yeah, you can use the same
SD3.5 on left and Refined with my own Flux merge on right
too bad it failed on the headdress
and it lost some of the painterly quality the og had
Yes, but it's first attempt. I'll play around with the levels and see what happens.
this is a death-mauler 9000, do not be fooled by it's cute looks
Christmas? 
Easter is coming 
"don't worry mom, it's just a phase"
do stabilityai/stable-diffusion-3.5-large fit in 4090 memory?
Yes
Missing a hand, but... ๐คท๐ปโโ๏ธ
Don't play with fire kids!
SD3 Large struggles with these prompt adherence challenges quite a bit:
But it did sort of get it in the end after about 10 gens (but missing the liquid).
first time i got my burgers so... disgusting ๐คข
how many steps?
Hopefully AIT should have proper and full SD3.5 training support soon
It definitely needs a LOT of fixing, but I am down to give it a shot
20 Steps, but I doubt using more will have any effect, but I might give it a go.
it'll give it more time to get things right... 20 is on the low side
- because uh. i'm addicted to getting a lot things right.
Yeah 20 isn't enough
30 should be your floor for most things even ODE
If you're doing SDE with this then it benefits even more to add some extra
anime, girl, progression chart left to right, human to robot, cute to horror - first model that got this right, pure prompt.
well done sd3.5 team โค๏ธ
is the AI saying art is obsolete from now on?
mess with the gecko, u get the stabbo
It is proving to the complainers that it did not scrape anything, it learned straight from the source
I've seen such weird images from AI that definitely never would have existed in the world before ๐
and in such fascinating and fun volume
I partly figured out what the issue might have been I think.
I was using a workflow with the Dual Clip loader, so the T5 Text encoder wasn't in play, but it is still not all that prompt adherant now I am using all 3 text encoders.
yeah, t5 is kinda big...
I used to have such fun feeding Midjourney v3 the vaguest prompts and it would come up with mindboggling and compelling images. I'd share and wow friends and family alike with some
First old 8b api, second each time sd3.5. Seems 3.5 untrained 3.0 api or something ๐ฌ
photo of an alien holding a barbie sitting in his ufo, hard surface, retro futuristic, gothic, art by h r gigger, android jones,james jean, alyssa monks, john martin, highly detailed, fine detail, intruiging, #absurdres , nikon d850 '#photography, sharp focus, dof cinestill
In sd3.5 it's not sitting inside the ufo
A cadaverous boy with sunken, black-rimmed eyes, alabaster skin, and a rictus grin, adorned with a top hat and a tangle of vines, in the eerie, whimsical style of Ray Caesar and Tim Burton, with intricate, high-contrast details.
In sd3.5 it's a generic skeleton face :/
dark gritty pinhole photograph, dusk luminous faerie realm landscape, crouching figure in tattered faded clothes on the side of a broken overgrown path, crumbled ruin archway, ivy and wildflowers, warm color grading, ancient stone structures, moody aesthetic, trending in magical fantasy
In sd3.5 it's overly glossy generic style
But still trying, these are old fun prompts that worked in for me in 3.0 api, seems i need another way to get this out of 3.5
multiclassing - cuz wizards run out of spells
anyhow, SD3 is being praised wildly out in the tech world. TMG claimed it was better than Flux 1.1. Seemed a tad overdone. I agree it does trumpt it on a few obvious fronts though
tmg?
And the awesome thing is, all our kids mj images and prompts are still sitting there a couple years later ๐
some of these are kinda subjective
subjective?
meant to reply to Aliquip
I am not ashamed of stealing good prompts ๐คญ
I have no idea what that even means. 'stealing a prompt'
kinda, but the alien is not in the ufo, the style in 2 is not like the artists mentioned (it defaults to skeleton face), and in 3 the glossy path is not what i'd expect from (pinhole) photography
I do so regularly to improve ideas and build a visual (in my brain) library
I always tweak and modify them though
i steal everything i can, and try to embed prompts in all images i post (wasn't in these, the 3.0 ones were old, the 3.5 not set up right), that's the fun part ๐
the alien sort of is in the ufo, its just that the UFO is more like a sci fi chair
also I agree it did not get the pinhole camera right
artists should probably be a lora thing if they are named
flux detailer?
SD3.5 is great at comics and cartoons. Flux is quite bad at it out of the box.
and SD3.5 inherently understands art styles. Flux was a colossal idiot for those
progression charts are good ^^
impressionist oil painting for Flux means photorealism
agreed. one of my few gripes with it
This is not to bash Flux as bad. It is simply acknowledging and working with its strengths and weaknesses
yup. but not being able to do other styles other than the baked in style really well... kinda sucked
for prompting idiots like me

Yes
For example, SD3.5 has failed miserably (might be a prompt issue) with my classic cappuccino with the words Coffee is my World written in the milk foam with coffee swirls. SD3.5's output had the words, but they just looked like printed letters
Whats the prompt, i wanna try it
I mean this looks like it works?
The words are supposed to look like they were made from the coffee swirls, and not like printed words on a milk foam background
It's a Rich to Rild story.
The age one is good at least, poor guy finally dropped that bagpack later in the life.
TO be fair, Flux is better but not perfect either. The best by far was Dall-E 3. Ideogram and MJ were a solid second
Flux output
One thing that SD3.5 has been consistently terrible at, and it is both specific and kind of weird, is a hand holding a pen or pencil
yes but sd35 has the potential to become better than flux.
if the people take to it and refine it
I am not condemning any of them
I test and learn what they do well or not
young to old, bling to homeless. damn
Yes, text has been a bit of a sore point for 3.5. But it might be that I am using fp8 and not the good version
Here is a good result by Sd 3.5 on a prompt that has proven quite problematic for the AIs
You can see the hand holding pen issue here too though
Toung, Loonelleg, IbIIgnt, Leght to thungots... those are the stage of your life.
Like I said, just a matter of knowing the tools and what they do best
Switched the prompt slightly, prompt issue for sure on your end 
It's numbering and colouring these correctly for me every time. It is missing the liquid, so I've rewritten the prompt and trying again...
Pretty but the idea is a cppuccino, meaning milk foam on top. ๐
My point stands 
right now the best thing is omnigen tho. tho its not out yet
well, it would stand if the output were what I expect
if you cannot get a correct result either....
gens look decent and if it can do all thos ethigns they say thats amazing
that means way less noodles
XD
I once did an image of a cappucino with almond syrup - but no matter how I tried - it was always hazelnut syrup!!!
that is awfully specific. lol
O, this isn't Starbucks then?! ๐ฅณ
semi-fail
and the letters really look like they are made in the foam and not printed on it
You specifically asked for foamy letters?
Here is the prompt: "On a clean white background there is a cup of cappuccino, and in the foam the coffee swirls spell the words, "Coffee is my world". Around the cup we see an elegant coffee spoon with an ornate handle as well as some grains of coffee."
Ah so it intuitively 'foamised' the wording?
Good one
Another interesting variant is to do this with a pizza and have it produce an image from the toppings. A world map, or a short phrase like this
.. afk ...
will it be just colorful words on a pizza? Or will they look like they were formed from pizza toppings?
I just noticed the chart says that SD3.5 Medium will be 2.5B parameters and not 2
since bottle prompt adherence is being discussed:
four bottles lined up on a table. from left to right, they are numbered "4" then "3" then "1" then "2". from left to right, they are red, blue, green, and orange. the setting is a magical forest from a tolkien novel. in the sky is an alien UFO abducting a cow with a ray of light.
first seed.
on a sidenote, I see artifacting in images of 1280 x 1280, suggesting that larger sizes are not supported
...
Astonishing!
this is what i got from the api with that prompt on may 4th
One thing I noticed on the announcement, it seems to directly imply that the upcoming 3.5 Medium is somehow trained differently than the other versions, and also supports higher resolutions? Not sure if that's what was intended though:
Stable Diffusion 3.5 Large: At 8 billion parameters, with superior quality and prompt adherence, this base model is the most powerful in the Stable Diffusion family. This model is ideal for professional use cases at 1 megapixel resolution.
Stable Diffusion 3.5 Large Turbo: A distilled version of Stable Diffusion 3.5 Large generates high-quality images with exceptional prompt adherence in just 4 steps, making it considerably faster than Stable Diffusion 3.5 Large.
Stable Diffusion 3.5 Medium (to be released on October 29th): At 2.5 billion parameters, with improved MMDiT-X architecture and training methods, this model is designed to run โout of the boxโ on consumer hardware, striking a balance between quality and ease of customization. It is capable of generating images ranging between 0.25 and 2 megapixel resolution
That would be great if it's actually the case
Intriguing also is that their self-made chart suggests it is toe-to-toe with the Large Turbo in quality and adherence
@noble coyote art by Alan Kenny
i see u posting stuff like this at times, thought you'd want to know
Large Turbo has some straight-up advantages over Large I noticed, even, it can actually do traditional hi-res-fix denoise passes without creating weird jagged artifacting around the edges of the image, while 3.5 Large still can't really
I wonder what it is about SD3 in general that causes that even, it's never really been clear
3.0 Medium had the same problem
Perturbed Attention Guidance seems to ฤฅelp a lot with it but not completely fix it
Streamline Moderne stylee
Thanks .. Googling Alan Kenny images ...
is there a site or thread on artists in SD3.5L yet (Yes it just came out yesterday I know)
cup
Horizon Zero Flux LoRA
do you know a list for SD3.0?
they were all unlearned from api :p
thanks
No i dunno, but all my styles that worked in api broke
okay so I'm not alone
I won't claim that I did extensive research but Zdzisลaw Beksiลski used to work well
or at least make something I really loved
but now paintings (other than maybe watercolour) look too bright and photo-like
like.... flux...
i posted #๐๏ฝsd3 message there about it
welp, thank you
So it was ruined
I literally wanted the large model BECAUSE of the painting capabilities
but generally whatever i try, it's gone. it does do styles kinda sorta still though, but not very specific anymore
and now they took that away
same for me, not so much painting but just different styles in an image
this was in the api
I love how this looks
it was a Zdzisลaw Beksiลski prompt
he was like the new Greg Rutkowski for me
i wonder how much 3.0 was lykon's dreamshaper styled finetune and he was kept away from 3.5 or something, but even then artists seem gone too
but now it just makes super bright and contrasty paintings
and idk why they needed a newer version to this when the VAE/speckled look isn't even fixed from the api
(yes to remove artists too that was another reason)
but seriously
I wonder if medium is just gonna be another flop
A grotesque, hunchbacked woman with a bulbous, crimson nose, skin like cracked porcelain, and a toothy, sinister smile, sporting a tattered, lace veil and a bouquet of dead flowers, in the haunting, dreamlike style of Ray Caesar and Tim Burton, with rich, velvety textures.
went from 1st to 2nd, soo generic, no fine texture or anything
second is SD3.5?
:(
they aren't lying it's 3.5 and not 3.0 ๐คฃ
yeahhh
Super_SD3.5L Workflow
yeah, you read the annoucement correct
Claustrophobic close-up showing that In the depths of a cave system, a wraith stands guard, the wraith has a gaping hole in its chest with an eldritch creature inside, tentacles crawl out. illuminated by the faint light of crystals. The scene is styled in the manner of Virgil Finlay, paying homage to the influential Weird Tales artists August Derleth and Frank R. Paul
3.0 2x then 3.5 2x, i have no words :/
i kinda hope i'm using 3.5 all wrong ๐ข
Are you using ModelSampling SD3?
the standard diffusers workflow
this is a painting, trust me!!!
also I get oversaturated messes at above cfg 3
and below I sometimes get weird anatomy
Flux does a mighty clefty chin chin
it turns everybody into Henry Cavill
yeah the paintings are super nerfed
If it supports twice the res as Large as the announcement says I don't see how it could be
Might even be viewed as better by people even if that's not the intention of SAI lol
uf, for none generic style it's all backwards
its a dumb ass model
I hope they intend to like address why it's like that if it is lol
People will be confused for sure
MJ seems the only imagen caring about style ๐ข
with any new model, you need to start at square one and explore it. learn how it thinks. put in a single prompt, lock the seed, try all the different sampler and scheduler combos. try different values for shift and cfg
you can't assume even a new iteration of a model will work the same as the previous iteration, much less and entirely new version
try adding 'brush strokes' or 'pallet knife' to the prompt
this isn't SD1.5 -> SD2 where we have a completely different CLIP model (Clip-L -> Clip-H)
But you're right, we should experiment
just in case there might be something good left in this model
no, this is sd3.5 which is a completely different neural network, different data set, 3 encoders instead of 2 - you need to learn how it works
give me your prompt and let me see what i get out of it
is this a watercolour
Prompt: soft watercolor painting of a soldier standing at the edge of a battlefield. heavy canvas texture, ragged edges
cause I heard that works well
forget about sd3.5, try https://blendic.ai
Create stunning AI-generated images by combining the power of visual and textual inputs.
I want paintings
give me your prompt and let me play with it then
rough impressionist painting of a ww1 soldier with a helmet, looking onto the sunset in the Grand Canyon. The primary colours are orange, red and blue. The colours of the painting are faint and the brush strokes are thick and detailed.
this is from the API
I would look up my previous prompts but Glif DELETED ALL OF MY RUNS
๐
rough impressionist painting of, A man in a forest, sitting on mud, which around a pond. It takes place at night with stars in the sky. There are candles around the man and he is looking up at the sky. the painting has large strokes and has high contrast between the colors.
(from the api)
๐ข i checked mine too, uuuffff... but that's on me not copying it sooner
luckily I still have some left over from my post
Large Turbo gens women who look a LOT like Flux Lady lol
presumably just as a result of also being distilled
Not got sweaty/oily Flux skin
Acrylic Paint, heavy impasto brush strokes, rough linen canvas texture: faded, expressive impressionist painting of a WW1 soldier with a worn, M1917 helmet, gazing wistfully onto the muted, warm sunset in the Grand Canyon. Employ chiaroscuro, sfumato, and drybrushing techniques to capture the rugged, weathered atmosphere
did you use unipc? it has weird artifacting
also this does look a bit closer
but still looks too much like a photo
and jfc so many keywords just to get somewhat close-ish
no. but i can go try it with that sampler if you like
thank you for your effort though
nah I remember unipc artifacting like this actually
not nearly as many as you might ahve to tell a human artist to get what you want
using specific painting terms carries very specific data - and directs the AI to what you want rather than it having to guess
its just that SD3 API did these by default and not with a super contrasty look to everything
I'll try this prompt and mess around with the CFG
the API isn't just the model. it's the model and a whole bunch of settings. you're going to have to play around with everything
and shift - i tend to keep shift at 2 most of the time. cfg at 4 and steps at 32 to 40
also try giving each encoder a different prompt, crafted to that encoder's strengths
i think the main problem is that the AI has just seen too many photos of faces - the rest of the image comes out looking like a painting. but that's something i'd deal with in photoshop in post production. and remember, this is BASE model, so it's also something you could train a lora for
Tails are still difficult for ai ๐ฆ
for some really bizarre reason, SD3.5 is convinced pens and pencils write on both ends
This is not an outlier
"The other end is for when you want to repeat yourself!"
do you think SAI purposefully trained it on a bunch of double-ended pencils?
Its the Ninja Star Pencil which worries me!!! ๐
tails are also inconvient when you wear pants
they do if you sharpen them on both ends ;)
Seems obvious
they overtrained it on double-ended pencils as a little easter egg for the community
and pens
It's good at character turnaround sheets ๐
in all seriousness tho for a pencil I'd adjust my prompt to talk about the red rubber eraser on the other end of the pencil, or the clicky thing for pens, just to avoid that weird edge case
hey @sage burrow long time no talk, sort of dissapeared after Flux came out
It was hiking season ๐
Didn't expect you to take "Go take a hike!" literally
@sage burrow do you have a quick review on the level of explicitness that can be generated with SD3?
Don Quixote
i know it's able to do nipples now but can it do more? i'd test it myself but i rebooted my computer and now ComfyUI is giving me a weird error "tokenize not found" when I try to use the simple workflow, which was working and tested prior to the restart
Make a large nipple with a human attached
SD3.5 - question for you: Why do men have nipples?!
I haven't trained any loras yet for 3.5. Without loras, for men, you can get some suggestive stuff but that's about it. For ladies, you can get a bit more but it looks odd. I censored the badly done nips
that is the worlds worst censor job
i find it interesting that if someone wants to know if the AI can do ... that ..., the person they ask is... @sage burrow
Turbo seems better at Anime than non Turbo for some reason, not sure if you tried it
The image is a lot more resolved and less painterly I find
Probably just a byproduct of distillation
I haven't tried turbo yet, after other turbo models not being as goid as the base, I just skip turbo now
Probably because I know which prompts most likely coerce a model to do nsfw lol
It's not as good at super complex prompts but there's no resolution support difference like there was before
And it has a really distinctly different overall style than normal Large
okay so bottom linne this is the takeaway for me
SD3.5
nipples - โ
genitals - โ
lewd scenes - โ
is that accurate?
Horror character turnaround sheet ๐
other than just telling it 'bark like a dog' or something?
Different styles are fun, I'll have tp try it ๐
@dull star (20 minutes later as i forgot a had a batch of 10 set on comfy) - click on this image to open in the viewer. then click on the words Open in Browser to open the original image. then right click, and save as, then load the image into comfyUI to get the workflow. if you look at the original image, it's a painting. the workflow has my settings
thank you
More like "bark like a dog, make a vocal sound which dogs are known for, bark bark, dog dog, yelp loudly"
I'm thiinking this file: https://huggingface.co/stabilityai/stable-diffusion-3.5-large/blob/main/sd3.5_large.safetensors doesn't have the CLIP model built in so that's why iit's not working with the simple example. do you guys know if there's a variant that has CLIP built in?
welcome. just make sure you change batch to 1 - cause 10 takes a while
prompt: a small kitten staring at the camera, roaring like a lion
??? what do you mean, built in?
Medium is able to work on consumer computers... I wonder what most consumers own?
so does large
try downloading it from here https://civitai.com/models/878387/stable-diffusion-35-large
10 minutes per image on mine ๐ฆ <same as flux>
sd3.5_simple_example.png doesn't work with that model but it works with the advanced example where you manually load your own clip models, and it's having an error during CLIP processing so i'm thinking maybe iti's not embeeded inito that file
i'm still not sure what you actually mean. the example workflow loads all 3 encoders in their own node. if that's not what you're talking about, please explain
theres 2 example workflows, the simple one which just uses the Load Checkpoint node and the advanced one which shows off various ways of loading your own CLIP models, simply put the simple one doesnt work with the file given, can you confirm thats the case for you too?
Did anybody test to use sd3.5 with T5 and one of the clips only? Is it that important to have the 2 clips ? Considering that flux uses just one ?
my workflow is in this, download it and look at how i have it set up. 1. click on the image to open it in the viewer. 2. click the words Open in Browser to open the original uploaded image 3. right click on the image and save as, then drag and drop it into comfy
yeah. me. and it doesn't work well - it'll just use random data if you don't give the encoders actual prompts
and flux uses 2
@craggy crest your workflow doesn't help or answer my question
- i posted the image for the simple workflow which demonstrates I know how to use and load images as workflows
- I specifically mentioned multiple times that when manually loading CLIP models with their own node it's not a problem
- I'm simply asking if you've managed to get the simple workflow to work that JUST uses Load Checkpoint which internally attempts loading the CLIP model from that given safetensors file. it's a yes or a no, yes I used it and i got it to work, no i can't get the smple workflow to work either
and i've said over and over i have no idea what you mean by clip models
you know the little yellow input/output dots on ComfyUI denoted as CLIP?
regardless of whether you understand or not what i mean by clip models can you just confirm if you can get the simple workflow to work? in other words can you get it to work using just Load Checkpoint
so your safetensor's file has clip. yoru encoders all use clip. are you talking about not being able to connect the clip connectors in your workflorw?
i have had no problem getting the workflows to run
so can you try loading this image/workflow and confiirm it's not a problem ?
I'm talking about not being able to use the Load Checkpoint node
your negative and positive prompts need to connect to the conditioning node
that's your model.
noticie how Load Checkpoint has a yellow CLIP circle on it?
yeah. that's your model .
my concern is that the siimple workflow isn't working because the file doesn't output the CLIP from there
so you need comfys fp8 scaled checkpoint that includes the clips, there confusion averted
and that's why iit's not working for me
yeah, it has clip in it
EXACTLY
can you pls link me to that
that contradicts what Kagi is saying
i know. that's not where you connect clip form in that workflow
there's an alternatiive sd3.5 checkpoint that includes the clips in it right?
you connect it from the encoders
wrong, that's how you connect clip from in the SIMPLE EXAMPLE
comfy does have a discord - he posted the information in his annoucements yesterday
ill try that out thanks again would've never found it on my own
https://comfyanonymous.github.io/ComfyUI_examples/sd3/
The sd3.5_large.safetensors file (put it in your ComfyUI/models/checkpoints/ directory) does not contain text encoder/CLIP weights so you must load them separately to use that file just like in the following example:
Oh I should've read further down into the article, that's a my bad
For convenience there is an easy to use all in one checkpoint file sd3.5_large_fp8_scaled.safetensors (put it in your ComfyUI/models/checkpoints/ directory) that can be used in the default workflow like any other checkpoint files.
@craggy crest maybe you didn't know that which is why you didn't answer with the right link?
this part right here "encoder/CLIP weights " is what i was asking you if you were talking about or not. it's the encoders you are wanting embedded in the model, just an FYI - which isn't the best idea as you then can't use your own encoders if you want to
Do you have any idea what this may be about?
you can use your own encoders, just load them in a node and dont use clip from model
i use comfys fp8 but load encoders seperate to be able to use fp16 t5, cuz that checkpoint has fp8 in it
did you forget and feed the gremlins after midnight?
if you're going to do that, there's no point is using a model that has them embedded
cool but thats whats avaliable
right, i appreciate you trying to help and trying to figure it out with me, I know it's not the best idea but Load Checkpoint is how I have my workflow setup to work on my website so I dont want to do some crazy rewriting of the code to have it be compatible with the new clip loader nodes
should be a number of new versions, quantaized ect, by the end of this week - the speed at which this community is moving
yup
you have to load a checkpoint for everything. that's your model. you're not generating without it. but it's a lot better to use the encoders alone, not embedded in it
like i said this isn't me arguing about what's better or not, this is just me responding by saying it's in reference to integrating to existing code
I have code that takes a workflow and adjusts it for an image generation so the workflow must use load checkpoint
i'm not arguing either. just trying to point out that you might want to rethink what you're doing and try something different, it might work better
okay, could you at least change that to "load checkpoint node'
just trying to make sd3.5 work with my system at the most basic level and then slowly mprove support
im already using gguf quantized version of the Large turbo which is half the size of original turbo model
like my system supports the t5xxl and the clip_l prompt encoder nodes so I have the LLM generate 2 different pieces of texts and injects it into each fiield
that's a good idea actually I should Q_8 the large model for even more performance
meanwhile, if you want to use SD3.5 - clip_G is the encoder that does all the work.
i thought it was clipg/t5 that did the majority of the work together
if you don't use it, you have removed the core
they already have the q8 version of the large too, reason im not using the large model cause it's not distilled as the turbo
no, clip_g does the work. clip_l does the artsy, ambient, fine details and text. and t5xxl has your prompt comprehension. you need all three
there is a merge also
oh dude you're the best thanks
because turbo is distilled its got finer output, which im enjoying atm
is your large based off the one that doesnt include the clip?
like is it large + t5 fp8 or large without clip?
you will need all the clips, yes
i'll pass on that then
ill just Q_8 the version that has the t5 fp8 built in
if you wanna post those too i can wait for your copies
i prefer to have one on civit so i can link my models to a version online
so you can merge your own file to what you want also, not particularly hard
i know how to quantsize a file but not how to merge it so i would just take this one: https://huggingface.co/Comfy-Org/stable-diffusion-3.5-fp8/blob/main/sd3.5_large_fp8_scaled.safetensors and run it through the q_8 converter
so you want to embedd the clips as a single checkpoint?
yes pls
you'll probably see me post a bunch of images under that model's page too
if you see one guy posting a bunch of images with that model it'll probably be me lol
but you realize, by doing that you wont have the flexibility of using improved clips
i'm very well aware
like i said im using a fixed workflow that demands Load Checkpoint so there's no room for changing that at the moment
for example look at what im using
yeah I'm using that exact same seutp for flux (minus clip g)
bottom two were released later
i have the VITL14 text detail and I also have the SMOOTH one
i also have longclip which is even better than the VITL TEXT DETAIL one
there is no particular advantage in embedding the clips into a single checkpoint
im all for subbing out clip_L for better models by all means it's just my sdxl workflow isn't setup for that at all
its this easy
the major reason I need to have CLIP come from Load Checkpoint is because the workflow supports sdxl/sd3/pony base models so I can't sub out the CLIP models for pony specifically
none of the fne tuned CLIP Ls work with Pony
i see
so it's a major rewrite to detect the base model and adjust the workflow dynamically based on all that
it's fair i guess... no harm
but everytime they change the clip and not the model you'd have to update the whole checkpoint
yeah im willing to do that
and people will have to redownload the whole thing
so am i to expect a sd3.5 large q_8 + t5fp8 to be posted on your page soon?
i keep my wf organized for easy click load/access
i keep my wf organized as code as part of the system
// Import workflows
const { output: vaePlusLoraOutput, workflow: vaePlusLoraWorkflow } = require("../../comfyGalleryAppFrontend/src/components/img2img.json");
const { output: fluxOutput, workflow: fluxWorkflow } = require("../../comfyGalleryAppFrontend/src/components/fluxWorkflowKsampler.json");
const workflows = {
sdxl: { output: vaePlusLoraOutput, workflow: vaePlusLoraWorkflow },
flux: { output: fluxOutput, workflow: fluxWorkflow },
};
which UI are you using?
do you people notice something about sd3.5? usually with these models, including flux when you add a lot of detail into different objects in an image it can somewhat reduce the quality of the character's face, which is not so harsh with sd3.5
@silver sluice Well, if you were able to quantize the fp16 build at Q_8 that would be exciting, but fp8 already comes fast and so lossy, I am not sure of the benefit
ok nvm
looks like screenshot from a game
he wants to quantize the model into a single checkpoint unlike gguf diffusion model where you have to add clips node, i dunno if that's possible with quantized gguf
Yeah, no clue. why is it such a big deal though?
no idea
but he mentions he wants to make it convenient to load up checkpoints between sdxl and stuff
I mean, having GGUF is a big deal for performance purposes, but being an all in one or piecemeal makes no diff
ah
I just keep a pic handy for each workflow I want to use
yeah i tried to tell him, when they update the clips, as they do, he will have to recompile the whole model and ppl will have to redownload the whole checkpoints just for a clip
and a quick drag and drop
so I have an SD3 pic (with WF), and a FLUX pic (with WF), etc
how do you show images within comfyui for the workflows?
i usually name my files and load them from workflow tab
Well, suppose I wanted a FLUX wf and it is not already loaded. I have an image labeled and just drag and drop it into Comfy
oh that
i thought you had it set up for images to show up within comfyui like a1111
I think the last time I used A1111 was with SD 1.4
I have no idea where it is at now
dont see anyone using a1111 for months now
people still use forge but i prefer comfyui
I have heard about Forge, but not tried it or had any reason to
its similar to a1111 but personally i like comfyui too much to change ui
The sheer modularity and flexibility of Comfy just trumps its flaws
performance, convenience, efficiency, all those are plus points for me with comfyui ๐
very active development
yeah
sure. The downside is that godawful local menu setup to add nodes.
once i got the hang of noodling around with comfyui im hooked to it
where you can find a checkpoint loader of 10 different kinds found all over the place
sd3.5 medium gguf https://huggingface.co/city96/stable-diffusion-3-medium-gguf/tree/main
just about 2gb
I tried the 3.5-Large gguf Q8
its wonderful
say... compression....
@cunning lintel are you using fp16 or fp8 T5?
turbo isn't bad either, for speed and quality both
T5 isn't usually the thing giving the style as opposed to clip-l and maybe clip-g
downloading medium now, server seems jammed
but just in case it might contribute to the inferior painting capabilities in our tests
Any fps test on a 4090 on the sd3.5 and sd3.5 turbo?๐ฏ๐ฏ
3.5 Medium is not out yet, but he did do the 3.5 Large also
he has all 3 quantized
SD3.0 Medium
but not sure how he got hold of medium before 29th
He didn't
What do you mean by fps? If you mean frames per second, Its much slower then sdxl and even sdxl cant generate 1 fps on a 4090.
His page shows the link to SD3 medium, not 3.5
ohh yeeh.. overlooked the version
Mochi-1 is crazy amazing video gen(this is text to video) for open source, testing mochi on fal.
fp16, i used diffusers default config/download
it's clip_l that deals with the fine details, artsy, style, background, ambient
Pony = Main CLIP Switch set to 1
SDXL = Main CLIP Switch set to 2, SDXL CLIP Sswitch set to 2
SD3 = Main CLIP set to 2, SDXL CLIP set to 1
is there an advantage in quality using the triple clip loaoder over the dual clip loader?
i see you are trying to incorporate different workflows into one
indeed
im finding the quality is better with triple clip loader vs double clip and the performance is about the same, im not seeing any longer times, also I just tried the longclip model in the triple clip loader and I'm pretty sure it worked great, running it again to double check it actually worked
just to correct my earlier screenshot here's my 'final' version. to recap the LongClip model does work with SD3.5 via the triple clip loader

btw, 35l can be used on very low ram\vram systems becasue you can drop t5 and use clips only with q5 transformers which is about sdxl size or smaller
peple have already got gguf's out
FoFr's updated his repo https://github.com/fofr/cog-comfyui/blob/main/CHANGELOG.md "I've updated any-comfyui-workflow to have access to SD3.5L regular, fp8 and turbo weights."
GGUF and GGML are file formats used for storing models for inference, especially in the context of language models like GPT (Generativeโฆ
The TL;DR explanation is that it is a form of compressed format with minimal loss that is fed in blocks so even an 8GB GPU can handle a beast like Flux Dev which nominally needs 24GB to run
There is a chart that shows the levels of loss, but suffice it to say, it is far better than fp8 and Q8 is less than 0.1 worse than the pure original
yeah i use gguf q8 for my flux setup and it works great, really speedy, high quality output
It is the poster for a new Pixar movie: "A Bug's Life: Knights of the Round Table"
Nice gen from Mochi-1, I'm still amazed for open source, from Banadaco discord
prompt: You're sitting in the driver's seat of your car, driving along a winding mountain road. The scenery is breathtaking as you pass by lush green forests and sparkling waterfalls. Suddenly, a brilliant blue screen appears on the windshield, seemingly out of nowhere. The error screen slowly pulses
Oh ok... sdxl definitely can generate more than 1fps on a 4090
I doubt with 25+ steps, unless you are talking about sdxl lightning, lcm, turbo models.
yes, i was thinking on lcm, turbo,... thats why i was curious about the sd3.5 turbo model
you know an inpainting model would actually be really fricking good for SD3.5
very nice
painterly things
full body, style of Simรณn Prades, Ismail Inceoglu, by Ashley Wood, obscure, loose and expressive, broad brushstrokes, grays, whites, subtle skin tones, moody atmosphere, otherworldly demonic woman, bioluminescent fist, Dark shadowy heels, golden eyes, dark torn scarf, golden sun tattoo, dark curled horns, body composed of swirling energy adorned with intricate patterns resembling swirling smoke and sparkles
I tried a few flux prompts in 3.5, they do better, as there i already had an llm rewrite the artists to explicit styllistic stuff
it's coming i'm sure
Style: High-contrast, black-and-white style, blending photography and illustration, with subtle, painterly textures, reminiscent of a delicate, dreamlike collaboration between the moody, high-contrast techniques of atmospheric, mystical and symbolic illustration. Scene: A creature with the body of a craftsman and the head of a gentle, skilled artisan, stands amidst a workshop filled with strange, half-finished projects, its eyes shining with a warm, creative light, as if it is about to glue together the disparate pieces of a new, wondrous creation, and offer it to the viewer.
Funny how 3.5 creates 2 different styles, i like it ๐
what are the key difference between q4 and q8 other than size?
im guessing quality . but then why make q4 when you can have q8
or is that q4 specifically intended for low vram
and if so, how bad is the trade off for using q4
Yeah youโre correct, q4 is going to be less accurate while taking much less vram.
The trade off isnโt too huge, you might lose some tiny detail and maybe very little prompt following compared to fp16. For most images, itโs going to be pretty similar.
ok, and im also guessing this model file size impact the ram rather than vram?
Ram doesnโt matter much, you only need enough to load the model file and thatโs it. Q4 will use less ram then Q8.
Q4 will also use a less vram too.
Very easy way to measure vram usage is find how big the model file is(Q4_1 is 5.27) and add 1 or 2. So the model file should take around 7gb vram.
ok neat, ty, i think im gonna run some extensive tests with q4 , already got some good results with fp8 and q8
Thereโs also the text encoders, vae, they are not included in the model file and will take a bit of vram too.
Essentially Q8 is within 0.1 error rate from the vanilla build. Q4 is a big hit and Q5.1 is roughly 0.3
Q4 is likely about fp8 level
doing some renders with q4 .. quality isn't too bad
it is not blind, it is simply less than the full pure model
a bit like the table SAI posted showing Turbo to be X Elo worse
yeah and i dont think i would call such texture bad at all ..
i had confusion that q4 would have blurry and poor quality output, which is not .....
seems like people have good experience with training sd35l
can't wait 12gb optimization
Is there an analogy to that between t5 fp16 and 8? Like would it be fair to say it has an error rate of half compared to fp16?
Nice response but doesnโt quantify the error rate just said slightly
Thanks for the effort tho appreciated
yeah well.. it says ...
The error impact isnโt linearโso it isn't accurate to say FP8 has "half the error" compared to FP16. The impact depends on the specific workload and model sensitivity.
I just like how the other dude gave out easy numbers, i understand irs not as easy to quantify. Just a ballpark would be nice tho
if the system can take the workload fp16 is definitely a better choice but in likely scenario of consumer hardware fp8 fine
The โslightlyโ and โit dependsโ isnโt really a good answer but kudos for trying it def provides some good insight
so you load these models instead of the large turbo and that's it, it uses the same clip models and text model?
technically yes but thre is a trade off in terms of quality which may not seem like a big deal in most cases but would matter for profesional production
also just to be precise in answering what you asked, the gguf are quantized / optimized for better memeory managements so yes these quantized models replace the original weights
That is correct. fp8 in t5 is also worse than fp16, but there is also a GGUF build of T5
how do you compare that encoder with Scaled version https://huggingface.co/comfyanonymous/flux_text_encoders/tree/main ?
fp8 is lossy, and the fp16 is the same, though the GGUF should load and be processed faster
@bitter hearth can you ask that LLM that was primed for an answer to try to approximate a ballpark answer as to the error rate amount
local sora soon... XD
interesting... so by what you are saying gguf version is a better and faster choice
Like if original to q8 is 0.1 then is fp16 to fp8 like 0.1 or is it closer to 0.5? 1.5? Just curious
all things being equal, yes
there are side factors with model and workload
Q8 is about 0.1 away from pure fp16. fp8 is around 0.5
Yeah just ask the LLM To factor that in and give us a ballpark see what it says
it cant assess it like that
it depends on the model sensitivity and workload
Oh I see so in this case sd3 large starts at fp16, whereas other models can exist as fp32 like certain sdxl models so the 0.1 is the difference between fp16 and q8 got it
Is that what it said? Like even if hallucinates a number Iโd like to see the logic for what factor it comes up with, like the LLM knows everything so the dependencies can be calculated internally from what it knows

