#🆕|sd3

1 messages · Page 107 of 1

noble coyote
civic trail
sacred jewel
sacred jewel
sacred jewel
noble coyote
sacred jewel
sacred jewel
#

Mechanical Insect LoRA

sacred jewel
sacred jewel
#

Dreamlike surreal digital style LoRA

#

Surrealism style LoRA

sacred jewel
#

Soviet Robot Hunters LoRA

sacred jewel
sacred jewel
fleet meteor
#

Hey bro how its going? Do you have a workflow? for cogvideo img2vid? Most of the workflows I´ve found are way complicated and they use LLMs for captioning the source image (I only wanna do text to video , I can caption it earlier)

sacred jewel
bitter hearth
#

the facial expression on the rock LOL

dusky thistle
dusky thistle
sacred jewel
sacred jewel
#

rogue giant robot LoRA

still lance
#

Create an eerie, gothic world filled with whimsical, exaggerated characters who inhabit twisted, shadowy landscapes. The scene should blend dark, muted colors with vibrant accents, capturing a sense of fantasy and isolation, while evoking both beauty and unease.

alpine summit
sacred jewel
#

Dali Flux LoRA

alpine summit
sacred jewel
sacred jewel
muted dove
muted dove
bitter hearth
#

feather

muted dove
muted dove
noble coyote
sacred jewel
signal shuttle
noble coyote
sacred jewel
sacred jewel
noble coyote
bitter hearth
sacred jewel
sacred jewel
#

Futuristic Display LoRA

sacred jewel
sacred jewel
sacred jewel
dusky thistle
sacred jewel
dusky thistle
sacred jewel
dusky thistle
bitter hearth
hallow lion
vague topaz
#

果冻蛋糕

noble coyote
#

Flux1.Dev.fp8 in PortraitMaster

hallow lion
#

👍

muted dove
noble coyote
muted dove
noble coyote
muted dove
noble coyote
limpid thunderBOT
#

Thank you for using comcom analytics.
"comcom analytics" supports all community managers (moderators and server owners) by stats, visualization, and analytics.

If you have any questions, feel free to ask us!
Your dashboard
Help
Support server

Other languages
en: help
ja: help Japanese

noble coyote
sacred jewel
noble coyote
sacred jewel
noble coyote
sacred jewel
sacred jewel
sacred jewel
sacred jewel
cunning lintel
#

Schnell

sacred jewel
sacred jewel
sacred jewel
bitter hearth
sacred jewel
sacred jewel
sacred jewel
sacred jewel
low stone
fathom wharf
#

diamond painting

sacred jewel
outer blade
#

extraterestre control society

sacred jewel
sacred jewel
sacred jewel
noble coyote
open vigil
cinder junco
# sacred jewel

Any way you can share your prompts? They don't appear to be embedded in your uploads.

cinder junco
#

I like a lot of your stuff, so it would be great if you could turn on metadata somehow, lol. I don't know what tool you use.

#

But if you can't or don't want to, you could start with just the one I linked.

pseudo owl
#

https://pyramid-flow.github.io

New CogVideoX competitor, based on sd3 and produces pretty good videos. Only 2b and produces results similar to 5b CogVideoX.

errant dust
#

"Flux models now support the NVIDIA TensorRT software development kit, which improves their performance up to 20%. Users can try Flux and other models with TensorRT in ComfyUI."

#

"The Flux models’ dev and schnell variants were downloaded more than 2 million times on HuggingFace in less than three weeks since their launch."

bitter hearth
#

yeah was a couple of weeks ago

#

its been a decent speed up

mortal kite
noble coyote
#

HOW do we get away from you?

#

I see your motif is a monkey - is that what we'll become?

pseudo junco
#

@fossil pagoda

noble coyote
noble coyote
noble coyote
bitter hearth
low stone
noble coyote
#

I wonder if TensorRT for Flux will work on my 8Gb VRAM RTX 2070?

low stone
sacred jewel
#

My Rene Magritte LoRA

cinder junco
sacred jewel
cunning lintel
#

(schnell+lora)

civic trail
bitter hearth
sacred jewel
sacred jewel
bitter hearth
#

someone should just do it once and put it on hugging probably

sacred jewel
dusky thistle
dusky thistle
sacred jewel
#

PokeZom LoRA

bitter hearth
dusky thistle
sacred jewel
dusky thistle
sacred jewel
low stone
# sacred jewel

this guy needs to release a version of it without the zombies. would be great for general stuff

low stone
#

ran the same prompt against the stability sd3 ultra api. Not sure what's up with the subject doubling.

bitter hearth
#

always a risk when an edge is 1344+

low stone
dusky thistle
bitter hearth
#

never quite found out what Ultra actually is

#

its substantially better than SD3 Large

#

it has more high frequency detail which implies something like noise injection, noisy stochastic sampling or an upscale followed by a downscale

real terrace
#

So any news in the Flux world? I mostly used Schnell on Comfy, as Dev takes a lot to generate. Couldn't ever run the nf4 versions

#

I have a RX 6700 and Ubuntu

#

I'm sad that I cannot really test stuff like with SD XL because the generation time, even the loading time of the model

bitter hearth
#

not sure cos AMD

real terrace
#

Yes I read some improvement for Nvidia aparently

bitter hearth
#

yeah its good, it was 2 weeks ago though

#

nvidia reported late

real terrace
#

Flux Dev generation time is Interstellar time generation for me, at least usually results are great but I cannot play much with it

#

At least Schnell sometimes delivers amazing results, but it seems like Dev always has some twist

bitter hearth
#

some people prefer Schnell designs

real terrace
#

And I kind of lost interest in generating in SD XL as I presume I would get more interesting stuff with Flux

real terrace
bitter hearth
#

it takes a long time for tooling to get made, over a year
at the moment SD 1.5 and SDXL are strongest models

#

cos they have the most tooling

real terrace
real terrace
#

At the same time, some people make great images with whatever model, so, I don't know, but I know Flux follow prompts better and that's amazing, I'm tired of just praying in SD XL

bitter hearth
#

tooling as in software stuff

real terrace
#

Really in SD XL and with ip-adapters and ControlNEt, the posibilities are endless

bitter hearth
#

yeah at the moment there is so much to explore

real terrace
#

there are tons and tons of Flux loras, at some point I wonder if they should be integrated or something

bitter hearth
#

someone on this server had a go

#

they make checkpoints called "mangled merge"

#

they did a big one for SDXL and a newer, smaller one for flux as an experiment

real terrace
#

actually I haven't tried much as usually in my xp Lora needs a lot of tries and set up until they work as intended, if they are any good in the first place... so because long generation times, I don't want just to frustrated

real terrace
bitter hearth
#

it did, apparently

#

the SDXL one is cool but he said a few times the latent space gets wacky

sacred jewel
bitter hearth
#

there was a good realism one on reddit today

#

turns out you can outpaint a flux image using SD 1.5 and it will continue the image
even though SD 1.5 could not create that image from scratch

low stone
noble coyote
#

Flux_GGUF

hallow lion
turbid grotto
#

any news about sd3? sadcat

noble coyote
#

Cleopatra?

real terrace
#

You can try to do it with SD XL

#

I think it will generate better

#

just some tries, not perfect but better lighting already?

sacred jewel
noble coyote
sacred jewel
#

Purz Face Projection LoRA

noble coyote
#

Purz Neon Sign LoRA

sacred jewel
sacred jewel
#

Purz Dried Flowers LoRA

sacred jewel
#

Purz Neon LoRA

bitter hearth
#

this example worked a little better
this is 100% SD1.5 though

#

and one from SDXL

hallow lion
bitter hearth
sacred jewel
#

Purz VHS Box LoRA

bitter hearth
#

wow it learnt the layout really accurately

sacred jewel
real terrace
#

There was a user that posted SD 1.5 and the quality was impressive but the composition was like that

bitter hearth
#

it does have a higher amount of small objects than sdxl yeah

real terrace
#

yeah like these ones

#

I wonder if it is some kind of upscale

#

I guess

real terrace
bitter hearth
#

which is a great model

#

and then tiled upscale

sacred jewel
sacred jewel
sacred jewel
noble coyote
#

Purz Neon Sign Lora and GGUF_Flux

noble coyote
#

[Are we at 'banal' yet?!] 🥳

noble coyote
noble coyote
noble coyote
sacred jewel
#

Very cool

muted dove
muted dove
noble coyote
dull star
sacred jewel
#

DonM Illustration Styles LoRA

pallid summit
#

Imagine/ gourmet platter of hot dogs

hallow lion
# fleet meteor

It tastes funny. something happened to the meat when it went through.

#

Yes I haven't taught craziness to AI. something is always lost.

sacred jewel
#

Comic Book Vintage LoRA

sacred jewel
hallow lion
#

Dirk Lasermaster.

noble coyote
turbid grotto
sacred jewel
fleet meteor
steel beacon
#

I have slowly been working on a full Trump deck. 12 down. Many to go.

sacred jewel
short thicket
sacred jewel
short thicket
real terrace
sacred jewel
sacred jewel
real terrace
muted dove
sacred jewel
noble coyote
sacred jewel
noble coyote
sacred jewel
noble coyote
sacred jewel
noble coyote
sacred jewel
noble coyote
sacred jewel
noble coyote
sacred jewel
noble coyote
noble coyote
sacred jewel
noble coyote
mortal mesa
bitter hearth
#

squeak noises

autumn arrow
#

Can anyone tell what model this fake restaurant is using?

pseudo owl
#

Kinda amazing I can do all of this in 1-step with schnell, nothing else.

prompts: “4-image grid”, “the word “schnell” made out of cake”, “gta5”

bitter hearth
#

1 step? wow

pseudo owl
#

Quantized too lol

bitter hearth
#

I tried 2 step and it was great

#

need to try 1 step now

pseudo owl
#

Yeah with 1-step, however anatomy is messed up, 2 step helps very much in anatomy.

sacred jewel
sacred jewel
bitter hearth
#

the massive blur is hard for other models

bitter hearth
#

no step sounds nice, you just get the image lol

#

I noticed with flux pro 1.1, the blur is even more

#

maybe it takes a strong model to understand blur

zenith hemlock
#

my own flux lora, fluxdev

short thicket
#

Working on 2 new versions of Mangled Merge for Flux. 'Matrix' which focuses on realistic loras and 'Magic' for 2d loras. 'Matrix is almost done. This image is with an additional 228 loras on top of the original 230 from v0, 55 more loras to go for this one and then I am going to work on Magic. I started merging 4 loras at a time in sets of 3 and then smoothing with 2 versions of the Della merge method that I created a couple weeks ago.

bitter hearth
#

Mangled Merge is a really interesting project

short thicket
#

Thank you. I'm gonna try finetuning on top of it once these two versions are complete.

bitter hearth
#

do you find it very different to base dev now?

short thicket
#

I haven't tested just yet, I want to try and get all of these loras merged first. But one thing I am finding different from v0 is that cartoons or painterly images don't work anymore. I can get plastic 3d rendered looking things but not paintings or illustrations. Of course I could be wrong. that's just my initial findings based off of 1 seed.

#

with realism, you lose some aesthetics too.

bitter hearth
#

hmm okay

#

are there benefits as well as losses?

short thicket
#

yes. I'll show you a couple instances... Mangled merge on the left vs dev on the right.

bitter hearth
#

its done a good job lowering the blur yeah

short thicket
#

MM on left Dev on right. Keep in mind it's still WIP needs some more smoothing. But the prompt was "anime,cyberpunk, A young girl with large, apprehensive eyes stands amidst a cacophony of disembodied, bloodshot eyes. Render this in a gritty, expressionistic style, emphasizing jarring color contrasts and impasto brushstrokes. Employ a worm's-eye view to enhance the feeling of being watched. The girl's face and the surrounding eyes should be the focal points, illuminated by an unseen, sickly green light source. The background recedes into an indiscernible darkness."

So anime basically goes out the window in a lot of cases.

#

Same here MM left Dev right. Painterly images is still doable.

#

seems to add signatures for painterly things. I'm not using any negatives for these.

bitter hearth
#

yeah the changes are pretty big here

short thicket
# bitter hearth yeah the changes are pretty big here

Yeah. I'm finding the Della merge method pretty interesting too. I coded 2 different versions, 1 that follows the paper closely and another that works better with merging models patched with loras. I don't think either are the greatest at merging loras in but I'm finding they work great for smoothing overfit models.

sacred jewel
short thicket
sacred jewel
sacred jewel
spring yew
#

promt: make a logo

runic tusk
noble coyote
noble coyote
#

Flux with PortraitMaster

noble coyote
muted dove
noble coyote
#

Cool!

muted dove
upbeat lynx
#

Can you make a hairstyle changer?

sacred jewel
limpid thunderBOT
#

Thank you for using comcom analytics.
"comcom analytics" supports all community managers (moderators and server owners) by stats, visualization, and analytics.

If you have any questions, feel free to ask us!
Your dashboard
Help
Support server

Other languages
en: help
ja: help Japanese

sacred jewel
sacred jewel
steel beacon
#

Anybody know if there's a way to have eye occlusion on Reactor?

#

If my model has glowing red eyes, it'd be nice to have face swapping with those glowing eyes

sacred jewel
sacred jewel
noble coyote
#

PortraitMaster+Flux

#

Prompt: "An Art Nouveau style storybook page depicting the Whispering Woods scene. The page features elegant, flowing borders with organic floral and vine patterns. At the top, the title 'The Whispering Woods' is written in a beautifully ornate script, surrounded by decorative elements. The scene shows the shadowy forest with graceful trees, fireflies lighting a winding path, and curling roots, all framed by the decorative border. The colors are soft and magical, with deep greens, silvery moonlight, and glowing gold from the fireflies. The layout feels like a page from an enchanted storybook, with a mixture of imagery and text space."

sacred jewel
sacred jewel
short thicket
sacred jewel
sacred jewel
sacred jewel
short thicket
languid prism
#

if you are a branding specialist, please imagine a image branding picture for Midea Industry Group, the picture is about good lifes in the city, should contains the elements of home appliance, air conditioners, new energy cars, e-bike, energy storage products, people are enjoying happy life created by Midea, the picture should reflect the company value of nature and sustainability

runic tusk
#

No.

winter tusk
#

Generate high resolution render output from this

sacred jewel
muted dove
sacred jewel
muted dove
sacred jewel
muted dove
sacred jewel
noble coyote
sacred jewel
muted dove
#

Teachers suspected of 'fleecing' students of their buzz

sacred jewel
sacred jewel
noble coyote
sacred jewel
noble coyote
sacred jewel
noble coyote
sacred jewel
noble coyote
sacred jewel
noble coyote
sacred jewel
noble coyote
sacred jewel
noble coyote
sacred jewel
noble coyote
sacred jewel
noble coyote
proper umbra
#

I have been trying for days to achieve the same results as XY on my local installation. However, I am failing.

I know that they use Stable Diffusion XL but nothing else. I have a sample prompt here.

"Create a cartoonish illustration of a muscular, anthropomorphic rat. The rat should have exaggerated, bulging muscles and a confident expression, resembling a bodybuilder. It should be holding a red dumbbell, showcasing strength. The rat should have large ears and a cartoonish face with a smirk. Incorporate a bright and bold color scheme with black as the background to make the character stand out. At the bottom of the image, add the text "GYM RAT" in a large, bold, red font that looks hand-draw"

Attached is the comparison.

Can anyone help me how to get a similar result locally? I have tried it several times and the results were always better with stablediffusionweb.

#

my settings

noble coyote
pseudo owl
#

https://nvlabs.github.io/Sana/

yeah, I think stability is kinda cooked if above is good as shown.

Sana is 0.6b parameters, competitive to flux dev, undistilled, supports 4096 res natively. 1 sec for a single img on a decent gpu.

noble coyote
rapid pivot
#

@craggy crest look dm catlurk

dull star
#

uses Gemma2-2B instead of T5-XXL

#

I would personally not put sana 1.6B so high up on the GenEval score, but for how small it is it really makes up for it.

#

if its really good to train for, PurpleSmartAI/Astraliteheart might love this

#

Idk how far they are into training for AuraFlow

pseudo owl
# dull star

Yeah I think flux dev is still the best but its amazing for it's size, undistilled, 4k native generation, and incredibly fast.

low stone
short thicket
cunning lintel
#

if this sana model handles a variety of styles similarly to as pixart, it could be fun to use, the demo images have this very noisy grainy look though, time will tell.

bitter hearth
#

I agree with their scores, for what it's worth. Lumina can get pretty mushy, and Sigma feels a bit below SDXL to me

#

SD3 is only bad if there is a person in the prompt

cunning lintel
#

Guess we value different things :p

#

SD3m outputs are just awful to my eye, boring full frontal photo's of whatever i prompt with no atmosphere at all

sacred jewel
bitter hearth
#

SD3m is overtrained on photos a bit

#

which would artificially boost its benchmark scores a bit

#

just like generic photos of a room

#

so models that are overtrained on that style do really well in benchmarks even if they can't do other styles

cunning lintel
#

yeah, and geneval seems automated object detection

#

says nothing abput coherence, sadly also noticable in those sana images, that word "fast" on the cat's sign, the letter T is halfway out of the sign 😬

bitter hearth
#

I took a longer look through the paper

#

its gonna be good I think

#

competitive scores on all of ImageReward, FID on MJHQ-30K, DPG-Bench and GenEval

#

and its 100 times faster than Flux Dev

short thicket
fleet meteor
#

Do anyone know how to run flux with vae and clip l included (I have to run t5 separated) in comfyui?

#

What I mean is , is there a way to load only the T5 model?

mortal mesa
#

there is a load clip, not the clip vision one

short thicket
fleet meteor
noble coyote
bitter hearth
#

if you only have a full checkpoint file

noble coyote
muted dove
noble coyote
icy drift
noble coyote
icy drift
noble coyote
bitter hearth
#

to a good extent I agree though

noble coyote
#

The Weight Family are nearing the end of their incarceration!

noble coyote
icy drift
icy drift
noble coyote
#

Yes, tested way back when Flux was released

icy drift
noble coyote
#

The detail added was very very fine

icy drift
noble coyote
bitter hearth
#

for the most part I just leave stuff as default for prompting (so I use both Clip-L and T5)
and then I feed Florence 2 node to it
I'm not much of a prompter

noble coyote
#

Mousy-cide

icy drift
noble coyote
icy drift
noble coyote
noble coyote
icy drift
noble coyote
#

If you plan and plan your prompts, you can get workable images with SD3 - but it mangles human anatomy. But with Flux, it's so much easier

bitter hearth
noble coyote
icy drift
noble coyote
icy drift
# noble coyote

It's like asking an out-of-shape guy to take off his shirt though... Come on dude. 😨

bitter hearth
#

did you test T5 with full text, with tags for clip?

#

I feel like your tags for clip might have been too long for clip to handle maybe

icy drift
bitter hearth
#

I tried as well some of the new fancy clip or clip long fine tunes
but I never got better results from them

icy drift
#

T-5 only null-test. Prompt:
In this RAW photo, A furry anthropomorphic hamster live-action anime girl is holding a sign above her head. The sign says "CLIP-L FINE DETAIL". She is wearing a denim jacket and sneakers. She is standing in contapposto. Her silky, glossy brown and cream-colored fur is highly detailed and shining in the warm sunlight. She is smiling widely, with sparkling eyes. Her denim jacket has a rich fabric texture. Her shoe laces are untied. She has long, wavy, glossy brown hair flowing down her shoulders. She is standing in the park on a field of grass and wildflowers. Fluffy white clouds drift through the blue sky overhead. The photo is taken on an antique polaroid camera and is extremely highly detailed.

#

Doing clip now with same prompt, then will try with a few short tags, and just check detail changes.
(Same seeds / settings etc. Only varying prompt for test.)
(Also I'm using one of those fancy Clip-L versions.)

#

No significant detail gain from just pasting same prompt into clip.
Trying short tags next.

#

clip tags: RAW photo, extremely highly detailed fur, hair, fabric texture, eyes, grass, wildflowers, clouds, film grain

#

Very definite and obvious detail gain!!! 🥳

#

That's fantastic.

#

Wonder if I have time to test a difficult prompt and a portrait...

bitter hearth
#

wow nice

#

yeah this matches my experience

#

Clip-L with maybe 6-10 tags is nice

#

on models with Clip-G, sometimes Clip-G can be good with just 3-4 tags

icy drift
#

These textures are really something else. I love it. Out of time though. Gotta go.

bitter hearth
#

okay bye, thanks for tests

icy drift
#

Also these were all with the 8-step hyper lora.

bitter hearth
#

in my testing this one I put before maxed out T5 ```(Photo:1.3) of a street in a city. There are taxis and lamp posts. There are bins and plants. There is a garbage can and a drain.

#

unless you need very specific things

icy drift
bitter hearth
#

haha yeah

#

some people get really good results from prompting

sterile pendant
muted dove
bitter hearth
#

its the Pixart team apparently

muted dove
muted dove
wary portal
#

SD3 still hasn't released better weights than SD3 medium?

muted dove
#

They've gone silent.

wary portal
#

Well with that upcoming Sana, they are probably panicking.

muted dove
#

Not heard of that one

cunning lintel
#

The more i look at those images the worse they become, if all you need is speed, maybe people will see value as some gimmick to run a model on phone or something, but it looks not so great to me

muted dove
#

Well, the SD3 release wasn't anything to celebrate either 🤷🏻‍♂️
I'd like to know if Stability still exists as a company and if they intend to continue releasing models. It's been complete silence since Flux released.

cunning lintel
#

I'm pretty sure there's still the intend to release something as it was teased in their fine-tuning guide which is kinda sort of official communication and maybe more official than lykon's hints on twitter, but how well SAI's new thing works is anyone's guess

#

I really had high hopes for this new pixart / sana thing 😥 but it's just "fast"

muted dove
#

It's too small to be any good...0.6B

noble coyote
#

Isn't there a 1.6B version too?

noble coyote
noble coyote
wary portal
noble coyote
#

^..^<

zenith hemlock
zenith hemlock
pseudo owl
# cunning lintel Nah, they're laughing, sana looks less then stellar :/ https://nvlabs.github.io/...

Its much much faster then sdxl, sd3. Pixart, and 105x faster than dev at 4k.

It can natively generate 4k images natively with a speed 105x faster than flux, it’s undistilled too. It will use way less vram then flux dev.

Dev does seem clearly better but you can gen 10+ images way faster then a single flux gen and probably get a better img then dev.

I have to say though, humans might be an issue tho, they don’t show them in many poses.

pseudo owl
noble coyote
dry wave
pseudo owl
noble coyote
dry wave
#

all images I saw so far were full of errors. Much worse than sdxl

young leaf
#

help

dusky thistle
#

wonder if SAI is going to actually release 8B or just wait for it to become obsolete :/

pseudo owl
noble coyote
young blade
#

has anyone heard anything new from Flux team aside from their blackforest twitter account?

dusky thistle
hallow lion
dusky thistle
#

these are the settings

rapid pivot
#

SDXL is a god at generating images thomas all we hear from it is true

#

no base model compares to "sdxl" lmao

#

(meme)

#

first time I see sana stuff

#

looks very fast, interesting stuff

bitter hearth
#

if Sana really is 100 times faster than dev

#

then we could do tiled upscale up to 8k

#

and then downscale to 1024

#

in the time it takes dev to do the image normally at 1024

stoic turtle
#

so are we still waiting for sd3 something or is that whatever was delayed here now

#

checking in after a few months

stoic turtle
#

the weights i think they were

#

those here yet

mortal mesa
#

nothing new yet, nothing known, please hold restructuring

noble coyote
rapid pivot
noble coyote
mental bison
#

Any news about SD3.1?

radiant ledge
mental bison
dry wave
#

They use linear attention, which is faster than normal attention, but also leads to inferior results. To compensate for that they add convolutional nets.

#

yes its fast, but it suffers from all disadvantages of this architecture

#

it is also a bit funny because they act like "we have a DiT which is super fast and does not need position embeds - nobody achieved that yet!"

#

although SDXL is using exactly such an architecture: convolutional backbone, no position embeds

#

SDXL is using real attention however

#

and that's what you see in the results. Its superior to Sana

#

If you want super fast results use Paella. its a convolution only architecture

#

its bad as Sana but its super fast

dry wave
bitter hearth
#

it might be possible to bootstrap T5 to it like Ella anyway

#

I sort of don't like judging image models by their text generation ability

#

I know some people need that, but it feels like a more niche usecase

pseudo owl
# dry wave its bad as Sana but its super fast

Did you check the images in the project page or arvix? Sana base beats paella by a very large margin and even sdxl in a few cases while being much smaller.

Maybe you are comparing it to fine tuned sdxl models, base sdxl is not very great.

pseudo owl
dry wave
#

lol, whats the difference of fine tuned sdxl models to sdxl base?

#

its same model architecture

pseudo owl
#

No I’m not talking about architecture, I’m talking abt image quality.

dry wave
#

yes, if I make a new model and train it on better data I can beat other models that are trained on bad data

#

I know

#

and I say: their architecture is probably not good

#

that's all i care about 🤷 They won't beat any other good image model with that. i would rather wait for a next PixArt then

gusty trail
#

sana is the next pixart.

dry wave
#

there is only one author shared between both?

#

anyways, seems a step in the wrong direction to me

bitter hearth
#

I would have much rather had the exact opposite yeah
a model that is 100x slower than flux but with better image quality

dry wave
#

I would be also fine with a Flux light

#

what SD3 is supposed to be when it ever comes out

#

or some new and cool architecture like MAR

bitter hearth
#

the alleged samples of SD 3.5 2B looked great

dry wave
#

I don't trust anything SAI post anywhere ^^° I will just wait until its released

bitter hearth
#

yeah its hard to know

#

Hunyuan-DiT is slowly improving as well, they might yield a good model

#

the latest Hunyuan-DiT is not even that bad, so long as you do a tiled upscale or progressive upscale

#

it does need multiple passes

pseudo owl
dry wave
#

yeah, thats already ridiculous to me. Replacing attention with linear attention makes quality worse. Thats a good sign that linear attention is just not good for this task

#

using convolutional resnets can compensate that. Yes. We know. You can replace the complete network with conv resnets (see Paella). You will just not reach the quality of DiT

#

you see the problem in all their example images. They are full of errors and lack any global coherence

#

also their choice of making the VAE compression rate larger is a mistake in my opinion

#

generating images from latent using a l2 loss makes images blurry and low quality. To get around that you need either a GAN or a diffusion model. I think VAEs are usually trained with a GAN-like loss to mitgate this issue. But that has its own downside. You just get into trouble if you make the compression ratio too large

bitter hearth
#

linear attention seems to be better for very small things, smaller than the size needed for 512x512+ diffusion models

#

the scaling trends for linear attention seem to be okay for small compute amounts but it then falls off a cliff

#

which makes it useful for certain things but 512x512+ diffusion models are maybe too demanding

pseudo owl
bitter hearth
#

I found that SDXL and SD 1.5 VAE still does well if your final image is 4k+ resolution
i.e. tiled upscale

#

then the VAE limitations are minimised

#

at 1024x1024 scale, the SD3/Flux VAEs are much better yeah

gusty trail
bitter hearth
#

if the rest of the model was good I would feel okay with SDXL VAE

#

not ideal but would be okay

#

a bigger problem with SDXL was the lack of ZTSNR, which pretty much every new model fixes

turbid grotto
bitter hearth
#

its probably dropping rly soon

#

I think people will be happy with it when it drops (despite lots of memeing)

#

the previews looked good

turbid grotto
#

I wouldn't mind if they trained for a bit longer if there is still benefit

turbid grotto
bitter hearth
#

they were on this server so they should come up in the search

cunning lintel
#

user revealedinadream_70414 claimed to have sd3.5 images, so search that username

noble coyote
turbid grotto
noble coyote
noble coyote
noble coyote
rapid pivot
sage burrow
#

Is there a new sd3 yet?

turbid grotto
#

8 months ago...
Which model made this? It is Flux level hands

#

Also, what happened to model from the paper, did it just disappeared or what? It looked awesome

turbid grotto
bitter hearth
#

Ultra pipeline in the API in particular is very good yeah

lucid swift
dusky thistle
#

yea that's basically like 8 years ago in AI years

#

seems like the plan is to just let 8B become obsolete, then release it

#

it was obviously ready to release 6+ months ago

fleet meteor
#

8B seems kinda small today that we got flux 😁 @dusky thistle

#

But tbh it looks like the smaller the model the more efficient somehow, look at pixart for example, or sd3 (which was badly trained but for its size it was good in some aspects)

dusky thistle
#

8b is big enough, i think

#

what flux has really done for us is put to rest the idea that we need models to be tiny for them to be adopted by the masses

#

no one cares about how small your model is

#

they can just quantize it

#

one of the arguments against releasing great/big models was also that we wouldn't be able to train loras

#

well, we can drop precision there too and still get good results, we can swap blocks and finetune 12b on a freakin 4090 (which i'm doing, it works)

sacred jewel
sacred jewel
rapid pivot
muted dove
river vine
#

Please generate a 1900*1200 sized wallpaper showing an artis representation of a neuronal network.

muted dove
sacred jewel
muted dove
sacred jewel
noble coyote
#

Flux_GGUF and Magritte LoRA

noble coyote
civic trail
civic trail
sacred jewel
sacred jewel
sacred jewel
sacred jewel
rapid pivot
sacred jewel
sacred jewel
sacred jewel
short thicket
short thicket
sacred jewel
civic trail
civic trail
#

Come and get your Balloon...

short thicket
wary scroll
#

m

sacred jewel
bitter hearth
#

I like how there is always a camera lense hidden somewhere now

#

its like an easter egg

sacred jewel
sacred jewel
sacred jewel
sacred jewel
sacred jewel
dusky thistle
sacred jewel
dusky thistle
short thicket
sacred jewel
wet rose
short thicket
#

Mangled Merge Matrix is complete. Magic is coming along.

dusky thistle
muted dove
noble coyote
#

DecadeTW Auto Prompt

noble coyote
#

"Super-Flux_GGUF"

#

From an idea by Olivio Sarkas

short thicket
noble coyote
short thicket
noble coyote
short thicket