#🆕｜sd3 | Stable Diffusion | Page 131

devout schooner Apr 13, 2025, 1:49 AM

#

there's definite bleed of unrelated data in 3.5 Medium

errant dust Apr 13, 2025, 2:22 AM

#

not sure which is which, but the first image is clearly the better one

#

The second one looks like a cartoon

devout schooner Apr 13, 2025, 4:49 AM

#

errant dust The second one looks like a cartoon

I really think you're being intentionally disingenous at this point joeshrug
you keeps skirting around every single one of the actual points I'm making

#

unless you really believe that everything should randomly be super grey and dull and very clearly have elements of literal paintings mixed in even when that wasn't asked for or desired in any way

#

neither base is "perfect" by any means but the left side one (3.5 Medium) looks WAY worse than the right side one (3.0 Medium)
the 3.5 one is a poorly resolved mess that looks more like the inside of a piece of particle board wood than any kind of metal

bitter hearth Apr 13, 2025, 9:36 AM

#

its hard to describe but there is this thing like "scratchy details" that can come in a variety of models and SD 3.5 can get it quite easily
if you used SD 1.5 and SDXL with PAG then its basically the thing that goes away when you increase PAG

#

I kinda understand what you mean by cartoon because the colours went bolder on the one on the right and some people prefer muted colours

#

I used to have the same tastes I recently started using CIELCh colors and I appreciate bolder colours more now

errant dust Apr 13, 2025, 9:46 AM

#

devout schooner neither base is "perfect" by any means but the left side one (3.5 Medium) looks ...

No, it doesn't. And it is you who fails to understand over and over my point about light. I'm a photographer. Light is everything. Candle light, overcast, or sunlight will completely change the colors, the contrast and so on.

bitter hearth Apr 13, 2025, 9:59 AM

#

when they test people's personal taste about images what they find is that people's personal taste varies so so much

#

so IDK how much its worth debating matters of taste

#

its partly why I thought the big "battle" between SD 3.5 and Flux was a bit silly

#

if you preferred robust+clear object outlines and a stylized aesthetic then you would prefer Flux, if you were not bothered by some degenerated object outlines and you preferred a high detail natural aesthetic then you would prefer SD 3.5

#

there were like 100+ arguments about SD 3.5 vs Flux in the community but it just comes down to that taste difference really

devout schooner Apr 13, 2025, 11:29 PM

#

errant dust No, it doesn't. And it is you who fails to understand over and over my point abo...

I mean I strongly disagree but ok
if I scrounge up more examples I'll post them
this is IMO a very real problem SD 3.5 Medium (and to some extent Large) has even when the prompt mentions absolutely nothing about light
it aggressively wants to make everything dull and gray (and painterly), all the time
and make the entire image appear as though seen through inexplicable, excessive amounts of fog or smoke
in a way that is not at all desirable

devout schooner Apr 13, 2025, 11:32 PM

#

bitter hearth if you preferred robust+clear object outlines and a stylized aesthetic then you ...

putting everything we're talking about here aside I think it can't be stated enough that in fact Flux basically single-handedly invented and originated the "plastic skin CGI look"
for all realistic outputs
that all models since have done
stock Flux will never ever produce an image that looks like this 3.5 Medium one (meaning, just like, normal) under any circumstances
for any prompt like
a close-up portrait photograph of a young woman's face, focusing on her facial area from the eyebrows down to the upper lip. She is 18yo and has freckles. Her skin has a smooth, glossy texture from her makeup. She has dark brown eyes, framed by thick, dark, well-groomed eyebrows. Bold red eyeshadow extends from the inner corner of her eyes to the outer corners.

#

but revisionists want to insist nowadays that the way Flux looks has been how all models look
and that it's some kind of unavoidable problem
which obviously it is not

#

this is a big part of why I'm not particulaly amped about HiDream
needing to use the full 17 billion parameter version to even just get normal-looking realism is a bit silly

#

and API-only models have definitely taken into account what I'm saying
like Reve does not produce CGI-esque Fluxy realism
nor does Ideogram
and so on
it's just this recent stream of extremely similar-to-Flux open source ones that seem to be trained by people who think being utterly incapable of authentic realism is desirable

mortal mesa Apr 14, 2025, 12:21 AM

#

surface hasnt been scratched with HiDream yet, but ya its a chonker, still might have some pleasant surprises

bitter hearth Apr 14, 2025, 1:28 AM

#

```early JuggernautXL though lol

#

did that look super early on

#

I never rly found stock Flux useable, as I was saying the other day I never jumped from RealvisXL to flux until RealvisSchnell came out

#

and a bunch of photography+lighting loras

#

I also refined each flux image with RealvisXL or Realvis for SD 1.5

split bramble Apr 14, 2025, 1:36 AM

#

I don't know if I've seen RealvisSchnell.

bitter hearth Apr 14, 2025, 1:41 AM

#

best things always have zero hype
its like universal rule

#

there are better checkpoints these days though, at the time it was unrivaled IMO

#

it loses to properly de-distilled checkpoints as an example

devout schooner Apr 14, 2025, 2:18 AM

#

bitter hearth ```putting everything we're talking about here aside I think it can't be stated ...

Turbo or Lightning XL models always had that issue (as well as any merge that was ever trained on AI generated images erroneously captioned as "photos")
but not to the same extent as Flux

devout schooner Apr 14, 2025, 2:39 AM

#

mortal mesa surface hasnt been scratched with HiDream yet, but ya its a chonker, still might...

I really haven't formed an opinion on it
all I have discerned is that the Dev version often looks laughably bad even in comparison to stock Flux Dev on the same seed and prompt
that's the verbatim "Juggernaut Lady" prompt lol, HiDream on the left, Flux Dev on the right

#

HiDream looks more like Flux Schnell (at best) here

#

literally nobody wants shit that looks like this though
this is the number one complaint everyone has always had about Flux
and numerous loras were trained specifically to get rid of it (including by me lol)
like nobody is looking at this stuff and being like "yep, perfect"
I don't know who is convincing all these recent model trainers that this ridiculous not-actually-realistic-at-all bokehmaxxed style is desirable to anyone

#

like just make an open source model that looks normal but also doesn't have any amount of weird ass coherency and noise resolve issues
why is this so hard KEKW

mortal mesa Apr 14, 2025, 2:51 AM

#

data issue

devout schooner Apr 14, 2025, 2:53 AM

#

mortal mesa data issue

it's probably partially that but it's definitely also caused by people deciding that their model NEEDS to be 12B to 17B parameters for some unexplained, unclear reason
and then aggressively distilling and DPOing them down as a result of that to make them runnable for average users

#

there's no way any of these models NEED that many parameters
like the practical difference between SD 3.5 Medium and Full HiDream DEFINITELY doesn't amount to a "14.5 billion parameter" difference
not even remotely close

mortal mesa Apr 14, 2025, 3:01 AM

#

got alot of bang for the buck with quantity for a while now, and ya ide hope it shifts

devout schooner Apr 14, 2025, 3:12 AM

#

devout schooner I mean I strongly disagree but ok if I scrounge up more examples I'll post them ...

BTW using ClownsharkBatwing's exponential samplers doesn't necessarily help with the greyness problem in 3.5 Medium vs 3.0 Medium
but it does DRASTICALLY improve the resolve of lines, in general
much cleaner

devout schooner Apr 14, 2025, 3:55 AM

#

mortal mesa got alot of bang for the buck with quantity for a while now, and ya ide hope it ...

eh
I still just don't really get why people are so aggressively pushing HiDream
I even got into a truly bizarre argument with the simpletuner guy yesterday where he posited that LLAMA not being a "Chinese" text encoder was somehow a significant contributor to the reason
I'm still not even sure what exactly he really meant by that
or how that would plausibly be relevant to any average user lol

bitter hearth Apr 14, 2025, 8:05 AM

#

devout schooner Turbo or Lightning XL models always had that issue (as well as any merge that wa...

ah yeah turbo and lightning always seemed to lose some softness or blur ability

#

going from flux to flux turbo to schnell does the same

#

something about distillation makes you lose soft lighting and blur

dry wave Apr 14, 2025, 8:05 AM

#

HiDream could have been the model that replaced Flux (which is only available distilled).
But for me the model is way too parameter inefficient

#

I disagree with the claim you would not need many parameters, though.

#

I was sceptical myself first, but Flux is just so much smarter than smaller models

bitter hearth Apr 14, 2025, 8:06 AM

#

snapchat made a 0.3B image model that looks about as nice as any to me

#

oh it won't be as smart yeah

dry wave Apr 14, 2025, 8:07 AM

#

looking nice does not mean being smart

#

look at the ChatGPT model what you can reach with enough parameters

bitter hearth Apr 14, 2025, 8:07 AM

#

I know what you mean by Flux being smarter yeah
it fixes stuff during inpaints and upcales etc
that does seem to require more paramaters

dry wave Apr 14, 2025, 8:09 AM

#

I would like to have a new flux with stronger text encoder and more efficient parameter use (e.g. adaln parameter sharing)

bitter hearth Apr 14, 2025, 8:09 AM

#

I've been quite happy with tiny models for tiled upscale, like this one https://huggingface.co/cqyan/hybrid-sd-224m its SD 1.5 but squished to 224m so like a two-thirds size reduction

#

IDK if I am that bothered about parameter efficiency like
SVDquant Flux FP4 with 8-step turbo lora is rly fast on 5090 servers

#

so like without even mentioning B200s it can be fast on domestic 5090s even

dry wave Apr 14, 2025, 8:11 AM

#

it's not so much about speed but about using parameters where they help most

#

like one big jump from sd1.5 to sdxl was that they removed the transformers in the first block because - surprise - they haven't done anything, and put them into the middle block

bitter hearth Apr 14, 2025, 8:12 AM

#

yeah that first block in SD 1.5 is a big pain cos it makes SD 1.5 slower than SDXL at high res

#

which is crazy

#

when I use SD 1.5 I always use Modified Shifted Window Multi-head Self-Attention cos it fixes that issue
but then that makes it no longer work easily with torch.compile
its a mess

#

there is a "fix" which is to make custom CUDA kernel for tiled SD 1.5, which is kinda one of my current projects lol

turbid crane Apr 14, 2025, 11:53 AM

#

#🆕｜sd3 create a beautiful garden view from top

turbid grotto Apr 14, 2025, 6:36 PM

#

bitter hearth I've been quite happy with tiny models for tiled upscale, like this one ``https:...

hi, does it require external script to run? no comfy compatibility?

bitter hearth Apr 14, 2025, 6:41 PM

#

I asked comfy about supporting it but no dice

turbid grotto Apr 14, 2025, 6:43 PM

#

understood!

devout schooner Apr 14, 2025, 9:04 PM

#

dry wave I was sceptical myself first, but Flux is just so much smarter than smaller mode...

Flux Pro Ultra shows what it can really actually do well IMO
but the distilled versions (Dev / Schnell) are so aesthetically DPOed to hell that it kinda noticeably harms prompt adherence a lot of times I've found
even in comparison to like SD 3.5 Medium sometimes
which of course uses the same text encoder ultimately

devout schooner Apr 14, 2025, 9:05 PM

#

bitter hearth I know what you mean by Flux being smarter yeah it fixes stuff during inpaints a...

it does have very clean "resolve" yeah
I'm not sure that's related to parameter count entirely though

hallow lion Apr 14, 2025, 11:05 PM

#

So is this the hidream channel for the time being?

#

We have a new kid on the block

jagged gate Apr 15, 2025, 1:45 AM

#

queen edge Apr 15, 2025, 3:20 AM

#

Close-up view of a corner connection for stacked shelves (thin galvanized steel rectangular tube). The top surface of the lower shelf corner has small metal blocks welded to form a square locating pocket or fence. The upper shelf has a 10cm tall square spacer foot welded underneath its corner. This foot fits neatly inside the locating pocket/fence on the lower shelf. Show the 10cm separation created by the spacer foot. Detailed, metallic, industrial design, 3D render.

bitter hearth Apr 15, 2025, 6:56 AM

#

devout schooner it does have very clean "resolve" yeah I'm not sure that's related to parameter ...

parameter count and "abilities" are a super loose link yeah

#

but when I go from 0.2B pruned SD to 30B stepfun I do see an increase in "smartness"

#

even though they are fundamentally the same type of network

#

the 15,000% increase in parameter count gives some benefits

#

having said that, its amazing how well 0.2B keeps up with 30B

#

0.2B pruned SD is perfectly fine for tiled upscale and other tasks like that

dusky thistle Apr 16, 2025, 5:15 AM

#

safe creek Apr 16, 2025, 1:15 PM

#

Is the training set for SD3 or SDXL disclosed?

bitter hearth Apr 16, 2025, 1:47 PM

#

sadly no

dusky thistle Apr 16, 2025, 4:04 PM

#

raven fern Apr 17, 2025, 12:25 AM

#

@dusky thistle hahaha it's been a long while since i was kinda active on this server, im happy to see you are still doing some clownshark pics 🙂

#

did you try some with HiDream?

dusky thistle Apr 17, 2025, 12:26 AM

#

raven fern did you try some with HiDream?

yeah, just tested and confirmed res4lyf is working with it

raven fern Apr 17, 2025, 12:26 AM

#

😮

#

nice

dusky thistle Apr 17, 2025, 12:26 AM

#

gotta get all the attention masking working so i'll have to modify the model code

raven fern Apr 17, 2025, 12:27 AM

#

i actually never tried HiDream yet, will try it tonight, it seems most people use the Dev version?
should I go with that one?

dusky thistle Apr 17, 2025, 12:29 AM

#

no idea

#

i'm trying full atm

raven fern Apr 17, 2025, 12:33 AM

#

yea i will hopefully upgrade my PC this summer so I can enjoy all the good stuff instead of relying on quants or
compromises

icy drift Apr 17, 2025, 8:02 AM

#

raven fern i actually never tried HiDream yet, will try it tonight, it seems most people us...

I recommend full version. It's the only one with negative prompt, same size and works fine at 30 steps just like dev. Download the fp8 model from Comfy's HF then run in fp8_e4m3fn_fast if supported.

#

icy drift Apr 17, 2025, 8:20 AM

#

YES. Absolutely nailed it. Paws are so perfect. (This is the best of maybe 20; kept cancelling run halfway through based on preview.) Just realized the basketball is wrong though...

#

Huh. Looks like it almost never gets a basketball right.

#

Great at soccer balls. A little iffy overall.

icy drift Apr 17, 2025, 8:43 AM

#

It usually seems like just a slightly better version of Flux.
But then you give it a 12-constraint prompt like this and it's just like, "Yeah, I got all that." And it nails all 12 constraints every single time (with accurate hands). It really is way more powerful / intelligent than Flux, just not more knowledgeable.
A male elf with braided silver hair and green skin is wearing a purple toga, standing on the shell of a giant turtle. In his left hand, the elf is holding an intricately etched golden staff. In his right hand, the elf is holding a slice of meat lover's pizza. The scene is cinematic moodily lit under an overcast sky.

#

By default, it's super consistent seed-to-seed; always the same pose. But you can use the normal partial-denoise two-pass to get as much creative variation as you want.
I'm trying hires 2x now, at 2496 height which is > 2048 the original repo maxed out at.

#

Aww. Errored out. No >2048 resolution maybe?

#

LOL rifle spear cartoon? Hires 1.5x no obvious artifacting, but hard to tell because of the print texture. Gotta specify the style.
I'm amazed at the fingers and toes. I need to try a hands-and-feet prompt.

#

It probably hurts the performance that we can't do separate prompts for the different text encoders, but the ComfyUI native nodes definitely fixed the banding problem that the custom node had.

Hmm. That is some jaw-dropping texture detail, although the overall luma / chroma is wonky. Lemme see if I can fix it.

#

Is Flux actually better at architecture solidness / symmetrical objects?

#

#

Getting too close to the original pose again though.

raven fern Apr 17, 2025, 9:41 AM

#

@icy drift but the true test are tcg cards haha 🙂

icy drift Apr 17, 2025, 9:44 AM

#

raven fern <@944640559878930532> but the true test are tcg cards haha 🙂

I couldn't find any YuGiOh in its training data. 😕
Very good with text prompt adherence.

raven fern Apr 17, 2025, 9:45 AM

#

yea

icy drift Apr 17, 2025, 9:47 AM

#

Details on staff, necklace, and leather obviously AI. 2nd-pass needs 1.0 CFG or colors will blow out.

#

#

The preview shows this weird ping-pong effect like the model keeps trying to shake things up during generation. I haven't read the technical report if there is one, so I don't know how / if it differs from Flux.

#

That's one shiny card.

#

Printable center design no problem.

#

Got background art.

#

icy drift Apr 17, 2025, 10:30 AM

#

I give up. It just can't do reflections. It may have been trained on synthetic reflection data that was wrong.

#

It's the best teeth-brushing model yet though.

#

I think it's main real power is subject-inclusion constraint following. You can add tons of stuff into an image, and it will all show up.
It can't seem to do before/after same-person generations either. This model just has a really vague / fuzzy understanding of identities, materials, and structures.
Hmm. So why is it so good with hands (and presumably limbs from other peoples' tests)?

dry wave Apr 17, 2025, 10:51 AM

#

icy drift The preview shows this weird ping-pong effect like the model keeps trying to sha...

it's basically flux and then they added a lot of text encoders in a very inefficient way

sand osprey Apr 17, 2025, 3:09 PM

#

生成一只兔子的卡通形象

rustic bramble Apr 18, 2025, 4:17 AM

#

Anyone worked with mamba backbones for diffusion?

quartz hamlet Apr 18, 2025, 7:00 AM

#

High-detail map of North Africa, Morocco highlighted including Western Sahara, vintage parchment background, deep red and gold tones, cinematic lighting, ultra-realistic, no text, 16:9 aspect ratio

bitter hearth Apr 18, 2025, 8:17 AM

#

rustic bramble Anyone worked with mamba backbones for diffusion?

ye we have linearised dit also

#

which is even faster

dry wave Apr 18, 2025, 10:32 AM

#

and works really bad in my opinion

bitter hearth Apr 18, 2025, 10:35 AM

#

lol I remember you didn't like sana yeah

#

it makes an ok potato

dry wave Apr 18, 2025, 10:41 AM

#

I think linear attention is not so important/critical for image generation. Maybe I'm lacking imagination, but why would you want to scale up diffusion to millions of tokens? I can only see two scenarios: first, generating huge resolution images like 8k. But honestly, I think before you come up with a better attention mechanism, I would rather come up with a smarter upscaling technique. second, having a long conversation with multiple images (like omnigen or the current chatgpt). The latter might be interesting, but I think its currently more realistic to have a strong diffusion method that can be conditioned on one or very few images; that would be still possible with normal quadratic attention

#

(I also don't think that mamba and Co have a future in large language models to be honest. Might be wrong here, but my intuition is rather that memory models are the future)

bitter hearth Apr 18, 2025, 10:42 AM

#

oh generating to huge resolutions is like

#

the only thing I do lol

#

there is a project called CLEAR that nearly linearised Flux attention using a sliding window

#

it gives a 600% speed boost or so at 8k

#

flux is really nice at those higher resolutions especially if you can pass the 16k mark

#

its like getting a model from the future

#

the lighting goes so nice and soft

dry wave Apr 18, 2025, 10:44 AM

#

even than I would first generate in low res with quadratic attention and then upscale to higher res with something else

#

although sliding window is not "linear attention" to me xD But yeah, its almost linear in time I get that

bitter hearth Apr 18, 2025, 10:45 AM

#

dry wave even than I would first generate in low res with quadratic attention and then up...

oh I think it does do this

#

yeah it was not quite linear time

#

so if the resolution got high enough it would still start to scale in an unfriendly way

#

but it got pretty high

dry wave Apr 18, 2025, 10:47 AM

#

damned, its always hard to google for flux stuff xD

bitter hearth Apr 18, 2025, 10:48 AM

#

https://github.com/Huage001/CLEAR

GitHub

GitHub - Huage001/CLEAR: Official PyTorch implementation of paper "...

Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up". - Huage001/CLEAR

dry wave Apr 18, 2025, 10:50 AM

#

this looks really cool

#

but the memory consumption makes it probably still hard to generate more than 2k with it X_x

bitter hearth Apr 18, 2025, 10:52 AM

#

yeah

#

gotta use those thailand h100s

dry wave Apr 18, 2025, 10:53 AM

#

"The reason for these phenomenons is that pre-trained
DiTs, such as FLUX, rely heavily on local features to man-
age token relationships. To validate this, we visualize atten-
tion maps in Fig. 4 and observe that most significant atten-
tion scores fall in the local area around each query."

#

that's exactly what I hate at these DiT architectures X_x

#

why did they not simply add both, sliding window attention and global attention, in a 90:10 ratio or so

bitter hearth Apr 18, 2025, 10:59 AM

#

if you want a more modern / smarter version
thunderkittens team did sliding tile attention kernel for hopper

violet escarp Apr 18, 2025, 3:09 PM

#

bitter hearth lol I remember you didn't like sana yeah

I think that's the conv in the model
linear attention is really bad

bitter hearth Apr 18, 2025, 3:21 PM

#

the quality hit is fairly big yh

rustic bramble Apr 18, 2025, 3:25 PM

#

bitter hearth ye we have linearised dit also

Hmmmmm Aren't transformers inherently quadratic? Is this recent work? I was reading about vision mamba/ jamba hybrid backbones from last year

bitter hearth Apr 18, 2025, 3:25 PM

#

there's been all sorts of attempts to stop that quadratic scaling

#

I don't think it would help for me to link to a specific paper you need to read the whole literature really

#

like including RNN and GRU

#

its a frontier-level topic so its not rly something that can be distilled down into a small summary

rustic bramble Apr 18, 2025, 3:27 PM

#

Hmm

#

I thought that mamba was the fastest sequence modeling framework...

bitter hearth Apr 18, 2025, 3:32 PM

#

fastest is a loaded term cos

#

its hard to compare apples to apples

#

remember there are also non-deep methods

#

naive bayes, XGBoost, random forest etc

#

or just like standard panel data multivariate regression difference-in-differences models

#

or something like a dynamic stochastic general equilibrium model where you are essentially parametrising partial differential equations

dry wave Apr 18, 2025, 4:08 PM

#

Mamba got a lot of attention when it was published, but it turned out it just doesn't work as good as transformers. Nobody is using mamba-only architectures anymore. However, there are mamba-transformer hybrids in use

#

but there are million ways of speeding up transformers.

#

methods like mamba try to find a completely new architecture to solve the problem. But there are also just tweaks and fixes that can speed up everything sufficiently

#

for example: most of the time you don't need to attend everything to everything. So instead of removing all your attention layers from the model and replacing them with, e.g. linear attention, mamba, whatever, you could also just replace half of them or 80% of them. You could also use a unet like architecture and use transformers only in the middle layer (where they are cheap) and use other methods like neighbourhood attention in all other layers. There are so many ways of speeding up things which are not explored yet

bitter hearth Apr 18, 2025, 4:33 PM

#

transformers keep getting faster and faster with 6d parallelism, kernel search methods, better distilling etc

#

so the need for mamba or linear attn is less and less

#

hybrids can be ok yeah

icy drift Apr 18, 2025, 4:56 PM

#

sullen moss Apr 18, 2025, 5:25 PM

#

icy drift

HiDream ?

icy drift Apr 18, 2025, 6:30 PM

#

sullen moss HiDream ?

With Flux. I can't get HiDream alone to give me clear images at all for some reason. I'm using the recommended settings, and the quality is just bad.

sullen moss Apr 18, 2025, 6:31 PM

#

#

icy drift Apr 18, 2025, 6:32 PM

#

sullen moss

How does it handle architecture? (E.g. banister railings.)

sullen moss Apr 18, 2025, 6:35 PM

#

icy drift How does it handle architecture? (E.g. banister railings.)

Give me a prompt to test.

icy drift Apr 18, 2025, 6:36 PM

#

sullen moss Give me a prompt to test.

Just finished typing up a prompt and testing now with new settings.
The photo is taken from the bottom of a winding staircase in the interior of a suburban home. At the top right of the image, a woman is standing at the top of the stairs, leaning over the railing and looking down. The scene is a brightly lit interior.

#

Look at her wonky fingers and optical-illusion infinite stairs with super crazy disjointed banisters everywhere. It always comes out like this.

#

Here it is cleaned up with Flux at 0.61 denoise. I should've used 0.67 for architecture, but it did its best to fix the mess. (Of course, Flux can't actually follow this prompt if you try it just straight up. It can't get the camera position right.)

sullen moss Apr 18, 2025, 6:40 PM

#

icy drift Apr 18, 2025, 6:41 PM

#

sullen moss

Yep. Look at her fingers and the crazy stairs.

sullen moss Apr 18, 2025, 6:42 PM

#

sora

icy drift Apr 18, 2025, 6:42 PM

#

sullen moss sora

Wow that's great prompt adherence (same architecture problems, but I remember it being crazy fast). What version of Sora is this? Where can I get it?

sullen moss Apr 18, 2025, 6:43 PM

#

https://sora.chatgpt.com/

Sora

Transform text and images into immersive videos. Animate stories, visualize ideas, and bring your concepts to life.

icy drift Apr 18, 2025, 6:43 PM

#

Oh LOL nevermind. I was thinking Sana. I don't care about closed source stuff.

bitter hearth Apr 18, 2025, 6:43 PM

#

need enterprise deal to get access to the good sora sadly

#

these are sana

#

love sana so much

icy drift Apr 18, 2025, 6:44 PM

#

Okay, then full critique on that trash. The banister sweeping in from the top-left suddenly ends mid-air, clipped off by the right banister. She has no hands.

icy drift Apr 18, 2025, 6:45 PM

#

bitter hearth love sana so much

Yeah, that's what I remember from Sana. Nowhere near HiDream or Flux sadly.

sullen moss Apr 18, 2025, 6:45 PM

#

bitter hearth these are sana

🤪

bitter hearth Apr 18, 2025, 6:45 PM

#

ye its not flux level at all

#

#

it can do text

#

but I could not get "the" or "jedi"

icy drift Apr 18, 2025, 6:46 PM

#

bitter hearth it can do text

Definitely. Nothing out there can match 4o's text right now.

bitter hearth Apr 18, 2025, 6:46 PM

#

is ok for sci fi

icy drift Apr 18, 2025, 6:46 PM

#

But Flux can do full book covers too, with title and author title, in whatever font you want, with amazing kerning and composition.

bitter hearth Apr 18, 2025, 6:47 PM

#

I think we need separate foundation models for text anyway really

#

cos it feels like a very specialist task

#

I like having separate models for stuff

#

I don't make big images often these days but when I used to, I would chain together like 20 different models, some as regions and some as upscale passes

sullen moss Apr 18, 2025, 7:15 PM

#

icy drift Apr 18, 2025, 7:42 PM

#

sullen moss

Flux?

sullen moss Apr 18, 2025, 7:43 PM

#

Dream

icy drift Apr 18, 2025, 7:43 PM

#

sullen moss Dream

How is the leather on her belt so consistent / solid looking? I get a blobby mess every time. 😦

icy drift Apr 18, 2025, 9:10 PM

#

knotty axle Apr 19, 2025, 12:34 PM

#

#🏞｜general-with-images pikachu

jaunty basin Apr 19, 2025, 8:30 PM

#

Comic-style wide shot of a dark alley, with the polar bear and the little man in the black hoodie locked in a tense face-off. The bear stands on two legs, wearing a red scarf and brown newsboy cap. The man is frozen in place as the bear’s massive shadow stretches over him, cast by stark moonlight. Mood: fatal finality. Use silhouetted framing with sharp light-dark contrast, emphasizing the bear’s dominance and the gravity of the moment. Let the moonlight carve out dramatic shapes in the alley, heightening the cinematic tension.

surreal nova Apr 20, 2025, 1:25 AM

#

Generate an image…. What the hell am I doing? How do I create images on here?

#

Or is this just a chat for people to talk about?

dry wave Apr 20, 2025, 11:13 AM

#

surreal nova Or is this just a chat for people to talk about?

yes

#

there is a bot for generating images in a different channel, but it's not for free. This discord is about open source models, so people generate the images on their own graphic cards.

lucid roost Apr 20, 2025, 5:44 PM

#

"how to draw a cartoon elephant, step-by-step guide, 6 panels, each step building progressively -- step 1: simple head shape -- step 2: add trunk and eyes -- step 3: connect body to head -- step 4: draw legs and tail -- step 5: complete the full body with outlines -- step 6: fully colored elephant in soft pastel colors, clean vector art, minimal background, kids tutorial style, high resolution"

polar coral Apr 21, 2025, 5:02 AM

#

"A normal-looking public charging port (at an airport/mall) with a hidden microchip inside. A close-up shot shows the chip's LED faintly glowing as soon as a user connects their phone, with a 'Data Transfer' animation flashing on the phone screen. In the background, binary code streams and a hacker's hand is partially visible."

digital valley Apr 21, 2025, 8:17 AM

#

dream

#

Dream

#

(8k game sprite), (front view), pixel art, office worker,
(stressed face), (messy hair), (glowing computer screen reflection on glasses),
(untucked shirt), (coffee stain on tie), (holding smartphone under desk),
(cubicle background), flat shading, muted colors,
(comedy elements: tiny cactus with "F**k Work" sign),
art by Scott Benson, inspired by "Don't Starve"

pale blade Apr 21, 2025, 11:57 AM

#

Нужно в первой иллюстрации добавить ещё одну колонку с персонажами слева от основной. В новой колонке должны быть пустые слоты для абордажников. Затем весь интерфейс нужно выровнять.

quick lava Apr 21, 2025, 1:49 PM

#

generate image

gusty wing Apr 22, 2025, 5:09 AM

#

#🏞｜general-with-images make a smart room light automation design --aspect 9:16

craggy crest Apr 22, 2025, 5:32 AM

#

craggy crest Apr 22, 2025, 5:51 PM

#

cinder junco Apr 23, 2025, 12:49 AM

#

icy drift Apr 23, 2025, 10:33 AM

#

In a futuristic white room, on the right an android is floating in a green tank, and on the left a woman is standing at a control panel. The green tank is a tall cylinder of aquarium glass filled with fluid. The android inside the tank is floating weightless above the floor, with head tilted back with eyes closed, and with arms spread. On the left, the woman standing at the control panel is facing left and looking down at the panel. Her hands are on the control panel. The woman at the control panel is wearing a white lab coat. The control panel is a blue and white futuristic LCD screen. The room is futuristic and white, with panels and lines. The scene is brightly lit.
Yep, that's some absolutely amazing composition and constraint following from the prompt. Blows Flux out of the water in that regard. It can't do the solid-looking architectural structures (like the wobbly lines on the vent overhead or the mangled lines on the screen / panel / her fingers), but the prompt adherence is just so useful. It just needs low-denoise hires with Flux to fix the mistakes.

crimson hull Apr 25, 2025, 1:14 AM

#

In a futuristic white room, on the right an android is floating in a green tank, and on the left a woman is standing at a control panel. The green tank is a tall cylinder of aquarium glass filled with fluid. The android inside the tank is floating weightless above the floor, with head tilted back with eyes closed, and with arms spread. On the left, the woman standing at the control panel is facing left and looking down at the panel. Her hands are on the control panel. The woman at the control panel is wearing a white lab coat. The control panel is a blue and white futuristic LCD screen. The room is futuristic and white, with panels and lines. The scene is brightly lit.

crimson hull Apr 25, 2025, 1:14 AM

#

cinder junco

wet kayak Apr 25, 2025, 5:36 AM

#

Hello @Team,

I’m currently working on generating a Toy Starter Pack-style image using the following API:
"https://api.stability.ai/v2beta/stable-image/generate/sd3"

However, I’m not getting results that align with the attached reference image. I’ve included the prompt in the appropriate section and tested with various "seed" values, but the output still doesn’t meet expectations.

Could you please advise:
If this is the correct API to use for achieving this specific style?
If there are particular parameters or configurations I should be adjusting to improve the results?

Your guidance would be greatly appreciated.

sage burrow Apr 28, 2025, 9:34 AM

#

wet kayak Hello @Team, I’m currently working on generating a Toy Starter Pack-style image...

I recommend gliff

craggy crest Apr 29, 2025, 4:32 AM

#

craggy crest Apr 29, 2025, 5:21 AM

#

tulip fern Apr 29, 2025, 12:22 PM

#

Can someone help me with this?

muted cargo Apr 29, 2025, 12:37 PM

#

tulip fern Can someone help me with this?

output resolution is too low.

#

sd3 is trained for 1024x1024

tulip fern Apr 29, 2025, 1:01 PM

#

muted cargo sd3 is trained for 1024x1024

Okay ill change

tulip fern Apr 29, 2025, 5:09 PM

#

muted cargo sd3 is trained for 1024x1024

Its still same

dry wave Apr 29, 2025, 5:12 PM

#

dunno what schedule type automatic means, but rectified flow works simply with the Euler Sampling method

tulip fern Apr 29, 2025, 5:14 PM

#

dry wave dunno what schedule type automatic means, but rectified flow works simply with t...

Should I change it to anything else?

dry wave Apr 29, 2025, 5:14 PM

#

Euler and normal are the settings in comfyui

tulip fern Apr 29, 2025, 5:21 PM

#

dry wave Euler and normal are the settings in comfyui

dry wave Apr 29, 2025, 5:26 PM

#

Sampling method: Euler
Schedule Type: Simple

tulip fern Apr 29, 2025, 5:36 PM

#

dry wave Sampling method: Euler Schedule Type: Simple

Nah its the same

#

I'm still gettting image same as previous

weary crystal Apr 29, 2025, 6:39 PM

#

tulip fern Can someone help me with this?

Is this auto1111? if so i am not sure if it supports sd3.5.

tulip fern Apr 29, 2025, 6:42 PM

#

weary crystal Is this auto1111? if so i am not sure if it supports sd3.5.

yes

icy drift Apr 29, 2025, 9:06 PM

#

HiDream E1 working! 🙂 (E1 only works with 768*768 as far as I know, so output is smaller. At 1024, it will shift the image. No problem to upscale / hires / low-denoise.)
Change the painting on the canvas into a tuna fish.

#

It did its best with my terrible sketch.
Change the rough sketch style to a clean lineart style with coloring and shading.

#

I had to reroll and modify the prompt 3 times, and it still missed one of the apples. Maybe if I had said "4 apples" instead.
Change the apples' materials from gold to transparent crystal.

icy drift Apr 29, 2025, 10:25 PM

#

I think Omnigen might still be better though.

bitter hearth Apr 29, 2025, 10:57 PM

#

omnigen is still good sometimes yeah

serene whale May 1, 2025, 9:21 AM

#

a beautiful and powerful mysterious sorceress, smile, sitting on a rock, lightning magic, hat, detailed leather clothing with gemstones, dress, castle background

dull star May 1, 2025, 2:49 PM

#

Mods?

#

Scam

#

I cant ping mods at once

#

they removed that option

#

uhhh @spark grove ig

lilac plinth May 3, 2025, 6:38 PM

#

Make moustache narrow and wide with slightly curved ends, Make eyes positioning in correct direction simultenously natural, Mir anees

craggy crest May 4, 2025, 2:19 AM

#

lilac plinth Make moustache narrow and wide with slightly curved ends, Make eyes positioning ...

you can't generate in this channel.

#

icy drift May 4, 2025, 8:38 AM

#

Make it a rainy day, photoreal style.
Nailed it in one! 🙂 New IC-Edit lora with Flux-Fill.

#

Give the kitten sunglasses, photoreal style.

#

Underwater.

weary crystal May 4, 2025, 8:59 AM

#

lilac plinth Make moustache narrow and wide with slightly curved ends, Make eyes positioning ...

tried hidream e1 with a slightly alternated prompt:
Editing Instruction: change the moustache into a short bushy Toothbrush moustache, wearing a jester hat. Target Image Description: young man
@icy drift will try the kitten image an your prompt next 🙂

weary crystal May 4, 2025, 9:20 AM

#

icy drift `Make it a rainy day, photoreal style.` Nailed it in one! 🙂 New IC-Edit lora wi...

mortal kite May 4, 2025, 12:03 PM

#

My harddrive can't handle all these models

dry wave May 4, 2025, 1:03 PM

#

I waited for a lora for flux-fill 😀 do you have a link?

tacit ermine May 5, 2025, 4:40 PM

#

Can anyone tell me how to modify parts of an image through prompt in stable diffusion

weary crystal May 5, 2025, 5:15 PM

#

tacit ermine Can anyone tell me how to modify parts of an image through prompt in stable diff...

Well different options. One of the oldest and trusted methods would be inpainting. So masking the region you want to change and prompt what you needed.
Most recent hidream e1 was released where you simple could write a prompt without masking which editing operations. But this might change more than only the specific region.

tacit ermine May 5, 2025, 5:17 PM

#

weary crystal Well different options. One of the oldest and trusted methods would be inpaintin...

Could you please tell me where to do on this server like changing the image with a prompt

#

I'm new to discord

weary crystal May 5, 2025, 5:21 PM

#

tacit ermine Could you please tell me where to do on this server like changing the image with...

Well this is a server mostly with people who use stable diffusions models either local or with the paid api / artisan channels.
So there is no free bot etc. just some help to install stable diffusions models either front ends etc. to run on own hardware. Talk about prompt optimization, share art, share news.

craggy crest May 6, 2025, 12:14 AM

#

tacit ermine Could you please tell me where to do on this server like changing the image with...

you can generate in the artisan channels. read the information in #artisan-faq

azure thorn May 7, 2025, 3:25 AM

#

Help me generate a pink butterfly with a black background.
Only pink and black images

craggy crest May 7, 2025, 5:24 AM

#

azure thorn Help me generate a pink butterfly with a black background. Only pink and black i...

read the information in #artisan-faq

craggy crest May 7, 2025, 10:05 PM

#

Hot dog!

deft locust May 8, 2025, 12:53 AM

#

Como genero una imagen?

#

Is this free?

craggy crest May 8, 2025, 7:12 AM

#

deft locust Is this free?

Well... if you run the model on your own machine, all you oay us electricty

lucid glade May 8, 2025, 9:24 AM

#

Hot dog!

final lynx May 8, 2025, 7:28 PM

#

can i run sd3 on a rx580?

stark plume May 8, 2025, 7:50 PM

#

Make a cartoon big mango with human eyes

real terrace May 9, 2025, 4:42 AM

#

final lynx can i run sd3 on a rx580?

I don't think so, I owned a rx480 and SD 1.5 and SD XL was kind of a deal and I had to run it on Linux...

#

if it works it would be really slow I guess

violet escarp May 9, 2025, 2:45 PM

#

@spark grove

#

it's one of those fake steam bots again

spark grove May 9, 2025, 2:46 PM

#

https://tenor.com/4nmA.gif

Tenor

sage burrow May 10, 2025, 11:34 AM

#

tacit ermine Can anyone tell me how to modify parts of an image through prompt in stable diff...

Use an the original image as an image reference. Then talk about the change over and over again in the prompt for the new image.

sage burrow May 10, 2025, 11:38 AM

#

final lynx can i run sd3 on a rx580?

I recommend a gguf version of the sd3. Or a lot of patience. large swap file too. I run it on a 4060

blazing spoke May 10, 2025, 3:35 PM

#

Make a realistic pakistan PIA Aeroplane

restive wigeon May 11, 2025, 4:08 PM

#

<dem_form> <img1_style> a biomechanical humanoid creature with tusks and extended tongue, bust portrait, in the exact rendering style of the second image, cinematic shadows, dark metallic skin, surreal alien armor, inspired by H.R. Giger, highly detailed, photoreal 3D style, atmospheric lighting, monochrome tones

craggy crest May 12, 2025, 7:15 AM

#

You can not generate imsges in this channel

craggy crest May 17, 2025, 3:00 AM

#

dry talon May 17, 2025, 6:55 AM

#

pig image

woeful prawn May 17, 2025, 7:34 AM

#

a 185cm high sexy man wears transparent sexy underwear under the sunshine

turbid grotto May 17, 2025, 12:27 PM

#

sadcat

jagged gate May 17, 2025, 2:05 PM

#

Hidream

rugged yacht May 17, 2025, 11:29 PM

#

generate cartoon image with girl

runic tusk May 18, 2025, 1:26 AM

#

No.

harsh fjord May 18, 2025, 8:09 AM

#

how to use this

runic tusk May 18, 2025, 12:55 PM

#

harsh fjord how to use this

You don't. This isn't a bot channel for image creation. There are paid services for that here.

remote holly May 20, 2025, 12:00 AM

#

sd3.5 doesn't deserve to be forgoten by community

cinder elk May 20, 2025, 7:18 AM

#

global automotive manufacturing facility, robotic assembly lines, quality control inspection, international collaboration, high-tech production environment, workers in clean uniforms, cinematic lighting, panoramic wide aspect ratio

sullen moss May 20, 2025, 2:52 PM

#

remote holly sd3.5 doesn't deserve to be forgoten by community

proven pecan May 21, 2025, 10:52 AM

#

remote holly sd3.5 doesn't deserve to be forgoten by community

Same, though it can be a struggle (tbh I have a love-hate rlation with it)

an_amateur_photo_of_a_42_year_old_gardener__she_has_red_long_hair__wears_a_purple_tank_top_and_wide_blue_trousers__normal_posture__she_s_concentrated_working_in_the_garden__the_backgrou_3439374009.png

jade minnow May 22, 2025, 8:55 AM

#

A very pretty Chinese girl with a smile on her face and a nice figure, wearing a purple dress

#

A very pretty Chinese girl with a smile on her face and a nice figure, wearing a purple dress

warped prairie May 23, 2025, 12:46 AM

#

A cheerful, illustrated poster featuring a variety of wild animals engaging in humorous and chaotic parenting moments, like a koala dropping its baby, a hamster eating its young, and a black bear sleeping through parenting duties, all surrounded by hand-drawn floral borders and playful typography that says “There Are Moms Way Worse Than You”, pastel color palette, children’s book illustration style, flat vector aesthetic, clean white background --ar 4:5 --v 6.0 --style raw --s 250

craggy crest May 23, 2025, 2:14 AM

#

#

#

frail shoal May 25, 2025, 2:45 PM

#

icy drift May 26, 2025, 11:50 AM

#

Settings made a big difference for Bagel. Times are on my 4090.
A trading card from a trading card game. The title of the card at the top says: "GOBLIN FIEND". The card art shows a green goblin with red eyes holding a knife. Under the art is a text panel. The text panel has the text: "The goblin fiend loves the taste of fried foods and does not like vegetables." At the bottom-right corner of the card is a number panel. The number panel has an icon of a sword, and the number 7. The card is polished and well designed, with highly detailed art and text in a clear, crisp font.

#

(Bagel with bad settings. Made the difference between SDXL accuracy vs. better than Flux Dev accuracy.)

weary crystal May 26, 2025, 11:56 AM

#

icy drift Settings made a big difference for Bagel. Times are on my 4090. ```A trading car...

About 30 seconds for Chroma....

#

Guess the amount of steps is more important these where made with 26

#

38 Seconds with Chroma v30 unlocked.

#

SD3.5 Medium 15.6 Seconds

icy drift May 26, 2025, 3:25 PM

#

Yeah the steps are more relevant unless you're using my same hardware...
My example gens were 50 steps each. The point was just to compare Bagel's speed to the speed of other models. I've never heard of Chroma, and that's some impressive prompt following. Easily on par with Flux. I'll check it out.

#

Oh chroma is just flux nvm. Just gives me solid black images in Comfy.

violet escarp May 26, 2025, 4:58 PM

#

it's modified from Flux schnell

#

it's a different arch. It prunes schnell from 12b to 8.9b and corrects a mistake related to padding tokens. You need a different workflow for it.

turbid grotto May 26, 2025, 9:19 PM

#

it also does not need clip l and still in 512px pretraining phase, so there is a potential

craggy crest May 27, 2025, 5:00 AM

#

this is the SD3 channel, should probably take the flux discussion to #💬｜general-chat

sand badger May 27, 2025, 3:15 PM

#

A cheerful, illustrated poster featuring a variety of wild animals engaging in humorous and chaotic parenting moments, like a koala dropping its baby, a hamster eating its young, and a black bear sleeping through parenting duties, all surrounded by hand-drawn floral borders and playful typography that says “There Are Moms Way Worse Than You”, pastel color palette, children’s book illustration style, flat vector aesthetic, clean white background --ar 4:5 --v 6.0 --style raw --s 250

#

A cheerful, illustrated poster featuring a variety of wild animals engaging in humorous and chaotic parenting moments, like a koala dropping its baby, a hamster eating its young, and a black bear sleeping through parenting duties, all surrounded by hand-drawn floral borders and playful typography that says “There Are Moms Way Worse Than You”, pastel color palette, children’s book illustration style, flat vector aesthetic, clean white background --ar 4:5 --v 6.0 --style raw --s 250 ¯_(ツ)_/¯

mental raptor May 27, 2025, 9:41 PM

#

where do you get stable diffusion, preferably a gui version?

craggy crest May 27, 2025, 11:22 PM

#

mental raptor where do you get stable diffusion, preferably a gui version?

https://civitai.com/models/878387/stable-diffusion-35-large - this is the model. you'll need to install something to run it in. i recommend you install https://github.com/mcmonkeyprojects/SwarmUI

GitHub

GitHub - mcmonkeyprojects/SwarmUI: SwarmUI (formerly StableSwarmUI)...

SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility. - mcmonkeyprojects/Swa...

icy drift May 28, 2025, 10:39 AM

#

violet escarp it's a different arch. It prunes schnell from 12b to 8.9b and corrects a mistake...

Yeah I got it working with a workflow from Civitai, then experimented with a bunch of settings. For accuracy I'm sticking with HiDream+llama8. I use Flux for hi-res to fix HiDream's bad architecture, and my 6-step merge works better for that than chroma. It was still interesting to try though.

swift saffron May 28, 2025, 4:22 PM

#

An Asian man with a receding hairline and a round face, wearing a white shirt inside and a gray-blue jacket outside, holding a wooden dragon-headed cane. He is 190 cm tall and stares at the cane arrogantly. This is a 2D game concept map, with obvious strokes, transparent colors, delicate clothing materials, and shiny leather shoes. --cref https://s.mj.run/CxufHB5Qy1s --cw 100 --ar 9:16

rocky geode May 29, 2025, 12:54 PM

#

is it possible to run a flux dev with 22 gigs insted we only have 16 gpu ? (may be by vram ? got 64 of thems)

#

! thank for this speed answer !

weary crystal May 29, 2025, 12:55 PM

#

rocky geode is it possible to run a flux dev with 22 gigs insted we only have 16 gpu ? (may ...

pretty sure it is scam

#

don't click on the server

rocky geode May 29, 2025, 12:55 PM

#

ha ?

#

i dont click on

#

you have good moderation here =)

weary crystal May 29, 2025, 12:56 PM

#

Good for you. Lately scammer appear whenever questions are ask to join a discord and they need you login etc.

rocky geode May 29, 2025, 12:57 PM

#

i may not have to share things here ?¿

weary crystal May 29, 2025, 12:57 PM

#

Back to your question. There are some flux dev quantisation models with fp8 that are smaller. You will not be happy with the run time if the model will be pushed towards the ram

#

Well tech-support would be a good channel for these question. Scammers are everywhere 😦

rocky geode May 29, 2025, 12:57 PM

#

oh i'm ok with fp8 vae ones

#

♥ Lords thanks for your rescue ♥ i'll try to be carefull with what i ask !

weary crystal May 29, 2025, 12:59 PM

#

rocky geode ♥ Lords thanks for your rescue ♥ i'll try to be carefull with what i ask !

Ask whatever you want to know but if people DM you or send you links that sounds fishy.... it is fishy 🙂

rocky geode May 29, 2025, 1:00 PM

#

promess, i'll wireshark logs them ^^

icy drift May 29, 2025, 8:46 PM

#

Not quite dice-like, huh.

sullen moss May 29, 2025, 10:55 PM

#

Flux Kontext

#

#

#

#

#

#

icy drift May 30, 2025, 9:19 PM

#

Sure... I'll try the Kontext Dev version when available. I'll be BLOWN AWAY if it can do any of the following at all:

follow literally even one simple instruction like: "make his shoulders wider", or "make her look off to the right", or "make the dog cross its paws".
preserve outfits from references without changing sleeve length, adding panels, moving seams, changing button count, etc.
do simple / basic UI edits like, "add a sword icon in the bottom-right corner", or "make the title font at the top larger", or "add a raised border around the red button on the bottom right"
I am expecting the model to struggle immensely with anything other than the tasks listed in the paper, and I'm expecting it to need exact-word prompting to achieve those.

cunning lintel May 30, 2025, 10:25 PM

#

Tried the first, no joy on the pro tagged version

#

But who knows, maybe conrolnet pose will be added and you can twist and turn your char anyway you want too, that doesn't seem that big of a leap

cunning lintel May 30, 2025, 10:46 PM

#

Still, the way it preserves details is amazing, original photo "add a roswell alien riding the zebra" "make the alien hold one arm up, as in a greeting". And all gens actually keep looking like a photo, by far the biggest win for me, not the faux 3d render photo look so often seen.

#

So i thought style transfer would be really good too, nope, very hit or miss, hit for common digital art styles, but when using simple line doodles or charcoal stuff, style was pretty much ignored 😢

icy drift May 30, 2025, 10:50 PM

#

cunning lintel So i thought style transfer would be really good too, nope, very hit or miss, hi...

I was being all huffy to not get my hopes up. But my hopes were up anyway. Oh well.

cunning lintel May 30, 2025, 10:52 PM

#

maybe it's the prompting (might help to add a little style to the prompts), $.04/image is too much too experiment a lot for me

icy drift May 30, 2025, 10:53 PM

#

cunning lintel maybe it's the prompting (might help to add a little style to the prompts), $.04...

Yeah I'll wait until I can run it in Comfy. Sometimes with great prompting and lots of re-rolls you can find a useful thing that a new model can do, that no old models can do.

cunning lintel May 30, 2025, 11:05 PM

#

using this style create: a towering Lizardfolk mercenary whose scales are fused with veins of obsidian and reinforced with magi-tech plating. His eyes glow with internal arcane energy. He wears heavy brass pauldrons enchanted for durability and carries a powerful sonic disruptor gauntlet. He's gruff, pragmatic, and focused solely on the highest bidder.

#

it's something, maybe i just want too much :p

#

the first one was my first try and seeing it keep the effect of white border, image over it, i thought, wow, that's good

craggy crest May 31, 2025, 3:42 PM

#

rocky geode ! thank for this speed answer !

just get the gguf and run that

devout schooner May 31, 2025, 11:17 PM

#

I finally got around to actually releasing an SD 3.5 Medium lora after spending 80 years experimenting with training lol (actually it's a Dora, specifically, not that it matters)
for the art style of Tim Jacobus (guy who did all the original Goosebumps cover art)
https://civitai.com/models/1635408/stable-diffusion-35-medium-art-style-tim-jacobus

devout schooner May 31, 2025, 11:23 PM

#

devout schooner I finally got around to actually releasing an SD 3.5 Medium lora after spending ...

relevant training notes I guess if anyone cares (in Kohya-ian terms):

CAME optimizer
no text encoder training
0.0001 model learning rate
Cosine With Restarts Scheduler set to 3 restarts
"Noise Offset" at 0.2
Multires Noise Iterations and Multires Noise Discount disabled entirely
Dim 64, Alpha 32, Dora model type, "factor" set to 2
Batch Size 1 but with 5 gradient accumulation steps (as an alternative to a regular batch size of 5, which I imagine would have a similar effect)

#

also I think I generated all the samples with DPM++ 2S Ancestral SGM Uniform @ CFG 5.5, in Comfy

dry wave Jun 1, 2025, 12:45 AM

#

why do you even use noise offset?

#

it's a weird hacky technique full if potential errors that is not even necessary for rectified flow matching models

devout schooner Jun 1, 2025, 6:37 AM

#

dry wave why do you even use noise offset?

0.2 gives better results than nothing for SD 3.5 Medium
and better results than the standard 0.1 (or 0.03) people would typically use for SDXL

#

I can't speak to anything other than the results lol

#

the only "training guide" ever released for SD 3.5 anything was basically nonsense in my extensively tested opinion

#

anything other than Dora is basically useless
any normal Adam optimizer is basically useless
I've never gotten vaguely good results for SD 3.5 Medium with anything other than CAME and low-factor Doras joeshrug

#

it doesn't train anything remotely like any other model ever

modern rover Jun 3, 2025, 9:45 AM

#

create a icon like this photo

lucid jasper Jun 4, 2025, 8:03 AM

#

Draw a statue of an anime god : "raise the level alone" contrast photo with highlights, without background

opaque oyster Jun 4, 2025, 8:04 AM

#

#artisan-1 - PA realistic standing image of Lord Kalabhairava, the fierce form of Lord Shiva. He is depicted with a terrifying yet divine expression, with three eyes glowing like fire. His complexion is dark as a stormy night, adorned with garlands of skulls and serpents. He stands powerfully in a cremation ground, surrounded by blazing fires and spirits. He holds a trident, a drum (damaru), a noose, and a skull bowl in his four hands. His hair is matted and flies wildly, crowned with a crescent moon. His feet are adorned with golden anklets, and he wears tiger skin. A dog stands loyally beside him. The atmosphere is mystical, with storm clouds and divine light behind him, capturing the essence of time and death. Style: Hyper-realistic, high detail, divine and intimidating aura, traditional Hindu iconography.

weary oxide Jun 5, 2025, 5:44 AM

#

PA realistic standing image of Lord Kalabhairava, the fierce form of Lord Shiva. He is depicted with a terrifying yet divine expression, with three eyes glowing like fire. His complexion is dark as a stormy night, adorned with garlands of skulls and serpents. He stands powerfully in a cremation ground, surrounded by blazing fires and spirits. He holds a trident, a drum (damaru), a noose, and a skull bowl in his four hands. His hair is matted and flies wildly, crowned with a crescent moon. His feet are adorned with golden anklets, and he wears tiger skin. A dog stands loyally beside him. The atmosphere is mystical, with storm clouds and divine light behind him, capturing the essence of time and death. Style: Hyper-realistic, high detail, divine and intimidating aura, traditional Hindu iconography.

buoyant mesa Jun 5, 2025, 6:47 AM

#

devout schooner I finally got around to actually releasing an SD 3.5 Medium lora after spending ...

What tool did u use if I may ask?

dull locust Jun 5, 2025, 6:20 PM

#

prompt
"تیزر آموزشی هوش مصنوعی: چرخ‌دنده‌های مکانیکی کلاسیک (نماد سیستم‌های قدیمی) به آرامی به ساختارهای دیجیتالی تبدیل می‌شوند. ابتدا به لایه‌های نورانی یک شبکه عصبی ساده (3 لایه با نورهای آبی و سبز) تغییر شکل می‌دهند، سپس به یک معماری پیچیده Deep Learning (با صدها نور قرمز-زرد-آبی متصل) تکامل می‌یابند.

echo pond Jun 6, 2025, 10:12 AM

#

ebon bloom Jun 13, 2025, 12:24 PM

#

`prompt
主题公园景观，青柠水晶雕塑作为核心装置，透明玻璃温室中悬浮水滴形青柠树，弧形玻璃步道环绕浅绿色反光水池，现代极简风格建筑由玻璃与亚克力构成，阳光透过棱镜折射彩虹光斑，柔焦清新色调，等轴视角构图，by Nendo工作室 --ar 16:9 --v 6.0

dusky thistle Jun 16, 2025, 11:50 PM

#

#

SD35M

#

got a lot of upgrades to style transfer going

#

#

#

#

dusky thistle Jun 17, 2025, 12:25 AM

#

#

#

#

#

#

#

dusky thistle Jun 17, 2025, 1:15 AM

#

#

dusky thistle Jun 17, 2025, 1:33 AM

#

#

#

dusky thistle Jun 17, 2025, 5:43 AM

#

copper whale Jun 17, 2025, 10:23 AM

#

dusky thistle

i like this one, this kinda reminds me of the starry night of vincent van gogh, hope u generate more of thisss :>

lilac plank Jun 18, 2025, 8:11 PM

#

Realistic image of a porsche

craggy crest Jun 18, 2025, 9:09 PM

#

copper whale Jun 19, 2025, 2:10 PM

#

craggy crest

soo stunning...like a glowing fantasy world come to life broo

craggy crest Jun 19, 2025, 5:22 PM

#

copper whale soo stunning...like a glowing fantasy world come to life broo

thanks :) took me a few tries, i kept getting a mushroom top as a head

hasty plank Jun 20, 2025, 4:35 AM

#

Modern Style Bedroom Interior Scene with Contemporary Decorative Style, Bed, Cabinet, Nightstand, Table, Greenery

open heart Jun 20, 2025, 2:01 PM

#

a lone survivor senses danger lurking behind shattered glass.
Shot concept: Grit, suspense, and post-apocalyptic atmosphere.
#AIart #Cinematic #SurvivorScene #PostApocalyptic

devout schooner Jun 21, 2025, 3:32 AM

#

https://civitai.com/models/1701368 Did a multi-appearance realistic fantasy "hellhound" creature archetype Dora for SD 3.5 Medium

tardy niche Jun 22, 2025, 8:44 AM

#

bear

#

can someone tell me how to make images pls

runic tusk Jun 22, 2025, 1:49 PM

#

Nice scam.

urban arch Jun 22, 2025, 3:25 PM

#

Nah, the really good ones are the ones that actually sound real.

open heart Jun 22, 2025, 6:26 PM

#

🚨 Limited-Time Offer!
Heart of Steel by Blacke Marlin is now FREE on Kindle until June 27!
Enter a dieselpunk world of sky missions, sabotage, and dark secrets.
Don’t miss your chance to grab this gripping novella.
https://www.amazon.com/dp/B0FF3C5YX3?ref_=ast_author_mpb
#dieselpunk #scifi #kindle #freebook #fiction

Heart of Steel: A Dieselpunk War Adventure

copper whale Jun 24, 2025, 9:31 AM

#

devout schooner https://civitai.com/models/1701368 Did a multi-appearance realistic fantasy "hel...

awesome bro

devout schooner Jun 25, 2025, 4:44 PM

#

copper whale awesome bro

Thanks
I've got a big detailer one trained solely on hi res Flux Pro Ultra outputs I'm gonna release soon too
Stock Medium on Left, with Dora on right
Same seed / prompt / sampling settings / etc

simple flame Jun 26, 2025, 1:35 PM

#

create a friendly, cute, white and round robot assitant that resembles eve from wall e, deptic her from different angles

brisk brook Jun 26, 2025, 6:49 PM

#

Create a photorealistic and realistic image with a resolution of 3840x2160 in a cyberpunk style. In the foreground, depict a very beautiful, slender woman with a short haircut, who is half Asian and half Caucasian. She wears thin, tight-fitting clothing with the inscription "Xaero," through which the outlines of her nipples are visible. In the background, show a megacity with dark tones accented by blue, pink, and purple colors, and a cyberpunk-style sports car parked nearby. Please generate 3 different variations of this image. The image should have photographic realism, with detailed lighting, textures, and atmosphere typical of high-end cyberpunk visuals

cunning lintel Jun 26, 2025, 6:57 PM

#

cunning lintel > using this style create: a towering Lizardfolk mercenary whose scales are fuse...

So, with kontext dev out, tried the same images and prompt with dev (i used wavespeed online, haven't set it up locally).... uf, not the results i hope for.
Hopefully this ages like milk, and even next week it's shown kontext is amaaaazing

e5faa09a-7e51-436f-94d2-18b560767046-u1_55fc8c99-9a5c-4c74-a4ab-2dabcd07c432.png

153fdb89-9e55-4a53-b17f-623140c42dda-u1_ff61545f-7e22-4be1-aa5f-69d9b9479660.png

d25a8d1a-26c1-4c4a-8d4a-f14baa97b04d-u1_9a98c144-2e2b-4408-91e5-4fda5412b75d.png

ce09106d-0178-4dd9-9920-b6d1a0b77462-u2_45a3f415-9ea5-4e50-81b8-96dd7d6068ee.png

short thicket Jun 27, 2025, 12:38 AM

#

Works alright for me. Cheers.

copper whale Jun 27, 2025, 1:50 PM

#

cunning lintel So, with kontext dev out, tried the same images and prompt with dev (i used wave...

i think this is just fine bro maybe u have to explore more? i guess

rapid sparrow Jun 27, 2025, 2:01 PM

#

PA realistic standing image of Lord Kalabhairava, the fierce form of Lord Shiva. He is depicted with a terrifying yet divine expression, with three eyes glowing like fire. His complexion is dark as a stormy night, adorned with garlands of skulls and serpents. He stands powerfully in a cremation ground, surrounded by blazing fires and spirits. He holds a trident, a drum (damaru), a noose, and a skull bowl in his four hands. His hair is matted and flies wildly, crowned with a crescent moon. His feet are adorned with golden anklets, and he wears tiger skin. A dog stands loyally beside him. The atmosphere is mystical, with storm clouds and divine light behind him, capturing the essence of time and death. Style: Hyper-realistic, high detail, divine and intimidating aura, traditional Hindu iconography.

cunning lintel Jun 27, 2025, 10:08 PM

#

copper whale i think this is just fine bro maybe u have to explore more? i guess

The intend/prompt was to use the style of a source image (the same i used in the post i replied to), sadly the model hardly followed it, only the color scheme a bit. Other trickery might or might not wotk (i had some success adding stuff to real photo's) but style reference isn't something i managed to make the dev version do. (and it was what i looked most forward to 😞 )

fresh plover Jun 28, 2025, 6:51 PM

#

Create a 1990s realistic portrait featuring Mexican American singer Selena Quintanilla with long dark hair and bangs, she's wearing red lipstick she's smiling

spark quail Jun 30, 2025, 8:19 AM

#

PA realistic standing image of Lord Kalabhairava, the fierce form of Lord Shiva. He is depicted with a terrifying yet divine expression, with three eyes glowing like fire. His complexion is dark as a stormy night, adorned with garlands of skulls and serpents. He stands powerfully in a cremation ground, surrounded by blazing fires and spirits. He holds a trident, a drum (damaru), a noose, and a skull bowl in his four hands. His hair is matted and flies wildly, crowned with a crescent moon. His feet are adorned with golden anklets, and he wears tiger skin. A dog stands loyally beside him. The atmosphere is mystical, with storm clouds and divine light behind him, capturing the essence of time and death. Style: Hyper-realistic, high detail, divine and intimidating aura, traditional Hindu iconography.

muted cargo Jun 30, 2025, 9:03 AM

#

short name for a channel holding every informations related to this discord's channel bot.
🤖 : beep boop beep, there you go #artisan-faq

vast crag Jun 30, 2025, 9:55 AM

#

Chinese ink painting of the Red Cliffs battlefield at dusk, towering red cliffs with crashing waves (‘乱石穿空，惊涛拍岸’), ruined ancient fortifications in the distance, a young General Zhou Yu (周瑜) in silk headscarf and feather fan (‘羽扇纶巾’), standing beside Lady Xiao Qiao, romantic yet heroic atmosphere, misty river reflecting moonlight, fusion of historical grandeur and poetic melancholy, Song Dynasty landscape style.

muted cargo Jun 30, 2025, 12:09 PM

#

...

sly escarp Jun 30, 2025, 3:17 PM

#

“Are there open-source virtual try-on projects I could help with or test?”

raven elm Jul 1, 2025, 3:36 AM

#

kaleidoscope sucked into a kaleidochromic vortex --s 750 --v 7.0 --raw - Remix (Strong)

lyric iron Jul 1, 2025, 3:49 AM

#

Close-up professional corporate man headshot, modern business portrait. The subject's head and shoulders are tightly framed, filling most of the image. Focus is sharply on the face, particularly the eyes, with a shallow depth of field blurring the background.
Lighting: Three-point studio lighting setup optimized for a close-up. A soft, diffused key light directly or slightly to the side of the face to minimize harsh shadows. A fill light to subtly illuminate the shadow areas under the chin and nose. A hair light or rim light from behind to add a subtle highlight along the hair and separate the subject from the background.
Background: Smooth, solid, neutral dark gray or deep blue background, completely out of focus to ensure maximum attention on the subject.
Camera & Style: Simulated DSLR photography with a high-quality portrait lens (e.g., 85mm equivalent). The image should have ultra-detailed facial features, realistic skin texture (without excessive smoothing), and professional, neutral color grading suitable for business use. The overall feel should be confident, approachable, and trustworthy.

copper whale Jul 1, 2025, 11:02 AM

#

cunning lintel The intend/prompt was to use the style of a source image (the same i used in th...

it's fine bruhh just keep doung yo best and eventually things will follow thruuu

soft hamlet Jul 1, 2025, 3:01 PM

#

Create a hyper-realistic 8K resolution cinematic poster of Mobile Legends: Bang Bang featuring 5 characters: Layla (with her cannon), Dyrroth (in a fierce battle stance), Harley (casting a magic card), Esmeralda (with her cosmic scythe and flowing cloak), and Akai (spinning with his bamboo staff). The scene should be dark and dramatic, with intense rim lighting, glowing particle effects, lens flares, and smoke in the background. Position the characters in a powerful triangular composition on a fantasy battlefield with magic energy storms and ruins. Each hero must look dynamic and battle-ready, with ultra-detailed armor and realistic facial expressions. Add cinematic color grading and film grain for a movie-poster look. Include the title ‘Mobile Legends: Bang Bang’ in bold metallic lettering at the bottom center. Aspect Ratio: 16:9. Full movie-poster tone, highly detailed, epic fantasy style."

tardy prism Jul 2, 2025, 12:41 PM

#

dusky thistle

thats cool, whats that checkpoint?

dusky thistle Jul 2, 2025, 1:55 PM

#

Probably was zavy

deft grove Jul 3, 2025, 3:10 PM

#

sakura, white, pink --ar 9:16 --sref 2121577414

vernal yew Jul 5, 2025, 10:47 AM

#

#

Conver to anime

#

#🆕｜sd3

#

#🆕｜sd3

fresh plover Jul 6, 2025, 10:46 PM

#

A 1990s-style self-portrait of 27-year-old Jennifer Lopez, with long, dark wavy hair and soft bangs. She wears bold red lipstick and is styled like Mexican American Tejano singer Selena Quintanilla. The photo has a warm, vintage studio portrait vibe, with soft lighting and a nostalgic 90s glamour aesthetic.

surreal anvil Jul 7, 2025, 1:09 AM

#

Generate a black and white portrait of my face, shot from a close-up, overhead angle, with my head facing forward. I used a 35mm lens and 10.7K 4HD quality.

Proud expression, water droplets on my face. Background with deep black shadows: only my face is visible, and it looks ultra-sharp. Aspect ratio of 4:3, with a 1/5 depth of field effect.

copper whale Jul 7, 2025, 10:28 AM

#

vernal yew <#1230206273451069540>

tis coolll,,how u do this man?

runic tiger Jul 7, 2025, 10:35 AM

#

#▶｜stable-video-diffusion a divine digital painting of Lord Krishna as Radha Ramana sitting beneath a blooming kadamba tree on a carved stone bench, Radha resting gently against his shoulder, Krishna wearing a saffron‑yellow silk dhoti and peacock‑feather crown, softly playing the flute, Radha in a pastel pink and turquoise lehenga with jasmine garlands around her braid, lotus‑filled pond glimmering behind them, morning mist and golden rays filtering through leaves, peacocks and deer in the background, tranquil Vrindavan atmosphere, ultra‑detailed devotional art, cinematic soft lighting, peaceful romantic mood, high‑resolution

torpid marlinBOT Jul 7, 2025, 10:36 AM

#

how make ai photo
No data source is currently selected. Please choose a data source from the dashboard and try again.

sly escarp Jul 7, 2025, 2:49 PM

#

@civic latch yes I have the specifications/ details of the logo

dusky thistle Jul 8, 2025, 2:04 AM

#

#

#

#

#

#

#

#

#

#

#

#

#

#

dusky thistle Jul 8, 2025, 1:47 PM

#

#

#

#

#

#

#

#

#

#

dusky thistle Jul 8, 2025, 11:15 PM

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

dusky thistle Jul 9, 2025, 12:31 AM

#

dusky thistle Jul 9, 2025, 1:15 AM

#

dusky thistle Jul 9, 2025, 1:35 AM

#

#

#

dusky thistle Jul 9, 2025, 2:48 AM

#

#

#

#

#

#

#

#

#

#

#

#

#

#

#

dusky thistle Jul 9, 2025, 5:33 AM

#

tender oak Jul 10, 2025, 4:05 AM

#

golden retriever dog

dusky thistle Jul 10, 2025, 4:57 AM

#

tender oak golden retriever dog

Here is the image you requested.

#

Here is the image you requested.

#

Here is the image you requested.

#

Here is the image you requested.

dusky thistle Jul 10, 2025, 5:25 AM

#

#

#

#

#

#

dusky thistle Jul 10, 2025, 6:39 AM

#

dusky thistle Jul 10, 2025, 7:00 AM

#

dusky thistle Jul 10, 2025, 2:09 PM

#

#

#

raven elm Jul 11, 2025, 2:18 AM

#

house in the woods resemblance of a castle but more like a home

violet escarp Jul 11, 2025, 11:38 PM

#

@spark grove spammer scammer

spark grove Jul 11, 2025, 11:40 PM

#

violet escarp <@463931565643268108> spammer scammer

purged

violet escarp Jul 11, 2025, 11:57 PM

#

@spark grove again

craggy crest Jul 12, 2025, 5:18 PM

#

1: that prompt is too long and 2: you can't generate in this channel

dawn cargo Jul 13, 2025, 6:44 AM

#

Ultra-realistic photogrammetry 3-D globe named Gloxus, 16-k resolution earth texture, micro-topographic detail on every mountain ridge and river delta, continents carved from obsidian-black basalt with razor-sharp displacement maps, iridescent neon-cyan ocean currents swirling under a thin glass layer, holographic magenta circuit-veins mapping city lights across landmasses, subtle cyan grid lat/long lines hovering 2 mm above surface, cinematic rim-light from a cool white sun at 45°, micro-scratches and fingerprint smudges on glossy protective dome, shallow depth of field f/1.4, 32-bit HDR, octane render quality, ray-traced reflections, photoreal shadows, ultra-sharp 200 mm lens, clean black studio background, --ar 16:9 --cfg 12 --steps 40 --sampler DPM++ 2M Karras --vae kl-f8-anime2 --no text, watermark, logo, frame

cinder junco Jul 13, 2025, 10:44 AM

#

dusky thistle Here is the image you requested.

Shame on you. You’ve picked on this poor, helpless bot and now, somewhere on the Indian subcontinent, there is a web page where this image is captioned as an attractive young woman in a business suit. I hope you think about the suffering you have caused.

pulsar oak Jul 14, 2025, 2:59 PM

#

#🆕｜sd3 rooftop, anime style, recreate

panoramic-kuala-lumpur-skyline-view-concrete-observatory-deck-rooftop-sunset-asian-corporate-residential-lifestyle-financial-city-downtown-real-estate-product-display-mockup-empty-roof_269648-4272.jpeg

dusky thistle Jul 15, 2025, 7:35 AM

#

pulsar oak <#1230206273451069540> rooftop, anime style, recreate

Here is the image you requested.

copper whale Jul 16, 2025, 8:10 AM

#

dusky thistle Here is the image you requested.

u kidding man hahha this is hilarious!

languid terrace Jul 17, 2025, 6:24 AM

#

Anime-style third-year student with spiky hair, wearing a tank top and shorts, dramatically leaping towards a basketball hoop placed on a mountain summit, sweat droplets flying, exaggerated wind effects, vibrant sunset colors with pink and orange clouds, stylized rocky terrain, action comic book shading, inspired by 'Slam Dunk' artwork

raven elm Jul 17, 2025, 7:18 AM

#

spark quail Jul 19, 2025, 4:02 AM

#

#

how the heck do we counter-report someone

#

this king is one of the few keepin this channel alive

#

okay 😭

fresh yoke Jul 19, 2025, 9:25 PM

#

Hello im new here diffusionhand

regal vault Jul 20, 2025, 4:34 PM

#

Photorealistic full-body portrait, eye-level shot, sharp focus on subject: A beautiful, energetic 22-year-old Vietnamese woman, exuding confidence and strength. Her skin is glowing with perspiration, highlighting her active state. She is clad in sleek, form-fitting athletic wear (e.g., a tight sports top or tank top and high-waisted leggings) that accentuates her toned physique and prominent bust. The fabric, slightly damp with sweat, clings closely to her body, subtly emphasizing her natural contours and definition beneath. She is captured mid-movement or pausing in a modern, well-equipped gym, with blurred fitness equipment, bright mirrors, and a motivating atmosphere in the background. Her expression is focused and determined, yet radiating a youthful vitality. Natural gym lighting with subtle highlights on her skin and the sheen of sweat. Captured with exquisite detail and sharpness, showcasing natural tones in a realistic photographic style, akin to a professional shot on a high-end DSLR (e.g., Canon EOS R5 with a 70-200mm f/2.8 lens), ISO 400, 1/160s shutter speed, and f/3.2 aperture. Shallow depth of field, drawing all attention to the woman. True-to-life colors. Aspect ratio 9:16.

worldly gulch Jul 20, 2025, 6:02 PM

#

vorrei che rappresentassi una scritta "Il volo di Crà"; la immagino adaggiata sulla riva di una isola, leggermente lambita dal mere. I caratteri che la compongono vorrei che fossero come scolpiti su degli scogli e leggermente ricoperti di vegetazione.

raven elm Jul 22, 2025, 2:13 AM

#

cool girl

solar grail Jul 22, 2025, 2:40 PM

#

expand this

save_your_energy_for_a_backyard_fest_meat_claws_image.jpg

rigid marten Jul 23, 2025, 7:02 AM

#

raven elm cool girl

shesh, baddieee ❤️

open heart Jul 24, 2025, 7:08 AM

#

teal ingot Jul 24, 2025, 1:08 PM

#

Hi

raven elm Jul 25, 2025, 1:35 AM

#

Howl's Moving Castle

craggy crest Jul 25, 2025, 6:23 AM

#

@spark grove spammer alert

spice pine Jul 26, 2025, 2:12 AM

#

#🆕｜sd3ocean，beach

#

ocean，beach，young girl

dark plume Jul 28, 2025, 9:15 AM

#

I wana generate this kind of image some one help me

slate glacier Jul 29, 2025, 5:53 AM

#

Anyone use Stable Diffusion to segment?

craggy crest Jul 30, 2025, 5:28 PM

#

dark plume I wana generate this kind of image some one help me

start with "2d cel-shaded cartoon" as the first part of your prompt, and then go into detail what you want the cartoon to be

junior cloud Jul 31, 2025, 1:46 PM

#

dark plume I wana generate this kind of image some one help me

will this do?

5HXYH7CtNHlXQrk6xT4FgbJ0lqC33URm3emijOK1DTngqd6XLFykKnSB6H1uKHvurPfv3BJ115t6b1ehCa5GgKssKMcWiWmOcz6Cq55jky6BgBjRKj6yJegykJU1f8H9ncYUHa0iBMAAAAASUVORK5CYII.png

junior cloud Jul 31, 2025, 1:48 PM

#

dark plume I wana generate this kind of image some one help me

thick aurora Aug 1, 2025, 2:29 PM

#

guys, how can I run sd3.5 on Forge? I belive I'm doing something wrong, because don't generate image and tilt my Colab when I use it =/

errant dust Aug 1, 2025, 10:14 PM

#

So any opinions on Krea yet?

ruby prawn Aug 1, 2025, 10:32 PM

#

my first generation with generate/ultra

sullen moss Aug 1, 2025, 10:54 PM

#

errant dust So any opinions on Krea yet?

I recently tested it locally. I can't say I'm overly impressed with this model. It's very noisy, in my opinion. You could say it's just another fine-tuned model, nothing more.

errant dust Aug 2, 2025, 12:03 PM

#

https://www.krea.ai/blog/flux-krea-open-source-release

Krea Blog

Releasing Open Weights for FLUX.1 Krea

Krea announces the open release of FLUX.1 Krea

#

Krea took Flux Dev Raw and then did their own post-training. This blog entry details it

#

So calling it a finetune is not wrong, but it goes quite a bit deeper all the same

#

I will add that my initial images with Krea are very nice and are more intresting to me than vanilla Flux dev. Is it the best overall Flux? I haven't come to any conclusions. My other fav was/is Pixelwave. I tried otehr attempts but they were inevitably not very interesting. This is all without Loras of course

#

I sitll really love SD 3.5 FWIW. They each have their strengths and weaknesses. FOr actual text, Flux is in a class of its own for open local models.

#

SD 3.5 Large of course.

#

Of course, Flux's biggest strength is precisely its flexibillity to be finetuned or have Loras

radiant quiver Aug 3, 2025, 7:20 AM

#

hey is there a easy tutorial on how to train a lora

gritty turtle Aug 3, 2025, 12:23 PM

#

if anyone need help in lora training let me know

real terrace Aug 3, 2025, 6:04 PM

#

https://huggingface.co/kpsss34/Stable-Diffusion-3.5-Small-Preview1

kpsss34/Stable-Diffusion-3.5-Small-Preview1 · Hugging Face

torn marsh Aug 3, 2025, 7:51 PM

#

Flux-Krea

torn marsh Aug 3, 2025, 8:24 PM

#

diptych of two identical images as a split screen featuring the same character: a young woman from jrpg game. On the right she looks at viewer. On the left she's wearing a straw hat

dry wave Aug 3, 2025, 9:59 PM

#

errant dust So any opinions on Krea yet?

I find it MUCH better than Flux Dev

dry wave Aug 3, 2025, 10:00 PM

#

errant dust Krea took Flux Dev Raw and then did their own post-training. This blog entry det...

I think this is the important point. Flux Krea seems to be less dpo-ed and overfitted than Flux Dev

#

it has more issues with anatomy than Flux Dev, but on the other side its much more diverse in styles

#

did you tried to use author prompts with Flux Krea?
Flux Dev never responded on them. Flux Krea, however, can roughly imitate styles you name (similar quality as SDXL)

#

in Flux Dev everything always looks the same style. Flux Krea allows you to use different styles in your prompt without needing a Lora

#

For painterly stuff, I prefer PixelWave more than Krea, but Krea comes close. I think finetuning Krea will give better results than finetuning Flux Dev on styles

raven elm Aug 4, 2025, 3:10 AM

#

Old stone alley, mossy banyan, flower-filled balcony. Natural, vibrant, cinematic. Miyazaki style, 32K UHD.

long needle Aug 4, 2025, 8:58 AM

#

🔥

zinc delta Aug 5, 2025, 9:09 AM

#

hey is SD3 dead? no updates for almost a year?

muted cargo Aug 5, 2025, 9:48 AM

#

zinc delta hey is SD3 dead? no updates for almost a year?

https://stability.ai/news

dry wave Aug 5, 2025, 10:26 AM

#

zinc delta hey is SD3 dead? no updates for almost a year?

I would say the time where SD was the dominating open source solution for image gen are long over. Good news is, there are so many new models out on the market

#

there is the Flux ecosystem by the guys who initially developed SD. There is highdream, there is Wan (can generate videos and images) and there is now also Qwen image

zinc delta Aug 5, 2025, 10:48 AM

#

I know

#

I have still SD3 at my app

#

and today I refactored it

#

have more models etc

#

and removed it

#

it was the worst model honestly

twilit pollen Aug 5, 2025, 7:29 PM

#

Is there anyone looking for dev?

craggy crest Aug 5, 2025, 9:46 PM

#

zinc delta hey is SD3 dead? no updates for almost a year?

SD3 is now SD3.5 and it's not dead

errant dust Aug 6, 2025, 1:51 AM

#

Qwen image?

#

HiDream is nice but it is really sluggish

#

I am not at all convinced it is worth the effort

#

SD3.5 is super cool

#

But even Flux can be complained about in terms of updates. Krea is really just a different post-trained Flux

#

The commercials have not been idle either: Imagen, Mid7 and Ideogram have all been pulling ahead in some aspects

errant dust Aug 8, 2025, 2:52 AM

#

So miraculously two accounts spamming the exact same crap. Brilliant.

#

On a relevant note, I did generate some images with Qwen Image and it is quite good. Good text adherence too.

#

A lot slower than Flux, and too early to say whether the vanilla is an improvement or not over Flux Krea

devout schooner Aug 8, 2025, 7:26 AM

#

zinc delta hey is SD3 dead? no updates for almost a year?

a lot of people REALLY weren't happy about the recent "safety policy" update with regards to "core" models at least
especially in light of the fact that SD 3.5 was mostly uncensored
they didn't literally DPO-tune female nipples out of the model the way Flux did KEKW

errant dust Aug 8, 2025, 12:08 PM

#

I think he meant the SD3 in general and was not suggesting SD3.0 specifically

#

Anyhow, I did some testing, very light, of the new HiDream and Qween Image in terms of models with text

#

Qwen really I king of correct text, but it also sacrifices a lot to achieve it IMHO. The default imagery is much less inspired, and the fonts are downright boring. It never deviates or produces anything fun looking, which is likely solid if you are trying to put together some ad or banner. Hopefully it is more readily tunable and new tweaked models will emerge on Civitai

#

HiDream's text ability seems about on par with Flux and is definitely more intreesting visually. Albeit it... it botches long words a LOT

#

Here is an example of ultra correct Qwen:

#

#

I will point out that Qwen is by far the most accurate portrayer of chess pieces. It gets them right each and every time

#

Others, including Flux or SD35, can be a bit creative at times

#

FWIW, I ran the prompt multiple times with varying samplers and steps. Qwen really does not gain any improvement beyond 25 steps. You can see the occasional micro diff, but never anything warranting it to be called an improvement

#

#

This is hidream

#

\

#

another to illustrate. Flux is no better with this text

#

On the other hand, Qwen is incredibly strong at making logos

#

and not merely because of text accuracy

#

WHich is usually not a big deal since logos don't usually have major texts

#

I threw Qwen a bit of a curveball with the request for a logo for Chess & Tech, round, with a design based on a circuit board and.... Louis XIV

#

#

#

Not bad at all

hard lion Aug 8, 2025, 5:54 PM

#

A luminous ‘Digital Giant Tree’ stands at the heart of a futuristic city, its trunk entwined with flowing data chains forming a ‘2019-2025’ timeline. The canopy spreads into a massive ecological dome shaped like the number ‘6.’ AI drones perch on its branches like birds, while roots connect to an underground 5G network. The ground features transparent solar panels and dandelion-inspired smart streetlights. A river glows with quantum computing projections, and humans interact with nature in a holographic garden. Cyberpunk lighting blends with forest mist, rendered in a surrealist style

south wharf Aug 10, 2025, 11:12 AM

#

#

#

errant dust Aug 10, 2025, 1:27 PM

#

Is that supposed to be inspired from Ancient Rome or some other antique civilization?

raven elm Aug 11, 2025, 2:37 AM

#

silent hinge Aug 11, 2025, 9:01 AM

#

你好

brave apex Aug 11, 2025, 9:11 AM

#

silent hinge 你好

Hello!

silent hinge Aug 11, 2025, 9:15 AM

#

你好

craggy crest Aug 12, 2025, 2:39 AM

#

#

#

raven elm Aug 12, 2025, 3:46 AM

#

The cafe

zealous sierra Aug 16, 2025, 11:26 AM

#

SD 3.5 L, Dreamy Aesthetics

frail shoal Aug 16, 2025, 11:38 PM

#

frank haven Aug 17, 2025, 1:26 PM

#

/create: big red mouse

zealous sierra Aug 17, 2025, 2:37 PM

#

Neon Rev - Electric Denim Girl.

viral moon Aug 17, 2025, 3:41 PM

#

Am I able to use Stable Diffusion to make image to gif (while keeping the transparent background)?

craggy crest Aug 18, 2025, 4:51 AM

#

viral moon Am I able to use Stable Diffusion to make image to gif (while keeping the transp...

stable diffusion just creates the image. it'll depend on the interface you run it in whether you can have transparency or not, and export it out as a .gif with transparency

raven elm Aug 19, 2025, 3:15 AM

#

raven elm Aug 19, 2025, 3:32 AM

#

errant dust Aug 20, 2025, 10:36 PM

#

craggy crest stable diffusion just creates the image. it'll depend on the interface you run i...

There are plenty of free online tools to remove extraneous background to transparency. FOr most use cases they are perfectly fine. If you want or need really detailed work, then experience and a tool such as Photoshop of Affinity Designer are the way to go for now.

craggy crest Aug 20, 2025, 10:39 PM

#

errant dust There are plenty of free online tools to remove extraneous background to transpa...

i know, but @viral moon was asking if he could use stable diffusion to make transparent gif's

errant dust Aug 20, 2025, 10:40 PM

#

I had understood, but in case he felt stymied by its inability to do so, I was offering up solutions.

serene fiber Aug 21, 2025, 6:51 AM

#

#🆕｜sd3

dry wave Aug 21, 2025, 9:00 AM

#

stable diffusion cannot do transparency cause the vae has no alpha channel. So SubtleOne is right: you need an extra tool to remove the background

raven elm Aug 26, 2025, 12:56 AM

#

chilly igloo Aug 29, 2025, 7:44 AM

#

#🆕｜sd3 Manga style, black-and-white ink, dramatic contrast, cinematic angles. Sequential panels, consistent characters, tense horror-thriller mood. Silent library, frustrated writer, masked killers, surreal ending. Each page shows panels with continuous story flow.

Page 1 – Library
2 panels: vast empty modern library, tall shelves, rows of tables; closer view of books and dust in silence.

Page 2 – Writer
6 panels: close-up of man (30s) writing furiously; pen in hand; wide shot alone at table with books; messy scribbled handwriting; crumpled paper; shadowed angry face.

Page 3 – Intruders
4 panels: library doors open, masked men enter; close-up of cold eyes; killers moving between tables; man tapping desk, killer behind.

Page 4 – First Kill
3 panels: disruptive man tapping; killer grabs his hair; throat slashed, blood on table.

Page 5 – Girl
4 panels: young woman gasps; killer covers her mouth; “shhh” gesture at Keep Silence poster; silenced pistol shot, she collapses.

Page 6 – Writer’s Rage
3 panels: writer slams fist; killer behind with knife; suspenseful knife over him.

Page 7 – Break
4 panels: writer rips page; killers vanish, library empty; writer breathing heavy; fist smashing wall.

Page 8 – End
3 panels: glass door shatters; writer crushed under shards; close-up of shard with “Do Not Disturb, Keep Silence.”

craggy crest Aug 29, 2025, 10:53 PM

#

you can't generate in this channel AND you can't give the AI a script and expect it to create a movie or something

errant marsh Aug 31, 2025, 2:44 PM

#

girl

#

#🆕｜sd3 一个小女孩端着咖啡，微笑着面对着我

#

#😊｜co-creators white girl

upper rivet Aug 31, 2025, 3:13 PM

#

errant marsh <#1230206273451069540> 一个小女孩端着咖啡，微笑着面对着我

You can't create directly here on discord.
You can do that on your own stable diffusion env on your computer. 🙂

raven elm Sep 1, 2025, 1:42 AM

#

humble kelp Sep 2, 2025, 9:32 AM

#

??

charred vale Sep 4, 2025, 9:29 AM

#

一个小女孩端着咖啡，微笑着面对着我

upper rivet Sep 4, 2025, 11:30 AM

#

charred vale 一个小女孩端着咖啡，微笑着面对着我

You can't generate image directly here.

raven elm Sep 5, 2025, 5:49 AM

#

Kim jung gi style

scarlet canopy Sep 6, 2025, 1:59 AM

#

#🆕｜sd3 ccc

craggy crest Sep 6, 2025, 2:23 AM

#

scarlet canopy <#1230206273451069540> ccc

you can not generate in this channel

scarlet canopy Sep 6, 2025, 2:31 AM

#

What channel do I choose and how do I start writing the prompt because I tried # and I also tried /

scarlet canopy Sep 6, 2025, 2:32 AM

#

craggy crest you can not generate in this channel

What channel do I choose and how do I start writing the prompt because I tried # and I also tried /

craggy crest Sep 6, 2025, 2:34 AM

#

scarlet canopy What channel do I choose and how do I start writing the prompt because I tried #...

first of all, did you read the information in #artisan-faq

raven elm Sep 8, 2025, 2:01 AM

#

split bramble Sep 12, 2025, 6:26 PM

#

raven elm

rustic crown Sep 13, 2025, 6:26 AM

#

The terrifying office of China's cattle and horse employees

true jay Sep 13, 2025, 12:52 PM

#

Yo guys where to generate images
I’m new

runic tusk Sep 13, 2025, 2:07 PM

#

You don't. Unless you want to pay. Or you do it on your own computer.

solemn lintel Sep 17, 2025, 7:19 PM

#

#

#

raven elm Sep 18, 2025, 2:02 AM

#

faint rock Sep 18, 2025, 3:22 AM

#

true jay Yo guys where to generate images I’m new

i'll tell you,

narrow hawk Sep 18, 2025, 1:41 PM

#

raven elm

What type of prompt are you using? These are gorgeous

gleaming swift Sep 19, 2025, 3:36 AM

#

1

deft badge Sep 20, 2025, 11:40 AM

#

Where do you generate the images? Cloud or your Pcs?

livid rose Sep 20, 2025, 2:34 PM

#

I generate on my PC, then post the results.

drifting hull Sep 23, 2025, 7:54 AM

#

craggy crest you can't generate in this channel AND you can't give the AI a script and expect...

before it was possible to generate with discord right? can you tell me why its not possible now.

weary crystal Sep 23, 2025, 8:36 AM

#

drifting hull before it was possible to generate with discord right? can you tell me why its n...

I guess you mean generate for free. Otherwise the #artisan-faq artisan channels are still there. In the beginning and for testing purposes when new models appear there where some beta channels open and for free.
But without the purpose of beta testing giving away expensive GPU Calculation Time for free does not sound like a good business model

raven elm Sep 24, 2025, 1:46 AM

#

final fjord Sep 24, 2025, 4:08 AM

#

que sucede

raven elm Sep 25, 2025, 8:01 AM

#

bitter hearth Sep 27, 2025, 2:45 PM

#

errant dust Sep 28, 2025, 9:37 PM

#

Any thoughts on the monster new release?

#

https://huggingface.co/tencent/HunyuanImage-3.0

tencent/HunyuanImage-3.0 · Hugging Face

#

Probably impossible to run locally for now, but still the biggest OS image generator to date in terms of sheer size

#

It is MOE though, so maybe I will be wrong

#

Right now my fav local model is that new Flux out of the box. The big Qwen was ok, but didn't wow me

#

"The Largest Image Generation MoE Model: This is the largest open-source image generation Mixture of Experts (MoE) model to date. It features 64 experts and a total of 80 billion parameters, with 13 billion activated per token, significantly enhancing its capacity and performance."

#

"Our model can effectively process very long text inputs, enabling users to precisely control the finer details of generated images. Extended prompts allow for intricate elements to be accurately captured, making it ideal for complex projects requiring precision and creativity."

cunning lintel Sep 29, 2025, 1:00 PM

#

errant dust Any thoughts on the monster new release?

To me the new model is complicated 😵‍💫

#

In a way, it's what i hoped to see after SDXL, what SD3 and later models were supposed to be, it follows prompts and doesn't override styles with pre-baked crap

#

But it also has more errors, i tried it out on tencent site, let it create 4 gens, a few are always plain unusable bad, 3 arms like bad, but others are nice. And there's variety in outputs

errant dust Sep 29, 2025, 1:04 PM

#

Fal has it to test, but it is pay to use, which is fair, except I cannot imagine myself paying to use it when there are literally a number of free private ones such as Nano Banana, nevermind ones I can run on my own machine like Flux Krea or SD3.5 L.

cunning lintel Sep 29, 2025, 1:04 PM

#

it's free here https://hunyuan.tencent.com/modelSquare/home/play/d3d6bl42c3m83jodcrrg?modelId=289&from=open-source-image-zh-0 (sign up with email in the third tab)

腾讯研发的大语言模型

errant dust Sep 29, 2025, 1:05 PM

#

I went there and they wanted me to sign up with WeChat

#

which i do not have

#

and am certainly not going to install for this

cunning lintel Sep 29, 2025, 1:05 PM

#

there's three tabs at the top, the last is email

errant dust Sep 29, 2025, 1:06 PM

#

ok, so your impressions are that the results do not match the hype

cunning lintel Sep 29, 2025, 1:07 PM

#

cunning lintel Sep 29, 2025, 1:08 PM

#

errant dust ok, so your impressions are that the results do not match the hype

i like it, it's the first model where things look nice again after 3.0 / 3.5 🙂 But i'm a sucker for fine textures, half my prompts use the word etching or parchment

errant dust Sep 29, 2025, 1:09 PM

#

ok, I entered, let me try something simple, but offbeat

cunning lintel Sep 29, 2025, 1:10 PM

#

it's just well, it has issues (the foot became a paw, double trident, but it's also the first model where the bull and god is actually seemingly made of water)

#

Things like this i never managed with qwen and hardly with flux

#

and it understands "In a warm, sun-drenched Japanese classroom, a bright-eyed, cherry-blossom-haired schoolgirl named Sakura** playfully twists a lock of her hair between her lip and nose, creating a makeshift mustache that** makes her giggle uncontrollably, as her friends look on in amusement, by renowned anime artist, Hirohiko Araki."

errant dust Sep 29, 2025, 1:17 PM

#

well, for a comic rendition of a Gnoll with a sword, it actually did a decent job.

#

A powerfully built gnoll, resembling an upright hyena, covered in short brown fur spotted in darker brown. It wears a short kilt and a hardened leather apron adorned with metal links and spikes. The gnoll holds a short sword, ready for a fight, with a 3/4 body view, showing its full body. Rendered in a classic 80s comic book style with strong, defined linework, and detailed rendering of textures and shadows.

#

It is not really 80s commic book style, but nevertheless solid details

#

#

#

it also nailed the 3/4 body view

#

for the graphics assets of my game, I tend to use Nano for its repeatability in style as well as unlimited free use

#

(a significant deal)

#

though for the starting asset Flux Krea has been great too

#

This however was better to be honest

#

How many generations can you get? I assume there is a daily cap

#

or weekly...

cunning lintel Sep 29, 2025, 1:21 PM

#

I haven't really used imagegen a lot recently, I just never liked the look of new models. Flux with lora and sd3.5 were the last I actually enjoyed using. Hunyan 3 is exciting to use like those, it feels like a throwback to styles/textures in a good way

errant dust Sep 29, 2025, 1:22 PM

#

Nano is not Imagen

cunning lintel Sep 29, 2025, 1:22 PM

#

Flux krea was a disappointment to me, the real krea had nicer outputs

#

I actually never used nano, only imagen.

errant dust Sep 29, 2025, 1:24 PM

#

Try Nano. Aside from being the top rated text to image generator on LM Arean, it has unique editing abilities no one has

#

Editing with it is done via prompt, but let me show you what I mean by unique

#

Here is a plain jane image, not made by Nano

#

#

simple enough, sunrise, pirate ship, etc.

#

Now I tell Nano: change the image to a sunset

#

Just that, no masking or anything else

#

#

It's insane

cunning lintel Sep 29, 2025, 1:27 PM

#

Yup that's good, i kinda stopped being interested in this editting after flux-edit

#

the pro version was nice, the dev version abysmal

errant dust Sep 29, 2025, 1:28 PM

#

You can ask it to take a person, or even those cartoons, and tell it to raise the arm, have him turn his head to the right, and it will all be perfect

#

fur, ears, everything

#

as if telling the model to move around for the next photo

#

Anyhow, that is why Nano is overall king for now

#

overall, not necessarily in each individual thing

#

to be clear

cunning lintel Sep 29, 2025, 1:30 PM

#

afraid i tried it on wrong thing, i tried to transfer style, that didn't go well :p

#

but yeah, haven't looked back since... i understand it actually can generate images too, which is nice

errant dust Sep 29, 2025, 1:30 PM

#

So suppose I like Hunyuan's core image. I could use it as the starting point and then have Nano make thge modifiucations

#

Nano is top rated on LMArena as I mentioned

#

I assume you know what LMArena is

cunning lintel Sep 29, 2025, 1:32 PM

#

yup, aware of it 🙂

errant dust Sep 29, 2025, 1:33 PM

#

so what are the limits in Hunyuan public use? DO you know?

cunning lintel Sep 29, 2025, 1:34 PM

#

cunning lintel it's just well, it has issues (the foot became a paw, double trident, but it's a...

nano, for made out of water, hunyuan wins

cunning lintel Sep 29, 2025, 1:34 PM

#

errant dust so what are the limits in Hunyuan public use? DO you know?

Nope, i fully expect to hit a wall anytime soon

#

just a random prompt i remembered models struggled with to make look natural, hunyan does a good job

Atmospheric wide shot in a dense, ancient forest under dappled sunlight. Large, incredibly adhesive spiderwebs stretch high between gnarled trees, their thick, glistening strands shimmering as they catch the light. A wild deer (doe) is visibly ensnared, its body tangled in the sticky webbing. Nearby, a young woman struggles against the webs, her clothing and hair tightly bound, her face showing distress and the sticky strands clinging to her. Eerie shadows. Highly detailed dark fantasy illustration.

#

nana banana kinda iffy first not much web, when i asked made it more entangled i got zombie 🤡

#

Where is SD 4 (i guess never, new sai doesn't seem big on open models or even new model dev for individuals (as opposed to enterprises)) 😢 Maybe it's because what i've seen/used first, but the SAI models have that something special (style/textures i just call it) newer models just haven't captured. hunyuan kinda seems to have as well, but it's early days.

muted cargo Sep 29, 2025, 2:14 PM

#

Sincere question. What would you expect SD4 to be ? What do you expect from it ?

errant dust Sep 29, 2025, 3:09 PM

#

Expect? Or want?

#

For me, 4 things:

Easily trained for LoRAa. Flux has an iron grip on this right now, and it is a big deal IMHO.
Stronger text handling. It can handle 2-3 words ok most of the time, but it is now lagging quite a bit behind its peers.
A larger more powerful model.
And please tone down the nanny police. Efforts to control such things are not only wasted, since it is literally the first thing targeted by others for removal, but it invariably has detrimental effects on general image production. It need not overtly allow sexual content, but nor should it feel like a 1950s movie censorship board.

#

Just my 2 cents

#

I really like SD3.5 L FWIW, but I tend to use Flux Krea for more consistent results and style. I can ask SD for an 80s comic books style, and it will deliver, but even with plenty of details, it is all over the place in the results. It is why I mentioned LoRAs. Someone is bound to want something that it doesn't handle well, anyhow.

muted cargo Sep 29, 2025, 3:29 PM

#

I don't see 4) happening anytime soon for any model released by any company. This kind of usage is bad press for the large audience. Moreover the easier it is to do this kind of content, the easier it is to abuse it. Add to that all the legal issues and stuff going on such as ID restriction getting introduced in some countries for that kind of content... And yeah... They pretty much have to do that kind of policing.

#

Otherwise yeah you pretty much expect it to catch up with others.

dry wave Sep 29, 2025, 5:14 PM

#

I mean, Flux is more or less SD4. I wouldn't expect a new successor of Stable Diffusion as all people who developed SD are now developing Flux

#

I think the reason why basically all new models. including Flux, have big issues with styles is because they are using T5 or other text-based models instead of the CLIP as in SD, and because they are trained on synthetic captions

#

SD 1.5 and SDXL were trained on ALT tags, so the image captions often contained hints regarding the style

#

newer models use VLLMs to caption the image, but VLLMs usually don't capture stylistic nuances. They know the difference between a "cartoon" and a "photography", but they barely understand differences between certain art styles. When they generate captions, the captions focus on the content and not on the style. Models are trained on these captions and never learn how to describe these styles via prompt

#

thats probably the reason why models like Flux can easily learn (via lora) a lot of different styles, but its hard or even impossible to reach these styles via prompt engineering

#

at least thats my theory 🤷‍♂️

#

unfortunately, styles are also a thing all the big companies are not interested in. Styles are often associated with specific authors, and everyone fears copyright issues. Furthermore, if you want to make money with image generation, you want to target the advertisement industry. For this, you don't need art styles

cunning lintel Sep 29, 2025, 6:06 PM

#

muted cargo Sincere question. What would you expect SD4 to be ? What do you expect from it ?

I'm afraid the answer is like "a better horse", i know what we have know, what i like and don't like but no idea what's possible.

But the reason I mention SAI models is that compared to others their outputs always felt less artificial, more fine details and textures, instead of overly smoother AI look. (after SDXL, i feel 3.0 (the API version) did this still well, but in 3.5 it suffered a bit, some styles just became much worse or flat out impossible, it felt more like exploiting clip's knowledge as opposed to having the model actually trained ion them). Maybe SAI has a really really good data set, better than what other models have been trained on (maybe cause it's older it just has less synthetic data).

Anyway, what i would hope is ,much, much better prompt following (also when things are off the beaten path), but not at the cost of style or variety, like many recent models. So good prompt following, wide range of style and fine details/textures. "Promptable" by just by using references, both images (like ideogram) and codes/hashes (artists seem a no-go anyway), my dream would be throw some images to it, extract a style hash that's a merge of styles in those images, kinda like a lora but instant. And detailed as in make the creature in ref style a, the other creature in a blend of a and b, the background in style c. I suppose that's already beyond simple image-gen and close to current instruction models, just also for style not just subject please.

On top of that consistency, which again would probably mean an instruction like model, that allows consistent subjects and environments in various styles / perspectives / angles.

cunning lintel Sep 29, 2025, 6:10 PM

#

dry wave at least thats my theory 🤷‍♂️

That's been my thought as well, and after that it's often "why not use that info from captions to make the model learn styles in a better way, including a way to get it out of the model". Obviously it's no that simple 😉

errant dust Sep 29, 2025, 6:20 PM

#

It is a lot simpler than made out, or all the other image generators would struggle just as badly. As to the devs who made Flux, well, there are more than those devs in the world. Whether or not Stability will actually develop a new model is entirely up to them, but producing a subpar model, relative to the existing ecosphere, with a list of reasons why it is subpar isn't going to cut it. There is no shortage of competition nowadays.

dry wave Sep 29, 2025, 6:22 PM

#

I don't say SAI cannot make another model. My point is, that such a model would have the name "stable diffusion" but it would be made by entirely different development team. From my perspective, a truly successor of stable diffusion xl/3 would be another flux model

#

cause for me its the people who count, not the brand name. That's said, it doesn't matter much anyways. I'm also happy to work with models like Qwen. The days where all good open models for image generations came from SAI are long over anyways

#

(although, sadly, most of the newer models like Qwen were mostly just copy&pasted SD3 or Flux models )

cunning lintel Sep 29, 2025, 6:28 PM

#

I feel data matters too, and wouldn't be surprised if SAI liberally scraped the internet and/or used screencaps for their models where newer models simply scraped ideogram/dalle/flux/MJ, ie already not the most finely detailed ai-slob.

#

I think we'll never know what a new SAI model would be like, in the end it doesn't matter a whole lot where new models come from, though i have to admit it feels a bit iffy so much is from china, i'd ike some western biased models too :p

dry wave Sep 29, 2025, 6:38 PM

#

of course they scraped the internet. But I would be surprised if flux is not just using the same data

cunning lintel Sep 29, 2025, 6:50 PM

#

You sure would think so, same team and all 😉 And yet, flux seems to have less knowledge, but i've also read it's the result of preference optimization, or the distillation. SAI outputs just usually appear less AI to me. Then again, i've convinced myself flux is good with hands cause they trained tons of hands to the point where children got adult hands ;p who knows what othewr optimizations were done.

errant dust Sep 29, 2025, 9:34 PM

#

I think the massive inFLUX (pun intended) of Chinese open source models is centered around two things:

It is the best way to get non-Chinese to use them. After all, if these were some models locked behind some Chinese ChatGPT equivalent, the use would be a fraction of what it might be.
There is a massive US vs China war on the AI front, and their efforts are very much in the good graces of the CCP. If you look at just the number of papers publlished on AI, China is actually ahead of the US in the last 12 months.

#

I mean frankly, it is much the same with local LLMs

#

The best right now is hands down the Qwen models by Alibaba

#

it isn't even close anymore

#

In fact, just now Qwen3 80b MOE was just released and it is an absolute beast

#

The commercial models by the US are still king of the hill by a good margin though. ChatGPT5 is no.1, and then it is between Claude and Gemini. So the Open Source front is where they have the most chances to shine

fathom prawn Sep 30, 2025, 2:10 PM

#

I have a question what model or lora could be closes for this art style ?

AJfQ9KR8bp3cYe7rZ9lv1oPcxcyvyA0pPDsffxR_KadjUendJkgcVh_4sJRhgnBf_Y4L8h_Afuqp5YszColnpDENs1U6OU4DUUdIygHaOFh8fAws3UnhYek2jmQj7Ln48iNC5Sv5bY65G2dO6-mATGFBi70Du8NLPcNKsaK3szJOve1CKx62s1024.png

gaunt scarab Sep 30, 2025, 5:01 PM

#

A photorealistic masterpiece, shot on Arri Alexa, cinematic it, shaking his head furiously. The camera is handheld, with a slight, almost imperceptible shake, enhancing the raw emotion color grading. A bald dwarf who is an exact likeness of Cristiano Ronaldo is sitting on the floor of a bedroom of the scene. The visual style must match the provided reference images, focusing on gritty realism, deep shadows, and des. His face is a mask of visceral anguish and pure rage, with hyper-detailed, wet skin and realistic tears streaming down hisaturated colors.

#

a bald dwarf who is an exact likeness of Cristiano Ronaldo/imagine prompt: a cute pink chick with big muscles, wearing green kung fu clothes, 3d anime style, ultra realistic, cinematic lighting, standing in a dojo

vapid radish Sep 30, 2025, 7:23 PM

#

errant dust so what are the limits in Hunyuan public use? DO you know?

I'm not sure, I have made about 90 images for free with Hunyuan 3 today and I am still going.

#

I have been experimenting with upscaling Hunyuan 3 images with Qwen as I think 1024x1024 is way to low res.

#

#

#

full minnow Oct 2, 2025, 6:51 AM

#

想問有沒有人是用中國的模型會不會有被閹割之類的狀況呀
I'd like to ask if anyone has used the Chinese model and if they have been castrated or something like that.

full minnow Oct 2, 2025, 6:52 AM

#

full minnow 想問有沒有人是用中國的模型會不會有被閹割之類的狀況呀 I'd like to ask if anyone has used the Chinese model...

There are difficulties in generating R18 content.

stoic salmon Oct 6, 2025, 6:06 PM

#

Create a modern scientific laboratory scene with clean white counters, chemical storage cabinets, safety signage (like PPE reminders and hazard symbols), and realistic lab equipment such as microscopes, beakers, and fume hoods. Include subtle lighting and a slightly dramatic tone to suggest a challenge or escape room atmosphere. The layout should be modular and clear, suitable for overlaying interactive hotspots or puzzle elements.

faint vault Oct 8, 2025, 4:34 PM

#

create a future robot on a new earth

tranquil vector Oct 8, 2025, 7:29 PM

#

hi can i use SD3 to edit the color grading in a photo i upload?

devout schooner Oct 11, 2025, 9:50 PM

#

I really hope someone deep dives into what the heck happened with the SD 3 / 3.5 arch one day
this is SD 3.5 Large on the left and SD 3.5 Large Turbo on the right, same seed, same prompt
I never believed any of the issues even with the original SD 3.0 had anything at all to do with "censorship" but like rather
there's definitely some really weird deeper technical issue problem caused it to be the case that distilling 3.5 Large into 3.5 Large Turbo actually significantly IMPROVED the coherency and pixel resolve (and almost completely elimated the strange edge artfacting problem) as opposed to the opposite (and no one will deny this is the case if they actually do enough seed-to-seed comparisons between the two, I promise)
there's numerous questions to be asked here no one has ever answered to date joeshrug

dry wave Oct 12, 2025, 6:46 AM

#

turbo variants are often "better". Same happens for SDXL. I think they do some dpo with the distilling

unique sigil Oct 12, 2025, 11:22 AM

#

How to generate image?

raven elm Oct 13, 2025, 1:22 AM

#

potent inlet Oct 22, 2025, 11:34 PM

#

I installed SD 3.5 Large but run in error, I think my method is wrong, could some point to the correct way to download and install?

muted cargo Oct 23, 2025, 8:24 AM

#

potent inlet I installed SD 3.5 Large but run in error, I think my method is wrong, could som...

try asking in #🤝｜tech-support

potent inlet Oct 24, 2025, 5:17 AM

#

muted cargo try asking in <#1002602742667280404>

Ok thank you for your reply and sorry for the wrong post

languid blaze Oct 25, 2025, 8:36 AM

#

Photorealistic rendering of the letters why4e, make the letters readable but broken, like the wreckage of a spaceship, with a dark, gloomy space background, traces of a dying explosion

calm thistle Oct 25, 2025, 4:06 PM

#

/ generate promt: dark contrast noir photo realism with detective and ufo

#

photograph of [object], [details], [environment], professional photography, 50mm lens, f/1.8, natural lighting, high resolution, sharp focus, detailed texture

jagged gate Oct 25, 2025, 5:44 PM

#

jade lion Oct 27, 2025, 9:08 AM

#

full minnow There are difficulties in generating R18 content.

You have to look for nsfw variants then. Check encoders and so on. At least I found things for Wan but haven't found any practical use for it with my limitations.

severe prism Nov 1, 2025, 4:03 AM

#

𝙃𝙚𝙡𝙡𝙤

astral lantern Nov 5, 2025, 12:04 PM

#

animation

torpid marlinBOT Nov 8, 2025, 9:49 PM

#

how to generate images with propmts?
No data source is currently selected. Please choose a data source from the dashboard and try again.

willow oar Nov 9, 2025, 2:38 PM

#

paint a rabbit

summer ginkgo Nov 9, 2025, 4:25 PM

#

🐇

thorny stream Nov 9, 2025, 4:56 PM

#

Paint a crab

summer ginkgo Nov 9, 2025, 5:24 PM

#

🦀

warm hollow Nov 11, 2025, 3:53 AM

#

paint a rabbit

south creek Nov 13, 2025, 11:09 PM

#

Draw an orc fighter

bitter hearth Nov 14, 2025, 8:49 PM

#

draw me

chilly storm Nov 15, 2025, 1:37 AM

#

/genrate test

summer ginkgo Nov 15, 2025, 2:32 AM

#

[F]

noble sparrow Nov 17, 2025, 6:53 AM

#

how to create image with an existing image?

errant dust Nov 17, 2025, 12:32 PM

#

Probably use something like Nano Banana and tell it the modifications you would like

#

All is quiet on the image front I guess. for my own ends, for pure creation I think Hunyuan has the edge (though it is impossible to run locally, being much too big).

#

For editing Nano is best

#

at least for large transformations

livid rose Nov 18, 2025, 2:53 AM

#

@gilded stone Instead of supplying empty latents to the KSampler, use a vae encode node to convert your reference image to latents. Connect those latents to the Ksampler. Then lower the denoise to 0.5-0.8. Your original image will shine through, modified by your prompt.

quick pelican Nov 19, 2025, 1:56 AM

#

i've been doing some analysis/testing of sd3.5 large over the past months, it seems something really nasty happens at MMDiT block 35. best i can tell, block 35 has the strongest influence on making the speckled greebled texture that's common with outputs. maybe the growing values also have something to do with the poor quality? (i'm not studied enough in the math at play to make a full evaluation of it)

(attached img: this graph is a single step (0 of 24) of sd3.5 large under bf16 unet)
under fp16 unet, which is what comfyui runs as default with the sd3.5_large.safetensors file, l2 hits inf

quick pelican Nov 19, 2025, 6:43 AM

#

(@devout schooner, you mentioned you were interested in some analysis on sd3)

sacred shard Nov 25, 2025, 12:53 PM

#

quick pelican i've been doing some analysis/testing of sd3.5 large over the past months, it se...

For some reason, I'm not convinced yet that that is the primary or direct cause of the speckled visual effect on outputs.

If that is the case, and if it was easy to isolate this issue, we would likely have had a new custom model available by now that could counteract this issue. I agree, I've noticed a strange effect with that specific latent. However, it's difficult to say with confidence what the exact cause is, given the complexity of these algorithms.

sacred shard Nov 25, 2025, 5:26 PM

#

@quick pelican

quick pelican Nov 25, 2025, 6:37 PM

#

I partially drew that conclusion from using the skip layer guidance sd3 node, where 35 had the biggest reduction in that pattern, I'd have to post some examples of it.

I have other test situations where I've seen higher quality results, like using 1344sq resolution, or disabling bias on various modules. I can post some of those if you're interested

sacred shard Nov 26, 2025, 1:40 PM

#

quick pelican I partially drew that conclusion from using the skip layer guidance sd3 node, wh...

It's plausible that the cause is latent 35, all I'm saying is that I'm not yet convinced it's the most probable cause. Also, it makes sense to me that skipping the first and last stages of a diffusion process would cause quality improvements, although it should be worse on average given the greater noise present.

I'd be interested in learning more about this as I'm actively doing research in this area still, but we should really have a real conversation, at least for a moment if that would be fine with you.

#

I don't doubt it's involved, but I'd be curious to know what other tests would determine its primary cause, I suspect the cause could be in the algorithm itself rather than in a latent and the issue is only exacerbated by block 35.

sacred shard Nov 26, 2025, 3:46 PM

#

@quick pelican

quick pelican Nov 27, 2025, 2:40 PM

#

yeah, you can dm me or continue here. I've been busy/symptomatic lately, apologies for the delay in posting more info

hallow lion Nov 28, 2025, 4:14 PM

#

Imagine out of the blue stable diffusion drops SD4. XD

quick pelican Nov 28, 2025, 5:45 PM

#

there's certainly a new framework they could use
https://arxiv.org/abs/2510.02300
i wonder if it'd prove to be better than flow matching in large datasets/models

arXiv.org

Equilibrium Matching: Generative Modeling with Implicit Energy-Base...

We introduce Equilibrium Matching (EqM), a generative modeling framework built from an equilibrium dynamics perspective. EqM discards the non-equilibrium, time-conditional dynamics in traditional diffusion and flow-based generative models and instead learns the equilibrium gradient of an implicit energy landscape. Through this approach, we can a...

rustic bramble Dec 2, 2025, 8:29 PM

#

anyone know how to get u_t ^{theta} from flux api ?

#

or flux mini

#

im trying to setup post training experiments

upbeat girder Dec 3, 2025, 9:42 AM

#

#🏞｜general-with-images cat

bitter hearth Dec 4, 2025, 2:04 PM

#

hi

sacred shard Dec 4, 2025, 3:08 PM

#

rustic bramble anyone know how to get u_t ^{theta} from flux api ?

Are you asking about neural networks?

This is not the correct forum to ask that, you should go to a programming community or a dedicated Flux forum for better advice and expertise.

#

@rustic bramble In the past, in a conversation about machine learning, I found a lot of the people I was talking with couldn't give me a good explanation of how back-propagation works. What is your level of technical knowledge around neural nets?

#

It depends on what you're using. What library are you using? Are you using TensorFlow? Pytorch? Are you a PhD student or is this for your day job?

#

Are you using PyTorch? Tensorflow? Keras? Do you want U theta to be a weight in the net? A gradient?

rustic bramble Dec 4, 2025, 5:23 PM

#

sacred shard Are you using PyTorch? Tensorflow? Keras? Do you want U theta to be a weight in ...

by utheta i mean just the vectorfield representation of the ODE

#

and not score parametrization

sacred shard Dec 4, 2025, 7:32 PM

#

rustic bramble by utheta i mean just the vectorfield representation of the ODE

I don't really know what you're talking about. Are you working through a research paper or something?

I am familiar with the mathematics of gradient descent and back propagation in the context of neural net training, but I don't know what "vector field representation of the ODE" means.

#

It's also not that surprising that most people you talk about aren't able to give you technical details on how neural net training works. Neural net training is highly abstract and complicated, I've seen a lot of PhD students and people of that level struggle with it.

#

If you're talking about a system of autonomous ODEs, I have also studied that, but if you're taking about the use of ODEs in a neural network I am not familiar with that idea.

sacred shard Dec 5, 2025, 7:59 AM

#

@rustic bramble

dry wave Dec 5, 2025, 11:27 PM

#

lol, is he a bot?

dry wave Dec 5, 2025, 11:46 PM

#

Alexander. Sometimes it's hard to say if someone is a bot or just not a native speaker 😅

sacred shard Dec 6, 2025, 2:09 AM

#

dry wave Alexander. Sometimes it's hard to say if someone is a bot or just not a native s...

This is your own epistemic bias at work, my guy. And the reason you believe it sounds like AI nonsense is because you've trained your perception to flag certain lexical patterns as synthetic.

sacred shard Dec 6, 2025, 3:00 PM

#

@dry wave

#

@dry wave Yeah, you obviously pretend you didn't saw this dude.

muted cargo Dec 8, 2025, 11:53 PM

#

let's not go that way.

sacred shard Dec 11, 2025, 5:58 PM

#

@muted cargo Bruh

urban arch Dec 12, 2025, 4:48 PM

#

@Mods?

pure yacht Dec 19, 2025, 12:41 PM

#

#🆕｜sd3 cat

hallow lion Dec 23, 2025, 6:50 PM

#

oh noes... i remember when this place was so active lol

#

Emad do something! yeah, yeah i know he doesnt work there anymore

ancient folio Dec 24, 2025, 12:26 PM

#

catlurk

snow echo Dec 31, 2025, 7:07 AM

#

#🆕｜sd3 rat

hallow lion Jan 3, 2026, 3:01 AM

#

https://tenor.com/view/palla-deserto-desert-hot-gif-9654851992494180531

Tenor

calm parcel Jan 3, 2026, 2:37 PM

#

I'm trying sd3-Turbo on my AI Platform for simple picture generation. I want to keep it cost effective for after the rollout in April. I am getting wolves with three ears, two headed bunnys, dragons that are breathing fire not from the correct end. Is there a more ... responsive version?

calm parcel Jan 3, 2026, 3:22 PM

#

Okay ... all 20 pictures were complete failures. I'll switch to a different model, perhaps their flagship version, been around and tested longer. But this needs to not be a thing.

fierce heath Jan 7, 2026, 9:56 AM

#

serene lantern Jan 10, 2026, 4:35 PM

#

A woman reclines across a slab of warm stone near the edge of an abandoned quarry at night. Her upper body rests on one elbow, legs bent with grace. Her bare skin catches scattered beams of light that fall from a distant industrial lamp. A long piece of translucent fabric runs beneath her, catching subtle folds of shadow.

Technical Notes:

Lens: 50mm, aperture f/2.0

Lighting: Hard backlight + diffused fill from below

Camera Angle: Side-profile at ground level

Color Tone: Cool with amber accents

Atmosphere: Quietly mysterious, cinematic

barren rock Jan 10, 2026, 11:46 PM

#

Hi someone help me pls?

#

I dowloaded forge webuii but when i want to generat image i get a error like this

#

runic tusk Jan 11, 2026, 12:19 AM

#

Google exists:
https://www.google.com/search?q="runtimeerror%3A+cuda+error%3A+no+kernel+image+is+available+for+execution+on+the+device"&oq="runtimeerror%3A+cuda+error%3A+no+kernel+image+is+available+for+execution+on+the+device"&gs_lcrp=EgZjaHJvbWUyBggAEEUYOdIBCTIyNzk4ajBqN6gCALACAA&sourceid=chrome&ie=UTF-8

barren rock Jan 11, 2026, 12:20 AM

#

Yeah i know that. But i don’t know how to solve this problem

runic tusk Jan 11, 2026, 12:40 AM

#

Hence the use of Google to read and apply potential troubleshooting steps and solutions.

barren rock Jan 11, 2026, 12:46 AM

#

runic tusk Hence the use of Google to read and apply potential troubleshooting steps and so...

I’m not that smart as you i need more help

runic tusk Jan 11, 2026, 12:46 AM

#

You don't need to be smart, you need to read the links and do what they do to see if it works.

barren rock Jan 11, 2026, 12:49 AM

#

runic tusk You don't need to be smart, you need to read the links and do what they do to se...

Which one?

runic tusk Jan 11, 2026, 12:49 AM

#

Start with the first one, then go down from there if it doesn't work. This is how basic troubleshooting works. You try something, learn something, try a different thing if it doesn't work.

#

Literally everyone can do it.

#

Nobody is born with the solutions in their brain already.

#

I believe in you.

barren rock Jan 11, 2026, 12:53 AM

#

My English is bad, it would take a tons of time to read the links, my brain is already burning😩

calm lava Jan 11, 2026, 6:24 PM

#

I had this problem and decided to hire help from fiverr. Otherwise I would have been at it for months trying to troubleshoot. I watched the guy dealing with dozens and dozens of errors of various kinds. Had I not hired, I could have probably eventually gotten it working, but it was about getting it working in 6 hours vs 6 weeks or 6 months

#

6 hours because i had him install a whole bunch of things very specific. the basics would probably be a lot less

#

The downside is I didn't learn how myself, but I also work full time and have a bunch of other things going on so for me it's about time

summer ginkgo Jan 22, 2026, 6:46 PM

#

Try the pinned guides in #🤝｜tech-support

hallow lion Jan 26, 2026, 9:54 PM

#

barren rock

Use Forge Neo... Old forge man is no longer updating the stuff.

#

Aktivald a windozd bazdmeg! XD

cinder socket Feb 8, 2026, 1:11 PM

#

Hi! I train FLUX'SDXL Face lora for ur realistic AI influencers. Happy to share workflows, tips, and examples 😊
https://www.behance.net/gallery/243708697/Stable-AI-Influencer-Private-Flux-Face-LoRA

Behance

Natalie Mikie

Stable AI Influencer — Private Flux Face LoRA - Natalie Mikie

I create realistic AI influencers and stable AI identities.What I create:AI influencers for Instagram, TikTok, and X (Twitter)Digital personas for Patreon and OnlyFansLong-term AI characters for branding and monetizationRealistic AI faces for lifes...

shut talon Feb 14, 2026, 6:40 AM

#

a tree

ebon roost Feb 15, 2026, 6:29 AM

#

A dog

hallow lion Feb 17, 2026, 1:13 PM

#

Anyone else has the feeling that if sd3 was a good model it would have been very good?

lament rampart Feb 19, 2026, 5:58 PM

#

Hello guys please help me out with these images I have a very low PC I can't run local stable diffusion and I do not even know how to write this image high quality prompt please if anybody knows please help me out this it's sucks me from last two months please

long jasper Feb 28, 2026, 11:29 PM

#

lament rampart Hello guys please help me out with these images I have a very low PC I can't run...

Hi

I understand how frustrating that can be. If your PC can’t handle local Stable Diffusion, I can help you generate high-quality results using optimized workflows or cloud options no heavy setup needed on your side.

You can share the images or idea you have, and I’ll assist with professional prompts and output. Let’s get you unstuck.

tough viper Mar 1, 2026, 11:19 AM

#

marsh lintel Mar 11, 2026, 5:50 PM

#

livid rose Mar 13, 2026, 1:11 PM

#

#

#

north tide Mar 15, 2026, 12:50 PM

#

oh isnt cyberdoll tongue has like two halves?

#

or the lack of that line?

#

hmm.. cannot recall

marsh lintel Mar 16, 2026, 1:33 AM

#

livid rose Mar 17, 2026, 12:39 PM

#

#

#

marsh lintel Mar 17, 2026, 6:31 PM

#

eternal jasper Mar 18, 2026, 9:46 AM

#

blue moon

rugged bear Mar 19, 2026, 8:51 PM

#

Kann ich jetzt hier was erstellen

muted cargo Mar 20, 2026, 7:29 AM

#

rugged bear Kann ich jetzt hier was erstellen

#artisan-faq or use online services from reputable websites or create stuff locally

hallow lion Mar 22, 2026, 5:02 PM

#

moderate pls

cunning nebula Mar 23, 2026, 8:24 AM

#

#🏞｜general-with-images Cinematic, surrealist medium shot of a vintage 1970s cream-white sedan partially submerged in deep, dark teal water. Thousands of fresh, vibrant flowers—primarily ranunculus, dahlias, and baby's breath in shades of peach, soft pink, cream, and pops of orange—are overflowing from the car's windows and hood, floating out onto the rippling water surface. The lighting is moody and ethereal, featuring a soft misty glow from the background and shimmering reflections on the water. High-grain film photography style, 35mm lens, shot on Kodak Portra 400. Deep shadows, realistic water textures, melancholic yet beautiful atmosphere, hyper-detailed floral petals and rusted chrome accents

marsh lintel Mar 23, 2026, 3:51 PM

#

near sierra Mar 23, 2026, 7:36 PM

#

#🆕｜sd3 genearte an classic hyper realsitic image of burger eating a mna

#

hi

marsh lintel Mar 24, 2026, 12:57 PM

#

marsh lintel Mar 24, 2026, 4:41 PM

#

urban arch Mar 25, 2026, 3:39 PM

#

I don't know they're right.
I can't say they're wrong.

faint sand Mar 25, 2026, 8:58 PM

#

#🆕｜sd3 ⁠generate an human

summer ginkgo Mar 26, 2026, 1:07 PM

#

faint sand <#1230206273451069540> ⁠generate an human

faint sand Mar 26, 2026, 2:13 PM

#

Yo m’y bro how do you use sd?

muted cargo Mar 26, 2026, 2:39 PM

#

summer ginkgo

not gonna lie I thought you were one of those bots at first :p

muted cargo Mar 26, 2026, 2:40 PM

#

faint sand Yo m’y bro how do you use sd?

I mean ... what are you looking for exactly ? Can help on the details but it s hard to tell with such a vague question.
To use sd either you install it locally, you use this server #artisan-1 /2/3 channels or use one of the trusted online services to either generate stuff or rent a gpu to run your own client on it.

summer ginkgo Mar 26, 2026, 3:13 PM

#

muted cargo not gonna lie I thought you were one of those bots at first :p

Everyone does lol

urban arch Mar 26, 2026, 3:31 PM

#

It's what's for dinner.

faint sand Mar 27, 2026, 8:01 PM

#

Okay man thanks for the help 🙂

grand wyvern Mar 28, 2026, 8:20 PM

#

marsh lintel

stop bringing politics in this buddy

snow prairie Apr 2, 2026, 2:22 AM

#

grand wyvern stop bringing politics in this buddy

Is that a rule?

livid rose Apr 2, 2026, 12:47 PM

#

Of course it is, rule #7.

silent cedar Apr 2, 2026, 5:10 PM

#

Where bot?

tidal garnet Apr 3, 2026, 8:31 AM

#

Generate an anime picture, just a test.

#

:(

fair kettle Apr 14, 2026, 4:33 AM

#

/generate an human

summer ginkgo Apr 14, 2026, 1:30 PM

#

#

^ hooman

north rain Apr 14, 2026, 4:21 PM

#

/generate a bright brown alien face

#

dreamy palm Apr 16, 2026, 2:20 AM

#

⁠generate an human

warped lodge Apr 18, 2026, 5:28 AM

#

generate a bright brown alien face

#

#🏞｜general-with-images generate a bright brown alien face

#

/generate a bright brown alien face

north rain Apr 19, 2026, 12:10 PM

#

/generate blue alien head on white background --square

#

#

/generate alien landscape scribble, 8k, highly detailed, best quality -malformed hands

#

raven elm Apr 20, 2026, 7:48 AM

#

Found a solid free alternative: draw.freeforai.com. It's a web-based SaaS, so no login needed. Best part? Completely unlimited and free with no watermarks. Perfect for when you just want to whip up an image fast

potent prawn Apr 20, 2026, 10:27 AM

#

Dawn of the Divine Archer

south pendant Apr 21, 2026, 2:39 AM

#

https://vt.tiktok.com/ZS9Jkx4mp/

TikTok

‏TikTok · Ziraxo

شاهد الفيديو الذي أنشأه Ziraxo.

upper delta Apr 21, 2026, 1:54 PM

#

/generate alien landscape scribble, 8k, highly detailed, best quality -malformed hands

south pendant Apr 21, 2026, 6:56 PM

#

https://vt.tiktok.com/ZS9dJyjPX/

TikTok

‏TikTok · Ziraxo

شاهد الفيديو الذي أنشأه Ziraxo.

#

🙊

sturdy python Apr 27, 2026, 11:18 AM

#

can I take it for free jk 😝

urban arch Apr 27, 2026, 3:20 PM

#

That would certainly prove it's not spam. 😁

covert crown Apr 28, 2026, 1:45 AM

#

Generate a cartoon picture

summer ginkgo Apr 29, 2026, 2:18 AM

#

waxen sonnet May 5, 2026, 4:15 AM

#

：/imagine prompt: a cyberpunk cat with neon lights, wearing sunglasses, 8k, hyper-detailed --ar 16:9，

hallow zenith May 6, 2026, 5:11 PM

#

/generate imaginative photo of cryptoman dark blue on white background NeoBlessVerseGenerate

jolly drum May 6, 2026, 5:17 PM

#

hallow zenith /generate imaginative photo of cryptoman dark blue on white background NeoBless...

Doesn't work like that here clueless

hallow zenith May 6, 2026, 5:18 PM

#

/generate imaginative photo of man dark blue on white background NeoBlessVerseGenerate

hallow zenith May 6, 2026, 5:18 PM

#

jolly drum Doesn't work like that here<:clueless:894299439609565266>

And now

jolly drum May 6, 2026, 5:18 PM

#

hallow zenith And now

#artisan-faq

#

Most people generate locally here

#

I recommend using cloud services if you don't have a GPU such a civitAI or any of the comfyUI Integrated services thru API

#

Though those also cost money unfortunately

hallow zenith May 6, 2026, 5:22 PM

#

jolly drum I recommend using cloud services if you don't have a GPU such a civitAI or any o...

Appreciate it

soft lantern May 9, 2026, 9:55 PM

#

/generate image of donald trump dancing with benjamin netanyahu

jolly drum May 9, 2026, 10:07 PM

#

soft lantern /generate image of donald trump dancing with benjamin netanyahu

That doesn't work like that here

soft lantern May 9, 2026, 10:07 PM

#

i was joking lol

warm zealot May 16, 2026, 5:55 AM

#

正向提示词 (Positive Prompt):
A magnificent ancient oak tree, intricate bark textures, glowing moss and tiny bioluminescent mushrooms growing on the trunk, soft volumetric sunlight piercing through dense green leaves, cinematic sunbeams, floating dust particles and magical spores in the air, macro photography perspective, hyper-detailed, 8k resolution, photorealistic, masterpiece, depth of field, sharp focus on tree bark, rich textures, award-winning nature photography.

反向提示词 (Negative Prompt):
(worst quality, low quality:1.4), blurry, smooth texture, plastic look, deformed, out of focus, artificial, cartoon, drawing, illustration.