#🏞|general-with-images

1 messages · Page 143 of 1

fleet goblet
#

They have a channel for stable diffusion

#

👍

#

Found this image, wanted to test it with SV3D.

#

I like how every time he's half-turned you can slightly see a nose and mouth

tough pasture
#

I saw that they upgraded the capabilities for free ChatGPT users with search and GPT-4.
So I wanted to test if it had DALL-E 3, I asked if it could generated images, it said yes. So I asked it for a green field with a giraffe.

ocean oyster
tough pasture
onyx isle
#

Rembrandt move on over... 😉

onyx isle
crisp stream
#

it can most likely run on 8GB, yet the question is how fast. With SD3 I'd recommend a 4080 (16GB) or 4090 (24GB) (even with SDXL)

nimble mason
marsh flare
#

generate a image of smegal

fleet goblet
fleet goblet
deft bison
mystic marlin
#

Dollar Tree Todd Howard after Xbox's Starfield DLC showcase.

thin compass
jovial tiger
nimble mason
#

luminosity channel shifted fractal noise

jovial tiger
#

kitchen sink sampler.

nimble mason
jovial tiger
nimble mason
#

luminosity dropped, and then shifted the cyan/red channel

mystic marlin
#

Might as well go for something really simple tonight. lol

stone trellis
#

Right in time for the release of SD3 ! A muscial video made with SD and SD3 about the alleged superiority of Comfy ! https://youtu.be/O3NzGSHjj4s

#aiart #stablediffusion #comfy #comfyui #stablevideodiffusion #stablevideo #imageai #videoai #aimusic #mistralai #udioai #blufftitler #parody #sarcastic
The Ballad of Comfy UI is a funny little video clip about the Comfy UI webui interface for Stable Diffusion, which is allegedly the superior interface !

All images and videos are AI made, gene...

▶ Play video
#

All SD and SVD made, and the song "The Ballad of Comfy UI" is Udio & Mistral (with my assistance).

thin compass
#

\m/

nimble mason
late sorrel
#

nice stuff

thin compass
#

I did these last year, I don't think I need to say who these people are. 🙂

languid pebble
maiden thistle
#

A fresh light blue background, with hand-painted white clouds and water ripples, highlights the refreshing feeling. The slightly lower part of the center is a blue-and-white product map, surrounded by hand-painted pimples and disappearing effects, with some hand-painted plants. In the upper right corner is the red "618 Special" hand-painted label, and the bubble pattern adds vitality. At the top of the product map, write "Use salicylic acid to get rid of acne without trouble!" The overall style of the publicity copy is refreshing, natural and vivid.

crisp stream
snow pawn
#

Привет

#

/img

clever oar
#

hi

olive fjord
#

Всем салам, бота можно к себе на сервер добавить как нибудь?

shut sinew
#

@deep gust

deep gust
shut sinew
#

And I did not cherry pick, this was the first and only one

wispy nest
#

what the fridge is this

random root
#

@shut sinew create an image with cat

clever oar
languid pebble
#

My first SD3 picture 😄

#

Not really used to swarm ...

#

Ahhh... disable regular Sampler 😄

shut narwhal
#

Good evening, can you please tell me why some checkpoints generate images of this type? Is something missing from the settings?

clever oar
#

mojo

#

for use sd3 best way=swarm?

languid pebble
clever oar
#

my sd audio trance cattitude

languid pebble
#

cool

clever oar
#

thx

#

I thought it would be loopy and monotonous, but it turned out dynamic

languid pebble
#

Bad luck most of my songs are German ...

clever oar
#

problem is not full free

languid pebble
#

Same with Suno ...

clever oar
#

suno do only sound without voice?

steel apex
languid pebble
clever oar
#

I wonder what video card is needed to run this locally

#

I didn't find an online version

languid pebble
#

Haven't searched for that yet ...

pallid ruin
clever oar
#

LeoB

#

HELLO

pallid ruin
#

Hello! 🤗

clever oar
#

😄

languid pebble
pallid ruin
#

And even cheaper if nvidia rushed to launch its 5000 series..... I'm still waiting 😡 😭

languid pebble
#

5000 could be a bit over 🙂

#

Using my 4090 in energy safer mode most of the time ....

clever oar
#

I have very expensive electricity

strange island
clever oar
#

yoga

languid pebble
#

There must be a guy behind her 😄

frozen nebula
#

Same prompt in SDXL vs SD3, tell me again that SD3 has better prompt understanding...

languid pebble
#

We should know the prompt 🙂

old pike
#

(and which one is which... lol)

topaz sinew
#

🤤

frozen nebula
# old pike (and which one is which... lol)

Sadly the good one is SDXL, the one on the right is SD3... now sure it might be a prompt building thing, the prompt was - "Full body shot of a werewolf queen ((powerful dominant female werewolf)) with her pack of werewolves ((partially transformed werewolf queen, chimera woman half human half wolf)), dense realistic fur, body hair, strong woman, macabre, odd, weird, amateur photography, vox pop, photojournalism, real image, highly detailed, (volumetric lighting) (photo:1.15) skin blemishes, (soft glow, hazy real camera image, real texture, grainy, gritty:1.1) blemishes, imperfections, interesting contrasting color palette, F/2.8, film grain"

#

But out of the gate SD3 is not great, its 'slightly' better than SDXL was at photorealism, but it completely lacks all the hyped ability to make artistic shots imo

languid pebble
#

Maybe put the chimera thing in front ....

frozen nebula
topaz sinew
#

is this the base SDXL model or a finetuned one? I would compare the two base models.

languid pebble
#

Base models have never been really satisfying ...

frozen nebula
#

And the way it handles latent noise seems strange, a lot of the tricks i use with that for SDXL simply don't work for Sd3 as it sort of ignores base noise differences

topaz sinew
#

thinking is way way far from knowing. I would compare the base two models. I am not an expert at all. thank you for replying and testing it. 👍

languid pebble
frozen nebula
languid pebble
#

It's day 1 ... I wonder we can already use it local ...

#

But some things are a bit strage ... the fingers for example

frozen nebula
#

Fine tunes will be the thing, but nobody knows how easy or hard they are to do yet

languid pebble
#

I also wonder that it's only 4GB 🙂

fleet goblet
#

Would anyone mind passing me a working SD3 workflow for comfyUI? Just a base one.

#

I need to see if Intel Arc is operational with the current model.

fleet goblet
#

VRAM usage is great, too.

languid pebble
#

Cool!

fleet goblet
scenic yew
languid pebble
fleet goblet
languid pebble
#

AFAIK there's no other at the moment ... ?

fleet goblet
#

Ye, all three are the same model with their clip encoders embedded. You just download the base and use "TRIPCLIPLOADER" node in comfyui instead

#

Takes me a minute and a half to get an image.

languid pebble
#

Space for improvement ... 🙂

scenic yew
scenic yew
# frozen nebula Sadly the good one is SDXL, the one on the right is SD3... now sure it might be ...

you simply aren't prompting it right. it's way way different then sdxl prompting, you need to sentences. for example this(its llm enhanced)

A stunning photorealistic depiction of a dominant female werewolf, half-human and half-chimeric, commands attention. Her chiseled physique, replete with textured skin and flowing locks of golden hair, radiates power and authority. Surrounded by a pack of strong, masculine werewolves, each one imbued with realistic vitality and movement, the scene pulses with life and energy.

frozen nebula
#

Having expiremented more the problem seems to be the base workflow is just not very good in comfyui

#

If you throw the steps up to 80+ and remove the weird negative conditioning, drop the model sampling it starts to look much better

scenic yew
#

80+ yikes, that will take forever no?

#

i just did it with 25 and changed absolutely nothing. and yeah it is still not as good as a realisim trained model. This is a 100% expected. Even in llms, a small fine tuned llm beats much larger llms in a specific task.

Sdxl and sd3 are somewhat similar in size and and still sd3 puts up a pretty good performance

fleet goblet
#

It takes me a minute and a half to generate a single SD3 image.

#

I wonder how fast a 3060/4060 generates an SD3 image.

frozen nebula
scenic yew
#

oh yeah thats pretty impressive

frozen nebula
#

And removing your prompt just using my original word salad sdxl prompt, gets this

#

Which honestly is damn impressive, SDXL cant even get close to that out of the box

#

(base model of course)

#

I am 100% convinced most of the issues are sampler/steps/scheduler based

scenic yew
#

yeah i think my prompt made the werewolf a bit too human

alpine furnace
#

side landscape view of white castle countryside mansion with long futuristic glass car garage on the left, top pointed towers on the main house

frozen nebula
#

Here is the same prompt using the base workflow the huggingface repo suggests (base on left) I think the base workflows are removing a lot of the artistic elements and making it look a bit fake/digital anatomy seems better using the higher settings as well

alpine furnace
#

side landscape view of white castle countryside mansion with long futuristic glass car garage on the left, top pointed towers on the main house

frozen nebula
fleet goblet
#

Keeping Clip_L and Clip_G text empty allows a solid colourfill background character image

#

which is nice

viral widget
#

SDXL on the right. Too bad SD3 can't combine stuff like this.....

#

but stuff like this definitely works way better in SD3 ! SDXL just mixes up everything

crisp stream
glossy herald
viral widget
fleet goblet
reef moth
jagged badger
#

New model may be "smarter" and finally getting words right (for most of the time) is ok, but still anime XL + alchemy is the killer for me

#

got this one from phoenix

#

luckily was able to fix it with killer combo

#

sorry, didn't mention it's leonardoAI models

reef moth
#

can't wait for some SD3 finetunes to come.
Out of the Box can work for some things though

fleet goblet
#

So I have SD3 fed into SV3D for model gen

#

What would be the best node to use to convert the multi-image output SV3D utilizes into a usable model?

crisp stream
#

Cascade (Invictus Redmond Checkpoint)

crisp stream
pure monolith
#

can sd3 do "a cow sitting on a frog", by default? Can someone test it?

crisp stream
crisp stream
pure monolith
#

@pallid ruin very close. but in most images iv seen, it looks more like the cow is behind the frog doing something questionable. pretty good though.

sinful lagoon
pure monolith
#

Maybe you didnt use enough steps. They are still in the stage of trying to figure it out. The frog looks ready though.

toxic trail
#

anyone need a job?

I need someone to do my images for my history channel on YouTube

Feel free to dm

faint silo
mild jay
#

ive been messing around with the pixart huggingface space just a tad bit. how do theses 2 images look?

deft bison
stone trellis
stone trellis
deft bison
stone trellis
#

I don't understand why all new models still are trained in 1024x1024. 16:9 is more appealing and video AI is coming soon.

crisp stream
nimble mason
#

a woman lying in the grass: sd15 base, native 1024x1024 generation

crisp stream
crisp stream
#

I very much like Cascade even if it is less versatile in terms of compositions, like it´s often repeating similar settings/perspectives for example yet input images can help along as well (not used with the recent ones here)

languid pebble
south temple
steep sleet
#

Climate-Change had melted the Polar-Icecaps; so water was more abundant everywhere - leading to people having to "live-out-of-a-bathtub" forever more ... SD3

royal monolith
languid pebble
languid pebble
# royal monolith and?

Looking at the pictures from the SD3 Creators, I would've never prompted like that and don't get results as good with my old prompting style ...

royal monolith
languid pebble
royal monolith
languid pebble
#

I think Comfy is a bit different than Stable Swarm.... Can take a picture for you ...

royal monolith
#

this is SD 3 in comfy, sampler set to DDIM, Scheduler set to DDIM_Uniform, ModelSamplingSD3 set to 1, all the rest of the workflow settings at default. Prompt is a 1.5 prompt and is: Fluffy snooty elegant Muskrat with lush tightly curled hair, wrinkled nose, crinkled eyes, brown eyes, Octane Render, by Jasmine Becket-Griffith, by Daniel Merriam, by artist "Patrick Woodroffe"

#

the workflow i'm using is the comfy_example_workflows_sd3_medium_example_workflow_basic workflow available on the huggingface weights page

languid pebble
royal monolith
#

this workflow uses

river ruin
#

A single colored flowerpot decorated with similar lines

royal monolith
#

i can't use t5XXL_fp8 - my machine throws errors, i have to use 16

languid pebble
royal monolith
languid pebble
#

I never thought we would be able to create pictures from day 1 anyway. Now it needs the swarm intelligence.

royal monolith
#

same settings. another SD 1.5 prompt: wet shimmery hair, hyper-detailed, galaxy airbrush portrait; bee magic; mist, volumetric lighting

#

change the sampler to uni_pc and the scheduler to sgm_uniform

#

this model is a beast

languid pebble
#

For sure ... that you can have 3 different objects in a picture for example ...

#

Wouldn't have tried to prompt that in SDXL ... so need to relearn prompting for SD3

#

That's been my idea: SD1.5 or XL prompts work, but won't take you to best results ...

royal monolith
#

prompt: green cat on the left, red dog on the right, in the middle is a blue cube with a triangle on it

#

the only thing i did different between the images as change the value in the ModelSamplingSD3 node

languid pebble
#

Chance to get that with SDXL or 1.5? 🙂

royal monolith
#

prompt: breakfast, photo by Tjalf Sparnaay

languid pebble
#

My friends don't understand that I am a 1-Man-Army and want to get a MTV-Ready video ^^

dusky ingot
#

Leonardo and Mona Lisa on a date

languid pebble
#

He looks like a seaman 🙂

royal monolith
#

prompt: morning light, photo by Tjalf Sparnaay

#

prompt: still life, photo by Tjalf Sparnaay

oblique ingot
#

Can you try one by William Eggleston

royal monolith
oblique ingot
#

yeah a still life or food photography

royal monolith
#

k. just a second

#

still life, by photographer "william eggleston"

#

prompt: candy stripe, bobbles, bubblegum, bubbles, butterfly, Mark ryden, Jordan Grimmer <--- ModelSamplingSD3 node set to 1.2

#

same prompt, but ModelSamplingSD3 node set to 1

#

prompt: Illustration of an old village, winding cobblestone streets,gloomy, castle-like buildings, intricate carvings, gothic architecture, otherworldliness, enchantment, midnight, full moon, mist, volumetric lighting by illustrator "Joe Sinnott", by artist "Patrick Brown"

wispy nest
royal monolith
#

running it inside ComfyUI

wispy nest
#

im running a few basic samples too using comfyui, wish i had a workflow for facedetailer

royal monolith
wispy nest
#

thanks i actually need something basic that can incorporate face detailer for human faces

languid pebble
royal monolith
#

grab a copy of ReActor perhaps?

wispy nest
languid pebble
wispy nest
#

i tried loading up that workflow btw, but shows some glitches in red

#

did node updates too but fails at face detailer

languid pebble
#

Did you get the latest version? They said something about bugfixes ...

wispy nest
#

i might have to check that

#

think i tried the one that says official

#

the base model is pretty good for images other than human figures

languid pebble
wispy nest
#

honestly this is a better result than trained sdxl for this kind of images

languid pebble
wispy nest
#

cfg 3, could go lower i think

#

steps 25

languid pebble
wispy nest
#

shockingly weird anomalies with limbs

#

surprisingly faces look fairly good for a base model, but hands and legs are fu*d up

languid pebble
wispy nest
#

well yes, very unexpected

languid pebble
#

Need to catch some food. Have a wonderful day!

wispy nest
#

i'm gonna make wild guess and say that sai wanted to censor nudity which is why we see face/portaits look good but when it comes to figures they messed it up big time

amber arrow
#

Freepik's new AI tool is really consistent with the character's likeness. And it's blazing fast too, considering the initial image quality. I wonder how they're achieving it.

grand sapphire
#

This is so cool

#

who wouldve though eh? not me

grand sapphire
#

this is so cool

mental frost
#

prompt: still life, photo by Tjalf Sparnaay

languid pebble
grim girder
#

宫崎骏的动画作品以其独特的画风和治愈系的故事。镜头缓缓推进,展示一个宁静的乡村小院。阳光透过树叶的缝隙,洒在一张旧木桌上。桌上摆放着一大块切好的西瓜,红色的瓜瓤和黑色的瓜子在阳光下显得格外诱人。一只橘猫懒洋洋地躺在旁边,偶尔抬头望向镜头,眼神中流露出满足和慵懒

glossy herald
#

Showing off SD3 capabilities to a friend. Who can guess his name ?

glossy herald
#

Almost. You got 0 good letters. Nice try though x) well I know not everybody seem to be liking it, but I do have quite lots of fun with that model right now

shut sinew
#

Definitely John

orchid schooner
#

Roast'n?

languid pebble
#

Naruto?

glossy herald
#

thanks for this laugh

languid pebble
#

This is impossible ... BMW! sing

crisp stream
# wispy nest the base model is pretty good for images other than human figures

Not really from what I can tell so far. What's missing comparably I feel are interesting compositions/characters/poses/exaggerations/etc. Not completely yet present merely to a certain extend when looking at prior versions (2.0/2.1 excluded).
It also at least seems to struggle with artistic styles, also while using generalistic terms. So not only artist names do not work as good (if they work).
I´ve merely done a bit testing yet it looks like SD3 has the tendency to coat everything into a glossy/hyper real finish combined with realistic depiction tendencies, lacking a more natural appearance.
8B could potentially be varying because the images I did see looked at least more interesting (as well compostionwise) for example with a pleasant here and there slightly painty cinematic touch and also a more illustrative painty example I did see, being a comparison of a prompt in MJ and 8B, then it´s merely been a few images I could look at so far, let alone play with it here, so I couldn´t tell exactly how it behaves and what is potentially possible in that regard, as well in terms of natural appearances.

Here for demonstrating what I´m talking about:

abstract surreal expressionistic,expressionism,painted

Stable Cascade:

#

SD3 2B:

wispy nest
crisp stream
wispy nest
crisp stream
languid pebble
#

Leonardo has launched a pretty cool new model ... no idea on what it's based ...

wispy nest
#

and i can see why a lot of ppl are pissed

languid pebble
#

It's been almost impossible to redo those pictures they showed us ...

wispy nest
#

exactly

crisp stream
topaz musk
#

8b is only for enterprise

crisp stream
languid pebble
topaz musk
topaz musk
#

No. They clearly said it's for enterprise. And even most companies can't afford it.

#

Have a friend working in an org who said that spoke to stability team, and this was their stance

crisp stream
#

well, what to say 😄 Hoping for a proper 6B model then 😄

languid pebble
#

I like this one ...

#

Pretty creative ...

crisp stream
languid pebble
#

Money - the root of all evil ... ^^

crisp stream
languid pebble
crisp stream
#

🙂

#

btw have you checked on artistic styles with SD3?

#

@languid pebble

languid pebble
# crisp stream 🙂

Not really ... did 20 of my old prompts, 4 pictures each ... maybe 2 I want to keep ...

#

Looks like even Glif generated better pictures. But I am not really used to StableSwarm ...

#

And as I said ... old prompts aren't the best idea ...

crisp stream
# languid pebble It's a pretty simple phrase ... sure ...

And btw at least in my eyes there are more suited ways for generating income, like let´s say selling a model for commercial use on a fair basis. When suggesting this already elsewhere there was the reply roughly saying: "but then it will be pirated" which doesn´t really matter when being available for free for non-commercial use anyway and even if it was behind a paywall that´s what every software company has to deal with, as well in terms of possible protection (if required). Then for the latter the open source background would be gone anyway.

languid pebble
#

Some nice details but what's the thing on the bottom right for example?

crisp stream
languid pebble
crisp stream
crisp stream
#

possibly even PS 🙂

languid pebble
#

Well it just doesn't make any sense ... so I don't want it ... SD3 can do inpainting?

#

I'm still relaxed but wondering about some things ...

crisp stream
languid pebble
crisp stream
crisp stream
#

yet can go for one at least in terms of AI 🙂

#

Could be a kendo stick as well 😄

languid pebble
#

I have no idea ... but looks like it's flying ^^

deft bison
crisp stream
#

Cascade

#

Cascade with input image:

#

Input Image (SDXL):

languid pebble
elfin cairn
#

many many images

crisp stream
languid pebble
crisp stream
vale marlin
#

Okay that's weird

royal monolith
#

go run that through lumalabs

vale marlin
royal monolith
#

but that should animate really well in LumaLabs

crisp stream
#

Cascade

royal monolith
vale marlin
#

that's my max

royal monolith
#

i'm still walking through the settings

#

never let your watercolor paper get wet

#

also, paint drys out, keep your brush wet

crisp stream
# royal monolith

hmmm, doesn´t feel really convicing in regards to the reference to be honest and I´m btw not saying it cannot produce any sort flat appearances, yet it simply seems to add its style on top, basically either a photorealistic or mentioned 3d-finish look. Like said, when using input images in Cascade or SDXL (or even SD1.4 for that matter) they neutrally transport the original in that sense while SD3 seems to add a finishing if you will

royal monolith
#

i'm not trying to make something that looks identical to your image, to me it's a watercolor painting with tempura paint on cardboard

#

there are a lot of images that i've posted with paper surfaces and paint on them

deft bison
vale marlin
#

that's it

#

i think it's weird

pure monolith
#

hmm, pretty close.

#

maybe less gruesome. that ladybug dont look too well. 6/10 for prompt adherence.

vale marlin
crisp stream
# royal monolith i'm not trying to make something that looks identical to your image, to me it's ...

yes, like said, not saying it cannot produce anything flat looking, yet here it usually doesn´t so far, unlike prior models, then I´m currently testing some and I think it´s simply the lack of trainingsmaterial that isn´t really there, like a lot of stuff looks at least seemingly being merely a photo of the original (even though it could have other reasons,yet I´m seeing a lot of those paintings on a canvas with a relief structure. Btw here you can see this added effect as well:

SDXL

#

btw I´m currently testing da Vinci/Bosch and alike, where it seems to be pretty flat indeed, reason I think it might be the trainingsmaterial/captions

royal monolith
vale marlin
#

yeah were definately don't know how to use SD3 perfectly

pallid ruin
vale marlin
crisp stream
pure monolith
#

@royal monolithyou mean flat color and flat shading?

royal monolith
vale marlin
crisp stream
royal monolith
deft bison
crisp stream
royal monolith
# vale marlin wow

ROFL! well now, that could turn into anything. yeah. that function is critical to the look of the final result. don't change anything, lock the seed, and just start changing that one node one decimal point at a time.

vale marlin
royal monolith
vale marlin
#

I've mostly heard that from Russians

blissful vale
#

Has anyone tried Luma AI? It's so interesting...

steel apex
wispy nest
#

sd3 ... human figure is out of the question.

royal monolith
#

@proud dagger i tried that prompt you suggested - it doesn't work very well

proud dagger
#

it's mostly a valid image for the prompt just not great anatomy

#

a regular quality issue vs the horrid demons being shared around on reddit

royal monolith
#

and what happens if i take it out of the ( ) and put it in [ ] instead

proud dagger
# royal monolith true 🙂 now i want to know what it is that adding :0.5 to that prompt is doing.

that downweights the prompt. My working theory is that something's broken in the guidance causing it to explode - on the older unet models we had explicitly separated cross-attention (text guided) and self-attention (unguided), often the cross attention made mistakes and self attention fixed them. SD3's arch is different, it doesn't have that separation, so I think downweighting the prompt is letting the model do the equivalent of empowering it's self-attention to outrank the prompt guidance

proud dagger
royal monolith
#

what is it specificaly affecting, why, and how

proud dagger
royal monolith
#

@proud dagger ummm

cold inlet
#

but that begs the question

#

who wins?

royal monolith
wispy nest
proud dagger
proud dagger
wispy nest
proud dagger
#

This SD3-Medium model is awesome at a lot of things but yeah has some severe shortcomings we didn't realize in advance

wispy nest
#

some of the active members of SAI were posting amazing pics on twtter btw. What happened?

proud dagger
#

(note that SAI staff posting might sometimes be the upcoming big 8B)

wispy nest
#

i hope you are right when you say big one is still coming. cause this 2B medium variant looks like needs rigorous training yet

royal monolith
wispy nest
#

you'd be surprised if you compared sdxl base with sd3 base

royal monolith
#

it still can't do elephants correctly either, and tails are still a huge issue - cat and dog tails wind up in strange posistions or disconnected

tired basin
#

Laying on a car bonnet looks like the result of a car crash too

royal monolith
wispy nest
#

there is a funny plot twist ... i never thought i'd be using comfyui, but now that i've put some hours behind this, im starting to like it with reasonably decent control over it, this wouldn't have happened if it wasn't for sd3 lol

#

btw this is sdxl base 1.0 that i just rendered now, sd3 dont stand a chance next to it

wispy nest
#

data on img using comfy

royal monolith
wispy nest
#

prompt doesn't matter, im showing the differences in sdxl and sd3

royal monolith
tired basin
wispy nest
#

i chose both base models

tired basin
#

I say you as SAI not you personally*

wispy nest
#

you guys can have a healthy discussion about it and probably persuade a better decision making protocol, so far SAI has done some nice work w/ txt2img, i dont think people will stop using sd15 or sdxl

#

but this sd3 2b version needs go back to drawing table

royal monolith
royal monolith
royal monolith
proud dagger
wispy nest
#

this sd3 version is incapable of prompt coherence and would forcefully give you glitchy outputs

proud dagger
tired basin
wispy nest
proud dagger
royal monolith
proud dagger
#

2B is much faster to train so we wanted to get that pushed ahead first

wispy nest
#

some of the cooler images with sd3 but not human figure

royal monolith
#

i'd like to see what it creates

wispy nest
royal monolith
wispy nest
#

i hope you have eyes open to see the weirdness

royal monolith
#

prompt She waits in the waves, hair flowing in the sun, radiant and smiling softly

royal monolith
wispy nest
#

take a good look at the first one

#

if you dont see anything wrong i dunno what to tell you

royal monolith
wispy nest
#

which prompt ?

#

but why do you need me to run a prompt

tired basin
royal monolith
# wispy nest which prompt ?

give SDXL this prompt: a red cat on the left, a green dog on the right, in the middle is a blue cube with a pyramid on it
i'd like to see what it creates

wispy nest
#

i have views to share in defense of sd3 and ethical pov ... but sd3 2b hasn't been perfected as it is, aslo falling short in comparsion with its predecessors

royal monolith
wispy nest
royal monolith
proud dagger
wispy nest
proud dagger
#

but it's awesome in a lot of ways that prior models were not as awesome at

royal monolith
wispy nest
royal monolith
wispy nest
#

simple prompts cant do it

#

but given you focus on the result its easily doable

royal monolith
#

that is prompt coherence

wispy nest
#

that's good for what it can do but why are you evading the fact how terrible it generates human form?

tired basin
#

To be fair even at a zero shot prompt, theres 10 billion seeds out there, sdxl will be capable of it, just may need longer to seed hunt. Doesnt mean it cant do it though.

Also its kind of moot as the main argument is not about prompt adherence and more about the fact that the woman on the grass has a bad case of leprosy

wispy nest
#

another sdxl base 1.0

royal monolith
wispy nest
#

just a lazy prompt for a portrait of a blonde woman

wispy nest
#

of all the human images i've generated with it, almost 90% of them were trash

royal monolith
wispy nest
#

i havent

#

and i dunno what you mean by watching this channel

#

i've been prompting

royal monolith
wispy nest
#

do you mean there are some rudimentaries to prompting on sd3?

royal monolith
wispy nest
#

im using flat tone

#

oil painting too

#

and its a base model

royal monolith
wispy nest
#

bring on the hands im not talking about portraits

#

i get good faces with sd3 but not hands and feet

royal monolith
wispy nest
#

i still dont understand how you miss the point

royal monolith
wispy nest
#

i have generated tons of nsfw with hands legs on sdxl and they came out grewat

#

you are defending a flaw as i can see

#

all im saying sd3 is terrible at human form

#

if you have no desire to correct that you are pushing it into a corner

royal monolith
# wispy nest you are defending a flaw as i can see

what I've watched you do is look at images and say how bad they look simply because you assume they are SD 3. but just like the anti-ai art haters, you really don't know what you're looking at or what created it.

wispy nest
#

you are making aguments in defense of something that needs rectification

#

are you fine with how sd3 is now?

#

that was my whole point, either you see it or you dont

hazy warren
#

My first SD3 image!!!!

wispy nest
hazy warren
wispy nest
#

let me take a crack at it too with sdxl base

#

no cherry picking

hazy warren
wispy nest
#

quality in terms of image tone is fine but the render itself is botched up

hazy warren
wispy nest
#

sure

hazy warren
#

idk why SD3 always wants to show breasts

wispy nest
#

but it not related to your workflow in most situation, its a gliche in the training data

hazy warren
hazy warren
wispy nest
#

ive used sdxl with tags and sentence too

hazy warren
#

I was pretty hyped about that one

wispy nest
#

i like tagging more

#

easy to get the ideas through

hazy warren
#

go on

wispy nest
#

and the models that are fine tuned listen well

hazy warren
wispy nest
royal monolith
#

To the point that a photo could be posted, and if you assumed it was sd3, you'd find issues

wispy nest
#

and if you are seriously arguing that sd3 is fine as it is .. your arguments are logically invalid

wispy nest
#

this is also sd3

#

i think she is a victim of a nuclear holocaust

languid pebble
#

Good morning!

late sorrel
#

hallo

#

guten tak

#

or is it tag? i never remember

languid pebble
late sorrel
#

ah 🙂

#

il try to remember it

languid pebble
#

Thanks 🙂 Trying to start every day with a good morning coffee picture 😄

late sorrel
#

i actually never noticed that haha

#

nice

languid pebble
#

It's more a twitter thingy ... and kinda competition. When will I get out of ideas? Created more than 150 and trying to create a big mosaik ...

wispy nest
#

using oil painting purposefully for the cool effect

languid pebble
#

I think we are getting closer to what we wanted to reach 🙂

sterile kiln
#

Hey guys, SD 3 works with A1111 ?

wispy nest
#

i feel inclined to think once the model is fine tuned there won't be a need for adetailer / face detailer

quiet current
languid pebble
wispy nest
late sorrel
wispy nest
#

lol

#

it took me almost year to finally switch to comfy btw

languid pebble
wispy nest
late sorrel
#

a1111 is for kids and comfy is for grown men 🙂

languid pebble
#

I like to stay a kid 😛

late sorrel
#

hehe

wispy nest
#

they are both good, but the reason im enjoying comfy a lot now is cause i finally figured out how to manage those wiring

late sorrel
#

its really not that hard

languid pebble
#

A1111 is good for quick results and an easy manual workflow ...

wispy nest
#

when i first tried comfyui the app didnt have default template lol but now that it has a load button for default workflow its lot easier to get started

languid pebble
late sorrel
#

nice

wispy nest
languid pebble
#

Learning Comfy is always a benefit cause you learn to understand the A.I. more ...

#

Thanks 🙂

wispy nest
languid pebble
#

It's really a genius tool

wispy nest
languid pebble
#

I have only learned Turbo Pascal 😄

wispy nest
#

mid journey like effect with water color

languid pebble
#

BTW I've got the idea that SD3 likes to add signatures ...

wispy nest
#

altho i have text, watermark in negative but it still shows up

languid pebble
#

I've the slight idea that it might be parts of the prompt the A.I. don't understand like SDXL any longer

wispy nest
#

i was able to negate watermark in sdxl but when it comes to anime its a hit and miss sometimes, but sd3 is bit more persuasive with watermark

languid pebble
#

You could add "Signature" no idea whether it will help 🙂

wispy nest
#

yeah good point

#

clean...

#

little horror

sterile kiln
#

HAHA xD

languid pebble
#

Kim Dot Com?

#

Take the photo! I need to pee!!!

sterile kiln
#

but "mangecouilles" mean something like "balls eater"

languid pebble
#

Kim could be more a sausage eater nowadays ... not sure ^^

languid pebble
steep sleet
#

These two are bewitchingly beautiful (SD3 into low-noise SDXL+LoRAs)

languid pebble
#

Some call it: Evolution ... ^^

steep sleet
languid pebble
#

6 is the new 5 ^^

sterile kiln
languid pebble
#

Arm loos a bit strange ...

#

Made by Frankenstein?

wispy nest
languid pebble
#

This didn't turn out as expected ...

halcyon junco
#

1

steep sleet
sterile kiln
grand sapphire
#

dayum, sd3 looking real good

wispy nest
#

If you are on the correct amount of drugs, that looks normal.

royal monolith
# wispy nest and if you are seriously arguing that sd3 is fine as it is .. your arguments are...

i told you what I was saying. you keep either ignoring it, or twisting it. Alex told you that the version of SD 3 you are using is unfinished, the company is well aware there are issues and are working on them. you ignored that, too. However i see that you've continued to try to learn the product and i'm hopeful that after working with it for a while longer you'll start to find the method that works for you to achieve the sort of results you wish to have

wispy nest
glossy herald
# royal monolith i told you what I was saying. you keep either ignoring it, or twisting it. Alex ...

even as it is, with the bias and other failings like anatomy, I think the prompt adherence and the text capabilities make up for it, and are worth investing time in understanding the model prompting system, while waiting for a 3.1 or for finetunes that do tackle those failings.
But, for having been around for quite some time, every new model had this exact community response when it got out. Time helps, people getting better with the model helps, finetuning and other methods bringing quality back up help too. but it's hard to keep the community happy in those times, or to just keep a level head about what a base model is and isn't, when it just gets out.
Arguing isn't the best solution either, just keep on bringing some good pics, training on the new model and having fun as a community, this is the best any of us can do around

wispy nest
#

and i have no interest to converse for the sake of drama

glossy herald
#

I don't argue on the weak and strong points, I'm aware of those failings too. like I said, I just feel the strong points do win over the weak points, and are worth investing time into understanding the model more while it keeps on getting better.
I didn't answer you directly, so no interest in drama either don't worry 🙂 I even said to the person you were talking to that arguing was useless

fluid basin
wispy nest
glossy herald
# wispy nest im optimistic about training it, im also curious about 8b variant

the few things I tried, it seemed to me that training wasn't working correctly yet, or that I didn't understand something. I'll try again later on that front : this is the big thing that will bring 3.0 to the top. Right now, it fails on some key concepts for me too.
8b variant seems fun, but I won't be able to train it locally so I'm more interested in that 2b we currently have. I need to tinker more in my diffusers script :p

wispy nest
glossy herald
#

the thing I'm spending time on today, while the diffusers training get sorted out, is the prompting. We do have 3 text encoders now, that do interpret prompt differently, and this is quite new. Also, model shift parameter. Understanding those new parameters, how to use those for real, what each of them does exactly, ... this can already be quite usefull to get better 3.0 results, as well as train it better once we can do that properly

wispy nest
#

in worst case scenario and what im hearing from others, best bet might be to train 8b model if resources allow

glossy herald
#

In the medium/long term, 8b training will be better for sure. but even at first, I do feel that training 2b will be faster, and teach us more about how the model reacts, how the prompts should be built for optimal result, and then use those things we learned to train better on 8b

#

just from what I learned when I trained a lot 1.5 and 2.1, each model does react differently to the prompt format

wispy nest
#

yeah agree on your approach

#

the reason i mention 8b over 2b model is that there could be technical bumps/limits with 2b model

glossy herald
#

true. but are we sure the 8b weights will be public at some point ?

wispy nest
#

no idea honestly

glossy herald
#

I mean, I hope they will, but while I don't know, I intend to learn the most I can, and have fun with it. I do admit, I'm still a child having fun when talking about training, this is still so cool and addictive to me, even after so many models x)

wispy nest
#

this 2b base model is quite capable with various things, if human figure gets sorted then community will love using it

glossy herald
#

human anatomy needs to get sorted out, yep. this is the main criticism I ear and see in the model, and it's a problem

#

especially when we see it being so good at other points

wispy nest
glossy herald
#

asian bias is quite high too in my experience

wispy nest
#

the human figure...

#

i didnt see much of that bias tho

glossy herald
#

well, it may just be me, I didn't make enough picture to really 100% comfirm it, but if I prompt "a woman" and don't specify origin, I get an asian woman most of the time, in the samples I have

wispy nest
#

you mean with this sd3?

#

i got all these while playing with it... i didnt specify any ethnicity, no asian bias with these

glossy herald
#

I didn't use a "save" node, and mostly previewed my pictures, plus I didn't play a lot with anatomy except for that "girl laying on grass challenge" people seemed to run yesterday ^^ it was mostly my impression from my small sample of results. It's the problem with small samples though, not representative. I'm happy yours doesn't seem to present that bias though

wispy nest
#

oh well i have had terrible anomaly with human figures too, but no asian bias

#

this is the very first asian render with sd3 but on purpose.

#

i wonder if you have noticed this, we dont need much facial correction or at all with this model

#

and its a base model

thin fog
#

anyone know what the normal speed for generating images with sdxl is with a 4080, it takes around 45 seconds to generate 1 image and i want to know if thats normal?

wispy nest
thin fog
#

what is end at step mean

#

why there 2 values

wispy nest
#

not sure, i dont use those in my workflow

thin fog
#

can i copy your workflow thing?

wispy nest
#

for sdxl?

thin fog
#

yeah

wispy nest
glossy herald
#

200 steps seem quite high !

thin fog
glossy herald
#

try torqx template, but I usually make pictures in 20 to 40 steps, depending on the model and resolution

wispy nest
#

yeah 20-40 is ideal

thin fog
#

oh ic

thin fog
wispy nest
#

check with missing nodes in case you dont have them already installed via comfy manager, but those are essentials

#

sd3

thin fog
# wispy nest

what are these ? am not able to generate wit hur template

glossy herald
#

for example, this is a comparison done here for step numbers : #🍥|anime message
The more steps, the more refined an image gets, but it has diminishing returns

wispy nest
#

those are face detailer to correct facial defect, you should have them for general purpose, you can update them via comfy manager

wispy nest
royal monolith
glossy herald
#

I did a lot of other responses in more helpful format, this one wasn't helpfull I'll give you that. Got tired after a while of the same complaints when there are lots of other positives to use.
Yeah, always the same when anything releases, people for it, people against it, life

wispy nest
thin fog
wispy nest
thin fog
wispy nest
#

check with the ultralytics models

#

they are under Install Models once you click manager

#

get all the face models

#

im not in a good position to trouble shoot comfy plugins, i only picked up comfy yesterday when sd3 came out

#

but they should be easy to fix

thin fog
#

i got all the ultralytics models but for some reason it stillh ave the red highlight

#

what does it mean tho

wispy nest
#

something missing

#

but do you get any error messages?

thin fog
#

huh weird now it doesnt have a red highlight

wispy nest
#

you would have had to restart after updates

thin fog
#

oh aight imma do that

wispy nest
thin fog
#

hm i try to generate landscape but it it looks low quality, is the model not great at landscapes? or is my promting just bad

wispy nest
#

looks washed out

thin fog
#

yeah

wispy nest
#

which checkpoint are you using?

thin fog
#

just using the normal sdxl

#

the base 1.0

#

should i get a different sdxl

wispy nest
#

ahh ok, base model is kind of less vibrant

thin fog
#

wth is pony

wispy nest
#

this was the result with sdxl base i was testing out to compare anomaly with sd3

thin fog
wispy nest
#

you should get one of fine tuned sdxl

orchid schooner
#

I got some good landscape results with base SDXL, maybe play with settings n prompts more?

thin fog
#

should i use the refiner?

orchid schooner
#

I only have 1 on me, could find more when I am home. It is very stylized

thin fog
orchid schooner
#

here's the 1

thin fog
orchid schooner
#

at least I thought it was good lol, but that's subjective

thin fog
#

yeah this looks kinda blurry still, ur images looks sharper i think

limpid lichen
#

Those forground wild flowers are crazy. i love that new vae

#

is that sd3?

snow cove
#

i dont expect shit prompt to be good, i however expect the new SD3 to not do ctulhu. And i expect the deffirence between two boys on the beach and two girls on the beach to not be so stark.

#

boys have meh hands. but girls have A FUCKING LEGFOOT

#

Can you PLEASE censor nsfw without making women look like aliens ? agony this is accidental misoginy on German level.

thin fog
#

which sd3 should i get?

#

shouldi get download the 15.8 gb one

thin fog
limpid lichen
#

usually when i get many colors in wild flowers like that, they all smash together and gradient oen another

thin fog
#

well i used the stunning landscape prompt

#

i remember i did stunning landscape prompt before on the sdxl prerelease i think it was called clip drop and the output was completely different from what im getting now

#

hm weird they look painterly when i did it before it was more photographic

#

maybe its cuz of the face models

languid pebble
wispy nest
deft bison
#

the terrible cataclysms of the last days of humanity on Earth...

grand sapphire
#

@worthy sonnet haha

pallid ruin
# thin fog the base 1.0

Aaa 1.0 is not that good, I recomend you trying leosamsHelloworldXL_helloworldXL7.0 , its the best one I tried

sterile kiln
#

we agree that SD 3 is anti-celebrity, don't we?
It's absolutely stupid, pointless and very frustrating. Yet another reason to stay at SDXL

slate garden
#

beautiful blonde in slim bath suit with rainbow socks just off the beach

pallid ruin
crisp stream
languid pebble
#

No Justin Bieber? habby

royal monolith
sterile kiln
crisp stream
royal monolith
worthy sonnet
royal monolith
pallid ruin
#

Im sure if used enough data, it can be finetuned it and "fix" everything it cant do or at least many of the things it cant do

royal monolith
#

if you mean can people train SD 3 models and LoRAs, i know of two that are doing that right now

sterile kiln
#

Soon we'll have tools like Kohya_SS compatible with SD3, and that should be even better. But if checkpoints are so bad with anatomy, more than a few people will be disappointed 😏

royal monolith
pallid ruin
sterile kiln
worthy sonnet
sterile kiln
crisp stream
royal monolith
royal monolith
pallid ruin
pallid ruin
#

@meager geyser

crisp stream
crisp stream
royal monolith
royal monolith
mild jay
crisp stream
#

If you like, you could do it with this one:

royal monolith
crisp stream
#

or this, I don´t know:

crisp stream
crisp stream
royal monolith
#

🙂 i didn't prompt for anything, just used the image. it let's you either just do image to video, or text to video, or put a prompt in with an image to video

crisp stream
# royal monolith

This looks funny glitchy like if it was inflated 😄 Like it 🙂

#

Thank you very much for testing 🙂

royal monolith
#

🙂 free accounts are lmiited to 10 a day, but you could work up a really fun short over a few days if you wanted to

royal monolith
crisp stream
tough lark
#

democracies and other political regimes in the contemporary world

coarse girder
#

Hello

crisp stream
crisp stream
solid bridge
#

those are awesome @crisp stream !

crisp stream
crisp stream
royal monolith
crisp stream
royal monolith
crisp stream
royal monolith
#

i wanna see the movie you are creating

crisp stream
royal monolith
crisp stream
crisp stream
crisp stream
crisp stream
#

@solid bridge

crisp stream
#

Cheers 😀

royal monolith
#

someone's not going to sleep tonight

crisp stream
crisp stream
wispy nest
wispy nest
#

i wonder if thats sd3 tho

royal monolith
crisp stream
crisp stream
royal monolith
crisp stream
crisp stream
# wispy nest i wonder if thats sd3 tho

couldn't get anything alike from SD3 so far and haven't seen anything like that from SD3, reason I stay on Cascade for now in combination with the Invictus Redmond v1.1 checkpoint, in case you are interested

wispy nest
crisp stream
wispy nest
crisp stream
wispy nest
#

well the comparison graph on sdxl, cascade shows cascade runs faster and gives better outputs, something im tempted to setup on comfyui

crisp stream
wispy nest
#

are you able to create humans in photorealistic theme?

#

say something basic

crisp stream
#

gonna send the base model one as well, and for the Standard way withe the Redmond you can simply substitute the "load image node with an empty latent

granite mesa
#

I've been staring at faces too long, does this look real?

crisp stream
wispy nest
wispy nest
#

downloading files .. im going for the lite 16 version since i just wanna run cascade for orientation

crisp stream
crisp stream
#

@wispy nest t2i workflow for the Redmont Checkpoint

wispy nest
# crisp stream

wont let me load the workflow, probably cause of copying the image from comfyui interface, might have to drag the actual file for it to work

crisp stream
#

i2i Workflow for the base Model (fron the github examples)

wispy nest
#

ok no worries

#

im checking out the comfyuiworkflows

#

going to do something very basic... just for a feel of it

crisp stream
#

yet you won't find the Redmond Workflow there, that workflow is at Civit.ai or simply take the ones I posted 🙂

#

@wispy nest

nimble mason
crisp stream
nimble mason
#

manage to do a bit of training on sd3

crisp stream
#

Love Cascade 😄

wispy nest
#

not bad at all.. didnt use face detailer

nimble mason
#

cascade is indeed great

wispy nest
#

used oil painting for it

crisp stream
#

looks good for starters

wispy nest
#

im also impressed how little effort it took me to put this into work with comfyui

crisp stream
#

btw if anything is black and white and you don´t want it you can n-prompt colourless. Works pretty well, at least to some extend

wispy nest
#

btw i see the stageb and c go into unet folder, not checkpoint folder

crisp stream
wispy nest
#

hmm i see

#

didnt let me load c and b when i had them in checkpoint

#

i moved them into unet and it worked

#

stage a went into vae

#

i actually renamed the files appropriately where they go

cascade_textencoder.bf16
stage_a_vae
stage_b_lite_checkpoint_bf16
stage_c_lite_checkpoint_bf16

#

but checkpoint should be unet

crisp stream
#

yeah well, don´t know. I didn´t put them in the unet folder because I so far couldn´t customize the unetfolder path (got basically all checkpoints except for the Redmond one in checkpoints on an SSD while Comfy is installed on an HDD. Would have to move to the SSD at some point

crisp stream
#

anyway, mainthing is you got them going

wispy nest
#

all functional

crisp stream
crisp stream
wispy nest
wispy nest
#

do you have that up on hugginface btw? cause civitai download speed sucks for me

crisp stream
wispy nest
#

ok, so im wondering right now i have 2 files going into unet folder, those are the models, but with redmond as a single checkpoint folder how does that work?

royal monolith
wispy nest
#

my speed with civitai tends to fluctuate

royal monolith
#

it's 6, almost 7 gig

wispy nest
#

usual sdxl file size

#

but with civitai throttling speed its inconvenient, huggingface on the other hand is very consistent

crisp stream
wispy nest
#

any idea how this preview section has 4 images cause when i create a preview image node i get one panel

crisp stream
wispy nest
#

ohhh nvm batch 4

#

yeah

crisp stream
#

🙂

wispy nest
#

cool stuff

crisp stream
wispy nest
crisp stream
wispy nest
crisp stream
nimble mason
royal monolith
# wispy nest ok ty

if you need the model still, i downloaded it and can stick it on my google drive

crisp stream
nimble mason
#

those facial expressions lol

crisp stream
royal monolith
crisp stream
languid pebble
#

Good morning 🙂

crisp stream
crisp stream
languid pebble
#

Was your cascade workflow included somewhere?

crisp stream
crisp stream
languid pebble
#

Thx ,,,, something to play 🙂

crisp stream
#

🙂

#

Very much a welcomed SD3 substitute here 😄

languid pebble
#

Hmmm... sure they are still included? Comfy don't wanna find a workflow ...

wispy nest
#

btw the redmond checkpoint needed to be in unet folder as well