#🆕｜sd3 | Stable Diffusion | Page 67

sterile pendant Jul 10, 2024, 2:17 PM

#

and it then becomes statistically signficant

scarlet wharf Jul 10, 2024, 2:17 PM

#

left is AYS kolors workflow, right original workflow by kijai. add AYS seem give more adherence on prompt

craggy crest Jul 10, 2024, 2:17 PM

#

sure. but not very useful for the individual person. the only thing that really is useful to them is whether the results they personally get are what they want or not

edgy kelp Jul 10, 2024, 2:18 PM

#

Reading this link with my tiny eyes it looked like "Im gay dot org"

alpine summit Jul 10, 2024, 2:19 PM

#

hallow lion Jul 10, 2024, 2:21 PM

#

Manual man, we need a manual. XD

sterile pendant Jul 10, 2024, 2:22 PM

#

craggy crest sure. but not very useful for the individual person. the only thing that really ...

play around on the site, you'll quickly see why it's such a powerful tool. it pits two random models against each other while only showing you the prompt. you vote L/R or tie, then it shows you afterwards which was which. the whole point is to test how flexible a model is. on the rankings page, you can click the stats button to see how a model fairs against other specific models. Oh and the prompts are random stuff people ask for (you can prompt for something as well i think, but ive never tried)

so basically, when you thrown a bunch of random prompts at a model, you get a much more realistic rating of how well the model handles various types of concepts vs the cherrypicked BS that most papers show where they have some ultra fine crafted prompt that happens to work really well with a model, since they know the data/captions that were used to train it.

alpine summit Jul 10, 2024, 2:22 PM

#

craggy crest Jul 10, 2024, 2:24 PM

#

sterile pendant play around on the site, you'll quickly see why it's such a powerful tool. it pi...

you do that on svd too - however, i'm pretty sure that while it's useful to other peopel, it's not as useful to them as just learning how to talk to the models correctly and generating themselves

sterile pendant Jul 10, 2024, 2:25 PM

#

you should see some of the random ass prompts people throw at it, it's actually kind of rare to see cliche waifus and stuff.

#

people are undereducated and the average reading/writing level is that of an eleven year old

low stone Jul 10, 2024, 2:25 PM

#

All of this is until 8b is out, so if you could get on that. 🙂

craggy crest Jul 10, 2024, 2:26 PM

#

sterile pendant people are undereducated and the average reading/writing level is that of an ele...

what does that have to do with working with the AI, learning how it thinks, so you can get what you want out of it every time? that's what is actually useful to someone.

craggy crest Jul 10, 2024, 2:27 PM

#

low stone All of this is until 8b is out, so if you could get on that. 🙂

you can expect 8b in about 5 years, longer if people continue to harp on it

low stone Jul 10, 2024, 2:27 PM

#

I'll also drop more sugar on you, I've been using your aam xl model a lot. It's really great as a general model as well, able to handle an impressive amount of concept beyond just "anime".

foggy cloak Jul 10, 2024, 2:28 PM

#

First impressions matter a lot, SD3 got off on the wrong foot

edgy kelp Jul 10, 2024, 2:28 PM

#

Imagine if they keep hyping it like the 2B and in the end it's not the same as they are using in the API but it's a neutered one

low stone Jul 10, 2024, 2:28 PM

#

craggy crest you can expect 8b in about 5 years, longer if people continue to harp on it

Hopefully in dog years. 🙂

craggy crest Jul 10, 2024, 2:29 PM

#

edgy kelp Imagine if they keep hyping it like the 2B and in the end it's not the same as t...

2b isn't neutered and it wasn't lobotomized

lavish osprey Jul 10, 2024, 2:29 PM

#

low stone I'll also drop more sugar on you, I've been using your aam xl model a lot. It's ...

Thanks, I'm happy you like it. There are also finetunes of that one now by other people

#

I like Neta Art

edgy kelp Jul 10, 2024, 2:29 PM

#

craggy crest 2b isn't neutered and it wasn't lobotomized

My message did not imply 2b is neutered

lavish osprey Jul 10, 2024, 2:30 PM

#

in general most models built on top of Animagine are pretty great

sterile pendant Jul 10, 2024, 2:31 PM

#

craggy crest what does that have to do with working with the AI, learning how it thinks, so y...

apply transductive reasoning here: dumb/undereducated people vs highly complicated tech stuff. i'm 99.999% positive that you don't actually fully understand what's happening under the hood. you likely have a gross overview understanding, but it's just the tip of the iceberg.

lavish osprey Jul 10, 2024, 2:31 PM

#

AAM XL in particular is built on top of both Animagine (indirectly) and Dreamshaper. Another reason why it's able to go Turbo without too much quality loss.

low stone Jul 10, 2024, 2:31 PM

#

lavish osprey in general most models built on top of Animagine are pretty great

Agreed, anime has a lot of dynamic camera and posing angles that break out of the standard symmetrical portrait looks.

sage burrow Jul 10, 2024, 2:31 PM

#

@lavish osprey I'm wondering if my glif (that I remixed from another) has anything built in somehow that isn't mentioned? I looked through it all, and it's only SD3 and Claude helping with prompting. But somehow the images come out better than just straight up SD3 on my own computer.
To see any settings, you can just hit remix. But more likely it's something glif has built in to just make everything better, OR, is claude the awesome and it's all about prompting?

craggy crest Jul 10, 2024, 2:32 PM

#

sterile pendant apply transductive reasoning here: dumb/undereducated people vs highly complicat...

i'm 999.99% positive that i understand exactly what is happening under the hood.

lavish osprey Jul 10, 2024, 2:32 PM

#

sage burrow <@180327464155742208> I'm wondering if my glif (that I remixed from another) ha...

Glif is using API, so SD3 Large, right? Or they allow to select SD3 Medium?

#

on your computer it's 100% sd3 medium, since we didn't release large yet.

bitter hearth Jul 10, 2024, 2:33 PM

#

Hello waow

craggy crest Jul 10, 2024, 2:33 PM

#

bitter hearth Hello <:waow:1017853838516035725>

meow

sterile pendant Jul 10, 2024, 2:34 PM

#

craggy crest i'm 999.99% positive that i understand exactly what is happening under the hood.

well i'm sure dunning-kruger would have some things to say, but i'm not going to be mean at this hour lol. regardless, i promise you're not prompting as ideally as you think you are, which means you don't actually understand "ai" as well as you think you do.

edgy kelp Jul 10, 2024, 2:35 PM

#

bitter hearth Hello <:waow:1017853838516035725>

Balls

craggy crest Jul 10, 2024, 2:35 PM

#

sterile pendant well i'm sure dunning-kruger would have some things to say, but i'm not going to...

i promise you that after spending more than 3000 hours meticulously prompting stable diffusion, changing as little as one character in a prompt, i know what i'm doing and that you do NOT know, though you think you do, my skill set. I also promise you that i'm a programmer and have probably got a deeper understanding of the code than you do.

craggy crest Jul 10, 2024, 2:36 PM

#

edgy kelp Balls

shiny

bitter hearth Jul 10, 2024, 2:36 PM

#

edgy kelp Balls

Pineapple pizza ball

uncut river Jul 10, 2024, 2:36 PM

#

promise promise
dogs that makes lots of sound
typically dont bite

edgy kelp Jul 10, 2024, 2:37 PM

#

bitter hearth Pineapple pizza ball

MFW I'm italian

craggy crest Jul 10, 2024, 2:37 PM

#

bitter hearth Pineapple pizza ball

i'm not sure if that's edible or not... where's the crust?

bitter hearth Jul 10, 2024, 2:37 PM

#

craggy crest i'm not sure if that's edible or not... where's the crust?

Inside is all crust

bitter hearth Jul 10, 2024, 2:38 PM

#

edgy kelp MFW I'm italian

Chocolate and MMs?

craggy crest Jul 10, 2024, 2:38 PM

#

bitter hearth Chocolate and MMs?

not on tomato sauce. On marshmellow, sure

edgy kelp Jul 10, 2024, 2:38 PM

#

bitter hearth Chocolate and MMs?

I'm fastly losing my health points

bitter hearth Jul 10, 2024, 2:39 PM

#

edgy kelp I'm fastly losing my health points

You'll become American after this one

edgy kelp Jul 10, 2024, 2:39 PM

#

bitter hearth You'll become American after this one

10 health points, however 200 diabeetus points (which is like the USA mana points)

sterile pendant Jul 10, 2024, 2:39 PM

#

craggy crest i promise you that after spending more than 3000 hours meticulously prompting st...

i know plenty of people that have done some kind of thing for a job for decades and are still not great at it. practice doesn't make perfect, perfect practice makes perfect. part of growing and getting better at something is accepting that you don't know it all and that there is always something to be learned. otherwise, it's just megalomania/narcissism mixed with dunning-kruger.

craggy crest Jul 10, 2024, 2:40 PM

#

sterile pendant i know plenty of people that have done some kind of thing for a job for decades ...

think what you like.

sterile pendant Jul 10, 2024, 2:40 PM

#

i will

craggy crest Jul 10, 2024, 2:40 PM

#

sterile pendant i will

and keep it to yourself

sterile pendant Jul 10, 2024, 2:40 PM

#

i wont

finite osprey Jul 10, 2024, 2:40 PM

#

sterile pendant i know plenty of people that have done some kind of thing for a job for decades ...

doing something wrong a LOT doesn't make you good 🙂

bitter hearth Jul 10, 2024, 2:40 PM

#

sterile pendant i wont

You will

edgy kelp Jul 10, 2024, 2:40 PM

#

Dunning Freddie Kruger effect

craggy crest Jul 10, 2024, 2:40 PM

#

bitter hearth Chocolate and MMs?

how about a fruit pizza sphere

bitter hearth Jul 10, 2024, 2:40 PM

#

I have no clue what you guys are on about, seems very dumb

edgy kelp Jul 10, 2024, 2:41 PM

#

bitter hearth I have no clue what you guys are on about, seems very dumb

Hilarious joke, dunno if you made it on purpose

bitter hearth Jul 10, 2024, 2:41 PM

#

Like the people I see in games arguing who is right thomas

bitter hearth Jul 10, 2024, 2:41 PM

#

edgy kelp Hilarious joke, dunno if you made it on purpose

Idk, I didn't read half of it

edgy kelp Jul 10, 2024, 2:42 PM

#

bitter hearth Idk, I didn't read half of it

I think we share the same braincell

finite osprey Jul 10, 2024, 2:42 PM

#

Sharing is caring

bitter hearth Jul 10, 2024, 2:43 PM

#

edgy kelp I think we share the same braincell

I share my balls half the time

craggy crest Jul 10, 2024, 2:43 PM

#

bitter hearth I share my balls half the time

now stick that inside a bubble

edgy kelp Jul 10, 2024, 2:43 PM

#

bitter hearth I share my balls half the time

Nice ball

#

Ball rate 10/10

bitter hearth Jul 10, 2024, 2:44 PM

#

craggy crest now stick that inside a bubble

#

thomas

edgy kelp Jul 10, 2024, 2:44 PM

#

Imagine doing this in the house of a vegan, they won't be able to eat

sterile pendant Jul 10, 2024, 2:45 PM

#

bitter hearth You will

nah, bros acting like he's dr. jenkins from starship troopers saying "its afraid..." like he can magically read an ai's mind or something and trying to hit people with the git gud spiel lol...

craggy crest Jul 10, 2024, 2:45 PM

#

sterile pendant nah, bros acting like he's dr. jenkins from starship troopers saying "its afraid...

i said learn how to talk to it. learn how it thinks. it is not that hard.

bitter hearth Jul 10, 2024, 2:45 PM

#

sterile pendant nah, bros acting like he's dr. jenkins from starship troopers saying "its afraid...

finite osprey Jul 10, 2024, 2:45 PM

#

bitter hearth

why not use the square ratio for this

sterile pendant Jul 10, 2024, 2:45 PM

#

craggy crest i said learn how to talk to it. learn how it thinks. it is not that hard.

you can't "talk" to something if you don't know how it "thinks"

craggy crest Jul 10, 2024, 2:46 PM

#

bitter hearth

what'd you do with the rest of the fish

craggy crest Jul 10, 2024, 2:46 PM

#

sterile pendant you can't "talk" to something if you don't know how it "thinks"

well we certainly know how you think. ignoring you now.

bitter hearth Jul 10, 2024, 2:46 PM

#

craggy crest what'd you do with the rest of the fish

Rolled away

craggy crest Jul 10, 2024, 2:46 PM

#

bitter hearth Rolled away

fish heads in floating spheres.

edgy kelp Jul 10, 2024, 2:47 PM

#

Fishy situation I reckon

craggy crest Jul 10, 2024, 2:47 PM

#

@bitter hearth

#

sterile pendant Jul 10, 2024, 2:48 PM

#

craggy crest well we certainly know how you think. ignoring you now.

yeah, logically and realistically... but alright man, i get it, your ego is hurt and at this point, you're probably taking this as cyberbullying or something. see how i'm "talking" to something because i can understand how it "thinks" inside?

bitter hearth Jul 10, 2024, 2:48 PM

#

#

#

waow

foggy cloak Jul 10, 2024, 2:52 PM

#

bitter hearth

How are you using SD3 with 4gb VRAM 😭

craggy crest Jul 10, 2024, 2:52 PM

#

sterile pendant yeah, logically and realistically... but alright man, i get it, your ego is hurt...

naw, just done trying to explain something to a person with a huge ego and lack of comprehension

edgy kelp Jul 10, 2024, 2:53 PM

#

foggy cloak How are you using SD3 with 4gb VRAM 😭

He's a liar (not a cat)

low stone Jul 10, 2024, 2:53 PM

#

craggy crest and keep it to yourself

There's a block option that alleviates you from certain unpleasantries. It solves a lot. 🙂

foggy cloak Jul 10, 2024, 2:53 PM

#

edgy kelp He's a liar (not a cat)

Ban that mf

finite osprey Jul 10, 2024, 2:53 PM

#

I just realized you can go negative in the positive prompt. Like I can write "None of them are outside." and it worked.
You all probably already knew that, but anyway

bitter hearth Jul 10, 2024, 2:53 PM

#

foggy cloak How are you using SD3 with 4gb VRAM 😭

By utilizing the powers of the internet im able to use the api and be lazy in my bed while my PC is crying in the corner

sage burrow Jul 10, 2024, 2:54 PM

#

lavish osprey Glif is using API, so SD3 Large, right? Or they allow to select SD3 Medium?

They don't say, so perhaps 😉

reef urchin Jul 10, 2024, 2:54 PM

#

~~Only their cat has 4gb of vram. They themselves have more than enough.~~

edgy kelp Jul 10, 2024, 2:54 PM

#

reef urchin ~~Only their cat has 4gb of vram. They themselves have more than enough.~~

Underrated joke

low stone Jul 10, 2024, 2:54 PM

#

bitter hearth I share my balls half the time

I'm always impressed by how perfectly spherical your balls are.

edgy kelp Jul 10, 2024, 2:55 PM

#

low stone I'm always impressed by how perfectly spherical your balls are.

Dude works out

lavish osprey Jul 10, 2024, 2:56 PM

#

sage burrow They don't say, so perhaps 😉

it's Large probably, They had that contract before Medium was released. Unless they switched to Mediun for the price

craggy crest Jul 10, 2024, 2:56 PM

#

low stone There's a block option that alleviates you from certain unpleasantries. It solve...

i know, the problem is that you then can't see what they are saying - and sometimes it's important that you do so

bitter hearth Jul 10, 2024, 2:56 PM

#

low stone Jul 10, 2024, 2:56 PM

#

craggy crest i know, the problem is that you then can't see what they are saying - and someti...

Does it really matter? Not really.

edgy kelp Jul 10, 2024, 2:56 PM

#

bitter hearth

Did you shave with the razor?

low stone Jul 10, 2024, 2:56 PM

#

edgy kelp Did you shave with the razor?

No razor can tame that

lavish osprey Jul 10, 2024, 2:57 PM

#

edgy kelp Jul 10, 2024, 2:57 PM

#

low stone No razor can tame that

That's the aftermath of razor shaving (some weeks later)

bitter hearth Jul 10, 2024, 2:57 PM

#

edgy kelp Did you shave with the razor?

That's on a cold day

low stone Jul 10, 2024, 2:57 PM

#

edgy kelp That's the aftermath of razor shaving (some weeks later)

Clearly some shards of razor have been left behind.

lavish osprey Jul 10, 2024, 2:57 PM

#

#

#

bitter hearth Jul 10, 2024, 2:58 PM

#

Booba

edgy kelp Jul 10, 2024, 2:58 PM

#

bitter hearth That's on a cold day

Can you output a ball lying on grass?

lavish osprey Jul 10, 2024, 2:58 PM

#

bitter hearth Jul 10, 2024, 2:59 PM

#

edgy kelp Can you output a ball lying on grass?

lavish osprey Jul 10, 2024, 2:59 PM

#

lavish osprey Jul 10, 2024, 3:00 PM

#

bitter hearth

is it bad that I can tell this is 100% fireworks sd3 8b?

bitter hearth Jul 10, 2024, 3:00 PM

#

thomas I dunno

craggy crest Jul 10, 2024, 3:00 PM

#

lavish osprey Jul 10, 2024, 3:01 PM

#

bitter hearth Jul 10, 2024, 3:01 PM

#

Try it on 2b
My prompt is "A shining metallic ball with 2 arms open lying on grass"

lavish osprey Jul 10, 2024, 3:01 PM

#

low stone Jul 10, 2024, 3:01 PM

#

lavish osprey Jul 10, 2024, 3:01 PM

#

#

craggy crest Jul 10, 2024, 3:02 PM

#

low stone

when you've spent too much time laying in the grass

edgy kelp Jul 10, 2024, 3:02 PM

#

bitter hearth

Science has gone too far

bitter hearth Jul 10, 2024, 3:02 PM

#

thomas

edgy kelp Jul 10, 2024, 3:02 PM

#

Damn

bitter hearth Jul 10, 2024, 3:03 PM

#

runs away

edgy kelp Jul 10, 2024, 3:04 PM

#

New York Times Opinion: Is This Too Suggestive For SAI?

low stone Jul 10, 2024, 3:04 PM

#

uncut river Jul 10, 2024, 3:06 PM

#

lol! does this count ... ?

bitter hearth Jul 10, 2024, 3:06 PM

#

uncut river lol! does this count ... ?

Clearly underhanded methods were used

edgy kelp Jul 10, 2024, 3:07 PM

#

Because hands are under

craggy crest Jul 10, 2024, 3:08 PM

#

uncut river lol! does this count ... ?

photograph of a woman leaning sideways on a grass wall?

edgy kelp Jul 10, 2024, 3:08 PM

#

That grass looks like a... yeah lol I was beat to that

low stone Jul 10, 2024, 3:08 PM

#

8b

craggy crest Jul 10, 2024, 3:08 PM

#

bitter hearth Jul 10, 2024, 3:08 PM

#

How sd learned hands: it grows them

uncut river Jul 10, 2024, 3:09 PM

#

the side view made me rotate the image, was more hoping for a front view with flat solid green wall of grass

edgy kelp Jul 10, 2024, 3:09 PM

#

bitter hearth How sd learned hands: it grows them

Extra fingers are, in fact, fruits

craggy crest Jul 10, 2024, 3:09 PM

#

uncut river the side view made me rotate the image, was more hoping for a front view with fl...

try adding "front view" to your prompt then

low stone Jul 10, 2024, 3:10 PM

#

edgy kelp Jul 10, 2024, 3:10 PM

#

Imagine a mummy during that covid toilet rolls shortage

bitter hearth Jul 10, 2024, 3:10 PM

#

low stone Jul 10, 2024, 3:12 PM

#

edgy kelp Imagine a mummy during that covid toilet rolls shortage

Kolors did a good job too. Not sure I can think of an idea for that one. Maybe in the aisle with a barren shelves with hands raised to the sky yelling why

edgy kelp Jul 10, 2024, 3:13 PM

#

low stone Kolors did a good job too. Not sure I can think of an idea for that one. Maybe i...

That's nice, maybe with a face mask would be more understandable though (I guess)

bitter hearth Jul 10, 2024, 3:14 PM

#

#

Mummy ball?

low stone Jul 10, 2024, 3:16 PM

#

Neither sd3 8b nor kolors is understanding "barren shelves". It's giving me the opposite. Maybe he's angry about too many choices.

edgy kelp Jul 10, 2024, 3:17 PM

#

100% sure OpenAI will make a special denoiser (or call it what you like) for AI generate images and train the next Dall-E on insane AI generated images but will be sure to edit every one with some automation in order to avoid artifacts

bitter hearth Jul 10, 2024, 3:17 PM

#

low stone Neither sd3 8b nor kolors is understanding "barren shelves". It's giving me the ...

Try empty shelves

sage burrow Jul 10, 2024, 3:17 PM

#

foggy cloak How are you using SD3 with 4gb VRAM 😭

Glif? Or, it costs a a subscription fee to change your username, could be an old one lol

bitter hearth Jul 10, 2024, 3:18 PM

#

sadcat

#

low stone Jul 10, 2024, 3:19 PM

#

bitter hearth Try empty shelves

Out of a lot of attempts, managed to get a single one with "empty" 🙂

edgy kelp Jul 10, 2024, 3:19 PM

#

bitter hearth

That's my local supermarket after I bought all the toilet paper (I have a hyperactive gut)

sage burrow Jul 10, 2024, 3:21 PM

#

low stone Neither sd3 8b nor kolors is understanding "barren shelves". It's giving me the ...

Ref image maybe?

low stone Jul 10, 2024, 3:22 PM

#

sage burrow Ref image maybe?

That exceeds my laziness quotient.

edgy kelp Jul 10, 2024, 3:23 PM

#

BRB gotta photograph myself in mummy bendages while I show despair in an empty supermarket (I have the budget)

craggy crest Jul 10, 2024, 3:24 PM

#

low stone Neither sd3 8b nor kolors is understanding "barren shelves". It's giving me the ...

try "empty shelves"

bitter hearth Jul 10, 2024, 3:25 PM

#

edgy kelp Jul 10, 2024, 3:25 PM

#

Empty shelves is a banned phrase from the SAI datasets because it's too naughty

low stone Jul 10, 2024, 3:27 PM

#

edgy kelp BRB gotta photograph myself in mummy bendages while I show despair in an empty s...

I smell a Lora. Or is that something else?

edgy kelp Jul 10, 2024, 3:27 PM

#

low stone I smell a Lora. Or is that something else?

No I was joking

mortal mesa Jul 10, 2024, 3:28 PM

#

govt intervention, cant show empty shelves

low stone Jul 10, 2024, 3:28 PM

#

edgy kelp No I was joking

I know, but a dedicated Lora of people shouting why at the skies in grocery stores would probably find a niche market on civit

bitter hearth Jul 10, 2024, 3:28 PM

#

edgy kelp Jul 10, 2024, 3:28 PM

#

low stone I know, but a dedicated Lora of people shouting why at the skies in grocery stor...

Gotta make a dataset with Dall-E 3 and then train a lora on it I guess

mortal mesa Jul 10, 2024, 3:29 PM

#

go to walmart and take pictures for your dataset

low stone Jul 10, 2024, 3:30 PM

#

edgy kelp Gotta make a dataset with Dall-E 3 and then train a lora on it I guess

Wow even dalle couldn't do it

edgy kelp Jul 10, 2024, 3:30 PM

#

mortal mesa go to walmart and take pictures for your dataset

I'd rather not pose kneeling down with a desperate expression to be photographed

mortal mesa Jul 10, 2024, 3:30 PM

#

2024-07-10-112215-SD3_sd3_medium.safetensors_00001_.png

bitter hearth Jul 10, 2024, 3:35 PM

#

uncut river Jul 10, 2024, 3:46 PM

#

so ... real life photo model for clothing branding, seems an easy task for ai.

custom desing NOEDEL brand outfits. not for sale yet!

alpine summit Jul 10, 2024, 3:53 PM

#

uncut river Jul 10, 2024, 3:59 PM

#

Off-grey stretchy combi outfit, top with half-long stretch skirt. Good for both parties or casual activities. Not for sale yet. Noedel brand.

#

sleeves not included

alpine summit Jul 10, 2024, 4:02 PM

#

uncut river Jul 10, 2024, 4:12 PM

#

Hey Goo Goo Gage, I hope you get banned for life

craggy crest Jul 10, 2024, 4:16 PM

#

y'all are such good friends

uncut river Jul 10, 2024, 4:17 PM

#

well crystalwizard, I certainly hope you missed that video which has been removed. was there for too long.

sage burrow Jul 10, 2024, 4:18 PM

#

So how come Ella over the various Claude or Ollama ones?

uncut river Jul 10, 2024, 4:21 PM

#

I don't care about the hands, but how can I make SD3 stop using double ll ? It's Noedel, not Noedell

craggy crest Jul 10, 2024, 4:24 PM

#

uncut river I don't care about the hands, but how can I make SD3 stop using double ll ? It's...

photoshop it!

uncut river Jul 10, 2024, 4:24 PM

#

no, sd3 should do it. back in the days using sd15 i first generated and image, then photoshopped the text over it, then do a refinement with img2img

#

SD3 should do text better, it's not like I'm asking for a full poem in correct layout

#

maybe I should ...

#

btw, this is my character Caitlin. Always wearing custom designed glasses, cuz she wants and can afford it.

craggy crest Jul 10, 2024, 4:29 PM

#

uncut river no, sd3 should do it. back in the days using sd15 i first generated and image, t...

8b, does. 2b - doesn't have that sort of enhanced ability.

#

uncut river Jul 10, 2024, 4:32 PM

#

I just want a working imitation of M$ Word Art (tm) inside the SD3 medium model. perhaps too much to ask for a 2003 technology?

#

😄

#

sorry, that was bordering trolling. Nevermind, I switched to generating realistic images of Caitlin anyway!

bitter hearth Jul 10, 2024, 4:34 PM

#

uncut river I just want a working imitation of M$ Word Art (tm) inside the SD3 medium model....

like ascii ?

uncut river Jul 10, 2024, 4:35 PM

#

lol, no. ascii is 1963 tech

bitter hearth Jul 10, 2024, 4:35 PM

#

thomas

bitter hearth Jul 10, 2024, 4:35 PM

#

uncut river I don't care about the hands, but how can I make SD3 stop using double ll ? It's...

give me the prompt you used there

mortal mesa Jul 10, 2024, 4:38 PM

#

uncut river I just want a working imitation of M$ Word Art (tm) inside the SD3 medium model....

have you ever looked at Comfyroll studio nodes

uncut river Jul 10, 2024, 4:39 PM

#

no, what is that?

mortal mesa Jul 10, 2024, 4:40 PM

#

lots of graphics and text nodes, its quite good

#

https://civitai.com/models/183551/comfyui-comfyroll-custom-nodes https://github.com/Suzie1/ComfyUI_Comfyroll_CustomNodes

uncut river Jul 10, 2024, 4:42 PM

#

Caitlin, expensive glasses for each day or outfit. Cute looks but a calculated cold character, who loves her work and does not like standing still. Her study in behavior management really helps her at the facility.

devout schooner Jul 10, 2024, 4:50 PM

#

For all the talk of censorship in SD3, I find it draws women who are randomly straight up naked (but disturbingly without nipples or like anything other than continuous skin downstairs) quite often, even when I didn't prompt for it at all. Not gonna post any examples cause they're all creepy, looks like burn victims or something lol

mortal mesa Jul 10, 2024, 4:52 PM

#

yes

uncut river Jul 10, 2024, 4:53 PM

#

damn, it seems to be hard to get a real photo style with purple eyes from sd3

uncut river Jul 10, 2024, 4:58 PM

#

devout schooner For all the talk of censorship in SD3, I find it draws women who are randomly st...

yes, it seems there is still some nsfw hidden in the model, but indeed somewhat masked/mangled

#

it seems they have not removed all nsfw from the dataset and trained from scratch, but instead evolved existing models based which do include nsfw images in the dataset. I think sd3 is just trained not to show the more explicit stuff that is hidden within it.

mortal mesa Jul 10, 2024, 5:05 PM

#

if nipple reroute to smooth plastic

bitter hearth Jul 10, 2024, 5:06 PM

#

does anyone have a guide to CFG and Steps numbers for SD3

uncut river Jul 10, 2024, 5:06 PM

#

maybe it's just me, but I think sd3 makes better sexy images when putting stuff like (naked, nude, explicit:1.1) in the NEG prompt

#

yes, go for low CFG

bitter hearth Jul 10, 2024, 5:06 PM

#

there was a complex discussion on here a few days ago about negative prompts
they might not do much

uncut river Jul 10, 2024, 5:07 PM

#

like between 3.8 and maybe up to 6 typically around 4.0 to 4.4 or maybe just keep it on 4.375

bitter hearth Jul 10, 2024, 5:07 PM

#

I am not sure as I am just now barely starting to test SD3

#

ah okay thanks

#

yeah 2 was too low and 7 too high

uncut river Jul 10, 2024, 5:07 PM

#

and if you want to go crazy, don't let the usual limits stop you

#

btw, I only know sd3 medium running locally

#

im not sure, but I think the architecture and prompting for the larger models differ

bitter hearth Jul 10, 2024, 5:08 PM

#

I am using a huggingface space

#

but the only problem is it doesn't say the sampler

#

I might have to do it properly in comfy to know what the actual full settings are

uncut river Jul 10, 2024, 5:08 PM

#

hm, i think samplers are overrated

#

I just use Euler for about everything

bitter hearth Jul 10, 2024, 5:11 PM

#

its mostly that they either converge or not

#

and need a different amount of steps

mortal mesa Jul 10, 2024, 5:15 PM

#

many samplers dont ever converge by design, kinda why i like DPM_Adaptive it picks out how many steps it needs by itself to converge

uncut river Jul 10, 2024, 5:16 PM

#

Caitlin back at work, as happy as she can get! (she loves serious work...). though maybe sd3 turned her a bit asian. Oh well, it's all about behavior for Caitlin, the good looks are just her trademark.

bitter hearth Jul 10, 2024, 5:16 PM

#

I've always used DPM++ 2M Karras, DPM++2S a Karras or DPM++ SDE 2M Karras

#

cos that gets you an ancestral one and an SDE one

#

as well as just DPM++ 2M Karras

#

Interesting

uncut river Jul 10, 2024, 5:20 PM

#

very similar prompt in SD15 - RealisticVisionV60B1 - for reference

#

you don't see it, but there was color bleeding all over, so had to cherry pick this one. SD15 might be able to give nice(r) results, but at the cost of some rejects

craggy crest Jul 10, 2024, 5:23 PM

#

prompt: hdr photograph, head and shoulder shot, a man,1960s hippy

bitter hearth Jul 10, 2024, 5:27 PM

#

bitter hearth Jul 10, 2024, 5:29 PM

#

craggy crest prompt: hdr photograph, head and shoulder shot, a man,1960s hippy

hdr photograph, head and shoulder shot, a man, cyberpunk 2077 hippy

craggy crest Jul 10, 2024, 5:29 PM

#

bitter hearth hdr photograph, head and shoulder shot, a man, cyberpunk 2077 hippy

i just thought it was funny that when i asked for a 1960s hippy, i got a jim croce look alike

uncut river Jul 10, 2024, 5:32 PM

#

craggy crest prompt: hdr photograph, head and shoulder shot, a man,1960s hippy

makes me think of Frank Zappa

craggy crest Jul 10, 2024, 5:33 PM

#

uncut river makes me think of Frank Zappa

yeah, sort of a mix between them I think

muted dove Jul 10, 2024, 6:28 PM

#

craggy crest prompt: hdr photograph, head and shoulder shot, a man,1960s hippy

https://tenor.com/view/adam-driver-snl-kylo-ren-thumbs-up-okay-gif-16158853

Tenor

bitter hearth Jul 10, 2024, 6:42 PM

#

sadly the bartender triggered the anatomy problem

#

ok so putting man in the negative is the way to go

torn wharf Jul 10, 2024, 6:54 PM

#

my friend challenged me to train this barbie bimbo instagram model on sd3 and said it was impossible and could't be done. i'd had difficulty getting people's likenesses but i thought i could show him up. so i think i trained sd3 to do this megan millions girl using pics scrapped off her instagram. it does a lot of selfies mostly but it works.

#

||https://ibb.co/album/F5cDXw|| may be slightly pg13. these are all from sd3 with my trained lora. she's not my type but i love giving a good ol "in yo face" to haters.

#

if i could do this with 100 images over 100 epochs. minute per epoch. i believe in fine tuners

edgy kelp Jul 10, 2024, 7:02 PM

#

torn wharf if i could do this with 100 images over 100 epochs. minute per epoch. i believe...

Results are good, but damn a lora on 100 images for a single subject...

bitter hearth Jul 10, 2024, 7:04 PM

#

#

#

lavish osprey Jul 10, 2024, 7:14 PM

#

uncut river I don't care about the hands, but how can I make SD3 stop using double ll ? It's...

negative works

low stone Jul 10, 2024, 7:15 PM

#

bitter hearth

bitter hearth Jul 10, 2024, 7:19 PM

#

#

uncut river Jul 10, 2024, 7:21 PM

#

#

torn wharf Jul 10, 2024, 7:28 PM

#

edgy kelp Results are good, but damn a lora on 100 images for a single subject...

maybe that's where i went wrong. other expert lora trainers are using 80000 alpha

edgy kelp Jul 10, 2024, 7:29 PM

#

torn wharf maybe that's where i went wrong. other expert lora trainers are using 80000 alp...

No I didn't mean you are doing it wrong, if it's working it looks like an issue of the architecture of the model. I'm used to a max of 20 images for the best likeness

torn wharf Jul 10, 2024, 7:30 PM

#

i was joking a bit lol. i have no idea what i'm doing. i'll try with less images.

Those results are VERY cherry picked i should say too. All the underlying model problems are still there

bitter hearth Jul 10, 2024, 7:30 PM

#

depends on the character or whatever

#

if you need more or less images thomas

edgy kelp Jul 10, 2024, 7:34 PM

#

I think the average instagram model should be easy enough to train on 10 images, you'd have more difficult time with videogame aliens characters

craggy crest Jul 10, 2024, 7:35 PM

#

torn wharf i was joking a bit lol. i have no idea what i'm doing. i'll try with less imag...

i think you could just prompt that character without training anything and get it without too much trouble

desert garnet Jul 10, 2024, 7:35 PM

#

edgy kelp I think the average instagram model should be easy enough to train on 10 images,...

yea its easier to train on photorealistic than on highly stylized imgs like 2d

edgy kelp Jul 10, 2024, 7:35 PM

#

Well... if you have to train nudity on SD3 maybe you might need some thousands of images for a lora, as Cat with 99999 gb vram implied

bitter hearth Jul 10, 2024, 7:36 PM

#

DO NOT make us use our vram

#

thats just asking for it

torn wharf Jul 10, 2024, 7:36 PM

#

sd3 training is more efficient or maybe i'm just stupid. I can barely do batches of 2 on sdxl but on sd3 i can do batches of 10

#

wth room to spare

edgy kelp Jul 10, 2024, 7:36 PM

#

Do not make me come here and use my VRAM, yeah?!

bitter hearth Jul 10, 2024, 7:36 PM

#

🫣

edgy kelp Jul 10, 2024, 7:37 PM

#

torn wharf sd3 training is more efficient or maybe i'm just stupid. I can barely do batche...

Me: clueless

torn wharf Jul 10, 2024, 7:37 PM

#

like, totally

edgy kelp Jul 10, 2024, 7:38 PM

#

I have no idea either

torn wharf Jul 10, 2024, 7:38 PM

#

https://tenor.com/view/rollin-homies-batman-robin-head-nod-gif-5440088

Tenor

batman and robin homies

▶ Play video

edgy kelp Jul 10, 2024, 7:39 PM

#

BALLS

bitter hearth Jul 10, 2024, 7:43 PM

#

#

torn wharf Jul 10, 2024, 7:48 PM

#

bitter hearth

https://www.youtube.com/watch?v=EZNFo5lL4iw has this energy to it

YouTube

OfficialDynamiteHack

Dynamite Hack - Boyz In The Hood

The Official HD Video straight from the Band. From the album SUPERFAST.

▶ Play video

bitter hearth Jul 10, 2024, 7:48 PM

#

oh goodness!

#

ok so it can make a bartender correctly
but only if the bartender is batman saying hello

craggy crest Jul 10, 2024, 7:52 PM

#

bitter hearth ok so it can make a bartender correctly but only if the bartender is batman sayi...

thosea are some interesting chairs

bitter hearth Jul 10, 2024, 7:52 PM

#

lol yeah

#

art deco in the prompt does that

#

if you cherry pick you get much better results, this one is fantastic

#

its very inconsistent

torn wharf Jul 10, 2024, 7:57 PM

#

https://tenor.com/view/scary-movie-strong-hand-gif-5122498

Tenor

bitter hearth Jul 10, 2024, 8:01 PM

#

#

#

#

#

hello

#

#

#

hmm the image quality goes up if you stop prompting for super heroes

#

same prompts but with bartender instead of batman or superman

#

#

#

#

#

#

the fashion

craggy crest Jul 10, 2024, 9:08 PM

#

bitter hearth art deco in the prompt does that

add rococo to your prompt

odd basalt Jul 10, 2024, 9:08 PM

#

hazy kestrel Jul 10, 2024, 9:39 PM

#

#

placid swallow Jul 10, 2024, 10:04 PM

#

can someone DM me the 3.1 2b plz

bitter hearth Jul 10, 2024, 10:35 PM

#

placid swallow can someone DM me the 3.1 2b plz

you have to prompt for it

#

if you cant I think its a skill issue

#

placid swallow Jul 10, 2024, 10:38 PM

#

no way jose I copied all the skillz i needed from the hugginface

craggy crest Jul 10, 2024, 11:12 PM

#

placid swallow no way jose I copied all the skillz i needed from the hugginface

huggingface called, they want their skills back

sage burrow Jul 10, 2024, 11:14 PM

#

A narwal anthro 😉

open__a_narwal_furry_hot_sexy_muscular_man_adopt_by_ladylalitafantasyart_dhr5qyb-fullview.jpg

bitter hearth Jul 10, 2024, 11:29 PM

#

hazy kestrel Jul 10, 2024, 11:40 PM

#

bitter hearth Jul 11, 2024, 12:51 AM

#

funky

#

"inverted colors" in the prompt

#

alpine summit Jul 11, 2024, 12:55 AM

#

hallow lion Jul 11, 2024, 1:11 AM

#

those are some magnificent cats

hallow lion Jul 11, 2024, 1:13 AM

#

bitter hearth

hey its the sd3 wizard form the promo banner, he's back with a gooder license and the promise of an even gooder model to fix and rectify the universe. SD3.5 will be great just like SD1.5 just like poetry- it will rhyme.

regal hollow Jul 11, 2024, 1:41 AM

#

how to use?

hallow lion Jul 11, 2024, 1:42 AM

#

thats a very vague question...

#

if u mean sd3 then u got the API and/or comfyui but youll have to dig a bit for the models since civitia banned them

regal hollow Jul 11, 2024, 1:44 AM

#

i want to use like a midjouney in discord

hallow lion Jul 11, 2024, 1:44 AM

#

then go to the midjourney channel

regal hollow Jul 11, 2024, 1:44 AM

#

i cant?

#

nono i want use sd. but use direction is like a discord usable

hallow lion Jul 11, 2024, 1:45 AM

#

well there is Artisan

regal hollow Jul 11, 2024, 1:45 AM

#

thank u

finite fractal Jul 11, 2024, 1:51 AM

#

At dusk, a muscular man riding a bicycle at 120 KM/H on the highway, dramatic lighting, intense motion blur, dynamic pose, cinematic atmosphere, high-speed action, detailed muscles, realistic style

sacred jewel Jul 11, 2024, 2:13 AM

#

low stone

That is the 8B dollar question.

bitter hearth Jul 11, 2024, 2:34 AM

#

I am bored... so its time for more balls

#

torn wharf Jul 11, 2024, 2:40 AM

#

balls!

bitter hearth Jul 11, 2024, 4:07 AM

#

No one balling sadcat

#

hazy kestrel Jul 11, 2024, 4:22 AM

#

bitter hearth Jul 11, 2024, 4:31 AM

#

waow

placid belfry Jul 11, 2024, 5:14 AM

#

made with the medium opensource version

bitter hearth Jul 11, 2024, 5:22 AM

#

hazy kestrel Jul 11, 2024, 5:35 AM

#

bitter hearth Jul 11, 2024, 5:48 AM

#

sullen moss Jul 11, 2024, 5:48 AM

#

hazy kestrel

What about to lay down? 😉

bitter hearth Jul 11, 2024, 5:49 AM

#

hazy kestrel Jul 11, 2024, 5:50 AM

#

bitter hearth Jul 11, 2024, 5:56 AM

#

#

#

mortal mesa Jul 11, 2024, 6:02 AM

#

2024-07-11-015441-SD3_sd3_medium.safetensors_00001_.png

mortal mesa Jul 11, 2024, 6:17 AM

#

2024-07-11-021027-SD3_sd3_medium.safetensors_00001_.png

mortal mesa Jul 11, 2024, 6:43 AM

#

2024-07-11-023009-SD3_sd3_medium.safetensors_00001_.png

edgy kelp Jul 11, 2024, 6:51 AM

#

bitter hearth

Neat, I woke up to some balls!

bitter hearth Jul 11, 2024, 6:58 AM

#

#

#

#

#

sage burrow Jul 11, 2024, 7:21 AM

#

There seems to be 3 versions of sd3 for dl. 1 w/o clips, the 10gb one w clips, and the 15gb one w clips. Does the w/o clips version require less vram? Does the 10gb one require less vram than the 15gb one?

muted dove Jul 11, 2024, 7:28 AM

#

The largest includes clips and the T5. You can load everything individually, or all together. The different sizes just give you flexibility on which parts you want to use/load.

bitter hearth Jul 11, 2024, 7:48 AM

#

I like SD3 now

#

didn't really at first

#

but making stormtroopers invade 17th Century France has been fun

sage burrow Jul 11, 2024, 7:53 AM

#

muted dove The largest includes clips and the T5. You can load everything individually, or ...

Thank you 🙂
I have the 10gb one, tempted to get the 15gb instead. They both have the clips.

#

I love it when software "shopoing" is free 😄

sterile pendant Jul 11, 2024, 7:56 AM

#

sage burrow Thank you 🙂 I have the 10gb one, tempted to get the 15gb instead. They both hav...

The most flexible way is to get the smallest sd3 model(doesn't have text encoders) and to then download the three individual encoders. Within comfyui, you can then use a single, double or triple clip loader to pick which ones you want

#

And it will take up the same amount of storage space as if you downloaded the largest sd3 checkpoint that contains all three encoders with it

static cedar Jul 11, 2024, 7:59 AM

#

one tree during winter, reflection from lake, all white --v 6.0

sterile pendant Jul 11, 2024, 7:59 AM

#

oh and dont worry about the fp16 version of the t5, it's mostly pointless. you could run a million A/B blind tests and would likely see them both within margin of error of each other, in terms of voting

#

if you have the ram/vram for it, sure, go for it

bitter hearth Jul 11, 2024, 8:05 AM

#

sage burrow Jul 11, 2024, 8:08 AM

#

sterile pendant if you have the ram/vram for it, sure, go for it

I can barely run sd3 to begin with lol (only 8gb vram)

sage burrow Jul 11, 2024, 8:10 AM

#

bitter hearth but making stormtroopers invade 17th Century France has been fun

It knows quite a few painters from that period as well, I'm case you ever want to get really specific

bitter hearth Jul 11, 2024, 8:10 AM

#

ah okay that's good

#

midjourney is the best at knowing artists I think

#

edgy kelp Jul 11, 2024, 8:17 AM

#

Dall-E 3 very likely knows even more, but the prompt expansion might fugg them up... and not to talk about the filters lol

bitter hearth Jul 11, 2024, 8:30 AM

#

#

#

I don't know anything about anime but this is my sci fi anime attempt

#

#

not even sure which anime that style comes from

limpid thunderBOT Jul 11, 2024, 9:35 AM

#

Thank you for using comcom analytics.
"comcom analytics" supports all community managers (moderators and server owners) by stats, visualization, and analytics.

If you have any questions, feel free to ask us!
Your dashboard
Help
Support server

Other languages
en: help
ja: help Japanese

sterile pendant Jul 11, 2024, 9:56 AM

#

sage burrow I can barely run sd3 to begin with lol (only 8gb vram)

I run SD3 on 8gb just fine, still using an old RTX 2080 FE on this PC

hallow lion Jul 11, 2024, 9:57 AM

#

the one thing emad delivered on: it is PC resource friendly.

#

Thanks emad. Say thanks kids.

sterile pendant Jul 11, 2024, 10:01 AM

#

well i'd also give a HUGE shoutout to comfyanon and all the others that work on comfyui. that's where the real performance comes from. his system for automatically handling model offloading at various steps and stages of the process, is what keeps the vram usage down.

hallow lion Jul 11, 2024, 10:04 AM

#

yes comfy is amazing, it is as fast as foooocus but without the quality loss

cobalt moon Jul 11, 2024, 10:04 AM

#

6GB also run just fine

#

although may be a little bit slower by community standard

#

( 40 sec per 28 steps 1024x1024 )

hallow lion Jul 11, 2024, 10:05 AM

#

i started with 4GB now that was slower than standards

#

took about 2-3 mins for 512

cobalt moon Jul 11, 2024, 10:05 AM

#

hallow lion took about 2-3 mins for 512

that is fast at my standard

hallow lion Jul 11, 2024, 10:05 AM

#

4-5 for sdxl 1024

sterile pendant Jul 11, 2024, 10:05 AM

#

cobalt moon ( 40 sec per 28 steps 1024x1024 )

nah thats about average if you're using an older gen card. my 2080 spits out 35 step 1024x1024 sdxl images like like 30 seconds i think

cobalt moon Jul 11, 2024, 10:05 AM

#

lmao

hallow lion Jul 11, 2024, 10:06 AM

#

🤔

cobalt moon Jul 11, 2024, 10:06 AM

#

it is a Laptop 4050 so yeah

sterile pendant Jul 11, 2024, 10:08 AM

#

if you know specific resolutions you like to use, i highly advise going the tensorRT route for experimenting. like if you don't need to use cnets, loras, ipa, etc, you can get like 75% reductions in generation time. it's great for exploring (they might be able to work some of those features? not sure, never tested to actually see)

#

like for me, the only resolutions i use are 1024/1024, 1152/896 and 1344/768, or reversed

bitter hearth Jul 11, 2024, 10:44 AM

#

I like ultrawide

#

more than I even use square LOL

odd basalt Jul 11, 2024, 11:04 AM

#

sterile pendant Jul 11, 2024, 11:15 AM

#

bitter hearth I like ultrawide

yeah, but the problem is that most models are not trained for extreme aspect ratios like that. sure, they'll work sometimes, but think of the dataset used to train the models. there is likely VERY little content involving things like 21:9 aspect ratios. 16:9 would is likely the widest they naturally want to go, which aligns pretty well with 1344x768(still roughly 1 megapixel, so well within a typical model pixel range, and also, both numbers are divisible by 64/32/16/8 evenly). i use 1344/768 a lot because if you do a NN latent upscale by 1.5, it puts it just a hair of 1080p and is super easy to crop/resize to size

deft briar Jul 11, 2024, 11:15 AM

#

Create a highly realistic and dynamic image of the Indian cricket team celebrating their victorious moment after winning the Champions Trophy. The scene should capture the exhilaration and joy of the players as they celebrate on the cricket field. Use vivid colors and sharp details to portray the players in their blue uniforms, some holding the trophy high, others embracing, and some jumping in joy. Include elements like confetti raining down, fireworks in the sky, and a jubilant crowd in the background. The expressions on the players' faces should reflect pure happiness, pride, and excitement. Ensure the setting is a well-lit stadium, with bright floodlights, a lush green pitch, and the Champions Trophy prominently displayed. The image should evoke a sense of triumph and national pride, making the viewers feel the energy and emotion of this historic win.

Specific Details:

Players' Emotions: Capture various emotions like shouting with joy, tears of happiness, and players lifting each other in celebration.
Team Unity: Show the players in a close group, arms around each other, symbolizing team spirit and camaraderie.
Trophy Display: Ensure the Champions Trophy is clearly visible, being held by the team captain or a group of players, reflecting the significance of the win.
Background Elements: Include a cheering crowd, waving Indian flags, and banners with congratulatory messages, adding to the festive atmosphere.
Action Shots: Some players could be shown spraying champagne or doing victory laps around the field.

proven cipher Jul 11, 2024, 11:42 AM

#

when i try to create an image with SD3 1368px * 2048px it just fills top and bottom with nonsense and process a square. Is there any workaround ? SDXL works fine in comparison.

#

signal shuttle Jul 11, 2024, 11:55 AM

#

proven cipher when i try to create an image with SD3 1368px * 2048px it just fills top and bo...

That resolution isn't supported in SD3, try creating a 1:1 image or a 16:9 or a 9:16 image

proven cipher Jul 11, 2024, 12:01 PM

#

signal shuttle That resolution isn't supported in SD3, try creating a 1:1 image or a 16:9 or a ...

9:16 i have the same issue 🤔

sage burrow Jul 11, 2024, 12:01 PM

#

sterile pendant I run SD3 on 8gb just fine, still using an old RTX 2080 FE on this PC

Nvidia 2060 here. It runs fine with simple workflows, but not with any fancy ones.

How is 2080 old?

signal shuttle Jul 11, 2024, 12:03 PM

#

proven cipher 9:16 i have the same issue 🤔

how much is your cfg?, and what sampler + scheduler are you using?

proven cipher Jul 11, 2024, 12:03 PM

#

signal shuttle how much is your cfg?, and what sampler + scheduler are you using?

cfg 7, sampler dpm2, sheduler normal

signal shuttle Jul 11, 2024, 12:04 PM

#

proven cipher cfg 7, sampler dpm2, sheduler normal

try lowering the cfg to 3.5 or 4 and use heunpp2 with simple or normal and set your steps to 28 or 30

sage burrow Jul 11, 2024, 12:04 PM

#

cobalt moon although may be a little bit slower by community standard

None of that fancy triple prompt workflow stuff though 😦 lololol

signal shuttle Jul 11, 2024, 12:05 PM

#

proven cipher cfg 7, sampler dpm2, sheduler normal

SD3 works best with low cfgs, higher cfgs make the images look burned

sage burrow Jul 11, 2024, 12:05 PM

#

hallow lion i started with 4GB now that was slower than standards

I need that fb care reaction lolololol

low stone Jul 11, 2024, 12:11 PM

#

signal shuttle SD3 works best with low cfgs, higher cfgs make the images look burned

Sadly sd3 2b looked burned a lot of the time even at 4 cfg. Rather frustrating.

signal shuttle Jul 11, 2024, 12:12 PM

#

low stone Sadly sd3 2b looked burned a lot of the time even at 4 cfg. Rather frustrating.

Weird, i never have a problem with burned images with SD3 2B

low stone Jul 11, 2024, 12:16 PM

#

signal shuttle Weird, i never have a problem with burned images with SD3 2B

#

This is with cfg 4. Seems harsh lighting.

signal shuttle Jul 11, 2024, 12:17 PM

#

low stone

are you using heunpp2 with normal or simple?

edgy kelp Jul 11, 2024, 12:17 PM

#

I generally think that's an issue of prominent synthetic data in the pretraining

low stone Jul 11, 2024, 12:18 PM

#

Euler. Heunpp2 is massively slower and doesn't yield better quality for me. I, doing 50 steps though which did make a difference.

#

I'm using comfy's workflow that he published.

#

It the regular one

bitter hearth Jul 11, 2024, 12:18 PM

#

sterile pendant yeah, but the problem is that most models are not trained for extreme aspect rat...

they don't say how many images of each aspect ratio were in the training data, but this is the table from the SDXL paper and it has 16 9 in the form of the resolution of 1536 x 640 in particular. It apparently has some images at 2048 x 512 which is crazily wide.

low stone Jul 11, 2024, 12:18 PM

#

Not the regular one.

#

proven cipher Jul 11, 2024, 12:23 PM

#

signal shuttle try lowering the cfg to 3.5 or 4 and use heunpp2 with simple or normal and set y...

still questionable output oO

#

#

with sdxl i got a lot better results

low stone Jul 11, 2024, 12:25 PM

#

proven cipher still questionable output oO

Can you paste the prompt? I'll try on mine

proven cipher Jul 11, 2024, 12:25 PM

#

here are more (questionable) SD3 outputs

cobalt moon Jul 11, 2024, 12:26 PM

#

today I just try SD3

#

on my another 4050 laptop

proven cipher Jul 11, 2024, 12:26 PM

#

low stone Can you paste the prompt? I'll try on mine

tried just a simple prompt:
positive: top view, Music Festival, anime style, key visual, vibrant, studio anime, highly detailed
negative: photo, deformed, black and white, realism, disfigured, low contrast

signal shuttle Jul 11, 2024, 12:28 PM

#

proven cipher tried just a simple prompt: positive: top view, Music Festival, anime style, key...

Negative prompts don't do much in SD3, using ...................... or aaaaaaaaaaaa has the same effect on the image

low stone Jul 11, 2024, 12:29 PM

#

proven cipher tried just a simple prompt: positive: top view, Music Festival, anime style, key...

#

Based on comfy's workflow, I've found the optimal res to be 1024x1280 or 1024x1344. I don't use anything else anymore.

proven cipher Jul 11, 2024, 12:30 PM

#

i need something like 1368 x 2048 as smartphone backgrounds

#

i will try with 1024x1344

low stone Jul 11, 2024, 12:31 PM

#

Yeah, if you need higher, you'll have to use upscaling methods. Sd3 won't full had resolutions directly.

proven cipher Jul 11, 2024, 12:36 PM

#

low stone Yeah, if you need higher, you'll have to use upscaling methods. Sd3 won't full h...

can you tell me your settings, or show / send your workflow.

#

i get heavily nonsense with SD3, idk xD (same prompt as told above)

low stone Jul 11, 2024, 12:43 PM

#

proven cipher can you tell me your settings, or show / send your workflow.

https://comfyanonymous.github.io/ComfyUI_examples/sd3/

ComfyUI_examples

SD3 Examples

Examples of ComfyUI workflows

#

I use the workflow on the woman picture on that page, at 50 steps

hallow lion Jul 11, 2024, 12:44 PM

#

sage burrow I need that fb care reaction lolololol

give it to the cat with 4GB vram, he needs help

edgy kelp Jul 11, 2024, 12:45 PM

#

Do not make me come here and use my VRAM, yeah?

lucid swift Jul 11, 2024, 12:46 PM

#

sage burrow Jul 11, 2024, 12:53 PM

#

hallow lion give it to the cat with 4GB vram, he needs help

I've seen Cat's images, no way they are from a bgb system only lololol

edgy kelp Jul 11, 2024, 12:55 PM

#

Dude's Ballz (TM) are amazing, I guess it's not a neutered cat

mortal kite Jul 11, 2024, 1:12 PM

#

I have to say tho that all the low vram users I think limit the model. Can we really get an excellent model if it has o fit in 4GB?

bitter hearth Jul 11, 2024, 1:17 PM

#

I put mask in the prompt to avoid face issues
and it decided to do a lace mask

#

its quite clever at adapting to the theme

#

when I put friendly witch in the prompt it added flowers and a vase to also make the background more friendly

#

hmm same prompt but now they have covid mask

sage burrow Jul 11, 2024, 1:32 PM

#

mortal kite I have to say tho that all the low vram users I think limit the model. Can we re...

4 I gon't think is actually possible. Even 8 is pushing it (no advanced workflows). 12 is recommended.
I'm still surprised someone managed with only 6gb vram!

hallow lion Jul 11, 2024, 1:37 PM

#

sage burrow I've seen Cat's images, no way they are from a bgb system only lololol

He uses the API

#

Allegedly. Meow.

sage burrow Jul 11, 2024, 1:41 PM

#

The catch with the API (and glif and huggingface) is that we won't be able to add checkpoints and loras to it when they come out I've been eyeing up some cloud systems!

#

via glif SD3 large with CLaude helping

glif-lady-lalita-s-version-of-sd3-text2image-ladylalita-ptt51dfowgwef4c3bhi4ysfv.jpg

#

A werewolf sphinx lol

#

brb, Imma post this on some conspiracy groups 😉

#

urban arch Jul 11, 2024, 2:11 PM

#

sage burrow via glif SD3 large with CLaude helping

Prince Thun, of the Lion Men.

sage burrow Jul 11, 2024, 2:14 PM

#

A Ctulu centaur lol

sterile pendant Jul 11, 2024, 3:02 PM

#

sage burrow Nvidia 2060 here. It runs fine with simple workflows, but not with any fancy one...

the 2080 came out almost six years ago and was a top of the line card that cost in the 700-800 dollar range. now, it's roughly on par with a 4060 that costs less than half that and that draws less than half the power. in the tech world, six years is a long time.

sterile pendant Jul 11, 2024, 3:04 PM

#

bitter hearth they don't say how many images of each aspect ratio were in the training data, b...

yeah that is pretty wide, but again, they are likely a very tiny percentage of the actual dataset. wouldn't surprise me if all of the data beyond maybe 16:9 or 9:16 made up less than a few percent. this is why a lot of models will list recommended ranges and warn you not to stray too far beyond the recommendations. and when you do, you end up with the weird duplicated bodies and stuff like that

foggy cloak Jul 11, 2024, 3:27 PM

#

sterile pendant the 2080 came out almost six years ago and was a top of the line card that cost ...

2080 is ancient in AI terms, it’s still a decent card mind you but saying it’s not dated is a stretch

bitter hearth Jul 11, 2024, 3:40 PM

#

sterile pendant yeah that is pretty wide, but again, they are likely a very tiny percentage of t...

yeah its likely only a few percent of the actual dataset

#

I tend to use hidiffusion or deep shrink for generations like that

sterile pendant Jul 11, 2024, 3:55 PM

#

foggy cloak 2080 is ancient in AI terms, it’s still a decent card mind you but saying it’s n...

Yeah it's still decent for 1080p gaming and other 3d related stuff, but Nvidia is two whole generations ahead, about to be three when the 50xx series launches.

#

Point was that an entry level card of the current generation is on par with a top of the line card from the 20xx gen

#

But it also meant I got a ton of usage out of it before needing to upgrade again, so there's that

mortal mesa Jul 11, 2024, 3:59 PM

#

mmm its better than some, i rock a 2080 TI

#

waiting on 50 series pricing, when i get up off the floor ill probably get a used 3090

sage burrow Jul 11, 2024, 4:31 PM

#

Hmmm, cloud computing options seem to be less than monthly computer payments, for a computer which will be obsolete in 2 years, hmmm

torn wharf Jul 11, 2024, 4:34 PM

#

not obsolete. the 2080 is 6 years old and still useable. just ancient

#

obsolete has a specific meaning

#

lots of old tech stil has uses

mortal mesa Jul 11, 2024, 4:36 PM

#

landfills!

torn wharf Jul 11, 2024, 4:37 PM

#

e waste is a useless thing

bitter hearth Jul 11, 2024, 4:38 PM

#

Give me that useless and obsolete card sadcat

mortal mesa Jul 11, 2024, 4:38 PM

#

who doesnt want more land

sterile pendant Jul 11, 2024, 4:38 PM

#

torn wharf lots of old tech stil has uses

Yeah exactly. But I'm holding out for probably a 5070. I'm not THAT big of an AI junkie, but I still game quite a bit. I don't care about shit like 4k 360fps style gaming though, so the xx70 model cards are more than enough

silver sluice Jul 11, 2024, 4:38 PM

#

proven cipher with sdxl i got a lot better results

you should turn this into a where's waldo

sterile pendant Jul 11, 2024, 4:39 PM

#

I'm assuming the 5070 will have 16gb vram, so it's good enough

torn wharf Jul 11, 2024, 4:40 PM

#

i game tons so i'm always getting a new gpu. AI was the cause of me switching from AMD to Nvidia. I probably should've switched around the 1080 generation instead. Nvidia really started to shine so hard then.

mortal mesa Jul 11, 2024, 4:40 PM

#

2024-07-11-122939-SD3_sd3_medium.safetensors_00001_.png

sterile pendant Jul 11, 2024, 4:41 PM

#

I have a 7900xt in our other PC, it's dawgpoop for AI related stuff, or at least it was the last time I tried it last fall

torn wharf Jul 11, 2024, 4:41 PM

#

new card every couple of years but i try to find purpose for my old cards and don't just landfill them. at my old house i'd have them on my wall but they were ugly. i might make a knolling case for them next

mortal mesa Jul 11, 2024, 4:42 PM

#

thanks i learned a new word today

torn wharf Jul 11, 2024, 4:42 PM

#

was actually going to get the 7900 on launch week but it was apaper launch in Canada. I couldn't find any places anywhere between edmonton and vancouver that got any in stock. amd fucked around that launch hard. that was the other major factor that made me switch to nvidia

torn wharf Jul 11, 2024, 4:43 PM

#

mortal mesa thanks i learned a new word today

i actually learned that when sd2 came out and someone had a knolling case lora for it that was beauty

#

https://civitai.com/models/1203/knollingcase-embeddings-sd-v2-0 mb it was an embedding

knollingcase-embeddings-sd-v2-0 - kc16 v1 4000 | Stable Diffusion E...

Originally posted to HuggingFace by ProGamerGov The embeddings in this repository were trained for the 768px Stable Diffusion v2.0 model. The embed...

mortal mesa Jul 11, 2024, 4:44 PM

#

nice

#

sterile pendant Jul 11, 2024, 4:49 PM

#

Bruh the term knolling case is so redundant... Almost every case you'll ever see or uses 90deg angles, even if there are curves, the base is flat or the object being held is kept perpendicular to the surface the case is on

low stone Jul 11, 2024, 4:53 PM

#

Sd3 2b. I asked for inside the brain of Cookie Monster.

sage burrow Jul 11, 2024, 4:56 PM

#

Never ladfill computers! in my city there's this for eg., as well as another~~ one~~about ten or so which donates them to people to broke to buy their own computer. Probably similar in every city https://www.rebootcanada.ca/#:~:text=Supporting reBOOT Canada is simple,virtually anywhere in the country.

reBOOT Canada

Brendan Quigley

Home - reBOOT Canada

reBOOT Canada provides computer equipment, training and technical support to charities, non-profits and people with limited access to technology.

bitter hearth Jul 11, 2024, 4:56 PM

#

sage burrow Never ladfill computers! in my city there's this for eg., as well as another~~ ...

waow

desert garnet Jul 11, 2024, 4:57 PM

#

sage burrow Never ladfill computers! in my city there's this for eg., as well as another~~ ...

i will gladly accept your computer

mortal mesa Jul 11, 2024, 4:57 PM

#

gotta save room for the wind turbines, they ar HUGE

sage burrow Jul 11, 2024, 4:57 PM

#

desert garnet i will gladly accept your computer

I gave away some from the 90's earlier this year, are you sure? 😉

bitter hearth Jul 11, 2024, 4:58 PM

#

Becky is one step ahead of me sadcat

desert garnet Jul 11, 2024, 4:58 PM

#

sage burrow I gave away some from the 90's earlier this year, are you sure? 😉

https://tenor.com/view/free-money-meme-fast-gif-gif-16935040457953192433

Tenor

sage burrow Jul 11, 2024, 4:58 PM

#

bitter hearth <:waow:1017853838516035725>

Don't get too excited, the tech they give away is generally 6 years old at least. Also people have to prove that they really well below the poverty line.

mortal mesa Jul 11, 2024, 4:58 PM

#

used to pick computers on garbage days and Frankenstein them into something, dont see much on the curbs like that now

bitter hearth Jul 11, 2024, 4:59 PM

#

sage burrow Don't get too excited, the tech they give away is generally 6 years old at least...

My gpu is 6 years old sadcat

torn wharf Jul 11, 2024, 4:59 PM

#

sage burrow Never ladfill computers! in my city there's this for eg., as well as another~~ ...

when i was a kid and couldn't afford a pc that could play quake, something like $4000. I had a 486 given to me that kind of ran windows 3.1. it had a 1 speed cdrom and the discs had these jewel cases i had to put them into before popping that whole case into the drive.

I played the shit out of lemmings on that beast and learned so much

sage burrow Jul 11, 2024, 4:59 PM

#

bitter hearth My gpu is 6 years old <:sadcat:1130568570712109176>

and none with GPUs I'm pretty sure lol

bitter hearth Jul 11, 2024, 4:59 PM

#

I think.. it might be from 2019

#

thomas

bitter hearth Jul 11, 2024, 5:00 PM

#

torn wharf when i was a kid and couldn't afford a pc that could play quake, something like ...

thomas

sacred jewel Jul 11, 2024, 5:01 PM

#

low stone Sd3 2b. I asked for inside the brain of Cookie Monster.

sage burrow Jul 11, 2024, 5:01 PM

#

bitter hearth I think.. it might be from 2019

I have a dell workstation from 2018, and it has no GPU, darnit!

bitter hearth Jul 11, 2024, 5:05 PM

#

edgy kelp Jul 11, 2024, 5:06 PM

#

bitter hearth

BALLS!

bitter hearth Jul 11, 2024, 5:07 PM

#

edgy kelp BALLS!

Eat my balls

#

waow

edgy kelp Jul 11, 2024, 5:07 PM

#

Haha

low stone Jul 11, 2024, 5:08 PM

#

#

A colorful, swirling vortex of half-eaten cookies and crumbs inside a fuzzy blue brain. Chaotic synapses shaped like chocolate chips firing erratically. Dark, shadowy corners filled with forgotten vegetables. Frenetic thought bubbles containing jumbled letters spelling "COOKIE" in various fonts. Tiny, worried-looking Sesame Street characters trying to navigate through the cookie debris. Flashing neon signs reading "EAT" and "MORE" scattered throughout the brain tissue. A distant, echoing laugh track playing in the background. Cracked mirrors reflecting distorted images of cookies and milk. Pulsing veins carrying streams of cookie dough instead of blood.

bitter hearth Jul 11, 2024, 5:13 PM

#

#

wide pagoda Jul 11, 2024, 5:13 PM

#

low stone A colorful, swirling vortex of half-eaten cookies and crumbs inside a fuzzy blue...

Is that an ai generated prompt?

bitter hearth Jul 11, 2024, 5:13 PM

#

I always use GPT-4o for prompts its pretty good

#

his prompt kinda looks AI

edgy kelp Jul 11, 2024, 5:15 PM

#

I think SD3 should work best with AI generated prompts as its training captions were made with CogVLM

low stone Jul 11, 2024, 5:19 PM

#

wide pagoda Is that an ai generated prompt?

It's Claude 3.5 sonnet. I was using gpt4o but I just started using Claude and it's SO much better for creative prompts.

mortal mesa Jul 11, 2024, 5:19 PM

#

2024-07-11-130726-SD3_sd3_medium.safetensors_00001_.png

bitter hearth Jul 11, 2024, 5:23 PM

#

AI prompts, what's next ? AI images?? sadcat

edgy kelp Jul 11, 2024, 5:25 PM

#

Ah darn, these new things made by THE DEVIL, in my times we had Dall-E 3!

torn wharf Jul 11, 2024, 5:31 PM

#

edgy kelp I think SD3 should work best with AI generated prompts as its training captions ...

it doesn't inherit some greater compatibility with the model that created the captions. cogvlm generates natural language captions. the inherent compatibility is then natural language prompts.

#

for a lot of people this means using an LLM

rich tartan Jul 11, 2024, 5:31 PM

#

How to use?

kindred pumice Jul 11, 2024, 5:39 PM

#

Hi

edgy kelp Jul 11, 2024, 5:41 PM

#

torn wharf for a lot of people this means using an LLM

Not sure, natural language made by LLMs is not really natural

torn wharf Jul 11, 2024, 5:43 PM

#

i believe you're stuck in magical thinking and don't have any evidence to support your hypothesis. cogvlm captions don't lead a model to understand LLMs better than people prompts.

You have a skewed understanding of what natural language is.

edgy kelp Jul 11, 2024, 5:46 PM

#

I mean I'm not entirely sure, there are many systems that recognize "synthetic" sentences and long forms, but that's another thing, when I said it works best with AI generated prompt I didn't mean necessarily in contrast to prompts written by humans but rather "it's the intended way"

torn wharf Jul 11, 2024, 5:47 PM

#

natural language was intended. not a stronger compatibility to LLMs. LLMs can produce tag style prompts too

winged grail Jul 11, 2024, 5:48 PM

#

hey im having problems installing ReActor, it's not showing up when i installed it. Any way you guys could help me out?

torn wharf Jul 11, 2024, 5:52 PM

#

winged grail hey im having problems installing ReActor, it's not showing up when i installed ...

consle probably says an error, something about no insightface installation. this is tricky on windows and i often elect to use a precompiled version of insight face. precompiled are often not ideal since that's how viruses can easily spread, but in this case insightface is popular so it's kind of easy to find a reliable one. https://github.com/Gourieff/sd-webui-reactor?tab=readme-ov-file#viii-for-windows-users-if-you-still-cannot-build-insightface-for-some-reasons-or-just-dont-want-to-install-visual-studio-or-vs-c-build-tools---do-the-following

the docs have a wicked troubleshooting section

GitHub

GitHub - Gourieff/sd-webui-reactor: Fast and Simple Face Swap Exten...

Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111 SD WebUI, SD WebUI Forge, SD.Next, Cagliostro) - Gourieff/sd-webui-reactor

bitter hearth Jul 11, 2024, 5:53 PM

#

I actually think OpenAI did the right thing by providing an LLM that was finetuned to prompt their model
I wish every publisher of a diffusion model did that

torn wharf Jul 11, 2024, 5:54 PM

#

After seeing what omost can do, i'm ceratin that LLMs to prompt the model is the future. But thats for reasons outside of "cogvlm captions mean it understands LLMs better than something human written". I mean, LLMs were trained on human material to begin with

winged grail Jul 11, 2024, 5:55 PM

#

yeah its the insightface

torn wharf Jul 11, 2024, 5:55 PM

#

winged grail yeah its the insightface

if you don't use a precompiled version, the install requires all sorts of visual studio build tools and gets hairy to set up.

bitter hearth Jul 11, 2024, 5:58 PM

#

the original cogvlm is not too smart though
its not like GPT-4o
there are patterns in its captions

#

like repeated mistakes etc

winged grail Jul 11, 2024, 6:00 PM

#

@torn wharf i sent u a dm

bitter hearth Jul 11, 2024, 6:01 PM

#

#

waow

#

#

torn wharf Jul 11, 2024, 6:26 PM

#

bitter hearth

you see that ninja scrolls is getting a remastered theatrical release?

bitter hearth Jul 11, 2024, 6:26 PM

#

I'm afraid I know nothing at all about anime

sage burrow Jul 11, 2024, 6:27 PM

#

I was in best buy today picking up plugs. Stopped by the computer department for fun. I looked at the specs of the ones on display and said out loud "is that ir?!"

The person working there explained that the second any high end computers are released they are bought immediately. There's a limited amount if gpus apparently.

This was in Vancouver, Canada, a rather large city.

Not sure how true that us but Nvidia owner us buying a few more yachts I think lol

bitter hearth Jul 11, 2024, 6:27 PM

#

all I know about anime I learnt from looking at stable diffusion generations lol

#

yeah nvidia is most valued stock right now

sharp stag Jul 11, 2024, 6:28 PM

#

Hey, need someone's help with gen. that has stable installed on a local machine

Long story short im away at work and will be home in like a month. Need someone with "stable" running locally (non XL, can be makeayo). Had been bored and wrote a prompt in my spare time. Wanted to see how well it performs, but it needs to been locally since its long/weighted and i wont get the same results with cloud based gen. (like never). Side note its nsfw(nothing hardcore or lolicon). I would send both positive and negative privately. Thank you

torn wharf Jul 11, 2024, 6:28 PM

#

bitter hearth I'm afraid I know nothing at all about anime

ninja scroll from 1993. it's legendary like akira. back when it was all hand drawn and they still took detail to unseen heights. I dont know anime too much but i respect animation as a whole.

torn wharf Jul 11, 2024, 6:29 PM

#

sage burrow I was in best buy today picking up plugs. Stopped by the computer department for...

am on vancouver island

#

no fires yet thats later in the year

sage burrow Jul 11, 2024, 6:30 PM

#

torn wharf am on vancouver island

Very awesome 😉

torn wharf Jul 11, 2024, 6:32 PM

#

balls

sage burrow Jul 11, 2024, 6:33 PM

#

sage burrow Very awesome 😉

Where'd you buy your system? Prob newegg or Dell or something

torn wharf Jul 11, 2024, 6:35 PM

#

sort of built mine. Bought a second hand alienware area 51 pc and i've replaced most of the core components at this point. went from amd threadripper to an intel alderlake

#

this one was a 3 paragraph long prompt instead of one i wrote. its okay too i guess. still balls.

#

ice cold

sacred jewel Jul 11, 2024, 7:01 PM

#

low stone Sd3 2b. I asked for inside the brain of Cookie Monster.

mortal mesa Jul 11, 2024, 7:02 PM

#

2024-07-11-145403-SD3_sd3_medium.safetensors_00001_.png

sacred jewel Jul 11, 2024, 7:05 PM

#

#

#

torn wharf Jul 11, 2024, 7:14 PM

#

mortal mesa Jul 11, 2024, 7:31 PM

#

hazy kestrel Jul 11, 2024, 7:37 PM

#

low stone Jul 11, 2024, 7:42 PM

#

hazy kestrel Jul 11, 2024, 7:43 PM

#

low stone Jul 11, 2024, 7:49 PM

#

#

bitter hearth Jul 11, 2024, 8:23 PM

#

#

#

top is SD3 and bottom is Kolors

#

not a fair comparison since SD3 a base model

#

but I hope it can get to Kolor's level

low stone Jul 11, 2024, 8:27 PM

#

bitter hearth

bitter hearth Jul 11, 2024, 8:27 PM

#

wow nice

#

looks festive

#

this is sd3

#

sometimes it does great

#

low stone Jul 11, 2024, 8:30 PM

#

Kolors is excellent. Sadly it doesn't know Cookie Monster or Elmo particularly well

low stone Jul 11, 2024, 8:30 PM

#

bitter hearth

Is that Death Star fully operational?

gusty trail Jul 11, 2024, 8:31 PM

#

low stone Kolors is excellent. Sadly it doesn't know Cookie Monster or Elmo particularly w...

Just train a lora

low stone Jul 11, 2024, 8:31 PM

#

gusty trail Just train a lora

Did they bring out training stuff for it yet?

gusty trail Jul 11, 2024, 8:32 PM

#

I managed to train it with my own script. It just replace the encoder to glm

bitter hearth Jul 11, 2024, 8:34 PM

#

low stone Is that Death Star fully operational?

thomas

gusty trail Jul 11, 2024, 8:34 PM

#

https://civitai.com/models/571029/kolors-cotton-doll-lora-trained Here is the lora. Use it with lora loader. But you need this plugin first. ComfyUI-Kolors-MZ

Kolors Cotton Doll Lora Trained - v1.0 | Stable Diffusion LoRA | Ci...

First? trained kolors lora. Trained with my custom training script. Repo: https://github.com/lrzjason/T2ITrainer Prodigy, repeats 10, rank 32, capt...

bitter hearth Jul 11, 2024, 8:39 PM

#

wow thanks this is awesome

#

will try to make some kolors loras

#

what is kolors

#

in dumb cat terms sadcat

edgy kelp Jul 11, 2024, 8:43 PM

#

Also in BALLS terms please

edgy kelp Jul 11, 2024, 8:44 PM

#

low stone Kolors is excellent. Sadly it doesn't know Cookie Monster or Elmo particularly w...

T5 moment

bitter hearth Jul 11, 2024, 8:44 PM

#

edgy kelp Also in BALLS terms please

#

anything becomes cool if you add 4k background on it lmao

edgy kelp Jul 11, 2024, 8:45 PM

#

So many balls to witness

#

I'm going emotional

bitter hearth Jul 11, 2024, 8:47 PM

#

you might not be able to handle this one @edgy kelp

edgy kelp Jul 11, 2024, 8:47 PM

#

🥲

bitter hearth Jul 11, 2024, 8:48 PM

#

sadcat not round enough

edgy kelp Jul 11, 2024, 8:49 PM

#

Oh no, unballed

#

Sad stuff

bitter hearth Jul 11, 2024, 8:51 PM

#

ball factory waow

edgy kelp Jul 11, 2024, 8:51 PM

#

I work there 🫡

#

Notice also that the emoticons are BALLS

bitter hearth Jul 11, 2024, 8:51 PM

#

true, everyone loves using balls pretty much

edgy kelp Jul 11, 2024, 8:52 PM

#

thomas If-you-know-what-I-mean

runic tusk Jul 11, 2024, 9:02 PM

#

#

bitter hearth Jul 11, 2024, 9:19 PM

#

sadcat beans are horrible

mortal kite Jul 11, 2024, 9:20 PM

#

sage burrow 4 I gon't think is actually possible. Even 8 is pushing it (no advanced workflo...

I always run Foooocus with SDXL on 6Gb before I got my 3090

bitter hearth Jul 11, 2024, 9:26 PM

#

mortal kite I always run Foooocus with SDXL on 6Gb before I got my 3090

were you able to hires

torn wharf Jul 11, 2024, 9:39 PM

#

bean corn

vapid radish Jul 11, 2024, 9:51 PM

#

#

pseudo owl Jul 11, 2024, 10:07 PM

#

bitter hearth were you able to hires

should be possible, you just need to enable sliced vae decoding and vae tiling. i have no idea how to do that with fooocus or comfy but in diffusers its pretty easy.

runic tusk Jul 11, 2024, 10:20 PM

#

torn wharf bean corn

How about having some delicious bean popsicles to cool off after that?

craggy crest Jul 11, 2024, 10:25 PM

#

pseudo owl should be possible, you just need to enable sliced vae decoding and vae tiling. ...

with comfy you'd probably have to write a node - i don't think there is one that exists yet

bitter hearth Jul 11, 2024, 10:28 PM

#

#

delicious toothpaste makes you smile after eating the pizza right

runic tusk Jul 11, 2024, 10:34 PM

#

@bitter hearth

low stone Jul 11, 2024, 10:37 PM

#

#

now with cat meatballs

bitter hearth Jul 11, 2024, 10:39 PM

#

low stone Jul 11, 2024, 10:40 PM

#

hazy kestrel Jul 11, 2024, 10:42 PM

#

low stone Jul 11, 2024, 10:49 PM

#

bitter hearth Jul 11, 2024, 10:49 PM

#

#

thomas

sacred jewel Jul 11, 2024, 11:13 PM

#

#

#

#

#

#

#

#

#

#

#

torn wharf Jul 11, 2024, 11:22 PM

#

strange grotto Jul 11, 2024, 11:30 PM

#

https://tensor.art/models/744985347894651153/SD3-universal-beta_a

SD3-universal - beta_α | Stable Diffusion Model - Checkpoint

150 runs, 3 stars, 0 downloads. SD3-univeral beta αThe model fine-tuning training was completed on SD3medium-2B, using a dataset of approximately 50,000 real...

#

anyone tried this?

#

sd3m finetune

#

trained on 50000+image

sacred jewel Jul 11, 2024, 11:32 PM

#

bitter hearth Jul 11, 2024, 11:32 PM

#

We could run it real quick

sacred jewel Jul 11, 2024, 11:33 PM

#

strange grotto https://tensor.art/models/744985347894651153/SD3-universal-beta_a

Where can we download it locally? Otherwise, I am out 😉

bitter hearth Jul 11, 2024, 11:33 PM

#

"An ancient castle draped in ivy, looks even more majestic under the setting sun"

#

Not sure it is worth running

#

Should be free on Tensor

sacred jewel Jul 11, 2024, 11:34 PM

#

sacred jewel Jul 11, 2024, 11:35 PM

#

bitter hearth "An ancient castle draped in ivy, looks even more majestic under the setting sun...

pseudo owl Jul 11, 2024, 11:38 PM

#

bitter hearth Not sure it is worth running

Hmm yeah it’s a lot more different than the examples, I think probably a sampler issue?

bitter hearth Jul 11, 2024, 11:57 PM

#

pseudo owl Hmm yeah it’s a lot more different than the examples, I think probably a sampler...

I am not sure. That is a good question.

sage burrow Jul 12, 2024, 12:03 AM

#

strange grotto https://tensor.art/models/744985347894651153/SD3-universal-beta_a

Can't sell the images on DA, damn

torn wharf Jul 12, 2024, 12:04 AM

#

brace yourself

sage burrow Jul 12, 2024, 12:06 AM

#

torn wharf brace yourself

Someone needs to create these in rl, then sell them on food trucks! 🙂

torn wharf Jul 12, 2024, 12:07 AM

#

sage burrow Someone needs to create these in rl, then sell them on food trucks! 🙂

customers would be like "how are we supposed to eat these" and you'd just shout "BALLS" then closeup and go to a different location.

#

so then eveyrone in line would be like "why would you ask!?"

#

balls

bitter hearth Jul 12, 2024, 12:26 AM

#

I come back and so many balls

torn wharf Jul 12, 2024, 12:45 AM

#

https://suno.com/song/cb2cff79-afd0-4cf2-8d5d-a7a3e05365c8 song about dem balls

Ballin’ All Day by @mulishmodulation688 | Suno

90s hiphop funky song. Listen and make your own with Suno.

▶ Play video

brittle nexus Jul 12, 2024, 1:00 AM

#

#

#

brittle nexus Jul 12, 2024, 1:12 AM

#

torn wharf balls

bitter hearth Jul 12, 2024, 1:23 AM

#

#

low stone Jul 12, 2024, 2:28 AM

#

#

auraflow

#

low stone Jul 12, 2024, 2:49 AM

#

hallow lion Jul 12, 2024, 3:02 AM

#

so many balls

hallow lion Jul 12, 2024, 3:03 AM

#

torn wharf brace yourself

I don't mind pizza being a ball but the pineapples are a disgrace.

sage burrow Jul 12, 2024, 3:07 AM

#

at least he didn't mix pineapple and anchovies 😉

torn wharf Jul 12, 2024, 3:11 AM

#

sounds kinda good

fallen pier Jul 12, 2024, 4:19 AM

#

alpine summit Jul 12, 2024, 4:49 AM

#

#

#

alpine summit Jul 12, 2024, 5:21 AM

#

#

junior peak Jul 12, 2024, 6:08 AM

#

alpine summit Jul 12, 2024, 6:13 AM

#

#

#

#

#

bitter hearth Jul 12, 2024, 6:52 AM

#

#

#

cobalt moon Jul 12, 2024, 7:51 AM

#

hmm, one 25steps 1024x1024 required me 20 minutes on my 2GB VRAM setup

#

it is painful ofc

hallow lion Jul 12, 2024, 8:12 AM

#

You are patient sensai.

bitter hearth Jul 12, 2024, 8:33 AM

#

why would you not use lightning in that situation? LOL

brittle dragon Jul 12, 2024, 8:51 AM

#

A futuristic, abstract representation of a human brain fused with circuit boards and digital neurons, set against a deep space background. The brain should be partially translucent, revealing pulsing energy and data streams within. Incorporate vibrant, electric blue and purple hues to represent cognitive activity. Add subtle, glowing lines connecting various parts of the brain, symbolizing neural networks. The overall shape should resemble the letter "C" for CogniZone. The style should be sleek, high-tech, and slightly ethereal, conveying the concept of advanced artificial intelligence and cognitive computing.

alpine summit Jul 12, 2024, 9:33 AM

#

alpine summit Jul 12, 2024, 10:09 AM

#

low stone Jul 12, 2024, 11:09 AM

#

#

#

dull star Jul 12, 2024, 11:15 AM

#

is this auraflow

low stone Jul 12, 2024, 11:16 AM

#

low stone Jul 12, 2024, 11:16 AM

#

dull star is this auraflow

What? I have no idea what you're talking about. 🙂

dull star Jul 12, 2024, 11:18 AM

#

oh, I understand

#

I made a mistake

#

this is absolutely sd3

muted dove Jul 12, 2024, 11:28 AM

#

low stone What? I have no idea what you're talking about. 🙂

https://huggingface.co/fal/AuraFlow

fal/AuraFlow · Hugging Face

low stone Jul 12, 2024, 11:32 AM

#

muted dove https://huggingface.co/fal/AuraFlow

muted dove Jul 12, 2024, 11:33 AM

#

Not bad for an early beta with regular updates promised.

fleet meteor Jul 12, 2024, 11:34 AM

#

low stone

Lmao nice

low stone Jul 12, 2024, 11:38 AM

#

fleet meteor Lmao nice

I haven't been able to find anything it can't do.

bitter hearth Jul 12, 2024, 11:46 AM

#

guys the heun ones seem good

#

much better than dpm ones

#

ok I did more trials

#

euler heun heunpp2 dpmpp_2m uni_pc uni_pc_bh2 were good

#

with

#

sgm_uniform or simple

#

however ddim_uniform gave more "baked" results

#

which sometimes was fun

#

heun heunpp2 were better for realistic people than euler overall

dull star Jul 12, 2024, 12:07 PM

#

sometimes images look like something out of ideogram

low stone Jul 12, 2024, 12:14 PM

#

dull star sometimes images look like something out of ideogram

I'm pretty sure that's the majority of what it was trained on.

muted dove Jul 12, 2024, 12:15 PM

#

low stone I haven't been able to find anything it can't do.

Try guns 😉

low stone Jul 12, 2024, 12:15 PM

#

Compared to Kolors which was midjourney, so it often has a very over stylized look to it (I still love it too)

muted dove Jul 12, 2024, 12:16 PM

#

I'm surprised nobody has tried "girl laying in the grass" yet 😄

low stone Jul 12, 2024, 12:18 PM

#

muted dove Try guns 😉

muted dove Jul 12, 2024, 12:19 PM

#

low stone

Aura? Maybe it's just revolvers it doens't like...? #✨｜sdxl message

low stone Jul 12, 2024, 12:20 PM

#

muted dove I'm surprised nobody has tried "girl laying in the grass" yet 😄

muted dove Jul 12, 2024, 12:22 PM

#

https://tenor.com/view/chuck-norris-approve-approval-nice-not-bad-gif-8061938

Tenor

low stone Jul 12, 2024, 12:23 PM

#

muted dove Aura? Maybe it's just revolvers it doens't like...? https://discord.com/channels...

Yeah it could use more training on revolvers.

edgy kelp Jul 12, 2024, 12:27 PM

#

AuraFlow's text encoder is a "pile-t5-xl", I assume that being trained on The Pile dataset it can understand and learn NSFW, for anyone interested

low stone Jul 12, 2024, 12:35 PM

#

edgy kelp AuraFlow's text encoder is a "pile-t5-xl", I assume that being trained on The Pi...

There's no question it's not censored.

edgy kelp Jul 12, 2024, 12:36 PM

#

low stone There's no question it's not censored.

🤷‍♂️ I didn't test it enough to tell, but the "issue" would be also what the transformer was trained on and for how long

fleet meteor Jul 12, 2024, 12:36 PM

#

low stone I haven't been able to find anything it can't do.

Any idea how much vram it uses? I wanna try it 🥺

low stone Jul 12, 2024, 12:39 PM

#

fleet meteor Any idea how much vram it uses? I wanna try it 🥺

#

the model itself is 16 gigs. This is what Lykon means when he talks about the 8b sd3 being larger than most people can deal with.

#

that said, it's all one big file right now. I don't know what the possibililties are in terms of breaking that out into image and text encoder components at some point.

edgy kelp Jul 12, 2024, 12:42 PM

#

AuraFlow model has 6,8B Transformer but has ONLY the big text encoder (T5), I think if you use the 8B SD3 with only the clips you'd have a different "scale" of GPU use

low stone Jul 12, 2024, 12:42 PM

#

edgy kelp AuraFlow model has 6,8B Transformer but has ONLY the big text encoder (T5), I th...

The idea of using the clip encoders instead of the t5 seems like a horrible waste.

cobalt moon Jul 12, 2024, 12:43 PM

#

why not use both

#

thomas

low stone Jul 12, 2024, 12:43 PM

#

Because clip is awful.

#

When you have a t5 llm there, there's no reason to use anything else.

#

The clip is only when you're trying to shoehorn it into small cards.

edgy kelp Jul 12, 2024, 12:44 PM

#

I think if you don't use the clips you won't be able to use 99% of the Loras though

#

Not sure though

low stone Jul 12, 2024, 12:44 PM

#

If the Lora's are trained on t5 then it's not a problem

edgy kelp Jul 12, 2024, 12:44 PM

#

That's what I meant

#

I think most people won't train loras on the T5

#

But I have no idea haha

#

We'll see

low stone Jul 12, 2024, 12:48 PM

#

edgy kelp I think most people won't train loras on the T5

Why not? We're already doing it with pixart.

#

There's always going to be a market for small card technology, but there's little progress if we keep holding onto outdated stuff.

alpine summit Jul 12, 2024, 12:50 PM

#

cunning lintel Jul 12, 2024, 1:09 PM

#

low stone Because clip is awful.

Still not convinced of that, models with clip seem to know more, both styles/artists and obscure characters. Of course might just be the training of new models on synthetic stuff combined with current vlms being pretty bad at describing style and artists being stripped. Still feels like a regression sadly.

strange grotto Jul 12, 2024, 1:09 PM

#

#

aura

fleet meteor Jul 12, 2024, 1:11 PM

#

strange grotto

Finally!

low stone Jul 12, 2024, 1:13 PM

#

cunning lintel Still not convinced of that, models with clip seem to know more, both styles/art...

I'm certainly curious to find out. I get the impression that most of the latest models aren't particularly trained on artist styles because of copyright issues. I think cascade was the last high quality model that still has them all.

cunning lintel Jul 12, 2024, 1:15 PM

#

Yeah, i hope things like IP-adapter will allow for style transfer in the future

sterile pendant Jul 12, 2024, 1:36 PM

#

cunning lintel Still not convinced of that, models with clip seem to know more, both styles/art...

and you're not going to get them again from a major company's public release, due to the wild wild west hay-days being already being over with. lawsuits and threats of lawsuits galore, shut that shit down fast. from here on out, newer models are only going to contain what they are legally allowed to contain and won't be able to include things like named people and artists, without their consent. so basically, it will mostly just be a bunch of copyrightless datasets and any artists that are okay with their stuff being used will have to opt IN and not opt OUT now. so if you want stuff like that again, people will have to risk potentially being sued to train loras/models with stuff they want, until governments step in and regulate that part as well(won't be long, two years tops for pretty much all modern countries).

dull star Jul 12, 2024, 1:39 PM

#

low stone

oh hell yeah I can run it offline too?

#

hope it improves

verbal epoch Jul 12, 2024, 1:44 PM

#

#

AuraFlow

mortal kite Jul 12, 2024, 2:03 PM

#

low stone the model itself is 16 gigs. This is what Lykon means when he talks about the 8b...

They could have given a 4b though

craggy crest Jul 12, 2024, 2:11 PM

#

mortal kite They could have given a 4b though

no they couldn't

mortal kite Jul 12, 2024, 2:12 PM

#

craggy crest no they couldn't

fine, whatever

#

tired of every single thing on the internet being an argument

craggy crest Jul 12, 2024, 2:12 PM

#

cunning lintel Still not convinced of that, models with clip seem to know more, both styles/art...

just learn how to create loras (et. al.) and train a fine tune of what you want to use yourself.

craggy crest Jul 12, 2024, 2:13 PM

#

mortal kite tired of every single thing on the internet being an argument

then don't repeat somethign that's been beaten to death, on the forum it's been beaten to death on.

mortal kite Jul 12, 2024, 2:13 PM

#

ugh bye

low stone Jul 12, 2024, 2:21 PM

#

mortal kite They could have given a 4b though

I say this sincerely, but I think this is where sd3 medium will shine. They apparently considered a 4b but it was decided it wouldn't benefit enough people, so 2b and 8b are the ones they're focusing on. I think once sd3 medium is "fixed" as per their press release, it'll be really great.

alpine summit Jul 12, 2024, 2:22 PM

#

cunning lintel Jul 12, 2024, 2:37 PM

#

low stone I say this sincerely, but I think this is where sd3 medium will shine. They appa...

When SD3 2b works, it still gives the crispest cleanest images by far. So if it starts to gen images in a much wider and dynamic range, it can be be amazing. If 2b had been any good, all those new models would hardly gain traction.

low stone Jul 12, 2024, 2:39 PM

#

cunning lintel When SD3 2b works, it still gives the crispest cleanest images by far. So if it ...

agreed. claude expanded prompts (even better than gpt4o) has really made the current sd3 2b shine.

#

cunning lintel Jul 12, 2024, 2:40 PM

#

Noticed the same, claude is way more creative

low stone Jul 12, 2024, 2:41 PM

#

gpt4o just kind of expands slightly on what you type. whereas claude adds all sorts of elements including text banners and signs that really enhance things.

#

which sd3/aura are both really good at.

craggy crest Jul 12, 2024, 2:41 PM

#

cunning lintel When SD3 2b works, it still gives the crispest cleanest images by far. So if it ...

it's the only version of Stable i've used since it released. and i've generated a LOT of images. it works every time as long as you learn how to use it

cunning lintel Jul 12, 2024, 2:49 PM

#

craggy crest it's the only version of Stable i've used since it released. and i've generated ...

Sure sure, that's why SAI says it's a beta model and they're fixing it, cause it works every time now 🤡

craggy crest Jul 12, 2024, 2:51 PM

#

cunning lintel Sure sure, that's why SAI says it's a beta model and they're fixing it, cause it...

shrug. think what you like. no generative AI model out there turns out perfect results every time, but if you learn how to use it, you can get the results you want, (the first time you generate, not the 100th time) EVERY single time. 2b is no different 🧌

mortal mesa Jul 12, 2024, 2:52 PM

#

dalle ect, no that i like it

low stone Jul 12, 2024, 2:52 PM

#

craggy crest shrug. think what you like. no generative AI model out there turns out perfect r...

Pulls off crystalwizard's mask in scooby doo episode "It was old man Lykon all along!"

#

🙂

mortal mesa Jul 12, 2024, 2:53 PM

#

its Ella

craggy crest Jul 12, 2024, 2:54 PM

#

low stone Pulls off crystalwizard's mask in scooby doo episode "It was old man Lykon all a...

heh

mortal mesa Jul 12, 2024, 2:54 PM

#

ok let me navigate around the glaring issues of SD3 and make some stuff

craggy crest Jul 12, 2024, 2:55 PM

#

mortal mesa ok let me navigate around the glaring issues of SD3 and make some stuff

points you at this channel and suggest you scroll through and look at all the really good stuff made by 2b first

mortal mesa Jul 12, 2024, 2:56 PM

#

i can just look at stuff on my computer, no scrolling

#

2024-07-11-175533-SD3_sd3_medium.safetensors_00001_.png

#

orange you glad im here

alpine summit Jul 12, 2024, 3:04 PM

#

low stone Jul 12, 2024, 3:05 PM

#

mortal mesa

i heard you liked oranges and put oranges in your oranges

mortal mesa Jul 12, 2024, 3:05 PM

#

low stone i heard you liked oranges and put oranges in your oranges

i remove this one orange

2024-07-11-155100-SD3_sd3_medium.safetensors_00001_.png

low stone Jul 12, 2024, 3:21 PM

#

mortal mesa i remove this one orange

off with his rind!

mortal mesa Jul 12, 2024, 3:54 PM

#

2024-07-12-114141-SD3_sd3_medium.safetensors_00001_.png

low stone Jul 12, 2024, 3:58 PM

#

lavish osprey Jul 12, 2024, 4:10 PM

#

Pixart 800m uses kv compression. It's essentially like a much larger architecture, but compression has drawbacks

mortal mesa Jul 12, 2024, 4:13 PM

#

2024-07-12-120641-SD3_sd3_medium.safetensors_00001_.png

mortal mesa Jul 12, 2024, 4:40 PM

#

2024-07-12-123114-SD3_sd3_medium.safetensors_00001_.png

frail shoal Jul 12, 2024, 4:42 PM

#

mortal mesa

prompt ?

bitter hearth Jul 12, 2024, 4:47 PM

#

Blue girl, city background
Surely

#

mortal mesa Jul 12, 2024, 4:51 PM

#

frail shoal prompt ?

A dreamlike, ethereal portrait of Stability AI, its digital form dissolving into swirling clouds of iridescent gas. Glowing blue lines pulse through its translucent body, as if infused with an otherworldly energy. Its face, a blend of human and machine features, appears serene yet intense, with eyes that seem to hold the secrets of the universe within their depths. The surrounding environment is distorted, with buildings and landscapes warped into impossible shapes, reflecting the AI's ability to manipulate reality itself.

torn wharf Jul 12, 2024, 4:53 PM

#

mortal mesa

dudde did you teleport the bread?

mortal mesa Jul 12, 2024, 4:54 PM

#

lol pretty much

bitter hearth Jul 12, 2024, 4:58 PM

#

Rawr

torn wharf Jul 12, 2024, 5:01 PM

#

monstrous covid

frail shoal Jul 12, 2024, 5:25 PM

#

mortal mesa

frail shoal Jul 12, 2024, 5:31 PM

#

mortal mesa ```A dreamlike, ethereal portrait of Stability AI, its digital form dissolving i...

#

mortal mesa Jul 12, 2024, 5:39 PM

#

are those SD3? your getting cool mirrored patterns, i like it

#

Loads Sacred Geometry LoRA

frail shoal Jul 12, 2024, 5:42 PM

#

mortal mesa are those SD3? your getting cool mirrored patterns, i like it

no this datavoid finetune of pixart sigma

#

#

low stone Jul 12, 2024, 6:06 PM

#

mortal mesa ```A dreamlike, ethereal portrait of Stability AI, its digital form dissolving i...

#

could get it to do all the text quite right.

bitter wadi Jul 12, 2024, 6:15 PM

#

Wen SD3 ? cheems

mortal mesa Jul 12, 2024, 6:19 PM

#

2024-07-12-141103-SD3_sd3_medium.safetensors_00001_.png

hazy kestrel Jul 12, 2024, 6:22 PM

#

hallow lion Jul 12, 2024, 6:26 PM

#

Oh noes... Emad. you OK?

low stone Jul 12, 2024, 6:30 PM

#

torn wharf Jul 12, 2024, 6:33 PM

#

low stone

looks like a madball face

mortal mesa Jul 12, 2024, 6:33 PM

#

Prompt challenge, wordsmith this into something usable: text of "always coming from take me down" reflecting the text of "never going to give you up"

dull star Jul 12, 2024, 6:35 PM

#

that sounds impossible

mortal mesa Jul 12, 2024, 6:36 PM

#

yes 100's of gens later

#

i mean maybe not but ya haha

dull star Jul 12, 2024, 6:38 PM

#

https://www.reddit.com/r/StableDiffusion/comments/1e1ktdh/auraflow_sure_does_like_making_the_ideogram/ @low stone

From the StableDiffusion community on Reddit: AuraFlow sure does li...

Explore this post and more from the StableDiffusion community

#

wanting an offline ideogram comes at a price

torn wharf Jul 12, 2024, 6:42 PM

#

mortal mesa Prompt challenge, wordsmith this into something usable: text of "always coming f...

the scene is at night. A still lake fills the image and the shore is in the distance. 3d text exists physically over the water, reading "always coming from take me down" and the reflection of the text in the water reads "never going to give you up" closest i could get with this prompt

torn wharf Jul 12, 2024, 6:43 PM

#

dull star https://www.reddit.com/r/StableDiffusion/comments/1e1ktdh/auraflow_sure_does_lik...

lol they probably mined ideogram for images and didn't bother to filter their dataset at all

hallow lion Jul 12, 2024, 6:44 PM

#

Maybe not safe.

dull star Jul 12, 2024, 6:45 PM

#

torn wharf lol they probably mined ideogram for images and didn't bother to filter their da...

yeah its funny

#

idk how they didn't think of this oversight

#

but hey, its a free model that's in like 0.1 state

#

an actual free model with apache 2 license

#

I didn't test too much but 2 character facial expressions work better

like I ask for the person on the left to be scared or crying and the one on the right to be shouting and angry, it gets it right

#

torn wharf Jul 12, 2024, 6:47 PM

#

likely they used the prompts to make ideagram images as the captions, so in training those unrelated captions will learn that cat, when that cat has nothing to do with whats being captioned.

dull star Jul 12, 2024, 6:47 PM

#

this is sooooo ideogram its painful

#

but I still like how it looks though

hallow lion Jul 12, 2024, 6:48 PM

#

ideogram suez SAI coz why not XD

torn wharf Jul 12, 2024, 6:48 PM

#

SAI didn't make it

dull star Jul 12, 2024, 6:48 PM

#

SAI making an apache 2 model?

#

https://tenor.com/view/thanos-impossible-marvel-shocked-gif-15104180

Tenor

#

even older models are at least openrail++

torn wharf Jul 12, 2024, 6:49 PM

#

it was made by the guy who brought loras to image models

dull star Jul 12, 2024, 6:49 PM

#

yeah its wild

torn wharf Jul 12, 2024, 6:49 PM

#

as far as i could tell, ideagram's terms don't limit using images as base model training

hallow lion Jul 12, 2024, 6:51 PM

#

we went out took photos of real world stuff and trained our model on that.

mortal mesa Jul 12, 2024, 6:56 PM

#

the guy that implemented lora wanted to try to train a mmdit from scratch, it worked they released - the story

dull star Jul 12, 2024, 6:56 PM

#

its crazy

torn wharf Jul 12, 2024, 6:57 PM

#

its' not just stability's mmdit architecture though. they modified it to be more efficient

mortal mesa Jul 12, 2024, 7:11 PM

#

lucid swift Jul 12, 2024, 7:13 PM

#

dull star wanting an offline ideogram comes at a price

they will remove that from the next training batch and its gone lol

dull star Jul 12, 2024, 7:22 PM

#

hopefully

torn wharf Jul 12, 2024, 7:33 PM

#

yeah it's v.01 and that's an obvious training data fix

cunning lintel Jul 12, 2024, 7:33 PM

#

I saw the cat thing, tooo sad

torn wharf Jul 12, 2024, 7:33 PM

#

oh i mean v0.1

cunning lintel Jul 12, 2024, 7:33 PM

#

But pretty hard to get and even if you manage it's only sometimes :p

torn wharf Jul 12, 2024, 7:44 PM

#

i am looking into running it, but it's nearly 7b parameters and i don't think comfy is optimized to use it in 16gb of vram

cunning lintel Jul 12, 2024, 7:46 PM

#

torn wharf its' not just stability's mmdit architecture though. they modified it to be more...

And they use the old sdxl vae, which sadly shows :/

torn wharf Jul 12, 2024, 7:48 PM

#

it's a rushed out v0.1 release yeah. They can change that. I'm sure stability had many versions of sd3 and other models that would've counted as 0.1 versions, but never released them. This guy released his and it's the ugly side of the development process that many people aren't used to

mortal mesa Jul 12, 2024, 7:49 PM

#

the we made something that works stage

cunning lintel Jul 12, 2024, 7:50 PM

#

fall even released a 16 channel vae, it's surely coming

mortal mesa Jul 12, 2024, 7:50 PM

#

i made some bad cats on a 2080 TI or i think it all fallback to CPU

torn wharf Jul 12, 2024, 7:51 PM

#

mortal mesa the we made something that works stage

minimal viable product . on the blog it says the intention is to kickstart community engagement

mortal mesa Jul 12, 2024, 7:52 PM

#

torn wharf minimal viable product . on the blog it says the intention is to kickstart comm...

im cool with it, i dont care if its not like starbucks, i dont even like starbucks

cunning lintel Jul 12, 2024, 7:53 PM

#

Funny when SAI seemed to be close to falling apart and rushed 2b, it seemed open generative AI for images was going to be a black hole, now new models are planned or released in various places 🎉 (and SAI seems back in business)

torn wharf Jul 12, 2024, 7:54 PM

#

starbucks isn't even that great at coffee anymore. least the ones in my podunk hillbilly region.

torn wharf Jul 12, 2024, 7:55 PM

#

cunning lintel Funny when SAI seemed to be close to falling apart and rushed 2b, it seemed open...

i dont think they were ever falling apart. that was just yellow journalism sensationalism from people farming engagement and loving that scheudencfreud. stability's got legs for days. they wouldn't have attracted sean parker if they were falling apart

#

probably a little bit of social engineering from competitors

cunning lintel Jul 12, 2024, 7:56 PM

#

i feel if all was fine 2b would never been released as it was, but will never know

torn wharf Jul 12, 2024, 7:56 PM

#

there's a wide spectrum of diverse situations between "falling apart" and "all is fine"

mortal mesa Jul 12, 2024, 7:57 PM

#

they started pumping expectations too early

torn wharf Jul 12, 2024, 7:57 PM

#

spectrums?! too woke tooo woke

hallow lion Jul 12, 2024, 7:58 PM

#

They were trembling...

torn wharf Jul 12, 2024, 7:58 PM

#

seems like an over dramatic metaphor. they were hung over after emad partied too hard

hallow lion Jul 12, 2024, 7:58 PM

#

it was a solid scale 7 on the richter scale

#

lol

#

emad is SAI SAI is Emad, he is the mascot

torn wharf Jul 12, 2024, 7:59 PM

#

hangovers take a couple days to get over

hallow lion Jul 12, 2024, 8:00 PM

#

i hope they will recover whatever the future their contribution to open ai was awesome

#

hypemad

#

Emad was the vision man tho

#

without vision its hard

hallow lion Jul 12, 2024, 8:01 PM

#

torn wharf hangovers take a couple days to get over

It's a serious Friday if you need a couple of days to recover XD

#

You weekended the wrong way

cunning lintel Jul 12, 2024, 8:02 PM

#

hallow lion without vision its hard

well, his vision is now creating magic ai money in the blockchain to fund true open ai, call me skeptical

hallow lion Jul 12, 2024, 8:02 PM

#

its like Disney without Lucas.

#

Maybe Sai will end up like that o without Emad

#

just hopeless atmepts at cahs grabbign bot no real vision or integrity

#

i hope not tho but ye

mortal mesa Jul 12, 2024, 8:03 PM

#

they will be joining the WEF soon

hallow lion Jul 12, 2024, 8:03 PM

#

Someone email Musk

#

Musk is the eternal whimsical clueless billionaire who can save us all coz pockets deep enough

#

Help me Papa Elon, you're my only hope.

mortal mesa Jul 12, 2024, 8:06 PM

#

is trust and safety trustworthy and safe if they hide things like that

#

.1 shift 6 CFG, were glitching

hallow lion Jul 12, 2024, 8:08 PM

#

glitch art - never fails.

lavish osprey Jul 12, 2024, 9:40 PM

#

dull star wanting an offline ideogram comes at a price

No idea what's going on with model creators training on Ideogram outputs lately (isn't pixart also trained on Ideogram data?)

Not only it borderline violates ToS (sure, you can claim you downloaded them from HF, but come on), but no way in hell an Ideogram image is better than a real photo or art, and its prompt following and text accuracy is never going to better than what a VLM can caption from a real image (that can also caption very small text).

And forget doing that with a 16ch vae. Sure, pixart and auraflow use SDXL vae, but even that is good enough to not be a bottleneck compared to synth data training that hasn't been at least refined.

mortal mesa Jul 12, 2024, 9:42 PM

#

FOMO

lavish osprey Jul 12, 2024, 9:42 PM

#

that being said, this is the #sd3 channel, please keep it on topic if you can 😄

craggy crest Jul 12, 2024, 9:46 PM

#

lavish osprey No idea what's going on with model creators training on Ideogram outputs lately ...

fad?

lavish osprey Jul 12, 2024, 9:53 PM

#

it's probably just very cheap, since you only have to scrape, no need to (re)caption (which is very expensive on large data)

#

MJ/Ideogram images come from prompts, so you can get free image+caption (again, violating the ToS)

#

there are a bunch of datasets like that on HF

#

making models is very expensive. I wonder why some companies ask you to pay for commercial use over a revenue threshold (or not, like Kolors)

craggy crest Jul 12, 2024, 9:58 PM

#

maybe. they seem to have hidden their TOS page

brittle nexus Jul 12, 2024, 10:01 PM

#

hot dawn Jul 12, 2024, 10:04 PM

#

against my expectations, the non-attention parameters of SD3 seem to be more important for training than the attention blocks, despite that I froze them earlier in training it still seems most of the learning is there. It's possible that whatever censorship they did was semi hardcoded in the conditional pathway to remap the embeddings away from certain areas, though I doubt it since characters and styles are also primarily learned in the non-attention blocks.
if there any vram savings to be made for finetuning SD3, it might be best to stick to training the non-attention parameters, which are probably much smaller too

#

potentially training with those non-attention components frozen from the start would give different results, perhaps the easier training went there but it may not be ideal

#

"roman soldiers in formation, covered in mud and blood" did get much worse with the non-attentions only, possibly because there were only a few sketchy examples of roman soldiers in the dataset, and most of the learning has gone into those components, good and bad

lavish osprey Jul 12, 2024, 10:17 PM

#

hot dawn against my expectations, the non-attention parameters of SD3 seem to be more imp...

It's possible that whatever censorship they did was semi hardcoded in the conditional pathway to remap the embeddings away from certain areas
That's just a silly rumor.
AuraFlow has the same issue, right? It's in the "llm nature" of the architecture. At 8b params it scales very well, 2b is an attempt of having the same tech run on local hardware.
(Also not sure why Aura fails it too, since it's almost 7b params)

hot dawn Jul 12, 2024, 10:18 PM

#

lavish osprey > It's possible that whatever censorship they did was semi hardcoded in the cond...

I was just assuming there was some since there was a screenshot of somebody from SAI saying that "something" was done to the weights before release by the trust and safety team

#

it seemed a bit iffy since I did see your post about how the issues with the grass pose were there earlier

lavish osprey Jul 12, 2024, 10:18 PM

#

yeah but that just covered up nudity, did nothing to anatomy

#

if anything, some biased filter on the dataset might have amplified the issue

#

but nothing done in SFT under my watch had any effect on that

hot dawn Jul 12, 2024, 10:19 PM

#

there's not much nudity in my dataset, but the ability to generate nude people seems to have been learned in the non-attention blocks as well

lavish osprey Jul 12, 2024, 10:20 PM

#

also 8b was trained on the same "filtered" dataset and has no issues with this use case

#

can make it flawlessly

#

you can try it yourself on API

hot dawn Jul 12, 2024, 10:21 PM

#

I suspect 2B can learn that pose if focused in the dataset. I've always found that lying poses are the hardest to finetune and didn't assume it was a conspiracy by SAI that they didn't work well in the base model

lavish osprey Jul 12, 2024, 10:21 PM

#

(even if I still think that WHEN sd3m manages to get this prompt right, it's the best)

runic tusk Jul 12, 2024, 10:21 PM

#

She's got a little something in her "pocket".

lavish osprey Jul 12, 2024, 10:21 PM

#

the hands are wrong, but look at the details. Native gen, no upscaling

#

SFT on sd3m was done very nicely

#

too bad the base wasn't perfect

hot dawn Jul 12, 2024, 10:22 PM

#

I didn't caption text in my dataset so I'm surprised this wasn't broken 😅

brittle nexus Jul 12, 2024, 10:23 PM

#

Part of the problem was including feet as sex organs

lavish osprey Jul 12, 2024, 10:23 PM

#

but this at least ensures that simple pictures come out amazing and that the model is a very (very) good refiner

hot dawn Jul 12, 2024, 10:23 PM

#

there's no women laying on the ground in the grass in my dataset, the model just worked it out from general pose examples

lavish osprey Jul 12, 2024, 10:24 PM

#

hot dawn "roman soldiers in formation, covered in mud and blood" did get much worse with ...

also please label "sd3m" when referring to medium 😄

hot dawn Jul 12, 2024, 10:24 PM

#

lavish osprey also please label "sd3m" when referring to medium 😄

fair enough

lavish osprey Jul 12, 2024, 10:24 PM

#

it's gonna be a mess when we release Large 😄

mortal mesa Jul 12, 2024, 10:27 PM

#

give it a name and refer to it like that, Gigantor

hot dawn Jul 12, 2024, 10:27 PM

#

using base SD3M as a refiner

hot dawn Jul 12, 2024, 10:28 PM

#

mortal mesa give it a name and refer to it like that, Gigantor

SD3XL 😅

lavish osprey Jul 12, 2024, 10:28 PM

#

hot dawn using base SD3M as a refiner

lol it actually fixed the text. Maybe a bit too much denoise anyway

bitter hearth Jul 12, 2024, 10:28 PM

#

No balls in sight sadcat

lavish osprey Jul 12, 2024, 10:28 PM

#

I usually use 0.15 denoise with sd3m

#

but to fix text you need to go harder

#

0.15 denoise on sd3m is equal to about 0.4 on sdxl

hot dawn Jul 12, 2024, 10:29 PM

#

that was switching back to the base model from step 10 of 28

brittle nexus Jul 12, 2024, 10:29 PM

#

hot dawn using base SD3M as a refiner

The censored feet

lavish osprey Jul 12, 2024, 10:29 PM

#

hot dawn that was switching back to the base model from step 10 of 28

with steps I use 28 total and start from 20

hot dawn Jul 12, 2024, 10:30 PM

#

using a negative prompt for the first stage

lusty oyster Jul 12, 2024, 10:30 PM

#

Just asking for a piece of advice should I use SDXL or SD3

#

For better images

bitter hearth Jul 12, 2024, 10:30 PM

#

Yes

hot dawn Jul 12, 2024, 10:30 PM

#

depends what you're creating, SD3 is amazing at a lot of stuff, but bad at people from the waist down

lavish osprey Jul 12, 2024, 10:30 PM

#

brittle nexus The censored feet

it's just bad at doing them, like with hands (feet are a little bit easier but also rare in datasets)

lavish osprey Jul 12, 2024, 10:31 PM

#

lusty oyster Just asking for a piece of advice should I use SDXL or SD3

use SD3 Large for most things that are not anime

#

use a SDXL finetune for the rest

#

SDXL turbo finetunes are also crazy fast

lusty oyster Jul 12, 2024, 10:32 PM

#

lavish osprey use SD3 Large for most things that are not anime

Where do you get SD3 large?

lavish osprey Jul 12, 2024, 10:32 PM

#

4 steps of dreamshaper xl turbo-lightning for gen + 3 for upscaling + 0.15 denoising (or 8/28 steps) of sd3m for refining

brittle nexus Jul 12, 2024, 10:32 PM

#

lavish osprey it's just bad at doing them, like with hands (feet are a little bit easier but a...

API literary censor feet with blurred images. There's no feet images because they were ripped off in 2b

bitter hearth Jul 12, 2024, 10:32 PM

#

Lykon do you know if prompting for "8" or "eight" is better or something

lavish osprey Jul 12, 2024, 10:32 PM

#

lusty oyster Where do you get SD3 large?

only API for now. Still working on it

bitter hearth Jul 12, 2024, 10:33 PM

#

hot dawn Jul 12, 2024, 10:33 PM

#

Lykon adding to my wishlist of stuff it would be great to know from SAI, it's not clear if the first timestep (1000) should be finetuned or not. I think in SD1.x it wasn't which led to greyness problems, but after that I never really kept up

lavish osprey Jul 12, 2024, 10:33 PM

#

bitter hearth Lykon do you know if prompting for "8" or "eight" is better or something

depends what you're trying to do.
Counting past 4 is always gonna be hard and random for models

#

models can't count sequentially

#

they count "at a glance"

#

and even humans have issues past 4

bitter hearth Jul 12, 2024, 10:34 PM

#

waow

lavish osprey Jul 12, 2024, 10:34 PM

#

by the way, this is also one of the reasons why hands are a hard problem

#

and why a huge ass 8b model is better at them compared by small-brain 2b cousin

hot dawn Jul 12, 2024, 10:34 PM

#

lavish osprey by the way, this is also one of the reasons why hands are a hard problem

I was really hoping it was the 4 channel VAE but it seems not 😦

lavish osprey Jul 12, 2024, 10:35 PM

#

nah

#

hands are just very hard in general. Too many visual permutations

hot dawn Jul 12, 2024, 10:36 PM

#

with attention / non-attentions. Think I've found the cause of my sometimes all-white images. Potentially merging layer by layer could find the problem

lavish osprey Jul 12, 2024, 10:36 PM

#

these are all "valid" hands

#

but they all look very different

hot dawn Jul 12, 2024, 10:36 PM

#

yeah I've been paying more attention to hands im photos and have realized how utterly insane hands are

#

so often a finger just can't be seen due to being bent the right way

lavish osprey Jul 12, 2024, 10:37 PM

#

most models that are decent at hands basically overfit on small data and fewer hand positions

#

or take the shortcut of using only anime style or only realistic style

#

(or are huge ass like 8b)

hot dawn Jul 12, 2024, 10:37 PM

#

well SD1.x was fighting against the VAE not even being able to encode and decode hands below a certain size threshold without introducing new lines etc, so I was optimistic a more powerful VAE would lead to a huge improvement