#🆕｜sd3 | Stable Diffusion | Page 3

cobalt moon May 22, 2024, 2:03 PM

#

of course you won't say we gonna train 16-chanel VAE and YOLO

#

at least it is an attempt

dull star May 22, 2024, 2:03 PM

#

hell yeah you will

#

you will wait 30x longer

#

and so what

#

/s

cunning lintel May 22, 2024, 2:04 PM

#

Bit unfair to compare this effort to SAI, they used a tried architecture from SAI, then threw some data at it. It's fun, would be nice to see it grow, but it is not even in the same league of work needed

cobalt moon May 22, 2024, 2:04 PM

#

dull star hell yeah you will

Simo is pretty much an exception ( like why he want to train the model at the first place ) sooooo... thomas

dull star May 22, 2024, 2:05 PM

#

cunning lintel Bit unfair to compare this effort to SAI, they used a tried architecture from SA...

hell yeah I will compare smaller models with less training data (0.6B vs 8B), its absolutely fair

cunning lintel May 22, 2024, 2:06 PM

#

One party ~~writes~~ researches a recipe, the other party uses the recipe, yet it's all the same work 👀

low stone May 22, 2024, 2:08 PM

#

I know it's been sdxl refined, but I keep being impressed at the composition that hunyuan puts out. The blue ball vs red hat stuff is not the best, but 1 to 1.5 subjects on the screen has way better composition than sdxl and different than ella and pixart so it compliments it well for prompt running across multiple models.

dull star May 22, 2024, 2:13 PM

#

that's very nice

#

I hope we'll get safetensor weights so I can try it as well

#

there's a comfyui plugin so you can run it there with no complex setup

#

its the same plugin that lets you use pixart-sigma and other stuff

cunning lintel May 22, 2024, 2:15 PM

#

How many custom nodes is you comfyui? Checked them all? That's a much larger attack vector than this set of pickled weights that's been out for some time and no red flags have been raised

#

Just use it if you want to try 🙂

dull star May 22, 2024, 2:16 PM

#

oh yeah right

#

they can just update anything anytime and I won't really know

bitter hearth May 22, 2024, 3:27 PM

#

whats that guy responding to ?

#

I can't see cause twitter requires you to have an account to even see anything, just garbage

#

agony

hallow lion May 22, 2024, 3:29 PM

#

sd def feels more accomplished than sdxl

bitter hearth May 22, 2024, 3:32 PM

#

#

have a shrek

lavish sparrow May 22, 2024, 3:33 PM

#

not sd3

bitter hearth May 22, 2024, 3:34 PM

#

lavish sparrow not sd3

banned thomas

dusky thistle May 22, 2024, 3:34 PM

#

https://www.reddit.com/r/StableDiffusion/comments/1ctirfz/psa_stabilityai_released_official_sdxl_update/

wonder if this is really what was said

From the StableDiffusion community on Reddit: [PSA] Stability.AI re...

Explore this post and more from the StableDiffusion community

#

does this mean it's just going to take a while and they are going to release them? or the plan may have shifted away from an open release

bitter hearth May 22, 2024, 3:35 PM

#

bruh

#

How many times have they said they will release it open source and all

#

agony

#

sadcat dies

raven fern May 22, 2024, 3:50 PM

#

sadcat happemad

dull star May 22, 2024, 3:54 PM

#

bitter hearth I can't see cause twitter requires you to have an account to even see anything, ...

its just the guy himself replying to his own tweet

little quarry May 22, 2024, 4:49 PM

#

happemad

#

Two more weeks??

dull star May 22, 2024, 4:58 PM

#

only two more weeks after two weeks

#

🙏

woven dock May 22, 2024, 5:13 PM

#

hypemad

woven dock May 22, 2024, 5:13 PM

#

cobalt moon of course you won't say we gonna train 16-chanel VAE and YOLO

someone should build a 32 channel vae

muted dove May 22, 2024, 5:18 PM

#

https://tenor.com/view/skull-fall-skeleton-disappointed-gif-15613679342612459378

Tenor

#

Just another 2 weeks

woven dock May 22, 2024, 5:22 PM

#

another day, another 2 weeks

dull star May 22, 2024, 5:26 PM

#

woven dock someone should build a 32 channel vae

at that point its overkill tbh

#

16 channel imo is a sweet spot for finer things like text

woven dock May 22, 2024, 5:27 PM

#

Midjourney's vae is around that size

dull star May 22, 2024, 5:27 PM

#

damn

hallow lion May 22, 2024, 5:30 PM

#

hang in there Emad just fix the hands first then drop the Weights.

raven fern May 22, 2024, 5:42 PM

#

muted dove https://tenor.com/view/skull-fall-skeleton-disappointed-gif-15613679342612459378

🤣

dull star May 22, 2024, 5:46 PM

#

emad will fix the release date

#

thomas

hallow lion May 22, 2024, 5:48 PM

#

Emad is the new Gaben.

#

But he CAN count to three.

teal fossil May 22, 2024, 6:02 PM

#

hallow lion Emad is the new Gaben.

SD3 confirmed?

hallow lion May 22, 2024, 6:03 PM

#

You can buy bootleg Russian SD3 weight copies.

#

Axel Rose leaked them on the net too.

#

Someone make "In 2 weeks" t shirts.

raven fern May 22, 2024, 6:34 PM

#

that moment when half life 3 releases before sd3

hallow lion May 22, 2024, 7:00 PM

#

Another day, another two weeks.

icy drift May 22, 2024, 7:43 PM

#

bitter hearth I can't see cause twitter requires you to have an account to even see anything, ...

Get an account. All the important business news stuff happens there. It's more reliable than any newspaper at least.

teal fossil May 22, 2024, 8:00 PM

#

icy drift Get an account. All the important business news stuff happens there. It's more r...

Which absolutely sucks. That site is a cesspit.

low stone May 22, 2024, 9:01 PM

#

teal fossil Which absolutely sucks. That site is a cesspit.

no it's not. you're just looking at the wrong stuff. i have my account rather well setup with ai related stuff and announcements and it's great.

#

took a little while to tell it what I didn't want and it stopped showing me that stuff

icy drift May 22, 2024, 9:02 PM

#

teal fossil Which absolutely sucks. That site is a cesspit.

Then you're following the wrong people.

low stone May 22, 2024, 9:02 PM

#

the answer is, everything is on there, good, bad, everything. you just have to curate your feed.

dull star May 22, 2024, 9:02 PM

#

its okay to have an account to keep up with news and crap

teal fossil May 22, 2024, 9:02 PM

#

low stone took a little while to tell it what I didn't want and it stopped showing me that...

And that shouldn't be necassary tbh. Musk ran it into the ground - and it was not great even before he took over.

low stone May 22, 2024, 9:02 PM

#

eh? literally the whole world is on there but they should know what you want?

#

once I followed all the right academic and ai people, and said no to the political stuff, I don't get anything stupid in my "for you" feed anymore.

icy drift May 22, 2024, 9:04 PM

#

teal fossil And that shouldn't be necassary tbh. Musk ran it into the ground - and it was no...

Stop whining. It was privately owned before, and it still is. Musk doesn't control what anyone posts there, and it's where news happens.
I don't really care though. Not like anyone's forcing you.

dull star May 22, 2024, 9:08 PM

#

waow

#

can we just get SD3 2B

teal fossil May 22, 2024, 9:10 PM

#

icy drift Stop whining. It was privately owned before, and it still is. Musk doesn't contr...

Stop protecting that shithole. shrugs

Not like anyone is forcing to try and sugarcoat what it is.

teal fossil May 22, 2024, 9:10 PM

#

dull star can we just get SD3 2B

2B and an 8B "Beta" with v0.9 weights. 👼

dull star May 22, 2024, 9:10 PM

#

3B?!

#

thomas

teal fossil May 22, 2024, 9:11 PM

#

dull star 3B?!

Just a typo. xD

dull star May 22, 2024, 9:11 PM

#

ah

#

minor spelling mistake

raven fern May 22, 2024, 9:18 PM

#

2B and 9S models (no one gonna get this reference)

frozen lynx May 22, 2024, 9:19 PM

#

guys what to do while waiting for sd3?

raven fern May 22, 2024, 9:20 PM

#

keep generating waifus and husbandos

cunning lintel May 22, 2024, 9:31 PM

#

rotund ibex May 22, 2024, 9:38 PM

#

https://tenor.com/view/no-donkeys-shrek-gif-16041065

Tenor

teal fossil May 22, 2024, 9:59 PM

#

raven fern 2B and 9S models (no one gonna get this reference)

Jokes on you I played that for the first time ever a few days ago.

cinder junco May 22, 2024, 10:19 PM

#

raven fern 2B and 9S models (no one gonna get this reference)

People already use SD as a “2B model”. No need to create another one! /s

raven fern May 22, 2024, 10:24 PM

#

haha

#

but man.. every time i get a ping from this server, i kinda hope it's from the announcements channel, but it turns out it's some random people... smh :3

#

but i still love ya

low stone May 22, 2024, 10:38 PM

#

frozen lynx guys what to do while waiting for sd3?

use all the new open source t5 based models

cinder junco May 22, 2024, 10:43 PM

#

I don’t really understand the logic of choosing T5 for the text encoder. Wouldn’t a newer llm (e.g. llama 3) or even a reduced parameter count T5 using new distillation methods be better?

raven fern May 22, 2024, 10:49 PM

#

hopefully the t5 module is somehow plug and play and we eventually can replace it

rain current May 22, 2024, 11:00 PM

#

bitter hearth May 22, 2024, 11:02 PM

#

bitter hearth May 22, 2024, 11:03 PM

#

raven fern hopefully the t5 module is somehow plug and play and we eventually can replace i...

from what I know you can or not use it and you can put it in your ram so it doesnt take vram from gpu

dull star May 22, 2024, 11:04 PM

#

I wonder if I'll switch over to invokeAI if that get SD3 support, just so img2img and regional stuff will be easier to do

raven fern May 22, 2024, 11:05 PM

#

bitter hearth from what I know you can or not use it and you can put it in your ram so it does...

true

dull star May 22, 2024, 11:05 PM

#

yeah idk about replacing it with other weights that do not match the size

#

as they literally just replace the bits of the weights of T5 with zeros in the MMDiT or whatever to make it not use T5

raven fern May 22, 2024, 11:06 PM

#

we are still in the dark when it comes to how this is actually all structured code wise :3

#

depending how their pipeline actually works

dull star May 22, 2024, 11:07 PM

#

no wait

#

I got it wrong

bitter hearth May 22, 2024, 11:07 PM

#

raven fern May 22, 2024, 11:08 PM

#

the community will find ways to fix stuff anyway

dull star May 22, 2024, 11:08 PM

#

they replace the T5 EMBEDDINGS with zeros, to make the conditioning

#

still, the T5 model they are using is going to perform the best

raven fern May 22, 2024, 11:10 PM

#

im also curious about the edit model, i hope it fixes the artifacts and deformations from cosxl

dull star May 22, 2024, 11:12 PM

#

I'm glad that inpainting will be a thing out of the box

raven fern May 22, 2024, 11:19 PM

#

bitter hearth

that's me right now lol, but without the glasses tho

#

and i also shaved

dull star May 22, 2024, 11:20 PM

#

god SD3 with highresfix would be a treat 🙏

raven fern May 22, 2024, 11:20 PM

#

OOM 🙂

dull star May 22, 2024, 11:21 PM

#

not for me

#

and if its gonna be 2B, then the base res is gonna be 512

raven fern May 22, 2024, 11:21 PM

#

yea i cant wait to upgrade my pc

dull star May 22, 2024, 11:21 PM

#

yeah its easy, just simply buy a new pc

#

duh

#

thomas

raven fern May 22, 2024, 11:21 PM

#

i mean im just waiting for 5090

bitter hearth May 22, 2024, 11:22 PM

#

raven fern that's me right now lol, but without the glasses tho

this is you

raven fern May 22, 2024, 11:22 PM

#

and if they dont give us at least 32GB vram, im gonna kill the nvidia dude... jk of course...

bitter hearth May 22, 2024, 11:22 PM

#

(in minecraft)

raven fern May 22, 2024, 11:22 PM

#

kek

#

i really dont like to have a beard personally, just always itchy, so i try to shave as much as possible, but sometimes lazy as hell

#

and technically my eyesight is not great, but i just dont want to wear glasses

bitter hearth May 23, 2024, 12:07 AM

#

Just see better

#

?

#

profit

low stone May 23, 2024, 12:14 AM

#

cinder junco I don’t really understand the logic of choosing T5 for the text encoder. Wouldn’...

Hunyuan uses a 15 gig version of llama2. Would be neat if they swap that out for 3

violet escarp May 23, 2024, 1:24 AM

#

cinder junco I don’t really understand the logic of choosing T5 for the text encoder. Wouldn’...

quantization is better than distillation

cinder junco May 23, 2024, 1:45 AM

#

violet escarp quantization is better than distillation

Interesting. Do you have a source for that? Is the rationale something like distillation (reducing number of weights so you can keep higher weight precision for a given model file size) effectively reducing the model's breadth of knowledge while retaining the accuracy of the knowledge it has, while quantization makes the knowledge more approximate but maintains its breadth?

violet escarp May 23, 2024, 2:02 AM

#

https://arxiv.org/pdf/2212.09720

#

I might have misremembered the paper, but quantization is still one of the best ways to reduce vram requirements

#

I guess you could distill and quantize

sterile pendant May 23, 2024, 4:10 AM

#

Yes in almost every case with llms, this is the answer. More parameters will almost always be better, assuming it's at least Q3 or higher. Like a 13B at Q4 will outperform a 7B at Q8. The 7B Q8 will be close to 7GB in size and have a perplexity like 5.9, the 13B Q4 will be around 7gb in size as well and have a perplexity of like 5.3 (lower is better)

#

But you can also distill the model as well to make room for more relevant data. If you're using a model for writing English novels, you probably don't need the model to contain a shitload of data about math and science

#

So in the same file size, you can stuff more relevant data in the model if you need to(or just shrink the overall filesize to play nicer with multimodal setups like when doing stable diffusion so you don't need 256 terabytes of ram for swapping 3000 models in and out of the GPU lol)

noble coyote May 23, 2024, 6:40 AM

#

Wombo Dream i2i into SDXL+LoRA+PAG Advanced NOT SD3

cobalt moon May 23, 2024, 8:53 AM

#

do you even know how's the performance of LLaMa 3 8B Q4 + SDXL

#

seriously even some of the SSD can't handle it

fossil pagoda May 23, 2024, 12:14 PM

#

low stone Hunyuan uses a 15 gig version of llama2. Would be neat if they swap that out for...

Im playing with it right now, seems like a lot of fun so far

teal fossil May 23, 2024, 12:44 PM

#

cobalt moon do you even know how's the performance of LLaMa 3 8B Q4 + SDXL

What do you mean? Running them in parallel or is there an SDXL version of Ella now?

low stone May 23, 2024, 1:19 PM

#

noble coyote Wombo Dream i2i into SDXL+LoRA+PAG Advanced NOT SD3

What's wombo dream? Edit: oh ok it's another service

low stone May 23, 2024, 2:01 PM

#

quartz mulch May 23, 2024, 2:48 PM

#

When I finally got it to work.

lavish sparrow May 23, 2024, 3:14 PM

#

teal fossil What do you mean? Running them in parallel or is there an SDXL version of Ella n...

I run them as a single workflow in comfy on my personal computer, so i'm not sure what the problem would be running that combo

#

however, you can't use the LLM directly as a tokenisation step 😦

teal fossil May 23, 2024, 3:25 PM

#

lavish sparrow however, you can't use the LLM directly as a tokenisation step 😦

So you are using it as a prompt enhancer? Can you show me the workflow?

lavish sparrow May 23, 2024, 3:25 PM

#

teal fossil So you are using it as a prompt enhancer? Can you show me the workflow?

yes, that's correct

#

but depending on model, you might have to change output, i'm using ollama as backend

lavish sparrow May 23, 2024, 3:26 PM

#

teal fossil So you are using it as a prompt enhancer? Can you show me the workflow?

this is the summary xD

#

switched from llama3 to phi medium, so gonna have to find my bearings again a bit

teal fossil May 23, 2024, 3:27 PM

#

lavish sparrow switched from llama3 to phi medium, so gonna have to find my bearings again a bi...

Thanks.

lavish sparrow May 23, 2024, 3:27 PM

#

teal fossil Thanks.

https://github.com/stavsap/comfyui-ollama

GitHub

GitHub - stavsap/comfyui-ollama

Contribute to stavsap/comfyui-ollama development by creating an account on GitHub.

#

this is the node i'm using to use ollama

#

important is that you get the model instruct template right!!!!

#

for this workflow, the system prompt MUST be used, or you'll be getting gibberish that you cannot use

#

right now i'm trying to find a better system promtp

#

and then stuff like this happens

#

@teal fossil workflow is inside the image

#

#

abstract nymph May 23, 2024, 4:10 PM

#

still no news? goodness

hallow lion May 23, 2024, 4:11 PM

#

Two weeks.

abstract nymph May 23, 2024, 4:12 PM

#

hallow lion Two weeks.

where you hear that?

hallow lion May 23, 2024, 4:15 PM

#

Echoes in the chambers of the mind.

#

Mind of Emad.

little quarry May 23, 2024, 4:18 PM

#

Two weeks

woven dock May 23, 2024, 4:35 PM

#

wrong server bub

drifting oak May 23, 2024, 4:42 PM

#

This honestly pisses me off, and it's only the people who don't know how AI works or they've just never used AI before, AI has been with us for a whole while it only got popular with t2i and llms

dull star May 23, 2024, 4:44 PM

#

decades ago: There is no such things as a digital artist thomas

drifting oak May 23, 2024, 4:50 PM

#

Lol even VFX, a vfx artist uses AI tech to track footage so they can add CGI stuff later on, but they wouldn't know that bcoz they're npcs

hallow lion May 23, 2024, 5:12 PM

#

Ludites man, get off your PC.

#

Use analogue only. Kodak Agfa family moments. Go develop that shit!

#

Better yet only cave art is true art, only uses natural raw materials, paint me some bisons.

icy drift May 23, 2024, 5:58 PM

#

abstract nymph where you hear that?

Two weeks is common knowledge by now, just scroll back up to see how common. Sources do not need to be cited for common knowledge.

dull star May 23, 2024, 5:58 PM

#

its only two weeks away on every single morning

#

thomas

icy drift May 23, 2024, 5:59 PM

#

hallow lion Better yet only cave art is true art, only uses natural raw materials, paint me ...

Humans are obviously unnatural, just like AI. Only stuff made by non-humans is really art.
Hmm... But life seems pretty unnatural compared to the other stuff in the solar system. Maybe only stuff made by non-living things is really art?

rotund ibex May 23, 2024, 6:19 PM

#

https://tenor.com/view/huh-gif-23918002

Tenor

low stone May 23, 2024, 6:24 PM

#

autumn arrow May 23, 2024, 6:39 PM

#

#

Had the hands fixed afterwards

#

But text is SD3

low stone May 23, 2024, 6:52 PM

#

abstract nymph May 23, 2024, 7:15 PM

#

icy drift Two weeks is common knowledge by now, just scroll back up to see how common. Sou...

ah, I see :D

#

you never know eh?

dreamy sundial May 23, 2024, 10:31 PM

#

autumn arrow

https://tenor.com/view/olitalia-olitaliapolska-oliwa-olive-oil-mozzarella-gif-12444952

Tenor

#

olive oil better

low stone May 23, 2024, 11:00 PM

#

low stone May 23, 2024, 11:16 PM

#

half tundra May 23, 2024, 11:39 PM

#

desenho kids

hallow lion May 24, 2024, 1:16 AM

#

How heavy are the weights? 50kg? 500? 5000?

#

Can we even lift the weights after Emad drops them? Are we worthy and capable?

hallow lion May 24, 2024, 2:01 AM

#

Time to sleep.

#

Maybe by the time I wake up the SD3 weights will drop.

teal fossil May 24, 2024, 2:21 AM

#

hallow lion Maybe by the time I wake up the SD3 weights will drop.

2 weeks.

hallow lion May 24, 2024, 2:25 AM

#

teal fossil 2 weeks.

OK. So I'll sleep for two weeks. See you then.

hallow lion May 24, 2024, 2:33 AM

#

dreamy sundial https://tenor.com/view/olitalia-olitaliapolska-oliwa-olive-oil-mozzarella-gif-12...

The oil flow stop and turn into savages.

remote holly May 24, 2024, 3:07 AM

#

Does sd3 released ?

low stone May 24, 2024, 3:54 AM

#

night condor May 24, 2024, 4:02 AM

#

Feu

teal fossil May 24, 2024, 5:14 AM

#

remote holly Does sd3 released ?

2 weeks.

raven fern May 24, 2024, 5:19 AM

#

2 weeks plus tax

quartz mulch May 24, 2024, 6:08 AM

#

I'd be soooo all over it dropping. IF I could even think about running it locally without my gpu sprouting legs and getting the hell out of my country.

#

even if it drops, I'll still have to use the api.

noble coyote May 24, 2024, 6:21 AM

#

2 epochs and an era™

noble coyote May 24, 2024, 6:26 AM

#

lavish sparrow right now i'm trying to find a better system promtp

System Prompt? Who or what generates that? 🙂

lavish sparrow May 24, 2024, 6:26 AM

#

noble coyote System Prompt? Who or what generates that? 🙂

it's a way to initiate your LLM

#

telling it what it should do

#

else it wwould just respond with a default answer

noble coyote May 24, 2024, 6:28 AM

#

OK, I have a ChatGPT4 account - will that link to this node?

lavish sparrow May 24, 2024, 6:30 AM

#

noble coyote OK, I have a ChatGPT4 account - will that link to this node?

no, this is ollama specific

#

i dunno if you can do a system prompt on gpt4

noble coyote May 24, 2024, 6:31 AM

#

lavish sparrow no, this is ollama specific

I'll try 🙂

noble coyote May 24, 2024, 7:33 AM

#

lavish sparrow no, this is ollama specific

http://localhost:11434/v1/chat/completions

quartz mulch May 24, 2024, 7:56 AM

#

In general no. ChatGPT does not offer api access. You need a playground account for that. A separate thing entirely.

#

But you can use openrouter. It's what I do, and the charges are exactly the same as in playground. But offers many, many more models.

noble coyote May 24, 2024, 7:57 AM

#

I do have a PG a/c - so I have plenty of options...

quartz mulch May 24, 2024, 7:58 AM

#

But chatgpt seems to be entirely non-functional for me for the past 5 minutes or so. API calls work though.

noble coyote May 24, 2024, 8:16 AM

#

lavish sparrow no, this is ollama specific

I have d/loaded Llama 2 Chat 7B Q4 into Jan.ai - can I link ComfyUI-Ollama locally to this?

remote holly May 24, 2024, 8:55 AM

#

teal fossil 2 weeks.

Realy ?

lavish sparrow May 24, 2024, 9:09 AM

#

noble coyote I have d/loaded Llama 2 Chat 7B Q4 into Jan.ai - can I link ComfyUI-Ollama local...

https://huggingface.co/NikolayKozloff/Meta-Llama-3-8B-Instruct-bf16-correct-pre-tokenizer-and-EOS-token-Q8_0-Q6_k-Q4_K_M-GGUF this or https://huggingface.co/bartowski/Phi-3-medium-4k-instruct-GGUF are pretty much the best light models imho

#

llama2 is pretty much outdated at this point 😦

teal fossil May 24, 2024, 9:13 AM

#

remote holly Realy ?

Every time someone asks for the release it's "2 weeks".
It's a Meme at this point.

sterile pendant May 24, 2024, 9:22 AM

#

lavish sparrow https://huggingface.co/NikolayKozloff/Meta-Llama-3-8B-Instruct-bf16-correct-pre-...

Yeah llama3 instruct or other dpo fine tunes of llama3 are super powerful for their 8b sizes. You can easily fit them in 8gb of vram with 8k context using Q4 or Q5 quants(within 1% of Q8's perplexity). They are on par with llama2 80b models.

lavish sparrow May 24, 2024, 9:37 AM

#

sterile pendant Yeah llama3 instruct or other dpo fine tunes of llama3 are super powerful for th...

phi3-medium (14b) is even more amazing in smarts -> it's super neutered tho

remote holly May 24, 2024, 9:48 AM

#

teal fossil Every time someone asks for the release it's "2 weeks". It's a Meme at this poin...

Ha lol

#

Usually I'm a patient person but the SD3 demos impressed me so much that I can't wait to have the open source weights to test, but I guess releasing a model takes time

frigid saffron May 24, 2024, 10:00 AM

#

sd3 open means midjourney in danger, i strongly believe

remote holly May 24, 2024, 10:04 AM

#

Same

#

A midjourney quality open , fully finetunable , is the best thing about sd3

hallow lion May 24, 2024, 10:55 AM

#

remote holly Usually I'm a patient person but the SD3 demos impressed me so much that I can't...

2 weeks.

desert garnet May 24, 2024, 11:17 AM

#

hallow lion 2 weeks.

2 solar weeks 🌞

hallow lion May 24, 2024, 11:20 AM

#

What happens is that Stability AI GPUs are so strong that time bends around them.

#

We are in a time loop of two weeks created by this phenomena.

remote holly May 24, 2024, 11:32 AM

#

hallow lion 2 weeks.

Maybe 32 may

#

Or 42

desert garnet May 24, 2024, 11:33 AM

#

but CTO said may,he dont lie

remote holly May 24, 2024, 11:34 AM

#

Wich will be released before between sd3 and gta6 ?

desert garnet May 24, 2024, 11:34 AM

#

manhunt 3

remote holly May 24, 2024, 11:43 AM

#

Alf life 3

dull star May 24, 2024, 12:08 PM

#

remote holly Maybe 32 may

yeah it'll come out at the end of this month, probably on may 32nd

#

thomas

#

hope 2B comes at the end of the month fr

#

8B and others need cooking though

#

2B is a perfect candidate for accessibility and training

hallow lion May 24, 2024, 12:09 PM

#

Whats the diff between 2B and 8B?

dull star May 24, 2024, 12:09 PM

#

quality probably for example

#

I am scared that prompt adherence too

hallow lion May 24, 2024, 12:10 PM

#

and Size

dull star May 24, 2024, 12:10 PM

#

a little but sure, but I don't know if it's gonna be massively worse than 8B in prompt adherence

#

if it's gonna be on the level of the others such as pixart-sigma I might as well wait for 8B or something

#

all I could do is generate paintings as they are in the style I like

#

finetunes of 2B would be fine though

desert garnet May 24, 2024, 12:11 PM

#

hallow lion Whats the diff between 2B and 8B?

they are the same,both unreleased

dull star May 24, 2024, 12:13 PM

#

kek

#

2B is closer to a full train as its a smaller parameter model, so we might even get better quality with 2B than what we see on the API

#

I just wonder how GOOD 8B would be at its fullest potential

#

like how llama3 8B was trained for 15T tokens and its wonderful

#

we won't get anything close to that, but I'm still willing to wait if stability still has the opportunity to train further as much as they can

gusty gale May 24, 2024, 1:47 PM

#

I feel like the time complexity of SD3 releasing is [ O(n^∞)+ (2 weeks) ]

#

it releases in two weeks, at any given time

pliant osprey May 24, 2024, 1:54 PM

#

sd3 not supporting unet is a problem with no support for controlnet ootb

remote holly May 24, 2024, 1:54 PM

#

gusty gale I feel like the time complexity of SD3 releasing is [ O(n^∞)+ (2 weeks) ]

No the sd3 release is O(TREE(exp(n)) + 2weeks)) lol

gusty gale May 24, 2024, 1:56 PM

#

remote holly No the sd3 release is O(TREE(exp(n)) + 2weeks)) lol

got it, see you then

remote holly May 24, 2024, 1:57 PM

#

"i will catch the turtle maybe in june "

dull star May 24, 2024, 1:59 PM

#

pliant osprey sd3 not supporting unet is a problem with no support for controlnet ootb

they will launch with controlnets

remote holly May 24, 2024, 1:59 PM

#

dull star we won't get anything close to that, but I'm still willing to wait if stability ...

Relsease the sd3 2B now is a good idea

dull star May 24, 2024, 2:01 PM

#

they said that the will release the smaller models first

noble coyote May 24, 2024, 2:11 PM

#

lavish sparrow no, this is ollama specific

Do I install it inside ComfyUI/Custom_Nodes? Or as a standalone?

lavish sparrow May 24, 2024, 2:11 PM

#

noble coyote Do I install it inside ComfyUI/Custom_Nodes? Or as a standalone?

ollama is a standalone program

noble coyote May 24, 2024, 2:27 PM

#

lavish sparrow ollama is a standalone program

Stuck at "[WinError 10061] No connection could be made because the target machine actively refused it!"

#

I have a fresh API Key and a positive balance $£

lavish sparrow May 24, 2024, 2:31 PM

#

noble coyote Stuck at "[WinError 10061] No connection could be made because the target machin...

ollama is local?

noble coyote May 24, 2024, 2:33 PM

#

I have d/loaded it into Custom_nodes and selected the Ollama_Vision Node in comfyUI

#

#

OK, I got the LLama Server running in the background - says Payload Too Large!!! 😄

lavish sparrow May 24, 2024, 2:46 PM

#

noble coyote OK, I got the LLama Server running in the background - says Payload Too Large!!!...

you got this one?

noble coyote May 24, 2024, 2:55 PM

#

#

noble coyote May 24, 2024, 3:45 PM

#

SD3@ClipDrop

Editorial_Style_PhotoThree-quarters_Front_AngleModern_AbstractSports_CarFront_HeadlightGlossy_M_1.png

Editorial_Style_PhotoThree-quarters_Front_AngleModern_AbstractSports_CarFront_HeadlightGlossy_M_2.png

Editorial_Style_PhotoFisheye_Lens_PerspectiveSteampunkCustom_Monster_TruckMassive_TyresRusty_Me_2.png

Editorial_Style_PhotoIsometric_Back_AngleNoir_AestheticsLuxury_SedanTail_LightSmoky_GlassComfo_2.png

Editorial_Style_PhotoLow_AngleUrban_ChicCoupeSide_MirrorMatte_PaintCompact_Body_ShapeVibrant_1.png

Editorial_Style_PhotoLow_AngleUrban_ChicCoupeSide_MirrorMatte_PaintCompact_Body_ShapeVibrant_2.png

Editorial_Style_PhotoLow_AngleUrban_ChicCoupeSide_MirrorMatte_PaintCompact_Body_ShapeVibrant_3.png

Editorial_Style_PhotoLow_AngleUrban_ChicCoupeSide_MirrorMatte_PaintCompact_Body_ShapeVibrant_.png

Editorial_Style_PhotoSide_ViewClassic_EleganceVintage_SaloonGrilleLeather_and_Polished_ChromeC.png

Editorial_Style_PhotoStraight_FrontPop_ArtConvertibleTrunkRetro_FabricElaborate_TrimsCandy_Co_3.png

#

In_a_vibrant_Latin_jazz_dance_club_infused_with_the_bold_colors_of_Fauvism_and_the_geometric_eleganc_2.png

In_a_vibrant_Latin_jazz_dance_club_infused_with_the_bold_colors_of_Fauvism_and_the_geometric_eleganc_3.png

In_a_vibrant_Latin_jazz_dance_club_infused_with_the_bold_colors_of_Fauvism_and_the_geometric_eleganc.png

In_the_realm_of_artistic_expression_the_beauty_envisioned_by_Rauschenberg_Diana_Ejaita_Basquiat_2.png

In_the_realm_of_artistic_expression_the_beauty_envisioned_by_Rauschenberg_Diana_Ejaita_Basquiat_.png

Editorial_Style_PhotoThree-quarters_Front_AngleModern_AbstractSports_CarFront_HeadlightGlossy_M_3.png

Editorial_Style_PhotoThree-quarters_Front_AngleModern_AbstractSports_CarFront_HeadlightGlossy_M.png

Editorial_Style_PhotoTop_Down_PerspectiveSurrealismOff-RoaderWheelsRough_Tyre_TreadIntricate_G_2.png

Editorial_Style_PhotoTop_Down_PerspectiveSurrealismOff-RoaderWheelsRough_Tyre_TreadIntricate_G_3.png

In_a_vibrant_Latin_jazz_dance_club_infused_with_the_bold_colors_of_Fauvism_and_the_geometric_eleganc_1.png

robust junco May 24, 2024, 4:45 PM

#

icy drift Get an account. All the important business news stuff happens there. It's more r...

funk twitter 😉

dull star May 24, 2024, 4:46 PM

#

YESSSSS

#

this would be amazing

#

bunch of variants to choose from

#

the community will find which one's the best for each model size

raven fern May 24, 2024, 6:13 PM

#

We might release some variants.
We might release some
We might release
We might
sadcat

bitter hearth May 24, 2024, 6:15 PM

#

sd3 2030 confirmed

cursive mist May 24, 2024, 6:18 PM

#

2030 AD?

pale aurora May 24, 2024, 6:20 PM

#

trying to use the stability api and I can't even gen a picture of a woman in a shirt. Just comes back blurred. You can clearly see that she's NOT naked

jolly drum May 24, 2024, 6:20 PM

#

pale aurora trying to use the stability api and I can't even gen a picture of a woman in a s...

whats the prompt though

raven fern May 24, 2024, 6:20 PM

#

enhance.... enhance....

#

that second pic tho.. there is something sus going on on the bottom part of the pic LOL

cunning lintel May 24, 2024, 6:21 PM

#

dull star YESSSSS

||not sd3||

jolly drum May 24, 2024, 6:21 PM

#

raven fern that second pic tho.. there is something sus going on on the bottom part of the ...

idk but it looks like a snail to me lmao

raven fern May 24, 2024, 6:21 PM

#

kek

raven fern May 24, 2024, 6:22 PM

#

cunning lintel ||not sd3||

the hell is that face tho creepy

pale aurora May 24, 2024, 6:22 PM

#

raven fern that second pic tho.. there is something sus going on on the bottom part of the ...

You know, you might be right on the second one. I'm trying to get a pic of a woman sitting on a sundae wearing a shirt that says 'cat'. Don't ask me why, I just wanted to test the api for prompt adherence.

low stone May 24, 2024, 6:22 PM

#

pale aurora May 24, 2024, 6:23 PM

#

I also have 'nude' as a negative

raven fern May 24, 2024, 6:24 PM

#

that straight up looks like one of those videos when you start MGS4, its like some weird tv channels @low stone

bitter hearth May 24, 2024, 6:24 PM

#

y'all think we will ever get a truly photorealistic model?

low stone May 24, 2024, 6:25 PM

#

pale aurora trying to use the stability api and I can't even gen a picture of a woman in a s...

I don't know what you're talking about, I was able to generate nudes all over the place: #🆕｜sd3 message

raven fern May 24, 2024, 6:25 PM

#

i mean some sdxl models are very good with photorealistic, assuming you prompt correctly

cunning lintel May 24, 2024, 6:28 PM

#

pale aurora I also have 'nude' as a negative

that's not how the filter works, lots of false positives. my pet theory is anatomy is so outrageously bad, they decided just to blur most gens that are somewhat horrific :p

pale aurora May 24, 2024, 6:28 PM

#

low stone I don't know what you're talking about, I was able to generate nudes all over th...

maybe they've cranked the settings down?

raven fern May 24, 2024, 6:28 PM

#

so you telling me we cant generate horror?

cunning lintel May 24, 2024, 6:29 PM

#

actually, it works better than females 🙂

#

No, in all seriousness, that filter is bizarre, but it is what it is. The good thing is that there's a lot of prompts to try for the first time once you can use sd3 local 😂

bitter hearth May 24, 2024, 6:31 PM

#

raven fern i mean some sdxl models are very good with photorealistic, assuming you prompt c...

sure but they always lack in skin texture, always looking glossy and the dreaded "ai face"

raven fern May 24, 2024, 6:31 PM

#

well to answer your question, one day for sure

#

i dont see why not

bitter hearth May 24, 2024, 6:32 PM

#

i hope so, looking at ai photos all day makes it so easy to tell. Kinda ruins it

raven fern May 24, 2024, 6:33 PM

#

cunning lintel No, in all seriousness, that filter is bizarre, but it is what it is. The good t...

but wait, is that filter because of sd3 by itself or is it more because the pics are shown on discord here and its blocked on discord? i have no idea cause im not using the api lol

cunning lintel May 24, 2024, 6:33 PM

#

it's at the api level

raven fern May 24, 2024, 6:33 PM

#

ah

#

so basically you get them blurred on comfy for example from the api right?

cunning lintel May 24, 2024, 6:34 PM

#

yup

raven fern May 24, 2024, 6:34 PM

#

well that sucks

cunning lintel May 24, 2024, 6:34 PM

#

i think the api is the only source, so it's the same everywhere

raven fern May 24, 2024, 6:34 PM

#

yea just have to wait for the weights :3

#

i mean can't they just release at least one? like heck even give us the small 2B model to play with

#

sigh sadcat

cunning lintel May 24, 2024, 6:37 PM

#

If the model isn't good yet, the reactions will be "is this it???" and no one will remember it's the limited 2b model. Don't think SAI can win whatever they do at this point

#

Though some openness/updates would be much appreciated

pale aurora May 24, 2024, 6:39 PM

#

sdxl wasn't really all that great when it released either. The community made it good

raven fern May 24, 2024, 6:40 PM

#

i would be happy even with a text update that says, it will release in June or something

#

just give us some info

pale aurora May 24, 2024, 6:47 PM

#

bitter hearth sure but they always lack in skin texture, always looking glossy and the dreaded...

I don't get poor skin texture or ai face with sdxl

bitter hearth May 24, 2024, 6:47 PM

#

pale aurora I don't get poor skin texture or ai face with sdxl

send some samples

pale aurora May 24, 2024, 6:48 PM

#

one moment

raven fern May 24, 2024, 7:00 PM

#

take your time

bitter hearth May 24, 2024, 7:03 PM

#

no problem, for me the best i've got out of sdxl is

pale aurora May 24, 2024, 7:04 PM

#

I mean, that's pretty good?

bitter hearth May 24, 2024, 7:05 PM

#

maybe, it's hard to tell after a while if it's real enough

pale aurora May 24, 2024, 7:05 PM

#

aside from the iris, I wouldn't know it was ai, and only because I'm looking for it

weary crystal May 24, 2024, 7:13 PM

#

bitter hearth maybe, it's hard to tell after a while if it's real enough

pale aurora May 24, 2024, 7:13 PM

#

bitter hearth send some samples

pale aurora May 24, 2024, 7:13 PM

#

weary crystal

1st one is not bad, second looks like ai to me

weary crystal May 24, 2024, 7:14 PM

#

pale aurora 1st one is not bad, second looks like ai to me

yeah just the first two generated...

pale aurora May 24, 2024, 7:15 PM

#

weary crystal yeah just the first two generated...

the skin is a little too glossy and the depth of field looks unnatural

pale aurora May 24, 2024, 7:16 PM

#

weary crystal yeah just the first two generated...

that being said, I mean, I put mine through a few different detailing steps, and I render them with two different checkpoints

weary crystal May 24, 2024, 7:17 PM

#

pale aurora that being said, I mean, I put mine through a few different detailing steps, and...

Yeah but the skin is very very smooth without any little imperfection

pale aurora May 24, 2024, 7:18 PM

#

weary crystal Yeah but the skin is very very smooth without any little imperfection

it's meant to look like a model shoot, touched up

low stone May 24, 2024, 7:21 PM

#

raven fern i would be happy even with a text update that says, it will release in June or s...

So I've been watching Lykon's tweets as they respond to the onslaught of SD3 when posts. One just now said "Sooner than you expect" which was then deleted. 🙂

pale aurora May 24, 2024, 7:31 PM

#

low stone So I've been watching Lykon's tweets as they respond to the onslaught of SD3 whe...

interesting....

pale aurora May 24, 2024, 7:32 PM

#

weary crystal Yeah but the skin is very very smooth without any little imperfection

I can get more textured and blemishy too

raven fern May 24, 2024, 7:38 PM

#

low stone So I've been watching Lykon's tweets as they respond to the onslaught of SD3 whe...

huh... interesting...

#

is there any possibility before June? :3

low stone May 24, 2024, 7:39 PM

#

not this weekend becuase it's a holiday... so that's the only sad part i'm focusing on. 🙂

low inlet May 24, 2024, 7:52 PM

#

Hello Guys

#

can someone join diffusers 4 vc to talk ?

dull star May 24, 2024, 7:56 PM

#

low stone So I've been watching Lykon's tweets as they respond to the onslaught of SD3 whe...

wtf I remember that one too

#

lmao

#

guess he got contacted that hte model is not 2 weeks away, but like 2 months

#

😔

#

Everything is soon if you personally have access to it

pale aurora May 24, 2024, 8:01 PM

#

dull star guess he got contacted that hte model is not 2 weeks away, but like 2 months

man, the morale is pretty low around stability, which sucks. I really hope we continue to see open models because giving everything to openai/msft/etc sucks

dusky thistle May 24, 2024, 8:01 PM

#

low stone So I've been watching Lykon's tweets as they respond to the onslaught of SD3 whe...

that's a lot better than the other recent one where he said something like "as far as i know, the plan is still to release the weights"

dusky thistle May 24, 2024, 8:02 PM

#

pale aurora man, the morale is pretty low around stability, which sucks. I really hope we c...

chinese companies seem pretty intent on continuing to work on such things

pale aurora May 24, 2024, 8:02 PM

#

dusky thistle chinese companies seem pretty intent on continuing to work on such things

those chinese companies censor the shit out of the model

dusky thistle May 24, 2024, 8:02 PM

#

they do

#

i tried generating flags with hunyuan

#

won't post em here, but it was a test for how carefully they've checked their surely massive data set

#

they have def gone through every image one by one

#

flags that are easily confused: holland, russia, france. nailed em all consistently

#

britain, USA, australia: nailed all those consistently

pale aurora May 24, 2024, 8:03 PM

#

pixart sigma might as well be sd2 for generating anything more than fully clothed people

dusky thistle May 24, 2024, 8:04 PM

#

zero trouble with their own of course, couldn't do taiwan, and had absolutely no concept of what a nazi flag was or the confederate flag. none.

pale aurora May 24, 2024, 8:04 PM

#

'flag of taiwan' probably gets you extradited

dusky thistle May 24, 2024, 8:04 PM

#

both of those latter symbols are fn everywhere in all kinds of random images

#

historic stuff, garbage online, photos of random political events

#

the fact those are not in their model at all is honestly kinda shocking/impressive

pale aurora May 24, 2024, 8:05 PM

#

dusky thistle the fact those are not in their model at all is honestly kinda shocking/impressi...

... expected you mean? You gotta remember that the ccp oversees all of that stuff

dull star May 24, 2024, 8:05 PM

#

SD3 8B (or 2B if the prompt adherence is at least better than pixart and other open ones) is something that would be amazing

#

I do believe it coming out

#

just not soon

#

pixart-sigma isn't even close to what SD3 8B can do

dusky thistle May 24, 2024, 8:06 PM

#

pale aurora ... expected you mean? You gotta remember that the ccp oversees all of that stu...

nah, i'm not surprised they tried

#

what's impressive is that it is completely not in the data set at all

#

that's a ton of labor

dull star May 24, 2024, 8:06 PM

#

SD3 8B is like 70% of the way there to ideogram level of adherence, and pixart is like 40% or worse, idk how to say it

#

it also all boils down to motion/graphic elements and text working in SD3, whilst not in Pixart-Sigma

#

but pixart-sigma, for a 0.6B model is still extremely impressive

#

for complex compositions its waaaaay better than something like SDXL

#

but it doesn't meet what I want to do, which SD3 8B gets really really close

cunning lintel May 24, 2024, 8:08 PM

#

pale aurora pixart sigma might as well be sd2 for generating anything more than fully clothe...

try harder

pale aurora May 24, 2024, 8:09 PM

#

oh yeahhh forgot about ideogram. That model is great

dull star May 24, 2024, 8:10 PM

#

ideogram is amazing, I just wish there was something like it offline

#

SD3 8B gets quite close, and I suppose finetunes would be 99% of the way there tbh

cunning lintel May 24, 2024, 8:10 PM

#

pixart sigma does some nudity by accident quite often. maybe the prompts were filtered, but the images it was trained on less so it seems

dull star May 24, 2024, 8:11 PM

#

2B or 4B being able to be finetuned offline, I would make a bunch of models that would boost the motion graphic(?) element capabilities

dull star May 24, 2024, 8:12 PM

#

cunning lintel pixart sigma does some nudity by accident quite often. maybe the prompts were fi...

so did SD3 to one person here

#

it got through the censorship very rarely

#

I don't know how it happened

#

#🆕｜sd3 message

cunning lintel May 24, 2024, 8:19 PM

#

oh i rember that... it feels as if filter has some sort of cumulative scoring system. woman: 1pt, photorealistic: 1pts, upper body skin: 2pts, breast structure: 1pts. 3 is out. That image evaded it as well, totally covered upper body 😂

dreamy sundial May 24, 2024, 8:30 PM

#

#

raw sd3 output

#

#

faint breach May 24, 2024, 8:41 PM

#

cunning lintel oh i rember that... it feels as if filter has some sort of cumulative scoring sy...

doubtful it's that fine grained. It's just a classifier model thats been trained to recognize the nsfw'nish of an image and give it a score

dull star May 24, 2024, 8:58 PM

#

same guess

#

its not even well trained

#

I wonder if they are just using the laion nsfw detector or whatever

severe phoenix May 24, 2024, 9:00 PM

#

Playground is soon about to drop their v3 model which looks like it might be up their with sd3. they didnt specifically say this but they sound pretty confident about its ability to render better faces and prompt adherence.

low stone May 24, 2024, 9:19 PM

#

severe phoenix Playground is soon about to drop their v3 model which looks like it might be up ...

That's exciting. Any idea about clip/language model side of things?

dull star May 24, 2024, 9:33 PM

#

I want SD3 so bad man

testing-out-sd3-on-my-own-ideas-i-tried-not-to-cherry-pick-v0-uhs06438cavc1.webp

#

these images were from a month ago

#

testing-out-sd3-on-my-own-ideas-i-tried-not-to-cherry-pick-v0-do8hsnfacavc1.webp

#

the meme potential

#

testing-out-sd3-on-my-own-ideas-i-tried-not-to-cherry-pick-v0-d1lek8sxbavc1.webp

cunning lintel May 24, 2024, 10:03 PM

#

||A surreal, dreamlike portrait of a brunette, with a mesmerizing, infinite zoom effect, where a circular section of her face is magnified, revealing the intricate texture of her skin, with tiny, industrious construction workers, no larger than a grain of rice, busily at work, filling the pores,||

low stone May 24, 2024, 10:09 PM

#

cunning lintel ||A surreal, dreamlike portrait of a brunette, with a mesmerizing, infinite zoom...

That's a neat prompt

cunning lintel May 24, 2024, 10:15 PM

#

The effect is 10+, but for prompt understanding, well, i had something else in mind 😂

raven fern May 24, 2024, 10:20 PM

#

dull star

haha from san andreas

low stone May 24, 2024, 10:23 PM

#

cunning lintel The effect is 10+, but for prompt understanding, well, i had something else in m...

Supposedly the term is "inset map" but sd3 nor the local stuff seems to understand that

cunning lintel May 24, 2024, 10:26 PM

#

inset map, makes sense , obviously still too much off the beaten path

cunning lintel May 24, 2024, 11:24 PM

#

In a dreamlike, 8K photo, a whimsical, furry, futuristic white cyber owl with purple streaks in its fur, sits in a mystical Valdivian forest, surrounded by bioluminescent foliage, with a tiny garden gnome stepping into a mushroom house. The aurora borealis swirls above, casting an ethereal glow on a secret arctic vault, where a cute, velvet-skinned goblin with whiskers poses.

#

A stunning, ultra-realistic portrait of a black Barbie doll stands in a secret arctic vault, surrounded by towering ice mountains, and a kaleidoscope of colors. In the background, a cute, velvet-skinned goblin with whiskers poses, surrounded by clockwork machinery and glowing orbs.

#

In a surreal, 8K photo, a futuristic, cyberpunk cityscape unfolds, with a vibrant, candy village, where a furry, futuristic white cyber dog rides a dragon-zebra chimera. A woman barbarian rides a majestic, agitated dragon-zebra chimera, through a dense, mystical forest of Schwarzwald.

#

A mesmerizing, 8K portrait of a brunette, with a mesmerizing, infinite zoom effect, reveals the intricate, labyrinthine texture of her skin, where tiny, industrious construction workers, no larger than a grain of rice, busily toil within the pores, building minute skyscrapers and suspension bridges. In the background, a surreal, fantasy cityscape unfolds, with towering ice mountains, and a vibrant, candy village, where a furry, futuristic white cyber dog with purple streaks in its fur, wearing a dogtag saying "soon", rides a dragon-zebra chimera through the streets.

#

A stunning, ultra-realistic portrait of a black Barbie doll, dressed in intricately detailed clothes and jewelry, stands in a secret arctic vault, surrounded by towering ice mountains, and a kaleidoscope of colors, swirling with abstract, Dalí-esque patterns. In the background, a cute, velvet-skinned goblin with whiskers poses, surrounded by clockwork machinery and glowing, iridescent orbs, as the aurora borealis swirls above.

#

In a surreal, 8K photo, a vibrant, fantasy confectionery wonderland unfolds, with a whimsical, furry, futuristic white cyber owl perched on a mushroom house, surrounded by biscuits, and a flowing chocolate river. A woman barbarian rides a majestic, agitated dragon-zebra chimera, through a dense, mystical forest.

dull star May 24, 2024, 11:55 PM

#

https://github.com/Lucky-Lance/TerDiT

GitHub

GitHub - Lucky-Lance/TerDiT

Contribute to Lucky-Lance/TerDiT development by creating an account on GitHub.

#

chat?

#

@low stone

#

@faint breach

#

4.2B model

#

#

97s inference time tho

#

but still, 3GB vram requirement for 4.2B

#

faint breach May 25, 2024, 12:01 AM

#

pretty neat. always great to see new models that can run on cheap hardware

#

especially since stability is going to release sd3 and be done with image models

dull star May 25, 2024, 12:05 AM

#

we need to see how this affects bigger resolutions and accurately captioned datasets (prompt adherence)

#

currently its just 256x256

severe phoenix May 25, 2024, 12:09 AM

#

low stone That's exciting. Any idea about clip/language model side of things?

nope, they havent revealed any technical things, not even images. which honestly i kinda like...they said its about 30% done, so i'm guessing 2 more weeks or months who knows lool . i just wish these companies would just keep tthings quiet and drop things when its actually finished. this whole waiting thing is beginning to get tidious and abit annoying. you're not selling a movie or sthng geez.

severe phoenix May 25, 2024, 12:11 AM

#

faint breach especially since stability is going to release sd3 and be done with image models

ehh what? they packing up after sd3?

faint breach May 25, 2024, 12:16 AM

#

severe phoenix ehh what? they packing up after sd3?

before emad quit he said sd3 would be the last image model they ever made

#

That's entirely fine though. It's sad to see Stability stop in this field, but it's fine. There is enough research and resources available that others can more easily keep going.

lucid swift May 25, 2024, 12:17 AM

#

faint breach That's entirely fine though. It's sad to see Stability stop in this field, but i...

but why would they stop

#

i think ther streght is in image models

faint breach May 25, 2024, 12:17 AM

#

theres more money in LLMs

lucid swift May 25, 2024, 12:18 AM

#

sure but the open models are so good and you compete with the gpus and engeneres and money of meta

#

but for open image models is almost no competion

faint breach May 25, 2024, 12:19 AM

#

how? lol. Meta's business is advertising. They give away all their models for free. How do you compete with that as an exclusively model building company?

#

it's not a popularity contest. Stability is an actual business that needs revenue

hallow lion May 25, 2024, 12:23 AM

#

dull star

Turbodong is that you?

real terrace May 25, 2024, 12:47 AM

#

cunning lintel ||A surreal, dreamlike portrait of a brunette, with a mesmerizing, infinite zoom...

Oh I want that kind of gen

raven fern May 25, 2024, 12:50 AM

#

faint breach how? lol. Meta's business is advertising. They give away all their models for f...

is Meta gonna start releasing their own image gen models?

lucid swift May 25, 2024, 12:52 AM

#

faint breach how? lol. Meta's business is advertising. They give away all their models for f...

thats what i am saying you cant compete with them. tahts why doing image models is smarter becasue meta does not relese them

real terrace May 25, 2024, 12:54 AM

#

*Congratulations you've been added to the Stable Diffusion 3 early preview waitlist!

You'll be notified by email with an invite to our Discord server when you've been granted access to the preview. *

nothing since that 😕

low stone May 25, 2024, 1:19 AM

#

cunning lintel > In a surreal, 8K photo, a vibrant, fantasy confectionery wonderland unfolds, w...

#

Jeez that is the most ridiculous prompt

low stone May 25, 2024, 1:20 AM

#

dull star https://github.com/Lucky-Lance/TerDiT

My checkpoints folder just exploded.

raven fern May 25, 2024, 1:23 AM

#

is there a way to try that in comfy?

low stone May 25, 2024, 1:24 AM

#

raven fern is there a way to try that in comfy?

they just got hunyuan going in comfy the other day, so this would also probably take a bit.

raven fern May 25, 2024, 1:25 AM

#

nice, i like to try new toys 🙂

cobalt moon May 25, 2024, 1:36 AM

#

me who only have 2GB VRAM :

hallow lion May 25, 2024, 1:37 AM

#

Maybe the cat with 4GB of Vram can help you instead.

jolly swan May 25, 2024, 2:45 AM

#

Once again I have to remind that there is no Pony team, it's just me.

I would be happy to train on whatever (which has the right license) but it's a tricky question. Good data can make meh architecture shine but will still be inferior to good model with good data, hence I am still optimistic about SD3 option. If it does not happen for whatever reason and I need to find back up - honestly another XL version with improved data is probably fine.

woven dock May 25, 2024, 2:46 AM

#

dull star but still, 3GB vram requirement for 4.2B

Is that loaded in int4?

#

1.1gb is quite small for 4.2b in fp8

woven dock May 25, 2024, 2:48 AM

#

jolly swan Once again I have to remind that there is no Pony team, it's just me. I would b...

Hey astralite, I've got a question:

When you train (or fine-tune) pony, what kind of hardware/gpus do you train on?

#

Like what's the vram requirements

jolly swan May 25, 2024, 2:50 AM

#

woven dock Hey astralite, I've got a question: When you train (or fine-tune) pony, what ki...

v6 was trained on 3x a100 with 80GB VRAM for 3 months

woven dock May 25, 2024, 2:51 AM

#

Does the vram usage scale up as more images are dumped into the dataset?

jolly swan May 25, 2024, 2:52 AM

#

Not the memory, you need model weights + (batch size * per image vram) so more VRAM means more images per iteration (but it also does not scale linearly)

#

Generally you just need enough to fit weights and 8+ batch

#

So 40GB is most likely enought for XL finetuning

#

I remmeber 3090/4090s not having enough to use Adam (which you want to use)

muted dove May 25, 2024, 5:53 AM

#

jolly swan Once again I have to remind that there is no Pony team, it's just me. I would b...

Would you not consider using Cascade?

violet escarp May 25, 2024, 6:00 AM

#

severe phoenix ehh what? they packing up after sd3?

All of the researchers are gone. There is nobody left to develop a new model.

#

and Stability really needs money

dusky thistle May 25, 2024, 6:10 AM

#

jolly swan v6 was trained on 3x a100 with 80GB VRAM for 3 months

impressive

jolly swan May 25, 2024, 6:13 AM

#

muted dove Would you not consider using Cascade?

It's non commercial. So, no.

vale furnace May 25, 2024, 6:38 AM

#

gm

muted dove May 25, 2024, 6:38 AM

#

jolly swan It's non commercial. So, no.

Pony,or Cascade?

#

You're still allowed to fine tune it 😃

#

Please 🙏

jolly swan May 25, 2024, 7:15 AM

#

muted dove You're still allowed to fine tune it 😃

I can fine tune it but can't use it commercially, making Pony is super expensive and running inference for it also costs money, I am already operating at a loss so making it even worth is not a great option.

muted dove May 25, 2024, 7:16 AM

#

jolly swan I can fine tune it but can't use it commercially, making Pony is super expensive...

The reality of our enjoyment, a shame 😞

#

Worth trying to raise funds from the community? I suppose it all hinges on the SD3 release anyway.

jolly swan May 25, 2024, 7:17 AM

#

I would prefer to figure out functional economy rather than using some one time large events like kickstarter

lunar rivet May 25, 2024, 7:19 AM

#

iirc redmondai sponsored a cascade finetune once upon a time, maybe they'd be up for another

woeful spindle May 25, 2024, 8:06 AM

#

jolly swan v6 was trained on 3x a100 with 80GB VRAM for 3 months

oh boy

#

that's like $15k

#

would it be more expensive if you rented 9xA100 for a month?

#

or 270xA100 for a day🤔

jolly swan May 25, 2024, 9:17 AM

#

woeful spindle that's like $15k

we actually bought the hardware so a bit more expensive

sterile pendant May 25, 2024, 10:25 AM

#

woeful spindle would it be more expensive if you rented 9xA100 for a month?

Oh I misread this, I see that they said they have 3 a100s and it took 3 months.~~ Without accounting for power supply efficiency, A100s pull around 300w at max load plus some minor CPU and other peripheral usage. So let's round up and say 3000w for the 9, you now have to offset two space heater's worth of heat with HVAC for a whole month(~10k btu which is a large 120v window unit that would pretty much have to run nonstop).~~ Meh it's too early in the morning to pencil all this shit out, but these are some of the things to consider lol. Basically, ponyboi would have a massive power bill, but you'd have to do the math to see which would be cheaper. Him buying all the equipment+his power bill vs renting a server to do it in a much shorter amount of time. Plus, he makes multiple models with the hardware he invested in.

#

Oh and also, a lot of data centers have rules against using them for nsfw stuff

rain current May 25, 2024, 10:46 AM

#

sigma 2k - superhands....

cunning lintel May 25, 2024, 10:49 AM

#

rain current sigma 2k - superhands....

Just use a hand detailer 🤣

rain current May 25, 2024, 10:50 AM

#

I think it wouldn't work, it will detect potatoes instead of hands

sullen moss May 25, 2024, 11:19 AM

#

#

😂

#

Two weeks mf

low stone May 25, 2024, 1:04 PM

#

@cunning lintel since you were posting complicated prompts yesterday. (Some sd3 pics, some not) / people traveling through a tube network over a futuristic city. Robot Santa. Man is friend with a robot. One eyed women. Doctors with Cthulhu heads. Scientists with space ship shipping businesses.

#

Another: large throne room where a scientist with Cthulhu head sits on his throne made of lightsabers. Frantic and panicking crowds beneath him as he decrees his next royal orders to begin the end.

peak kettle May 25, 2024, 1:19 PM

#

jolly swan v6 was trained on 3x a100 with 80GB VRAM for 3 months

Hi, A quick question. When you 3 months of training, Is it with the final dataset or Iterating by finetuning the dataset every training? Sorry if that a stupid question.

low stone May 25, 2024, 1:46 PM

#

noble coyote May 25, 2024, 3:27 PM

#

SD3@ClipDrop

Insanely_detailed_cinematic_face_portrait_photography_of_a_majestic_beautiful_fierce_yin_and_yang_sy_1.png

Insanely_detailed_cinematic_face_portrait_photography_of_a_majestic_beautiful_fierce_yin_and_yang_sy.png

#

SD3@ClipDrop

fat_man_bathing_hat_large_bath_tub_on_wheels_umbrella_blue_sky_clouds_rain_drops_sea_shells_6.png

fat_man_bathing_hat_large_bath_tub_on_wheels_umbrella_blue_sky_clouds_rain_drops_sea_shells_4.png

fat_man_bathing_hat_large_bath_tub_on_wheels_umbrella_blue_sky_clouds_rain_drops_sea_shells_2.png

#

SD3@ClipDrop

fat_man_bathing_hat_large_bath_tub_on_wheels_umbrella_blue_sky_clouds_rain_drops_sea_shells_19.png

fat_man_bathing_hat_large_bath_tub_on_wheels_umbrella_blue_sky_clouds_rain_drops_sea_shells_16.png

fat_man_bathinghat_large_bath_tub_on_wheels_umbrella_blue_sky_clouds_rain_drops_sea_shells_l_1.png

cunning lintel May 25, 2024, 3:36 PM

#

low stone <@284358521519341568> since you were posting complicated prompts yesterday. (Som...

yeah, mixed some prompts together, poor sd3 trying to make sense of it :p

low stone May 25, 2024, 3:36 PM

#

@cunning lintel hunyuan at 2.40:1 aspect ratio

raven fern May 25, 2024, 3:37 PM

#

wide

cunning lintel May 25, 2024, 3:40 PM

#

i looked, looked again, but well, of course cthulhu has a third leg with all those tentacles!

low stone May 25, 2024, 3:40 PM

#

even gods need sensible footwear

sick cedar May 25, 2024, 4:38 PM

#

dull star https://github.com/Lucky-Lance/TerDiT

I assume there is no infulstructure made for this yet? So we may need to wait for a extention/nodes and such in order to make it usable?

hallow lion May 25, 2024, 5:10 PM

#

rain current sigma 2k - superhands....

no hand detailer can fix that 😄

faint breach May 25, 2024, 5:34 PM

#

muted dove You're still allowed to fine tune it 😃

one could argue that if finetuning cascade sends a ton of donations to their patreon or other service, they've commercialized it and are liable af. Anyone trying to earn from their AI work won't touch it

#

non commercial research only license really kills a model. also shutting down the official channel for it does too

muted dove May 25, 2024, 5:55 PM

#

Yep...Gits! 🤣

teal fossil May 25, 2024, 6:13 PM

#

jolly swan Once again I have to remind that there is no Pony team, it's just me. I would b...

How about CosXL as a backup plan? It's supposed to be an update for base SDXL.

autumn arrow May 25, 2024, 6:28 PM

#

How do I buy more artisan credits?

#

The site just bumps me back to discord

little quarry May 25, 2024, 6:37 PM

#

https://tenor.com/view/2week-countdown-14days-more-waiting-gif-13525316

Tenor

jolly swan May 25, 2024, 6:43 PM

#

teal fossil How about CosXL as a backup plan? It's supposed to be an update for base SDXL.

Non commercial

teal fossil May 25, 2024, 6:44 PM

#

jolly swan Non commercial

... Why the heck are they doing that... sighs

dull star May 25, 2024, 6:56 PM

#

same with SD3, but with SD3 you can buy a license

#

I forgot if you can also do that to cosxl

teal fossil May 25, 2024, 7:27 PM

#

dull star I forgot if you can also do that to cosxl

Apparently not.

sterile pendant May 25, 2024, 7:31 PM

#

teal fossil ... Why the heck are they doing that... *sighs*

It's a research experiment, much like cascade. They may eventually decide to do some kind of spin-offs with them down the line or sell them

dusky thistle May 25, 2024, 8:55 PM

#

i think it's bizarre that a company with a cash crisis has so many models without a commercial license

#

you'd think you'd at least leave the door open to a conversation

rain palm May 25, 2024, 9:25 PM

#

little quarry https://tenor.com/view/2week-countdown-14days-more-waiting-gif-13525316

huh, 2 weeks left?

little quarry May 25, 2024, 9:26 PM

#

Two more weeks after another 2 weeks

#

Rumors of the rumored SD3

rain palm May 25, 2024, 9:26 PM

#

ah, reminds me of Nintendo's announcement announcement announcement.

#

so, which dataset is SD3 based on?

dull star May 25, 2024, 9:37 PM

#

my friend's friend who was a stability employee 6.53 years ago heard from my uncle that he heard from emad that it will come out tomorrow (this is legit)

dull star May 25, 2024, 9:39 PM

#

rain palm so, which dataset is SD3 based on?

no idea, but I bet its partially laion, trained on 512px first, then on 1024px for 8B and later on the smaller models
the dataset was captioned 50/50 by CogVLM (detailed accurate prompts) and the raw captions

#

they truncated the prompt length to like 72 or whatever because of clip and I don't know if this is real (I heard from a random discord user who heard from a random discord user who heard from a twitter user who was claiming to be a stability employee but it was actually my uncle all along), but they might ditch clip and use T5 only and continue training with non-truncated prompts or whatever idk what was really told, I bet I'm wrong, it could just be clip being ditched and that already heavily improved the text adherence and the prompts were never truncated or something I don't know, it doesn't even matter how hard you try you will never know the truth cause we are never given it.

rain palm May 25, 2024, 9:43 PM

#

sounds like a lot of room for error 😛

dull star May 25, 2024, 9:45 PM

#

a lot of room for misinformation spread around as fact because stability tries their best not to inform us, so we make up random shit constantly and get proven wrong

#

I need to read the paper again

#

random screenshot go!!!! (this must mean something idk)

rain palm May 25, 2024, 9:51 PM

#

i'm not an astrophysicist

dull star May 25, 2024, 9:51 PM

#

its simple rocket science, what do you not understand thomas

#

T5 has 512 context length for sure

#

but I don't know if the cogvlm prompts were actually shortened to 77 tokens or not

#

and if they were, does it sabotage the prompt adherence and make the T5 context length less important as it was never trained on prompts longer than 77

#

waow

twin tulip May 25, 2024, 9:53 PM

#

dull star random screenshot go!!!! (this must mean something idk)

that snippet shows they only used 75 tokens for the "c" vector (t5 embedding)

dull star May 25, 2024, 9:54 PM

#

that makes sense

#

so they were shortened

twin tulip May 25, 2024, 9:54 PM

#

77x4096, 77-2 = 75 due to start and end token

#

75 is still a decent length at least

dull star May 25, 2024, 9:56 PM

#

like I'm not gonna use up 512 anytime soon, but like idk ~200 would have been a little more useful or something (longclip has 248)

#

https://github.com/beichenzbc/Long-CLIP

twin tulip May 25, 2024, 9:57 PM

#

the issue is the vram required for cross attention goes up substantially as you increase either/or resolution and embedding size

#

there are several hacks out there that try to deal with it, like localized or sparse attention, or the chunking of the token blocks, they have drawbacks though

dull star May 25, 2024, 9:58 PM

#

I see..

#

#

this is 310 CLIP tokens

#

and 360 gpt2 tokens (idk about T5 tokenizer)

twin tulip May 25, 2024, 10:02 PM

#

chunking seems the be the most popular, which is likely what that is

dull star May 25, 2024, 10:03 PM

#

chunking?

#

also

teal fossil May 25, 2024, 10:05 PM

#

dull star also

Afaik T5 gets the short end of the stick bc they are also still using Clip G & L.

dull star May 25, 2024, 10:05 PM

#

nothing changes the context length of T5 obviously, but idk how the shortened prompts in the dataset

teal fossil May 25, 2024, 10:05 PM

#

And they are testing (apparently) if they can use T5 only instead.

dull star May 25, 2024, 10:05 PM

#

teal fossil Afaik T5 gets the short end of the stick bc they are also still using Clip G & L...

I would destroy clip-L and never let it touch SD3 again and then decrease the strength of clip-G so that its used for styling, just in case T5 makes everything too photoreal

teal fossil May 25, 2024, 10:05 PM

#

Btw T5XXL is not CogVLM, but good for captioning.

teal fossil May 25, 2024, 10:06 PM

#

dull star I would destroy clip-L and never let it touch SD3 again and then decrease the st...

Well, tags still have their purpose. I like to combine tags and semi-natural language.

dull star May 25, 2024, 10:06 PM

#

I would use tags for styling if it helps

#

otherwise I would just use T5 becuase of natural prompting

twin tulip May 25, 2024, 10:09 PM

#

dull star chunking?

splits the prompt into 75/77 long segments

#

T5 has a very long embedding dimension, I iamgine thats why it was used, there's more data there

#

4096 vs 768 or 1024 or 1280 or whatever of common clip models

#

but its not a VLM and wasn't trained on any cross entropy loss with a VIT or anything, its just an encoder/decoder model, like something you'd use for language translation

#

could've just as easily used Llama3 or something else

dull star May 25, 2024, 10:12 PM

#

isn't that decoder only? or is that not an issue?

#

I saw lavi-bridge, which could use decoder only models such as llama2

twin tulip May 25, 2024, 10:27 PM

#

I guess i'd have to noodle on the impact of using a decoder only network, but you can get the features from whatever model I suppose and use that

twin tulip May 25, 2024, 11:05 PM

#

I think a lot of the vlms are just VIT tacked onto (often encoder only) llms with adapters

dreamy sundial May 25, 2024, 11:30 PM

#

#

dry wave May 25, 2024, 11:42 PM

#

twin tulip I think a lot of the vlms are just VIT tacked onto (often encoder only) llms wit...

decoder-only has the disadvantage that the information flow moves to the last tokens

#

in encoder architectures every token gets context information from any other token

#

in decoder only models you have a causal mask, so every token only gets information from the past tokens

#

so "a cat with blue fur" in clip or t5 would have the information about blue fur in the cat token

twin tulip May 25, 2024, 11:44 PM

#

yeah makes sense

dry wave May 25, 2024, 11:44 PM

#

in llama3 in contrast the token "cat" has no further information while the token "fur" contains this information

twin tulip May 25, 2024, 11:45 PM

#

right due to causal mask

dreamy sundial May 25, 2024, 11:45 PM

#

dry wave May 25, 2024, 11:45 PM

#

I would imagine that this makes the cross attention more difficult because the last token contains all the information instead of having all tokens equally

dreamy sundial May 25, 2024, 11:45 PM

#

dreamy sundial

zoom in for more detail

dry wave May 25, 2024, 11:46 PM

#

doesn't mean it wouldn't be possible with llama3, but I guess that's why they prefer decoder architectures like t5

twin tulip May 25, 2024, 11:46 PM

#

yeah just as is would probably not be as efficient

#

I think the vlms are using full self attention on the image tokens prior to the attachment to the llm part

dull star May 26, 2024, 12:14 AM

#

I wonder if in the future we'll get large parameter (lets say, 12B or larger) ternary diffusion transformer models

#

idk which companies would be willing to test it further

#

https://github.com/Lucky-Lance/TerDiT

#

at smaller parameter sizes, it has a massive FID/quality penalty, but it starts to climb back up the larger the parameter size is, whilst retaining low VRAM requirements

#

only problem, the inference time is terrible at larger parameter sizes

#

but damn, the small checkpoint size and only 3GB vram required for a 4.2B model

sick cedar May 26, 2024, 12:40 AM

#

teal fossil And they are testing (apparently) if they can use T5 only instead.

How is that possible? Without Clip?

twin tulip May 26, 2024, 12:48 AM

#

just uses the embeddings from the text model, its a different embedding space but in theory still has contextual meaning

raven fern May 26, 2024, 1:02 AM

#

dull star https://github.com/Lucky-Lance/TerDiT

is there any current way to try TerDit in comfy?

cinder walrus May 26, 2024, 1:45 AM

#

What's best way to use SD3 on a phone? Can use via api with Comfy etc but when I'm on the go what's the best solution currently?

#

Ideally outside of discord and not the stability assistant because it's trash

low stone May 26, 2024, 3:50 AM

#

dull star idk which companies would be willing to test it further

I'd love to try out that large-dit tha they mention in that article. 20 gig image model... we just need to find a hugging face demo of it.

white current May 26, 2024, 8:06 AM

#

SD3 Open Source Weights, when ?

muted dove May 26, 2024, 8:30 AM

#

white current SD3 Open Source Weights, when ?

2 weeks

noble coyote May 26, 2024, 9:51 AM

#

cinder walrus What's best way to use SD3 on a phone? Can use via api with Comfy etc but when I...

Clipdrop.co

verbal epoch May 26, 2024, 9:58 AM

#

Guys SD3 release in May?

dusky thistle May 26, 2024, 10:06 AM

#

verbal epoch Guys SD3 release in May?

yep, only a year to go

finite hollow May 26, 2024, 10:11 AM

#

🙂 we had this with 2.0 already 🙂 month was correct, just not the year 😉

abstract nymph May 26, 2024, 12:55 PM

#

tbh would just be nice to see some communication eugh

cobalt moon May 26, 2024, 1:11 PM

#

abstract nymph tbh would just be nice to see some communication eugh

you can visit Civitai Discord server for communication though

#

here is mostly for art sharing or super-technical discussion

low stone May 26, 2024, 1:26 PM

#

#

hyper detailed, photorealistic, myriad witnesses, frantic crowds panicking, surreal aerated landscape, inter dimensional planetary robotic networking

honest cedar May 26, 2024, 1:36 PM

#

Hi, what's the difference in quality between SD3 and SD3 Turbo?

turbid grotto May 26, 2024, 1:53 PM

#

Is that new? what could this mean?
https://www.linkedin.com/posts/stability-ai_build-microsoftstabilityai-aipartnerships-activity-7199080680226516995-6zPM

Stability AI on LinkedIn: #build #microsoftstabilityai #aipartnersh...

In case you missed the Microsoft Azure announcement at #BUILD yesterday.

Coming soon…… ⏱️

#MicrosoftStabilityAI
#AIPartnerships
#ModelsAsAService

low stone May 26, 2024, 1:54 PM

#

honest cedar Hi, what's the difference in quality between SD3 and SD3 Turbo?

Any tests we'd run now aren't on the final model versions so we'd have to wait for release to know that

low stone May 26, 2024, 1:55 PM

#

turbid grotto Is that new? what could this mean? https://www.linkedin.com/posts/stability-ai_...

Azure has these kinds of models like mistral and OpenAI available in azure as a resource you can provision in your resource group. Looks like stabilities stuff will be available too.

turbid grotto May 26, 2024, 1:57 PM

#

low stone Azure has these kinds of models like mistral and OpenAI available in azure as a ...

oh thanks for explanation

honest cedar May 26, 2024, 2:08 PM

#

low stone Any tests we'd run now aren't on the final model versions so we'd have to wait f...

Hmm and based on what you have seen until now? I'm wondering if it's worth the effort to set up the API to test SD3 Turbo (as it is now).

muted dove May 26, 2024, 2:10 PM

#

honest cedar Hmm and based on what you have seen until now? I'm wondering if it's worth the e...

The last I heard, the Turbo version is worse and not worth wasting credits on.

honest cedar May 26, 2024, 2:20 PM

#

muted dove The last I heard, the Turbo version is worse and not worth wasting credits on.

thanks for this

low stone May 26, 2024, 2:30 PM

#

honest cedar thanks for this

Yeah I haven't tested it. I've only used the regular sd3 on the api. The main sd3 model is so fast via that I never think about speed and want the turbo. Obviously that could change locally.

hallow lion May 26, 2024, 2:30 PM

#

Emad is the hero we didn't know we needed and we didn't ask for.

low stone May 26, 2024, 2:39 PM

#

dull star May 26, 2024, 2:43 PM

#

low stone Azure has these kinds of models like mistral and OpenAI available in azure as a ...

hope this will generate a good amount of income for stability

#

if SD3 comes out, they'll make a super finetuned version of SD3 or SD3 Turbo, like with Core (sdxl turbo) and it will make it a competitive choice

#

especially if they optimize it for stuff like tensorRT, it will decrease the price of the credits

fiery wharf May 26, 2024, 2:53 PM

#

nice jokes you have there

dull star May 26, 2024, 2:53 PM

#

its really funny

low stone May 26, 2024, 2:58 PM

#

dull star hope this will generate a good amount of income for stability

Agreed. They were mentioning overfitting which I think I'm seeing in the results of the current api. Ultra stylized output from a base model is less than ideal.

dull star May 26, 2024, 2:58 PM

#

really?

#

I mean they don't give a seed option

hallow lion May 26, 2024, 2:58 PM

#

why dont they just put a commercial license on all of this -IF you make money . and for personal use free... why not?

#

sdxl sd15

#

all of it

dull star May 26, 2024, 2:59 PM

#

money

#

they wanna keep making models

hallow lion May 26, 2024, 2:59 PM

#

ask musk for money

dull star May 26, 2024, 2:59 PM

#

hmm

hallow lion May 26, 2024, 2:59 PM

#

a few billion is nothign for him and this is up hi salley

dull star May 26, 2024, 2:59 PM

#

if you say so

hallow lion May 26, 2024, 2:59 PM

#

he supports this sort of thing

dull star May 26, 2024, 3:00 PM

#

that would be good

#

well musk invested in openai or whatever

#

SD3 being commercial now is the most logical, as they have opted out a lot of artists

#

so to me it feels less morally incorrect

#

but I would still not sell ai art tbh

fiery wharf May 26, 2024, 3:07 PM

#

hallow lion he supports this sort of thing

but he already has an ai company,why would he buy a company thats full of debt

low stone May 26, 2024, 3:09 PM

#

dull star I mean they don't give a seed option

They do an actually give a seed as an option. I should try that and see how it goes.

dull star May 26, 2024, 3:10 PM

#

oh yeahhhh

#

I haven't been using the api for a long time 💀

hallow lion May 26, 2024, 3:12 PM

#

Hello papa musk, it's me Emad from stability AI. We hear open AI joined the evil empire and backstabbed you but if you are still into freeing AI for the masses we are doing the same and we won't backstab you because we are righteous dudes. So we need money coz we're nice. We have a long track record of putting out free stuff and we are commited to the cause. Drop me ugh us a call and we talk.

#

Thats all it takes a tweet

#

whatshisface sad billionaire didnt think theyd buy minecart when he tweeted about beign fed up with this world

#

but they did

hallow lion May 26, 2024, 3:13 PM

#

fiery wharf but he already has an ai company,why would he buy a company thats full of debt

Coz he is an eccentric billionaire.

#

he doesnt have to buy it, just help out or buy a share in the company... whatever rich people do

fiery wharf May 26, 2024, 3:23 PM

#

hallow lion he doesnt have to buy it, just help out or buy a share in the company... whateve...

SAI gonna be like that hobo on San Francisco asking for spare change,he swears on his mama life he gonna find a job in 2 weeks

honest cedar May 26, 2024, 3:25 PM

#

low stone Yeah I haven't tested it. I've only used the regular sd3 on the api. The main sd...

Sd3 turbo is cheaper, according to the pricing page, that's why I'm considering it

hallow lion May 26, 2024, 3:27 PM

#

fiery wharf SAI gonna be like that hobo on San Francisco asking for spare change,he swears o...

no because SAI has really a good track record.

#

they did amazing things

#

Emad is no hobo!

fiery wharf May 26, 2024, 3:29 PM

#

hallow lion Emad is no hobo!

true,he's been clean for years i swear!

hallow lion May 26, 2024, 3:32 PM

#

CIVITAI SAI comfui etc, incredible what this community did and its for free. It's better than paid products! Could you imagine what people could do if money wasn't a hindrance. If everyone could pour ALL their time and effort into what matters to them and their calling.

#

mind boggling potential

muted dove May 26, 2024, 3:34 PM

#

honest cedar Sd3 turbo is cheaper, according to the pricing page, that's why I'm considering ...

Maybe these help...
#🆕｜sd3 message
#🆕｜sd3 message
#🆕｜sd3 message

#

Earlier posts said how bad it was and not to touch it, so maybe it was improved after then 🤷🏻‍♂️

fiery wharf May 26, 2024, 3:37 PM

#

give them more money and the model will improve

#

two weeks to improve after you pay

hallow lion May 26, 2024, 3:39 PM

#

Whatever is going on with AI it's better than getting involved with anything crypto related... 🤮

#

But it is true based on my experience - NOTHING beats a good model.

#

No inpainting, no face detailing, no loras, no perturbed attention guidance.

fiery wharf May 26, 2024, 3:41 PM

#

hallow lion Whatever is going on with AI it's better than getting involved with anything cry...

well one thing is certain,bitcoin cash lasted more than SAI

noble coyote May 26, 2024, 3:42 PM

#

SD3@ClipDrop - prompt = Vibrant colours, Bold Brush Strokes, Strong Symbolic Imagery.
Deeply Personal, Reflective of Emotional and Physical Struggles.
Mexican Culture, Folklore, Surrealism.
Highly Emotional Depictions of Pain, Suffering, and the Human condition.
Symbolism of The Monkey and the Humming Bird, Symbols of Hope and Duality

Vibrant_colours_Bold_Brush_Strokes_Strong_Symbolic_Imagery._Deeply_Personal_Reflective_of_Emotion.png

Vibrant_colours_Bold_Brush_Strokes_Strong_Symbolic_Imagery._Deeply_Personal_Reflective_of_Emotion_7.png

Vibrant_colours_Bold_Brush_Strokes_Strong_Symbolic_Imagery._Deeply_Personal_Reflective_of_Emotion_6.png

Vibrant_colours_Bold_Brush_Strokes_Strong_Symbolic_Imagery._Deeply_Personal_Reflective_of_Emotion_4.png

Vibrant_colours_Bold_Brush_Strokes_Strong_Symbolic_Imagery._Deeply_Personal_Reflective_of_Emotion_5.png

Vibrant_colours_Bold_Brush_Strokes_Strong_Symbolic_Imagery._Deeply_Personal_Reflective_of_Emotion_3.png

Vibrant_colours_Bold_Brush_Strokes_Strong_Symbolic_Imagery._Deeply_Personal_Reflective_of_Emotion_2.png

Vibrant_colours_Bold_Brush_Strokes_Strong_Symbolic_Imagery._Deeply_Personal_Reflective_of_Emotion_1.png

#

This came from a question to ChatGPT4: extemporise the qualities of the art of Frida Kahlo.

hallow lion May 26, 2024, 3:43 PM

#

fiery wharf well one thing is certain,bitcoin cash lasted more than SAI

Well buttcoin is the only one that lasted out of what thousands of meme and shitcoins... not exactly stellar number.

noble coyote May 26, 2024, 3:44 PM

#

SD3@ClipDrop - prompt = photorealistic assassin’s creed cybernetic male assassin in an ivory
electrical-rococo elaborate robe by nexro xiii, light and mysterious, in
superhero pose, light and bright, mysterious, magnificent and cybernetic royal, warrior like, light and mysterious immense details, HD, cinematic lighting, cinematic, epic, photoreal by Riccardo Federici, Frank Frazettaby Bill Sienkiewicz and donato giancola and anders zorn, cinematic, dramatic lighting, rembrandt light

photorealistic_assassins_creed_cybernetic_male_assassin_in_an_ivory__electrical-rococo_elaborate_ro.png

photorealistic_assassins_creed_cybernetic_male_assassin_in_an_ivory__electrical-rococo_elaborate_ro_3.png

photorealistic_assassins_creed_cybernetic_male_assassin_in_an_ivory__electrical-rococo_elaborate_ro_2.png

photorealistic_assassins_creed_cybernetic_male_assassin_in_an_ivory__electrical-rococo_elaborate_ro_1.png

hallow lion May 26, 2024, 3:44 PM

#

and it doesnt work sure you ge tpaid in crypto ok cool well as soon as you need to buy stuff in YOUR location anythign like a shouse a car whatever your government and bank will find out because you ahve to covnert your crypto to real currency and the tax man comes

noble coyote May 26, 2024, 3:45 PM

#

SD3@ClipDrop prompt = Art Nouveau style, face by Anna Dittmann, snake eyes, snake young, large illuminati symbol in the boarder, Celtic knot with pine tree and pine cones, perfect eyes, A painting of a norwegian woman with flowers on her head, botanical art by Pierre-Joseph Redouté, vivid, blond hair, 1920s short dress, trending on deviantart, pop surrealism, detail

Art_Nouveau_style_face_by_Anna_Dittmann_snake_eyes_snake_young_large_illuminati_symbol_in_the_bo.png

Art_Nouveau_style_face_by_Anna_Dittmann_snake_eyes_snake_young_large_illuminati_symbol_in_the_bo_3.png

Art_Nouveau_style_face_by_Anna_Dittmann_snake_eyes_snake_young_large_illuminati_symbol_in_the_bo_2.png

Art_Nouveau_style_face_by_Anna_Dittmann_snake_eyes_snake_young_large_illuminati_symbol_in_the_bo_1.png

hallow lion May 26, 2024, 3:45 PM

#

These are nice but I want to see realism.

#

And good hands.

desert garnet May 26, 2024, 3:46 PM

#

those images are so cool i feel like burning my money rn

noble coyote May 26, 2024, 3:46 PM

#

SD3@ClipDrop prompt = antique damaged portrait war poster, devil,portrait, by Albert Bierstadt, by Andy Warhol, by Annibale Carracci, by Caravaggio Michelangelo Merisi, by Takashi Murakami, Spray Paint, Halfrear Lighting, Soft Lighting, Linen, Posterization

antique_damaged_portrait_war_poster_devilportrait_by_Albert_Bierstadt_by_Andy_Warhol_by_Annibal.png

antique_damaged_portrait_war_poster_devilportrait_by_Albert_Bierstadt_by_Andy_Warhol_by_Annibal_3.png

antique_damaged_portrait_war_poster_devilportrait_by_Albert_Bierstadt_by_Andy_Warhol_by_Annibal_2.png

antique_damaged_portrait_war_poster_devilportrait_by_Albert_Bierstadt_by_Andy_Warhol_by_Annibal_1.png

#

SD3@ClipDrop prompt = a realistic beautiful autumn queen, headshot, close up, night time, autumnal mood, venice carnival, grand guignol, wavy hairstyle, white hair, character concept art, created by victo ngai henri rousseau vladimir kush coles philips elizabeth catlett arief putra john currin alenka sottler itzchak tarkay anita inverarity maxfield parrish peregrine heathcoate tamara de lempicka mads berg isaac maimon iwona lifsches non binary heart connection/detailed modern art style 8k

a_realistic_beautiful_autumn_queen_headshot_close_up_night_time_autumnal_mood_venice_carnival_.png

a_realistic_beautiful_autumn_queen_headshot_close_up_night_time_autumnal_mood_venice_carnival_3.png

a_realistic_beautiful_autumn_queen_headshot_close_up_night_time_autumnal_mood_venice_carnival_2.png

a_realistic_beautiful_autumn_queen_headshot_close_up_night_time_autumnal_mood_venice_carnival_1.png

noble coyote May 26, 2024, 3:49 PM

#

hallow lion These are nice but I want to see realism.

They have excellent, excellent eyes and faces ... so far so good!

desert garnet May 26, 2024, 3:56 PM

#

best eyes i have seen,these folks i tell you,HUGE and BEST hands,they are very very great like our country

calm dew May 26, 2024, 5:17 PM

#

hello

low stone May 26, 2024, 5:28 PM

#

honest cedar Sd3 turbo is cheaper, according to the pricing page, that's why I'm considering ...

so I just tried their sd3-turbo model on their api. doesn't work. returns a 404 not found. sd3 works fine.

#

oh... nevermind, their url is the same i guess, i just have to pass that model in json

abstract nymph May 26, 2024, 5:35 PM

#

cobalt moon you can visit Civitai Discord server for communication though

for communication from stability ai concerning their model?

deep pebble May 26, 2024, 5:37 PM

#

SD3@ClipDrop prompt = masterpiece,best quality,fine_art_parody,realistic,real,solo,multiple_girls,alternate hair length,wet hair,tears,tsurime,white colored eyelashes,looking at viewer,red eyes,narrowed eyes,large breasts,crop top,gothic_lolita,tabi,cross-laced_footwear,demon horns,half middle_finger,smoking,

low stone May 26, 2024, 5:39 PM

#

honest cedar Sd3 turbo is cheaper, according to the pricing page, that's why I'm considering ...

here are 6 images from sd3-turbo instead of sd3. i know this is still the old version of the model, but the quality is WAY muddier. I'd never use this unless I was generating icons or thumbnails or some kind of very clean render artwork. anything stylized just is way too messy.

cunning lintel May 26, 2024, 5:50 PM

#

@honest cedar d3 turbo is in another league, but not in a good way

#

#

#

each time same prompt 2x sd3, 2x sd3 turbo

#

maybe those were a bit unfair, one more this time more suited for sd3 turbo's looks, it's usable for this kinda prompt (cartoon illustration of a woman in a hat holding a gun, digital art, fantasy art, steampunk, redhead, weird west, portrait of lady mechanika, cowgirl )

cunning lintel May 26, 2024, 6:22 PM

#

desert garnet best eyes i have seen,these folks i tell you,HUGE and BEST hands,they are very v...

best eyes i have seen,these folks i tell you,HUGE and BEST hands,they are very very great like our country

bitter hearth May 26, 2024, 6:45 PM

#

#

I put epic battle in the prompt

wild remnant May 26, 2024, 8:02 PM

#

#

cunning lintel May 26, 2024, 8:17 PM

#

A powerful agent, her eyes aglow with an unholy power, stands atop a ruined, gothic spire, as a stormy, apocalyptic landscape unfolds behind her, in the styles of Michael Garmash, Guy Denning, and Olive Cotton
Neg: boring, tranquil, wrong, low quality, photo

#

A resourceful operative, her eyes in a determined gaze, infiltrates a secret society's masquerade ball, surrounded by masked figures and candelabras, in the styles of Michael Garmash, Guy Denning, and Olive Cotton.
Neg: boring, tranquil, wrong, low quality, photo

#

A haunting portrait of a weary agent, her face deathly pale, surrounded by ritualistic symbols and forbidden knowledge, as candles flicker with an otherworldly energy, in the styles of Michael Garmash, Guy Denning, and Olive Cotton.
Neg: boring, tranquil, wrong, low quality, photo

wild remnant May 26, 2024, 8:31 PM

#

strange skiff May 26, 2024, 9:04 PM

#

1

wild remnant May 26, 2024, 10:36 PM

#

spice rain May 26, 2024, 10:58 PM

#

A photorealistic portrait of a 20-year-old South Korean girl radiates beauty with her long, flowing black hair, mesmerizing brown eyes, and captivating smile. She stands at 166 cm tall, with fair skin and a slim, D-cup figure reminiscent of Blackpink's Lisa. Dressed in a white shirt and deep blue jeans, she exudes elegance and charm. The portrait should be a full-body shot, 8k HDR, with high detailed features and a natural, approachable expression, illuminated by soft, golden-hour sunlight.

low stone May 27, 2024, 12:13 AM

#

cunning lintel maybe those were a bit unfair, one more this time more suited for sd3 turbo's lo...

Yeah, I made the content simpler and sd3 turbo definitely did better with it. Prompt: in the style of anime, chibi cute samurai playing video games at a Tokyo game shop, minimalistic

dull star May 27, 2024, 12:22 AM

#

I expect SD3 Turbo to look more stylistic and have less variety

shut kiln May 27, 2024, 2:32 AM

#

sheep standing on hind legs whereing a gas mask looks up in the sky away from a cell phone

drifting oak May 27, 2024, 6:07 AM

#

glif-stablediffusion-3-diffusionmastered-afzxcie5c8a83xoydo05c53z.jpg

noble coyote May 27, 2024, 6:34 AM

#

shut kiln sheep standing on hind legs whereing a gas mask looks up in the sky away from a ...

A sheep standing on its hind legs, wearing a gas mask, looking up in the sky, and away from a cell phone (no cell phone, sorry!)

noble coyote May 27, 2024, 6:39 AM

#

spice rain A photorealistic portrait of a 20-year-old South Korean girl radiates beauty wit...

frigid saffron May 27, 2024, 7:22 AM

#

Any update news for sd3 open ?

pine canopy May 27, 2024, 7:31 AM

#

A striking, azure Lamborghini, sleek and aerodynamic, thunders down a sun-kissed coastal road, its engine roar blending with the crashing waves and salty sea breeze. Majestic seagulls soar overhead, adding a dynamic element to the scene's exhilarating motion.

sullen moss May 27, 2024, 7:43 AM

#

frigid saffron Any update news for sd3 open ?

Sure. 2 weeks 😂

frigid saffron May 27, 2024, 7:44 AM

#

sullen moss Sure. 2 weeks 😂

another 2 weeks from today agony ?

sullen moss May 27, 2024, 7:49 AM

#

happemad

worthy hound May 27, 2024, 8:14 AM

#

https://tenor.com/tibv4emJK3h.gif

Tenor

gusty trail May 27, 2024, 9:54 AM

#

sd3 2weeks edition

cerulean orchid May 27, 2024, 10:18 AM

#

A dimly lit alleyway in Mumbai, with shadows looming ominously in the background, setting the tone for the dark and gritty atmosphere of the film --ar 16:9

#

#🆕｜sd3 A dimly lit alleyway in Mumbai, with shadows looming ominously in the background, setting the tone for the dark and gritty atmosphere of the film.

potent idol May 27, 2024, 11:08 AM

#

#🆕｜sd3 How to generate?

frozen patrol May 27, 2024, 12:18 PM

#

futuristic headphone advertising

delicate hollow May 27, 2024, 12:42 PM

#

where can i generate images

wild remnant May 27, 2024, 12:54 PM

#

hallow lion May 27, 2024, 2:45 PM

#

Two weeks? Maybe one?

#

who cares I don;t even know if I cna run this on my machine. I am scared. What ifts 25 gigs for the model and it takes 8 minutes to generate one pic

woeful spindle May 27, 2024, 3:24 PM

#

hallow lion who cares I don;t even know if I cna run this on my machine. I am scared. What i...

they explicity said that it can run on a 24 gig rtx 4090 without memory overflow and 4090 can generate 1 1024×1024 image in 30 secs

#

I dont remember step count but it was about 30-45 iirc

woeful spindle May 27, 2024, 3:25 PM

#

woeful spindle they explicity said that it can run on a 24 gig rtx 4090 without memory overflow...

And that's (probably) with T5

hallow lion May 27, 2024, 3:40 PM

#

sullen moss <:happemad:1012407616565149706>

well 24 gigs vram is a lot

#

diffusionhand

#

theres a cat here with 4 (send help) what is he gonna do?

sullen moss May 27, 2024, 3:42 PM

#

5090 with 32 gig soon

primal summit May 27, 2024, 4:13 PM

#

800 million model Can it produce good images?

river umbra May 27, 2024, 4:15 PM

#

Do you know how to prompt image here

noble coyote May 27, 2024, 4:24 PM

#

SD3@ClipDrop

A_hedgehog_is_holding_a_sign_which_says__We_should_be_kind__While_there_is_still_time__The_hedgeh_3.png

A_hedgehog_is_holding_a_sign_which_says__We_should_be_kind__While_there_is_still_time__The_hedgeh_4.png

A_hedgehog_is_holding_a_sign_which_says__We_should_be_kind__While_there_is_still_time__The_hedgeh_5.png

A_hedgehog_is_holding_a_sign_which_says__We_should_be_kind__While_there_is_still_time__The_hedgeh_7.png

A_hedgehog_is_holding_a_sign_which_says__We_should_be_kind__While_there_is_still_time__The_hedgeh_15.png

A_hedgehog_is_holding_a_sign_which_says__We_should_be_kind__While_there_is_still_time__The_hedgeh_20.png

A_hedgehog_is_holding_a_sign_which_says__We_should_be_kind__While_there_is_still_time__The_hedgeh_23.png

A_hedgehog_is_holding_a_sign_which_says__We_should_be_kind__While_there_is_still_time__The_hedgeh_25.png

A_hedgehog_is_holding_a_sign_which_says__We_should_be_kind__While_there_is_still_time__The_hedgeh_26.png

A_hedgehog_is_holding_a_sign_which_says__We_should_be_kind__While_there_is_still_time__The_hedgeh_27.png

#

SD3@ClipDrop

The_Mower__The_all-terrain_vehicle_stalled_twice_kneeling_I_found__A_hedgehog_jammed_up_against_2.png

The_Mower__The_all-terrain_vehicle_stalled_twice_kneeling_I_found__A_hedgehog_jammed_up_against_3.png

The_Mower__The_all-terrain_vehicle_stalled_twice_kneeling_I_found__A_hedgehog_jammed_up_against.png

The_Mower__The_mower_stalled_twice_the_man_kneeling_he_found__A_hedgehog_jammed_up_against_the_2.png

The_Mower__The_mower_stalled_twice_the_man_kneeling_he_found__A_hedgehog_jammed_up_against_the_3.png

The_Mower__The_mower_stalled_twice_the_man_kneeling_he_found__A_hedgehog_jammed_up_against_the_.png

#

SD3@ClipDrop

teal fossil May 27, 2024, 5:24 PM

#

Btw guys with TagGui you can already pretty easily play around with captioning images with T5-Xxl and it's not bad.

prisma cypress May 27, 2024, 7:58 PM

#

uhmmm does anyone know when with SD3 model be available on hugging face

gusty trail May 27, 2024, 8:11 PM

#

2 more weeks (until it released

woeful spindle May 27, 2024, 8:12 PM

#

there's gonna be a time where it's exactly 2 weeks before SD3's release

#

probably

fiery dawn May 27, 2024, 8:13 PM

#

woeful spindle there's gonna be a time where it's exactly 2 weeks before SD3's release

When SD4 API will be released.

#

Only then we will get SD3 checkpoints.

#

Until then it will be "two more weeks" gaslighting like Emad did a month ago.

woeful spindle May 27, 2024, 8:16 PM

#

Hope stability wont go bankrupt by then

#

Someone's gotta pay that gpu rent

fiery dawn May 27, 2024, 8:17 PM

#

woeful spindle Hope stability wont go bankrupt by then

Impossible. I'm sure their API's are used by the corporations that guarantee majority of the costs.

woeful spindle May 27, 2024, 8:18 PM

#

They have limited time until Google's imagen 3 and GPT-4o's image generation roll out

#

Those two have all the things SD3 promises

errant dust May 27, 2024, 8:45 PM

#

woeful spindle They have limited time until Google's imagen 3 and GPT-4o's image generation rol...

There are no planned updates of Dall-E 3 in the works that I have heard. And frankly I find Copilot's implementation less of a pain that ChatGPT's, since the latter has all manner of censorship that Copilot does not.

#

If you look at the paper on SD3, they consider the biggest rival to be Ideogram

past flame May 27, 2024, 8:52 PM

#

Plot twist: It'll be unstable

woeful spindle May 27, 2024, 9:23 PM

#

errant dust There are no planned updates of Dall-E 3 in the works that I have heard. And fra...

https://openai.com/index/hello-gpt-4o/

#

They show some examples under "explorations of capabilities" title

#

New model has near-perfect text generation

#

those images in these examples are definitely cherrypicked

#

but it is still impressive

#

I don't think they call it Dall-E 4

acoustic kite May 27, 2024, 9:28 PM

#

woeful spindle New model has near-perfect text generation

only problem is that it hasn't started to roll out yet

woeful spindle May 27, 2024, 9:29 PM

#

#

I just generated that with gpt4o

dull star May 27, 2024, 9:29 PM

#

not bad

woeful spindle May 27, 2024, 9:29 PM

#

it's not readable

#

but not looking like the old models

dull star May 27, 2024, 9:30 PM

#

hwo did you generate with it

woeful spindle May 27, 2024, 9:30 PM

#

they've definitely rolled it out

sullen moss May 27, 2024, 9:30 PM

#

Well, SORA can also generate images. Paradoxically, to advance DALL-E, they just need to remove the filters 😂

errant dust May 27, 2024, 9:32 PM

#

woeful spindle https://openai.com/index/hello-gpt-4o/

Maybe it is for a future rollout. I am a subscriber, have GPT4o, and can tell you it is no different than GPT4 for image generation that I can tell. Not even for text.

woeful spindle May 27, 2024, 9:33 PM

#

woeful spindle they've definitely rolled it out

if we take the fact that they generated an enormous number of images just to cherrypick the best one to show it as an example on the website into account, they probably have rolled it out

errant dust May 27, 2024, 9:33 PM

#

For text accuracy it is miles behind Ideogram

woeful spindle May 27, 2024, 9:33 PM

#

errant dust For text accuracy it is miles behind Ideogram

I've tried it few times

raven fern May 27, 2024, 9:33 PM

#

soontm happemad

woeful spindle May 27, 2024, 9:34 PM

#

I dont know if it's just me but it looked like too cartoon-ish

dull star May 27, 2024, 9:34 PM

#

ideogram is still the goat imo

#

I hope finetuned SD3 can get close

errant dust May 27, 2024, 9:34 PM

#

I am not here to wax poetic on ideogram on all things, since they all have their weaknesses and strengths. Ideogram as well, but if you want text, Ideogram is king.

#

As to looking cartoonish, the text, it is a matter of knowing how to engineer the prompt. This is still a factor today.

dull star May 27, 2024, 9:36 PM

#

thankfully SD3 is good enough for Movie Titles

raven fern May 27, 2024, 9:36 PM

#

lol

woeful spindle May 27, 2024, 9:36 PM

#

errant dust As to looking cartoonish, the text, it is a matter of knowing how to engineer th...

true

raven fern May 27, 2024, 9:37 PM

#

dull star thankfully SD3 is good enough for Movie Titles

it's missing the credits at the bottom

errant dust May 27, 2024, 9:38 PM

#

There are things for example where Midjourney can do things none of them can yet. But SD traditionally can compete soon enough once you get specialized Loras, so my comment is on the vanilla experience

#

SD3 overall is super exciting, don't get me wrong. I'm just underwhelmed for now at the cost. $19 for 300 images in a month? RLY? I can imagine spending the 9 bucks for a test run of 130 odd images, but never for a regular experience.

#

You can get 60 fast for free with Copilot (Dall-E 3) per day, and more if you can wait a bit, and same for Ideogram

dull star May 27, 2024, 10:00 PM

#

they need to optimize it first for tensorRT and stuff, and you have to consider that they need the money
but yeah its very expensive for what they offer

errant dust May 27, 2024, 10:02 PM

#

They can need the money and will have me nodding my head in sympathy, but that doesn't mean the end user/consumer is going to opt for paying more for less

cunning lintel May 27, 2024, 10:23 PM

#

I guess SAI can still lean on the stable diffusion brand and its promise of (local) generation with superior tooling. The main reason I'm interested in SD3 is the promise of much better tooling combined with state of the art generation. It's why i'm now paying a little for playing with SD3, curiosity to see how well it performs. But if the tooling (control nets, inpainting (read on twitter there was no such thing as SD3 inpainting yet, ouch), style transfer, regional prompting, customizable guidance, weighted/mixed prompts) turns out to not be there, it'll be more and more waning interest in SD3 for me. If it's just texttoimage, might as well use something that seems more capable.
But I'm no pro or heavy user, whether I pay or not, is of no consequence to SAI, they should cater to heavy professional use, but I'm afraid for those the tooling story is much the same if all you need is texttoimage stock-footage en masse, there's plenty other options and at the current price point SD3 is not competitive at all.

lucid swift May 27, 2024, 11:00 PM

#

woeful spindle

can you generate other stuff maby you realy have a new model

dull star May 27, 2024, 11:03 PM

#

cunning lintel I guess SAI can still lean on the stable diffusion brand and its promise of (loc...

I only know of controlnet and fine tuning that were promised

#

and even then, we don't know which

#

openpose, canny and depth would be more than enough

lucid swift May 27, 2024, 11:06 PM

#

"A first person view of a robot typewriting the following journal entries:

yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?

the text is large, legible and clear. the robot's hands type on the typewriter."

#

idogram seems worse for this example

hallow lion May 28, 2024, 2:48 AM

#

raven fern <:soontm:1004416662373675119> <:happemad:1012407616565149706>

Two weeks! diffusionhand

low stone May 28, 2024, 4:37 AM

#

svd with an sd3 raw image

#

noble coyote May 28, 2024, 5:58 AM

#

errant dust There are things for example where Midjourney can do things none of them can yet...

SD3@Clipdrop.co - $10/month for 10 prompts/24hrs (4 pictures each prompt) = 1 ,200 pictures/month for $10

woeful spindle May 28, 2024, 6:31 AM

#

noble coyote SD3@Clipdrop.co - $10/month for 10 prompts/24hrs (4 pictures each prompt) = 1 ,2...

I mean you can technically open up a new account with a temporary email and generate 3 sd3 images for free (dont do that pls sai needs money)

noble coyote May 28, 2024, 8:23 AM

#

woeful spindle I mean you can technically open up a new account with a temporary email and gene...

1200 images/month much much better than ArtySan's 150!

cursive mist May 28, 2024, 8:32 AM

#

I dreamt I was in the place of Fry from futurama, and when I asked Bender if SD3 was finally out, he replied “two more weeks”.

noble coyote May 28, 2024, 9:05 AM

#

cursive mist I dreamt I was in the place of Fry from futurama, and when I asked Bender if SD3...

"If you know how long is a piece of string?!" then that will = the weight for the waits.

icy drift May 28, 2024, 9:21 AM

#

Remember February when everyone was like, "Don't need Stable Cascade since we're getting SD3 in two weeks"... and then proceeded to waste half a year completely ignoring the model that can literally generate 4K images in 30 seconds because SD3 would be better.

icy drift May 28, 2024, 9:40 AM

#

My desktop background is still made by SC, because I only update it when we get the weights for a new model. (And no, CosXL and Hunyuan DiT don't count at all for subtle and ethereal reasons.)

primal summit May 28, 2024, 10:27 AM

#

icy drift Remember February when everyone was like, "Don't need Stable Cascade since we're...

Indeed, Stable Cascade has great potential, but has been overlooked despite its strength

icy drift May 28, 2024, 10:30 AM

#

Maybe once SD3 releases and people realize how much RAM it costs to get large renders out of it, SC will suddenly look more interesting. I could just be wishing though...

primal summit May 28, 2024, 10:36 AM

#

icy drift Maybe once SD3 releases and people realize how much RAM it costs to get large re...

I doubt that. Because the 2 billion model is as powerful as the SDXL, and there is also the 800 million model, which is smaller in size than the SD1.5 and is most likely much better than it.
Maybe if sd3 isn't released they might look around for sc

icy drift May 28, 2024, 10:38 AM

#

primal summit I doubt that. Because the 2 billion model is as powerful as the SDXL, and there ...

Hmm. Well, on the other hand, if it turns out to be better than Stable Cascade, I will have zero complaints. Just at the moment, I feel very frustrated that open-source AI has basically been dragging its feet because no one wants to work on anything that will be obsolete when SD3 releases.

primal summit May 28, 2024, 10:41 AM

#

icy drift Hmm. Well, on the other hand, if it turns out to be better than Stable Cascade, ...

I agree with you . But perhaps it was planned to delay SD3 from the beginning, as it does not make sense that they released SC first, and most likely it was for a promotional purpose.

#

Looks like we'll be stuck with the huge SDXL model

icy drift May 28, 2024, 10:42 AM

#

primal summit I agree with you . But perhaps it was planned to delay SD3 from the beginning, a...

I'm not sure what that means, but I get the sense SAI has lost control of a lot of things since about the end of last year.

icy drift May 28, 2024, 10:42 AM

#

primal summit Looks like we'll be stuck with the huge SDXL model

What's that? Just normal SDXL?

primal summit May 28, 2024, 10:43 AM

#

icy drift What's that? Just normal SDXL?

?

icy drift May 28, 2024, 10:43 AM

#

primal summit ?

What's the "huge SDXL model" we're stuck with? Do you just mean to say that SDXL is huge compared to 1.5? Or did someone do a mergekit / mixture-of-experts or something that I don't know about?

primal summit May 28, 2024, 10:44 AM

#

SDXL has 6 billion parameters so it consumes a lot of resources

icy drift May 28, 2024, 10:45 AM

#

primal summit SDXL has 6 billion parameters so it consumes a lot of resources

Oh you just meant compared to 1.5.

primal summit May 28, 2024, 10:45 AM

#

icy drift Oh you just meant compared to 1.5.

yes

#

But imagine if SD3 was released in the form of 2 billion and it is as powerful as SDXL. Life will be easier

icy drift May 28, 2024, 10:46 AM

#

primal summit But imagine if SD3 was released in the form of 2 billion and it is as powerful a...

What size is 1.5?

primal summit May 28, 2024, 10:47 AM

#

There is an 8 billion model, but I think 90 percent or more will not be able to operate it or will find it not worth the effort.

primal summit May 28, 2024, 10:48 AM

#

icy drift What size is 1.5?

I think 1 billion or 1.5 billion. I forgot, but in this range

#

But the parameters are not everything, but also the structure of the model and the text encoder play the most important role

icy drift May 28, 2024, 10:49 AM

#

Perplexity

#

Sounds about right.

#

What about the VAE though.

primal summit May 28, 2024, 10:52 AM

#

icy drift What about the VAE though.

VAE I don't know but it is only a decoder and plays a very minor role

icy drift May 28, 2024, 10:52 AM

#

primal summit VAE I don't know but it is only a decoder and plays a very minor role

VAE is where the VRAM crashes always happen. 4K unet is just fine.

primal summit May 28, 2024, 10:53 AM

#

icy drift VAE is where the VRAM crashes always happen. 4K unet is just fine.

Yes, I agree, but there is a tiling method, but it takes longer

icy drift May 28, 2024, 10:54 AM

#

Perplexity fail oh well.

#

I'm sure the info is out there somewhere.

edgy kelp May 28, 2024, 10:54 AM

#

primal summit SDXL has 6 billion parameters so it consumes a lot of resources

SDXL is suppsed to be 3,6B, excluding the text encoders obviously

#

My guess is that a lot of people will use the smallest SD3, just as most people are still using SD1.5 instead of XL

icy drift May 28, 2024, 10:57 AM

#

A model the size of SD1.5 with the power of SDXL would be a huge step-up. Wouldn't be really useful until all the controlnets etc were trained though.

#

I bet that will only take 2-3 months.

#

After SD3 releases in 2 weeks.

edgy kelp May 28, 2024, 10:58 AM

#

Devs said that 800M SD3 will be more powerful than base 1.5 (despite 1.5 has slightly more than 800m parameters), that's good enough already for to play around with it

icy drift May 28, 2024, 10:59 AM

#

I'm working on a painting app / trying to use AI to do useful work, so I'm really not interested unless it can do useful stuff. For now, that 100% requires multiple controlnets.
https://github.com/QuintessentialForms/ParrotLUX

GitHub

GitHub - QuintessentialForms/ParrotLUX: Painting App for Open-Sourc...

Painting App for Open-Source AI. Contribute to QuintessentialForms/ParrotLUX development by creating an account on GitHub.

edgy kelp May 28, 2024, 11:01 AM

#

Either way you'd need to wait the controlnets, but having the smallest SD3 to work with should anyway cost you less electricty and power... which sounds convenient

hardy merlin May 28, 2024, 11:04 AM

#

so any idea about sd3 release eta or something? I just join the channel and looking for some good news.

primal summit May 28, 2024, 11:05 AM

#

hardy merlin so any idea about sd3 release eta or something? I just join the channel and look...

Currently, everything is unknown to my knowledge

icy drift May 28, 2024, 11:05 AM

#

Two weeks from [insert current day here].

primal summit May 28, 2024, 11:06 AM

#

icy drift Two weeks from [insert current day here].

? Did they announce anything?

dull star May 28, 2024, 11:10 AM

#

edgy kelp Devs said that 800M SD3 will be more powerful than base 1.5 (despite 1.5 has sli...

I think so

#

same with 2B beating SDXL, despite it being smaller than SDXL (3.5B)

#

and 8B is just undertrained 😔

icy drift May 28, 2024, 11:12 AM

#

You can actually just calculate the release date using this handy javascript function.

var SD3 = new Date();
SD3.setDate( SD3.getDate() + 14 );
console.log( SD3.toISOString() );

low inlet May 28, 2024, 11:16 AM

#

dull star and 8B is just undertrained 😔

How much B is in the API right now sd3 ?

#

is it the 8b or the 3.5b ?

dull star May 28, 2024, 11:17 AM

#

I think the API uses 8B

icy drift May 28, 2024, 11:17 AM

#

I think I remember hearing it was an early train of the 8b model at one point. Without T5.

dull star May 28, 2024, 11:17 AM

#

Well we can't train T5 though can we

low inlet May 28, 2024, 11:17 AM

#

because i'm wondering is there is any chance that the one at the api gonna get any better ?

dull star May 28, 2024, 11:17 AM

#

isn't it frozen

low inlet May 28, 2024, 11:17 AM

#

or that's it's limits ?

#

because it can't render hands or legs correctly most of the time

icy drift May 28, 2024, 11:18 AM

#

SD only gets amazing once the community fine-tunes it.

low inlet May 28, 2024, 11:18 AM

#

true

icy drift May 28, 2024, 11:18 AM

#

And then you need specialized models and controlnets to get production-quality usable stuff.

dull star May 28, 2024, 11:18 AM

#

low inlet because i'm wondering is there is any chance that the one at the api gonna get a...

if they train it further

#

I know that finetunes will help

icy drift May 28, 2024, 11:21 AM

#

low inlet because i'm wondering is there is any chance that the one at the api gonna get a...

It might be hard to notice if it had gotten better. Quality always depends on your prompts.

dull star May 28, 2024, 11:21 AM

#

I wish 8B Loras will be possible with 24GB, but I'm doubtful

icy drift May 28, 2024, 11:21 AM

#

dull star I wish 8B Loras will be possible with 24GB, but I'm doubtful

Isn't that just a fine-tune?

dull star May 28, 2024, 11:22 AM

#

yes Loras are finetunes

#

like modular finetunes, you can use it with models

#

and its less VRAM intensive, etc

icy drift May 28, 2024, 11:22 AM

#

dull star yes Loras are finetunes

I thought loras were lower-rank. Fewer parameters than a full model fine-tune.

dull star May 28, 2024, 11:22 AM

#

yes

#

but even that will require a lot of VRAM if we would try 8B

low inlet May 28, 2024, 11:29 AM

#

icy drift It might be hard to notice if it had gotten better. Quality always depends on yo...

not just my prompts it depends on the encoder

#

because 1.5 1.6 and 2 is different encoder not a lot of people using sd 2

#

it's sd 1.5 and sdxl

sterile pendant May 28, 2024, 11:30 AM

#

icy drift I thought loras were lower-rank. Fewer parameters than a full model fine-tune.

They are, with most in the 8-128 dim range. Still takes a shitload of vram to train them. Doras look promising. Saw that even with 16 8 dim on them, they are on par or better than loras with way higher dimensional sizes. Like as good as a 128. If that's the case, then it lessens the vram training requirements

#

Even still, if you're working with images in the 1024² range, then for an 8B model, it's probably going to take 32-48gb vram to train them with even just the clip encoders and no t5

#

correction, even as low as 8 dimensions. here's a decent writeup about it all that i saw the other day: https://sebastianraschka.com/blog/2024/lora-dora.html

Sebastian Raschka, PhD

Improving LoRA: Implementing Weight-Decomposed Low-Rank Adap...

Low-rank adaptation (LoRA) is a machine learning technique that modifies a pretrained model (for example, an LLM or vision transformer) to better suit a spec...

dry wave May 28, 2024, 12:12 PM

#

just because the model is 8b doesn't mean you have to train all 8b parameters

#

in sdxl you often achieve same results when only training the cross attention layers than when training both. Similarly, you don't have to train the down-layers of the unet in sdxl

#

usually our loras are several times too large for no reason

#

similarly, you can train only a part of the 8b model and you will be fine

dull star May 28, 2024, 12:27 PM

#

well I hope that will work for 24GB and less

dry wave May 28, 2024, 12:34 PM

#

I would say that's just a question of proper gradient checkpointing. We just have to wait for someone implementing efficient training

finite hollow May 28, 2024, 12:51 PM

#

do you guys use stable cascade at all ?

dull star May 28, 2024, 12:53 PM

#

used it for a little while

#

its good, but doesn't match what I want

#

and the results are smooth

#

as long as you aren't looking for super photorealistic images, the model makes nice and clean images

finite hollow May 28, 2024, 12:55 PM

#

can you show some of the nicer pictures you made with it?

dull star May 28, 2024, 1:02 PM

#

uhhh

#

didn't really save any I think

muted dove May 28, 2024, 1:08 PM

#

finite hollow can you show some of the nicer pictures you made with it?

These were all made using Cascade

#

finite hollow May 28, 2024, 1:11 PM

#

the white cloth girl portrait is nice

#

the rest has a strong elder-scrolls touch

#

dull star May 28, 2024, 1:13 PM

#

yeah these look SUPER clean

finite hollow May 28, 2024, 1:13 PM

#

sterile pendant May 28, 2024, 1:16 PM

#

dry wave just because the model is 8b doesn't mean you have to train all 8b parameters

It takes ~12gb vram to train an sdxl lora with 128 ranks with both clip encoders+unet in koyha or one trainer. I'm not even talking about dreambooth training, that takes far more vram. So to train a rank 128 lora on 8b model, it's going to be faaaaaar more.

dull star May 28, 2024, 1:23 PM

#

guess I'll have to train 0.0000000000001 ranks then

#

😔

muted dove May 28, 2024, 2:11 PM

#

finite hollow can you show some of the nicer pictures you made with it?

Also Cascade

sterile pendant May 28, 2024, 2:27 PM

#

dull star guess I'll have to train 0.0000000000001 ranks then

Well that's why I brought up the Dora thing. Also since sd3 doesn't use a unet, it could potentially take far more or far less vram to lora train per "billion model parameters" if that makes sense. It's likely also going to take some time for people to get the tooling up and running for it as well

#

I haven't looked into that aspect much yet, so I can't give you an educated guess on how demanding the training will or won't be. Just using sdxl training as a reference since I've trained dozens of loras for it

dull star May 28, 2024, 2:30 PM

#

sterile pendant Well that's why I brought up the Dora thing. Also since sd3 doesn't use a unet, ...

I wonder if we can use Qlora, since its more transformer based

#

then again, how will the quantization go then

low stone May 28, 2024, 2:33 PM

#

muted dove These were all made using Cascade

Apparently sd3 does a great job with these miniature scenes as well. 🙂

muted dove May 28, 2024, 2:34 PM

#

low stone Apparently sd3 does a great job with these miniature scenes as well. 🙂

A good WS there too!

dry wave May 28, 2024, 2:42 PM

#

sterile pendant It takes ~12gb vram to train an sdxl lora with 128 ranks with both clip encoders...

you don't need dim 128

#

neither for sdxl

#

even less for sd3

#

think of it as you want to train the method a new concept. The size of the concept don't necessarily scale with model size

#

in particular, if you only have a few megabytes of training images, training a gigabyte lora is a rather dumb idea anyways

low stone May 28, 2024, 2:44 PM

#

muted dove A good WS there too!

I was thinking it's because it was sdxl refined, but this is sd3 raw and it indeed has him.

sterile pendant May 28, 2024, 3:09 PM

#

dry wave you don't need dim 128

Oh I don't train that high ever, it's just some superstitious thing a lot of "guides" lead people into thinking they need to use, so most people use it anyways. I normally just do 16 or 32, but it depends on what you're training and how you're training.

sterile pendant May 28, 2024, 3:10 PM

#

dry wave even less for sd3

We don't actually know yet, but if it's like training llms that the architecture is based around(dit), then it's still going to take some hefty resources to train correctly

#

Llms are a lot more forgiving than image based generation

rain palm May 28, 2024, 3:58 PM

#

little quarry https://tenor.com/view/2week-countdown-14days-more-waiting-gif-13525316

still 2 weeks ay?

little quarry May 28, 2024, 3:58 PM

#

2 more weeks

rain palm May 28, 2024, 3:58 PM

#

aight.

teal fossil May 28, 2024, 5:24 PM

#

So did anyone test the latest API (or whatever)? How does it compare?

dull star May 28, 2024, 5:29 PM

#

latest api?

#

did it change?

rain current May 28, 2024, 6:26 PM

#

icy drift You can actually just calculate the release date using this handy javascript fun...

agony

lucid swift May 28, 2024, 6:28 PM

#

icy drift Hmm. Well, on the other hand, if it turns out to be better than Stable Cascade, ...

i know of at least 2 promosing stable cascade projects

lucid swift May 28, 2024, 6:29 PM

#

primal summit SDXL has 6 billion parameters so it consumes a lot of resources

no it has not 6b parameters

icy drift May 28, 2024, 6:30 PM

#

lucid swift i know of at least 2 promosing stable cascade projects

Please share. I know it's been mostly ignored, but I managed to find at least one youtuber training loras.

lucid swift May 28, 2024, 6:30 PM

#

edgy kelp Devs said that 800M SD3 will be more powerful than base 1.5 (despite 1.5 has sli...

ther is also a stable cascade model that has 1b parameters and it will also be more powerfull then 1.5

icy drift May 28, 2024, 6:31 PM

#

lucid swift ther is also a stable cascade model that has 1b parameters and it will also be m...

Link? Also what SC projects?

lucid swift May 28, 2024, 6:32 PM

#

icy drift Link? Also what SC projects?

one group is making a furry model and the other group/person is making a anime model and both aver very big datasets. like 6m+ images

#

these two are from the anime finetune but its still not done.

icy drift May 28, 2024, 6:34 PM

#

Oh finetunes. I could use an anime finetune if there was a lineart controlnet.

finite hollow May 28, 2024, 6:35 PM

#

lucid swift May 28, 2024, 6:36 PM

#

icy drift Oh finetunes. I could use an anime finetune if there was a lineart controlnet.

it has something similar to lineart. where it sees the edges of the image

icy drift May 28, 2024, 6:37 PM

#

lucid swift it has something similar to lineart. where it sees the edges of the image

You can't draw a canny map.

dreamy sundial May 28, 2024, 7:21 PM

#

raven fern May 28, 2024, 7:22 PM

#

finite hollow

she is sitting in the middle of the car with a seat belt on, where is that seat belt connected to from the middle? LOL

finite hollow May 28, 2024, 7:40 PM

#

its one of the new mercedes 🙂

raven fern May 28, 2024, 8:03 PM

#

kek

finite hollow May 28, 2024, 8:25 PM

#

lucid swift May 28, 2024, 8:27 PM

#

icy drift You can't draw a canny map.

https://huggingface.co/Disty0/sotediffusion-wuerstchen3-alpha3

Disty0/sotediffusion-wuerstchen3-alpha3 · Hugging Face

low stone May 28, 2024, 8:44 PM

#

lucid swift https://huggingface.co/Disty0/sotediffusion-wuerstchen3-alpha3

anime looks amazing with the wider color gamut

hallow lion May 28, 2024, 8:45 PM

#

dull star well I hope that will work for 24GB and less

Make it 12 gigs or less 😄

finite hollow May 28, 2024, 8:47 PM

#

remote holly May 28, 2024, 8:48 PM

#

I am learning the word "soon" in any language since sd3 release day 1 :
in catalan : aviat

hallow lion May 28, 2024, 8:53 PM

#

in two weeks is what in catalan?

#

What a weird situation w ehave here. Cascade has been out for 6 months. We know its a huge imrpovement on SDXL. It works on current hardwares... I t makes huge images. It's also VERY fast...Yet here we are in two weeks... Waiting. Can you image what cascade would be in 6 months if it was embraced at leats half as much as sdxl...

#

What are waiting for anywya more than half of us wotn even be able to run this thing locally.

#

So sad.

#

People always want what they can't have and ignore what they have.

#

So of course they closed the channel even, nobody cared. this s all our fault

#

So lets wait then forever

#

for nothing

dull star May 28, 2024, 9:16 PM

#

hallow lion What a weird situation w ehave here. Cascade has been out for 6 months. We know ...

very fast? really?

cunning lintel May 28, 2024, 9:16 PM

#

cascade has one limitation similar to SDXL: prompt understanding. I think it's mostly the promise of a model that will soon be available and that improves on that aspect is what invalidates further work on cascade (and sdxl), the difference being that lots of efforts/research on sdxl started earlier and are only recently published.

hallow lion May 28, 2024, 9:17 PM

#

It iwas very fast

#

faster waaaaay faster than sigma

dull star May 28, 2024, 9:17 PM

#

I remember cascade having very average speed

hallow lion May 28, 2024, 9:17 PM

#

and same or better results

dull star May 28, 2024, 9:17 PM

#

than sigma? maybe

#

better results yes

#

just not smarter

hallow lion May 28, 2024, 9:17 PM

#

yes

#

so

dull star May 28, 2024, 9:17 PM

#

speed is questionable

cunning lintel May 28, 2024, 9:17 PM

#

But just as we now still see sd1.5 research published, i'm sure sdxl wil be there to stay for a long while

hallow lion May 28, 2024, 9:17 PM

#

wheres the controlnets? and why no one refines it

#

lol

#

whwres the cascade loras

dull star May 28, 2024, 9:18 PM

#

cunning lintel But just as we now still see sd1.5 research published, i'm sure sdxl wil be ther...

if 2B fails to replace

cunning lintel May 28, 2024, 9:18 PM

#

cascade is great for what it is

dull star May 28, 2024, 9:18 PM

#

if T5 seems to be too good to not use, and it will be a hassle to load/use, SDXL is staying

hallow lion May 28, 2024, 9:18 PM

#

Yes it is great hence its amazing how underrated it is

#

sdxl is good too yes

#

but cascade has greate rpotential

cinder junco May 28, 2024, 9:37 PM

#

I didn’t mess with it much, but got frustrated with cascade. It LOVES flat backgrounds. Like, it lacks creativity. If you ask for a subject, it won’t build a scene around it, just slap it on a blue or gray background.

remote holly May 28, 2024, 9:40 PM

#

hallow lion in two weeks is what in catalan?

I dont speaked catalan since long time , i am not sure but i think is "en dues setmanas"

remote holly May 28, 2024, 9:42 PM

#

hallow lion but cascade has greate rpotential

Cascade deserve more atencion and fine tunes because is hard to find a good prompt on base model

lucid swift May 28, 2024, 9:48 PM

#

remote holly Cascade deserve more atencion and fine tunes because is hard to find a good prom...

cascade understands prompts diffrently. its trained more for natural laguage and longer prompts

remote holly May 28, 2024, 9:57 PM

#

lucid swift cascade understands prompts diffrently. its trained more for natural laguage and...

ah, you teach me something, I didn't know this subtlety

lucid swift May 28, 2024, 9:58 PM

#

remote holly ah, you teach me something, I didn't know this subtlety

🤝

abstract nymph May 28, 2024, 10:00 PM

#

heyyyy uh, any news?

#

https://tenor.com/view/you-got-any-gif-26357631

Tenor

lucid swift May 28, 2024, 10:04 PM

#

remote holly ah, you teach me something, I didn't know this subtlety

stable cascade can do some good stuff.

remote holly May 28, 2024, 10:06 PM

#

I love the second

low stone May 28, 2024, 10:17 PM

#

Ultra-realistic 8K image of will smith in an exploded view. The components should be meticulously detailed and appear to float against a black background, highlighting their complexity and precision craftsmanship, hyper-realistic detail

cunning lintel May 28, 2024, 10:27 PM

#

oh i like, let's put will smith in an earlier prompt i borrowed

ultra-detailed photo of a shattered sculpture made of rose quartz depicting will smith, full body enlarged, ((pink glitter explosion)), side view, motion effects, ((shattering sculpture)), colored crystal particles floating as the sculpture breaks into many tiny pieces, studio lights, ultra sharp focus, high speed photo, Mschiffer art, soft colors,

#

giraffe confident expression, pixar style, expression