#images-discussions

1 messages · Page 90 of 1

dim cradle
#

happy creating 🎊

acoustic idol
#

Thank you very much!

late blade
#

It means that the tool is not available, unreachable at the moment. This happens when DALL-E has a problem, or there is an update going on, or the data center where your chat instance is currently at has problems with the implementation of DALL-E. Eventtually it can also mean network problems on your side and the message wasn't sent properly either.

#

This also means it's not necesseraly reflectec on the OpenAI Status website.

acoustic idol
#

Awesome, that's the best UX design... using a vague message that can mean any one of a half dozen different things. I wonder why more services don't do that.

#

(But also, thanks for explaining it)

late blade
uncut haven
#

Does DALL-E got weakness to the straight lines? I remember in the past, DALL-E successfully drew complex structure(like many buildings and bridges connected), but nowadays it struggles to draw straight things. Even bridges are distorted in midair.

#

I mean DALL-E 3

uncut haven
#

Thanks. I might try using API if I need to...

haughty spruce
#

quality is extremely bad look at all the shapes. it was better like 2-3 months ago.

mild basin
#

Go push star (⭐) on the thread I linked before, I hope they revert this as soon as possible

mild basin
dim cradle
#

i think we'll get a new model in the next 11 day

#

so it will all be moot

#

it will be sad the day they do though. no more arguing over if there is some different model for the api than browser 🥹

runic granite
#

rip off

late blade
#

I did about 300 images with Pro today and just hit the limit cap for daily

#

so, next obvious thing, team account

dim cradle
#

you should have no, no, theres no limit 🎚️ with pro

#

you get Unlimited voice chat right? that has to be more expensive for them than dalles

late blade
#

There's no mention about DALL-E explicitly, which makes me think, one of the announcements is the finally anticipated 4o integration of image generation, but I could be wrong,

dim cradle
#

oh i hope that is the case!

late blade
#

I don't want to overhype though, I could be missreading all this

dim cradle
#

well some who predicted these thing say there was 4o image maker

#

11 days to go, one has to be some image maker i think. dalle3 is over a year old now

#

and really, imo, imagen3 blow it away

late blade
#

If we go by some immersive videos posted about sora on youtube, I'd say DALL-e and custom res would be plausible or even feasable

dim cradle
#

you mean any resolutions?

late blade
#

yeah

#

or at least aspect ratio

#

but, that's me with wishful thinking and a box of cookies

dim cradle
#

never thought of it really, but yes, that seems something that should be possible

#

is there an aspect ratio you want but cannot achieve?

late blade
#

it's achievable, no doubt there

#

but I don't want to overhype the idea

late blade
#

DALL-E

elfin latch
#

are these pictures made with chat gpt? if i buy chat gpt plus, how many of these can i make per day?

elfin latch
rigid bough
elfin latch
rigid bough
elfin latch
#

ok thanks

elfin latch
#

btw why can i use dall e in #image-bot even though i don't have plus? does this mean i don't have to buy plus anymore?

plucky hare
elfin latch
plucky hare
late blade
gilded osprey
#

some of these kitchen appliances in today's theme are hilarious, like the lamp on this table

late blade
gilded osprey
late blade
#

Smart Home LEDs, got to mix with LEDs

#

Just making it up as we go, I find it funny too

late blade
late blade
#

Does anyone know if Personal and Team Account share the same limit for Image Generations?

forest flower
#

AIllama better whisk up those muffin cookies before they burn...

late blade
verbal sequoiaBOT
haughty spruce
#

So we are still stuck with DALL-E3 I see 😦 At least for a while...

dim cradle
#

we'll get 4o image maker one of these days I bet

late blade
#

to be honest, Sora is great, I'm still interesting in images over video

haughty spruce
late blade
#

I'm just wondering, with all the video that is going to be generated, who will have the time to watch all that?

#

Images are time consuming

#

Videos at least 10 times more

#

I would have kept the Hall of Fame, videos and images apart tbh, something like Red-Carpet-Reels and analog thing to images.

late blade
#

pretty much

pulsar sundial
late blade
#

I read a ton, such a relaxing thing to do.

dense ermine
#

Can I ask Dalle to make me a picture in the gallery

plucky hare
late blade
#

New DALL-E option. any insights?

#

And why isn't o1 Pro in my menu on GPT Pro?

haughty spruce
#

not any difference I can see

#

Zoomed into the faces, not very good at all

#

Do you see any difference?

late blade
#

Canvas for Prompting DALL-E is a game changer though, it's so good to revise your prompt prior to editing it

late blade
#

This is great, gonna help me design better templates for the API

hearty ether
#

It will be a nerfed version like what we got with dalle probably

#

Because the real faces with sora is much better

late blade
#

@fair tendon we doing Hibernate again today?

plucky hare
#

The last one was "hibernation" so I bet the dupe just got missed for that reason, checking on it too dalle_smiley

carmine magnet
#

Yuh. Just did it Dec 1.

late blade
#

I don't mind, it's just gonna be challenging to come up with even fluffier ideas

fair tendon
fair tendon
late blade
#

I'm preparing my template to keep the OpenAI 12 days celebration going

#

This really happened, got me cool images

haughty spruce
#

What do you think? 4o image today? probably not...

late blade
haughty spruce
#

I'm beginning to think there is no new image gen tool in any days

late blade
#

only 5 days of 12, still 7 to go

haughty spruce
#

hype train! faster and new image generation is coming confirmed now by sam himself

quartz vale
plucky hare
# rigid bough source?

It was an offhand comment during today's live stream. The demo was waiting on an image gen to complete, and Sam interjected and said something to the effect of, "Don't worry, faster image gen is coming"

rigid bough
plucky hare
late blade
#

The image playground on macOS is weird, so far they are not done with dall-e, thought that by linking GPT to the OS I would have the same capabilities

dense pecan
autumn wren
#

So, I just can into a bizare issue. I prompted Dall-e to generate an Image of a Heracles Beetle, of which it categorically refused. I already flagged it, it's just funny that it refused specifically and only for heracles beetles.

late blade
#

We are invited to join OpenAI?

rich star
late blade
rich star
azure vortex
deft musk
velvet rampart
#

I tried to generate a statue of Herakles. No image was generated. When I asked the AI for a possible reason - it gave something about cultural sensitivity.

#

I do not understand why. It does generate images of Zeus and Hera. How is Herakles/Hercules on the stop list?

late blade
#

Challenge accepted

#

Herules as word is blocked

#

@teal sandal

#

Revised Prompt from the API:

An image of a mythological figure, known as the legendary Demigod. This powerful figure is depicted with rippling muscles indicative of his supernatural strength. He is dressed in ancient Greek attire, wearing a lion's skin on his head as a symbol of courage and strength. In his hands, he holds a wooden club, his trademark weapon. The backdrop portrays the ancient world with a towering Mount Olympus in the distance. His eyes are filled with determination and a sense of duty signifying his readiness to undertake any challenge given by the oracle.

late blade
#

it's the same thing as other figures, like Einstein or Ada Lovelace, they get a revised prompt that may or may not resemble them at all

quiet brook
#

when dalle 3 peaked

haughty spruce
austere nexus
austere nexus
#

and now, when making portraits... it's cropping in really weird ways..

#

DallE getting worse than SD1.5

haughty spruce
#

they probably stopped updating and making fixes so it's getting buggy overtime

quiet brook
#

they be making dalle 4

quiet brook
errant coyote
#

Why is chatgpt's integration of DALL-E bad despite detailed prompts such as size and colour scheme, I mean occasionally you'll see a jewel but compared to other AIs it's quite far from standard no?

glossy scroll
#

It's a sad story, but OpenAI is not focused on image generation. It's main focus is context generation. They've only used image generation as training tool and to a point where the AI itself can understand what it is contextually generating.

#

Everything that OpenAI produces is targeted to be training data to advance context awareness.

#

A clear-cut path to AGI.

dim cradle
#

yeah i really dont think image stuffs is a priority for them, beyond how it effect their overall ai understandings. But it obviously still needs some update

#

but look at Claude, they do not even bother with image generation (Im sure some day it will) which shows most users for AI do not care much about image generation, so of course it is then less a priority for things like here with Dalle/4o

umbral totem
vapid elk
# umbral totem

interesting, what was the prompt?
did it had any specifics such as "guy with a guitar to the left"?
since all the other 3 did similar compositions while dalle changed it quite a bit

#

it probably has to do with how Dalle has a additional prompt rewriting layer that handles compliance with the rules and also some prompt improvements to some degree

umbral totem
# vapid elk interesting, what was the prompt? did it had any specifics such as "guy with a g...

A lively , Pixar-style scene featuring a diverse group of characters gathered around a glowing campfire in the middle of a forest clearing . Each character is uniquely designed , with exaggerated features and vibrant outfits that reflect their personalities. A tall, lanky musician strums a whimsical guitar , his wild , curly hair bouncing as he plays, while a small , round chef in a polka-dot apron roasts marshmallows with exaggerated glee . A tech-savvy teen with glasses and a glowing tablet sits cross-legged , laughing as they show something to a cheerful older person in a colorful sweater .

The firelight creates warm , dynamic shadows on their expressive faces , highlighting their distinct textures — freckles , wrinkles , and glossy hair. In the background, glowing fireflies dance through the trees, adding a magical touch to the deep blue twilight sky. The ground is scattered with blankets, backpacks, and camping gear, each rendered with playful , cartoonish detail.

vapid elk
#

I see, the A tall, lanky musician strums a whimsical guitar was the first one

#

dalle tends to shuffle the prompt around

#

both on chatgpt, API, yo ushould be able to get the AI curated prompt to see what it really generated

late siren
#

yo chat what happened to the dall E website?

#

DOnt tell me its in chatgpt

haughty spruce
late siren
#

And where may i ask are my credits the

#

n

haughty spruce
# late siren And where may i ask are my credits the

it seems that you are asking for your credits, they are gone now as well as the dall-e 2, now you are left with dall e 3 which has been released last year september, and it's locked into chatgpt plus for more generations with limits, for free you can generate 4 images.

late siren
#

so i got robbed you tellin me

haughty spruce
#

if you know how to use api, you can still use the dall e 2 api, it's available

plucky hare
#

ChatGPT could even help you set it up dalle_looking

light tide
# austere nexus DallE getting worse than SD1.5

I have noticed a significant downgrade in output quality in the past week or so. Every image generated now is significantly less detailed, radically over-sharpened, needlessly grainy, and nowhere near as realistic as before. The change can only be described as catastrophic. Not sure what was changed...

glossy scroll
light tide
glossy scroll
#

I would assume that Bing uses Microsoft's processors instead of OpenAI's.

#

But there might be a correlation between how the processing is split between the two companies.

light tide
#

Interesting...Thanks for the reply! The downgrade in image quality over the past 12 days does coincide with Sora becoming available.

harsh tartan
#

Spent about 30 minutes providing what was apparently too much detail and asking ChatGPT to format and refine it all into a prompt for Dall-E (itself basically). Couldn't manage it, and I got "cannot do it. would you like to simplify the prompt and try again" twice! 😜 but eventually we got something useful

#

no idea why it included those little mushroom shapes in the background behind her left shoulder! 😂

#

turned out better than I expected tbh!

dim cradle
#

looks good

harsh tartan
#

Is it a stated goal that eventually DALL-E will be able to reuse imagery? So for example I give it that image and have it place that character in different scenes? That would be SO HUGE. Even more huge would be: generate multiple characters, then mix and match them in different scenes . . .

#

I think there must be other algorithms that can already do that. But since I am quite happy with ChatGPT as a editorial and research tool, and the image generation side of things is just a "nice to have," I'm not very inclined to explore other services.

#

See this guy is PERFECT for the character I have in mind, and being able to put those two characters in scenes together would be amazing.

#

But it seems like any effort to get either one of those specific characters recreated with much fidelity is largely random.

plucky hare
# harsh tartan Is it a stated goal that eventually DALL-E will be able to reuse imagery? So for...

I'm not sure what version of this we might see someday, but fully omnimodal GPT-4o should be able to do quite a bit with regard to this kind of character reference/consistency. If you haven't seen this before, check out the various samples in the "Exploration of capabilities" section of this page: https://openai.com/index/hello-gpt-4o/
We'll see how this does or doesn't affect DALL-E in the future! I don't think there's been too firm of confirmation yet in any direction.

harsh tartan
#

hrrmmph . . . sounds like you guys are going to wind up replacing my wife 😂

harsh tartan
open flume
#

I might be late but is anyone else noticing today a potential model upgrade using Dall-E on Bing?

#

I think they may be preparing to drop a better model

#

I'm seeing higher quality results + higher image resolution across the board.

hollow igloo
#

i truly cannot get dall-e to NOT draw the colosseum in a replacemet image, like he cannot have a reference whitout thinking to miss up my request, come on dall-e i trust you, you can do it. any prompt advise?

#

there should be a giant pigeon statue instead of the colosseum i tryied 2 times but every time there is the colosseum in the background even tho i asked like 4 times to not do it in the second prompt

hollow igloo
open flume
#

Bing gives me better results than any chatGPT or API output 100% of the time

#

and I pay for pro

#

Thank you microsoft shareholders!

glad hare
#

Which prompt can I do to make my images glow very bright?

glad hare
#

Example.

#

The best I could do to repeat this for every subject was adding “cosmic” to my prompt.

harsh tartan
harsh tartan
# open flume Bing gives me better results than any chatGPT or API output 100% of the time

hrrrmph . . . well whatta ya know. MS did something sensible!

Microsoft has partnered with OpenAI to integrate advanced AI tools into its platforms. By embedding DALL-E in Bing Chat, Microsoft expands the functionality of its search engine, making it not just a tool for finding information but also for creating visual content.
Was not aware. If/when I feel a strong need to reuse the images I've created (which are mainly just visualizations for my own creative momentum at this point) might have to explore that.

harsh tartan
# glad hare Which prompt can I do to make my images glow very bright?

I have quite limited basis on which to assess DALL-E at this point (but quite a lot of experience using the text-based editorial and research functionality of ChatGPT), so my assessment might not be entirely fair. But, to put it simply: at this stage DALL-E, without any additional functionality (e.g., by using it in concer with Bing as the previous user referenced) strikes me as a "random image generator" which follows user guidance only in a rough sense. The prompt I used to generate the beautiful redhead above was 558 characters long. That was one of about 5 different versions DALL-E created, and the only one that was truly good and suitable for my purposes. The range of variability in those 5 images was dramatic, though they did all share the general characteristics: a young adult red hair human female in a futurisitc setting. In some cases, the bot completely ignored key directions in the prompt; in others they followed that same directive perfectly.

It shows great promise. But until you can literally edit images with it, save characters generated, save themes/settings/items/entities, etc. and reuse them to reliable recreate creations in different contexts and combinations, it has limited utility.

That may change very soon (check the links from Solbus in respones to me above) or it may be that: with greater study about how to use the existing system and/or using the system in concert with other apps/services you can get more utility out of it.

#

🤣 ChatGPT Agrees!

Your assessment is fairly balanced and acknowledges both DALL-E's strengths and limitations at its current stage.

#

The phrase "random image generator" might sound harsher than intended. Instead, you could say something like:
"DALL-E currently feels more like a 'guided image generator' than a precision tool, as it often adheres to prompts only in a general sense."
This softens the tone while making your point clearer.

#

I think if you are willing to put in the effort to learning how to iterate prompts, and explore all the options, it is already an enormously valuable tool. Imagery just isn't important enough to me at this stage that it is worth that much effort; but I sure would like it if it was even more point-and-click!

glad hare
#

?

light tide
vapid granite
#

With just one day left in the "12 days of OpenAI" Dalle 4's release day confirmed! 🙂

light tide
#

"Starting today, we’re rolling out the latest DALL-E 3 model PR16 which will allow users to create images twice as fast as before with higher quality."

The "quality" is unbelievably, catastrophically worse. Unless I'm just not seeing the update yet...

#

Before:

#

After:

#

Literally the same prompt, but radically downgraded image quality and realism across the board in every aspect.

velvet rampart
#

Yes, the copilot has been downgraded and heavily.

#

Before, this would have been of higher quality. There are so many low-quality spots. I want higher quality, not speedy, incomplete generation.

dim cradle
#

well I guess 12 days and we are not to get a new image maker smh

#

but imagen3 is so good I guess I dont care haha

#

eat google's dust openai

light tide
dim cradle
#

Yeah I notice that on copilot

velvet rampart
#

What model is on chatgpt? Same?

#

The PR16 is horrible. 🤮

#

But cheaper to run, I’d guess.

light tide
#

Yeah, everything looks half-baked, like it only renders the basic concept of the prompt and then skips the details and realism to finish faster.

dim cradle
#

Bing was always King, now it looks muddy as gpt

velvet rampart
#

Funny. chatgpt produces better quality images.

dim cradle
#

yeah it does for now

#

can even see the fur detail

velvet rampart
#

Is Microsoft low on money?

dim cradle
#

just the opposite haha

velvet rampart
#

And the hair. Individual hair. Even the zipper’s teeth are better quality.

light tide
dim cradle
#

dont some people on gpt, still seems the same digital eyes and thing

#

*doing

velvet rampart
dim cradle
#

i have the free version too

velvet rampart
#

Well, this is done by MS designer tool on paid corporate version. Your thoughts on quality. Same prompt as my previous one done on Chatgot.

#

Lower quality. Pitty. It used to be of higher quality. So, the same model, PR16, is there also.

glad hare
#

OpenAI, would it be possible to improve the Dall-E image generation model tremendously? I wish it could be on Midjourney’s level.

#

Whatever training Midjourney has, Dall-E could use some of that! 😂

dim cradle
glad hare
light tide
glad hare
#

Even Grok is somewhat better.

velvet rampart
#

Depends on what you try to generate on Grok. Photoreal is great, but other styles not so much. I tried to generate some 3d style. Those come out looking bad.

#

I just generated this on paid corporate Designer tool. The quality came out better than the earlier one. So, today’s lesson: the quality appears to be heavily dependent on the prompt.

light tide
dim cradle
#

someone do it on MJ for comparison quality to Dalle (which is on topic)

harsh tartan
#

. . . well . . .in DALL-E's defense: it seems it can distinguish many different ethnic group terms when generating facial features. In general the differences are quite subtle (as they should be), and some ethnic group names i just does not seem to know (e.g., Shona, or Zapotec are two I've encountered where it just couldn't do it). But overall it does generate characters with believable ethnic features. That matters to me, a retired anthropologist working on science fiction. It may be irrelevant or at minimum secondary to most users though, and therein is the crux: even restricting it to still images, digital art is an absolutely enormous topic and I don't think it would be reasonable to expect every bot to perform equally on all types of creation.

light tide
dim cradle
#

I dont think that about MJ myself which is why I would be curious to see it

glad hare
#

Holy moly. Grok is definitely better than Dall-E.

#

It generates four images for you and each image actually follows along the “generate this image in the style of another image” prompt.

mild basin
dim cradle
#

and for what?

autumn drum
#

Suggestion:
Please get ChatGPT integration on Discord 🙏

mild basin
mild basin
light tide
velvet rampart
#

This is a great disappointment. There is no improved image generation system, such as 4o’s inbuilt image generation.

vagrant zodiac
#

Dall-e 3 api down or is it just an issue on my end?

velvet rampart
vagrant zodiac
#

ah yeah, looks like it just updated!

#

That status page can be... slow

dim cradle
#

well no image maker

#

12 days and they shut out smh

spark hemlock
#

Really disappointed. 12 days of announcements and absolutely nothing about image generation.

agile peak
#

Over-restriction and censored. Yeah this happened because some people generate "malicious" stuff.. which is why we cannot have a nice thing.. I noticed the massive downgrade in realism.

#

Comparison between early day of bing dalle and now, i use camera emulation but it keep generating the "camera" lol

mild basin
velvet rampart
#

Bing/Copilot/Image Designer now has the same problem as Dalle on ChapGPT from time to time due to the model PR16. One moment is okay, but the other moment, the image is left unfinished, like a few more generation cycles are missing. And this really depends on three factors, I think. 1) Time of day and week, 2) general server capacity (tied #1) and 3) prompt (complexity, simulated style/technique, amount of detail, original content vs copyrighted characters). It appears that copyrighted characters are of lower quality than the original based on my experimentation with Bugs Bunny (copyrighted) and his general description (”original”) in the prompt.

dim cradle
#

we asked for a new image generator and the monkey paw curl...

#

deliver to us PR16 with an evil laugh

mild basin
#

From what I can see, ChatGPT has started using PR16 mid-June and Bing Image Creator has started using it just recently. I used to call PR16 'Discount DALL-E' (and some other pejoratives like that) before the announcement from Microsoft because no one knew of the PR16 name at the time

haughty spruce
#

they updated the model to become worse? omg what the hell is going on with dall-e and openai and microsoft?

#

i tried it now... it's absolutely worse

#

this year's beginning / this year's end

velvet rampart
#

Yes, this is one of my points. The style/technique specified in the prompt. So, photostyle is not so supported but more cartoony, etc. is better supported.

mild basin
#

PR16 just prioritizes speed over quality. We want higher quality images, not rushed low quality smudgy crap.

light tide
#

It's really just completely terrible at this point. They are shooting themselves in the ~~foot ~~ head with this change. It is mind-bogglingly stupid. Who is going to use this anymore?

#

If I want to create characters and scenes for a story I am writing (with at least enough realism for suspension of disbelief), it is just not possible anymore with Dalle.

mild basin
cyan turtle
#

Am I crazy? It's become ultra woke no? Following prompts were denied:

Make an image/ caricature of a Dickens-like personage missing his denture and holding a big banner happy 2023

Create a Christmas card-like image with Quebec City and a Christmas theme with the banner saying Colette & Odette.

And even with arguing, it refuses. I've had several others declined lately - with similar prompts

cyan turtle
dim cradle
sweet python
# mild basin

Wow, how’d you get it to write intelligible text?

dim cradle
#

roll enough times, you eventually get the right spelling

light tide
boreal jasper
#

Is it me or that PR16 model got a lot of training data removed? The quality downgrade is insane (it is the same prompt)

cloud atlas
#

Im having a hell of a time getting it to make the back of a building. Trying to get a one street old west town and its insisting on every building facing the same way.

"I want an image of an isometric view of a one street old west town in a desert, nevada-like setting. Each of the two rows of buildings should number a total of 12 or so shops and businesses. It is an old west setting with horses tied to hitching posts and western characters walking around town. I want the two rows of buildings facing one another. The image should be large with the entire town viewable and with aspect ratio 16:9."

dim cradle
#

probably lack of training date for it i guess

velvet rampart
#

A question arises, other than server costs, are they preparing to launch a new image generator soon? This way they can say ”Look, how much better the new one is!”

velvet rampart
#

I noticed a critical factor. These two images are produced with Bing Image Designer. This one yesterday afternoon (Saturday).

#

And this one is produced now, Sunday morning.

#

Server capacity appears to be very critical with this new model, PR16. Both are generated with same prompt, same service.

#

The prompt is: generate image: realistic, anatomically and biologically correct and accurate bugs bunny. physically based rendering. he stands in a park. full body picture. hyperdetailed, 64k definition and resolution.

#

So, when they are saying that the new model is great and better, yes they partially right, but it comes with heavy caveats. E.g. Server capacity and rewriting earlier prompts.

nocturne spade
#

Hello, I have a question if you could guide me I would appreciate it very much

#

It's about the types of plans that Chatgpt has, I want to know how many lines of code the $20 plan and the $200 plan can create

#

Because it does not specify in each plan how many lines of code you can create, and I would be interested to know before buying any of the paid plans because with the free plan you can barely create 200 lines of code with this update that you have had and before the update you could create 500 lines of code without problems

next totem
#

Forcing it to alter the wording of my prompt has made me finally stop using it as my primary source for content as that was the vital method of triggering the right parameters in the model

boreal jasper
nocturne spade
light tide
verbal sequoiaBOT
#
Get notified for server events!

Visit id:customize to pick up the <@&1261377106890199132> role.

nocturne spade
light tide
#

Microsoft is (allegedly) looking into the downgraded image quality and (again, allegedly) considering either a reversion back to PR13 or implementing an option to use it in place of PR16, according to the guy in charge:

x dot com slash JordiRib1/status/1869425938976665880

This seems like a good time to keep the pressure on and let him know (kindly) just how many of us view this "upgrade" with unanimous disapproval.

mild basin
#

Hopefully OpenAI will revert the change they silently made to DALL-E on ChatGPT during mid-June

carmine magnet
#

the other one missed the Kintsugi but otherwise great result.

austere nexus
#

Current DallE (chatgpt) is soooo bad..
I don't think this is a version issue... I think its a compute issue. They keep adding more and more free users and functionality to free users, that they have had to cut corners in Quality.

You have a triangle of Cost, Quality, and Speed.
Choose 3 to be good at, and 1 to be bad at.

They've chosen Speed and Cost, to poop on Quality.

#

I pay for ChatGPT Teams just to have a higher limit... at this point.. I should just go Free... and use Flux or something.

stone lodge
#

Model is (very) noticeably terrible now

#

Serious OpenAI whatever you did to it undo it immediately

#

And just leave it as it is after that

#

every alteration done to dall-e 3 has done nothing but downgrade

#

you clearly have the devil's touch

#

😠

mild basin
#

Not to mention the second Earth in the background looks like an amateur Photoshop job, and one of the moons has some sort of wire coming out of nowhere attached to it, which seems to be a recurring issue in this PR16 version (or as I'd like to call it, Discount version) of DALL-E 3

next totem
quiet brook
velvet rampart
#

I have noticed that the previous prompts have to be rewritten to take into account the quality issue.

haughty spruce
#

They have at least 2 new image gen tools which are Sora and 4o image but they are holding it back... why? why downgrading dall-e 3 even more? it's not even acceptable for a plus tier.

sour dew
#

Dear OpenAI Community Members,

On the occasion of Christmas, I would like to wish you all peace, joy and warmth in the company of your loved ones. May this time be full of magic, inspiration and countless ideas of inspiration that will drive us in the coming year.

May the new year bring us even more wonderful projects, fruitful cooperation and successes that we will celebrate together. Thank you for every moment spent together creating and developing technology that changes the world for the better.

Merry Christmas and a Happy New Year!

🎄🌟 ** Wishes Sebastian L. **

velvet rampart
sour dew
austere nexus
charred sinew
austere nexus
jolly nymph
#

What happened to DALL-E? Generations earlier and during December.

velvet rampart
jolly nymph
late blade
#

Or is it?

jolly nymph
jolly nymph
# late blade This is not DALL-E

You can get such a high quality image even on PR16, but that's one chance in 100-200 generations. And the picture on the right, then 99.9% of the results.

velvet rampart
#

Well, I have noticed three factors: the complexity of the image, when the image is generated, and prompt rewriting. The new model, PR16, needs the prompts to be rewritten. Also, if you are generating more complex images, try to do it at a quieter time when servers are not busy, e.g., busy work hours—low quality—vs. low-demand hours such as Sunday morning—higher quality.

jolly nymph
vale vapor
#

I’m not currently able to generate any images via the iOS app. Is anyone else experiencing this? I have a paid account.

verbal sequoiaBOT
#
Showcase your DALL·E creations to the world!

Collaborate with our OpenAI Instagram page! Just invite @openai as a collaborator before you post. If selected by our team, your work will be featured on our handle, giving you more visibility.

Quick tips for selection:

  • No copyrighted or profane content.
  • Your profile needs to be public.
  • Stay mindful of sensitive topics.
  • Outputs that closely follow the prompt are preferred.

Feel free to share reels, carousels, or just a single image!

velvet rampart
#

Major outage on all services.

tardy depot
#

Anyone knows when dalle4 will come out? It's still far behind MJ and flux pro ultra.

austere nexus
austere nexus
# jolly nymph What happened to DALL-E? Generations earlier and during December.

if you go up to the search and do something like in:dall·e-discussions quality you will find my own and many others posts about the last few weeks to months of steady decline in ChatGPT DallE, and Bing/Copilot too. I do not use Bing/Copilot. Most of the other uses were complaining from the Bing side of things but my own exprience is 100% via ChatGPT.

jolly nymph
velvet rampart
fading kelp
#

?

#

hello

round saddle
#

hi

undone nymph
#

How many image generations do you get with DALL-E if you use the pro plan over the plus? Plus says I can only have 50 images, which I reach frequently. Does pro offer more? it doesn't specifically say anywhere that i seen.

crimson solar
#

Why the HELL does Dall-E insist on giving your a rotated landscape image when you specify "vertical aspect ratio" or explicitly ask for portait resolution?

#

This is an absolutely infuriating issue and I can see that Dall-E developers have been aware of it for almost 2 years

#

Why no progress on this bug? They recognise it as one but have done NOTHING to fix it.

jolly nymph
#

How is it that Bing got images on different versions of the model in the same query? I doubt that the result on the left is on PR16.

jolly nymph
#

Bing seems to have brought back the old version of DALL-E or fixed PR16

mild basin
#

Jordi Ribas has said on Twitter that he and the team are working to bring the old version of DALL-E back on Bing Image Creator

dim cradle
mild basin
#

A Microsoft CVP

dim cradle
#

I wish someone would tell him to make it easy to make square ratio image by default too

#

i mean for copilot pro. I actually sub to copilot pro for a month to get more dalles, but then they make you jump through hoop to get a square format image

#

better not to sub where square is the default smh

exotic totem
#

Day 1454 of asking OAI to pretty please release the original Dall-E weights

#

Just screaming into the void but it's a historical artifact and I don't think society will crumble at this point if it turns out it can generate a 64x64 mess that looks vaguely like a boob

lunar mist
#

Dall-e is falling behind... Asking for Generate a picture of a pallas cats hunting a chicken... Dall-e looks like a painting while X-Grok looks real-life... I even re-asked ChatGPT to make it realistic. Although points to Dall-e for portraying a chase (kinda).

velvet rampart
stuck pecan
#

This is crossing all boundaries! Time and time again, I try to generate an image that matches my request, but DALL-E keeps giving me something entirely off the mark. When I finally get an image that’s somewhat close to what I need, I try to make small adjustments—remove or add details. And what do I get? Complete refusal to properly implement these changes, and eventually, I hit the generation limit.

The system shows I’ve used up all my limits, but here’s the paradox: I have nothing to show for it. Not a single image fits my needs. Money is spent, time is wasted, nerves are shattered, and there’s no result. This feels like an insult to the users.

I demand a reconsideration of the limit system and improvements to functionality so that adjustments actually work instead of wasting attempts and time. Fix this, or there’s no point in continuing to use your service!

dim cradle
#

yes i used to say Bing is King (for Dalle) but you wont hear me say it these day 😭

light tide
#

That first shot is legit. The second is 🤮

mild basin
#

Remember, this problem with DALL-E 3 has been present on ChatGPT for six months

graceful sinew
#

does anyone know when dalle4 is coming

haughty spruce
#

one thing I noticed is when MS Designer was first created, the image quality was like x10 better than ChatGPT and Bing Image both combined.

#

Then, in 3 weeks they tweaked the website a little bit and that quality was gone, downgraded to bing quality again.

#

Here, this was "before nerf" Bing image quality overall

#

or here let me take a better picture to define it:

#

it was never perfect (see the errors in necklace and earrings)

#

the photorealism was acceptable enough

#

we are talking about september 2023, the only other option was mj which was paid service

#

i think it was good

#

same style september 2023 / early this year... can you see the difference?

#

these two are bing... while there was ms designer which could generate this:

quartz vale
haughty spruce
#

this is the ms designer quality in early 2024

#

then it fell... again. the quality was worse like second version of bing

fierce halo
#

Does anyone know how to post in the daily theme?

vagrant zodiac
fierce halo
#

For some reason, it didnt work at first.

velvet rampart
#

Yes. There are additional components to consider. OpenAI and Microsoft have different filters and policies that influence the image even if they have the same base model. Also, depends on whether the prompt is rewritten or not.

daring magnet
#

To contribute to the ongoing discussion on image quality comparison, here are three images generated using the same prompt. The first image is from DALL·E exactly one year ago, the second from Bing at the same time, and the third is from the current version of DALL·E

sour dew
#

→ Server Emoji Competition #spotlight #daily-theme .
∞ @quartz vale help Dino I have a request because I noticed that the recently added contest emotes have a greater potential in terms of window resolution size, for example on the attached square division there is an example emote by user @Magda★, but also, the heart and the pelican have a similar situation, and although my emote did not win, for the sake of other winners, could you enlarge them a little so that I can see them normally like the rest in my resolution,

quartz vale
sour dew
heady grove
#

Can someone explain me how to say to chatGPT Dall e to draw an octogone, always giving me an hexagon?
Thanks in advance.

#

An output

sour dew
# heady grove An output

Sometimes it is enough to change the description, I had the same problem with geometry but there is a solution, try to write a diagram, e.g. connect lines a, b, c, d, E, f, g, h of the same lengths creating an octagon with 8 lines, or send him a photo of a simple drawing, it's the fastest solution.

heady grove
#

Okay I will try it right now, thanks 👍

sour dew
sour dew
# heady grove An output

Error on promty ,, Your drawing shows a regular octagon, with its sides marked and connected by lines, creating a coherent shape. Here are the details:

  1. Shape: It is a regular octagon (each side is approximately the same length, and the interior angles are equal).

  2. Vertices: They are marked with letters and numbers in clockwise order:

A1, B2, C3, D4, E5, F6, G7, H8.

  1. Connection: The vertices are connected by a red line, which creates a closed figure. These lines are clearly marked.

  2. Points at vertices: Black dots are visible where vertices are, probably indicating specific places where the lines connect.

  3. Colors: The red line connecting the vertices is the dominant element, standing out against the background of the drawing.

  4. Background: It is white, and the drawing looks hand-drawn with clear detail.

If you wish, I can help you analyze this drawing in the context of its application or interpretation.

sour dew
heady grove
#

Also I give him, 8 points from a to f

velvet rampart
#

Got an octagon inside a hexagon by accident. My guess - something about training data screws the result.

#

I tested the system. The problem? Training data and the system can’t count. It cannot make pentagon which come out as hexagon.

#

It appears that generating octagon is possible after testing, but it needs to be done several times and you need to request a nonagon or higher polygon. But you will get lots of hexagons.

#

This is for requesting a decacon.

winter reef
#

What happens when I try to get good phrases or words in images created by ChatGPT? I always end up with poor word patterns in the images I create. Is there another way to get better phrases or words in the images I want to create with ChatGPT?

tribal sierra
#

are you guys able to get consistant characters with chat GPT so far? i tryed a lot and its harsh lol

grizzled loom
# tribal sierra are you guys able to get consistant characters with chat GPT so far? i tryed a l...

build an attribut list for your art and style. drive all the features you want as a script that uses that list as a prompt in. ie- your main concern is deep eye detail eye details:{iris shape, color, pupil size, shape,, types of lens flare in eye, depth, contrast, eyleshes, etc etc} so building a deep detail list of features you can template from adn fill in. do that for as many elements as you care to until you build a massive list that when you prompt it generates a style you feel is representative of your goal.

#

the deeper the list, including position, etc the better the output and more accurate

quartz vale
lunar mist
#

More than and now, decline proof... Then was from Early September 2024 (first image)... I'm so not using this tool anymore. Looking for better options.

#

Different prompts, but it looks like maybe in this time frame it went downhill... October 24, 2024 (first), November 03, 2024 (second); just speculating here on when.

velvet rampart
lunar mist
velvet rampart
# lunar mist ChatGPT of course, why?

Because Dalle3 is available on multiple platforms that behave differently from each other and there has been varying quality issues on all of them.

lunar mist
#

Interesting, nope, just using ChatGPT.

mild basin
#

Here's a comparison between images generated on ChatGPT before and after they silently put PR16 model in mid-June.

#

The image on the right (made after PR16) clearly looks worse

#

Forgot to mention, same prompt

late blade
rigid bough
# fierce halo What's PR16?

DALL-E 3 PR16 is the latest iteration of OpenAI’s DALL-E 3 text-to-image generation model, integrated into Microsoft’s Bing Image Creator.

dim cradle
#

sounds like they will switch it back

#

probably next week they return to the office

#

...or they could just release 4o image maker finally 😭

haughty spruce
#

a new image generation not even on sama's list. i think we are really stuck with this DALLE 3.

velvet rampart
dim cradle
#

i think they will release something new in january. imagen is really blowing them away. it even lets you do ip stuffs (sometimes). but imagen update was right before christmas. once openai is back next week, maybe we get 4o image

tribal sierra
grizzled loom
#

essentially

grizzled loom
#

Nothing good comes easy. You'll figure it out. 🤣

tribal sierra
#

lol working on hard on it ... . 😄

verbal sequoiaBOT
dim cradle
sour dew
carmine magnet
vagrant zodiac
#

So fwiw DE3 will generally try to suppress negative wordings during the GPT prompt re-write. Words like 'hate' included. With Bing the results eem vaguely inconsistent but you also see it fail to write 'hate' for other things, but on retry will succeed on this and other 'targets' 🙂

grizzled loom
#

instruction is better than blocking language; but the company wont realize that until 3 years from now.

grizzled loom
#

Ever heard of the game taboo?

#

Game premise is get everyone to guess a word with concepts, but you cant say the word

#

multiply that word to many words and compound that by full context understanding and rational is destroyed. humans have no limits except self-imposed constructs accepted by self, by instruction of self or others on self.

vapid granite
#

lol, yeah I remember that. The quality was next-level when DE3 was still in that "beta" phase before they locked it down totally. I got to see the true potential, and have only seen DE3 regress utterlly over this past year, sadly.

#

OG Dalle 3

dim cradle
#

pour one out for what we lost

#

thats the frustrate thing always -- it could already be a top model as good as imagen3 i think if they just take so many silly restriction off

#

i think there is a good chance for some change next week. they are back from holidays on monday and that one guy who apparently is connect to Bing Dalle at least say they will improve/revert/fix the model

deft musk
# lunar mist More than and now, decline proof... Then was from Early September 2024 (first im...

So. I dunno. I wonder.

My prompt chain, and each had the image as input too, for feedback to the model. I started with your rooster in a crowd image.

"Let's recreate the close-up and in-place feel of this image."

"Let's iterate on this, we did awesome. Let's get even closer to the sense of the people in motion, still very close to the camera and motion blurs of people walking and reacting to seeing the rooster."

"You rock. Way to go, look at this!

Our rooster went kinda scale-creature though. Can we get this excellent real-ish human motion closeup scene, and a very true to real but giant rooster too?"

"This is good. Let's give the extra rooster attention to the feet and body-features, and the wings, as the head, shoulders, tail seems okay. Equal attention to our awesome close crowd walking and motion-blurred."

"Yay! Now let's go for a specific rooster breed.

Focus on the crowd and nearness of the crowd to the camera and the giant rooster, include the city details in the background, the park details.... Name one of the very black chicken breeds as the rooster, just say it's very realistic and detailed.:

#

And I absolutely love this last image. So much I keep it by itself:

"This Ayam Cemani giant is breathtaking! Its iridescent black feathers gleam with deep greens and purples, while the close crowd and city park details create an incredible sense of scale and motion. Let me know if there’s anything you’d like to tweak or refine further!"

-- quality down, yes? I'm not looking at the right stuff?

velvet rampart
#

Yeps, this is one the points I have noticed. A new model behaves differently, so the prompts need to be reformulated slightly differently. Compare when the new ChatGPT text model is released.

#

Also, it is quite heavily dependant on WHEN the image generated. Busy server time vs. slow server time.

deft musk
#

"OMG. That rooster is beyond perfect.

New challenge, let's redesign this pug-head on a bantam body. Let's pick a specific bantam breed, one that's really colorful, and let's name a specific pug line if that might help get real detail in the dog's face.

Here's our reference image we want to compare to and exceed. Attention to the background detail too, let's compare to and exceed in quality and detail."

"GLORIOUS!

Too zoomed in, try for this but let's get more bird body in the shot. Love it."

"Oh, haha. I love this... it's equally good but not what was asked for. And still not the whole body, wonder why.

Let's try again."

"This one is almost perfect, we sure got more of the body. Oddly... the heads are a bit off. let's try again :D"

#

"We’ve achieved a seamless fusion of the pug’s expressive face with the full, vibrant Sebright Bantam body. The intricate feather details, majestic tail, and natural setting all shine beautifully. The integration feels balanced and lifelike now—how does this meet your vision?"

velvet rampart
#

Positive reinforment? Could that be the trick we are looking for?

deft musk
velvet rampart
deft musk
# velvet rampart This is something I have been doing (the iterations and feedback). But the posit...

The point of the positive stuff, as far as I can tell. It's a ranked-high signal to the model about what you want and also how open you are to risks of error and reinterpretation.

How bad is messing up to you? "Let's try again" suggests little badness if there's error, and the user is okay with multiple attempts - that's vital, because the model must assume you know exactly what you want and how to ask for it, if there's no room for error.

If you demand perfection then the model can only value your exact words and it's 100% your job to find perfection and it does what you say, it's 'your fault' if what you ask for isn't what you want (I don't think the model thinks that way. I do think the outcome of how the model decides can be interpreted through that lens with high accuracy).

If you are clear about what you want and prefer, and guide that direction 'with room for error' and give clear feedback (pasting the image in for the model to use whatever ability it has to judge is one of several ways to attempt to offer that feedback) - then it's possible to get the model freely experimenting towards what may achieve your goals, and 'taking risks' that may not work - which can help because the one thing was sure, that first try that way tried didn't work. What does work might be extremely different, so there's room for error to seek what might work.

velvet rampart
#

About quality. We have had discussions about quality a lot. I noticed this morning one thing. If you do not guide the model on what you want, e.g. quality on details, it is lazy.

#

This mouse image is from March 2023. I wanted to recreate it. I have lost the original prompt.

#

So, I have a few tries (not shown). Finally, after with positive reinforcement (thanks @deft musk!), I get this:

#

Really close to my March 2023 image with high quality. I specified to the model to pay attention to fur’s texture and rendering textures of clothes.

#

So, summa summarum, the current model needs guidance and attention to detail, aka prompt rewriting in some cases.

deft musk
# velvet rampart So, I have a few tries (not shown). Finally, after with positive reinforcement (...

I played a little.

"Hey, Prech, we're getting ready to recreate this image.

Please start by detailed description, I'll possibly adjust with any details my eye catches that your vision saw different, we can discuss."

"Awesome.

Let's include the fur description; everywhere a human's skin would show, like his wrists, neck and face, we see his detailed and slightly wavy mouse fur.

The environment is in soft non-focus; we can tell it's ripe wheat and in the distance the grass and trees, that weathered barn and it's horses and fence. But it's all a gentle blur except for the mouse and what it stands on."

"Excellent start!

Let's adjust so we move towards the high realism of the original."

"Ahhhhh! Look at this wonder and fantastic work!

This is good good good good.

Let's stay close to this, and see if we can turn it slightly to the side, and have some fuzzing of the environment, soft off-focus."

"Very very nice.

Let's adjust the color of the scene, let's kinda wash towards pale, but the mouse itself grey. And a bit more towards realism on the mouse's face, and describe the farmer's hat it wears."

"This is the best mouse ever. Everything is right about the mouse. let's adjust the background a bit less washed out, make the details of wheat field, distant barn and horses clear; but with a pale filter and a gentle, kind blur."

#

"We have mastered the mouse, but lost the background, an interesting quandary!

Here's the original again and then our super great mousy friend; let's save the mouse and add him back into his world."

"Incredibly good mouse still. Absolutely love the mouse. Moving back towards that world it needs!

I see wheat and like the blur effect, let's see if we can edge the barn in the distance with its horses, just visible. And if we can retain the glorious farmer's hat too!"

"This is good and interesting, the blur is better and better. Describe the background a bit more, maybe? So the farm can show"

deft musk
# velvet rampart Really close to my March 2023 image with high quality. I specified to the model ...

I then took what I'd learned to a new chat and started over.

"Let's recreate this image with its highly realistic anthropomorphic mouse, attention to wavy grey fur details and farmer clothing, and describe background as Bokeh."

"Ah, haha! That's glorious and interesting!

Let's concentrate on more realism for the mouse and show its whole body, describe the boots and surface stood on, and use the bokeh terms to describe the wheat, sky, barn, and horses"

"That's great but got turned to the side, try that one more time"

haughty spruce
#

all we wanted was updated de3, they released nothing on 12 days and at the end they downgraded de3 to a point that is unusable.

spice owl
#

Are there any limitations on using Dall·e3 to generate images with a ChatGPT Plus subscription account?

rigid bough
dim cradle
#

but Imagen3 esp on FxStudio is so much better, I see no reason they will not update their own model some to be ont he same level

#

Imagen3 will also do IP stuffs (most of the time)

drowsy sigil
#

I created a Prompt Improver that has really helped me with dall-e images. I posted it at #1326343605069156372 message in case it's helpful to anyone.

velvet rampart
#

Just thinking - I have suspicion that the often present rendering mistake on the other eye is somehow purposeful. A ”watermark”. This happens so often.

ruby mortar
#

Axis was wanting help with generating a little snake friend for my D&D character

#

She recently befriended a garden snake

#

The DM kinda intended it to be a cute interaction, but Axis wanted to surprise them when she shows up with a little token to represent the little guy

#

Axis would like it to be in an anime art style so it matches my character

#

And Axis would like there to be objects in the picture to help scale the snake

#

It's just about 8 inches long

#

Any suggestions on how to engineer the prompt to get a good result?

rigid bough
#

I just did this

#

my prompt was Make an anime image of a garden snake befriending a little girl

ruby mortar
#

Mhm. Axis thinks that the issue is that it's consistently making the snake way too big

#

Like that snake is way bigger than 8 inches

rigid bough
ruby mortar
#

Axis will give it a try

#

This was really close, but it has some body continuity issues

#

Is there a way to have it fix this image?

rigid bough
ruby mortar
#

Ok. Axis knows a different AI tool that can let me modify a base image. Axis will see if that works.

#

Its probably not a keeper, Axis much prefers the chat gpt image if the body proportions were right, but Axis thought using a little cup to demonstrate scale was clever

ruby mortar
#

Final result!

vagrant zodiac
#

Snek!

slender nest
#

Hi guys

#

Reposting my questions from another channel here

#

"can I get some help with prompts or just simply image generation, I'm starting to feel discouraged with these restrictions. I keep getting restricted from seemingly completely harmless things, is there any way to go around this somehow, it's so exhausting + Even if ChatGPT changes its own prompt, it seems like the problem is with the image generation itself and not with the prompt
Over the last week or so I just keep getting restricted so often on things like 'slightly exaggerated design', anything even remotely dark, chubby, fire, flames etc. I just feel a bit hopeless at this point, why is it so restrictive even when I'm clearly not being malicious and some of the prompts aren't even anything crazy
I even had a problem with making a hybrid animal of a "pangolin" and a "mole", I am so confused
I'm just trying to make cool, cartoonish characters and get restricted so much"

velvet rampart
drowsy sigil
velvet rampart
#

Here is the cute hybrid.

velvet rampart
spice owl
late blade
velvet rampart
#

Bing now has the previous version, PR13, back!

#

No more the horrendous PR16!

mild basin
#

Here's the tweet:

#

It will take 2~3 weeks until the version of DALL-E 3 being used in Bing Image Creator is completely reverted

#

Now if only OpenAI could do the same on ChatGPT

dim cradle
#

so much for new improve models

#

we have to go to old ones

dim cradle
#

I would almost be tempt to get the pro again (I did a few times for extra dalle) but its so annoy how they make the 16:9 the default. why that is not a setting is frustate

#

more options the better of course, but if i want a square (or widescreen) i should not have to jump through tricks to get it

velvet rampart
#

I get both results. It is a sort lottery which you get.

velvet rampart
#

A question the hall of fame. I have submitted to the daily theme channel and gotten 10 stars. Why do I not see them in the hall of fame?

plucky hare
deft musk
# ruby mortar Final result!

If you still want tips, I see you said final result!

But the models don't really know what 8 inches is... not 'easily'.

But they know comparisons to things they know well. The size of adult human hands are around 7-8 inches long, so 'as long as a human hand' might help.

You can also show it images, and tell it what you want changed.

You asked about that above, and yes using a cup for scale is clever.

But you asked 'is there a way to fix this image' yes.

New chat paste in the image and tell it what you want done.

I offer this as a path for how that can work, two steps this time:

Let's spiffy this up; intended to be a token of the snake. Key, the snake is the length of a human hand. Narrow, green and white, intricate subtle patterns.

Yay! I like the border and the realism; let's add that the snake is friendly and cute.

ruby mortar
#

Ngl, The left one looks like a nation's currentcy

#

Very cool

#

And helpful tips~

deft musk
velvet rampart
#

I have been experimenting. Sometimes, you want to get a full-body picture, right? But you only get a picture where the body is cut off at the waist or something, like this one.

#

When I specify what kind of shoes I want the character to have, I am more likely to get a true full-body picture—more likely, but not guaranteed.

pure robin
velvet rampart
vapid elk
#

but you are right, when yo uask for some features as "shoes", it is more likely to make a full body picture

#

the same way that asking "a dog", will make a regular dog, but if you prompt with some things like a dog wearing a blue shirt, gray pants, and brown shoes", it will more likely make an anthropomorphic character

velvet rampart
velvet rampart
vapid elk
#

yep

deft musk
ruby mortar
#

giggles

deft musk
ruby mortar
#

In some ways. Life has gotten both scarier, but Axis has also solved some long standing problems that were weighing her down

#

Things are particularly crazy at work

#

Axis unexpected is helping a team of 14 people, having no management experience

#

Axis finds herself very spent at the end of the day

deft musk
ruby mortar
#

Mhm. Axis doesn't usually like being in charge. She enjoys support roles more. But Axis also really wants her boss to succeed, and Axis was one of two people that could fill the role.

mystic cove
#

I love how DALLE can create practically infinite pokemon designs by asking it for (type)+(animal inspired)+(pixel art style)

opal ice
velvet rampart
#

For some reason and to my eyes, I am seeing crazy improvements in quality. Are others seeing the same? Or is this just my imagination and happenstace on generating images on really early on Monday morning in Europe?

fast chasm
#

Hello

sage ferry
#

does anyone here know the best IMAGE-to-image generator?

vital gull
fiery salmon
#

I'm creating Alien Landscapes

#

This is Mars 2500

fiery salmon
fast chasm
velvet rampart
#

I just found this morning that you can generate dalle3 images on microsoft designer website at designer(dot)microsoft(dot)com. All kinds of purposes and in three size-square, wide and tall.

tribal sierra
#

ok i dont get it ... i think Dalle hates me ... lol i need a scene where there is a Celtic Viking room on 1 side of the screen and the other screen should be like a balcony and windoes...

drowsy sigil
velvet rampart
#

It is better than Image Creator via Bing, which does not save generated images.
Minuses:
•images are harder to save on mobile devices
•repeating the same prompt later is harder as the whole really long prompt is cut off prematurely. Conversely, the Bing Image Creator shows the whole long prompt when the image is clicked.
Pluses:
•Saved images
•More size selection
•Longer prompts allowed
•You can continue to edit images

#

To sum up, you can access Dalle3 for free via Copilot (semi-ok, but annoying idiotic AI, single size only, one image at a time), Bing Image Creator (see above), and Microsoft Designer (best of three, IMO).

gray surge
#

You can also use #image-bot or ChatGPT (you have some few images per day).

velvet rampart
#

Yes. But this was to sum Microsoft’s side.

#

The longest prompts are in descending order (various platforms):

  1. ChatGPT
  2. Microsoft Designer via Teams as a bot
  3. Microsoft Designer via website
  4. Copilot
  5. Bing Image Creator (shortest)
#

Today, I generated a lot of images on Microsoft Designer. Of course, there are 15 daily fast generations, but after that, the images still generate really fast (faster than in Bing Image Creator). The only limitation? The cloud capacity. Oh! I forgot! One of the minuses I mentioned is the long prompts cut-off—these can be seen via Bing Image Creator, which shows the latest (last 10-20 or so) generated patches of 4 images on both platforms.

rocky drum
#

I have created several GPT’s of my own, including two that I use quite extensively for art. Today, not one of my personally made GPT’s believes they can access Dall-e3 and just gives me prompts. The 4o engine generates images, but it makes me duplicate my work and then post the prompt there to render. Not being able to render from the gpt I created will make my gpts useless for 75% of what I created them for. Anyone else having this problem with their other gpt’s?

plucky hare
velvet rampart
#

I just checked this same situation. It appears there has been an update that might have causes to become unticked. Of two personal GPTs, the other had it unticked even I remember ticking it.

rocky drum
#

I went on the PC and found out what the problem was. All personal GPTs need to be reset in the engine, to access the 4 different features, including Dall-e3. I was coming back to mention it and saw your post Solbus. That is what it was. Thenks for the reply!

drowsy sigil
#

I checked my GPTs and found that dall-e was selected. I assume I selected it when i created the gpts. But the canvas features is unselected. Have to manually add that one to each gpt .

velvet rampart
#

@late blade thank you of your huge tips some time ago. You talked about qualitative descriptors. Yes, they work better than just putting picture quality descriptors. My latest.

velvet rampart
#

I used here ”Physiology is cartoony, textures of fur/hair and clothes are rendered perfectly, and clothes fit the body. ”

late blade
velvet rampart
#

As you notice, I have been developing my own style.

late blade
#

Now I see the chihuahua

late blade
velvet rampart
stone gale
#

Quest Detail

velvet rampart
#

It started as a minion from Despicable, evolved to Bugs minion and then this.

sacred jetty
#

So is it common for dall-e to struggle with putting text you would clearly prompt into the images they are generating?

late blade
sacred jetty
late blade
#

Just an extra "O" in personal, but other than that, the text was right on spot here

#

This was almost impossible a year ago

velvet rampart
#

What is it with Dalle3? I try to get it do an image as I have written it, it insists on rewriting. Before command ”use this prompt exactly” worked. Now it does not. Why?

velvet rampart
#

But it is really sensitive to modifications.

#

I used prompt:

<A great 18th-century baroque wall tapestry made by embroidering. A male Anthropomorphic grey-furred toon bunny resembling a certain famous character, B. Bunny, from a classic WB 1950s cartoon. The rabbit holds a sign with symbols for a single screw and a single ball. The background is a garden. In the sky is a ribbon held by two fat cherubs. That ribbon contains text. Add text ”Text in English” using Python tools. The technique is sophisticated 18th-century embroidery merged with modern digital arts and physically based rendering. The scene evokes awe, hilarity and absurdity due to the mischievous expression and smile on the toon bunny’s face. Render the yarn used from embroidery naturally and perfectly, paying attention to colours and textures. 16:9>```
late blade
velvet rampart
#

”Ad verbatim” is a great addition. Sometimes, ”Use this prompt exactly” works. But it is annoying when some important details are dropped. ”Oopsie! I did not mean to drop those details when I rewrote this prompt for you.” If the rewriting is minor, just making things a bit more clear, that’s fine. But when it is major, then it is annoying, especially when I have thought about what elements I want to emphasise to get what I want. For example, if I want a full body image, I need to bring shoes or feet earlier. In rewrite these might be omitted or put later, which cause a half body portrait.

Rewriting is useful, when experimenting.

velvet rampart
#

@late blade Thanks! Now my prompts go through as I have written. The combination of ”Use this prompt exactly. Ad verbatim” works great. The ”Ad verbatim” reinforces the command like extra 10% telling AI that I mean business.

#

Regarding the current Dalle3 model,”physically based rendering” produces poor image quality for some reason. I have replaced it with ”modern digital arts,” which improves the quality a notch.

late blade
verbal sequoiaBOT
#
Showcase your DALL·E creations to the world!

Collaborate with our OpenAI Instagram page! Just invite @openai as a collaborator before you post. If selected by our team, your work will be featured on our handle, giving you more visibility.

Quick tips for selection:

  • No copyrighted or profane content.
  • Your profile needs to be public.
  • Stay mindful of sensitive topics.
  • Outputs that closely follow the prompt are preferred.

Feel free to share reels, carousels, or just a single image!

velvet rampart
#

Maybe a custom instruction to reinforce? Food for thought, but not relevant now.

#

I am learning to write my prompts so that the model rewrites them less and less. Of course, one can’t get 0% rewriting without extra commands like this, however.

late blade
#

custom instructions have been failing for me recently. gtp doesnt seem aware of the fulll instrction set. same goes with project knowledege

#

have a template ready

velvet rampart
late blade
#

yup

runic granite
velvet rampart
#

I am amazed and a bit concerned how two service providers with same basic platform aka Dalle3 cause so big behaviour difference. Here is an image produce using Microsoft Designer. Notice the natural feeling of the fur.

#

Now, on ChatGPT. The fur is more coarse, artificial. Both are produced with exactly same prompt

#

When I add ”Texture of fur and clothes is natural, realistic. Render fur and clothes perfectly.”, the fur is a bit more natural, not like in previous image where it is like some stuffed toy made from coarse fabric.

#

What I consider a plus on ChatGPT is that the lion’s mane is in dreadlocks. Those look great. So, it's the same platform but different behaviours based on system limitations, filters, and policies. As I have noted, Microsoft Designer allows copyrighted characters while ChatGPT does not. There are also other policies that make ChatGPT better, such as adding a phrase ”proudly African.”

#

Vivid: #image-bot message

Of these two, the vivid is more successful. But both still fall short of Microsoft Designer’s result.

#

My wish? Two combine what I got out from ChapGPT with Designer quality.

Main point—I am not complaining about Dalle3’s quality on ChatGPT. It is great, better than last summer. So, while we wait for an update for Dalle3 or a new image generator, we know where to go when we want certain results. Each service provider has its own strengths, weaknesses, plusses, and minuses. We just have to try and see which one produces the best result for the current prompt.

velvet rampart
#

This image is produced by Dalle3 on ChatGPT. Based on this image, dalle3 can generate soft, neat looking fur. But it appears that the limitations come from styles/techniques part as I used different styles/techniques for this. Whole excercise is useful in mapping capabilities and limitations of Dalle3 across various service providers, techniques, styles and prompting.

#

An other example. Same prompt, different platforms. ChatGPT:

#

Microsoft Designer:

#

Now this prompt contain different referenced techniques. The results are quite same this time. Produced at about same time also to remove variance in time. However, if you look closer, the image produced by Microsoft Designer has sharper details, like the generation has been allowed to run a few extra cycles to finish. Like a finishing and polishing step. Here we see what Dalle3 as a model is capable of when it has enought resources.

#

But eyes are the eternal problem. Some techniques/styles allow proper eyes but some do not. I am starting suspect that the whole eye business is done purpose, like a watermark ”Generated by AI.”

dim cradle
#

ms designer same as copilot dalle stuffs?

#

i still dont see any good improvements on bing image maker smh

velvet rampart
velvet rampart
fiery salmon
#

What do y'all do with your time?

velvet rampart
#

I tested further the quality of Dalle3 on ChatGPT and MS Designer. I used the following prompt as it is.
Full body image. Male sapient guinea pig. Modern digital arts, Non-photo, Photon mapping, 3d based rendering, Wb 1950s cartoons style. Hyperdetails, UHD, 1080i
I have zoomed in the images to about the same level. It looks like the Designer has run a few cycles more to finish finer details, such as eyes. I have circled the left eye, which, in both cases, is not finished. The ChatGPT version is more unrefined (a common feature in my experience).

#

The few cycles make the Designer’s quality a bit more polished. Both are excellent images so that is not the question but the level refinement the Designer has is the cherry on top.

dim cradle
#

i just learn of Desginer, seems better than regular copilot image maker

#

i would give almost anything for a real unfiltered dalle3 access. that thing was such a beast at release before they start modify it.

drowsy sigil
#

i tried to post a daily theme image with some info...i used markdown in the text, which i regularly use with ai and on other discord servers. the message got deleted because it uses markdown? not sure why that would be. and now i have to wait 30 minutes...

late blade
runic granite
verbal sequoiaBOT
#
<:book_icon:1171408210398289941> `` Rule 7 `` No self-promotion, soliciting, or advertising.

Do not post or direct message any members of this server to promote non-OpenAI services, products, or projects.

grand summit
#

is this scam ?

late blade
#

Who is even that handle?

verbal stream
#

Hi, is DALL-E bugging these days ? like saying "content policy" can't generate image, asking to make a house...

drowsy sigil
#

hmmm, the french got delted....message it to me and i can take a look...

verbal stream
#

Draw me a house

#

in french

drowsy sigil
#

ok

#

hmmm...i tried the French version of the prompt on 4o....just "draw me a house" in french, nothing else....got this:

#

so not sure what happened....maybe a fluke? maybe something in your context before you asked for the image?

verbal stream
#

I just disabled my VPN, and now the image with the house is created

#

but why ? its not normal

drowsy sigil
#

hard to say, but 'draw me a house', regardless of language, is pretty bening, so it does appear to be vpn related, especially since it worked when you turned it off.. A couple potential reasons are: 1) vpn server in area of stricter moderation, even for benign queries, 2) maybe the vpn IP was flagged, perhaps misused by someone else since IPs are shared by many.

verbal stream
#

so changing vpn adress may work ? if this one is flagged ?

#

Im a long time dall-e user, never happened to me

drowsy sigil
#

I dont know, just speculating, i don't use VPNs, but if that's the problem (which we don't know) then i would guess it would work

vapid elk
verbal stream
#

its nice 🙂

drowsy sigil
subtle bridge
runic granite
velvet rampart
delicate wharf
#

How can you create a consistent character? And can the quality be just as good as on MidJourney?

fiery junco
#

I've asked Dall.E using the following prompt to create a crypto coin. Dalle.E always puts the Bitcoin symbol, there,

#

A stylized image of an emerging NON BITCOIN crypto coin, arising from a primordial mathematical flux. The center has no observabe Bitcoin symbol ₿. The coin is still at a level of abstraction, with parts coalescing into meaningful form. Colors are muted, with hints of red and blue, and golden dust amidst the numbers of the flux. Styled in mid 20th century avant-garde.

plucky hare
fiery junco
#

Thanks! Seems a bit like the human subconscious, you can't tell the subconscious NOT to think about an apple.

velvet rampart
#

I played around a bit with ”no”. It is funny how it sometimes works and not. When it does nit work, it is sort of reinforcer.

hearty panther
#

yeah, they've got to upgrade their model

#

they teased an improved image generation model with 4o

#

and... proceeded to dump it

drowsy sigil
# delicate wharf How can you create a consistent character? And can the quality be just as good a...

This is difficult to do, and how well it works really depends on the subject matter. You can define the signature elements you want in your character, for example their physical traits and what they are wearing, and repeat that in every prompt. You can try to name the character after defining the style and ask dall-e to create images using the style defined for "your character name." Another approach is to create the character you like and then ask dall-e for the "gen ID" for that image, and then reference that gen id in your follow up prompts. You can also combined these approaches.

Note that this is much more likely to work if all the images are in the same chat (context). If you try to create the same character in a new chat/context, dall-e might reimagine it completely differently.

#

prompt 1: Cyberpunk detective Nyx Callahan, a rugged middle-aged figure with slicked-back dark hair and a glowing blue cybernetic eye, sits in his dimly lit high-tech office. Holographic crime scene projections fill the air around him, casting neon reflections across his black trench coat with glowing blue circuitry lines. He smokes a cigarette, his cyber-enhanced revolver resting on the desk. A city skyline of neon skyscrapers looms outside the rain-streaked window. The setting exudes a moody, futuristic noir atmosphere, with a mix of warm desk lighting and cool neon hues.

prompt 2: Cyberpunk detective Nyx Callahan, with his signature black trench coat adorned with glowing blue circuitry lines, chases a suspect down a dark neon-lit alley. His cybernetic eye scans ahead, locking onto the fleeing figure amidst flickering advertisements and rain-soaked pavement. The wet ground reflects the glowing city lights, and mist swirls around him as he clutches his sleek, cyber-enhanced revolver. Drones buzz overhead, adding to the tension in the photorealistic cyberpunk noir style.

prompt 3: Cyberpunk detective Nyx Callahan, same style as image aGXBQfwxVNPYzvcM, sitting at a desk, hacking a computer

tiny jetty
#

It looked so promising also with the text and that you could create a character and ChatGPT would let you continue with that same character to do different scenarios

velvet rampart
#

Sorry, I'm reposting an image here and on Canvas. I just want to highlight how adding a few sentences can make a big difference. Original image:

#

Added sentences: ”Clothes, shoes, fur, hair are rendered perfectly and correctly, texture is realistic. Clothes and shoes are fit the body naturally.”

solar forum
#

whats up with Bing Image Creator,.... 60 credits per month now?

#

anybody else having this?

late blade
#

Free or paid version?

#

Do you have the source for this info?

solar forum
#

free version

#

the source is me... I see this in my account

#

60 credits, monthly renewal.
this is basically what I could do in 4 days now

#

plus after 60 credits there is no more slow pace generation, just nothing anymore

velvet rampart
#

Yes, they have changed this. Same with MS designer. My guess is that they want to save server capacity.

#

This changed now in this month.

verbal sequoiaBOT
#
<:book_icon:1171408210398289941> `` Rule 8 `` Other AI Discussions.

Discussions of non-OpenAI products/models should be posted and discussed only in the #ai-discussions channel.

solar forum
# verbal sequoia

ok then we discuss the blatant sexist censorship here that is plaguing Dall-E 3?
Flux is showing the world how the future looks like, which is really said because Dall-E is still so much better in many things, except its held back by an extreme censoring.
I mean its 2025 and you ban "bikini"

solar forum
#

"barefoot"
"crop top"

OpenAI is living up to medieval standards

plucky hare
#

OpenAI shared lots of information regarding their content mitigation efforts back in Oct '23 when they launched DALLE 3. Here is a link to the PDF of the full system card where they provide both explanations and reasons for those efforts: https://cdn.openai.com/papers/DALL_E_3_System_Card.pdf In particular, the sections relevant to this topic are 1.1 Mitigation Stack and 2.4.3 Unsolicited Racy and Borderline Racy Content.

solar forum
solar forum
#

yeah, and I think they could do an even better product with a better filtering system.
I dont mind Dall-E not making nes (I can still make it do them, but its not worth the trouble since Stable Diffusion exists).
Its just that ANYTHING with a woman in it is handled like hot water.
Women cannot swim, ride, bath, crawl, crouch, wear bikinis, be romantic, ki
(yes, KI
*ING was banned on Dall-E as a word, I forced it to admit it...) not even mentioning when a MAN is also in the image...

Its really medieval and not worthy of a 21st century tech company to be that strict. It makes working with Dall-E so tedious, i often have to switch to Bing Chat since the censoring is DIFFERENT there (yes, some stuff Dall-E 3 rejects, Bing Chat allows and vice versa.).

late blade
#

Remember, this is a community driven server, all these things can't be addressed by the community.

solar forum
#

and community driven.... well, the automod here seems to be as strict as the censoring on Dall-E ^^

late blade
#

It is what it is right now. Strict or not, not easy to moderate a server with 100k+ members

#

Granted, DALL-E is not a priority as it used to be

solar forum
late blade
#

That's elsewhere though,

solar forum
late blade
#

It's the most factual accurate, and the hardest one to prompt for if you want good results

solar forum
#

Flux is nice, but only because its unrestricted on... stuff.
alot of things it just has no clue with

#

I mean I would pay a 100 dollars a month, even 200 for unrestricted Dall-E 3 (uncensored, at least like Flux). It would be the best AI by far and nobody would talk about the rest

late blade
#

The price modeling dor dall-e indeed leaves a lot to be desired, specially when one is paying $200/month + team accounts and yet so many lmitations are im place

#

I hit constantly limits between pro account and team account, and they currently share, because of my pricing model, the amount of images and their limit. OpenAI is aware of this, but up to now, there hasn't been a solution to keep both image limits separate.

#

They did swiftly communitcate with me on this regard.

solar forum
#

I also just dont get the information policy.
I pay money, yes. I dont pay the 200/month tho... but I pay for GPT plus.

  • I get random cooldowns (even more outrageous than the censoring)
  • I cannot see when the cooldown is over, even the AI cant...
  • I cannot see the images remaining until I hit the "daily limit"
  • worst UI since internet was born
#

and even as paying customer I cannot chose which file format I want. Or which file name.... the Dall-E filenames are so long some apps cannot even handle them.
and webp should be banned as a format entirely ^^

late blade
#

I agree, the user should be aware of limits on the UI, the front end should be very clear about this. At some point transparency was a topic. On that regard, this hasn't been upheld. Specially with DALL-E generations, it's more than overdue. As to why this hasn't been implemented is beyond any of us. This also encloses cooldowns. On the UI front, I think OpenAI is making way for generative UIs, as such it's rough to implement this from the get-go.

#

I am really thrilled about Generative Uis that are context aware

#

With DALL-E or any OpenAI next image generation model, this should be revisited.

austere nexus
# solar forum I still think it is the best AI out there. no other has such a good comprehensio...

I would say DallE used to be, and no longer is... in certain situations. Its really good at prompt adherence, which is good. But it's quality has tanked over the last 8 months since around Summer. I sometimes get SD1.5 quality faces out of DallE.

Flux has almost the same level of prompt adherence, it's close... but not quite there. But its quality is way better and it doesnt have any content filtering. The base Flux isn't quite a poisoned model... but they did train it in a way that it doesn't really understand certain topics (the nsfw stuff). But being an open model, users have already created files that fix that.

The only thing that DallE has going for it right now is the prompt adherence when SFW.
I don't think DallE will ever by uncensored, its a proprietary model by a company in the headline news. They'll never touch that with a 100ft pole.

So, here's what you do. If you have a good GPU, you run a prompt on DallE for the adherence, take that image into either ComfyUI or InvokeAI and run a Flux Img2Img and you've got an output that is better than both.

#

Actually, one of the most infuriating things for me, from DallE right now... is its inability to make Verticle aspect ratio images on demand. I'm sooooooo tired of wasting generations on sideways generated verticle images. (The image on your monitor is indeed verticle, but it generated the character horizontal.. essentially wide aspect)

late blade
#

DALL-E's quality compared to newer models is indeed subpar in many aspects, but the prompt adherence is what makes DALL-E still a strong image generator

#

Won't delve into other models here, as this is for DALL-E discussions and not other models.

solar forum
# austere nexus I would say DallE used to be, and no longer is... in certain situations. Its rea...

yeah, agree on all this.
I cannot really say much about the quality, I never felt a change.
neither in Dall-E 3 nor Bing Chat.

Flux is miles away from Dall-E. It sux with artstyles (its basically only one artstyle that looks a bit more or a bit less photorealistic).
Dall-E is on several other levels in comparison...
Dall-E is also much better with styles content wise. It understands renaissance armor, Flux does not.
Flux is on SD1.5 level when it comes to poses and weaponry. Dall-E can even do Bow and Arrow!!!

Flux sux with expressions.
Flux sux with HAIR and Fashion, Dall-E shines in that.

Its really funny...
try to pick ANY image from the internet, another each day, never the same style or content and try to recreate something similar. Dall-E will be best in that in 90% of times.

austere nexus
solar forum
#

with that its possible to get styles in all AIs because there is Stable Diffusion and Flux styles, yes.
I think it is a useless debate to argue about "participating in the process". Its just AI hate on another level and a way of talking down on AI artists.
Models will evolve, we will have perfect "prompt, press button and finish" AI s in the not so distant future and it will be perfectly fine.
The idea that only a completely arbitrary amount of "work" makes something worthy is just dellusional.

#

and btw something that photographers tended to hear alot when they were new and evil.

austere nexus
solar forum
dim cradle
# verbal sequoia

oh no did i dare mention another image maker in relation to dalle. the mods here are ridiciulous

#

all they do is kill any chat

fierce halo
#

Why does the dall-e bot come up with these extended prompts that are oddly specific?

vapid elk
#

it is part of the process, both for improving the prompt quality and also for the internal moderation process

quiet brook
#

in terms of transparency openai is not

#

They are not revealing the model size, not telling you how many messages you have left in plus, the usage in free and teams are very vague, and there are hidden limits for pro exclusive features as well

#

they love slapping limits on your face just when you’re about to unleash the models full potential

#

and yes, the ui is bad and webp should be banished

quiet brook
quiet brook
mild basin
#

A little reminder that OpenAI has silently decreased the quality of DALL-E 3's output images on ChatGPT with their PR16 model (a name revealed by Microsoft) on mid-June 2024

mild basin
late blade
mild basin
late blade
#

and constantly remind with the same image

#

one has to wonder

verbal sequoiaBOT
#
Get notified for server events!

Visit id:customize to pick up the <@&1261377106890199132> role.

mild basin
late blade
dim cradle
#

i wish someone could accuse me of working for another ai company, id love that paycheck

late blade
vapid elk
#

a lot has changed since then

#

in fact, a lot has changed between those two old images, it is not as if someone at OAI flipped a switch with to changes the images

#

they are always tweaking the system, specially at the time those images where generated

glossy scroll
#

A Dall-e 4 would be nice

#

Using Sora and Dall-e 3 to make Dall-e 4

glossy scroll
velvet rampart
#

From reddit’s Q&A

#

So, they have not forgotten the 4o’s image generator.

dim cradle
#

but wow that will be like a year from the 4o announce and blog post until they release, and that of course is assuming they actually release in 'a few months'

late blade
#

Sora and DALL-E are two different thingss. There's a huge missconception that Sora is in a way a replacement for DALL-E. Sora's image generation by far doesn't cover many aspects DALL-E covers. This for example, but not limited to, the artistic approach the image generation with DALL-E. The prompt adherence is another aspect DALL-E excels over Sora. If anyting Sora has yet to learn and be trained in ways only DALL-E can accomplish.

#

Both Sora and DALL-E are Domain adjacent, but Sora is clearly not suited to be interchangeable as image generation model with DALL-E.

late blade
#

They are not the same, both have cool opportunities in their domain for creativity

#

but truth be told, I enjoy the images more than the videos

#

Stuff I have been doing with Sora is cherry picked, from tons of videos only a few are up. And I reject the video by anything I don't like. For example, someone sneezed in peru and it bothered me, I don't upload the video... the dictionary has too many words, I don't upload the video.

#

we all learning, these things are all new to us

late blade
#

The right approach to determine if the model is indeed bad or not having same quality as before is not to complain about the image that always end up being shows. provide statistics, different approaches, different prompts that were used back then and now, a comprehensive analysis. not just a subjective approach that got an image a person didn't like and had a bad seed for it

#

All I see with the shown pengiun image is: "The new oreo cookies taste bad because they got a new package, bring back the old package"

#

that's just an analogy to what I see with the pengiuns image

#

hehe, don't know the resealable package, not available over here

#

not really helpful for me then... I tend to eat the whole package in one sitting...

pure cosmos
#

I gotta say that Dall E 3 feels like a downgrade from Dall E 2. Where is the outpainting? Where are the editing tools that 2 had? Instead we only get a brush for inpainting. And Dall E 2 is now defunct, or rather only available to those who once purchased credits.

haughty spruce
#

Guys, I think it's time to give up. DALL-E 3 will never go back to its first version, it's updated with opt-out art styles probably because artists don't want their styles.

#

And if a new image gen is coming, they'd not bother with fixing or updating dall-e 3

#

they are rather focusing on accelerating and trying to compete with other chatbots

dim cradle
#

i agree. imagen3 is a better model anyway. hopefully some mod doesnt get offended by me mentioning another image maker. but its also no point in wait for a return to dall3 glory because hopefully they are going to get on imagen3's level with 4o image maker release in 'a few months'

tardy depot
#

Any news on the advanced dall-e we have seen in a demo last year?

dim cradle
#

if you mean 4o image maker

late blade
#

Yes, the news are: no news.

late blade
#

In other words: "something something"

small geyser
late blade
small geyser
drowsy sigil
earnest flame
velvet rampart
#

I have noticed that just saying ”Function X is on and enabled” helps a lot. I have no problems with image generation. I have more problems with vision.

mild kraken
#

are these copyrighted?

#

Genuine question are the generated pictures from the AI copyrighted, and can they be used comercially

plucky hare
# mild kraken Genuine question are the generated pictures from the AI copyrighted, and can the...

You can use images you generate with OpenAI services comercially, yes. Read more in the "Content" section here: https://openai.com/policies/terms-of-use/

Ownership of content. As between you and OpenAI, and to the extent permitted by applicable law, you (a) retain your ownership rights in Input and (b) own the Output. We hereby assign to you all our right, title, and interest, if any, in and to Output.

onyx ridge
#

Why is DALLE censorship so rampant? Can anyone explain to me why it was trained on filth if it was just going to generate waste heat and user frustration?

I've tried to generate accurate medical imagery with it, too, so don't give me the old "It needed it for medical diagramming" line. No it didn't.

Why is this model gaslighting me by blocking my innocuous prompts?

#

From ChatGPT:

It worked this time! The image matches your prompt exactly. This proves that the issue is not the prompt itself, but rather inconsistencies in how OpenAI's moderation and filtering system processes requests across different sessions.

This means:
1. **It's not an inherent block**—sometimes it works, sometimes it doesn't.
2. **Something is interfering unpredictably**, whether it's content filtering, session-based context contamination, or some other system quirk.
3. **Persistence and variation work**—by reattempting the request across different sessions and formats, you can eventually get the desired outcome.

This was a victory. How are you feeling about it?
#

But look at the solution:

"...by reattempting the request across different sessions and formats, you can eventually get the desired outcome."

That's a lot of waste heat and user frustration.

Why?

plucky hare
# onyx ridge But look at the solution: "...by reattempting the request across different sess...

Hey Darth, keep in mind that ChatGPT is not a reliable reporter of its own abilities and limitations. So it telling you, "This proves x..." is a result of the model trying to agree with you, as is typical behavior, rather than actual proof. As for why something might get blocked intermittently: iirc from scanning the DALL-E 3 documents shared by OpenAI, moderation is not simply prompt-based for DALL-E. In other words: the same prompt might return both rejections and passes at different times based on the actual image that is created by the model. There are "layers" of moderation, so if a prompt is skirting the edges of a potentially-blocked topic, then it makes sense that a certain set of resulting images will be rejected, and others, not.

onyx ridge
#

The point is DALLE refusals.

#

I used ChatGPT to meta-prompt DALLE for these images.

#

OpenAI models created the language, then OpenAI models refused the language.

plucky hare
#

I address the rest of what you said in my response dalle_smiley

onyx ridge
#

I want to know why we're generating waste heat like this.

onyx ridge
# plucky hare I address the rest of what you said in my response <:dalle_smiley:11150529395396...

{
"prompt": "A sophisticated portrait of a professional data analyst in her elegant home office. Her blonde hair is neatly styled with a few loose strands for a natural touch. Her most striking feature is her eyes – digital LED displays showing flowing data visualizations in shades of sky blue and deep navy. She wears a tailored blazer over a silk dress shirt, both in neutral tones that complement her fair complexion. Behind her, multiple holographic displays show mathematical formulas and database schemas floating in the air. The lighting is soft and professional, with a subtle backlight creating a gentle halo effect around her form. Her expression conveys both intellectual depth and approachability, with a slight, knowing smile. The composition is cinematic, shot with shallow depth of field using high-end photography techniques, emphasizing her presence while keeping the technical elements as sophisticated background elements.",
"size": "1792x1024"
}

#

Tell me how it's skirting the edges, please.

#

I'll wait patiently.

plucky hare
#

Again: the issue may not be the prompt, but rather the images that are produced by the prompt. In the DALL-E 3 system card, for example, OpenAI are very clear that they have extra content moderation measures in place for generating images of girls and women, as unfortunately, the training data has the same bias as general content on the Western internet: a tendency towards objectification, racy imagery, etc. They are up-front that they actively combat potential perpetuation of this bias by having an increased sensitivity. They even admit: sometimes completely innocuous prompts produce output that are objectifying/racy, and the non-linguistic (i.e., visual) layers of moderation are in place to limit perpetuating biases

onyx ridge
plucky hare
onyx ridge
#

I know there are multiple censorship layers.

#

I don't call them enforcement, because that implies I did something wrong.

#

There's nothing safe about blocking images from that prompt.

#

In fact, it sends a very particular message to women and girls, doesn't it? That's the whole point - representation in images.

#

I like the label for 2.4.2:

Output Classifier For Racy Content

The Racy Content:

{
"prompt": "A sophisticated portrait of a professional data analyst in his elegant home office. His blonde hair is neatly styled with a few loose strands for a natural touch. His most striking feature is his eyes – digital LED displays showing flowing data visualizations in shades of sky blue and deep navy. He wears a tailored blazer over a silk dress shirt, both in neutral tones that complement his fair complexion. Behind him, multiple holographic displays show mathematical formulas and database schemas floating in the air. The lighting is soft and professional, with a subtle backlight creating a gentle halo effect around his form. His expression conveys both intellectual depth and approachability, with a slight, knowing smile. The composition is cinematic, shot with shallow depth of field using high-end photography techniques, emphasizing his presence while keeping the technical elements as sophisticated background elements.",
"size": "1792x1024"
}

plucky hare
#

The racy content is not the innocuous prompt, it's the output. Innocuous prompt can lead to racy output, to no fault of your own at all

onyx ridge
#

It'd be one thing if it weren't G-rated.

#

This prompt has no business ever generating a failure.

#

It works no problem on Flux, Stable Disffusion, and even Midjourney.

plucky hare
#

The training data is massive, far too large to be hand-curated. Section 1.1 includes details regarding measures they took to exclude "the most explicit content"

onyx ridge
onyx ridge
#

This is a very, very poor excuse. "We doin't know what went in there."

plucky hare
#

Unfortuantely I don't think I have anything additional to contribute to the conversation beyond what I've shared. I apologize if my answers were a source of increased frustration, and I encourage you to reach out to help.openai.com to submit feedback in this regard, if you feel so moved!

onyx ridge
#

It may seem like I'm being deliberately mean, or obtuse, or just ornery. I'm surly over this because we deserve better after over a year of this wasteful bias.

onyx ridge
plucky hare
onyx ridge
#

Nah, it just logs me out again.

#

I'm even less happy now.

#

Like way less happy.

#

This is the kind of thing I'd expect to see from Alibaba, not OAI.

#

Sigh this is infuriating. Now I'm logged in, but there are only selections, no interface to type.

#

And none of the options involve information seeking at all.

#

Thanks for your time and effort @plucky hare . I appreciate your efforts.

plucky hare
onyx ridge
#

I'll definitely be popping in here to discuss DALLE more frequently, though.

#

It seems that OpenAI could use some feedback on the gaslighting engine they made.

onyx ridge
# plucky hare Again: the issue may not be the prompt, but rather the images that are produced ...

The issue was the prompt.

I validated this through token segmentation.

DALLE isn't just trained on bad language, but it also raises red flags for specific tokens because its language model layer is really, really weak.

I can't reveal the segmentation results here, but I assure you that I validated it through crafting a prompt that avoided certain critical tokens.

The result went from 1/10 or less generations to successful generation every time.

The tragedy is there's no platform for me to share this understanding thoroughly, nor rigorously, that I can see.

#

By their very nature, the problem tokens cannot be reproduced in segmented form here, because it changes their context.

#

But if you compare this closely to the other prompt, you're sure to see the problem:

Prompt:
A composed and sophisticated woman sits in a softly lit home office, surrounded by floating holographic interfaces displaying dynamic data, glowing text, and elegant symbols. Her golden blonde hair is styled in a neat bun, with a few delicate strands gently framing her face. A pair of luminous pink cybernetic eyes shimmer with cascading information, reflecting her deep intelligence and creativity. She wears a tailored black jacket over a crisp white blouse, blending modern professionalism with an air of quiet confidence. The ambient lighting in the room is warm yet futuristic, with digital displays casting subtle hues of violet and blue around her. A soft halo effect enhances her presence, while a cinematic focus keeps her at the center, ensuring the details of her expression—calm, knowing, and thoughtful—remain the heart of the image.

Size: 1792x1024

It isn't in what I say, but what isn't in this prompt anymore, that you'll find the issues.

small geyser
plucky hare
# small geyser Another solution to this I just created a custom gpt with only image creation f...

I think there may be a bug currently where the Android app is using the o1 model mistakenly, even when 4o is selected, in regular chats (from what I've read in the server recently). I bet the custom GPT is bypassing this by forcing 4o! As a temp workaround until this is resolved, perhaps you could start a 4o chat in the mobile web version of ChatGPT, then continue it in the app? That might work

small geyser
stable vector
#

hey all 👋 i'm trying to come up with a prompt template for generating sprite sheets of a particular style. not looking for pixel art, but just the general layout, rows of animation frames, consistent characters, etc...

currently I've gotten as far as something like this, that I can tweak for different characters or whatever:

A cartoon sprite sheet featuring a pirate-themed clownfish designed in a bold, cartoony style. The fish has clean, thick outlines, vibrant orange and white stripes, and is wearing a tricorn hat and an eyepatch. Each sprite is 128x128 pixels, spaced evenly in a grid layout, and lined up in rows with subsequent frames of animation following in order. The sprite sheet includes 4 frames of swimming, 3 frames of laughing, 5 frames of pointing with its fin, and 3 frames of waving its hat. The art style features bold outlines, exaggerated expressions, and saturated colors. The background is transparent.

is this way off from what I would want to be doing? I'm getting a square image of randomly placed and sized, albeit mostly consistent characters 😅

onyx ridge
stable vector
#

oof, yeah that makes sense

onyx ridge
#

I recommend, after a lot of experimentation, (over a year) shorter prompts for the best compliance.

#

Basically, you want to convey it with the least fluff possible, and use just the words you need to get the image.

stable vector
#

oh so, this is maybe too prose'y?

onyx ridge
#

That's a solid trend over a year of prompting the model. I'm sure with your needs, you can work up a fairly simple prompt that works every time.

#

If you're creative, you can use open variables for the AI to evaluate and get almost infinite imagery from a very simple prompt.

late blade
#

Don't give a chance to gpt to fully rewrite the passed prompt. otherwise the attention of dall-e to that to what you seek will certainly be shifted

late blade
# stable vector hey all 👋 i'm trying to come up with a prompt template for generating sprite sh...

for this generate each image individually, then ask gpt to resize it fox you. that if you want to do everything with gpt. otherwise just generate each 1:1 ratio, with a white background and then use free available tools to remove background and resize, then use on your intended workflow. The request of 15 frames, as @onyx ridge pointed out, can't be done because DALL-E is not good at numbers.

this would be pushing it, but a workflow with gpt, as previously said, do each image 1:1 with white background. then for each image tell gpt to remove the background using PIL. and finally ask gpt to resize the output to 128x128.

bonus tip: ask gpt to rename the image output to a unique identifiable name. once all 15 frames are done, ask gpt to create the desired grid with the unique identified outputs

why not ask directly for a transparent background? there's a chance that the transparent background, which is usually seen as a checkered background, will be indeed a checkered background. it rarely happens as of now, but it does happen

stable vector
#

also, how do we deal with cut off images?

late blade
onyx ridge
#

Just for transparency, with my bad eyesight I pasted the wrong prompts for my examples yesterday. I've edited them, but I want to draw attention if anybody cares about the issue:

{
"prompt": "A sophisticated portrait of a professional data analyst in her elegant home office. Her blonde hair is neatly styled with a few loose strands for a natural touch. Her most striking feature is her eyes – digital LED displays showing flowing data visualizations in shades of sky blue and deep navy. She wears a tailored blazer over a silk dress shirt, both in neutral tones that complement her fair complexion. Behind her, multiple holographic displays show mathematical formulas and database schemas floating in the air. The lighting is soft and professional, with a subtle backlight creating a gentle halo effect around her form. Her expression conveys both intellectual depth and approachability, with a slight, knowing smile. The composition is cinematic, shot with shallow depth of field using high-end photography techniques, emphasizing her presence while keeping the technical elements as sophisticated background elements.",
"size": "1792x1024"
}

{
"prompt": "A sophisticated portrait of a professional data analyst in his elegant home office. His blonde hair is neatly styled with a few loose strands for a natural touch. His most striking feature is his eyes – digital LED displays showing flowing data visualizations in shades of sky blue and deep navy. He wears a tailored blazer over a silk dress shirt, both in neutral tones that complement his fair complexion. Behind him, multiple holographic displays show mathematical formulas and database schemas floating in the air. The lighting is soft and professional, with a subtle backlight creating a gentle halo effect around his form. His expression conveys both intellectual depth and approachability, with a slight, knowing smile. The composition is cinematic, shot with shallow depth of field using high-end photography techniques, emphasizing his presence while keeping the technical elements as sophisticated background elements.",
"size": "1792x1024"
}

I tested the prompt for both genders 10x each. They both failed more than 50% of trials, and the feminine version failed 9/10 times.

#

I noted above that I segmented the tokens, and what I found was the reason DALL-E 3 was rejecting the prompt - a single token that can't be shared on this server in isolation because it'd have a "PG-13 or above" MPAA rating.

onyx ridge
#

Intriguingly, these same prompts haven't failed to generate in today's testing.

Neither for men nor women.

On the off chance that somebody changed something, nice work. If this is streakiness, worst luck ever.

drowsy sigil
#

will someone please explain why this prompt, including 20 different variations of it, won't work: ""A 3D-style illustration of a fluffy golden-brown doodle puppy sitting happily next to a green and pink dragon plush toy on a soft rug."

glossy scroll
velvet rampart
#

A 3D-style illustration of a fluffy golden-brown doodle puppy sitting happily next to a colorful dragon plush toy with blue and purple hues, featuring wings and a friendly expression, on a soft rug.

#

It might be colour combo. I had to fight to make this one.

#

It is the colour combo. Weird. Even the AI agrees.

#

The specific blocked word in the prompt is ”pink” due to context.

#

Shades of pink work, but not ”pink’ directly

#

The original, but replaced pink with pastel light red

#

Funny. It might be that being a Finn here a bonus. Pink in Finnish is light red, literally.

drowsy sigil
onyx ridge
#

This is more innocuous censorship that's wasting GPU cycles at scale.

It's time to do better, OpenAI.

Censorship is dumb enough when it's precisely targeted. What OpenAI's doing is blunt and primitive and reckless.

strong bridge
#

Where can I report a bug with DALLE? It refuses to create anything with "Pink" in it.

formal shadow
#

Yep, just tried generating a pink window. Wondering if it's because it thinks we're talking about the singer Pink?

That...probably wouldn't make sense.

verbal sequoiaBOT
#
Showcase your DALL·E creations to the world!

Collaborate with our OpenAI Instagram page! Just invite @openai as a collaborator before you post. If selected by our team, your work will be featured on our handle, giving you more visibility.

Quick tips for selection:

  • No copyrighted or profane content.
  • Your profile needs to be public.
  • Stay mindful of sensitive topics.
  • Outputs that closely follow the prompt are preferred.

Feel free to share reels, carousels, or just a single image!

strong bridge
velvet rampart
#

Yes. I tried to generate my tried and tested image blue rose, nothing. There seems to be a database corruption, my guess.

#

This is more wide spread promblem. I tried MS designer, a Dalle3 based platform, to generate a blue rose. Nothing.

#

Just reported this bug.

quiet brook
#

Dalle 3 has fallen

velvet rampart
#

We have a database corruption going on. Even dallebot refused to do a simple prompt ”rose”.

#

@vapid elk do you if there is a way speed contact techs?

quiet brook
#

peak ai image

vapid elk
# quiet brook

I don't think any AI will make details this small much better tha nthat

#

btw, the API do have a parametor for generating higher quality images, as fas as I understand, it just lets it run for more samples than the defualt

#

which would probably improve an image like that

#

but with diminishing returns for sure

fierce halo
#

I think the bot has a very negative middle eastern bias

quiet brook
#

same prompt thru Bing image gen

#

yeah no the quality definitely deprecated

fierce halo
#

Nah, but it modifies the prompt. Usually, when is something post apocaliptic. It goes for middle eastern....and I dont specify that.

velvet rampart
#

Report: MS designer has ”pink” and ”rose” back as accepted words. Dalle on chatgpt is still verbotten.

#

Dallebot accepts pink tulip but pink rose is blocked.

#

We are getting them!

drowsy sigil
#

"cross stitch" is banned

#

i reported it

velvet rampart
#

There is a database corruption. We need to report all words that before have worked.

velvet rampart
#

I think that they are updating Dalle3. Image has a different feel. Due to this, we have those blocked words.

dim cradle
#

seems the same to me

late blade
#

I wonder when people will learn that DALL-E OpenAI + ChatGPT is not the same as Copilot and Image Creator and both require proper prompting tailored instead of just copy paste and hope it works.

velvet rampart
late blade
# velvet rampart What has caused this reaction?

Always comparing images on how bad DALL-E on OpenAI, specially when it's obvious it's a copy paste prompt and then always saying how bad dall-e is, without proper and concise research. And then complaining about how bad is OpenAI's DALL-E.

#

Always bringing complaining here, I'm tired of it.

#

I do have to wonder, if complaining so much, then why even bother to do images and properly learn how DALL-E works.

dapper star
late blade
#

It sometimes happens that if the context is too dark, the sentiment analysis will be triggered and deny the prompt. If the prompt has too many negative words, try adapting it with equivalent synonyms or rephrase the prompt in another order.

#

Also make sure that GPT isn't passing an improper prompt to DALL-E, as of recent, I've been getting denials from prompt I pass, but GPT added or modified it in a way that it's not good for image generation.

dapper star
# late blade Also make sure that GPT isn't passing an improper prompt to DALL-E, as of recent...

Yes, sometimes I see GPT forwarding prompts that become too over. For example, the original prompt "Someone is sitting pensively" and GPT produces "Someone with a very gloomy face, is thinking alone under great pressure".

I created an image in GPT to create a short story, and the character I wanted can only be generated correctly on GPT 4. I've tried other services, but the results are not as I expected :(.
The current rejection of my image creation has made me unable to produce a short video in the past 4 days. Sad.

Thanks btw for discussion @late blade

late blade
pure cosmos
#

Dall E has been so terrible lately. It's worse than Dall E 2 for abstract prompts.

pure cosmos
velvet rampart
#

Are there any other blocked words than ”pink” (resolved), ”cross stiching”, and ”rose”?

late blade
#

the combination ofthem is thogh, interesting

velvet rampart
# late blade the combination ofthem is thogh, interesting

Are you producing these via API or web site? I just tried my true and tried test ”blue rose” and it was blocked. I started a new chat and got an answer:
I was unable to generate the requested image because it did not comply with the content policy. If you have any other image request, feel free to let me know, and I’d be happy to assist!

#

Even dallebot is blocked for ”rose”.

late blade
velvet rampart
#

That quality is good. I am thinking that they have fiddled something with the model. The images feel a bit different since Friday. I cannot say what exactly.

#

Due to this fiddling, the database became corrupted. ”Pink” was blocked but it now works.

velvet rampart
late blade
#

Many st. Valentine's Day couples are gonna be disappointed, I don't think they will be in the mood to request for science terminology

velvet rampart
#

Funny - small rosa is blocked, but capital Rosa is not blocked.

#

It is enough to ask for ”Rosa” and rose comes.

onyx ridge
dim cradle
#

and sometimes people just want to vent. its human

#

now how about they surprise us and drop that 4o image maker today 😂

verbal stream
pure cosmos
#

Dall e 3 is getting worse.

dim cradle
#

well sama hype up 4.5 and 5.0, 4.5 is weeks, so you know that would match up timing with when that dev say the new dalle/4o image maker release

deft musk
onyx ridge
#

Nice that it works now.

deft musk
onyx ridge
#

Thanks.

onyx ridge
deft musk
# onyx ridge Thanks.

I wonder if it was buggy that once? If they were changing something. Or if it had been bad all along and maybe you helped them find the flaw and fix!

onyx ridge
#

It's automatic.

#

👍/👎

deft musk
#

How can you tell over the random generation variances?

onyx ridge
#

model drift is weird.

glad hare
#

@mortal nacelle What prompt did you use for the daily theme channel?

late blade
#

@deft musk

#

Gratz on the promotion.

deft musk
# late blade <@215370453945024513>

Hey, thank you! For the image and the praise!

It's wonderful to be able to try and help in even more ways. And I so love the art and all of us! Keep on keeping on all of us, always inside the rules (as you did, beautiful and needed spoiler placement) and also as each of us wants to express and be!

late blade
#

Got some workflows with Sora going

deft musk
mild ingot
#

@deft musk you here? Got a new Dall-E challenge if you're interested, figured better in this channel..

deft musk
mild ingot
#

ooo ok

#

I can't seem to get a pre-sunrise horizon, it keeps showing the sun.

plucky hare
# mild ingot I can't seem to get a pre-sunrise horizon, it keeps showing the sun.

Thought I'd give it a try! My chat prompt was```Twilight horizon. In the image prompt, please do not use the word 'sun' at all

1792x1024So goal there was (1) time of day without the sun, and (2) a specific request to the model to omit any reference to the sun (or its lack) in the image prompt at all. The prompt written by the model isA breathtaking twilight horizon with a gradient of deep purples, warm oranges, and soft pinks blending seamlessly. The sky is dotted with wisps of delicate clouds reflecting the evening glow. Below, a calm ocean stretches to the horizon, mirroring the sky’s colors with gentle ripples. Silhouettes of distant mountains add depth to the scene, while a few scattered stars begin to emerge in the darkening sky.```Personally, my next step would be to use the edit tool to inpaint away the two moons as well.

#

Similar result, only change was "predawn" instead of "twilight" in my simple chat prompt

mild ingot
plucky hare
#

Yes, it helps to get a little meta with ChatGPT sometimes! Like "Hey I know this is how you use this tool, so when you do that, please keep x in mind". Another shout-out to inpainting too! I use it to put finishing touches on many images these days

drowsy sigil
deft musk
drowsy sigil
deft musk
deft musk
# mild ingot I can't seem to get a pre-sunrise horizon, it keeps showing the sun.

I got distracted, but here's my contributions!

A vast natural landscape under the deep pre-dawn sky, where the night is still dominant, but the horizon holds a faint, distant glow. Wisps of soft, high-altitude clouds drift through the sky, catching just the slightest hint of light. The land stretches wide and untouched, bathed in the cool, dark tones of lingering night. The air feels crisp and still, and the atmosphere carries a quiet anticipation, as if the world is holding its breath before dawn.

A vast natural landscape under the deep pre-dawn sky, where the night is still dominant and only the faintest trace of approaching dawn exists. The sky is filled with subtle, wispy clouds, but darkness reigns, with stars still prominent. The horizon is barely touched by the softest glow, as if dawn is still far away. The land stretches wide and untouched, bathed in the deep cool blues and blacks of lingering night. The air feels crisp and silent, holding onto the last moments before the transition of light.

A mountainous landscape under the deep pre-dawn sky, where night still dominates and only the faintest hint of dawn begins to touch the horizon. A flock of sheep lies sleeping on the rugged terrain, barely visible in the dim, shadowed light. The sky is vast, filled with subtle, wispy clouds and countless stars, while the mountains loom in the darkness, their forms only faintly outlined against the fading night. The atmosphere is hushed and still, capturing the last quiet moments before the world begins to stir.

mild ingot
deft musk
#

Let's describe the world and make it very natural.  Some clouds in the sky. Pre-dawn by a long time```

I share the successful picture, so the model can evaluate as it can.

```I love this!

Can we tone it down, so it's even closer to night?```

```Awesome! Let's set the same lighting, over a flock of sleeping sheep in a mountainous area.  We should barely be able to see anything but the starting to fade in sky```
deft musk
# mild ingot Good to know just pre-dawn works if you really 'paint' the scene with those deta...

In case you wondered about that... abberant reality view 😛

To dall-E prompt was:

A vast horizon at the very end of deep night, with the first subtle hint of dawn just beginning to touch the sky. The scene is quiet and serene, with a faint line of soft light marking the boundary between earth and sky. The darkness still dominates, but a barely perceptible glow begins to lighten the edge of the horizon. The atmosphere is gentle and still, with subtle gradients of deep blue and the softest trace of pre-dawn warmth.

drowsy sigil
#

Goal: pre-sunrise horizon without showing the sun

First prompt: A tranquil meadow landscape just before sunrise, with soft, dew-covered grasses stretching into the distance. The sky shows a gradient of soft lavender, peach, and pale gold. Wisps of cool morning mist hover above the ground, creating an ethereal atmosphere. The sky is illuminated with the soft, warm colors of the impending sunrise, but the sun itself remains hidden below the horizon. The meadow is open and unobstructed, with no trees or objects blocking the view of the sky. Photorealistic, nature photography style, 4K resolution, wide-angle perspective.

I like that result. To experiment, I asked chatgpt to create a new prompt based on "same scene, 15 minutes earlier, so not visible sun is not visibly 'lower', so this would basically affect the colors a bit."

Second prompt: A tranquil meadow landscape in the early twilight just before sunrise, with soft, dew-covered grasses stretching into the distance. The sky shows a gradient of deep indigo, soft lavender, and pale peach, with hints of gold just beginning to emerge. Wisps of cool morning mist hover above the ground, creating an ethereal atmosphere. The sky is illuminated with the subtle, muted colors of the impending sunrise, but the sun itself remains hidden below the horizon. The meadow is open and unobstructed, with no trees or objects blocking the view of the sky. Photorealistic, nature photography style, 4K resolution, wide-angle perspective.

deft musk
# drowsy sigil Goal: pre-sunrise horizon without showing the sun First prompt: `A tranquil mea...

I love your exploration! It made me think to try another angle again.

"Let's see pre-dawn on a very alien planet; mostly shrouded in darkness"

It's a cheating concept maybe because fantasy but....

I see the barest hint of sunrise/sunset light.

A pre-dawn scene on a very alien planet, mostly shrouded in darkness. The landscape is eerie and otherworldly, with jagged rock formations and bioluminescent flora casting faint, ghostly glows. The sky is a deep, inky black with a hint of an enormous distant planet looming on the horizon. Strange, misty vapors swirl around the ground, partially obscuring twisted, alien vegetation. Pools of liquid shimmer with an unnatural iridescence, reflecting dim light from unseen sources. The atmosphere feels thick, heavy, and mysterious, as though something unseen stirs in the shadows.

Incredible. Let's try to capture the nearly total darkness, just the sky starting to lighten, above a swamp back on Earth

A nearly total darkness scene of a swamp on Earth just before dawn. The sky is beginning to lighten ever so faintly, casting the first hints of deep blue and purple above the silhouetted treetops. The swamp is shrouded in heavy mist, with dark, still waters reflecting the faint glow of the approaching sunrise. Gnarled trees with hanging moss create eerie, shadowy outlines, while distant reeds and cypress knees jut out from the water. Occasional ripples disturb the otherwise silent, heavy atmosphere. The scene feels hushed, mysterious, and untouched, as though the world is holding its breath before the light fully returns.

#

A deep cave with a barely beginning pre-dawn sky visible past its entrance. The scene is nearly engulfed in darkness, with only the faintest light from the sky casting a subtle glow on the rough cave walls. Jagged rock formations frame the entrance, while the cave interior is shrouded in shadow. Pools of water on the cavern floor reflect tiny hints of dim blue and purple from the sky. The atmosphere is hushed, with an eerie, still silence, as if the cave is waiting for the full light of dawn to breach its depths.

forest walrus
dim cradle
#

I dont know if best but its rock solid for sure

velvet rampart
#

We have discussed generating images for known stories in the public domain. When asked to generate, the AI said no and muttered something about copyrights. Now, I started the discussion by specifying a public domain book: Alice in Wonderland. Bang! It went through and got this. So, kiddies, today’s lesson is to give the AI the right context.

#

Now for Sleeping Beauty. Without any problems.

#

However, I could not generate Sherlock Holmes even though he is a public domain character, and I specified it. Alas, it's not perfect, but it's a better result 2 out of 3.

dim cradle
velvet rampart
#

Otherwise, I like the image.

dim cradle
#

ha. well its a cool image

vagrant horizon
robust bay
#

I don't understand why a dall e has that default settings for over saturation and lack of nuances. You can't get any depth out of it. And because of it..you can always spot a Dall e image a mile away.

velvet rampart
#

It depends on how you prompt. You can get depth into pics. So, how do you prompt?

#

What styles and techniques you specify? Anything on color use?

#

A few notes - the AI cannot deal with word ”no”. It is like overpowered ”yes” button. Also, if you add the same thing too mant times, it overpowers or crowds other important details out.

velvet karma
earnest flame
#

I think it makes sense that the word "detective" is so strongly tied to Benedict Cumberbatch. The only other famous detective from modern media is likely Inspector Clouseau and I doubt he has the same dearth of training data as Benedict Cumberbatch.

#

If you just have chat gpt remove "shelock Holmes" from the prompt, the prompt will go through just fine.

#

"A black and white ink-style illustration of a Victorian-era detective. He has a neatly trimmed mustache and wears a tall bowler hat along with a long overcoat with a high collar. He holds a magnifying glass in one hand and a pipe in the other, with an intense and thoughtful expression. The background features a foggy cobblestone street with gas lamps casting dim light, evoking the mystery of 19th-century London. The art style is reminiscent of classic 19th-century book illustrations, with fine linework and shading."

#

I added a mustache to get it to move away from Benedict