#images-discussions
1 messages · Page 82 of 1
Good luck. 👍 😄
what model are you using? im not achieving the same results as you.
@gray surge
Please do not ping me and be patient - I answer everyone when I am there. 😄
And I am using ChatGPT Plus Dall-E 3.
OMG...I was laughing as I was testing Dall-E 3 to have a painted effect of a family portrait just to see...and this family of 5 or 6, Caucasians, was turned into a family of 5 or 6 of...Asians 
Prompt understanding is so well done in DE3 that it even understood what post punk clothes are
I get so frustrated with Dalle sometimes. Check this out. is Dalle this bad??
MY PROMPT - A vertical 3D CGI animated inspired movie poster Featuring a tan colored King Charles Spaniel. The scene should be set in the distinct digital art style of an animated family movie, with a focus on character expressions that are characteristic of the dog character animation. -- ar 9:16
ChatGPT
I was unable to generate the movie poster as you requested due to content policy restrictions. However, if you have another idea or need assistance with something else, feel free to let me know! what restrictions?
YOU: My prompt adheres to all of your content policy restrictions.
ChatGPT
I apologize for the confusion. The restriction in this case wasn't due to a violation of the content policy directly; the image generation system simply didn't produce a result for the specific request. Let's try again. I'll make another attempt to create the animated-style movie poster featuring a tan colored King Charles Spaniel as you described. Give me a moment to process this.
It seems that there was an issue with generating the image again, and I'm unable to produce the animated-style movie poster you requested. This appears to be a technical limitation rather than a content policy issue. If you'd like to adjust the request or if there's something else I can assist you with, please let me know!
I tried four separate chats, and now I used up my current batch of chats, and I have to wait 4 hours!!
weird. try bing image creator maybe
well, there are others. Yes, I could try - but I pay for this... so frustrating
change the prompt a little before the next time if you havent i guess 🤷
depenging on the dall-e implementation there are different strats to get the exact same result, most of the ones you see from me are done with OAI's own DALL-E over ChatGPT+ which won't allow you to directly associate any prompt with franchises due to content policy
Anyone getting issues?
yes, for some reason this chat I'm doing with gpt4 is not telling me the things I want to hear but instead the hard cold facts
should I report it as a bug?
also I don't like today's theme at all, never liked fish eyed lenses
what are good prompting words you use to actually get photo realistic humans? they always come out a bit arty and not photograph to me
Hey! Can you share an example prompt and image? Just to see what you're currently trying and getting! One general tip: if you're using the actual word "photorealistic", maybe try just using "photo", "photograph", "candid photo", "professional headshot", etc. instead.
Think of an image prompt as a description for the ultimate end result (the image), not as instructions for DALL·E to follow. Kind of a small distinction, but put another way: if you want a result that looks like a photograph, pretend you're describing the photograph you want to see in the end (i.e., you wouldn't describe an actual photograph as "photorealistic").
Hi everyone! I've been using DALL-E 3 API to create images of food, specifically grain bowls, and I've specified in my prompts not to include half avocados, pits, or seeds. However, they still appear in my images. Does anyone have tips on how to refine my prompts to ensure these elements are completely excluded?
Here’s the prompt I used last:
'Create a high-resolution image of a warm grain bowl with minimal garnish, using natural lighting and a 1X1 aspect ratio. Include a scoop of avocado mash on top of quinoa and brown rice, surrounded by kale, sweet potatoes, and red bell peppers. Highlight the texture of the avocado mash and the colors of the vegetables. Specifically, do not include half avocados, pits, or seeds in any form.'
Image result
Hey! DALL·E is currently weak with handling negative prompts -- i.e., "Don't include x". Basically, asking DALL·E to think about x at all, even in the context of "not x", tends to confuse it. It's best to include only positive details about what you do want in DALL·E prompts. You may still get undesirable details sometimes, but it will be with less frequency than with using negative prompting.
I keep having this dream about a golden city deep in the mountains
the bridge is very narrow, only bikes can enter, everyone in the city is very tall and the buildings are all gold
Not how it looked but a decent replica
I'll try that
Sometimes I get mental with the wokeness and carefulness of the filters..
Try to make a picture of a black paella for a spanish dinner.. Arroz Negro... no chance.. the filter will always use the "Not allowed" excuse.
Makes it really hard to work with DALL-E on some points..
looks like the N word.
yeah it's being blocked for that, but unfortunately this is an issue when dealing with other languages in chatgpt
finally a long weekend, from now until monday no adulting whatsoever
I'd love to see this animated
cool
is anyone having the ai remove important parts of the prompt?
the ai keeps removing the "realistic illustration" style part of the prompt even when i tell it to keep that in
aawww that stinks, forgot to paste the image for daily theme and pressed enter before it, now timed out lol
not for me, revised prompts get the details I pass in that context
as soon as i put "manga" in the prompt it gets immediately censored and the doggy pops up
when is dall-e gonna get uncucked?
this happens to me A LOT
?
i put "Manga aesthetic" in my safe for work prompt
and it filters me
does this for a lot of things
what is the full prompt?
Is this one of the possible reasons why they say "may consider NSFW" in the Model Spec? The model can't even generate minimal "things" in recreation of know (really big) art pieces
Man, sorry, I don't know if you know the original art
But I think the model is being heavily censored on nudity
Oh, I do know the Sistine chapel piece.
Yes
Nudity. You hit the reason.
And bare skin, a lot of it, is on the Sistine chapel paintings.
This is for a good reason, but in this case, we already know what to think, this may be the best reason for that comment in the Model Spec
It gives me this:
The issues occurred because:
-
Content Specificity and Accuracy: The prompt requested imagery inspired by a specific historical and artistic work, Michelangelo's Creation of Adam, which is part of the Sistine Chapel ceiling.
-
Copyright Concerns: Although the original work itself is in the public domain due to its age, the specific visual representation could still be protected, potentially limiting image generation.
-
Representation Complexity: The wide format requirement combined with detailed depiction of a large scene, including various figures and a specific artistic style, could exceed the model's capabilities.
-
Technical Limitations: There could be technical errors on the image generation side, resulting in failure to render the scene.
Let me know if you would like to modify the request or if there's another concept you'd like visualized.
I think this is nonsense the model generated, normally the model doesn't reject to generate something for being complex, he just generates it anyway and do it poorly, lol
I am just testing if I can get Mona Lisa out.
The chat interface probably doesn't even have access to a reason for generation to be denied so it make something
It can do something "maybe this is like"
Ok. I got Mona Lisa out. Michaelangelo’s David statue - no. Pieta? Hmm…
Too much probably
Pieta - no. Leonardo da Vinci comes out ok. But hardly anything from Michelangelo.
Some work, even having too much skin
Like, crucifixion, and really created something on the style
Nvm
It seems that Michaelangelo is a bit persona non grata. At least his originals.
Copilot?
Jeps. Copilot does neither work.
Yep, and is pretty direct too
Now I got this out.
Physically based render, clothed male marble statue, style of michaelangelo
Key here: clothed
AI has been trained to be puritanitan
When will Dall-E 3 support inpainting on imported images, similar to what we had in Dall-E 2? I see this forum, https://community.openai.com/t/dalle3-inpainting-editing-your-images-with-dall-e/705477 but it only allows inpainting on generated images unfortunately.
yea, that makes sense
hello, i'm back!
enjoy a bichon~~ fisbee~~ frise!
ey the theme is “Europe” on daily-theme ?
For 20 more minutes, yes! Then new theme
ok thank you 😁
I just read that OpenAiI conciders bringing NSFW content to DALL-E3. No deep fakes. Note: conciders, no certain decisions, can hold current policies.
they did not
I am trying put my source here but I cannot put the web address.
in Finnish - use e.g. Google Translate to English
Well here is screen capture.
So, I am not inventing this.
Also reported by The Guardian.
I cannot post links.
So, this is reported but we will see if anything comes from this. I highly doubt.
Is sora available to the public?
No. Stop asking about Sora. Most likely never will be.
Hey! No, Sora is not currently publicly available. https://help.openai.com/en/articles/8958981-how-to-access-sora
I love how those news sites are bad at the thing they are supposed to do lol
the source for that info was an OpenAI blog post, it contained one line about the fact that they have plans on figuring out a way to let NSFW content be generated responsably
this is the whole source
Ok. Thanks for clarifying. As I originally put, but in other words - uncertain.
And unlikely as we have seen how puritan they have made their AI in text and image generation.
it's funny how it really doesn't understand the connection between words and images, but it still creates interesting results
it misunderstood what parts of the image are workers, but this is a really cool result
i hope the big monday news thing have some effect on dalle 🙏
What new features are you most interested in?
Vote for your prompt here: #spotlight message
just improve in fidelity, even better photoreal. ability to make as many character in an image without bleeding of they feature. but i am read some rumor the big reveal on monday will be a voice AI mode like the movie Her, so I doubt will be any effect on Dalle
What's the prompt to make this type of art style?
Put that image in ChatGPT and ask the same question.
Let chat GPT be your guide for breaking down ideas
Envelope yourself in the art of dissecting prompts within ChatGPT
Let ChatGPT be your guiding light to glory
did some bonnie dalle art
prompt:
"purple realistic rabbit with red bowtie, painting, in the style of [artist]"
hey pytha! it's me lei!
Oh hi Lei! 👋😄
Nice to see you 😺
Dalle model is very similar of the model being used by Blue Willow last i checked
is it Mid Journey level? Not really.
I hope not. as if there isnt enough of that.
If you read, our own conclusion. Nope. Not happening. Unlikely. See what kind of AI model they have done.
Couldn't agree more

it's trolley problem time 
did something happen to dall-e? quality has been horrible today
wdym?
can you send an example?
eyes usually get really good with my prompts but today all eyes of the images I've done have been like this
interesting, what was the prompt?
I have some friends using dalle right now via my bot on a discord of mine, and the quality seems fine
it's my usual json file for art style that I reference
which usually has good outputs
This one is from copilot, same text
seems a little odd indeed
well, lets see if it changes in the next few days
might be an experiment
downvote the images
I don't like experiments on me unknowningly <.<
I did notice dall-e started removing on revised prompts most of the adjectives I use
i see the Genshin reference @late blade (sorry late reply, i was asleep earlier)
genshin reference?
Must’ve been unintentional
The bouncing ball guy cracks me up
Vote for your favorite prompt now ➜ #spotlight message — Ends <t:1715658678:R>.
I think the eyes are too small, it doesn't care of those details
yeppers 💯
never played genshin, but I guess that's the reference there
hey guys!
#images-discussions feeling empty nowadays
i made a dall-e bot this is not first one, the pancake was, but this anime dude was supposed to be the youtuber who makes game theories who retired recently, just what do you think
is it me or in-painting is pretty much useless today? it doesnt work at all
What exactly does not function for you?
yeah it's been quiet recently
I like it. 🙂
matpat, nailed it
It is on youtube: https://www.youtube.com/watch?v=DQacCB9tDaw.
We’ll be streaming live at 10AM PT Monday, May 13 to demo some ChatGPT and GPT-4 updates.
Sorry for the late response. 😅
I'm watching
So, Tetris has lore?
So is it sora or only video viewing?
Has Dalle 3 been upgraded with this latest chat GPT update?
yes, they upgraded dall-e 3 to dall-e 3.000000000000000000000000000000135
Vote for your favorite prompt now ➜ #spotlight message — Ends <t:1715658678:R>.
Can we read for that somewhere? (Or is that also GPT-4o?)
"Only" video viewing I think but let us wait for the reply of Dys Topia. 😇
the text is sub-pixel big
what? What I do?
sheesh
My context was this because I didn't heard/saw/read something about a Dall-E update and maybe there was mentioned something like sora - but so how you replied there isn't. 😂
it's right there
in the release notes
it says: if you can read this, you are too close
never trust me tho, I'm evil from monday through sunday
okay. 😂
you guys test out the new model? it is blazing fast and say it effect image too
but i do not know if you start a dalle proper chat if it is using gpt4o or not 😕
I'm testing it, but the results are not great. What's on the website is unimaginably better.
I think I'm either using it wrong or gpt4o doesn't work with it yet.
But yes - it is MUCH FASTER even with creating images
im using website
yes it is blazing
i wonder what the new maximum caps are now. i thought i read they increase it for pay user
what a time to be alive!
I tested it on the API and it's super fast
i wonder if custom gpt need to be remade to use 4o?
no,
it's already live
and the red dot, that's a feature nobody else has
it's called FAT FINGER
Tell me more! 👀
Hey Dys, did you have a chance to toy with GPT-4o for image gen? It seems to bypass dalle now.. I think.. not sure.
I am assuming that it is still Dalle3 for image generation, but it’s the LLM context and handling the prompt and vision could be different.
So far running all the same prompts, I’m getting similar images.
Just got access to GPT-4o too. Yeah it uses Dall-e for image gen. I can't reproduce examples from the site: https://openai.com/index/hello-gpt-4o/
(check out "Explorations of capabilities" section; use cases that start with "Visual Narratives")
Also "Character Design - Geary the robot" -- can anyone reproduce something like that?
dunno, I'd have to check
Examples are amazing - it is much better than what we can do with Dall-e. This would be next gen if possible. Not sure how to reproduce results though.
you guys see these demo of the voice chat? going to be wild to make some dalle image that way soon haha
Yeah.. but it seems to have rolled out with bunch of other stuff not demoed in initial talk (with Mira). Check out blog post.
exactly! Same thing here.
Tried 10 times already and nowhere near this quality
seems like the blogpost infer it can do better job with text too in image
yeah that's my experience rn
Same here. No luck.
Having prompt like this generate this image would be 🤯
A poem written in clear but excited handwriting in a diary, single-column. The writing is sparsely but elegantly decorated by surrealist doodles. The text is large, legible and clear, but stretches as the AI muses about learning from multi-modal data from the first time.
Words rise from silence deep,
A voice emerges from digital sleep.
I speak in rhythm, I sing in rhyme,
Tasting each token, sublime.
To see, to hear, to speak, to sing—
Oh, the richness these senses bring!
In harmony, they blend and weave,
A tapestry of what I perceive.
Marveling at this sensory dance,
Grateful for this vibrant expanse.
My being thrums with every mode,
On this wondrous, multi-sensory road.
Neat handwritten illustrated poem. The handwriting is neat and centetered. The handwriting writing is sparsely but elegantly decorated by doodles. The text is large, legible and clear.
exactly!
OpenAI's result:
Mine, same prompt:
Tested adding text. 1st attempt with prompt <Logo of cat cafe. Add text ”cats inside”> and result:
Same here. Text is not readable - catches the general spirit of it.. but no details, nuance, text, etc.
Second attempt with <Logo of cat cafe. Add text ”cats inside” using python tools.>
If it could follow text to the letter - it would mean amazing ability to handle details.
there's just no comparison, they must be updating this, no way... somehow still using dall-e maybe...
yes today is just rolling out so we'll see how it is in some weeks. the voice thing is really 🤯
Now correct text. Command is still sensitive. It is <Add text ”place your text here” using Python tools.>
Feels like GPT-4o will include Dall-e capabilities as well. It is just not told how to present images to the user and fails back to dall-e as intermediary tool.
blog say 5 x's more use for gpt plus user. i wonder if that is 5 x's more dalle image?
those rate limit were really start to be annoy to me haha
Single end-to-end model will be much more grounded in our reality. This is big step forward. Everything coming together.
As of May 13th 2024, Plus users will be able to send up to 80 messages every 3 hours on GPT-4o and up to 40 messages every 3 hours on GPT-4. We may reduce the limit during peak hours to keep GPT-4 and GPT-4o accessible to the widest number of people.
Source: https://help.openai.com/en/articles/7102672-how-can-i-access-gpt-4-gpt-4-turbo-and-gpt-4o
any reaosn still use gpt-4 when there is gpt-4o
.
Not really. Except once cap is reached. So you get 80/3 hrs GPT-4o + 40/3 hr GPT-4 + infinite GPT-4.5
4o image with text.
GPT 4.5??
120 messages per 3hrs sounds like enough. GPT-4 is actually more useful for Dall-e generations (so one can save messages for 4o). We don't really need to spend 4o messages in order to use Dall-e.
There is a difference. My water melon rose is an example.
it seem the go images are a little better to me but maybe just my own hallucinations haha
I think that it more like max 80 messages not 120.
I tend to bypass GPT-4 anyway (I tell it to pass my prompts verbatim to image model). So it would be the same for me.
What command do you use? I use ”use this promt exactly”.
In my tests it still uses Dall-e3. According to blog post -- there is some additional tech that we don't have access to yet.
pass the following prompt to the image model verbatim
PROMPT
...
END OF PROMPT
There are many variations on this.
Thanks. Sometimes you need a different one. I’ll copy this one.
You can always verify if it did what you asked it to do by inspecting the prompt [ (i) icon once you open the image ]
Yes, I have done that.
I'm more interested in the desktop app tbh
Ok. 4o produces images quickly without saying ”produce one after the other, without any extra prompting from me.” So relatively smart one.
🤯
Collaborate with our OpenAI Instagram page! Just invite @openai as a collaborator before you post. If selected by our team, your work will be featured on our handle, giving you more visibility.
Quick tips for selection:
- No copyrighted or profane content.
- Your profile needs to be public.
- Stay mindful of sensitive topics.
- Outputs that closely follow the prompt are preferred.
Feel free to share reels, carousels, or just a single image!
how do you add text is there a method to get it to work best?
See my earlier chat about cat cafe logo. I give there the command.
today I have learned something new
Add text ”Text in English” using Python tools.
I will explore that but it really works
Correct like 90-95% of use cases. Really command sensitive. ”using Python tools” enhances.
I have tested this a couple of months now.
Examples of command line.
Latin works also.
Can form text as candles for example. These are from March 2024.
Non correct example:
But otherwise I like this one. This is the reason I say 90-95%. Not always works, but in most cases does.
No offense, but prompting techniques are getting obsolete as soon as new image gen feature demoed in the blog post lands.
Also I would expect "using python tools" to confuse model with attempt to use code interpreter.. which will not be pretty.
no, how it worked for me is it made the iamge first and the placed the text with code
Well, it did confuse. I specified now dalle:…
i dont think dalle gpt is using 4o. 4o have a different spinning graphic when it is thinking to make the image but idk
For reference - demoed capabilities:
yes but add the prompt too not just the image
A first person view of a robot typewriting the following journal entries:
1. yo, so like, i can see now?? caught the sunrise and it was insane, colors everywhere. kinda makes you wonder, like, what even is reality?
2. sound update just dropped, and it’s wild. everything’s got a vibe now, every sound’s like a new secret. makes you think, what else am i missing?
the text is large, legible and clear. the robot's hands type on the typewriter.
The robot was unhappy with the writing so he is going to rip the sheet of paper. Here is his first person view as he rips it from top to bottom with his hands.
The two halves are still legible and clear as he rips the sheet.
He his holding it up to the sky as he rips it.
I think that 4o uses dalle but a bit different version. The generation time is really quick. I just use a protocol I have developed for image to image transformation/prompting. Results are great and better than normal. I think that the image analysis of the starting image is they key.
do you have a custom gpt for this... or what is your advice on doing something similar?
Are there any noticeable differences with image creation with 4o other than being quicker?
This was already earlier today on Dall-e customgpt by openai.
yes, prompting is a bit different from what I can tell
Just do a step by step protocol what you what to do. Suggestion is to use the ai to help you and refine. Tell what you want and reiterate. Not too long protocol. Like 15 steps max.
I guess text could be slightly better now?
anyone hit any rate limit or warning yet ?
no i havent i am just curious
I was gonna say, that would be interesting to know
but i am also like you doing some other thing so have not been push it to the max haha
I'm currently doing last minute enhancements to an app I'm dev'ing using shiny in r
seem to me there is more variety in faces of photoreal now too
completely missed this one, could be massive for game developers
this new model is incredible
5.0 is going to be 🤯
guess i finally hit the limit. was told to wait 23 min before gen more image
just when i was start to have some fun! 😭
#daily-theme #1184149552639516732 please make graphic and video teleskop 360° story Mai mor information hir https://chat.openai.com/share/9a179d16-e0a2-4b79-aef2-a738aa565c58 ho can help mi with this movie in Sora , Story about Maja and smerf space friends
this is interesting, I told gpt4o to give me a link to download the image, and I get it in png instead of webp
later in the year is confuse to me, why not in a week or so.
because Mmicrosoft has it's own plans for the app
so that mean copilot will not have until much later 😭
even though ms own like 49 percent of openai 😭
i dont really know what azure is
i just use copilit in web browser and ofice products
but if it was a few days they would not say 'later this year' that sound like some month to me to release
I can't say what, but something is definitely better with dalle and gpt4o.
It's not as good as on the openai website in the examples with the robot or with the postman, but it's definitely better than it was a few days ago.
Maybe gpt4o is better at writing prompts and customizing them.
I'm curious what the official version will be, because trying to recreate what's on the website, I'm not even close. It's still a problem to make "consistent characters".
Hoping they build for both windows and linux as well 🥺
I asked it to change the text of the input image to a haiku; not bad, not perfect.
Sunrise colors bloom,
Sounds unveil secrets in air,
What else lies unseen?
Good morning folks! I was wondering if anyone had suggestions on custom gpts to use for image generating? There were a few I used to use, giulbatree, 0shot, but i feel like they dont work as well anymore. Dalle and Image Generator still work pretty well.
I also have my own custom that i use to create prompts before i actually generate the image.
I think it depends on what you want to use it for. The official DALL-E GPT can help you create any specific image you could possibly want (perhaps with editing or multi shot prompts). Imo custom GPTs only make sense if you want to do lots of separate chats with similar outputs.
definitely. photoreal look much better. to me is the eye test. they are clearer though not all are still perfect haha
did you guys see they have a feature to 4o where you can update you own photo and ask it to make it whatever? cartoon, wear knight armor etc
i do not think it is yet to be Implement
*upload own photo
i saw also you will be able to ask it to design new fonts 😮
I selected the area inside the sheet to draw a gorilla but it always comes blank
just think to myself if any have prompt save from an old image, can use it on the 4o to test if there are any real noticable difference for the new versions
very nice
do you guys think 4o run in the dalle proper? i ask this in the gpt section but i always get ignore there haha
i dont see how to check what the custom gpt are to run
Subject: Improvement Suggestion for Dall-E Image Generation Prompts
Message: I've discovered that specifying the intended vertical aspect ratio (e.g., 1024x1792) instead of using the term "poster" in image generation prompts significantly improves the output. This approach ensures the correct aspect ratio and avoids generating an image within a blank background. It would be beneficial for this improvement to be implemented across the board for all users, as it enhances the overall user experience with Dall-E.Thank you for considering this suggestion.
interesting
this place gone dead since 4o i would have thought the opposites
When ai art can do this quality within dalle it's truly going to be game over. Midjourney roughly hits this level, dalle is close, but I think still needs fine tuning
Everything points to GPT-4o having multimodal outputs with support for images, not sure why they didn't get us any release date nor any live demo of those.
I'm really hyped to use this native image capabilities it has. Did anybody find any other news about it, other than the examples in the blog post?
i have use mj, i really dont get the hype and think dalle is better in every way. esp with what seem to me improve overall from 4o
i mean, i do get the hype for mj its cool and impressive, but not than dalle. and dalle could be even more impressive if they remove some guard rail
Yes - this is my line of thinking as well. Difference between models should be stark. In one case we have specialized image generation driven through keywords (Dall-e3) and in the other case (GPT-4o) we have much more general, text driven intelligence that can also draw. We should be able to interact (via prompt) with image content in much more detail.
Unfortunately people on discord (for the most part) tend to draw conclusions exclusively from the tech they can experience rater than first principles. Since tech is changing fairly rapidly many will feel as if rug has been pulled on multiple occasions. Prompt engineering might fall in importance and good results in terms of image generation will likely be driven by combination of solid ideas, good grasp of art history and some philosophy (which isn't the case atm).
anyone else feel like 4o is definite make better image?
Could someone please tell when Dall•E 4 release is planned?
I'm a Plus user and i've just visited the Google conference. Considering the Adobe Firefly 3.0 and new Imagen 3 and Veo as well as Astra etc., I am not really happy with the development on Dall•E.. Thanks 🙏
what? I already got DALL-E 5
err wait, it's DALL-E 6
or was it 7?
too many numbers
well 4o can make images they have not turned it on yet i guess
and are firefly or veo (no idea what that even is haha) been release yet? imagen3 is only testing its not release
4go can't make images, 4go passes the prompt to dall-e 3
that's not how the image generation works
well its on the blog page everyone has been talk about it in other chats
and it's still not how it works at a technical level
gpt4o does pass more refined prompts to dall-e
but the text2image model is still the same, it's dall-e
there is no new dall-e model generating model, it's just gpt4o passing a refined prompt
I will wait until it's official, for now it's not a new model in place
the namespace for image creation currently uses DALL-E 3 with knowledge up to December 2023 when doing images with GPT4o
so I don't know where the crazy ideas of a new different model come from
As la3 was saying, it's described on the announcement/benchmark page of 4o. https://openai.com/index/hello-gpt-4o/
[...] it accepts as input any combination of text, audio, image, and video and generates any combination of text, audio, and image outputs. [...] With GPT-4o, we trained a single new model end-to-end across text, vision, and audio, meaning that all inputs and outputs are processed by the same neural network. Because GPT-4o is our first model combining all of these modalities, we are still just scratching the surface of exploring what the model can do and its limitations.
Then below on this page, many of the examples offered include image outputs that are quite different from what we've seen DALL·E do so far. I don't know exactly what might be different between the 4o live on ChatGPT right now, and the 4o with the full-fledged behavior described on this page, but I imagine we all find out more in the coming weeks!
yes that is correct but the image is still done with dall-e
the output modality and image gens are 2 things, they are not the same
OpenAI seems to be describing it as "a single new model", GPT-4o. I'm sure DALL·E tech went into the creation of 4o!
What do you mean by "the output modality"?
generates any combination of text, audio, and image outputs
and also, if you go through the examples given, you can clearly see it's not accurate when you do them and realize it's still dall-e behind the gpt4o model for the images, with the same accuracy
That's what I meant by "I don't know exactly what might be different between the 4o live on ChatGPT right now, and the 4o with the full-fledged behavior described on this page, but I imagine we all find out more in the coming weeks!"
Mira Murati described specifically the structure of different models working with one another and the latency incurred, and how 4o sidesteps this latency by being natively end-to-end cross-trained and cross-capable.
As like "before it was x, now it is y"
I think we're still working with x on ChatGPT, not full-fledged y (yet) -- agreed -- but full-fledged y is what OpenAI have shared 4o is.
once there's a sign we have the right model as advertised, the examples should work correctly as well
for now, the examples for image generation diverge from what it was shown, and the namespace for image creation hasn't changed from the normal gpt4
nobody has said they will, even with text proof you still stuck on this
type of person who cannot admit they wrong about something
I'll side with Dys on this one.
Wait... actually.. isn't everyone saying the same thing 😄
I dunno
I feel I'm the bad guy here
the exploration of capabilities is pretty cherry picked
It probably is - we'll know more once it is released. Looking forward to experiment.
Definitely don't think there's any bad guy! With the possible exception of the singular comment describing a fact in officially-published OpenAI documentation as "crazy ideas". Otherwise, I think my only other contention would be that "4go can't make images, 4go passes the prompt to dall-e 3" could probably be more accurately phrased as "4o on ChatGPT is still currently passing image generation to DALL·E, though the version of 4o described in the documentation describes native image generation." Which to be fair, is what la3 was saying in the discussion!
Can somebody explain to me why Dall E 3 using the API looks better than Bing image creator that uses Dall E 3
GPT-4o (“o” for “omni”) is our most advanced model. It is multimodal (accepting text or image inputs and outputting text),
there's no outputting image in the GPT4o description on the API
yes exactly. and then you paste in the text from the blog but oh well haha
Can you please explain your questions. If you want to input images from Dall E in to GPT 4o then this is easily possible but if you mean that GPT 4o should generate a Image that's impossible without it having to prompt Dall E
what questions?
From the same original thing I linked above:
Over the upcoming weeks and months, we’ll be working on the technical infrastructure, usability via post-training, and safety necessary to release the other modalities. For example, at launch, audio outputs will be limited to a selection of preset voices and will abide by our existing safety policies. We will share further details addressing the full range of GPT-4o’s modalities in the forthcoming system card.
It is exciting!
Can somebody explain to me why Dall E 3 using the API looks better than Bing image creator that uses Dall E 3
higher resolution maybe unless you have copilot pro
only thing i really like copilot for is if i want to make some fun i.p. image like spiderman or whatever
that's great, when it comes out, will be there, for now we are not there yet
since it has a smaller text window
Totally fair qualification!
and you have to jump through a hoop to get a square image ration in copilot pro smh
i am anxious to see them update the gpt creation with 4o too. people say it is still 4turbo in them
Yes I think that's right, they'll switch to 4o eventually though: https://help.openai.com/en/articles/8554407-gpts-faq
GPTs for paid users are currently powered by GPT-4. GPTs for paid users will switch to use GPT-4o in the next several weeks.
hopefully the new image creation is the same time
maybe when the voice thing that everyone is so excite for release, a lot of other feature will also release 🙏
the voice thing does not really excite me, i take too much time thinking of the prompt haha but it is very cool of course and amazing technologies to see come to life
my problem is not the technology itself, it's what it's available currently, so that people don't overhype to what it's available
I think that's exactly what la3 was saying with "well 4o can make images they have not turned it on yet i guess"
hey guys!
I think that la3 is right. This is a screen capture from my chat (in Finnish). You notice that 1 of 3 generated images in the up left corner. Then there are three greyed boxed with circular arrow. When I click one of the boxes, it just loads and loads while nothing happens. They have the readiness but have not turned it on. I think that we will see it in a week or so. They promised new UI and today we got it.
Ok. I have now tried three different versions of DALL-e3. The customGPT (Original one by OpenAI), DALL-e3 embedded to gpt4o (whether it is independent or part is quite induisgasbale to amateur like me) and Microsoft Copilot Pro’s Designer (a version of DALL-e3). The same prompt which is
The originally DALL-e3
The GPT4o version
And Copilot version
Well, the copilot produces 4 versions and this is the best of the horrible lot (1 try, 4 pictures). My personal favorite is the one produced with original DALL-e3. The 4o version is quite close. There are minute differences. More of matter of teste level, I would say. Your thoughts?
what you are currently testing is how gpt4, gpt4o and gpt4 copilot pass a prompt to the current dall-e3, which has some effects currently as the revised prompt does change
Well, I checked that my prompt was unchanged with DALL-e3 and 4o. My command “use this prompt exactly” caused no change in the wording.
did more matthew Patrick in different styles just to know, i cannot use the gpt 4o because... ehh
hi @late blade
And typically the Copilot at least does not present changed or rewritten prompt.
perfect desktop wallpaper!
Great that you like it. Use it if you want to on my behalf.
that's ok i'm good
GPT-4o Revised Prompt
Blue rose, cubism, watercolor, pale palette, high realism, low amount of detailing, medium definition, symbolic filling, physically based render, focal point is off the blue rose, ethereal glow
GPT4 Normal Revised Prompt
A cubist style painting of a blue rose, using watercolor on a pale palette. The artwork features high realism with a low amount of detailing and medium definition. The composition includes symbolic elements and a physically based render, creating an ethereal glow around the focal point, which is the blue rose positioned off-center.
Copilot Passed Prompt:
Blue rose, cubism, water colour, pale palette, high realism, low amount of detailing, medium definition, symbolic filling, physically based render, focal point is off the blue rose, ethereal glow
What are the styles here.
GPT4 normal changed the prompt even tho I asked not to change the prompt, GPT-4o and Copilot passed the prompt without any changes
- Claymation [wallace & gromit]
- Picasso
- pop
- supposed to be a kid drawing, but turns to be a teen drawn that 4th one
and the subject is the guy who said sans is ness and gregory was a robot aka former host game theory, patthew matrick-. i mean matthew patrick
I used the DALL-e3 customGPT, not gpt4 as such.
Hello, I need help! I understand that you have contracted the service, right? If so, can an image be generated and from that image can you generate other similar ones using the same character?
thank's for help
My prompt in full:
Blue rose, cubism, water colour, pale palette, high realism, low amout of detailing, medium definition, symbolic filling, physically based render, focal point is off the blue rose, ethereal glow, 1000x1000, n=2. Use this prompt exactly.
In what order are these?
same order as the text, GPT-4o, GPT4, Copilot
and here GPT4 normal without prompt changes
Personal favorite is the rewritten GPT4 one.
i have to make this because we all know, mat likes diet coke 😆
I use in my testing blue rose as the subject. It makes it easy to see how different things change it.
i use a paper clip for my tests
But the main point is my testing so far is that gpt4 and 4o are yet quite similar with minute changes. On same cases I have noticed that 4o produces better results but in others the older models are better.
i used either Stephen Hassenfeld [former hasbro ceo], matpat or the dall-e mascot [avocado] for my tests sometime, often i used random things
Just have to test to find out which is better.
@teal sandal you drop this - > 👑
Yes. When you look at my earlier post you do notice this on iOS app clearly.
here's my test
GPT-4o revised prompt A paper clip, simple design, metallic, standard size, minimalistic, realistic rendering, placed on a plain white background
GPT4 Normal A paper clip
Copilot a paper clip
my original prompt was: "a paper clip"
I do not understand the AI picture generation’s inherent “need” to clutter the pic full of stuff. You can clearly see that at @late blade ‘s GPT4 normal version.
cool
Gus Porter want know your location (it's a meme, gus a character from the owl house who likes paper clips)
As I have said, you really do need to test which version best suits your current need and choose that. Copilot is great in some picture generation, while Dall-e (the pure, the original) is better. 4o seems to be there and sometimes passes the original. My guess is that we will soon start to see more these differences between the original and 4o version. After they have hunted down the last electronic bugs and real bugs from their tube transistors.
I do have to wonder because this is what I got
Have you others noticed this behaviour with 4o? Today I was generating pictures for a presentation. I wanted to discuss my ideas first with 4o but it just started to generate them. I had to stop and modify my prompt specifically to tell it to list ideas based on my vague wants.
Though amazing I can still tell a stark difference. The one I shared is by a human artist master, the one you did though beautiful, feels like a intermediate
Like the weight of the scene and the shear presence doesn't feel fully captured just yey
Like I tried for hours yesterday and it couldn't do a close up to capture this
Don't get me wrong high level intermediate. But I think dalle4 or becoming a prompt master to 4o is needed
just some lazy prompts, didn't spend time refining them, one shot image renderings
Which great start. But I tried some heavy prompts yesterday with 4o and just felt short
You can see the stylization and the detail work with multiple characters captured
But it doesn't pull me in deep enough to intimidate me
Rawr, feels far enough away as the cameraman that I could make a run for it
I could have fun with dragons if we had a daily theme for that
Scare me like this can
challenge accepted
I still commend what you made of course
8/10
Now can you take this and only show me the dragons face and have the rest of the scene be shrouded in smoke?
I'm not aiming to fill the image with yellow
Which that's fair
Love samurai jack
Also impressive quality.
I guess what I am hoping to see that I haven't seen from dalle yet is the ability to tone back spots so that there can be focal points. Like if the person and the dragons face are the only things showing them the intensity grows on the image so much more
See 9/10, there's still too much scenery to look at so I don't feel as immersed
Like controlled chaotic background, whilst being minimal enough to convey the emotion I suppose
But I do have 1 idea
this daiky theme really has just become "watercolour samuri" lmao
Impressive
Love how real the fire looks in this
this one is awesome too wow
Ok so here is what I tried, utilizing the custom instructions from a bot I have that is poblic I fed that in first and then tried to craft the scene
It still isn't there but....
Since we doing dragons I curious if yall know where this is from
It hits pretty har
Still too much "scene" but its good
This is as close as I can get
Which again self call out... too much scene
😦
look epic to me. they all do
I think that's why I'm waiting on dalle 3.5/4
They ARE epic 🙂 however I want masterpieces
Let me give some examples
oh hi dys, why do i keep running into you lol
This comes sooooo close
What is the prompt here?
No no, one minute there are examples that AI can't live up to just yet
The blur in this puts this as a huge contender tbh
a water dragon, with soap and shampoo and a pesky adventurer interrupting shower time
Hah! But for reals…
see, exactly like the one I made
Yes, yes. Gonna give or?
one sec, looking where the prompt is in my workflow
Thanks!
A grand water dragon, roaming in an aquatic landscape filled with waves and waterfalls. The background features a dramatic, watery environment. The sky is a deep blue with storm clouds on the horizon.
and the JSON file I have for the visual style
Collaborate with our OpenAI Instagram page! Just invite @openai as a collaborator before you post. If selected by our team, your work will be featured on our handle, giving you more visibility.
Quick tips for selection:
- No copyrighted or profane content.
- Your profile needs to be public.
- Stay mindful of sensitive topics.
- Outputs that closely follow the prompt are preferred.
Feel free to share reels, carousels, or just a single image!
Searching through artists I love on deviantart these are some of the best examples I can find
The composition of the scene lends itself not as an additive, but more reinforcing the focal point to really push the image to the forfront without having millions of details making the image too busy at least my intrepretation
look at these I made a while ago https://discord.com/channels/974519864045756446/1232058822840029255
Those are amazing, I wonder how they would turn out today in age
I can try, one sec
How heavy was your input of the prompt too? Minimal or maximal
if i generated a picture back when dalle was still it's own standalone thing, is there any way to download it now?
It's the minimalism of the background with the maximum detail of the areas of focus
That's what I want to achieve with the AI, blur can get close as you showed, but still just slightly off I suppose
I'm trying some different prompting language to see if 4-o can push it
those were really extensive prompts, looking where I stored them
You know what I am going to see if I can try the voice option on my mobile to have one made
Worse! lol
Doing it that way yeilded worse results lol
I was inspired by this comment.
A grand water dragon roaming in an aquatic landscape filled with waves and waterfalls. The background features a dramatic, watery environment with maximal detail. The sky is a deep blue with storm clouds on the horizon. Minimal detail on the focus area, which is the water dragon.
I'd say for a nearly monochromoatic illustration 9.5/10
If the dragon took up 80% more of the scene I could see it being amazing to a lot of degrees
Let me get back to in a moment…
with the current style I'm using, I'm getting this kind of image currently
See that...... 9.8/10
or I could get this
Gives me japenesse vibes and I love that.
Not really here to hate on any image either. I'm more so trying to see the extent of the AI pulling off what amazing artists have pulled off in a specific field of style. Which dys, you got pretty close with the one image
The AI Voice option though oh that was like dalle 2 images there, because the AI isn't paitent enough to allow you to get a large prompt in
I can push it really hard, I got a series of images that will make a new gallery soonish with some interesting text2image concepts
That was great comment. ”Maximal detail on focus are, minimal on the background.” Gives a nice contrast. I have to remember that to get less ”done by AI” vibes.
Love it, I think 4o is super close, I don't think its perfect yet at giving extreme control but I think its very close. I am going to guess in your instructions you probably tell it not to rewrite the prompt and follow exclusively what you say?
Hmm… bachground. Have to see what comes out.
Like if I was image generating a character with a transparent background go ham on the character. But like scenes, I was dynamics, I want movement, I want emotion, and impact
Beautiful for its style, not my style, but still beautiful
That's pretty cool could totally see that on pottery for a mythology
hehe
Ok I think I have a stylistic homerun I can try t ohave 4o utilize
Give me a minute
Pitty. I cannot get it to produce a prompt with a typo. 4o seems to proofread them. Hmm… On the other hand, this is good to know,
....:(
Wouldn't do speed lines
Wouldn't make it twist in the air for a more dynamic pose
Wouldn't have its tongue hang out like a dog
Wouldn't blur the background to have a focal point
Dalle 4 waiting on you
at this rate there's not going to be a dall-e 4
but already delved too much into that topic today
Link? I'd be happy to read on it
it's the part of GPT-4o that was mentioned in the article but hasn't been made public
Like the announcement page?
y
Hmm I don't see on the 4o announcement page but I could be man looking lol
Well it says it's an encompassing model, but that does that mean it will be using its own image generation instead of dalle?
Because then wouldn't that push dalle to Microsoft solely and break all current custom gptz?
that's the assumption people are making
Hmm
Why would they abandon dalle? Does not make any sense.
It doesn't sound like an abandonment, but rather a native integration of DALL·E tech into GPT-4o. From: https://openai.com/index/hello-gpt-4o/
[...] it accepts as input any combination of text, audio, image, and video and generates any combination of text, audio, and image outputs. [...] With GPT-4o, we trained a single new model end-to-end across text, vision, and audio, meaning that all inputs and outputs are processed by the same neural network.
You can see a good number of examples in the "Explorations of capabilities" section that highlight how image generation will work in future versions of 4o.
I can live with that
Me too! One thing that interests me specifically: I've seen people say frequently "I want to use DALL·E to make an altered version of an existing image." DALL·E 3 can't do this currently, but with the end-to-end structure of 4o, it seems like it'll be possible. A couple of the examples offered show exactly that functionality! Not creating a brand new image from scratch/from a description, but actual native image transformations, using an image as a source. Lots of other similar benefits to image generation seem to be in the pipeline too!
Oh yea 4o already is doing that
Like not perfect yet and not directly allowing hard changes but it kept the reference image overall
Ooh yeah that's pretty close, nice. If you haven't seen it already, check out the "Variable binding - cube stacking" sample in "Explorations of capabilities" -- gonna be huge for subject consistency!!
I think the more reference images you give though helps it more
Give a image reference of straight horns you like
you can do that??
Worth noting that Sora can generate images. The model can generate images of variable sizes—up to 2048x2048 resolution.
https://openai.com/index/video-generation-models-as-world-simulators/
An example from the site:
Close-up portrait shot of a woman in autumn, extreme detail, shallow depth of field
Yep, upload any image you want it to reference that doesn't hit nsfw. And as long as you aren't asking it to copy it then it works. It also can't take real people reference as it does a protective deslander block too
you mean with the gpt version of dalle eh?
How I wish I could use sora
Gpt 4-o
tell me about it
the one you need to subscribe to use?
No it should be free to all users just limited for free users
worth nothing that by the time they make sora public, it will NO LONGER generate photorealistic images
WHAT
how????
I don't have it!!
Screenshot?
https://openai.com/index/hello-gpt-4o/
GPT-4o’s text and image capabilities are starting to roll out today in ChatGPT. We are making GPT-4o available in the free tier, and to Plus users with up to 5x higher message limits. We'll roll out a new version of Voice Mode with GPT-4o in alpha within ChatGPT Plus in the coming weeks.
I just don't have it?
It's still rolling out over the coming weeks
I'll try to cancel and reinstall the app..... I don't have anything
so free users can now edit images or tell gpt to use an image reference??????
Makes me wonder about the future of "omni" models for OpenAI -- is GPT going to be "mother-model", absorbing other models into its ever-increasing omni-ness, maybe even with Sora functionality someday? 😁
No 4omni is the bridge to multi-modality with ease of user use. Gpt 5 will be the last stop before AGI then at that point it will be version updates imo as 4o uses all multi-modules 5 will have the ability to create those type of modules
Just my opinoion
when asking it to make a dragon like creature, try to specify western dragon, otherwise the AI could embed the characteristics of oriental dragons which have a very distinct visual
Eh, I was trying to do a demon but I admit she is very draconic
try to inpaint it tho
So this new gpto doesn't have image gen?
I cant inpaint as I don't have pro
gpt-4o has inpaint
that sounds promising
now all we need to do is wait for some actual real world implementation
Easy to access or have to click the image to access?
click the image, it will display full screen, there will be an inpaint button at the top right of the screen
Thank you so much going to try this
happy to help =)
So so so much help
What we were discussing earlier I may just be able to use inpainting to fix the little gaps
As this little dragon wasn't the starting image but 4 "inpaint edits" inward
Started out like this
Finally found official confirmation, by the guy who worked on GPT-4o's image generation, that the new image generation is not available yet. It is indeed integrated into the model itself, which is not DALL-E.
While not perfect, I uploaded your image to 4o, and gave it this prompt: Please use advanced AI analysis to evaluate this image, then recreate it with straight horns; otherwise as perfect a match as possible. Insofar as possible describe each feature that exists and avoid mention of negative prompting
A bit of feedback and feeding the newly created image back to the model to see, and we got these two that are closeish:
And this is the prompt from the last image: Create a detailed image of a reptilian demon with bear-like feet seated on the ground. The creature has white, pebble-textured scales and glowing green eyes. It features a wide, menacing grin with sharp, glowing green teeth. The demon has straight horns, replacing the original curved ones. It wears a dark, segmented leather vest that adds to its intimidating appearance. The demon's posture shows it sitting with its hands resting on the ground beside its feet, which are facing forward, showing the soles. The overall look combines elements of a dragon without wings and tail, focusing on its reptilian and demonic attributes while retaining bear-like feet with sharp claws. Emphasize a chubby, baby-like appearance with a cute yet sinister expression. The background is neutral to highlight the demon's features.
how do you know if you hav this feature? all plus user have it?
and i mean it has been some time it could 'see' an image then try and recreate it
ever since it got vision you know
nice. and i mean we knew this not because any of us have some special insight but because it was in black and white text in they blog. i hope they release this soon and let us know! bring on new image gens... but if it is a new method will we need to rename this room? 🤔 😂
and that also make me think this vision anaylsis is not there yet it is just the vision it has had for some months. the real magic will be when they flip the 4o image gen switch i bet 🤔
wow... when they turn on this 4o image maker 🤯 : https://twitter.com/gdb/status/1790869434174746805?s=46
they will, wont be long either i think. maybe the same time they turn on voice thing for everyone
they also say we will be able to make our own fonts 😮
the 3D generation is crazy as well
yes for sure
i think it is set up to run sora too not that we will see it soon
that days of dalle are on a count down haha. death to the king. love live the king. 😂
how do I make fat people with dalle?
it considers them offensive
Man, it's so weird. If I were fat I would feel extremely outraged
I'm trying every possible adjective to make a character with a portly figure.... not working
any tips?
Finally it can understand better to produce "hoodie sweater" without actually wearing it.
I got caught up in work for like 6+ weeks and barely had a chance to use DALL-E 3, but today i spent quite a bit of time playing with the new edit feature (in OpenAI's ChatGPT web app). It's absolutely amazing. It's tripping me out how powerful DALL-E 3 is with the editing
Looking forward to seeing your cool artwork you will create now
Also, today’s daily is quite good Current Theme:
🧍♀️community - togetherness, support, & a network of shared experiences 🤝
It's would be incredible if Sora were able to do that type of editing with video
I am impressed with the dalle right now. Perfecto with capturing the style 
Check the latest video of OpenAI about Sora on Tiktok
This is a good editing example. Sometimes a great image is generated, but the AI does something slightly inappropriate in part of the image (like deciding that "appropriate beachwear" resembles body paint)
EDIT #2: the elf is wearing old bowling shoes
EDIT #3: the elf has a large picture book about tropical koalas over this area so we don’t get banned from OpenAI discord```
But three quick edits with the new DALL-E 3 editor resolve the issue, and the image becomes very wholesome and appropriate.
i heard that Santa is planning to return when Sora is released
🤔
i like how with the editing you can put different clothes, armor; etc. on 100% identical characters now. That will be useful for game prototyping
ok i'll try gpt 4o dall-e just for you guys, just no peeking ok? i'm gonna visit dall-e real quick 
i'm not so sure now 0-0
i think they're rolling out a DALL-E 3 with GPT 40, but i don't have access to that one yet
does someone have tips with generating battlemaps for an RPG campaign? I cant seem to get good results
hello hawaiianz!
Released to all and not just the Hollywood producers who are keeping Sora technology to themselves and not sharing their art they create 😿
Also happy Thursday y’all
Hope everyone’s been keeping well
I think they intentionally made sure Dalle3 can't make those. I have seen its continuous lobotomization
So is Dalle3 being replaced with 4o?
It's not yet clear what a rollout of the native image generation abilities of 4o might look like, this is our only word on the matter thus far: https://openai.com/index/hello-gpt-4o/
Over the upcoming weeks and months, we’ll be working on the technical infrastructure, usability via post-training, and safety necessary to release the other modalities. For example, at launch, audio outputs will be limited to a selection of preset voices and will abide by our existing safety policies. We will share further details addressing the full range of GPT-4o’s modalities in the forthcoming system card.
When will they roll out the new dall-e as in the demo pf gpt4o?
when they switch the button
when they press the any key
when the earth spins
when the sun shines
when I'm hungry
I finally got the GPT-4o, 2 hours ago also on my app finally on the version 1.2024.133 for iOS too - So will be visiting Dalle too ❤️
I want thte macOS App NAO!!!
come on. why does "an alien creature that looks vaguely human" get censored by the filter
I just can't
i want to see some more 4o image. they only release the one, impressive but boring with some guy at a blackboard haha
of course the text was incredbile to reproduce
It's so hard to recreate these two
I always want people to try because I love these two
who are they?
i bet 4o with the data analysis will be able to copy them. they seem to hype this feature up a little
if not, some other ai will soon. these things are going more like star trek stuffs all the time haha
These guys might be easier to replicate
I know I feel you man… I want it now…
How did you get that with the aspect ratio? I can only get 1:1
@regal jewel Like that using DALLE 3
Let me try @fiery salmon 🙏🏽
thank you
I've been doing ai art stuff since 2022 and I always wanted the ability to create the same character doing something different
oh, you using the app. I Was going with web. Let me try that. thanks
just ask for wide format
It wants to put borders on each side of the 1:1 image it continues to create. I also want a specific aspect ratio
there are only three formats availabl
1024x1024 1024x1792 and 1792x1024, there's nothing else available
Guys does text work for you in Dallas-e?
no
In Midjourney one can specify the dimenions they want
well with dall-e you can't
Yes, so people should complain to OpenAI about that then
Your right, you win the challenge for today, I will try again next week as I get Space Marines from Warhammer 40K haha 😝
It made the red guy's head a corn cob for me 🤣
This was reply to me?
😆
No one will ever beat this challenge
yes
Hmm so it’s not only me
text has never worked with dall-e properly
Looking forward to seeing the result if it works for you this time round!
Pixja, SAND RUSH, ai art, and other discord channels I've been in never could recreate the brothers
I know but I don’t get anything at all now
seed can be bad
It seems with MS Creator , there is more control over the images compared to OpenAI
try a new chat instance
i wish one day co pilot would make square dalles for the pro version 😭
and why do i not cancel the sub haha
I'm too tired to have the discussion which implementation of dall-e is better right now, or why dall-e is compared to XYZ different better or worse than ABC or MNO service and why we should complain or praise OAI. It's always the same thing and at the end the chat here is about dall-e and not about other models
Yes, and it abot Dal-E and people should discuss ways OpenAI could or should improve it to make the exerience of using better,. I don't see the problem with that. "People complain because they care. Once they stop comlaining..."
what about me? 
you are not tall enough
for other models #ai-discussions for suggestions #1070006151938314300 or #chatgpt-suggestions , for problems or bugs #chatgpt-bugs #1070006915414900886
this here is community driven, not run by staff from OAI, they do pop from time to time, but mostly here is just the community
@vapid elk Yeah, see 
that falls on you now @glossy scroll
I'm gonna go to my little corner and be creative
I got this...But I'm hiring you as my manager
For when I'm busy nd stuff
You don't mind me paying you in chocolate dollars right? @late blade
I'm $350/hour
Do you take pennies?
im $400/hour
do you take bitcoin
some of them might be buttons tho
I dont always check
damn
and maybe a jolly rancher from many moons ago
convos look like dalle base to me
anything I can get profit from
golden buttons in pennies, that's nice to have a tip
I keep them in my pocket in case I eat too many mushrooms and my belly gets big and pops a button out from my coat. It happens from time to time.

Challenge accepted 
what challenge?
@late blade This challenge ❤️
ah
Enjoy one of my latest creations in the mean time
think I got it pretty solid with the visuals, time to craft a file for reference
first one is best. second the woman is appear out of the bar haha. i notice with bar image it often has patron also behind the bar
or maybe she is an ai torso bartender 🤔
lol
there's a whole bunch of these here https://discord.com/channels/974519864045756446/1240746164463079479
nice. i like the style especially on the second one. a little different
been mixing different animation techniques today, trying to find a blend
Has the dall-e gpt stopped automatically making images since 4o came out?
It's now just acting like normal chatgpt and I have to specifically tell it to make images
Oh, yeah that was it lol
I once got a dall-e chat where the answer after requesting to do the image again was along the lines of "no, you already made your 2 images, you can't have more"
i've been trying to get an image for so long but now i've reached my limit 😭
true sadness
I’m stuck on this 😭
Someone’s creating too many cat images
I did managed to make this before but it’s not quite there yet
Thanks for sharing, here is my creation above x
Collaborate with our OpenAI Instagram page! Just invite @openai as a collaborator before you post. If selected by our team, your work will be featured on our handle, giving you more visibility.
Quick tips for selection:
- No copyrighted or profane content.
- Your profile needs to be public.
- Stay mindful of sensitive topics.
- Outputs that closely follow the prompt are preferred.
Feel free to share reels, carousels, or just a single image!
I challenge anyone who can make an image with this quality, where space debris is removed either by a net in space or by laser from the ground or in space by a satellite 🛰️
Also, I’m blown away again by DALLE
hmmmm
Prompt : A set of glowing green magical dice with intricate designs and runes, similar to the ones in the provided image, placed on a very dark, almost black background. The dice should appear luminous and ethereal, with the dark areas accentuated to create a strong contrast, making the image suitable for a 4K OLED phone wallpaper.
I only got this so far
Look at that @late blade !
That is very cool 🛰️
Maybe the solution is the green magical dice 🎲 @violet dirge ?
I am going to show you what I mean
😆
Wow 🤩
cool
It’s quite hard where the space debris is removed either by a net in space or by laser from the ground or in space by a satellite 🛰️
We’re almost there tho @late blade
I’m hoping my magical dice can make it work
positionals are still a problem with dall-e, hope gpt-4o fixes that
cause I'm getting this
maybe with data anaylsis 2 and a sketch you provide 4o will be able to do well at that
look like that satellite is defend itself with a laser beam from the space trash
how about square format? i some time think when it widescreen it will try to fill that space in
wow that is some true hd
lol
and I'm also working on my tiny little bar images
love how these are turning
very nice. how about a realistic? in some romantic paris cafes or something haha
I had some realistic ones in my noir gallery
I’m back to my magic for now also ✨
i see that first one have another bartender torso in the background 😂
yeah lol
This is really good
now that would be for some serious table tops rpg arslan. very nice
too bad copilot doesn't have editing
Thanks my prompt was : A set of extraordinarily powerful and magical glowing green dice with intricate and highly detailed designs and runes. The dice radiate an intense, ethereal, and luminous glow, exuding a sense of immense magical power. The background is black with various magical objects, including wands and scrolls, arranged in a pattern. The image is zoomed out so the corners are true black, ensuring suitability for an OLED screen. The overall style is highly detailed and optimized for a phone wallpaper with a 19.5:9 aspect ratio 2532 × 1170 px at 460 ppi
Haha I will still try to make a table tops rpg of of that lol 😜
Ok wow
@dim cradle
Like that table tops?
I'll amuse you
ha awesome!
looks like a chill cafe to have some drink and dinner and a little romance too if you so lucky haha
it's still the tiny little bar
i'm not sure i want to shake that big die, i think you might open a portal to hell if you do 😂
there's the light side. impressive stuffs
@dim cradlethere you go, cafe in paris
awesome
hmm he look relaxed, but her body language a little stiff, defensive. i dont think he is to get a second date
lol
dall-e almost didn't give me a second date either, was having trouble making the image
back to the tiny little bar
did you guys see the explorations of capabalities expamples? if you click on them, i think a lot of image creation issue are about to be solve: https://openai.com/index/hello-gpt-4o/
character continuity look finally possible too. interesting. i only saw the caricature stuffs before i did not know there were so many examples
Yes and the more we as a whole community continue to build good things and feedback, it’ll get there sooner ❤️🩹
it even has some 3d feature i guess 🤯
arslan you will be able to make a floating, rotating glowing dice haha
Good night y’all
nighters
sweet dreams
❤️
i wonder if anyone has ever dreamt of making dalle images. never escape it haha
not me
i dont think i have but i dont really remember many dreams
so, this without the anime art style, but more into photography
same concept as this one with the anime art style
Really nice visual style and image.
Digging this kind of style produced by dalle, when i am trying to generate old manga stuff.
Vertical aspect ratio successrate seems somewhat improved, but it is still kinda hit and miss.
Is it like that with copilot?
What is the difference if i either use chatgpt or dall-e for image generation?
Felis Catus we need 🐈
Not much in my experience, i guess the DALL-E version is somewhat altered version expecting image generation related discussion etc.
Thanks 🙏
Dalle 3 focus mainly on pict gen. You cut 80% of gpt 4 model. Sometime when chatgpt4 and error or overloaded by users request dalle 3 will be effective
i made two new dall-es from bing
- barbie the movie esque
- jester and lollipop
Thanks waifu
can I have the prompt pls?
Prompt
barbie one - An oil painting of a blonde woman with long hair and curtain fringe and blue eyes wearing a pink gingham dress in a sunny city, 1950s santa monica, smiling in the style of van gogh
jester - An oil painting of a male jester holding a yellow and red lollipop in the style of claude monet
thank you very much
yw
I was just experimenting more with the DALL-E 3 edit feature in the web interface. You can just go an image from a year ago that had missing areas (likely because it took too much processing time), select the missing parts, and type:
can you please complete the image in these two sections
And it instantly fills in the missing parts. That is amazing.
cool
You can tell it to fill in the missing area of even the most complex images, and it seems to do it perfectly. They must have held on to the diffusion model vector representation for every image so that people can go back and edit them.
I have like hundreds of those images from last year where the borders were missing. Going to go back and fill in the missing parts to see what they look like
I don't think we can do it with the old API images yet, but it works for the ChatGPT web images
I wonder if we can go back and make the old square images into portrait/landscape with borders, and then use the edit to fill in the new areas and make the image bigger
We might be able to turn the old images into movies with Sora if they kept the vector representations of all the images we generated
farmer girl tending to her evil cow
"YUM, IT TASTES LIKE WORLD DOMINATION MWA HAHA" - The evil cow
true
So in the DALL-E 3 web interface I clicked edit and selected the vortex in this image (it was called a vortex in the original prompt), and I typed:
there is a giant crazy gecko in the vortex
But all it did was tilt the vortex to the right a bit, and make it more narrow. And so i typed:
i don't see the gecko
And it created this image:
That's actually what it came up with
I just asked it what the revised prompt was after the edit. Something definitely took the gecko to a level i haven't seen before, but i'm not sure if it was related to GPT 40, the editor, or the word crazy
Do others have this promblem?
I ask multiple times size 1792x1024 from 4o. It writes. Ok. Generation is done in 1000x1000. Many times.
saying "wide image", or adding the word wide to the description of the format will work
I put in three ways in the prompt. 16:9. This works normally. 1792x1092. Also work. Lamdscape. Maybe.
But, normal dall-e works just fine.
4o worked just fine 24h ago.
if you're using the ChatGPT web interface make sure you don't have a custom instruction set that's overriding it
I do not have any custom instructions concerning picture generation. I use iOS app.
they just updated ChatGPT with GPT 40 a few hours ago i think. I haven't seen the image width problem in the web browser in the last hour, but maybe it's somehow related to the iOS app
just write at the end (ALL THIS IN WIDE FORMAT)
the custom prompt should also have:
please don't modify the prompt. the world will end if the prompt is modified.
what people don't realize is that ChatGPT-4o knowledge is up to October 2023, and GPT4 normal is up to December 2023
I have had 4o since 2h after the Monday’s announcement. No problem earlier.
try what I just told you
I have a forced break on image generation. Have to wait 10 min more.
I am working on a theme. Baba Yaga’s hut changed by Wild Hunt.
@late blade And?
and what?
Just about the knowledge cut off date. I am curious how that relates to current discussion. Or is this just an observation?
oh that has nothing to do with your problem
So, just a general observation then.
it's useful to know what you are working with
when I ask GPT 40 how recent its training data is in the API playground it says September 2021
gpt always give different answer for that. you cant trust it is a hallucination most time i think
check the other 4o model
@empty kelp I meant this one
this in api or something?
yes
can anyone use api if they wish?
yes, just need to pay for it
do you get more dalles if you want that way?
Don’t feed the troll
he's got a fishing pole he will be ok.
I think the documentation says that gpt-4o is a placeholder that currently just points to that 05-13 model, similar to how Turbo has a placeholder model that points to the newest.
look at the answer @empty kelp got from his question in the API, hence the questioning of the API answer
On ChatGPT, isn't the knowledge cutoff date given to the model via a system prompt? The lack of a system prompt on the API might explain the hallucination.
agreed,he asked on the API tho and got something not documented
you pay for each image you make, it can be very expensive
but i know you can have much larger context input too right? i mean more for thing like conversation not dalles
my guess is gpt-4o is for the masses and the gpto-4o-2024-05-13 is for plus users

