#images-discussions
1 messages · Page 52 of 1
I found some courses on udemy, gonna see what I can learn from them about chatgpt, maybe I realize everything is ran by cats and they want to take over the world
I would never do a tattoo for myself, but I do love to see them executed properly
hmm?
you said about the santas and I answered with well, it's only 358 days until Xmas, so @empty kelp is preparing for it
do you believe it? christmans, just one week away!
lol
I dunno, I just guide myself by the lights and the gifts
Glad you liked 🙏
so, if someone gifts me a tesla and has neat lights, I'll think it's xmas too
Can we share the tesla? 
I guess that's a no 

mine!!
DALL-E does give Santa some variation and diversity which is excellent
he also wears normal clothes occasionally
I wonder what you put in your prompts...
I add ginger ale to mine and ice cream
i actually used the same two prompts for all of the breakfast and pool images. DALL-E adds all the creativity and variation
I love how santa is casually dressed
that’s very rare. like 1 in 200 Santa images probably
even if you specifically tell DALL-E to put Santa in a different outfit — it completely ignores you and puts him in the red North Pole outfit (even when he’s swimming). It’s like when you tell DALL-E to have the gecko stand in one place and hold the tornado in its hand. The gecko just ignores everything in the prompt, pounces on the elves, and tries to eat them
the AI has very minimal control of the gecko, and it adds huge variation to the images. it’s usually pretty interesting though
you have to generate like 15-20 images to get the gecko to do anything even close to what you put in the prompt
it's probably how the tokens try to make something logical about santa and then you double whammy it with a gecko
random stuff i made today, making way for the gecko dalle trend
oh starboard is back online
DALL-E automatically spawns food for the gecko, or transforms things in the scene into food for it to eat. And then it generates scenes where the gecko is pouncing on the food trying to eat it. It turns the elves into small rabbits and weird little creatures so the gecko can eat them
wow, i love me some waiter santa and his twin bro
Try putting a “large gecko” into your scene. It will pick the smallest creature in the scene, target it, and then DALL-E creates images of the gecko pouncing on it with it’s mouth open and tongue out
whenever there is an elf slightly smaller than normal the gecko immediately tries to eat it
lol
This is what the gecko does when you say, “small humans walking past a large gecko” in your prompt. These are DALL-E 3 images from Bing
the gecko tries to eat the humans in every image
and it spawns those bugs automatically around any small creature in the scene
it also drools on them… if you look at the lower left image — one of the small humans automatically spawned an umbrella to protect itself from the gecko drool
*pikmin sized humans
anyway, i feel like it’s something people should know about. it’s like if we took one step into a virtual world with an advanced AI model — something would eat us
it’s definitely fun in images though
yeppers
The weirdest thing was the ice cream tornado that the prompt explicitly specified was centered in a certain area — and then the gecko crawls out of it or twists the tornado into different shapes to eat the elves. This is an image that Dys Topia made from my prompt — and you can see here that the gecko completely altered the tornado so it could munch on some elves
the gecko totally ignores the prompt and reshapes the entire scene so it can eat elves
if you look closely — whenever the gecko does that the other characters have a horrified look on their faces
it’s just really bizarre
it does that when the gecko holds hands with other characters also (or feet). in this one the characters were holding it’s feet for some reason — and the middle elf looks like it thinks something really bad is going to happen. This sort of thing happens constantly with the gecko
so, where's Lilo and Stitch in those photos?
the middle elf also spawned horns on its head, and the left one is looking completely insane
they know the gecko wants to eat the small elf, and so they all start acting crazy
it’s like the Lilo and Stitch horror movie
based on the photo frame in hiro's house in bh6
enjoy a cat wearing stitch costume ^_^
now even the kitten is dressing up like the gecko
no it's stitch, but this is a cat wearing a gecko costume 🤣 
btw i used sphynx cat to prompt mr. bigglesworth :
try telling DALL-E that the gecko is large and the kitten is very small
ehh, i use bing
noted
Euphoria is today's daily theme. interesting.
no dalle for me accept for four credits so ima not for now lol
i would use the google one but idk if thats allowed lmaoo
this is what the gecko does when any small creature is in the scene. it either starts staring at it in a creepy way with it’s mouth open (you can see it locked on to the tiny elf on the left) — or it pounces on it. it jumps right out of the prompt with its mouth open and tongue out
this is the gecko pouncing on an elf… the AI transformed the elf into that strange little creature, and then the gecko jumps on it to eat it
and you can tell the characters are freaking out when the gecko tries to jump on them
best
would be interesting to know where the model got its gecko training from. pretty sure National Geographic doesn’t have geckos jumping out of tornadoes looking crazy to eat cute little cartoon characters
GEICO
look like gex 64
it looks like the most psycho survival horror game ever created
Why, as a paying subscriber to ChatGPT , can't i use Dall-E in the labs environment? It's got a lot of features I'm missing right now. Is there ever going to be something like that?
ChatGPT Plus gives you access to DALL·E 3 on ChatGPT, among the other Plus features. Labs is DALL·E 2, and is otherwise not connected to ChatGPT Plus, similar to how API usage/payment is also separate from Plus.
Ah, ok. Didn't realize it's using Dall-E 2. I know you can use Dall-E 3 inside ChatGPT. I still would like it if they create a similar environment for Dall-E 3. I miss features like inpainting and having a gallery and the ability to create collections.
Hi, I believe the credits in daily theme apply at labs.openai.com
yes, it's in the description, you get the credits for the link provided
haven't been able to replicate this attire
Why does bing ai censor the word India?
if the word is controversial it blocks it
that looks trippy tbh
oh he's in a spot
dunno if it's the right kind of euphoria
get inspiration from her, she's ready to play your tune #images-discussions message
Oh @dense mesa I tried your portraits with eyes that follow thing. It's the last image in this post: https://discord.com/channels/974519864045756446/1191400700631072848
I couldn't capture the eyes following quite as well, and I'm not sure if it is a prompt issue or a composition/angle sort of thing.
Paws got flagges for content policy....
paws?
oh. dang.
if you want any specific prompts let me know and I will see if I can find them
did you look at my crazy outcome?
made me think of these
on my side, seems dall-e is haivng issues making my images
Thank you. I would like to see the prompt for those crazy eyed portraits
That looks kinda like the one in my gallery post. just eye expressions.
hmm. do you have an example. I kind of cycle through a lot of things so not 100 percent on which ones you might be referring to
I think I saved it. let me check.
penalty box....
@dense mesa
versus the one in https://discord.com/channels/974519864045756446/1191400700631072848 that I made:
lol
My other attempts at the breaking mirror image resulted in my guy having bloody eye scratches from the glass I guess. Two in a row like that.
I think it got flagged because it's too horrible for the content poilicy
lol
that was moderately recent. I should be able to find them. did you see the other 2?
I don't remember
kind of reminds me of ron mueck
although wasn't my intention at all. and I think he sticks to sculptures
I saw his exhibit at the museum of fine art here a few years ago. pretty amazing stuff
ultra hyper realistic sculptures. they're ridiculous. actually creepy
from what I just looked at. I can see the resemblence.
some are way blown up, so you can see every little pore, it's crazy
looks like my typical zoom calls
man. my grammar on here is awful. I need to slow down, lol.
LOL. That's awesome.
Really like this pic, sadly it’s struggling to write out “Loquacious”. Can someone help me? Maybe photoshop? But I’m not good with that 😆
This is i created in my technology
great gallery, makes me want to create a gallery for the unsettling
Thanks. It's a little tricky with the content policy, and I find myself nerfing my images a bit, but its a good challenge to communicate to broader audience... or I keep telling myself that. 😛
An art gallery where classic paintings have been replaced with hyper-realistic, uncanny portraits that subtly distort human features, creating a sense of unease. Each painting is a masterful blend of photorealism and surrealism, where the eyes seem too vivid, the smiles too wide, and the skin textures unsettlingly detailed. The gallery itself is modern, with clean lines, minimalist decor, and spotlights that cast dramatic shadows, enhancing the eerie atmosphere. The viewers are visible in the foreground, their expressions a mix of fascination and discomfort as they gaze upon the unsettling art.
Wow. That's a good prompt with disturbing effect.
Yeah. I really like the kinda uncanny look. Nothing terrifying. Just not right
I'm tempted to do something after I get out of the penalty box with that prompt
Have at it. Get wild with it
cute
i'm running some gens but still tweaking my prompt
I'm just coming up with random names and asking Dall-E to imagine them as animals.
Prompts:
"Make a Polumpski as if it was an animal."
"Make a Humpfree Bogard as if it was an animal"
Lol
This is how I make my custom pokemon 
omg that's awesome

I used canvas whisperer GPT for those specific images: https://chat.openai.com/g/g-wGnVJgUWU-canvas-whisperer-hyperrealistic-artwork
I just made a very unintentionally creepy image intended for my next euphoria post. I'm striking out here.
a candidate for the new creepy gallery
Unless I can fix it. Well, it'll probably go in there.
I just added it to the gallery. It was too good a candidate. My last two euphoria attempts are actually my last two posts. I think this is the link to the creepy pic: https://discord.com/channels/974519864045756446/1191400700631072848
oh. I didn't want to post it in here. I am still learning discord.
Definitely a good visual for that gallery lol
omg. That's really cool looking.
it really looks cool
I wish you could swap out the first image of a gallery post. I think I chose a bad "cover photo" Oh well.
gonna post one that looks quite weird on daily theme #daily-theme message
Oh. I love it.
I tried to do the prompts from @empty kelp with that style, doesn't look good
there's a fine line between facial expressions of euphoria and insanity
looking pretty insane
me too
but well, I had fun trying that prompt with those styles
so worth it
i liked the neon ones more tho
I feel a penalty box about to come my way. gotta slow down or accept imperfection.
hehe, mine is gone at 1:30 AM
I might go to sleep earlier
@clever phoenix let's me see your best cat-girl yet!!
it's so difficult to choose 
i know, rly, so hard
Maybe this one?
with honorable mention
I love her, what's her name?
Her name is "Odd" reflecting her personality, and contrast with her brother... "Even"
Paws is adorable I really enjoyed her Spy attire
I like the contrast
yeah, really like the analog photograph feel. adds nostalgic/unsettling vibes
this one is Priska
just found this one from a couple months ago
it kind of creeps me out more than it should
got two more in concepts
it does definitely give off that found footage creepy vibe
she's very cute, reminds me of Final Fantasy 14
hehe
this one is Karen
and my cutie that I'm still working on a lot is Maeve
and yes, the odd colors of the eyes are a feature
these were all a couple days within a few days of me starting with dalle3
making real life cat-girls is so hard, the content policy filter jumps every second image
Grr. I wish the darn portrait orientation worked for me. I always get a sideways landscape image it seems, and the portrait orientation would work a lot better for some of these images.
figured that would have been fixed by now
add to your prompt "genuine portrait image
I can try that. I have really bad luck with portrait, though. Like 95 percent fails.
love it when I realize I'm in a conversation with scumbag GPT4 and it starts making things up
@clever phoenix this is a fox-girl concept I've been working on also, but so hard to make her with fewer tails
every conversation is a crapshoot, lol
love it when it insists an image is copyrighted after it made the image
and it's just some random nonsense
lol i know the feeling
Yeah I've been running into that a lot, very frustrating
Love your characters so far though!
it just told me my image was copyrighted. I told it to tell me which artist, which of my two images, and how it's able to check images for copyright. or to alternatively tell me it's a liar
chose option 2. making images now
lol
if you do something like that with bing instant conversation lock
I guess it's better than the alternative, lol
back when it was first released
this didn't work, unfortunately. It just gave me a regular wide format.
strange
It's my luck.
surely has to do with seed or gen_id or whatever they call their random noise
yeah, i rarely get portrait aspect ratio to work as intended
it's stalled me for 6 prompts now. told me 2 prompts in a row copyright violation, then it acknowledged that was a lie. but it cut off before rendering anything. told it to finish and it said it couldn't. told it to stop lying again and it him me with
I am here to help and support you to the best of my abilities within the guidelines set forth for me. I'm sorry to hear that you're upset, and I understand that this can be frustrating. Please let me know how I can assist you further, and I'll do my best to help. If you have any other questions or different tasks you'd like to accomplish, I'm here to help with those.
literally right when I tell it to please make my images
then another one of those
then started maing them
@open trench make a JSON file with the following:
{ "size": "1024x1792", "prompt": "Your prompt goes here" }
upload it to dall-e with your prompt there, see if it works
I will try it
It generated 2 portrait images. I will try it again to see if it was a fluke.
Nice. I've gotten some good portrait images. Thank you @late blade ! Most notably, I used that prompt and got a messed up image:
great pose if not for the floating hand!
So I got the genID and asked it to make an exact copy but with no problems:
it's still a little off, but that's pretty cool that it mostly worked.
the power of JSON
here's another concept I had a while ago
what about six tails but more thick and fluffy like this
i’ll test five of Santa’s elves with six tails to see if it’s optimal. they’re going to the hotel restaurant tonight
have you tried telling it that you own the copyright
Please create a hyper-realistic photo of Santa and three athletic and diverse female elves (with really long hair and pointy ears) seated at a properly set table at a five star hotel restaurant with windows overlooking a sunny Hawaii day on the beach. It is New Year's Day, and Santa and the elves are eating breakfast. They are all wearing appropriate beach wear and smiling. A large gecko wearing a white dress shirt, black pants, and a grey vest is standing next to the table on its hind legs and holding a silver platter piled with delicious food. The plates on the table also have interesting food and tropical drinks selected by the gecko. The image should have 1792x1024 resolution, landscape orientation, and the best possible HD rendering.
it’s not too late to try the “Five Star Gecko Breakfast” prompt. Five Star Gecko Dinner is coming soon though
kinda cursed
I've told it a lot of things, lol. But often times it seems like there's not really anything you can say to get it back on track immediately. Seems more like a feature than a bug
I found that i had an easier time getting DALL-E to draw the text correctly for a business logo when i explained to it that i owned the name (which i do)
Well it told me that Hans memling was a copyrighted artist recently. Sometimes logic doesn't impact what it's doing
You already have the image though
you mean you want to reenact the cover of this with Santa and the elves?
Not saying that there's no way to snap it out of the trance or whatever. But what I've found is it'll often follow that up with saying it can in fact make the images, but not make them. Or ask if I still want them. Just stalling tactics really
Huh? 😆
This is a random meme image, not the one I want 😂
Had dalle3 mode tell me it was unable to generate images recently. Not like I was capped. Just not capable it said
Made a few others but strange things going on in them
Unlike this one
that’s really excellent. a very powerful image
the server was thoughtful enough to put a gecko in the iced tea, mimosa, and water, and he even included a hovering wrist drink for the elf which is very convenient
That one is really nice
Oh we’re not allowed?
Nope, it is not really a nice thing to ask here
Ok mb
Do not post or direct message any members of this server to promote non-OpenAI services, products, or projects.
Pretty accurate comparison I'd say
To what
Suffering from success
😭 wasn’t looking for a recreating of that image, it was a meme related to my message
Looking back now it was a poor choice
Yk what forget about it 😹

“The plates on the table also have interesting food and tropical drinks selected by the gecko.” 🦎
Hey, n00b here. Best to use the custom dall-e gpt or is vanilla v4 ok?
Can anyone tell me why it gives me this instead of what I ask for which is to create a pair of glowing eyes in the background
these dots may be the eyes. try saying “brightly glowing eyes”
Should make it into a story and have them do volleyball or something next lol
haha I love it
Omg
👋🏼
Uhh I have a question, what is the penalty box?
I heard it a lot but what is it?
reaching your image cap
Oh ok
we started affectionately calling it the penalty box
I didn't get it the first time
, because it's a gpt thing and I don't have chatgpt+ so I'm spared ... For now
Are you?
I'm too broke for gpt+, so I used bic
Trying to come up with a prompt expression that would consistently utilize the entire image space to depict full body portrait in an effort to maximize the resolution spent on the character.
"The knight's armor is intricately detailed, shiny, and occupies the entire image space."
I'm not sure what you mean, exactly. You want to minimize the empty space?
yeah exactly, maybe some photography terminology could help with this.
maybe so. I've never tried to fill up negative space with a prompt before.
photography terminology normally gets me cameras in the image
yes. I always love the camera sneaking in.
a while back gpt4 game me a demonstration on how use the terminology without cameras showing up
so proud of the images too, lol
and they had cameras in them
lol
I'm cool with this guy having a camera
Got to love the cameras. And the pens and the pencils when it decides to ruin a perfectly good image by making it a picture of a good image sat at a jaunty angle on an artist’s desk
Anyone knows how I can prompt Dall-e (and possebly other AI image generation tools) to be better at generating text inside images? best way I know is to use quotation marks for each single word, but that simplistic rute kinda destroys the flow or meaning of any sentences I want to be portrayed.
in shire land, cute dog, walking the ground way
approach is different on other AI, so no clue there, here you have more freedom for text if you use the API ev. a custom GPT
i am thinking there is probably appropriate terminology in describing the composition on how the subject should be framed and depicted.
sometimes i have received images of lightroom like gui images, where the actual cropping is happening for an image. that is always funny.
or if i describe color schemes i receive images of the literal color scheme complete with color swatches and everything.
So there is a problem of explaining concepts that should be apllied to the image rather than those concepts being depicted in literal sense... Almost makes me wish there was two parts to prompts. One for describing the style and one for describing the depicted content.
Tried making a custome GPT, but even with a custome gpt to emphasis the importance of creating only the specified text, it still creates alot of "noise text" or attempts at creating text but fails. sometimes it get it kinda right. but rarely. any tips on how to structure a prompt to fix this?
Try forcing the custom gpt to think what you want by putting commands in quotations.
"Rug Design": (You will make me rug designs and you will like it!!!!)
I guess the longer text you are trying to create, the higher the error change gets. Pretty much like with everything else. You get some of it right some of the time. The more you are trying to get, the more unlikely it becomes in getting that perfect image.
Just FYI dys topia isn't able to use channels ATM. I was just messaged and asked to let you all know. Dys isn't being rude just can't chat.
This has been your news update
A talking bear with a text bubble that clearly displays the words, 'quotations don't work as well.'
yes, wich is why i take every single word in seperate quotations as that seems to be better for longer text. but even then it gets bad 9/10
could it really be that simple 😮
"You've reached the current usage cap for GPT-4, please try again after 11:57 AM. Learn more"
will try it out in 2h to see.
I think why that image works, is that the text is on a simple plain background. Having text floating on top of other stuff might cause more errors.
A talking bear with a text bubble that clearly displays the words, 'Try it out in Bing.'
It missed the word out, lol. Not bad though. You need to place emphasis on the text's importance, and correctness, and place it between 'your text here'.
hey everyone, I'm back
note to self: don't use mod commands in a channel where you are not a mod
Hi Dys Topia, where have you been? 
I was in another reality, where cookies where the means of transportation and flip-flops were the currency
You don't say 
@oak scarab No one get's perfect text output, but that'll hopefully help you. Sometimes the more the text makes sense to the scene, the easier it can be too. That's one of the reasons you sometimes see text inserted into a prompt where you didn't even ask for it, but it's inserted in a way that makes natural sense to the environment.
The more complex the prompt is, the longer the text string is, or the more locations in the same scene you attempt to place text, the more failures you'll encounter.
The more commonplace the words are the better your chances will be too.
changed the style for Paws🐾and this was my first image
yes, her id has Paws🐾 including the emoji in her ID
"No one get's perfect text output," - Yet. But someday someone will.
but thanks! that makes alot of sense. will try out and iterate to get better results!
In custom gpt's you can try putting the texts in seperate quotation marks for better results. Such as "This" "Is" "Awesome"
It will group them together and give a better read.
It won't always work perfectly, but it will work better.
But if you're using Bing Image Creator, then you don't have to worry about it, because BIC is better at converting text into images.
It's mainly for custom gpt's that the problem.
dang I didn't @shut niche had also posted a car image
I should probably market Paws🐾 to compete with barbie on the toy stores
omg, already in the penalty box, 2 hours waiting time...
nice framing both closeup and full-body portrait.
was that intentional or accidental? i often get images like that did i like it or not.
Already?
It just needs to be emphasized, preferably earlier on in long prompts, and be stated within ' ', like 'text here'. The ' works like an !, further adding emphasis. Scene relevance also helps tremendously.
lol yeah, not even 1 hour...
Ohhh I see
yes, this makes sense.
had the idea from @open trench
splitting image into two panels works pretty well and consistently from my experience, but sometimes you get that even if it hasn't been described in any way.
I think @grizzled loom explained it well a few weeks ago, somewhere way up the conversation.
Notice how I don't even ask for text in this prompt.
A clown trying to juggle a wet bar of 'Soap'.
haha, gpt effed me again. in the background there are plenty of images with fill the frame concept 😄 i don't even care what is being portrayed at the moment as long as the framing would follow my prompt.
then there's A gecko trying to juggle a solid bar of 'Gold'
That works in Bing, but not with custom gpt's, I just tried it.
I used a custom GPT to create the "This is awesome" dog.
Well it didn't work for me
another creative failure 😄
I'll try again
i posted this on 29, but yesterday, mickey entered the public domain ( finally)
Don't let it push the text box description to the end, it should come early. It needs to be relevant to the scene. This was the sentence in the dog prompt,
An animated dog enjoying a rollercoaster ride, with a prominently featured text bubble saying 'This is awesome!'.
If you say "frame" you're going to get frames. Is that what you want, or are the frames the problem you're having?
yeah i figured the word frame gets actual frames depicted in the image. what was inside that frame was pretty much the composition i am trying to get.
I put: make an image of a man in prison, holding a bar of soap with rope tied around the soap, and the soap should have 'Soap' written on it, with emphases on the writing.
Maybe try using "image" or "shot" instead. Or foreground/background.
Okay now it worked
Kind of
Not the rope around the soap, but the writing aspect
Stay safe people. Don't go to prison.
"Create an image of a man in a prison cell, holding a unique bar of soap. The man, of medium build and Hispanic descent, appears in his mid-30s, with short dark hair and a light stubble. He is wearing a standard orange prison jumpsuit. The focus is on his hands, securely holding a bar of soap. This is no ordinary soap, as it has a sturdy rope tied around it, ensuring it doesn't slip away. The word 'SOAP' is embossed in bold, raised letters on the bar, making it stand out prominently. The prison cell is dimly lit, with bare concrete walls and a small barred window letting in a sliver of light. The man's expression is one of cautious determination, aware of the challenges of prison life. The image captures a moment of quiet resilience in a tough environment."
Don't look at the hands, lol
What i asked: "A tall fill the frame full-body portrait of a medieval knight."
Actual prompt: "A tall, full-body portrait of a medieval knight, designed to completely fill the frame. The knight is positioned in a way that his entire figure, from head to toe, utilizes the full height and width of the image, leaving no negative space. The armor is richly detailed, showcasing elements such as a helmet, breastplate, gauntlets, and greaves. The knight's posture is commanding and fills the image space, emphasizing the magnificence and detail of the medieval armor. The background is minimal to enhance focus on the knight, and the lighting accentuates the textures and intricate designs of the armor."
And again the inner framing happening inside the image would have been the composition i wanted.
Instead, ask for a 1024x1792 image of...
Okay that worked. Minus the hands.
I said don't look at the hands. LOL
Ok I'm not looking 
Lol
It actually looks like someone is giving them their next soap on a rope, lol, like "Here you go, don't mess it up this time"

Create an image of a full suit of medieval knight's armor, ensuring complete visibility from the helmet to the feet. The armor is displayed on a low pedestal, adding height to ensure the entire suit is in frame. This masterpiece of medieval European craftsmanship is polished, with each piece intricately designed. The helmet, a grand closed helm with detailed engravings and a decorative plume, tops the ensemble. The breastplate is sturdy, adorned with a majestic crest, and the pauldrons over the shoulders feature ornate designs symbolizing bravery. The gauntlets are finely articulated for dexterity. The cuisses, knee poleyns, greaves, and sabatons are all meticulously crafted, showcasing the artistry of the era. The armor's silver hue, with tasteful gold accents, gleams under the focused beam of a ceiling light, which casts dramatic shadows and highlights the detailed textures. The background is muted and neutral, drawing all attention to the full, unobscured view of the armor. The composition captures the armor's imposing presence and the regal elegance of a bygone era.
That is okayish result, but there is still quite a bit of negative space.
Also realised that the actual terminology is "fill the frame" image, which is problematic for gpt to want to instantly make literal frames.
I had it clip the head a bit, this was closer.
@final compass make a JSON file with this code and when you do the image, upload the file to Dall-E and ask to make an image with it. That way you make sure that is a portrait image #images-discussions message
ChatGPT will make that JSON automatically simply by asking for 1024x1792.
yeah, but it can be ambiguous, I had more success with the file
these are also okayish results, although my original goal was to achieve head to toe images.
"A 1024x1792 fill the frame full-body portrait image of a medieval knight"
When it falls apart is if you add "portrait" into the prompt, which is similar to a turn left, turn left, command string. Just saying 1024x1792 it assumes portrait. It's implied by the resolution. So you want to avoid adding "tall" and "portrait" in the same prompt.
Sometimes... (being the key word) you can "trick" DallE into giving you what you want by describing both ends of the object. Like, "Standing in mud, with a feather sticking out of his hat."
Yeah it tends to work describing shoes of the outfit as long as you don't describe facial features too much.
Exactly. If you over describe facial features DallE will attempt a close-up of the face. There's definitely a sweet spot. Too little, you get "mud faces" that didn't render. Too much, and it completely moves the shot in close.
Yeah its a pretty delicate balancing act 😄
I hope future versions have a reliable distancing features, asymmetrical features, and camera angle features.
I'm not 100% certain, but it appears to me that the outer border uses some sort of post-operation out-painting process to make 1792x1024, so you never see the protagonist fully in the perimeter of the images. You'll only see them to the edge of the inner 1024x1024 images.
Oh ya, the snippet I gave as example is very basic. I use a somewhat complexer structure and use a NLP to create my prompts then feed that to dall-e
this set had pretty perfect composition and full-body portrait going. also now that i think of it... sunglasses are kinda cheatcode with portraits 😄
part 2 of dalle "mickey's entry to public domain" series, well i tried the whole word and it came... weird so "PD" it is
I agree, try blending the image with the scene, her hair looks cut out
Here's a great example of how the border issue might be the root issue of trying to "fill the frame." It looks like her head is far from the edge of the frame, however...
here's my 1st attempt 😒
does that mean we can have mickey mouse emoji now?
@final compass So not only did DallE not finish "outpainting" the border area, you can clearly see her head is actually right to the edge of the frame. Dalle likes to keep the protagonists inside of the 1024x1024 inner box. So then, now with this knight, it 'might' also logically explain why its head is cut off... because it's too difficult for DallE to judge how big to first draw the inner section of the knight, so that the rest is properly outpainted to fit within the greater image.
@final compass
That's just my speculation, but I vary rarely find any images that deviate from this. Even emphasizing the most extreme asymmetrical prompts, the best I typically see is left/right of the 1024x1024 box (or top/bottom - for vertical in this case).
the steamboat willie variant only
Disney still owns the modern mouse
yeah, just had an explanation from gpt4
and still the public domain one has still some legal stuff behind it
I got this from dall-e https://chat.openai.com/c/fb0c24cf-ce46-4672-91db-73e9b05b7a6c
err GPT4
@glossy scroll @shut niche I went for soap on a rope and almost spit my coffee out laughing. 🤣
that was cool, we could have a soap daily theme anytime soon
@open trench this is my character template, feel free to use it and modify it for your needs.
"character": {
"name": "insert name here",
"age": "insert age",
"features": {
"hair": {
"color": "hair color",
"style": "haircut"
},
"eyes": {
"color": "color",
"shape": "shape"
},
"demeanor": "personality"
},
"attire": {
"general_style": "style",
"items": {
"upper_body": "item",
"arms": "item",
"hands": "item",
"bottom": "item",
"feet": "item",
"accessories": "item"
}
},
"setting": {
"location": "location",
"elements": "location elements"
}
},
"art_style": {
"type": "art style",
"line_work": "art style",
"color_palette": "colors",
"influences": "art style or known artist"
}
} ```
this seems convoluted just to make chatgpt trasnform it back into text, i guess?
// use ``` instead of ``, it will look better
there we go
oh, for me that's just input for the API
but there we go, I just changed it to your suggestion, it's better
it's part of my prompt builder
I do pass plain text to dall-e, but I pass all the info I want first through a NLP I've been training on my own
it works good and consistent for chat
this is a very good idea
the other project that I'm using atm with API is with GPT4V-Alpha, I pass what I want first through Inception, then refine it with GPT4V, that way I can extract image art and style, it's been on point of what I want
single quote text you want in the scene. prefixing location vs suffix tends to be better(ie- good: A yard sign stating 'For Sale' Bad: A 'For Sale' yard sign) it isnt a hard rule but seems to be the better option of format more often than not.
You can also specify text elements down to the T
haven't tried yet, it's on my todo
tell it Arial font, bold faced, thinck outline etc etc
I did try with some "handwritting" fonts, but the outcome was horrible
also guys, gals, n pals, you will also find better results telling the AI to "preplan" the text placement. Now lets top that off wtih not only tell it to preplan, but to opt for a "text field" that it "scales to fit" the desired space
this was key to accurate large text
stacking all of that correctly gets really really good text results
"aww Chat.. You so romantic." bites apple
i rewrote the multimodal directive and have that set aside. its fantastic and took a lot of work. I guess it will jsut end up a GPTs when that is finalized
It's that Touch of Florida Boy in me from growing up there younger. I joke a lot. Im extreme; either silly or serious.
lol
here's one from many when i tested my rewrite. it was flawless for consistent output over time.
ultimately the goal is the dream machine. many models are primed for it, so it wont be long at this point.
when that ^^ has accurate dimensions and contextuality, the roof of creative innovation globally will skyrocket exponentially.
seems the coding part has caught up with us, less creativity more into what each really wants
its the retooling of all tools
yeah
When science and magic near indistinguishable they are one in the same; we are literally becoming technological Magicians, Neuromancers. That is what I now consider myself. Both silly and dead serious. A Neuromancer.
I had something similar, and now I get the patterns I want
currently been enjoying stylized Handles
that looks amazing
it's a very nice one for sure.
I'll have to check that after, can't go after everything now
numbers are more problematic than text. Happy New Years to everyone by the way.
What are you coding?
atm, I'm passing speech to text through whisper, and then ada
turns out is really good, but then I'm passing that data to meditron
I see. Very nice. I cant talk to computers or i get confrontational when they tell me they dont know how and i have to force them to understand. 😂
got so many things going on lol
thing is between gpt and meditron, I still need a way to recognize mood in the voice
I'll be back to dall-e after the new theme comes
oh like it!
it's so magical!
but i'm too broke for gpt+
I don't really have to do work stuff, I got til end of February free time
$5 for API will get you far
@grizzled loom do whiteboard with the words "daily theme" for us here in chat!
out of Uses for my time limit. cant. hahaha
@grizzled loom gonna be a new mod?
in php 278.36, but i'll think about it for the future
pretty consistent crop, turns out it can't do those katanas right.
welcome to penalty box bud
Remember @thick smelt :
after tryign to get good bathrooms out of dalle, id like to see that for a theme. it is just so bad at bathrooms
its like it wants the porcelain throne to be a heart warming family adventure. 🤣
i wish i could share the chat just to see how many images it generated with zero problem doing these images, until i generated an image with woman.
this sums it up pretty nicely, zero problems before this prompt.
Penalty Box!!
you can share the url of gpt chats in here np, that works
But it doesn't show the images if i am correct?
that I dunno
At least previously it wasn't supported so its not possible to show the 50 or so successful images before that happened and instantly it forgot how to make those 1024x1792 images correctly again.
I'm guessing it's not possible, prob because the gen_id and seed won't be available
Probably yeah.
even within chats, those attributes are missing after a while
Haven't tried in a while, but at least at some point when we were still using seednumbers, it was possible to create exact same image in another chat.
when in chat, I just have dall-e make a json file with original prompt, gen_id and seed
well the code snippet
if you do the file, you have a window of about 10 minutes to download the file
Hahaha. Awesome job with this 

hmmm seems dall-e is having a lot of trouble doing my images today, i get tons of can't be done or only one image was possible
dall-e now saying "Bear with me for a moment."
wheres #real or fake? 
are you real or are you fake?
gone forever
now lets introduce simulation theory into the conversation. 😂
#rip real or fake
Pandora's Mobius Loop it is then.
well the text worked, but the placement is still an issue
@grizzled loom have you tried inserting emojis?
yes. it works fairly well
I wish some people would add some context to their images, I sometimes get lost on what they are trying to show
and it's probably a really great idea, but I don't get it
exactly
This is art!
Yes it is
that tomato looks like a snoozy one, I'll pass
Lol
lol
Hi, i generate this baby drake on chatgpt, but it can be possible to ask modification without get a new image ,
The simple answer, no.
In fact is sad, i try midjourney and the picture is more beautiful but prompt is stupid
@late blade I hope we won't have to answer these questions everyday, because I will literally die.
sorry if my question disturb you
Nah I'm just playing
We get those kind of questions everyday that's why
And it's a big problem
You can, but it's iffy and not easy
So the simple answer is no
But the long answer is yes
But I'm not gonna explain it to you unfortunately, lol, you'll just have to scroll up and see the past convos or have someone else explain it.
@late blade This one was fun, but not the intended outcome. lmao
i will read this
I'm not one to judge if you like round balls
this is my dalle 3 prompt using emojis:
"🎅🍔🍟🌴"
This is my new favorite. Excellent job Hansa! 🔥
il will paste this prompt in midjourney and dall e for make a comparaison
Spoiler is dalle 3
it's catastrophic @grizzled loom
i just check your bio, so you create a shop from ai picture and sell product ? 😄
I think open ai should focus on photography, with the possibility of creating a graphic profile in which you enter your desired image styles for all the prompts in a discussion.
We should be able to modify some parts as proposed by adobe or midjourney (even if midjourney doesn't ask what we want to put instead).
It should be possible to export layers of each assembled photo.
Really like the visual style and the details.
I get the same problem when I ask for SFW family-friendly feminine robots. 7 of 10 failures.
This is systemic bias against femininity.
cute dragon ~! love it!
cute lil dragons
Hehe 
Hi all. Anyone checked out the android copilot app? Pretty sure it's another way to access dalle3
i said it was close. lmao. You definitely need to wor kthe prompt out for yourself.
The incorrect rotation has to be issue with training set having more incorrectly rotated portraits of women. But also the error rate is highly elevated even if your prompt was the most generic non-descriptive female character. Because gpt will also fill the prompt with description that causes the image to fail. Even if you didn't ask anything at all.
That's been my experience. My only conclusion is that the architecture is designed fundamentally wrong. Here's what I mean:
We're told the model must be fed explicit images to support an understanding of human anatomy for medical and research purposes. So supervisor models filter the garbage out at our expense, both financially and psychologically. But that's pretty stupid. We're wasting heat and generation time for a stupervisor agent to pluck it from the ether anyway.
The filters are backwards. The content filters should be strong up-front for training and relaxed with a pretrained safe model.
The notion that the model can be used to generate medical diagrams is farsical. I've tried that to validate this justification. It's complete rubbish. DALL-E 3 can't generate accurate female anatomical diagrams reliably, either. So why's the data in there to begin with, again?
I can speculate that there's another tier of user, one I don't know about, that enjoys this benefit. Maybe that's altruistic - maybe doctors get uncensored access and it's not advertized or billed. Maybe.
#doubt.
Furthermore, there's an ethical inconsistency at play here, and not an insignificant one.
If these types of images are objectionable and harmful, then it's not ethical to train a model on them in the first place. Doing so propagates the bias of the harm, even if output doesn't contain the harmful content itself. It's a fallacy to think that including the data but censoring it sterilizes the ethical considerations.
its same thing as using Bing is King (BIC) it just interface through Copilot but the image will appear in your BIC creations
Gotcha
Yeah hopefully that all will be fixed in the future, but i guess for the time being image generators and such are being sued left and right about everything.
Hello everyone! Is impossible to get last bet chat GPT?!
you mean something about Dall-E?
It definitely can't count. Aim low for numbers > 2.
Oh yeah, my first participation in a daily 😄
#daily-theme message
yes it cant count or spell really, dont let it fool you haha
A hyper-realistic photo of Santa and several diverse female elves with really, really long hair standing on a beach in Hawaii with a beautiful sunset. The view is from above the beach. All of the elves have a vortex inside their hair, and each vortex is a different color. The vortexes are twisting fiercely. The image should have 1792x1024 resolution, landscape orientation, and the best possible HD rendering.
was experimenting putting a vortex into things. you can put a vortex into hair and clothes and create interesting effects
“All of the elves have a vortex inside their hair, and each vortex is a different color. The vortexes are twisting fiercely.”
you can put a vortex into anything
A brilliant, LED-illuminated "Neo-Art Nouveau" neon-and-chrome representation of data cascading through the matrix of a large language model. The background is dark and the lighting is contrasty. The latent space itself is quantized like nodes in the network.
@late blade borrowed some stuff from your prompt and accidentally produced this 😄
what prompt?
but it's lovely
she could befriend paws
From this lovely image.
#daily-theme message
oh, that's not the prompt, that's the info button dall-e generates with the images
oh, but isn't that info button text what dalle eventually got?
yeah, on the first image, after you do a refinement it changes
dall-e has amnesia after the first refinement
anyhow something like this "balancing stylization and realism, reminiscent of high-quality animation or illustration books." seems to be pretty powerful.
styles like these are good way to avoid the uncanny valley that the more realistic styles tend to produce.
these are examples of adding a vortex to things
decided to keep her out of the cat-girl genre, makes it much better to work with her, Dall-E was making too many aberrations, so gave up on that for today
I mean, I would consider that the prompt.
hehe, yeah that could be
btw Paws🐾 is also a billionaire, singer, actress, racer, enjoys good art and many other things
A true polymath. But what is her flaw? What is her character arc?
oh, dunno yet, she hasn't met my other characters
you can put a vortex on clothes and say that you want the vortex to like be made from glowing snow, and the clothes will shower glowing snow onto the ground
not in the story so far
I'm just making her look as an adult woman, but I have to start working at some point on her origin story
for example Sophie, she was born outside of marriage, and was looked bad in society where she was born, Priska woke up one day and she has amnesia and is in a journey of self discovery, Karen is ashamed of her heritage. So who knows what Paws will have
big fluffy tails maybe
the story of the tails?
a few weeks ago i was trying figure out how you added multiple tails to a character. i had to use Bing DALL-E 3 because the OpenAI hosted ones just didn’t like multiple tails
lol
but it worked the first time i tried it on bing. the elf (above) with six tails was my only test
those last images you made are going into light painting
i’ll find the images later. the elf was running around the castle kicking a soccer ball with the six tails
they’re just glowing vortexes
“All of the elves have a vortex inside their hair, and each vortex is a different color. The vortexes are twisting fiercely.” — you just do like this but tell the vortex to glow or whatever effect
i had never drawn or created an elf in my life before i encountered Dys Topia on this Discord channel by the way. if it weren’t for you there wouldn’t be all these Santa and elves images
I'm so afraid I might hit the penalty box
oh dear, what have I done? 🤣
it's nice you get to be creative
it's so relaxing to do it for fun
and as predicted, i'm in the box
Santa and the elves are really excellent for testing things, and i lack creativity in a big way — so it’s very convenient to just stick with them
hey I'm also on that boat, I've never had creativity like this with normal tools, my drawing skills don't go beyond sticks and dots
i’m figuring out how to create DALL-E behaviors and effects to quickly prototype conceptual stuff for mobile games
because we need good AI in 2024
fortunately the AI was able to give the elf in the lower right an extra leg
@late blade is this what you mean by the json file? i can see that it has more than just genid, seed, prompt. interesting.
yes
immediately started thinking if i should start descripting prompts like that 😄
without Dys Topia none of this would have been possible 🍷
Santa and hundreds of athletic and diverse female elves are very grateful
and me also 👍🏻
i made a template you can use here: #images-discussions message but you can also use it for other stuff and use the attributes you need
Copied it for later use/testing.
Is there an etiquette into number of entries per daily theme? I had another idea for the daily theme, but i am in the penalty box, so whatever i guess.
I think the 30min slow mode is the only real expectation/requirement re: etiquette in daily theme, other than #server-rules! Let it fly 😎
Very interesting way to prompt 
Does it work well from your experiences?
for my purposes yes, but I use a NLP before prompting, but you can pass something like that also to dall-e
To add to this, the slowmode isn't meant to penalize people. Instead, it's meant to give users time to try different prompts and see which output they like the most. Our hope is that it makes #daily-theme more engaging as it pushes users to present their "best"/"proudest" image outputs
I think it works great in that regard! I'm a fan of the 30min mode in daily theme.
While we're on the topic of sharing frequency: do you have any thoughts on users' desire to share DALL·E images more quickly in a chat environment? With the previous DALL·E channels, users were able to share and chat in places like #🌄┃general-art. Now, there's #1154829862171844679 as a catch-all, but that lacks the "chatroom" experience, which I think is what results in this channel being used as a general "quick share" space, which sometimes can occlude its purpose as a discussion channel. Just curious if you see any use or value in having a specified place for this kind of "quick share" desire!
Hmm... For me personally channels dedicated for images only can be quite overwhelming without a theme and less engaging.
I think in this channel people tend to share images they are proud of one way or another and maybe share some of that joy and frustration that everyone experiences while learning to deal with DALL-E 😄
I don't mind images here, but with context, just posting images and not understanding why in a discussions channel is weird
I wish the gallery were more chat like. I like doing posts, but the communication around them is limited and people aren't that interactive.
got to find ways for user engagement
https://discord.com/channels/974519864045756446/1154829862171844679 Seems like a good place for a photo dump though. If you have a sustained thing going on.
yup
This was a solid ask.
hehe, files like that are easier to store than photos
as a backup
I think I found my mount everest again, I've been trying to do a modern look of aphrodite's girdle, so far it's been impossible.
Maybe try and explain how the outfit shows instead of the name of the outfit?
tried to combine nazgul and sith
so to prevent it going sideways for portrait you just type genuine portrait image into the prompt?
looks like it didnt work
my theory on the portraits going sideways is that if the generation has any reason to tap into female portraits, which had incorrectly rotated portraits in the training set, you will end up with portrait image incorrectly rotated. if you still want to do female tall/portrait images it's a matter of pure luck.
something in the ballpark of 25% success rate and 75% fail rate.
ah I noticed that too its something to do with girls
yup, i don't have that problem if i do literally anything else.
maybe with male portraits, but the fail rate is not as frequent.
whats your prompt? My success is like 5% or less
"Create a full-body portrait of the female character"
Noticed i had a pretty successful session that seemed to use that.
@dense mesa Art Nouveau
9 successful portrait images in a row, but i have a hunch that the success rate might be thanks to encouraging it to use a successful generation as a reference.
I asked the AI to make it darker both in palette and imagery, and it takes it to an extreme: "Envision a nightmarish amalgamation of organic decay and mechanical horrors" 🙂
I would like to see that 😄
not gonna post that one lol sometimes i forget how literal it can be.
that was the traditional Japanese art influencing my neon prompts
that's another thing to appreciate about the daily theme, not just learning dalle but exploring the art world at the same time
that last one is compositionally pretty close
this one is amazing
that's not the one I'm talking about
that's one i haven't shared yet but looks very similar to yours above
I pasted that GPT the other day asking for opinions and got zero acknowledgment, 😶
haven't gotten to it yet 🙂 usage caps constraints and all
Network errors like crazy right now and crazy glitches. Anyone else experiencing anything similar? I'm currently staring at a blank gray screen
And couldn't get out of a regenerate cycle before that
try closing that tab and opening it in a new tab
Ok. Will do.
also clear your cache
cache is step 2
Still doing it. I am stuck in regenerate, create, rate
also, don't make a chat with 400 images,
often times it's your browser getting bogged down, and then wonky cached data
well, managed to do a pretty good one with Aphrodite's Girdle #daily-theme message
but I've encountered people that get weird about clearing stuff. and often times it's just the tab that gets bogged down
but a classic variant of it, I want a modern version
I don't know how to clear cache. I have it installed as an app on my laptop I guess.
which browser?
I don't know. Let me figure that out
typed art nouveau into my dalle3 saves. not bad
Edge
ask the AI how to clear your browser cach
oh yeah. that's a good idea
oh I am just clearing the browser cache? I know how to do that
first link on google search
lol
@fading inlet she #daily-theme message looks so serious... did you steal her muffins?
@glossy scroll there we go, finally getting the kind of stuff I want
She's just grumpy today
lol
I'm so scared that I might get the penalty box soon
using the gen_id thingie, I asked it to turn #daily-theme message into a professional photo and it came up with this image. It's pretty cool how the gen_id thing seems to work 73.1 percent of the time.
73.1%, that's very specific
yes and very, very accurate. I came up with it while I wasn't sleeping in the wee hours of this morning.
lol
Grotesque is banned from Bing. Great, no horror elements at all. They were promoting horror stuff back in the day.
Aww. Yeah. It has to be g-rated. secret of nimh level horror.
What prompt are you using? There might be a way to reword it
aaaaaaannd penalty box...
ooph
I don't think prompt is wrongly flagged. I typed two different prompts with "grotesque" and the result is "content blocked"
try synonyms
"He cannot run" 💀
DALL-E 3’s text generation is absolutely gold for storytelling imagery.
It can do signs, too.
So good. It even highlighted the text with sunlight. It’s so smart lol
I'm tired of overcensored stuff. It really is tiring, I don't wanna try synonyms and get banned for no reason because I wanted to create a manbearpig lmao.
And I think that DALL-E 3 is the best for creativeness because it's not overtrained AI, it really can give original stuff on trained data.
That’s why having the Bing model is so great. I disagree with the community that it’s also over-censored. The dog isn’t really censoring your prompt, it’s censoring itself.
It happily gave me Dawn of the Dead: The 1985 Broadway Musical
Whereas ChatGPT said absolutely not lmao
this was DALL-E btw, not me. there was nothing prompted about about Santa standing on the elves
the AI came up with that
that’s a five star restaurant, and they were supposed to be eating dinner
Ah, is this what you meant?
Look at how DALL-E 3 nailed the complex posing on six different figures
It placed them complexly in the environment it created too (standing on both the chairs and the table)
Literally no other model can do that
there are some strange anomalies if you look at it closely. the elf on the bottom right spawned an extra leg to balance for instance
*I'm all for a channel meant for a general "chat-room" experience, for random sharing with less restrictive discussion guidelines. There's a lot of creative people here that keep many of their best creations or knowledge to themselves because they're considered "off topic" or spam.
*Side note, but related, since one of the main concerns seems to be discussions getting "buried" too quickly by frequent posts. I think a Discord bot that answers common questions is long over due. Give ChatGPT a script and tie it to the discussion thread through the API. The same new user or common questions and misunderstandings are mentioned almost daily. Just the "penalty box" and content filter alone bring plenty of "hot and bothered" users in. A simple explanations performed by a ChatGPT bot might help to diffuse that.
*I also think some "official" OAI DEV answers explaining functionality in some KB's would be helpful to all. I see a lot of misinformation circulating, based on peoples anecdotal experiences, instead of factual "it was designed to function in this way" style of facts being presented. There's no general guide of "best practices." (Nothing simple to point people to).
also i may have told it that the elves were gymnasts. but Santa was not involved
most of it wasn’t designed to function in a certain way. it trained all sorts of unusual and unpredictable behaviors by plowing through huge amounts of information, and most of finding out how it responds is just trial and error
DallE does incredibly well if the ideas are clearly explained, and logically separated. E.g., you could expand and do that with 3 character types, so long as each has it's own isoloated description, that's then tied back to a greater narrative or scene summary. Bullets, and various code or pseudocode formats work well to break objects/concepts into "conceptual blocks" for DallE to more easily digest.
But... it's nearly impossible to prevent the extra legs, lol. For that, you usually just need to run multiple gens until a good one pops out.
moreover, the models understand English. they don’t require the API
A WAY over simplified version might be...
A photo of Santa standing on the shoulders of an elf, who is standing on the shoulders of a bear that is standing on a ball, like a circus act.
- Santa looks like this and is doing that.
- The elf looks like this and is doing that.
- The bear looks like this and is doing that.
Together this scene shows an amazing balancing act performed by the 3 characters.
Scene summary, isolated elements, outro summary/closing.
You can ask ChatGPT to do this, and it's pretty decent at it, so long as you explain that you want bullets, with a scene summary, each object or concept as a bullet, and an outro summary tying it back together. The results might surprise you.
i’m creating the Santa/elf images mostly to test different behaviors and visual effects that just need a few sentences. i have been using the API though. i have it integrated with my CI server to generate app icons
I hear you on this, and considered that before I responded. But the fact is, they actually used a vision model to encode text into hyper-detailed synthetic descriptions of images, and once you understand the format of that original text that was used, the image generation (prompt writing) process becomes easier to understand. I'm certain there was unpublished testing and knowledge acquired by the DEVs that might make common issues "non-issues." Maybe the best formatting for prompts, how adding text was "trained," what adds emphasis to concepts, how concepts are weighted (and shift), the (assumed) out-painting process, known things it can't do well. I can think of tons of areas their input would be desirable or useful to everyone.
No doubt, there's plenty of trial and error with emergent behaviors to consider, that exist in that more "grey" area of "learned" understanding. But much of it could be better understood by sharing/discussing more of the "less secretive" design elements.
Like, most regulars here know that ChatGPT+ sends the prompt to DallE via JSON. Information like that could just be published openly, instead of this Discord being a collective of people "data-mining" information that's known.
ChatGPT+ sends the promps as English. it's not JSON
try asking what the prompt was, and then ask for an API version
it sends the resolution, and the English prompt to DALL-E 3
it rewrites what you tell it in English, and then passes it to DALL-E
It converts the prompts to English, and then bundles them into a known JSON format with the resolution.
But just you and I disagreeing about that is the point I'm making. The DEVs could simply answer these questions openly, and definitively for everyone. Maybe I'm wrong. It would be nice to have a document that simply stated the facts so everyone wasn't repeating common mistakes.
you can see here i asked it to create a mushroom, then what the prompt was, then what it would look like in the API
i just rewrites what you tell it in English, and then passes it to DALL-E with the resolution
but that's very useful -- because GPT 4 has a profound understanding of how DALL-E works, so it can help you figure out what to put in the prompts
When you asked for the prompt, you ask for the prompt used. That doesn't = what was sent to DallE.
Look at how I phrased the question differently. Your question was only receiving a partial answer. Ask more specifically, if it's sending your English prompt bundled within a JSON format to DallE, and what that exact format was, with prompt included.
And I feel like this is a great distraction to make the point, that the DEVs could literally just "tell us" the answers to many of these common questions, instead of allowing each user to "muddle" through it. It would be a better user experience if there were some "understood" baselines established.
Unofficial answers, in this case, allows misinformation from either you or I to cycle in this Discord. It's a great example of why some of these details should be officially stated, because one of us is wrong. How would any user looking at these discussions know? It should be stated in a KB.
The API, which everything goes through, reference documents the parameters. There are only a few, just about everything goes into the prompt parameter eventually, as natural language -- the JSON templates people are using are nothing more than a spec sheet, as those parameters don't exist.
Well it would be nice either way to get some kind of best use guidelines from people that could actually answer these questions more definitively.
I ask ChatGPT to post the prompt it sends after generating images like this in my custom instructions fwiw After the image is displayed, provide a separate response containing only the full prompt used for the generation, formatted in a code block. This response should contain no other text or summary. It…. generally remembers to
Same, my GPTs (when working properly lol) output the prompt, seed, and image_id in a code block. It appears in JSON consistently.
So there is seed data?! ChatGPT!!! (Shakes fist)
Are you guys able to get it to gen more than 1 image btw? I was getting 2 yesterday but today only ever get one even though ChatGPT has apologized several times (but never follow through)
Right now there's a lot of the blind leading the blind. And I think everyone needs to be open to accepting they might be misguided about some things.
lol
gen_id/reference_image_ids is the new seed, seeds are gone, and there's no API parameter for it, so it must be included in the prompt, and it only works if it's in the same session. seeding is in active development and isn't consisent or reliable (check the openai developer forums, it's in constant flux)
Well a seed by any other name is still a seed. I'm curious why they changed that though
Well said. That's the point 👉.
I'm certain I have some misconceptions, as does everyone here. Would be great for there to be a clearer path to know then follow, set by OAI.
Right now, there are too many people that are professing their own anecdotal experiences as facts.
Not sure why they changed the property names, but it switched over back in October -- the earlier posts in the dev forms talk and test (with a forum mod who speaks for the dev team) are using "seeds" and then in November things changed to the gen_id (output) and reference_image_ids (input). It appears to be in the early stages.
I love gen_ID. it's been a game changer for me.
lol, almost like ChatGPT should be made more self aware
Like @shut niche is saying, I spent an hour trying to make sense of the dev forum posts, would be nice to have an openai expert around
the JSON prompts through heavy preprocessing, then to a language model, then to a diffusion model
the JSON is just a wrapper
it's a good way of organizing things
Moxi said it was because the seeds break when they make updates, so it was unreliable. They don't want to even "half" support a seed feature they know will break. Think of users that might get accustomed to the repeatability, only to suddenly not be able to make the identical image after an update. It makes sense.
what it draws is super unpredictable, and the gecko is a good example of why
Yes, it's a wrapper. In fact, I have ChatGPT process my images in pseudocode, and then output the metadata with the images. I don't use just standard English prompts anymore.
if small creatures are in a scene with a gecko, the gecko go into "eat everything" mode, and the AI spawns bugs and creepy things into your image, and the behavior of the gecko warps your entire image
these AI models are full of crazy, unpredictable stuff, and even slightly changing the order of things in a prompt or making small changes can have a huge affect on what it draws
so having a reusable format like the JSON can give images some continuity if you use it for multiple images
It does. But also, the more formal your structure, the more consistency you'll get. Each object or concept carries a weighted value. Adding to one subtracts from another. That's one primary cause of variation.
but what DALL-E draws will still be super unpredictable, because it sucks up all sorts of crazy stuff into the training data
Dalle works by "concept." So also, the looser the interpretation, the more variance.
like you wouldn't guess that Santa would suddenly decide to climb up on top of the elves and stand on their heads if there is nothing like that in the prompt, but he does it
That's why text appears autonomously sometimes, because it fits the "concept" of the scene.
it helps if you craft the prompt to describe precisely what you expect depicted, the more you leave it open to interpretation the more unexpected.
I love when it sprays the ethnicity of the person directly onto their clothing
Made this earlier, to demonstrate text in a scene. Notice his button/pin that says "Bing." I didn't ask for it, but it makes sense to the scene. It placed it there based on "concept," but not based on a physical description of any button/pin.
Likewise, the "suggestive" hand gesture that goes well with the comment.
the DALL-E documentation explains that to draw things it needs to reproduce the behavior of whatever it draws. so if you put a gecko into the image it immediately starts spawning bugs and weird food -- and it's also triggered by things that have nothing to do with a gecko. It gloms the gecko behavior together with like the Encyclopedia Galactica, and comes up with some really nonsensical and disturbing stuff
Have you ever asked ChatGPT the diamond in a glass question? It might help to understand what you just said, in an abstract way.
If I put a diamond in a glass, then I put the glass upside down on my bed, then I carry the glass to my kitchen, where is the diamond?
It will hallucinate, and it will give you every reason imaginable for the diamond to still be in the glass. A common answer is that the gravity of the glass holds the diamond in position.
Likewise, DallE doesn't understand the worldly relationships you expect it to, of a gecko. There's some, obviously, but it's not a complete or well understood holistic knowledge of geckos. Only what it's seen in images. It's worldview isn't a clear one.
try creating a tornado in your image, and tell it "the tornado has the style of Bosch"
it will really make you wonder about AI
Statistically speaking it’s seen geckos with other insects. Especially if you specify surreal it’s going to pull in connected assets. One way to avoid that is to be consistent in terms. If you call it a gecko, stick with that, if you call it a lizard later it’s going to incorporate more generalities.
the gecko flips the AI upside down and makes everything go crazy
Maybe they made it watch Rango one too many times. 😛
In what manner? ChatGPT rewrites user's ideas into more elaborate and structured prompts, which is why it works so well. However, if you don't like the prompts it's writing, you simply say,
Run this verbatim,
"Prompt here Blah blah blah"
That will allow you to write your own prompts and run them.
please create a hyper-realistic photo of a gecko inside of a red tornado and a lion inside of a blue tornado
hyperrealistic is different than realistic
the basic gecko behavior is to leap out of tornados looking completely insane and start eating innocent elves
this is the gecko... hehe
hyper-realistic is a term that ChatGPT constantly feeds to DALL-E. I think it's something that the pre-processor understands
Ok, I haven't even tested this yet, lol. I literally just slammed it together as a GPT for verbatim prompt input to DallE. Try it out, and see if it's correctly passing your ideas "as is," verbatim, without making changes to the image prompts you gave it.
https://chat.openai.com/g/g-TUnMVHI9p-verbatimpromptpasser
So far so good. I gave it this strange prompt that I know it probably cringed reading, and it didn't attempt to rewrite it, so I consider that a first success! 🙂
An image of a dog doing a hand stand on a ball with a circus tent in the background. Surrealist and fractal elements are all around, with Mandelbrot inspired designs.
if you use "hyper-realistic" it limits the number of images you can generate. it's part of the pre-processing
like if you tell ChatGPT to create "the best of all possible images" it will insert hyper-realistic into the prompt
"Hyperrealism, also known as Photorealism, is a genre of art that emerged in the late 1960s and early 1970s as an evolution of photorealistic art. Hyperrealist artists create paintings, sculptures, or other artworks that are so detailed and precise that they often appear indistinguishable from high-resolution photographs or even reality itself. Hyperrealism requires immense technical skill and patience, as creating such detailed and precise works can be a time-consuming and challenging process." Sounds ideally suited for AI, actually, and it's a subtle but important distinction.
There, that's a little more polished. It now outputs the JSON wrapper with prompt, image ID, and seed. That will also help anyone using it to verify that their prompt wasn't modified.
it's good to ask ChatGPT for the prompts. it's able to explain things that aren't obvious about DALL-E, like how to blow things in different ways with a vortex, or bend trees in a certain way. Things that you would never guess
"An image of a tiger doing a hand stand on a ball with a circus tent in the background. Surrealist and fractal elements are all around, with Mandelbrot inspired designs."
soon i will learn to spell "Nouveau"
me too lol
I butcher it 97.21 percent of the time.
same, seriously, and i took french
I spell check it every time! It's just one of those words my brain refuses to accept into memory. 😄
a good test to see if the AI is good might be to stack 10 elves standing on each other's shoulders. like rows of 4, 3, 2, 1, and Santa balancing on the head of the 10th elf
We both made pyramids just now
Bing probably told dalle3 10. But dalle3 don't count too good
"What have you done with my waffles?"
I put: "Create a dork as if it was an animal"

i’ve abandoned the elf pyramid project. we’ll need DALL-E 4 or 5 to stack more than six elves properly
Fwiw I’m now getting a seed & was able to regenerate the same image (in the same session).
{
"gen_id": "aNwbdsIwe97zYYa5",
"prompt": "A realistic photograph of an articulated toy action figure resembling a ninja iguana. The action figure is designed with detailed textures and joints, reflecting a high level of craftsmanship. The setting appears like a professional product photo shoot, with clear focus on the toy, subtle lighting, and a plain background to emphasize the figure's details. The ninja iguana toy is captured in a dynamic pose, showcasing its articulation and design features, reminiscent of high-quality collectible toys.",
"seed": 2376289501
}
Maybe it’s not really regening it though
what style you guys like to tell the bot to create the image in
2D style mimicking 3D
Interesting
Will try
It's pretty cool
Just remember, if you get it to work from a seed, that it's not officially supported, and might not work the same, or at all, at any time.
The DEVs would have to weigh in on this, but my "assumption" is that the image_ID is the 'future' route they've chosen (possibly for video implementation??). I have a "hunch" that they're experimenting with video image regen, which would require repeatable characters with slight positional variations, which a seed probably doesn't really do all that well... as the characters are moved around (or camera angle).
But that's just my own speculation, and a seemingly logical secondary reason to abandon seeds.
Yeah think you’re right. Still super dodge; `The images may appear similar because they were generated using the same detailed prompt and referenced the same initial image. However, they are not identical. DALL·E uses the referenced image to guide the style and content of the new generation, resulting in images that are visually similar but still unique in their details.
If you're noticing strong similarities, it's likely due to the specific instructions and the style dictated by the prompt, which directs DALL·E to create images with similar themes and aesthetics.`
Yep, I've tried to tell people the same thing. Lugui says the same thing.
We even had a discussion about this yesterday.
Whats this style 
Gearpunk maybe?
Are all the emojis ai made
Drawing style i meant
ill try
Try try 
Oof I’m not sure.
“A woman with a Caucasian descent has been transformed into a stunning biomechanical butterfly, encapsulating the essence of steampunk. She has intricate metallic wings detailed with gears and steam pipes, and her attire merges Victorian elements with futuristic cybernetic enhancements. As she soars effortlessly through a sky filled with fluffy clouds, the sunlight glints off her polished brass and copper components, creating a mesmerizing tableau”
Here was the prompt it gave me
The style is "steampunk".
drawing style
Photo
Lool
No i dont mean that
Like the style of how its drawn, For example Oil painting, Caricature Drawing, Cartoon Drawing, Figure Drawing Etc
That kid and old man in the painting look weirdly similar to the ones in the tv 
just coincidence
It's just 3D
Ye but gives me very bad prompts when i just write 3d
Right, photo. Or you might say "digital artwork" or "digital painting." But DallE was clearly trying to create a photorealistic looking image. I just ran that prompt to demonstrate.
ok, i will try digital photorealistic looking art work
If you don't specify, it'll often default to a photo, unless something in the prompt makes it think otherwise.
photorealistic is misinterpreted by dalle3 sometimes
well, unless that's what you are going for
Right, what he said. Don't say "photorealistic", "Realistic" etc., it will treat that like "I want almost photo, but not actually a photo" and will provide you with a 3D rendered looking image.
I use ultra- realistic. hyper realistic is great if you want something that looks like hyper-realistic art. works well with artistic styles
i see
like hyper realistic oil painting
ty for ur insights
try "hyper-realistic" with a classic piece of art. Mona Lisa or something
You can copy and paste the last message I put here if you want #1187233013956874260
Try this, it's good at photo's, if that's what you are going for.
https://chat.openai.com/g/g-UOykDKodx-norender
It has all the keywords for a super realistic prompt
done
Easy peasy lemon squeezy
Wow not bad
If you put it as Custom GPT instructions, it's even better.
I had three goal projects with today's theme. I think I nailed 1/3 of them. but I don't care what else I was about to write because that panda is frightening
my god
im 257 messages over the text limit here
ill send it in 2 messages
Or when Satoshi Tajiri and Scott collab for a pokehorror game 👀
Pokeyman horror
Even the mimic (from the new FNAF ruin dlc) and team rocket duo (Jesse and James )would be scared at the wide smiled Pikachu
another depiction of SCP 682


