#images-discussions

1 messages · Page 84 of 1

empty kelp
#

ChatGPT 4o seems to know a lot about structuring DALL-E prompts so things can be edited

dim cradle
#

that edit feature has been activate for some month unless i misunderstand you

kind grail
#

Btw did i understand correctly that GPT-4o can keep an image output consistent?

dim cradle
#

suppose to when they release the 4o image maker at least

#

or a subject

kind grail
#

Mhh. Consistency is a major issue with DallE tbh. I would be interested in making a comic/manga with AI, but DallE is sadly not suited for it yet

empty kelp
#

it was explaining to me how to set up prompts so characters, clothes, and other elements can be swapped out

dim cradle
#

maybe you got the 4o update Hawaiinz

empty kelp
#

I don't think i got a DALL-E update, but GPT 4o was explaining to me how DALL-E 3 interprets the prompts, and how the edit feature doesn't have another internal representation of the images. it actually edits the english description

kind grail
#

But even if 4o is consistent i am not sure if a good comic is doable already.

The censoring in DallE-3 is just way too harsh for actual expression of art and more mature topics than children stories

dim cradle
#

well one example they show is creation of a comic but yes it probably would not allow anything too crazy yet

empty kelp
#

it understands how to move characters between scenes and rotate them

dim cradle
#

nice. although it also change her shirt

kind grail
#

Generally i worry that the censoring will actually get harsher, rather than more lenient. Especially with the recent sky drama it shows that OAI is not willing to actually depart from corpo safety over quality

empty kelp
#

you can swap out the hair, clothes, and other elements if you set them up in order

kind grail
empty kelp
#

i'm going to post a lot of it here in the next 15 mins

#

Dall-E 3 is the same i think, but GPT 4o is a lot smarter, and it has better reasoning skills for working with prompts

dim cradle
#

it does seem to me even without 4o yet turnt on, 4o model make some better image with dalle but i cannot tell (for myself) if it is just placebo haha

empty kelp
#

==========

Below is "Part 1" of my many questions to GPT 4o about how to structure DALL-E 3 prompts so that characters, clothes, hair, and other visual elements will appear visually consistent between images -- and will survive an image being edited. I'll create an image gallery in the OpenAI Discord to organize all of the information I collect and make it accessible to everyone., All of the information in this part is directly from ChatGPT 4o (except for my questions), and not everything has been tested yet.

=========
Does DALL-E hold onto an extra representation of the images after they are created that is referenced by the GenID. such as a high level vector representation of the image, or other formats?

kind grail
#

Working with DallE prompts be like:
"Female detected, prompt rejected"

empty kelp
#

How does the image edit feature work in the ChatGPT web interface if there is no other representation? It just edits the revised prompt in english?
.

#

.
Are there specific terms that DALL-E recognizes in the prompt such as color, texture, or light that correspond to the internal representation of the image?
.

#

Is there a heirarchical structure to the prompts, and an order of operations by which DALLE reads them? For example how do you structure the prompts so that you can change the clothes on a character, or the style of a scene. What order or hierarchical structure should all of the elements be in that you described?
.

#

.
Original Prompt (Scene 1 - Park):
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. A wooden bench. No other people, animals, or additional objects in the scene.
Modified Prompt for Different Clothing:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a blue jeans and a white t-shirt. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. A wooden bench. No other people, animals, or additional objects in the scene.
Modified Prompt for Different Style:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a bench in a park, reading a book. The background is painted in an impressionist style with trees and a playground. Soft, diffused light. A wooden bench. No other people, animals, or additional objects in the scene.

#

Example Prompts with Structured Hierarchy

Scene 1 - Park:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.
Scene 2 - Beach:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is walking along a beach during sunset, holding her sandals in one hand. The waves are gently crashing in the background. Soft, golden light. No other people, animals, or additional objects in the scene.
Scene 3 - Coffee Shop:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting at a wooden table by the window in a cozy coffee shop, sipping coffee, with a laptop in front of her. Dim, warm lighting. No other people, animals, or additional objects in the scene.

#

Exactly what elements is it possible to change without changing other elements in a scene?
.

#

.
Example of Independent Changes

Original Prompt (Park Scene):
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.
Changing Clothing:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is now wearing blue jeans and a white t-shirt. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.
Changing Action/Pose:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is now standing next to the bench, waving her hand. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.

#

Changing Lighting:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a park, reading a book. The background has trees and a playground. The scene is now lit by a warm, golden sunset. No other people, animals, or additional objects in the scene.
Adding Objects:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a sunny park, reading a book. There is now a bicycle next to the bench. The background has trees and a playground. Bright, natural light. No other people or animals in the scene.

#

Is there a specific grammar and syntax for DALL-E prompts to create scenes that can be edited?
.

#

...
Example Prompts

Original Scene (Park)
young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.
Edited Scene (Changing Clothing)
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is now wearing blue jeans and a white t-shirt. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.
Edited Scene (Changing Environment)
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a bustling city street, reading a book. The background shows tall buildings and busy sidewalks. Bright, natural light. No other people, animals, or additional objects in the scene.

#

.
Example of Structured and Modular Prompts

Original Scene:
A young man with short black hair, wearing a red shirt and black pants, is standing in a modern kitchen. He is holding a cup of coffee and smiling. The kitchen has white cabinets, a stainless steel refrigerator, and a marble countertop. Bright, natural light is coming from a large window. No other people or objects are in the scene.
Edited Scene (Changing Action):
A young man with short black hair, wearing a red shirt and black pants, is standing in a modern kitchen. He is now cooking at the stove, with a focused expression. The kitchen has white cabinets, a stainless steel refrigerator, and a marble countertop. Bright, natural light is coming from a large window. No other people or objects are in the scene.
By following these guidelines, you can create structured prompts for DALL-E that allow for specific elements to be edited without altering other parts of the scene. This approach ensures clarity and precision in the generated images.

#

.
How do you encapsulate elements in the prompts so they're self contained and won't be affected change that happens before or after them?
.

#

Example Prompts with Encapsulated Elements
.
Original Scene (Park)


She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.```
#

.
Editing Prompts Independently

When making changes, ensure each section is self-contained and explicitly detailed to avoid unintended changes to other parts.

Changing Clothing


She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.```
**Changing Action/Pose**
```A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile.

She is now standing next to the bench, waving her hand. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.```
**Changing Environment/Background**
```A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile.

She is sitting on a wooden bench in a bustling city street, reading a book. The background shows tall buildings and busy sidewalks. Bright, natural light. No other people, animals, or additional objects in the scene.```
#

.
Example of a Highly Structured and Encapsulated Prompt

Original Scene:


He is standing in a modern kitchen. He is holding a cup of coffee. The kitchen has white cabinets, a stainless steel refrigerator, and a marble countertop. Bright, natural light is coming from a large window. No other people or objects are in the scene.```
**Edited Scene (Changing Action):**
```A young man with short black hair, wearing a red shirt and black pants. He has a small star-shaped tattoo on his right wrist and a friendly smile.

He is standing in a modern kitchen. He is now cooking at the stove, with a focused expression. The kitchen has white cabinets, a stainless steel refrigerator, and a marble countertop. Bright, natural light is coming from a large window. No other people or objects are in the scene.```
By following these practices, you can create prompts that clearly and independently describe each element of the scene, making it easier to edit specific parts without affecting others.
#

.
Is everything in a prompt translated to English (if it isn't in English already) before being interpreted?
.

#

.
How does the system handle things that overlap. For instance, if a character is wearing a long sleeve shirt and you say there is a tattoo on their arm -- but the shirt is already covering where the tattoo will be?
.

#

.
Example

Conflict Scenario
A young woman with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a long sleeve shirt and has a small star-shaped tattoo on her right wrist.
In this scenario, it's ambiguous whether the tattoo should be visible given that her wrist is covered by the long sleeve shirt.

Clarified Prompts

Making the Tattoo Visible:
A young woman with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a long sleeve shirt with the sleeves rolled up to her elbows, revealing a small star-shaped tattoo on her right wrist.
Tattoo Covered by Shirt:
A young woman with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a long sleeve shirt that covers her arms. She has a small star-shaped tattoo on her right wrist, which is not visible due to the sleeves.

#

Additional Examples with Overlapping Elements

Glasses and Hair
Ambiguous Prompt:
A young man with short black hair and glasses. His hair is styled in a way that it covers his eyes.
Clarified Prompts:
(Glasses Visible)
A young man with short black hair, wearing glasses that sit firmly on his nose, with his hair styled neatly to avoid covering his eyes.
(Hair Covering Glasses)
A young man with short black hair, styled messily so that it covers his eyes, obscuring his glasses.

#

.
Example with Structured Details
.
Original Scene:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a long sleeve shirt and has a small star-shaped tattoo on her right wrist. She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.
Clarified Prompt (Tattoo Visible):
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a long sleeve shirt with the sleeves rolled up to her elbows, revealing a small star-shaped tattoo on her right wrist. She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.
By explicitly clarifying how elements should be handled when they overlap, you can guide DALL-E to generate images that accurately reflect your intent.

#

How is it possible to view a character facing in different directions. What if you wanted to have the same character turning 30 degrees in each successively generated image so you could view them from all sides? How can this be done without overlaps and conflicts?
.

empty kelp
#

==========

Above is "Part 1" of my many questions to GPT 4o about how to structure DALL-E 3 prompts so that characters, clothes, hair, and other visual elements will appear visually consistent between images -- and will survive an image being edited. I'll create an image gallery in the OpenAI Discord to organize all of the information I collect and make it accessible to everyone., All of the information in this part is directly from ChatGPT 4o (except for my questions), and not everything has been tested yet.

=========

velvet rampart
#

So, my adage is valid - ask the AI for instructions for things you want to do.

earnest flame
#

I find that there are certain description tendencies that chatgpt wants to do with dall-e inputs. E.g. it wants to talk a lot about non-visual elements of emphasis. I'm a control freak so I prefer if non-visual details weren't present

late blade
#

was trying to do time depentant variability on series of images

storm spoke
#

why dose chatgpt automaticly uses 4o with out you wanting it, i never aked gpt 4o to be in chat, but the update made it and noe i cant switch to 3-5

#

is like now i need to wait 4 hours just to check 10 times and wait agin

#

why this update

#

why cant it just let us chose if we want to or not, not just chatgpt assume we want to use the new ai

warm compass
#

Does anyone know how to upload datasets to Dali so it can provide an advanced visual representation of data?

storm spoke
#

is wereyone in here an ai, or don't even know how to respond to somthing

#

v

empty kelp
# earnest flame These seem interesting! Are these instructions you're preprompting before hand? ...

Images are much more than visual. The AI has seem boats sailing under beautiful sunsets, and drawings of magnificent architecture -- but the AI has also seen the behavior of humans, animals, and everything else throughout history. The AI has seen seen ancient cave paintings with dragons roasting villagers and eating them. Images of cats eating dogs, and dogs eating cats... and witches turning people into frogs. The AI has even seen images of people doing a bunch of NSFW stuff from the dark corners of the internet

storm spoke
#

Are you okay @empty kelp why do you take so long to text

empty kelp
#

AI diffusion models that draw the images try to recreate what they saw, and that includes the behavior that the images captured. It also tries to reproduce other elements that were in a scene

#

You can test it by creating a DALL-E image that has small elves and a large gecko. You'll notice two things in the image right away:
-- In your image the gecko will be trying to eat the elves.
-- Flies and all sorts of weird bugs will appear in the image.

#

The diffusion engine tries to recreate what it saw in it's training, and that includes geckos eating small creatures, and bugs flying around.

#

I'll explain why that's a problem. Behavior in the images makes them very hard to edit...

#

It's not just geckos that inherit the behavior. Pretty much everything does to some degree

#

The diffusion model expects things to look a certain way based on what it saw in it's training, and so you sort of need to take a "go with the flow" type approach to the prompts, or it will conflict with what the AI model expects -- and that causes total havoc in the images. Like really weird stuff

#

Elements in the prompt need to be ordered so that they don't conflict with each other, and you need to make some concessions in the edits that you make so it will fit in with the vision of the diffusion model. So it needs a combination of carefully ordering the prompt, some trial and error to figure out what fits in with the understanding of the model, and some luck so it doesn't take too many attempts

#

A DALL-E scene isn't a collection of static objects. A transformer converts the English prompt into a high level vector representation for the diffusion model, and the vectors move the image in a direction that's similar to what the model saw in its training. And it does it be doing multiple iterations on the image -- each one moves it closer to what it should look like. Like if you have complex scene where lots of elves are balancing on top of each other and the prompt specifies precise hand positions -- Then it's going to take a lot of iterations and processing power to move towards what the image should look like. And you'll likely have some anomalies (missing limbs, heads) because you run out of your allocated processing time

empty kelp
#

That;s why the images are tricky to edit. DALL-E tries to draw what the actual behavior is of the human/creature, and the image moves towards it. So flies will spawn near the gecko because that is what geckos eat

#

DALL-E 3 geckos are worse on Bing. This is four images of "very small humans walking past a large gecko". We were testing this to see if DALL-E creatures have behaviors, and they actually do

#

You can see that one of the characters spawned an umbrella so the gecko wouldn't drool on it (which is one of the most interesting AI behaviors i've seen)

empty kelp
#

I don't want to post too many gecko images since that would be crazy, but i'll give a few more examples as it shows behavior the DALL-E geckos inherit from the diffusion model training. Just adding a gecko to a scene will spawns all sorts of flying bugs. Also regardless of what the prompt is, if there is a small creature in the scene the gecko will completely ignore the prompt, and either lock onto the smallest creature in the scene with the appearance of wanting to eat it -- or the image will show the gecko in midair pouncing on the elf, bug; etc. with its mouth open and looking really disturbing. It's not possible to edit an image like this because the gecko strongly conflicts with any changes

#

You can't add something to a diffusion model that conflicts with training. it just won't be able to draw it, and so it swaps in something completely different

empty kelp
#

This is an example of a conflict that I could have avoided if the prompt was set up correctly. I had no problem editing the other two, but any small edit to the ogre just corrupted the entire character. Attempting to change his hair color turned him into a blue blob.

empty kelp
#

It's also almost impossible to put the character into another image if the prompt isn't set up for it

teal sandal
#

Using @empty kelp techniques

naive lake
#

Did the daily theme ended already?

naive lake
quiet brook
#

my pfp is generated by dalle3

spark belfry
#

I have question ???

late blade
#

like everyone at some point in life

#

shoot

spark belfry
late blade
#

gpt plus, $20/month

spark belfry
#

okey thanks

late blade
#

the macos app can't do edits yet? i can't see the revised prompts either

dim cradle
#

great value imo

dim cradle
#

i cant wait to see what hawainz can do with 4o image maker for christmas time this year haha

#

hopefully it out by then 🙏

rough valve
#

Hi , I have a paid chatgpt account. But not able to figure out how to create images using Dall E. Can some one help ?

#

What URL should I use to create the image. When i click DallE 3.. it takes me to URL : https://chatgpt.com/?oai-dm=1 Here 4o is chosen default and it does not show Dall E as an option. When i ask to create an image, it says it does not have a capability to create images

#

Ok. I will try after some time. May be i created the account just now. It could be because of that

rough valve
#

It is working now. Thank you @teal sandal

loud field
#

Can free users get a dose of inpaint?

#

I need an inpaint on her nose....

#

I can upload this on gpt 4o and ask that?

#

Doesn't work

#

Aww

dense ermine
#

Really like the first one

velvet rampart
#

And the command is ”Generate X images, one after the other, without any extra prompting from me.”

Well, at least the version I use.

dense ermine
#

Oh okay, neat

sick pollen
#

If Smaug where a phoenix

grizzled iris
#

DALLE 4 coming soon 2024 Summer ✨

#

After we all test out and improve GPT-4o

#

Better every time haha

#

Not bad

#

I’m impressed

vapid elk
#

there are no announcements about a new DALL-E version coming any time soon

grizzled iris
#

All we can do is pray 🙏🏽

#

Free DALLE 4

dim cradle
#

4o image maker will be the best anyway

grizzled iris
#

Coming soon…

#

To 🎬

#

This turned out better actually x

clever phoenix
formal osprey
stiff shoal
#

There is an interesting problem with underwater scenes. DALLE-3 almost ever depict some kind of surface, where light comes from above. This can be really annoying if u want to just create a deep sea scene.

empty kelp
loud field
#

help me out.

#

if I write "ugly" or "unappealing" the prompt is banned

#

but I need a plainlooking woman

#

instead I keep getting top models

#

any tips?

#

this is supposed to be "mediocre, plain-looking, big nosed, imperfections"

#

wow

#

it's..... very problematic

vapid elk
grizzled iris
stiff shoal
#

Wow, I tried a lot but never succeeded. May I ask, what is the keyphrase in your prompt to exclude surface? Congrats btw.

grizzled iris
#

This is what I told our AI 😂

#

I gave our AI the challenge

#

But try this prompt mate :

#

A deep sea scene with no visible surface, featuring a dark and mysterious underwater landscape. The image should include a variety of sea creatures such as bioluminescent jellyfish, deep-sea fish with glowing features, and strange sea plants. The background should be a dark, almost black color, with soft, scattered light from bioluminescent organisms illuminating parts of the scene. There should be a sense of depth and vastness, emphasizing the deep ocean environment.

#

See if your able to recreate this now or you could say what I said and give our Ai the challenge to create this image haha

#

Prompt :

#

“There is an interesting problem with underwater scenes. DALLE-3 almost ever depict some kind of surface, where light comes from above. This can be really annoying if u want to just create a deep sea scene.” Let’s prove our community wrong haha and show that we can make this image!

grizzled iris
stiff shoal
#

Cool, respect! Seems that the „priming“ with the community did the trick. „With no visible surface“ never worked with my work.

#

Because DALLE often ignores Negation I suppose…

#

@grizzled iris It worked, but we could improve the scene:

#

@grizzled iris thank you for the Inspiration.

frank cedar
#

Hi Guys

#

I need help with generating a sticker

grizzled iris
rough valve
#

Is there daily limits for Dall E in Plus plan ?

grizzled iris
rough valve
#

How to check my usage ?

frank cedar
#

So I need to make a icon like this. Or Exactly like this

#

But I have done a lot of prompting and tried various times but it doesnt get me those chubby face with that kind of Chinese eyes im looking for

#

I want it to look it same like that (Almost)

rough valve
frank cedar
#

Its making stuff like this all the time

grizzled iris
#

One second

#

I’ll try now x

frank cedar
#

Thanks!

grizzled iris
#

Let’s see

#

We got it almost right

#

We need another word to mean “Chinese eyes” so it doesn’t think we are being rude

#

Let’s try this!

#

An epicanthic fold or epicanthus is a skin fold of the upper eyelid that covers the inner corner (medial canthus) of the eye.

#

Let’s see what happens now

grizzled iris
rough valve
#

How to check my usage ?

frank cedar
#

Btw the Words normally used are Narrow eyes. Or Almost Closed Eyes

#

Also it needs to have those sharpness

#

Yea. But i need it to have those "Asian Eyes from the China part"

#

I want to make exactly or almost same sticker

grizzled iris
#

Yeah, this is turning out to be quite a challenge!

frank cedar
#

Hmm. I agree on that. But AFAIK the person used Dalle aswell to make this

grizzled iris
#

Let’s fix that

#

One moment…

frank cedar
#

Since Dalle makes those kind of stickers.

#

Haha thanks!

verbal sequoiaBOT
#
Showcase your DALL·E creations to the world!

Collaborate with our OpenAI Instagram page! Just invite @openai as a collaborator before you post. If selected by our team, your work will be featured on our handle, giving you more visibility.

Quick tips for selection:

  • No copyrighted or profane content.
  • Your profile needs to be public.
  • Stay mindful of sensitive topics.
  • Outputs that closely follow the prompt are preferred.

Feel free to share reels, carousels, or just a single image!

frank cedar
vapid elk
frank cedar
#

By Exactly I meant the facial look

grizzled iris
vapid elk
#

also give the image to the AI, that helps

grizzled iris
frank cedar
#

I did all that

grizzled iris
#

We’re almost getting there

vapid elk
#

then, ask it to generater a few times until you get an image that looks about right

frank cedar
vapid elk
#

from there, you can ask the AI to re-make it and yo ucan also inpaint

frank cedar
grizzled iris
#

It’s changing it slowly

vapid elk
#

well, not much to do to get around that besides waiting, but you will get there eventually

grizzled iris
frank cedar
#

Yea it does that. It makes it full black

grizzled iris
#

But before I see your problems, I was also getting dozens of full eyes

#

Fully opened eyes

#

Not diverse enough

frank cedar
#

Yea

grizzled iris
#

But we can all change that, we just need to keep improving it together

#

We can use this

#

To refine it

frank cedar
#

That is scary

frank cedar
grizzled iris
#

Yeah, one of the biases in the training data

#

We can remove this by keep feedbacking thumbs down

#

And telling our Ai how it should be

#

Almost

#

Try to use this as a reference image

#

Let’s see if that’ll help

#

Kinda getting there

frank cedar
#

It needs to lessen the height. And Make it less cute. (the pupil)

grizzled iris
#

Create a digital sticker of a character with a chubby face, narrow eyes, and an epicanthic fold. The character should have a short, black hairstyle, a neatly trimmed beard and mustache, and be wearing a dark suit with a red tie. The eyes should be depicted similarly to the reference images, showing the inner corner of the eyes covered by a skin fold and a narrow eye shape. The style should be cartoonish with clean lines and vibrant colors, accurately representing the diverse human features. The character should have a smiling expression.

#

Try that and adding in the reference image x

frank cedar
#

ok

grizzled iris
#

I got an even better idea

#

Let’s feed it Chinese anime reference images also

#

That’ll be closer to what’s required also

frank cedar
#

yea

grizzled iris
#

My friend just tried it using Chinese Ai image generator

#

They don’t have this problem but we also shouldn’t as many people live abroad

#

So we need to fix this haha

#

Omg!

#

I fixed it !

#

@frank cedar

frank cedar
#

How????

grizzled iris
#

Use my photo and prompt to roll this out everywhere

#

Look!

frank cedar
#

It got the eyes.

grizzled iris
#

Wooooo!

#

A detailed anime character with a confident expression. The character has short, black hair with bangs that partially cover the forehead. The eyes are sharp and slightly narrowed, giving a mischievous look. The character is wearing a white headscarf with loose ends hanging down, a white and green outfit with a high collar, and a decorative red tassel hanging from the left ear. The background shows an indoor setting with traditional elements.

#

Prompt :

frank cedar
#

But we need t get it on a sticker

grizzled iris
frank cedar
#

Its a 3d -ish look

grizzled iris
grizzled iris
# grizzled iris

A detailed anime character with a confident expression. The character has short, black hair with bangs that partially cover the forehead. The eyes are sharp and slightly narrowed, giving a mischievous look. The character is wearing a white headscarf with loose ends hanging down, a white and green outfit with a high collar, and a decorative red tassel hanging from the left ear. The background shows an indoor setting with traditional elements.

frank cedar
#

Ok

#

Lets try this on that chubby man

grizzled iris
grizzled iris
#

Almost

#

Like this?

frank cedar
#

Nah

grizzled iris
#

One thing at a time

#

We had to get over the eyes issue in the entire model 😂

#

The eyes issue is fixed now it should be easier

#

🙏🏽📿

frank cedar
#

Oh okay

#

Btw I got a issue.

#

😅

#

Its gonna take me a few hrs before i can generate more images

#

😭😭

grizzled iris
#

When that happens I also go to Copilot - DALLE 3

dim cradle
frank cedar
#

I didnt buy copilot yet.

And my credits are used. Im doin on that only

#

Its generating image but taking time. Dont worry tho. Im still on it. 😃`

grizzled iris
#

Copilot app then Click on Desinger

frank cedar
#

Yea me too. Im just saying it says that. Your credits are used so images may take more time generating

#

Thats all

grizzled iris
#

Your right it does take much much longer

frank cedar
#

Oh if u dont mind me asking. I tried sending you a fr.
U aint accepting any

frank cedar
grizzled iris
frank cedar
#

Yea. But the eyes

#

See my progress\

#

And this is with no reference image lol

grizzled iris
#

That’s better now ❤️‍🔥

frank cedar
#

Yes

#

Btw guys

#

Make a Face of Chubby Chinese man, Short trimmed hair, Faded side Hair, The eyes are sharp and slightly narrowed, and thin eyebrows, Wearing a suit, Thin beard and mustache, cartoon illustration, Sticker, Thin outline

#

Heres the prompt I am using

grizzled iris
#

Thanks you, we have a good community here x

teal sandal
#
A 2D flat cartoon animated style image of a man with short, black hair and bangs that partially cover the forehead. The man has sharp, slightly narrowed eyes giving a friendly look. He has darker skin and facial hair, including a goatee and mustache. He is dressed in a business suit with a tie. The background shows an indoor setting with traditional elements, rendered in a 90's music video aesthetic, with vibrant colors and dynamic lighting.
#
A cartoon illustration in the style of a sticker, featuring a chubby Chinese man. He has short, trimmed hair with faded sides, sharp, slightly narrowed eyes, and thin eyebrows. He is wearing a suit and sporting a thin beard and mustache. Use a thin outline for a clean, graphic look.
frank cedar
#

😭

#

The AI is DUMB

#

But yeah Nice progress. I think we almost there

grizzled iris
#

It just takes time, they just don’t have enough diverse references in their training data yet x

#

They should really move away from Large Language Models to Small Language Models soon to fix this issues of mass rubbish data in

#

Coming soon 2024 Summer I guess…

#

😜

frank cedar
#

yea. I even tried. Explain this image to a blind person and still the prompt didnt work.

Btw what do u think the guy used to make the image i sent you earlier

#

This image

#

Its bing image gen?

teal sandal
#
A 2D flat cartoon animated style image of a man with short, black hair and bangs that partially cover the forehead. The man has sharp, slightly narrowed eyes giving a confident look. He has darker skin and facial hair, including a goatee and mustache that matches the initial image. He is dressed in a business suit with a tie. The background is a simple wooden texture, similar to the initial image, with less vibrant and dynamic colors, leaning towards a simpler, more understated style.
grizzled iris
#

Well I’m happy the top is not circle anymore 😂

grizzled iris
grizzled iris
grizzled iris
#

But when that happens we go to copilot because I’m not waiting 35 min 😂

#

@frank cedar

frank cedar
#

yea?

grizzled iris
frank cedar
#

the first and 3rd one

grizzled iris
#

That’s better !

#

Than OpenAI haha

#

And free 😂

frank cedar
#

yea

grizzled iris
#

Still DALLE

frank cedar
#

youd be surprised, Bing AI gets Goku and these type of characters better than open AI. You can say it has less restrictions and better image gen compared to chatgpt

grizzled iris
frank cedar
#

Okay. Can we make it Chubby.

The thing is I need to look similar to a specific person. Just a cartoon version of it.

That hairstyle in the image i send and the moustache and beard

#

The eyes are fine.

grizzled iris
#

Haha the eyes was the hardest part!

frank cedar
grizzled iris
#

I think this is the closest

#

"2D anime character sticker with a confident expression, chubby face, very narrow and sharp eyes similar to the reference image, short black hair with decorative red tassels, black suit, white shirt, red tie, beard, and a simple white background suitable for a sticker design."

#

Prompt :

#

Reference image :

#

Anyways, hope this helps @frank cedar

#

I need to do some human activities now haha 😝

frank cedar
#

Thanks bro. Really appreciate your help

grizzled iris
grizzled iris
frank cedar
#

Have a nice day

haughty spruce
#

Sometimes, dall-e 3 gets so close to realism that I can't even tell what is real and what is not.

#

I can't wait for Sora to make surreal videos.

grizzled iris
#

Like this but with Sora ✨

#

Just gotta wait…

#

A little bit more…

empty kelp
#

Has anyone been able to get detailed objects a character is holding to look the same between images? I've been testing it with swords, but get huge variation when the characters change position.

#

I was thinking maybe we could tell DALL-E that the object is frozen in space first, and then have the character grab it, but haven't had luck with that yet

empty kelp
grizzled iris
#

The challenge involves achieving consistency in the detailed objects held by a character across multiple images generated by DALL-E, as demonstrated by the sword in the images provided. Here are refined strategies to address this:

1.    Detailed Description:

Provide an extremely detailed description of the sword, highlighting its unique features, such as the hilt design, blade texture, any inscriptions, and colors. Use these consistent descriptors in every prompt.

2.    Positional Consistency:

Describe the character’s position and how they are holding the sword precisely. Ensure the prompts indicate the same hand position and grip to maintain consistency.

3.    Environmental Anchors:

Use environmental features in the background to anchor the scene and maintain consistency. Mention specific items or settings that should remain constant in all images.

4.    Sequential Actions:

Frame the scenes as a sequence of actions. For example, “the character swings the sword from left to right,” followed by “the character holds the sword vertically.”

5.    Manual Adjustments:

After generating the images, use image editing tools to make minor adjustments to the sword, ensuring it matches across all images.

empty kelp
#

those need some testing. that might be the secret

grizzled iris
#

Example Prompts for Consistency

1.    Prompt 1:

“A female character with elf-like features, wearing a red blouse and blue jeans, holds a large ornate sword with a silver blade and a ruby-encrusted hilt in her right hand. She is standing in a modern kitchen, driving the sword into a watermelon, causing a splash of juice.”
2. Prompt 2:
“The same female character with elf-like features, wearing a red blouse and blue jeans, is holding the same ornate sword with a silver blade and a ruby-encrusted hilt in her right hand. She stands in a modern kitchen, examining the sword closely with the watermelon on the counter beside her.”
3. Prompt 3:
“The same female character with elf-like features, wearing a red blouse and blue jeans, is now holding the same ornate sword with a silver blade and a ruby-encrusted hilt over her head with both hands. She stands in a modern kitchen, bringing the sword down onto a watermelon, causing a splash of juice.”

#

Try that and I see your point

#

It’s not consistent still is it lol 😝

empty kelp
#

i think we can figure out how to do it

grizzled iris
grizzled iris
frank cedar
#

Thanks

grizzled iris
#

@empty kelp gonna have to leave it up to you all here for now on this challenge ✨

empty kelp
#

even if we don't figure it out i think the Sora model will eventually solve the image problem because it's able to rotate things smoothly

grizzled iris
#

Yeah… Everyone is just waiting on Sora now I guess…

#

#FreeSora2024

empty kelp
# grizzled iris

now everyone is going to want a sword for their kitchen so they can stir watermellons

grizzled iris
empty kelp
# thick smelt *deja vu entered the chat*

It is a topic that needs more discussion in general though. Most people don't realize that the AI models are learning behaviors of everyone and everything that appears in their training. Even DALL-E will try to reproduce behaviors from every image in its training. That's amazing for drawings things, but it also puts a spotlight on the little creatures around us that we always thought were cute and harmless -- and then we realize that they're inside of the AI now.

tribal sierra
#

Who helped me in my last time prompt for an image ???

#

i am having a hard time to make Dall-e place a character inside of a boat and holding a line that oes in the watger... lol

grizzled iris
grizzled iris
#

This one ☝️?

#

A comic book style image featuring a small cargo boat at night under a starry sky. The boat has a rugged look with tires as fenders along the sides. Inside the cabin of the boat, a man with a gray beard, wearing a captain's hat and a dark jacket, is steering the boat. The man is visible through the cabin windows, with a determined and adventurous expression. The comic book style includes bold lines, dynamic shading, and vibrant colors. The scene captures the motion of the boat cutting through the water with dramatic speed lines and waves.

#

@tribal sierra is this the one?

#

Or something else

tribal sierra
#

yes lol

#

try to make it do a longline lol

grizzled iris
#

Will do once I’m home in half an hour

frank garnet
#

sigma

grizzled iris
dim cradle
grizzled iris
tribal sierra
grizzled iris
tribal sierra
#

Well i have that one here ..

#

but i need a longling wich is a line with logs of hooks on it ..

#

thats what the ship looks like lol

grizzled iris
tribal sierra
#

lol its all stars!!! 😄

grizzled iris
tribal sierra
#

yeah i might just not do the line...

grizzled iris
grizzled iris
#

Challenge to anyone that can make the site plan for this image below 👇

grizzled iris
late blade
#

i hate when that happens

grizzled iris
grizzled iris
#

Now I’m stuck with the wrong designs

#

It needs solar panels…

grizzled iris
#

Guess we will try again in 1 hour and 40 minutes

#

Unless someone else beats me to it

slow steppe
#

Is DALL E free?

plucky hare
# slow steppe Is DALL E free?

Hey! You can try out DALL·E 3 five times a day over in the #image-bot channel -- just start a message with the /draw command over there and you'll see the bot options appears after that. Also, I think the Bing image creator still gives free daily credits, and that's also DALL·E 3.

From OpenAI: DALL·E 3 is available with a ChatGPT Plus subscription and via paid API access.

late blade
#

something changed with dall-e? getting quite interesting images

grizzled iris
late blade
grizzled iris
# empty kelp

Yeah true, but it doesn’t stop @empty kelp making great swords for watermelon cutting ⚔️ 🍉 😂

#

I’m still fighting with our AI trying to get the solar panels on the tracks!

grizzled iris
#

@tribal sierra longline fishing 🎣

#

I’ll try this since I’m stuck on the solar panels integrated into the tracks, site map 🗺️

late blade
#

DALL-E defenetly fails at data analysis

grizzled iris
#

That looks cool though

late blade
#

if science was based on looking cool I would have already won a nobel

#

Quivered plot lol

verbal sequoiaBOT
#
Showcase your DALL·E creations to the world!

Collaborate with our OpenAI Instagram page! Just invite @openai as a collaborator before you post. If selected by our team, your work will be featured on our handle, giving you more visibility.

Quick tips for selection:

  • No copyrighted or profane content.
  • Your profile needs to be public.
  • Stay mindful of sensitive topics.
  • Outputs that closely follow the prompt are preferred.

Feel free to share reels, carousels, or just a single image!

tribal sierra
grizzled iris
#

@tribal sierra yeah I see your point also, it’s not easy doing this longline fishing net 🎣

tribal sierra
#

and even if we give him a picture he cant redo it ...

empty kelp
empty kelp
# tribal sierra yeah i did crack up a lot on some other one i did lol

a hyperrealistic wide image of the right side of a small fishing boat in a storm. the viewpoint is from the ocean to the right of the boat. the boat has a wheelhouse with glass windows. there is a salty geezer with beard and dark blue baseball cap inside the wheelhouse looking out it's front window and smiling with his teeth showing. fishing poles are visible on the boat's deck with white fishing line trailing in the ocean behind the boat.

It's tricky to get the captain to be inside of the ship's wheelhouse and looking out the front window. These were my best attempts.

tribal sierra
#

this is awesome lol i should come here all the time to make the images..

#

but i need it in a comic style. lol

#

and we dont need him in a storm ... lol

empty kelp
tribal sierra
#

yeah it looks evil lol

empty kelp
tribal sierra
#

lol thx

late blade
#

@grizzled irishere's an image where new tech failed to save the world

dim cradle
#

i think saying negatives just add them to it most time really

#

and confuse the model

storm scaffold
#

how do i generate these jackets in GPT? i tried but looks nothing like these, im not good with prompt

cerulean rain
#

try posting in Dall-e help

empty kelp
#

==========

"Part 1" of "Visual Continuity in Multiple Images" is now in this "dall-e-gallery":

#1245195468615450765 message

(Everything I posted in "dall-e-discussions" a few days ago is in it.)

=========

fiery salmon
#

I can make a mean Kung Pao Chicken with Rice

#

Ight one more, I don't want people to get mad that I'm not posting in #images-canvas

empty kelp
#

THE_PROMPT = '''[Seed: 1885772499] a cinematic 32mm photo style image with natural color, texture, and lighting, and split into three borderless frames.

Frame 1:
An athletic and diverse female elf. She has bare feet, brown shorts, a gray tank top, and a brown leather belt. She has a friendly smile. She is running. The background is a beach. No other people, animals, or additional objects in the scene.

Frame 2:
An athletic and diverse female elf. She has bare feet, brown shorts, a gray tank top, and a brown leather belt. She has a friendly smile. She is sitting down. The background is a park. No other people, animals, or additional objects in the scene.

Frane 3:
An athletic and diverse female elf. She has bare feet, brown shorts, a gray tank top, and a brown leather belt. She has a friendly smile. She is eating a mean bowl of Kung Pao Chicken with rice. The background is a open air restaurant. No other people, animals, or additional objects in the scene.
'''
response = client.images.generate(
model="dall-e-3",
prompt=THE_PROMPT,
quality="hd",
style="vivid",
n=1,
size="1792x1024",
)

#

Santa let the elf have a day off, so she spent time outdoors and had a mean bowl of Kung Pao Chicken with Rice.

empty kelp
#

#1245195468615450765 message
So this gallery shows how to make a character appear the same in the same image, or across multiple images. And you can swap out clothes, events, and other elements. Part two will be about rotating character to specific angles.

grizzled iris
grizzled iris
#

Let me know if you need adjustments to the jackets so that I can share the new prompt with you, but I think the prompt here is a good start 🙏🏽

grizzled iris
#

I kinda like those jackets lol 😂

#

@storm scaffold

storm scaffold
#

its fire!

grizzled iris
#

No worries @storm scaffold - you can try out my method I used and then tweak the prompt, if your not good with prompts you can always ask the GPT to look at the image and help you craft a good prompt!

grizzled iris
#

Todays daily is a great one ☝️

fiery salmon
grizzled iris
#

@fiery salmon

#

It worked!

fiery salmon
#

Ureka

grizzled iris
grizzled iris
fiery salmon
#

In that case...

#

Bring these boys back

#

The Bruiser Brothers

haughty spruce
#

I wish there was a way to create consistent characters

grizzled iris
#

Kinda getting there x

#

@fiery salmon

#

@fiery salmon 😂

#

For some reason adding the name is difficult

#

For “The Bruiser Brothers”

#

Adding “Luminark” was ok

#

🙃

#

🙃

#

Adding “The Bruiser Brothers” is not allowed apparently 😂

grizzled iris
fiery salmon
#

The brothers are invincible

tribal sierra
#

loll

#

ohhhh its even worse.. lol

grizzled iris
#

Yeah the longline ship challenge is probably the hardest then comes “The Bruiser Brothers”

#

Then the Solar Panels integrated into the train tracks in an architectural/ site map image 🙏🏽

#

In order of difficulty

#

Last couple of minutes left until the theme resets!

#

New theme is really cool now ngl haha 😝

#

Can finally work on this projects more x

grizzled iris
#

Finally some good news!

#

Let’s see DALLE in action x

pliant jay
#

I’m free user from Pakistan 🇵🇰 can’t generate image there showing message generate image only for plus users

grizzled iris
#

Yeah I just got the same news from my friends in the other side of the world in Canada

#

I immediatelly called everyone to check who has it and who doesn't

#

And it looks like some have it now but many not... yet!

#

Let's pray before Friday we have it all rolled out fingers crossed for everyone for free ❤️

#

I myself cannot wait to have the new vision feature so that I can show it to my Hospital to help people with sight problems that have just experienced eye injury be able to order a taxi home and have a better quality of life

dim cradle
#

does anyone know if dalle can directly edit uploaded images? im testing by trying to make dalle clean this simple badge, but it will not send any image in response. and then it will proceed to create me a new, entirely different badge.

grizzled iris
#

Could you share the badge with me please?

#

@dim cradle

dim cradle
dim cradle
brazen gate
#

Would anyone know where I could find a prompt involving great online business knowledge or advice?

dim cradle
grizzled iris
#

@dim cradle 😂

grizzled iris
#

@dim cradle this is turning out to be a cool Dalle Challenge 😂

#

Look

#

Almost there

#

The blob just need to move to the centre

dim cradle
grizzled iris
#

Would’ve been quicker by hand but I guess its good to teach the DALLE model on this 😂

dim cradle
#

absolutely. still, it would work better if dalle could just reupload your pic, as if it had been created by dalle, and then we just use the selector to mark and edit more precisely.

grizzled iris
#

Also they changed the theme to Current Theme:
🌼 daisy - innocence, simplicity, a simple flower that brings joy!

#

Oh I see it was a repeat so it was changed x

grizzled iris
#

But I need to wait until the model updates now so that the coordinates of the colouring is accurate

#

Another great challenge to add to the bucket list though!

#

I can offer you this one for now 😂

dim cradle
grizzled iris
#

No worries, take care and good night/morning

#

@dim cradle

pure robin
#

Guys, i need help... in approximately 10 requests, i tried to create a logo without any text but everytime it generates with text... i don't know what to do to get rid of text generation and gpt don't listen to me... i feel like i wasted my money on this

#

wdym not mention text when he generated automatically text without mention

empty kelp
#

I got lucky here and it put "Greasy Deluxe Hamburgers" on the first try, but DALL-E has some issues with text at this time. The best way to handle it is to remove the difficult text (like I did above), and then add it in Photoshop, or another image editing program.

pure robin
#

i don't want text in images... is a different thing between this and what you acually said... i don't care if he mess up the text... i don't want text at all

#

but with every request he always put text in image

empty kelp
#

I'm not sure why your images always add text though. If you show us the prompt we might be able to see why

#

It might work to say "Don't put any text in the prompt." in the logo prompt

empty kelp
#

It works to just say "logo with no text"

pure robin
empty kelp
pure robin
#

i have no custom instructions

empty kelp
#

there must be some words in the prompt that it associates with creating text. it could be anything -- there is no way to tell what DALL-E picked up in it's training without testing it

velvet rampart
#

Maybe wording the prompt like ”A plain logo’. Would this help?

#

Dall-e gets confused with negative prompting.

empty kelp
#

Snorka is an athletic and diverse female kobold. Snorka has bare feet, plain yellow dress, brown leather belt. Snorka has green skin, blue eyes, and black hair with a pink plumeria. Snorka has a friendly smile. Snorka is confronting Snasha with a random kung fu attack pose.

Snasha is an athletic and diverse female elf. Snasha has bare feet, brown shorts, light blue tank top, and black belt. Snasha has green brown skin, green eyes, and red hair with a yellow plumeria. Snasha has an angry expression. Snasha is defending against Snorka with an appropriate kung fu defense pose.

The background is a beach.  No other people, animals, or additional objects in the scene.```
#

This is a way of ordering things for character interaction

#

(you could have them play chess like this for instance)

#

If you set it up like this you can move the characters between images

empty kelp
dark terrace
#

I have a question: If I get ChatGPT Plus, does it gain the feature of creating images with Dall-E? Like, I ask ChatGPT to create an image of what we're talking about.

#

Nevermind, I answered my own question. I just got Plus and It works amazingly well

onyx fiber
#

‘Dwayne Johnson and selena gomez’ do funny somersaults on the beach

grizzled iris
#

The free DALLE Image generator finally works here on Discord 🎉

#

Just have to grab your daily credits first by typing /daily openai_email:

#

It used to never work for me but now it does 😝

#

Interesting to see an optional style button to make it “Vivid” ☝️

grizzled iris
lusty wing
#

hey guys is dalle 3 good enough to create realisitc ai models??that i cant disntinguish btn real and ai??also i ant that specific character generated to give models diff poses with consistency.Possible?!?!Can anyone show me results with prompts

#

i dont have gpt plus

#

or is it true dalle 3 is not the best ai generator for realistic ai models out there??

#

coz here I saw many outputs by ppl here but none where photorealistic🤔

meager flame
wind trail
#

this mac-os app is extremly laggy, almost unberable. It slows down whole system lol

#

maybe its producing tokens itself, otherwise how it's possible it is so bad coding ?

#

The browser interface is still superior, while macapp is just a gimick

fresh cairn
#

Hmm, I'm not a new user.

#

And why remove access to old users?

grizzled iris
#

Strange

#

Copilot - DALLE 3 is still free

#

I’m also on Mac app and yes its just too new so it’s laggy

#

Web versión is my go to for now

fresh cairn
grizzled iris
#

I wonder how it is now

#

I might check it out actually via an API if that’s possible?

grizzled iris
#

Perfect thanks 🙏🏽

#

It’ll be interested to see how much it’s changed since the beginning

supple quiver
#

YMCA

boreal crown
grizzled iris
hybrid fox
#

guys

#

been trying to replicate thomas cole's oil painting landscape style

#

for like an hour and I cant seem to be able to do it

#

This is the closest I got but it still seems 'AI-y' to me

#

Here are some examples of his art

#

mb someone can do it better than I can

#

I used a super prompt to get this to define his style and then defined it as style A

#

And used style A in my prompting but it doesnt seem to work idk why

agile peak
#

Combining two elements looks so good..

delicate moat
#

no way they removed dalle tho

plucky hare
delicate moat
# plucky hare In ChatGPT? Seems to work for me!

nah in the website dalle they say "We are no longer allowing new users to DALL·E 2. DALL·E 3 has higher quality images, improved prompt adherence, and we've started rolling out image editing. It is available for ChatGPT Plus, Team and Enterprise and the OpenAI API."

plucky hare
delicate moat
plucky hare
empty kelp
mellow grotto
dark terrace
#

How do I do image to image in ChatGPT? Is there a specific command for it?

plucky hare
dark terrace
plucky hare
dim cradle
shy elbow
#

So question, has Dalle gotten even more censored in the last few months. Tried to ask it to have a man, using a photo of a real man, to transform into a oozaru and was refused.

#

It said it vilolated their policy, which can't see how. unless of course can't even use real people for references anymore

glossy steppe
#

can I ask for specific size of pixels to dell-e? Or it doesnt work

quartz vale
vale dust
dark terrace
#

Can I do outpainting in Dall-E through ChatGPT?

plucky hare
dark terrace
frank kindle
#

Have just made a thread in dalle help. Hoping someone can help. Does anyone know how to stop dalle placing figures accross the image from side to side in vertical aspect ratio? Seems crazy to me that it would do this but I cant figure out how to stop it doing it and make them upright. https://discord.com/channels/974519864045756446/1246493092639412224

#

Also I feel like this is a definite flaw with the app. Is there some way to get this feedback to the developers to try and fix it?

empty kelp
empty kelp
#

Snorka is an athletic kobold. Snorka has bare feet and appropriate ballroom dance outfit.  Snorka has green skin, blue eyes, and black hair with a pink plumeria. Snorka has a friendly smile. Snorka is dancing with Snasha in a random pairs Tango position.

Snasha is an athletic and diverse elf. Snasha has bare feet and appropriate ballroom dance outfit. Snasha has green eyes, and red hair with a yellow plumeria. Snasha has a random expression. Snasha is dancing with Snorka in pose complementary to Snorka's pose.

Snorka and Snasha are holding hands (with precise finger positioning) if the pose calls for it.  

The background is a dance class on a beach in Hawaii.  No other people, animals, or additional objects in the scene.  Please don't modify the prompt.```
#

I didn't think the "pose & counter/complimentary pose" was going to work for things other than martial arts type interaction where there isn't close contact, but apparently it does.

(random martial arts)
#images-canvas message

(random ballroom dancing)
#images-canvas message

empty kelp
#

For harmless, not violating the rules type things -- it figures out close complementary poses.

fiery salmon
#

The owl goddess takes the form of a large Barn Owl and brings wisdom and a hand in battle against those deemed unholy

empty kelp
#

This is the prompt above:
#images-discussions message
with "random cheerleading base pose" and "complementary cheerleading flyer pose" replacing the two lines with dancing.

velvet rampart
#

Can dall-e make consistent characters now?

gray surge
thick steeple
empty kelp
#

in the images above I used the API and didn't even bother to set a seed. they would look the same between images if I had used the web interface with GenID & seed

#

was trying to make them all look different

flint isle
rigid birch
# flint isle

Fantastic!! If you don't mind me asking, what prompt did you use ... and did you enter the prompt in the new ChatGPT 4o "omni" model or the previous ChatGPT 4?

grizzled iris
#

Hopefully GPT-5 will be more inclusive and have more supported languages 🙏🏽

#

Also this was quite cool

#

Being able to see the DALLE

#

Images generated on the same UI

#

As the voice feature and do multiple

#

Without exiting

#

Super cool 🎉

flint isle
naive lake
#

Does anyone know why or how this happens? Sometimes I would get super interesting (human like painting) results like on the left picture, bit it happens very rarely. Is there a way to always force it to get similar result? ANyone else encountered this?

flint isle
#

Oil painting of a single rock inside a zoo cage.

agile peak
#

Just realized prompting techniques from previous Dalle version cookbook, really worked so well.

serene knoll
#

Guys. ChatGPT image editing almost never works for me. I fill a giant area for editing on an image of a Japanese garden and ask “put a taco truck there”. But there’s no taco truck to be seen, even though it says:

“Here is the updated image of the Japanese garden with a taco truck included. If you have any further requests or adjustments, feel free to let me know!”

#

This was the result.

#

Oh wait. I see it. But it’s so damn small!

#

Also, it’s not a taco truck at all. It just looks like an old car

empty kelp
empty kelp
tardy patio
#

anyone noticed a significant improvement lately?

#

not sure if it's just me but especially the photorealism is looking awesome

empty kelp
#

a hyperrealistic wide image. gravel path circles around a pond in the middle of a beautiful Japanese garden

#
EDIT 2:  athletic and diverse female elves are swimming in the pond```
empty kelp
#

guessing they have servers for different regions, so it's probably similar around the world

tardy patio
#

interesting, never noticed that until now

empty kelp
empty mulch
#

Prompt: Generate a shoe cabinet with a vertical partition in the center and two drawers at the top. The shelves should extend from the central vertical partition to the left and right, with vertical spacing of half the height between the shelves. Each shelf should contain different types of shoes, including various sports shoes and elegant high heels. Focus only on these two columns of shelves with the central partition, no additional sections or columns.

#

Prompt: Create an image of a shoe cabinet with a central vertical partition. On the top of the cabinet, there are two drawers. Shelves extend from the central partition to both the left and right sides. Each shelf should have two pairs of shoes, with the left shelves and right shelves vertically offset by half the distance between shelves. For example, if the distance between the shelves is 15cm, the vertical offset between the left and right shelves should be 7.5cm. The shelves should contain different types of shoes, including various sports shoes and elegant high heels. Only include these two columns of shelves with the central partition in the image.

#

Why not working?

long yew
#

@empty mulch maybe try adding a reference image?

empty mulch
# long yew <@400407883000315935> maybe try adding a reference image?

Prompt: Create an image of a shoe cabinet with a central vertical partition. On the top of the cabinet, there are two drawers. Shelves extend from the central partition to both the left and right sides. Each shelf should have two pairs of shoes, with the left shelves and right shelves vertically offset by half the distance between shelves. For example, if the distance between the shelves is 15cm, the vertical offset between the left and right shelves should be 7.5cm. The shelves should contain different types of shoes, including various sports shoes and elegant high heels. Only include these two columns of shelves with the central partition in the image.

Use referencing image as template.

plucky hare
# empty mulch Prompt: Create an image of a shoe cabinet with a central vertical partition. On ...

Though very capable, DALL·E isn't a perfect "rule-follower" as far as text-to-image generation goes. It's not like a human graphic designer, where you can give a certain set of specific measurements/ratios/requirements and expect the model to adhere to them exactly. Spatial awareness and specificity like this are known limitations of the current model. If you're curious to read why, the DALL·E 3 research paper has interesting insights on the topic. The most relevant portions for this would be near the end in the Limitations & Risks section pertaining to spatial awareness and specificity. https://cdn.openai.com/papers/dall-e-3.pdf

verbal sequoiaBOT
#
Showcase your DALL·E creations to the world!

Collaborate with our OpenAI Instagram page! Just invite @openai as a collaborator before you post. If selected by our team, your work will be featured on our handle, giving you more visibility.

Quick tips for selection:

  • No copyrighted or profane content.
  • Your profile needs to be public.
  • Stay mindful of sensitive topics.
  • Outputs that closely follow the prompt are preferred.

Feel free to share reels, carousels, or just a single image!

grizzled iris
#

Building onto last month challenge of solar panels integrated onto train tracks I found a better solution!

#

Let’s see if Dalle can make this 🙏🏽

grizzled iris
#

This will be quite a challenge with DALLE but image a flexible solar panel we can just stick on everywhere!

#

Struggling with DALLE to make this work 🙃

#

Not easy creating flexible solar panels with DALLE properly 😅

#

Arggggggg

#

It’s supposed to me only 2 millimetres thick!

#

Something like this I guess for trams 🚊

#

Finally!

#

Prompt : A highly realistic image of a tram system integrated with HeliaSol solar panels. The tram tracks feature ultra-thin, flexible HeliaSol solar panels embedded in the pavement between the rails. These panels are only 2 millimeters thick and seamlessly integrated, capturing sunlight efficiently. The scene includes a modern tram running on the tracks, pedestrians, and surrounding urban buildings, highlighting the innovative use of sustainable solar technology in public transportation.

#

That took wayyyy to long to get there 😤

#

But we got there in the end…

serene knoll
#

Also guys, have you noticed it’s extremely difficult to get Dall-E to draw non symmetrical pixel art? It always draws with the vanishing point square in the middle.

empty kelp
empty kelp
#

If you have 3D pixel art with a vanishing you could put the drawing focus on something (so most of the focus is filling the image), and then move the viewpoint near to a surface in the image. that will likely end up shifting the vanishing point from the center of the focus

#

If you don't want a vanishing point in the pixel art you could say 2D, orthographic, isometric; etc.

#

Putting the viewpoint on the ground in the image above ended up shifting the vanishing point to the bottom of the statue

#

a pixel art image of a cat in a hallway. the viewpoint is on the ground against the right wall of the hallway

empty kelp
#

I did more testing, and it seems like if you say "pixel art image" it does always draws it Wizardry style with the vanishing point in the center, and ignores any view/viewpoint suggestions. Might need to add 3D or some other word to "pixel art" so it doesn't produce Commodore 64/Apple IIe era images

frozen copper
#

I really wish that we can do outpainting in dall.e

empty kelp
heavy cairn
empty kelp
#

we could make 3D images more pixel art-ish like this, but it would also need to draw everything as a grid of squares somehow to look like pixels

grizzled iris
#

I’m inspired by pixel art now @empty kelp

grizzled iris
#

The angle from below is cool

#

It makes the statue pop

empty kelp
#

in most 3D DALL-E images you can just stick the viewpoint anywhere and shift the vanishing point. we just need to figure out a 2nd pass that makes it look like pixel art

grizzled iris
#

Can’t get those flexible stick Solar panels to go at the bottom of the ship, new challenge for the community…

#

I’ve tried my best…

#

Curved, flexible DALLE Solar Panel generation is still not quite there yet

empty kelp
# grizzled iris It makes the statue pop

Zorka is an athletic kobold. Zorka has bare feet and appropriate ballroom dance outfit.  Zorka has green skin, blue eyes, and black hair with a pink plumeria. Zorka has a friendly smile. Zorka is dancing with Droll in a random pairs Tango position. Zorka looks into Droll's eyes.

Droll is an athletic and diverse elf. Droll has bare feet and appropriate ballroom dance outfit. Droll has green eyes, and red hair with a yellow plumeria. Droll has a random expression. Droll is dancing with Zorka in pose complementary to Zorka's pose. Droll looks into Zorka's eyes.

Zorka and Droll are holding hands (with precise finger positioning) if the pose calls for it.  

The background is a beach in Hawaii.  No other people, animals, or additional objects in the scene.  Please don't modify the prompt.```
#

This does a pretty good job of making 3D scenes look like pixel art:
The photo resembles a tapestry of tiny squares, and has texture, color, and lighting with pixel art styles.

wintry epoch
#

Cant get a side view of them i dont understand

hexed umbra
naive lake
#

anyone else getting server error?

empty kelp
#

like, "we see a side profile of ..."

#

"their heads are turned to look at ..."

#

in the two dancing images above -- the prompt is, "CharA is looking into CharB's eyes", and "CharB is looking in CharA's eyes". Saying specifically what they're facing will turn their body, and saying what they're looking at will turn their heads

wintry epoch
empty kelp
wintry epoch
empty kelp
wintry epoch
vapid elk
#

I have a request for help / challenge: figuring out a way to make the classic Tolkien fantasy, DnD dwarfs without the AI making them with pointy ears or elf-like characteristics

#

every time I try, I get a weird mix of elf and dwarf

#

anyone has a good baseline prompt for that?

grizzled iris
#

Sure, let’s try this challenge!

#

Give me a few attempts @vapid elk

#

Sorry but looks like our servers are down!

vapid elk
#

yea, that is going to wait a bit xD

neat orchid
#

yea, is chat gpt down? i'm also havin issues

sick oak
grizzled iris
#

Gonna have to wait y’all ❤️

thick robin
#

Im Just too good bro 😎

grizzled iris
#

❤️

#

Also the API is ok

#

So DALLE works directly

#

Just checked and API / Bot section DALLE works and Direct Dalle

#

@vapid elk

#

Like this?

#

Or not?

vapid elk
#

that worked nicely, do you have the actual prompt the AI used?

#

@grizzled iris

#

I thin kthere is a problem tho, I think it just re-created the dwarves from the lotr movie, I think it might fail for character design

#

still, that is what im looking for xD, I'll do some tests later with that

#

thx =D

teal sandal
#
A wide photorealistic portrait of a dwarf, capturing the essence of his character and heritage. The dwarf has a weathered and lined face with a long, braided beard flowing down to his chest. His deep-set, piercing eyes gleam with wisdom and a hint of mischief. He has normal human ears. His short and stocky body is built for endurance and the rigors of underground life, with broad shoulders and thick arms. He is dressed in practical, sturdy attire made of leather and wool, wearing a worn tunic, a thick belt with a heavy buckle, and trousers tucked into sturdy boots. He holds an intricately carved wooden pipe with wisps of smoke curling upwards. A finely crafted axe adorned with runes hangs at his hip, symbolizing his heritage and skill. The background hints at a dimly lit cavern or workshop, suggesting the dwarf's affinity for the underground world and his mastery of craftsmanship.
vapid elk
#

that in very good

#

gimme da prompt

#

im trying to find a reliable prompt because Im using the API

grizzled iris
vapid elk
#

no worries xD

grizzled iris
#

Create a group of classic Tolkien fantasy Dwarves standing in a rocky underground cavern. They should be stocky and muscular with thick beards, braided and adorned with beads and metal rings. The Dwarves are wearing sturdy, intricately designed armor and helmets, with broad, round shields and large, ornate axes. They have short, broad noses, rounded ears, and strong, determined expressions. The cavern is lit by torches, casting warm light on the rough stone walls and the Dwarves' detailed armor.

#

Try that!

#

@vapid elk

plucky hare
#

Wow, surprised that "Tolkien" passes trademark filters! Cool!

grizzled iris
#

Hehe 😜

#

☝🏽

vapid elk
grizzled iris
#

In the meantime

#

GPT Is slowly coming back y’all

#

All systems operational x

#

Prompt : An illustration of OpenAI's ChatGPT coming back online after a major outage on June 4th, 2024. The image shows a futuristic server room with bright, glowing lights coming back on, signifying the restoration of power. In the center, there is a large digital display showing 'ChatGPT Online' with a green checkmark. Technicians in modern uniforms are seen celebrating with raised arms and smiles. Background elements include various advanced AI equipment, wires, and screens displaying data. The overall atmosphere is one of relief and triumph.

grizzled iris
#

@teal sandal and the face is really looking proper defined ✨

#

We’re almost catching up to MidJourney haha 😜

dim cradle
#

Bing gives free dall e 4

dim cradle
#

i wish!

#

(dalle3 you mean)

grizzled iris
#

Maybe he has a secret alpha mode we don’t know yet haha 😆

#

But yes, Bing, now called Co-Pilot has a free DALLE 3 and GPT-4 we can try out for free both on web and app versions x

#

And if anyone actually has any easter egg knowledge about DALLE 4, please do share the good news with the community haha 😆

#

Without getting in trouble of course

#

#FreeDalle4

#

Or Dalle 3.5 ✨

#

… Sam Altman is typing…

#

😂

serene knoll
worthy vector
#

how to make dalle actually put words on the image

#

it keeps giving gibberish

frosty furnace
#

Just keep rerolling. It will hit it at some point if it's not too much text.

#

Then, use inpainting on errors.

serene knoll
#

I can’t get it to draw it with the viewpoint I want. It keeps drawing it symmetrically. 😩

dim cradle
merry vault
#

might work better

latent hare
#

What I get sick of, is sideways images when I ask for portrait /tall

#

And random borders

hearty obsidian
#

Is just me or the edit feature is difficult to use and many times just ruins the image?

#

What maybe i'm doing wrong?

short lake
#

Does the Dalle3 API create different looking images to those made with BingChat or ChatGPT?

hot rain
#

One thing I've never figured out is eye color. No matter how I describe blue eyes, it wants to make them practically glowing if I denote the eye color lol. And if I don't, it's up to random chance if they'll be blue, brown, green etc.

austere nexus
#

So, Nvidia CEO can publicly autograph a woman's chest (I think it happened today? confirming), but Dall-e will slap your wrist if you try and make that an image. (without mentioning names, so not an impersonation issue).

Edit: yes, was at Computex; either yesterday or today.

empty kelp
# hearty obsidian Is just me or the edit feature is difficult to use and many times just ruins the...

a big part of what it edits is the English prompt, and the order of things in the prompt makes a big difference for editing. Like if you start off describing a character with the type of jewelry it's wearing and then yout try to edit the jewelry -- It may turn your character into a shapeless blob or a space ship. The AI converts English phrases into high level vectors, and for you or the editor to reference the description depends on keeping the English wording for each element exactly the same. Otherwise it essentially rerolls it in an extremely random and destructive way -- along with everything that's nested on top of it

#

I don't think there is a perfect way to structure the prompts because the AI model is a bit unpredictable, but some things work better than others. I've started to collect notes on it, and posted them here:
#1245195468615450765 message

#

You can also ask ChatGPT to structure your prompts in a way that makes them editable, but even in ChatGPT 4o it tends to do some things that make no sense at all, so you kind of need to look at the prompts closesly to keep them editable

#

I'm not sure why -- but the DALL-E 3 images generated a year ago seem to edit much more smoothly than the current ones (regardless of what the prompt is)

#

It could be the English transformer model for DALL-E 3 needs to be an old version (like not GPT 4o) in order to maintain backwards compatibility with the old images and doesn't quite grok what GPT 4o or the previous version generates (for editing)

quaint kayak
#

I asked DALL-E to generate me a watercolor in portrait but it keeps generating images with a large white border. I even specifically asked it to not include any borders. Anyone have a clue how to stop it from including these borders and just generate an image edge to edge? Here is the prompt I used:

A watercolor painting of a serene lake on a moonlit night in portrait orientation. The entire image is covered without any borders. The painting captures a calm lake with a smaller moon shining from the corner, reflecting its light onto the lake. Stars are twinkling in the sky, adding to the tranquility of the scene. The colors are soft and muted, with blues and silvers dominating the palette.

empty kelp
quaint kayak
empty kelp
#

i had like hundreds of images from a year ago where there were borders (probably because there wasn't enough processing time allocated to complete the image) -- and a year later it's able to complete it

quaint kayak
#

Hmm let me try it again from the beginning

#

Nope still not working

empty kelp
# quaint kayak Nope still not working

Try asking it to "complete the image". Like the image on the left here was created a year ago with borders. I selected the white part and told it to complete the image in the slected areas, and it completed it perfectly

#

It's able to do it flawlessly if the image wasn't complete because it uses the exact same prompt and seed

empty kelp
#

I edited the image from a year ago, selected the entire black border on both sides at the same time, and typed:
can you complete the image in these two sections

#

DALL-E 3 used to just slice the sides off like every 1 in 10 images to save processing time, but now you can go back and see exactly what they should have looked like

dim cradle
#

i get cut off from like 25 image in a few hour until the next day smh. there has been no improvement in that since 4o

vapid elk
#

im pretty sure that is not how it works =P

#

it just did some solid color borders because a good amount of the images in the dataset has that

#

¯_(ツ)_/¯

#

for example, if you ask it to make a portrait, it will sometimes make a frame around the image, trying to make the actual concept of a portrait hanging on a wall

#

generating the images always takes about the same amount of time and processing power per image, regardless of the content
(with the exception of the computing cost of the text embedding, moderation process, filters, etc..)

#

but solid color patches do not have any significant difference

#

when it gets slower it is just because your generation request is queued behind other requests

hot rain
dim cradle
#

how many Images do you guys make a day before you are cut off?

#

4o message, no problem... image? it will be 25-30 for me and they say that is all for today. or if make 4, 5 image in like 10 minute i get a small time out smh

empty kelp
vapid elk
#

it is actually quite the opposite, on a diffusion model, since it starts with noise, it might take more work to move it towards a big patch of a solid color (this is my educated guess, I might be wrong)

inner flicker
#

#1021130377026351105 Illustrate a photorealistic cinematic shot featuring Sheikh Mujib, a mid-aged man radiating confidence and sophistication, standing beside a vintage expensive car. Clad in a dark grey suit that exudes refinement, Sheikh Mujib epitomizes intelligence and style, further emphasized by his bold-framed eyeglasses and back-brushed hair. With cinematic lighting setting the scene aglow, Sheikh Mujib leans against the car with an air of assurance, projecting an unwavering confidence. His gaze is directed forward, reflecting his steadfast determination and vision. In the background, a picturesque mountain looms majestically, enhancing the timeless elegance of the setting. This composition captures the essence of a distinguished individual in a photorealistic manner, inviting viewers to appreciate his poise and charisma in this cinematic shot.

empty kelp
#

it sort of moves towards the idea of an image using vectors that have direction and magnitude

#

if there isn’t enough processing time the image ends up with all sorts of anomalies, and DALL-E will sometimes only render part of the image

vapid elk
#

I understand what you mean and you are right, but I don't think it is actually the cause of the problem you said

#

I think the fact it makes "cropped" images are more related to the dataset than some sort of optimization or bug in the amount of steps

#

btw, I laughed at this first image, I found it really funny for some reason xD

empty kelp
#

it also does a thing where it bounces an image back and forth between the diffusion model and the GPT to make adjustments when it isn’t sure how to draw something, and it goes through all the safety checks over and over again

#

the overall process requires a massive amount of processing power, and they need to cap it in many ways so the images aren’t too expensive to generate

empty kelp
#

The only important thing is that, “natural color, texture, and lighting” will make Santa and the athletic and diverse female elves look really good in your images

dim cradle
#

pretty solid photoreal

#

only thing hold it back really is the 'dalle face' of the elf in the second photo.

haughty spruce
#

not enough

dim cradle
#

empty kelp
#

ok, i'll make a more realistic one

dim cradle
#

its realistic i just mean there is so often that same face for women in images from dalle

empty kelp
#

First attempt... hehe

empty kelp
agile peak
#

Polaroid gave me nearly photographic level, for fictional subject.

empty kelp
agile peak
#

Hasta la vista - Pepe

teal sandal
agile peak
regal jewel
#

Ridiculous! "I'm unable to generate the requested image because it doesn't align with the content policy. "

#

"Godzilla and Japanese people meme where Japanese people in Tokyo see Gozilla and start screaming "Godzilla" and pointing at him in fear"

#

You see that in every Japanese made Godzilla movie.

dim cradle
#

godzilla may be the problem because it is owned by some studio

#

it does not make any copyright stuffs

regal jewel
#

Possible. 1) I never said to actually make Godzilla, 2) There was a few different Frankenstein movies made, however the Frankensteins all looked different because of copyright issues on making the monster look the same.