#Visual Continuity in Multiple Images

1 messages · Page 1 of 1 (latest)

red ore
#

==========

This is "Part 1" of my notes about how to structure DALL-E 3 prompts so that characters, clothes, hair, and other visual elements will appear visually consistent between images -- and will survive an image being edited.

This the information in this first part is directly from ChatGPT 4o (except for my questions), and not everything has been tested thoroughly yet

=========

#

.
Does DALL-E hold onto an extra representation of the images after they are created that is referenced by the GenID. such as a high level vector representation of the image, or other formats?
.

#

.
How does the image edit feature work in the ChatGPT web interface if there is no other representation? It just edits the revised prompt in english?
.

#

.
Are there specific terms that DALL-E recognizes in the prompt such as color, texture, or light that correspond to the internal representation of the image?
.

#

.
Is there a heirarchical structure to the prompts, and an order of operations by which DALLE reads them? For example how do you structure the prompts so that you can change the clothes on a character, or the style of a scene. What order or hierarchical structure should all of the elements be in that you described?
.

#

.
Original Prompt (Scene 1 - Park):
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. A wooden bench. No other people, animals, or additional objects in the scene.
Modified Prompt for Different Clothing:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a blue jeans and a white t-shirt. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. A wooden bench. No other people, animals, or additional objects in the scene.
Modified Prompt for Different Style:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a bench in a park, reading a book. The background is painted in an impressionist style with trees and a playground. Soft, diffused light. A wooden bench. No other people, animals, or additional objects in the scene.

#

.
Example Prompts with Structured Hierarchy

Scene 1 - Park:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.
Scene 2 - Beach:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is walking along a beach during sunset, holding her sandals in one hand. The waves are gently crashing in the background. Soft, golden light. No other people, animals, or additional objects in the scene.
Scene 3 - Coffee Shop:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting at a wooden table by the window in a cozy coffee shop, sipping coffee, with a laptop in front of her. Dim, warm lighting. No other people, animals, or additional objects in the scene.

#

.
Exactly what elements is it possible to change without changing other elements in a scene?
.

#

.
Example of Independent Changes

Original Prompt (Park Scene):
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.
Changing Clothing:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is now wearing blue jeans and a white t-shirt. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.
Changing Action/Pose:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is now standing next to the bench, waving her hand. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.

#

Changing Lighting:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a park, reading a book. The background has trees and a playground. The scene is now lit by a warm, golden sunset. No other people, animals, or additional objects in the scene.
Adding Objects:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a sunny park, reading a book. There is now a bicycle next to the bench. The background has trees and a playground. Bright, natural light. No other people or animals in the scene.

#

.
Is there a specific grammar and syntax for DALL-E prompts to create scenes that can be edited?
.

#

.
Example Prompts

Original Scene (Park)
young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.
Edited Scene (Changing Clothing)
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is now wearing blue jeans and a white t-shirt. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.
Edited Scene (Changing Environment)
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile. She is sitting on a wooden bench in a bustling city street, reading a book. The background shows tall buildings and busy sidewalks. Bright, natural light. No other people, animals, or additional objects in the scene.

#

.
Example of Structured and Modular Prompts

Original Scene:
A young man with short black hair, wearing a red shirt and black pants, is standing in a modern kitchen. He is holding a cup of coffee and smiling. The kitchen has white cabinets, a stainless steel refrigerator, and a marble countertop. Bright, natural light is coming from a large window. No other people or objects are in the scene.
Edited Scene (Changing Action):
A young man with short black hair, wearing a red shirt and black pants, is standing in a modern kitchen. He is now cooking at the stove, with a focused expression. The kitchen has white cabinets, a stainless steel refrigerator, and a marble countertop. Bright, natural light is coming from a large window. No other people or objects are in the scene.
By following these guidelines, you can create structured prompts for DALL-E that allow for specific elements to be edited without altering other parts of the scene. This approach ensures clarity and precision in the generated images.

#

.
How do you encapsulate elements in the prompts so they're self contained and won't be affected change that happens before or after them?
.

#

.
Example Prompts with Encapsulated Elements
.
Original Scene (Park)


She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.```
#

.
When making changes, ensure each section is self-contained and explicitly detailed to avoid unintended changes to other parts.

Changing Clothing


She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.```
**Changing Action/Pose**
```A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile.

She is now standing next to the bench, waving her hand. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.```
**Changing Environment/Background**
```A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a green dress with floral patterns and brown boots. She has a small star-shaped tattoo on her right wrist and a friendly smile.

She is sitting on a wooden bench in a bustling city street, reading a book. The background shows tall buildings and busy sidewalks. Bright, natural light. No other people, animals, or additional objects in the scene.```
#

.
Example of a Highly Structured and Encapsulated Prompt

Original Scene:


He is standing in a modern kitchen. He is holding a cup of coffee. The kitchen has white cabinets, a stainless steel refrigerator, and a marble countertop. Bright, natural light is coming from a large window. No other people or objects are in the scene.```
**Edited Scene (Changing Action):**
```A young man with short black hair, wearing a red shirt and black pants. He has a small star-shaped tattoo on his right wrist and a friendly smile.

He is standing in a modern kitchen. He is now cooking at the stove, with a focused expression. The kitchen has white cabinets, a stainless steel refrigerator, and a marble countertop. Bright, natural light is coming from a large window. No other people or objects are in the scene.```
By following these practices, you can create prompts that clearly and independently describe each element of the scene, making it easier to edit specific parts without affecting others.
#

.
Is everything in a prompt translated to English (if it isn't in English already) before being interpreted?
.

#

.
How does the system handle things that overlap. For instance, if a character is wearing a long sleeve shirt and you say there is a tattoo on their arm -- but the shirt is already covering where the tattoo will be?
.

#

Example

Conflict Scenario
A young woman with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a long sleeve shirt and has a small star-shaped tattoo on her right wrist.
In this scenario, it's ambiguous whether the tattoo should be visible given that her wrist is covered by the long sleeve shirt.

Clarified Prompts

Making the Tattoo Visible:
A young woman with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a long sleeve shirt with the sleeves rolled up to her elbows, revealing a small star-shaped tattoo on her right wrist.
Tattoo Covered by Shirt:
A young woman with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a long sleeve shirt that covers her arms. She has a small star-shaped tattoo on her right wrist, which is not visible due to the sleeves.

#

.
Additional Examples with Overlapping Elements

Glasses and Hair
Ambiguous Prompt:
A young man with short black hair and glasses. His hair is styled in a way that it covers his eyes.
Clarified Prompts:
(Glasses Visible)
A young man with short black hair, wearing glasses that sit firmly on his nose, with his hair styled neatly to avoid covering his eyes.
(Hair Covering Glasses)
A young man with short black hair, styled messily so that it covers his eyes, obscuring his glasses.

#

.
Example with Structured Details
.
Original Scene:
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a long sleeve shirt and has a small star-shaped tattoo on her right wrist. She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.
Clarified Prompt (Tattoo Visible):
A young woman in her mid-20s with long, wavy brown hair, blue eyes, and a fair complexion. She is wearing a long sleeve shirt with the sleeves rolled up to her elbows, revealing a small star-shaped tattoo on her right wrist. She is sitting on a wooden bench in a sunny park, reading a book. The background has trees and a playground. Bright, natural light. No other people, animals, or additional objects in the scene.
By explicitly clarifying how elements should be handled when they overlap, you can guide DALL-E to generate images that accurately reflect your intent.

#

.
How is it possible to view a character facing in different directions. What if you wanted to have the same character turning 30 degrees in each successively generated image so you could view them from all sides? How can this be done without overlaps and conflicts?
.

#

==========

This is the end of Part 1 -- this is only about 3% of it...

Please feel free to comment!

This is a link to the beginning:
#1245195468615450765 message

=========

red ore
austere stream
#

I ain't reading all that cooldog