#Drawing needs a way to specify geometry of scene

1 messages · Page 1 of 1 (latest)

celest runeBOT
#

Reported by @oak coyote

Bug Report: Drawing needs a way to specify geometry of scene
`Steps to Reproduce`

Creating a complicated scene with Dall-E needs a way to specify accurately the origin of the ligh, the orientation of the object and the position of the camera. Ideally an engineer plan with x y z coordinate system; instead of vague text indications like "more to the left". Otherwise we enter in an endless cycle of corrections where hallucinations bring us further and further away from the desired result.

`Expected Result`

a) In this creation, I wanted six different pyramids aligned in a row. Which implies that they each have the same perspective, same angle of view.

b ) In a different example, I had an already constructed landscape with well positionned elements.

`Actual Result`

a) Each pyramid had a different perspective, making manual correction impossible.
(I have images, but how to add them here?)
b) instead of rotating the whole scene as instructed, Dall-E mixes the elements and place them at random, making a different landscape.

`Environment`

Dall-E via the most recent GPT, windows 11, Brave web browser

#
Additional Information

Please provide relevant details to help resolve the issue, such as:

  • ChatGPT Shared Link (if applicable).
  • Screenshots or videos demonstrating the problem.

-# ➜ Need to contact support? Visit the OpenAI Help Center.

oak coyote
# celest rune

Six pyramids made each starting form the previous... Six different perspectives!

#

-Image 1: correct
-Image 2: I asked Dall-E a different view, from further and from a dirrerent angle. On a dozen attemps, none went close from the logical result. Here is just one example. However Dall-E was able to get a side view of an aircraft, starting from a cartoonish view from the front. This may help to understand why it failed at the landscape: the aircraft is a known one-element scene, while the landscape is a collection of elements. Dall-E knows to represent them, but not to place them properly. This is why I took this example to illustrate this bug.