I'm using GPT4 to come up with prompts for somewhat complicated images to generate with DALL-E. I like how it's capable of generating large amounts of text within the image itself (I know it makes frequent typos, but that doesn't really matter for me).
The problem is that the more complicated the image prompt becomes, the less likely it is to even attempt to generate words in it at all. I would like to be able to guarantee that I'll have words in my image without having to retry or change the desired output.
Does anybody have any prompt engineering tips to convince DALL-E to generate images with words?
Here's an example of a simple prompt you can put in ChatGPT with DALL-E enabled that sometimes fails in this manner:
"Come up with a complex scene for a humorous image with words in it, then draw it."