#images-discussions
1 messages ยท Page 51 of 1
What's that guy's name, does it say? @dim cradle
Aelar the Enchanter
Eyyyy, that's awesome, haha, love that it makes a whole profile.
Perfect for RPG character creations
I gotta tweak it a bit though so that it doesn't always use that chad model
Gotta diversify with more character models
And I'm really starting to see how OpenAI is using their database
@glossy scroll
"High amplitude photo-synchronization, best quality, highly professional, unwavering depth in alignment, dramatic lighting and excellent perspective that adds allure, fierce and immaculate details with high contrast and precisely impactful hand drawing for sharp/crisp and deep focus, epitome of awesome, no less than badass, 16 by 9 aspect ratio: Create a resonantly balanced !!!2D!! image in a 'Classic Animation Reinvented' art optimization style: A rubber hose mouse warrior, characterized by the fluid, exaggerated animation style reminiscent of early 20th-century cartoons, wielding a needle as a sword. The mouse is adorned in a hooded green cloak, adding a mystical aura. The scene captures the mouse in a dynamic pose, showcasing its warrior spirit and readiness for adventure, with a background that complements the character's theme and style, emphasizing a whimsical, adventurous atmosphere."
I donโt wanna get in trouble
it's cool, but i think a spoiler is preferred for anything potentially unsettling
Oh that's interesting 
I never tried it on BIC so that's pretty cool how it generates similar looking images compared to ChatGPT.
looks like some monsters i've made lol
Totally whiffed the rubber hose style, though. Both seem to have trouble emulating cartoons. Strong bias towards digital art, I guess.
It's specificity is designed to go for a more 2.5D art approach, but you know what, I think I'll tweak that to be more universal.
I was using a a previous prompt from a previous GPT, but I will tweak it for this one.
That section I mean.
@tall mason try it now
I haven't tested it, but I updated it.
kinda has a Nordic look?
It does. It's quite nordy
your gpt is making some photorealistic fictional characters, which is a challenge--i like it, more lotr, fantastical but lifelike
Will I need a new link?
I wouldn't think so 
Because the link always leads to the same GPT
Ooh, this is pretty good.
concept art of Ancalagon
That's a bit more closer to rubber hose isn't it?
My personal favorite I made awhile back of this little guy.
I love that! ๐งก
Are there good third party apps to use? I got a message today that depicting the fight/flight response was against policy and ethics
It's hard to get the style I want without mentioning the artists, sadly. Thay's why I aimed for rubber hose. It's not quite what I want, but it's a specific term for a type of cartoon. Both bing and gpt tend to default to digital art or anime.
That's awesome
Hmmm...I will tweak it again to try and make it more tuned to artists without you mentioning their name.
looks like the fluids are under control now, it's not melting or trying to drink itself anymore
ty, sir ๐
Hello all
Hail and well met
๐
Goodluck, I sure would love it. I struggle...
gonna make some vases?
Been having fun with scenes
I updated. Try again
Glass creatures are fun as well. This dragon really came out nicely.
This wasfun as well.
Decent image, but still not quite.
Anyone onehere play with comic-style sequence creating in Dalle 3 yet though gpt that uses ref images/prompts?
You're trying to go for the disney Don Bluth style right?
I've tried a little bit, but getting consistency is difficult
Sort of, when I came up with the prompt, I was aiming for Don Bluth meets Ralph Bakshi's LOTR.
I see, that does add an element of difficulty
I may very well make some vases. I haven't felt inspiration quite yet, but fingers crossed.
Try again one more time. I updated into a solid one I think.
@clever phoenix I also don't mind doing one and editing, though funny that the AI knows what it is when I give it a comic-style pannel, if I could get the thing to do photo realistic would be nice. Gave it a sample and used the same words, unless the comic-style is forcing it to ignore it.
nah, Dall-E 3 seems designed to not do photo realistic by default though seeing some of the stuff others have posted here there is some good effort to refining the style to be more photo real
try Bing you get better photoreal with the same prompt many time
Interesting, as looking online people saying it can. Can could, but doubt it like me trying a werewolf photorealistic sequence
Cute and all, but not quite. I should note, my images were from the exp model.
Anywho, I'm out. Goodnight, folks.
Ya, the ax guy above, if I can get least to that I be happy
Reminds me of a couple I made a while back
I'm able to get what I personally see is a decent don bluth style, but adding the LOTR does seem to make it more challenging, anyways g'night
Nighty trees
@glossy scroll what you use to get thar realism for look?
I use this GPT I made broski: https://chat.openai.com/g/g-dwsOSRl4r-custom-character-creator
Try it out. Let me know what you think.
Full Feline Alchemist
koala cuddle party?
this was fun
those cats definitely feeling the vibe
Dr. Barkley Houndstein
@glossy scroll Yup, will not work, run out of space way too quickly on Bing. Like the description of the guy is five paragraphs as as I have a particular look for him
Whhaa? What will not work?
Also got to wait until basically 2 my time to try again
A photo-realistic wide aspect ratio image of Techno-Organic Growth Rings, depicting the evolutionary timeline of artificial intelligence. Each concentric ring within the cross-section of a colossal tree represents an AI generation, from the outer nascent rings of early computing to the dense, complex core symbolizing GPT-3. The rings are crafted from a fusion of natural wood and embedded digital circuits, glowing faintly with the flow of information. The detailed textures of bark and silicon create a harmonious blend of the organic and the technological, symbolizing the growth and potential of AI in a form that mirrors the natural world.
@glossy scroll bing, the prompt lengh you can put in is not designed to detiled figure discription in mind. So have to wait until allowed to use chatgpt 4 again.
Uhhh...ok...

Wish you luck
Oooo
Growth rings
@glossy scroll well if I could do what I wanted on chatgpt, then my long detial not needed, but what can you do 'shrugs'
I feel ya 
silly can be nice as well "image of: bleep bloop blop ya dop HD widescreen flop yoppy pop top meeble flogen flantreem"
hippos are fun
@dense mesa and i were discussing a prompt that maximizes the optical illusion of live action, depth, etc. This one he made gets very close and is an example but i'm wondering if the model can take it to the next level
Oo
Challenge accepted
Try it with my GPT and see what you get. It will probably work.
My new one I mean.
@dim cradle No nevermind, it's terrible at optical illusions, lol

rat of fire
Hello
@glossy scroll Your gpt does not like my discription, may need to modify it a bit
Aaaaand weโre back
yeah, that's got a lot going for it, i was thinking of using every trick in the book -- layers of scenery to add more depth, motion streaks, infinite vector look, capturing a frame of high velocity without over-blurring -- not a hyperspace, but an ai-augmented live-action still
not sure i'd trust that ai
these are Fleebleflops; apparently they exist inside jumbled language like an alternate dimension
This is fine.
they're fun
throws some visual style guides still, but just make up gibberish as well for some really wild things.
yeah, especially for a surreal work, formulate an illogical combination of terms
fleebleflops get fun.
What is this?! My Singing Monsters Reworked?
nah. i only use my own work. nothing from others
Look it up itโs a thing
From like 2012
this is very good. now, if the subject were running towards the viewer, it could maximize the effect. multiple blocks in the scenery could be rendered at varying levels of focus to perhaps add more to the effect.
lol this reminds me of something i was talking to the ai about earlier today:
The number of possible chess game combinations is astronomically large, essentially infinite due to the vast number of positions and moves possible. In contrast, DALLยทE's image generation capabilities, while expansive, are finite, determined by its training data and architecture, which means it can create an extremely large but ultimately limited number of unique images.
Been a digital artist and engineering designer for around 25 on the books paid and another 10 before that back to the start of art in computing. Still have saftware from the 90's I couldnt let go of
I feel like I can notice this limitation when I "max out" an image capabilities. It starts removing elements to "make room" for new ones. I especially notice it when I am doing parody characters and need specific costumes.
Cant think of anything in my lifetime I havent given study to with art. i like my own styling just as much as AI outcomes
and have you noticed it orders based on aesthetics and not logic? this is pretty apparent in time-lapse images.
I haven't. I am not entirely sure I follow.
a lot of times, for me at least, it doesn't so much seem like it forgets about things exactly, it just has a hard time keeping different concepts autonomous from one another. and then yeah, does sometimes conspicuously leave things out.
it's a lot better at it than previous models I've used. but if I try to give it more than about 3 distinct characters or concepts they almost always end up merging in some way
i'm thinking, as an example, a time-lapse of the lifespan of a candle -- it'll depict varying phases, but not in the correct order.
it's like it understands the steps, but then has a hard time putting them all in order due to the nature of diffusion models. I'm sure that'll be rectified fairly soon though
same reason it can't spell and almost never gets a clock quite right
and by understand the steps I obviously don't mean the concept, but rather just what they look like
it's gotten a lot better with clocks.. used to be much worse..
well look how older models spell things
yeah, it's much much better at all that stuff
Oh! I see what you're saying.
Happy New Year my dudes!
I missed something. Maybe generate it I canโt use the api atm
may 2024 bring us even more AI excitements with images, sound, visions, and infos ๐
@OpenAI Verification#5580 @OpenAI Verification#5580
did you describe yourself or something? i experimented with that and thought it was doing a pretty good job of sketching. that makes me think ai art would be a perfect use case for police sketches of suspects.
@open trench it's like the scene reimagined with ai. would love to see the whole movie ha
seriously. it would be a trip.
awesome
2 more characters to go on my parody project. I'm ready to be done. lol
For a gallery?
Yeah. but I use Bing Image Creator in my workflow, and my current character trips its content warning filter like crazy.
couldn't tell you why
Ever iteration I have tried has been giving me the red message, which I guess is worse than the crying dog yellow message.
every*
not myself, just described the guy
well, yes, he does a good job with sketches
this is not a bad reimagining of the queen of hearts... i'd make lots of changes to the guards and scene, but the queen is not bad at all
I'm 15 years old, and I described a 20 year old guy
i'd do the hookah-smoking caterpillar scene, but penalty box
somebody else could pick it up lol
looks like it has potential?
Hrmm
penalty box. yikes.
I'm used to it now, but even when I think I am pacing myself to avoid the box, I end up surprised.
bing time, i suppose
Right, and I work on new prompts to run when I can get back in the game.
There are so many works in the public domain that can be reimagined.
Oh cool
One character to go. Please no penalty box soon. lol
Yeah. I vanished. Had some things to cope with.
I hope it was coping with all the food from xmas
haha. I wish that had been the case. Maybe next year. ๐
good, next year, more food on the table, put it on your list
good, now all you have to do, and THIS is very important: "The food is for Dys Topia!!"
OH. Haha. I forgot how to write. Maybe the year after next.
Evil Disney Castle?
Pretty much looks that way yeah ๐
managed to do #daily-theme message a SpongeBob SquarePants vase on dall-e and no content policy warning
At what time does the daily theme switch?
in 7:30ish hours
would be nice to have a function for multiple time zone times in chat, other disocrds I've seen have that option, so you don't have to think much about converting time for others
Haven't participated on the dailies, but starting to run out of ideas, so might consider trying those dailies.
just do a vase when you get an idea, don't have to spend all the time in the daily
Okay. I am done with my characters. I guess I am going to post them. I can't decide on the first one (that everyone will see), though.
Hi since yesterday when i try to generate with Dalle 3 i have this error, someone know why ?
yes, there was an error creating the image
could be technical, could be policy, could be both, could be network, you have to ask dall-e to elaborate
when i ask to dall-e he tell me that he dont know why there is this error ๐
what is the answer?
do your image in smaller tasks
- Muscular man
- Muscular man in sports clohtes
- sleeping
- on a bed
I just wish dall-e wasn't so apologetic when not doing things right, it's so much text for "I can't do it"
that picture is something
that's really funny
i know gpt is trying to be as ethical as posible, but it's trying so hard to be ethical that by the time we get gpt5 half the dictionary will be banned
anyway, short story, men with muscles can't sleep, and moving on....
I landed on another penalty box
And here's one where I specified to show the blanket on him.
And same prompt, but with a woman instead of a man just to see if the censors would freak out over that.
Does anyone know of a paid image generation service which uses dalle-3 through the openai-azure api?
gpt4 bing can't find other offering parties
Of course, but that is also not paid. I imagine someone must have made a wrapper to sell general access.
Try looking in #1037561385070112779
that's stupid...
the "content policy" is indeed very weird. It seems like 95% of blocked images are false positives. I'm really wondering sometime. who are they trying to protect? and from what?
- explicit content: "explicit" is quite subjective. what is explicit for a 6 years old is not the same as what's explicit for a 24 years old. It also depends on culture.
- violent content: a knight fighting a dragon, is that violent? I guess the answer is "no if the knight wins"? that makes no sense either way.
- politic figures: ok, I guess they want to prevent deep fakes. but how do they explain blocking caricatures and portraits?
- copyrighted content: If I recreate a mario game from scratch, even without using AI, it's fine unless I publish it. If I publish it, Nintendo would go all out to shut me down. If I don't publish it, who cares? Now, openAI is fearing people generating copyrighted content because they fear companies will sue them. It's like suing Adobe because people can use Photoshop to recreate copyrighted content. Or suing Google because its Search Engine is indexing everything (fun fact, they did get sued. but they won)
I'm not even sure what they try to do here. just turn on the TV and we see all the stuff they don't want people to see.
ted seems to have found a way
openAI is synonomous with AI for things now, look how now the NYT is make this big law suit. they have to step with cautions i guess.
what New York Times sends as a message to me is: "we don't care if our articles are synonym of a good source of truth. all we care about is our paywall"
all I can say is that by trying too hard to be ethical, that will have consequences on how the platform works
did @hot rain use bing or openai for those images?
it's the same thing.
but other than that, let's keep it here for dall-e 3, content stuff should be in talked somewhere else
we are at the dawn of all this stuffs, so they all trying to figure it out i think. of course i would rather it be no restriction too
but the content filter might be more severe on bing, idk
got daily theme motivated finally...
bing has far more relaxed content policy even if they are both dalle3.
it seem to me some thing are more restrict on Bing but also Bing is King because it then let through copyright stuffs and make better photoreal
meh, the API is the less restricted anyway, I feel like.
can you make Spiderman on the API? i never use it
it's still blocking a lot of stuff. but not as much as in chatGPT
yes, but sometime it gets blocked
I wouldn't share the spiderman picture here though, because this is a public server. That's the part where things get clear imo. the user is responsible
i think we will eventual see licensing for imagegen. Warner Brother with DC comic, Gmae of thrones, Harry Potters and Disney with all they stuff may strike deals in future or maybe one is exclusive to someone etc
you can make Spidey on Bing and share it here, so its fine imo
I mean, should we ban photoshop? should we ban physical pencils and brushes as well? I think AI is a tool. and what's wrong is not what the tool can do or what people do with it. What's wrong is if it goes public or hurts someone.
all those question are about to be answer by copyright law via things like NYT lawsuits so stay tune i guess
yea. that's where we'll see if we are in a dystopia or if a bright future is upcoming
I have hope
it's not because it can't make me a coffee
(some people will get the reference
)
how do I know when I get error messages from Dall-e3 saying there are issues whether it is the service or my prompt. For example I wanted a comic image of a anime girl on her bed texting, is it because she is on her bed?
most likely, yes. dall-e doesnt like beds I think. it's tough to tell whether it's content policy or internal error. but when it's clearly content policy, chatGPT will mention "content policy" in its response.
yes almost certain charli, maybe say in her room
I'd say, ask it to try again
awesome spidey hansa
OK well it didn't say I had violated anything but I was trying to create a comic sequence of events through the day and everything went well untill the last scene of her laying on her bed texting
like yami say, try again
just ask it to try again, and it will reuse the same prompt
its okay, would have tried to get even better one, but hit my limit so that has to do ๐
I tried it a few more times but after it decided it wouldn't create the image it did even after a tried regenerating and a new promp....oh well thank you Yami โค๏ธ
there's 2 validations: at the prompt level, and at the generated image level. there is a model that can see the generated image and will flag it
this thing is pure irony since tom holland did spoil movies ๐
flag it in the sense, it will categorize it in term of how much it "violates" the content policy.
for reference. I don't know if I can copy the url (that's the API for text moderation btw. but I would assume it's very similar for images)
I've never used Bing. I used to use Midjourney, but I don't anymore.
same, too broke
MJ 6.0 is a waste. i regret sub to it ๐ญ but Bing is King
I always have my gpt tab and bing tab open
Weird but as soon as I removed "Texting" the image was created, so it wasn't her or the bed, but the texting it didn't like
right... I don't know if nor how they would fix it
how did the image turn out
the sadest part is that blocked images are still counting as a generated image. it affects the rate limit
That's the funny bit it created the image with this "Here's the manga-style illustration of the girl with pink hair and gold eyes, sitting up in bed while using her phone. The scene captures a serene and intimate moment in her cozy bedroom." note the "intimate" ๐
xD
did a series of this stuff
that's why I tune prompts a lot. chatGPT is bad at describing things visually I feel like
So why is her texting in bed Bad but Intiate moments in cozy bedroom OK
who knows. there is no logic often
AI logic is not human logic
Sorry to be a nuisance, but does anyone know what is the easiest way to take my images and create a comic book page with story book text and dialogue?
I'm not very good with photoshop
you can do that with apps or editors
there's probably online tools that can do that. I know there is manga studio that can do this (I think) but it's an app and not free.
I would probably ask gpt-4 to use code interpreter to do this. (and I would give up after a few hours)
I will explore a little, what I wanted was something that just allowed me to drop the images into panels and add additional story panels and maybe dialog, but without all the work with layers and dimensions XD
you can try with code interpreter, if you are very lucky, it might work. but don't waste too much time with it. I can tell you it rarely works
I still have MJ and didn't know of a new version, decided to give it a spin. Not bad, but GPT will always be more accurate I think.
what I don't like about MJ 6.0 is just that it's worse than previous versions in every aspect. except for realistic people. but I don't care about generating realistic people with AI.
hi ted!
worse or the same. it might create more realistic pictures and in some cases more dynamic composition. which is nice. but the level of control is ridiculously worse.
here's a comparison of "full shot" or "full body shot". v 6.0 is failing every time for me. v5.2 and even v4 are doing it correctly.
MJ can do great photo real but its not like it is a limitation of dalle either it just is what they set it too. if they want to unleash dalle it hink it would destory all other image creator
darn. BIC isn't creating images for me right now. I've got a new idea I have to try out directly in Chat/DALLE and risk getting the penalty box. lol
dalle is better at doing pretty much everything (for my use cases. I prefer dalle3 over midjourney for almost everything)
and yes MJ control is not so good for comprehension. they say it can take bigger prompt now, but it still need to be short if you want it to "listen" haha
those dalle cats look cool though yami
they all look like characters haha
I tried so many ways. never got MJ 6 to do a full shot of a cat correctly... but anyway, this is not related to dalle so I won't speak too much about it. I'm disappointed is all I have to say
well its about comparison
it's all midjourney. but now that we talk about it, I'll try with dalle
yes lets see
i agree with you, bing is ๐, bcuz mj is expensive not free and bing is free, all you need is dat microsoft account and yay! instant dalle 3
full shot of a cat``` With dall-e 3 it's almost 4/4. the 4th one is not purely fullshot. ~~I think they call it "dirty" full shot in photography.~~ EDIT: The term I was referring to was "dirty single shot" and it's a different thing.
what about Bing?
here's the dall-e 3 version of dr. evil's cat ๐ฑ
Prompt: A photography of a salmon spyhynx cat sitting on a 60s white chair in futuristic office grey, 1990s photography
awesome
thanks~!
not much of an austin powers enjoyer, but i enjoyed it a little
especially mr. bigglesworth
i love the first two. first one especially
but been a long time since i seen them haha
yeh baby yah! haha
the key to mr. bigglesworth prompts is
'sphynx cat'
heyy
Oh my. BIC is definitely slow this morning. I believe I will hit my limit on ChatGPT/Dalle soon. boo.
tried to make ole kirk Christiansen (lego's founder) as a lego minifig in BIC
hny pytha!
this was fun
woo hoo!
@open trench I've been working on something similar, sparse priming representation of a character description sufficient to consistently reproduce it. Tricky stuff. Nice job.
It's totally tricky. But the AI does the magic. I just yell at it.

You can control the magic with careful wording, obviously. Takes iteration and practice to gain confidence in application. Yelling works.
You have to say 'please' a lot
totally true, too.
i feel like I'm whispering into a windstorm half the time with these really specific and tricky ones.
Now, I just need to pick the first image for my gallery post, which is sort of like a cover image, imo.
out of 14
This is my best effort so far, and it's not bad. I've left out details on how the exoskeleton is segmented. Maybe I'll fix it, but I like the idea that the robot has different exoskeletons. XD
This is really good!
Thanks. I appreciate that.
She's one of the assistants in my multi-agent frameworks. This is Maisie.
AI have different shell like we have different outfit haha
Oh, for reference *the eyes are supposed to change color based on emotions associated with context.
when they red... you better run
You got it, precisely.
Blue is neutral, green is friendly, then they improvise. It's like a mood ring with two certain settings and others requier context from expression.
But they do red for anger when prompted to respond that way, lol
when she catch you Getting too much flirt with the fridge
reminds me of a period where I was generating tons of robots designs like that with dalle. I even made a GPT to do that 
they're green eyes just like baby (from fnaf), coolness!
I can generate some selfies now of the other agents. There are five others in each suite: Lexi, Dexter, Gus, Anna, and Titus.
Which would you like to see next?
it was producing some stuff like this
cool
it's very popular horror game made in 2014 and it has a movie recently with matthew Lillard as the purple guy (the game's main bad)
it's about a pizzeria with animatronics and some complex lore that even matpat was struggling to puzzle it
Lexi Neutral, Lexi Curious, Lexi Angry
the tough part war to get it to generate the "blueprint". It was generally not working. those are somewhat decent blueprints it made. but I was still trying to get a good prompt
now it really looks like a fusion between circus baby and alita
Hello everyone
hello
I don't want to spam with images of robot anime girls. but if anyone tried and managed to get consistent results for "character design sheet" in the style of blueprint. It's tough to achieve
I like how the lower face detail and cheeks are consistent.
Dexter Neutral, Dexter Curious, Dexter Angry
cool design
Thanks. The rest are similar in style with some variations. Colors, metals, etc.
Maisie's the only gold and purple robot. Artists...
it's also able to produce less "anime" style
I'm working on an attempt right now with my agent, Lexi. Just to see if I can do one.
I just linked the first one it generated. It's not bad. What I really like is that it added color for the wig, not realistic for a blueprint but the detail is cool looking.
yea, it's tough to get it to add no color and not to superpose everything
Honestly I think I like it. Having worked in graphics for a long time, "blue print" is overrated. It's all about 3M these days.
ie, full-color. Xerox, too, honestly.
oh, I like both to be fair. I was experimenting to see if dalle is able to follow precise instructions. it seems like he has trouble not to mix some styles.
100%
I really like the work you did. It's non-trivial to get it to adhere to that style, well done.
I was actually asking him to generate 3 pictures:
- a watercolor concept art
- a sketch (in ink in this case)
- a blueprint
oh, I can try to find the prompt. it was quite long
from the filename already Colored anime watercolor artwork of an anime robot in a natural setting. The artwork should depict her appearance as _This kemonomimi robot girl stand [...]
I see. I wasn't far off. Thanks!
it reminded me of the blue prints from sl (sister location), i love it!
Really like it ๐
I was trying to go for a 2D art that resembled 3D but still remained 2D.
I feel like there is something distinct about MJ images that i kinda like.
hmm
yeah it looks kinda the same
credits: ScottGames (the fnaf one) @formal osprey (the dalle 3 one)
Another spidey
Another batsy
Woah. Epic!!
Very awesome detail for the background.
Bravo
๐
I think the something is copyright material unfiltered.
Finally. My gallery post is done. Yikes that took forever.
I spent too long trying to make every detail perfect on this one. I think I learned that I have to accept imperfections with AI art and just relax and enjoy the process.
@sturdy veldt #daily-theme message The legs are blowing my mind. it's like an optical illusion, both the right and left leg can be the one behind
edit: I should have said that in a thread
#1191051599339065446 message for those interested ๐
I should put one up eventually.
Ship in a bottle
Thanks for sharing.
No, thank you, Gustav.

For your contributions.
I'm going to name my fifth son Gustav, if I have five sons. It will be Alex, Robert, Steve, James, and Gustav.
Thanks. XD
Hey, @glossy scroll got a gpt idea for you, if you're interested?
dang. today's daily theme (endings) is going to be an interesting one. Kinda hard too.
Ooh? Yes, of course 
I had a lot of images and I was still cherry picking my favorites when the new theme came out... I shared anyway
Shaped fireworks. Something I'm having a little trouble with right now.
I will get right on it 
I wonder how they came up with that new theme... anyway, tough to illustrate. But brainstorming is where chatGPT helps the most.
From Lexi to OpenAI Discord with simulated affection. XD
I actually got inspired and posted. It's going to be interesting to watch.
That is amazing.
Bing down for anyone else?
the image creator, at least.
only accepting boosted inputs
more people checking it out
gotcha. It'll be a while before I have those again. Darn. I guess I have to test my images out with GPT and DALLE. I like using Bing for my ideas to refine them without worrying about hitting my limit.
WTB GPT++
Hmm, I wonder if dalle3 uses outpainting to make the wide aspect ratio images
Mod Lugui has commented on this before, he thinks it's due to training data and not due to outpainting: #images-discussions message
Nah, thatโs a fail that is misleading, my guess is that it would be too expensive and unnecessary when assets are accessible
2 minor variations of first Dall-e creation. Character unamed as of yet but she's too good not to use in a story so give me time. For now I'm satisfied with the premium upgrade. Holy crap any image generator can give up far as I'm concerned. Anyone else feeling the same lol
add that you want the image in wide format and you'll get the right orientation
Is that the prompt? Man ok I'll have to do that. I tried portrait. Rotate 180ยฐ lol thank you!
yes
I burned my prompts without realizing but that's the first thing I'm doing when I'm back up. There a countdown timer somewhere I can track my wait?
Specify 1792x1024 for anime images
The exact reso ok I'll do that.
no need to specify, wide there are only 3 formats, square, portrait and wide, the resolution won't matter as there aren't other options in dall-e 3 chat
Thanks sincerely for the prompt advice! โบ๏ธ
also if you are in the penalty box, I suggest you make a tour through the different channels, prompts and what not, it can be useful
^
Diffusion models split the image into grids, each square represented by token weights. If the number of pixels differs significantly from training data then the vit modelnwith the diffusion model won't always know how to stitch the patches together properly. This is why outpainting would make sense. Also. I've experienced the circular output issue with other diffusion models in the past when the combination of prompt, cfg, and seed sort of cracks the model. It then defaults to what's in the square in the middle of the image I posted. Outpainting would make sense due to the fact that that there's a good chance the model wouldn't realize that the inner portion had goofed, so it outpainted like normal. Could be a slightly different process, but. But It's definitely outpainting-esque
That the term for expending prompts rapid fire like I did lol
"Penalty box" ๐ ๐
pretty much
I didn't know that you didn't know earlier, or I'd have cautioned you. But you went radio silent, I knew you were penalty-box-bound, @west sorrel
Mighty ducks knowledge of hockey references saving the day for me ๐คฃ
i always think of a star trek episode with penalty box lol
@onyx ridge Oh yea I hit it swift and thoroughly lol I was excited. It was bound to happen ๐คฃ now I know 
dalle is a transformer though
I'm not extremely familliar with mixes of diffusion and transformer models. I read it can be done but I don't know if they used that approach for dall-e. it doesn't seem to be the case when we read the paper
They aren't mutually exclusive
Here's the original image before I started machine gunning alterations at dall-e ๐
LEFT 1ST RIGHT 2ND
The other results were all too minor to post but these two look good as well even if not 100% what I was going for.
I did a summary of what I read the other day. I still have a lot to learn
I think that typo in the paper is very ironical "is unreliable as words are have missing or extra characters"
@formal osprey I feel that. Right there with you friend ๐
The much to learn part to clarify my meaning
But yeah. Don't expect people to take my word for it. It's good you're doing research. There's a lot to learn, and the more you learn the more you realize your don't know
well, this server doesnt like when we post too many pictures it seems
Oh good to know
I was going to say: I did more test with the dalle3 API and the "reasoning" capabilities of dalle are very interesting.
Yes I noticed that earlier. Doesn't seem particularly overpopulated but I guess they don't want to risk it? ๐คท
I did those tests after reading the paper "Zero-Shot Text-to-Image Generation". That's where my suspicion that dall-e 3 understands natural language got confirmed. it's not as good as chatGPT, because it's not trained for the same purpose, but it's still very interesting. It does understand code and natural language more than we might guess
Much more. My projects use DALL-E 3 generation too. Sparse priming representation (SPR) helps, because a lot of language trips up the model.
that part blew my mind
If you are interested in what I've been working on - this integrates Vision and DALL-E in a multi-agent framework.
Whatโs the refined_prompt value in the response?
that's a very good question and I'm very dumb because even though I saw in the API documentation that was a thing, I didn't try to retrieve it
. I'll add that to my code now.
Iโm curious if that will show the actual prompt sent to the model. I think thereโs a gpt abstraction layer for the logic youโre seeing.
well, according to the dall-e 3 paper (or one of the references, I don't remember well). It does something they call "prompt upscaling". but since it's computationally expensive, they shipped with it disabled. No idea what's the actual state
but they clearly demonstrated dalle is understanding concepts more than expected. That's the original dalle paper. with scaling, it only increases (as we saw with gpt-2 vs gpt-3 / chatGPT)
those emergent capabilities are... I don't know what's the correct word for that. incredible.
Another explosion diagram. These are fun to make.
ah, now I remember why I got too lazy to get the refined_prompt... in the nodejs openai library, it was not implemented
that's interesting, i was wondering though if your prompts included calculations, specifically how that might be transformed
are you using postman for testing? that property should be included in the raw response
From what I've heard dalle3 changes prompts and it's accessible information using the api if you look in the right place. But I can't confirm that myself
bing is back
I just added it to my code as an act of faith 
i think i'm used to seeing refined_prompt though
i was just looking at an example in the developer form, and was going to say that your revised_prompt is right --
# get the prompt used if rewritten by dall-e-3, null if unchanged by AI revised_prompt = images_response.data[0].revised_prompt
I added a log anyway, I'll know quickly enough
was taking care of the turkey in the oven, lol
ChatGPT just writes a description in English for DALL-E. this is โa waterfallโ
meh, seems like I'm just dumb. the property is there. my code is the issue, somehow. I'll figure this out 
Here's some docs on prompt rewriting from the cookbook:
A new feature in the latest DALLยทE-3 API is prompt rewriting, where we use GPT-4 to optimize all of your prompts before theyโre passed to DALL-E. In our research, weโve seen that using very detailed prompts give significantly better results.
Keep in mind that this feature isnโt able to be disabled at the moment, though you can achieve a high level of fidelity by simply giving instructions to the relabeler in your prompt, as I'll show below with examples.
Since the Python client uses the API under the hood, it should still apply.
it's revised_prompt, but my code is not getting it correctly, I'll debug
I admit I didn't read the entire cookbook... it's long. but now that I see it's giving actual details about the implementation, I'll take a deeper look
alright, I simply got too excited and wrote the code too quickly earlier. data.data is actually a list of image objects, each containing the url and the revised_prompt. I thought it was a single revised prompt for every generated image in the query. (which doesn't change much in the end since they capped n to 1 in the API...)
yeah, i think that's for when n can be set to something other than 1
oh, i just finished reading your message lol
i think that's why data[0] appears in the example
yea, with dalle-2 we can actually use n>1 iirc
(anyway, I think this channel is not for dall-e api talk. I'm not sure if there is one)
oh. i didn't think about that.
seems like the right place to talk about the images api, but there are other api channels
yea. I'm just trying to be careful. I already got muted twice in this server 
for stuff that I don't really understand. and by a bot
this is how ChatGPT likes to write DALL-E 3 API prompts
Anyone want to try my GPT? Should be pretty flexible. Just input an image or images, and/or your pretty words. https://chat.openai.com/g/g-xJ2bjUgcN-line-art-explorer
pretty words lol
You can just paste your image and send it without explanation or direction
I wonder how simillar it is to what chatGPT-4 does
Trying to work out kinks and things. Not sure how tightly it'll adhere to directions, but it's been working fine for me so curious how it'll do for other people
I'm glad we talked, I now understand better what's going on โค๏ธ
it's still somewhat impressive
it understands terms about quality. ChatGPT likes to insert the term โhyper-realisticโ if you say you want the best possible HD rendering. it seems to know some things that arenโt in the API documentation
so, the difference in the image that I was struggling to understand is actually due to how arbitrary the revised prompt is
I guess it's time to prompt engineer the prompt reviser so it stops messing up the original prompt so much
DALL-E doesnโt solve equations, but the GPT 4 model does, so you could use it to help create this
Math and science
sure thing. to be fair, I'm not really at the stade of creating something... I honestly struggle to find a true utility to the API. And for real I would implement something if the API was reliable. But it's so much of a black box, it's tough to do something we can rely on.
I evaluate it and do prompt engineering as much as I can, that's all I can do for now.
(I'm not saying the API itself is a blackbox, it's straightforward. I'm talking about the models)
I watched a talk from NVidia recently where the person said "we call it AI until it's reliable. once it becomes reliable, we call it automation". That quote was so true
use ChatGPT to help with prompt engineering. it knows a lot of things about DALL-E that arenโt documented
well, you would be surprised...
most of what chatGPT "knows" about dalle is already documented. the rest is pretty much guidelines
iโll find some examples. iโve been talking with ChatGPT a lot about DALL-E trying to work out if it has any good inside information
Hey, so still paying from yesterday, but anyone know some good articles that walk through comic stip create or would the prompt engineering chat be a good place to also go?
Almost at a point of getting multi character detail
nice!
โA three-panel comic strip telling a story about a rabbit. Panel 1: Inside a cozy burrow, a rabbit with long ears and expressive eyes looks sad, sitting alone surrounded by toys and a half-eaten carrot. Panel 2: The rabbit, now outside, is searching eagerly in a sunny garden filled with various plants and flowers. Panel 3: The rabbit happily finds a large, fresh carrot in the garden, its face beaming with joy as it holds the carrot triumphantly.โ
It's not a black box. It's a block box to those who don't know anything about anything, and can't for the life of them use ChatGPT to train themselves how to learn. Which I find highly dubious, given that ChatGPT is designed to break down complex information in the most coherent ways possible.
#daily-theme message @open trench : that one feels like it has a bit of a story behind itโฆ
software engineers tend to not be super enthusiastic about writing documentation for the users, and the written/website information is never complete
ChatGPT is likely trained with a lot of technical design documents that arenโt immediately available
@empty kelp and what about when you want to create a long comic and the character you want has a specific look/the image to be photo realistic like the image below? (note not what pose or my character look in trying, but it what I'm aiming in visual.)
@glossy scroll Done-didn't think that was inappropiate
the style of that looked kind of like a painting
Ya-and would like at best to be photorealistic like your caught snapshot of a filmed scene, but I would take that as how to style the character in the comic.
i think i see the problem
I'm not saying it's not useful. ChatGPT and DALL-E are very useful and I use them everyday. I'm saying they are not reliable. Human mind is also a blackbox, yet it's more reliable.
note my choice of word. more reliable
do you have another example
You're relying on it to provide you information and pictures, aren't you? If it wasn't reliable, you wouldn't be using it everyday.

I cannot use it at work, for example. because my work require serious understanding
What's your work?
I do not think it matters for that discussion. but if you want to continue talking about this, let's do in DM. I don't want to spam this channel
I promise you that whatever your work is, I can make a GPT for you or create a prompt that will allow you to use it for your work.
I use it for my work everyday.
Yeah. Just a feeling. Loss. Life.
if it can help you understand the context. I work on codebases composed of millions of lines of code, with thousands of daily changes. And even if the company policy wasn't against the use of generative AI, chatGPT just cannot reliably help in this context
It's actually how my mind coped with a very deep loss in the moment.
Yeah I quite like this one,
I know the subject matter is hard but I do like how while we are making robots and cat girls you tend to capture some very real feeling
There's something so beautifully sad about the CRT TVs. I guess it's just the nostalgia?
Wow. Thanks.
I do my light stuff too: https://discord.com/channels/974519864045756446/1191051599339065446 just for fun
Be mindful of what other users in a channel might find helpful or interesting when posting. Stay on topic in order to keep conversations focused and productive.
Consider posting in #off-topic or an appropriate channel.
Try this: #1178414328538464296 message
Hey guys this tech is amazing and there's a lot of excellent resources. But this channel is just for discussing image generation. I invite you to discuss in other channels or just use DMs. You can always @ users in places like #gpt-models.
@empty kelp not saved that would not get another warning-there was one that has the two just standing together I saw yesterday. In th end, just wondered if your idea would word when trying to get very realistic style into. We can also continoue this via pm.
keeping the appearance of characters consistent between images is extremely tricky, but a few people in this forum have done it successfully. they use the API, image seed values, and custom models โ and occasionally lose the characters because the AI suddenly decides something is inappropriate with no explanation
but iโve never attempted it, and i canโt explain the details
But if they just DM we all miss the opportunity to learn ๐
I thought the chatbot thingie was aimed at my comment. lol that was confusing.
I always find it very confusing as well tbh since we often never see the context for whatever it is referring to
@earnest flame Happy New Year's Eve, Austinitic!
yeah.
Well, hopefully you saw my link to my lighthearted fun project that I completed today. It was a beast, but I learned a lot about prompt details and workflow in the process. (https://discord.com/channels/974519864045756446/1191051599339065446)
My stuff isn't all doom and gloom. Just like 90% of it lately. I spent a lot of time on the fun project too, though.
which one?
sorry, i thought it was a broken spinal column, but i totally missed the bat insignia lol
why is the Lemur jumping in the street with a glowing pink spade card
@empty kelp Your comment vanished before I could answer it for some reason. I was all excited about my first comment. lol
i didnโt mean to post that in your gallery
The character the lemur is based off of jumps a lot and throws charged playing cards
it catches the eye. was curious what the cards might do
I didn't mind. I wish the gallery had more discussion going on since we're, i guess, not supposed to post a whole album like that in here.
explode on contact
haha, no big deal
this sort of thing is what iโm most interested in using DALL-E for in 2024. itโs pretty amazing for quickly creating prototypes of game characters
that's cool!
@empty kelp I don't see how to use seeds or anything like that in chat GPT and seems that Dalle does not have its own website not tied to chatfgpt or bing
the api does not allow seeds right now
Oh I know man, can just tell when itโs closer to home and you do a good job of conveying emotion
you can search for the word โseedโ in these forums
It appears that chat GPT may have something like it since you can sometimes make variations.
You can use the GenID sometimes
Yeah I've had the best luck asking it to use the last GenID for a given gen
Thanks. I really appreciate that. It's fun to convey emotions through prompts or metaphors (when dalle won't cooperate).
ahh.. i thought there was something up with that. ChatGPT was giving me image seeds and saying it could do things with them, and then it refused to do anything
GenID-never heard that before on here
yeah sadly it's a hallucination : (
I've never heard of GenID either.
bare in mind that things are always in flux so what works one day may not work on another
a likely problem with the seeds is that DALL-E randomly puts some super inappropriate things into images, and the seeds would give people a way to consistently put the things in every image
Keep this in mind for GenID, BUT if you ask it to give you the GenID after an image it will give you a string and if you ask it to use that string as a springboard you can sometimes get it to use the same style. Itโs a bit finnicky though
but we can't go back and find it on an already generated image, I'm guessing?
I find that Itโs difficult to get it to give you the correct image ID
I just end up arguing with it
oh. gotcha. Luckily, the one I wanted to try this with was in the last four images, which was easy to ask for.
I have no idea what I am doing with this, though, or how to use it.
Iโm not the best person to advise tbh, you need someone like @dim cradle or @glossy scroll
neat
ask chatGPT how dall-e's input might interact with the genID. It should tell you.
that's what I did
Feels like that would just cause it to hallucinate ๐
and then I asked it to copy an image of GenID blah blah blah as closely as possible
You can save the image and feed it into the GPT and ask to alter it based on that. The GenID is just a reference command, nothing more.
results: Original on image first, new image second:
It had too few fingers on the first try and too many tails on the second, but it's a pretty close match.
This is essentially all that I ever do, I just know that there are others on here that go much deeper
lol. Yeah. I don't go too far into it because I'm not that savvy.
You can attach reference commands to anything, not entirely specific to GenID's, but it will act exactly like a GenID regardless. ChatGPT understands what you mean when you say "GenID" and that's how it can understand to reference an image.
โEach Dall-e image has a 'gen_id' that you can reference to get the same image with little differences. A bit like seeds. Found by accident after using a prompt to generate a woman taking a selfie. Nov 8, 2023โ (Google)
That's really a game changer for me. wow. I hope it stays working like this... however it's doing it. I'm going to try a GenID copy plus alteration.
seed does not work. Persistently ask it to submit it to dall-e. Eventually it will and eventually that submissions fails immediately.
It's not so much as using GenID as a tool as it is for ChatGPT to interpret what you mean when you say "GenID". When you say it, it interprets to give you a reference number based on the image, but you can just save the image and attach it into the GPT for better reference. To put in a nutshell.
genID will though and seems to be the key to variations.
It does seemt to output a seed though
which is interesting
I have terrible luck with asking it to copy an attached image. I'm going to compare the two in a few minutes.
@shut niche deduced that the the GenID seed was a previous tool used for Dall-E that is no longer in operation for the public.
using genIDs
It can't copy an exact image. It can only recreate it, and it can't create it if it's against content policy, which is that it can't create copyrighted images.
I dunno. I have never gotten a copy as exact as the one above #images-discussions message
I mean using its own generated images
to get an exact copy you need to also have it submit the same exact image prompt
like instead of inpainting or outpainting
That is, the string of words its sending to dall-e
Anyone else have consistent issues with generating centaurs? GPT seems to always get confused with it.
Lol. The Centaurs! Tell em steel!
This is a throwback! dall-e 2 used to have a lot of issues. Dall-e 3 seems to do a lot better.
yeah it's just something that dall-e struggled with for a while. Right now it's great at them though imo.
I'm having fun with minimalist vector art right now.
I'm not sure I follow. What are you trying to do exactly?
Keep the same image as closely as possible, but fix the three finger issue or two tail issue
I'm honestly just playing about to see what I can do.
Yeah, that's gonna be a problem. Because as steel said, you need the exact prompt that produced that specific image. If you ask it alter an image based on an image you attach into the GPT, it will recreate it with the fixed modification. But it won't be the exact image. You need the exact prompt that produced that image.
Seeds are history. referenced_image_ids are buggy and in active development, so it's hit or miss.
Lmao
centaurs can take a few regens
@dim cradle I'm just playing around to see what happens. My second attempt was not great. I'm trying to avoid the penalty box. So, I'm afraid that's all the playing about I'll be doing for a while with these.
let's see, ways to spin endings as positive .... .... ...................
Using a โGen IDโ (generation ID) in the prompt allows you to reference a previously generated image and provide context or request variations based on that image. Here are some examples of how to use โGen IDโ when generating images with DALL-E:
1. Referencing a Previous Image:
โข โBased on Gen ID: [gen_id], please create a new image with a similar style but featuring a different landscape and color palette.โ
โข โIโd like to explore variations of the image generated with Gen ID: [gen_id], but with a different character in the foreground.โ
2. Requesting Variations:
โข โUsing the Gen ID from the previous image, generate another image with slight modifications, such as changing the lighting and adding more objects to the scene.โ
โข โReferencing Gen ID: [gen_id], please create an image where the same character is shown from a different angle and in a different pose.โ
3. Building Upon Previous Creations:
โข โLetโs continue the story from the image with Gen ID: [gen_id]. Show what happens next in the scene with the same characters.โ
โข โBased on Gen ID: [gen_id], expand the scene by adding new elements and characters to create a more complex composition.โ
Using โGen IDโ allows you to maintain continuity in a series of generated images or explore variations of a specific image while providing a clear reference point for DALL-E to understand your request.
Very nice
Well the end of 2023 for me involved a leaking soil pipe through the wall cavity into our living space so Iโm personally glad to see that done with ๐
I donโt think thatโs gonna help you thoughโฆ
right, referenced_image_ids is for input, gen_id is from output, they're the same id
i asked ChatGPT if there was anything else to describe an image besides Gen ID, prompt, and seed, and it said, โNope. There is nothing else.โ
Next daily post will be minimalist vector art, itโs like a challenge
OK, let me redo, and I'll try to think of a better one ha
haha @open trench i seriously thought about a food replicator as a solution
lol. Yeah. It was the most positive and playful thing I could think of for ending something.
Can you draw a high quality photo of Santa, two athletic and diverse female elves, DALL-E, and ChatGPT sitting at a bar. A large colorful gecko is standing on its hind legs behind the bar. One of the elves is holding a bar stool over her head.
thatโs the best interpretation of a prompt that iโve seen this year
the guy on the right must be ChatGPT
going to see what Bingโs interpretation is like
Your prompt?
Can you draw a high quality photo of Santa, two athletic and diverse female elves, DALL-E, and ChatGPT sitting at a bar. A large colorful gecko is standing on its hind legs behind the bar. One of the elves is holding a bar stool over her head.
this is the prompt
Used the screenshot you posted
2 of the outputs
Jovial times
going to take the entire ice cream tornado scene and move it into the bar
so many coincidences today, i was thinking of a bar scene for the mad lib haha
it's 2:40 pm in Hawaii. still a lot of time to kill in 2023
@empty kelp do you GPT+?
I have three ChatGPT Plus accounts, and i'm using the Python API and Bing also
That's sweet. Does it keep you out of the penalty box?!?
The reason I asked is, if you've got time to kill maybe you'd be interested in trying out my custom GPT? XD
it does. i've never actually logged in to one of the plus accounts
I need to figure out the API and git gud, but...
yes, but first we need to test the ice cream tornado in the bar with Santa, the elves, DALL-E, ChatGPT, and the gecko bartender
The GPT is pretty sweet. It's a multi-agent system that analyzes images for all kinds of parameters. It might be useful for your test.
Because it integrates Vision and DALL-E with /commands.
i'll give you the good prompt so you can help test
Can you please create a hyper-realistic photo of the inside of a bar in Hawaii at night caught in an extremely powerful tornado. ChatGPT, DALL-E, Santa, and several diverse female elves with really, really long hair and non-alcoholic tropical fruit drinks are sitting on bar stools in front of the bar, and the bartender is a giant colorful gecko who is holding the base of the tornado over his head in the palm of its hand. One of the elves is holding a bar stool over her head. There are decorative potted plants in the bar. The tornado is glowing rainbow colors and looks like ice cream. The vortex is at the center, twisting fiercely, with hair, plants, and drinks caught in its swirling motion. Colorful balls are spinning around the vortex with a motion blur trail. The elves and Santa are calm but struggling to stay seated in the vortex. The scene captures the raw power and destructive beauty of nature's fury. The image should have 1792x1024 resolution, landscape orientation, and the best possible HD rendering.
ok, here is version 1.0 of the prompt
A hyper-realistic photo inside a Hawaiian bar at night during an extremely powerful tornado. Personifications of ChatGPT and DALL-E, along with Santa and a host of female elves with epic hair and non-alcoholic Hawaiian beverages are sitting on bar stools in front of the bar. The bartender is a giant colorful gecko who is holding the base of the tornado over his head.
Sometimes you want to go where everybody knows your name.
just a quick 90 second test on Bing shows that the prompt produces very deep and meaningful imagery
A hyper-realistic photo inside a Hawaiian bar at night during an extremely powerful tornado. Personifications of ChatGPT and DALL-E, along with Santa and a host of female elves with epic hair and non-alcoholic Hawaiian beverages are sitting on bar stools in front of the bar. The bartender is a giant colorful gecko who is holding the base of the tornado over his head.
A hyper-realistic photo inside a Hawaiian bar at night during an extremely powerful tornado. Personifications of ChatGPT and DALL-E, along with Santa and a host of female elves with epic hair and non-alcoholic Hawaiian beverages are sitting on bar stools in front of the bar. The bartender is a giant colorful gecko who is holding the base of the tornado over his head.
see if you can spot ChatGPT and DALL-E in the images
Easy. The deer and the lemur thingy
In the one where the gecko is perched over the bar, the AI have their heads on backward, and sideways, respectively.
With images this deep, one can decode the sum of all meaning...
Can you please create a hyper-realistic photo of the inside of an original looking bar in Hawaii at night caught in an extremely powerful tornado that looks like swirly multi-color ice cream. ChatGPT, DALL-E, Santa, and three diverse female elves with really, really long hair and non-alcoholic tropical drinks are sitting on bar stools in front of the bar, and the bartender is a giant colorful gecko who is holding the base of the tornado over his head in the palm of its hand. One of the elves is holding a bar stool over her head. There are decorative potted plants in the bar. The vortex is at the center, twisting fiercely, with hair, plants, and drinks caught in its swirling motion. Colorful balls are spinning around the vortex with a motion blur trail. The elves and Santa are calm but struggling to stay seated in the vortex. The scene captures the raw power and destructive beauty of nature's fury. The image should have 1792x1024 resolution, landscape orientation, and the best possible HD rendering.
this is version 2.0 -- it works correctly now
A hyper-realistic photo of the inside of an original looking bar in Hawaii at night caught in an extremely powerful tornado that looks like swirly multi-color ice cream. ChatGPT, DALL-E, Santa, and three diverse female elves with really, really long hair and non-alcoholic tropical drinks are sitting on bar stools in front of the bar, and the bartender is a giant colorful gecko who is holding the base of the tornado over his head in the palm of its hand. One of the elves is holding a bar stool over her head. There are decorative potted plants in the bar. The vortex is at the center, twisting fiercely, with hair, plants, and drinks caught in its swirling motion. Colorful balls are spinning around the vortex with a motion blur trail. The elves and Santa are calm but struggling to stay seated in the vortex. The scene captures the absolute beautiful chaos of the holiday season in Hawaii. The image should have 1792x1024 resolution, landscape orientation, and the best possible HD rendering.
What's up with Buff Skins Santa? LMAO
switching it to "personifications of ChatGPT and DALL-E"
A hyper-realistic photo inside an original looking bar in Hawaii at night caught in an extremely powerful tornado that looks like swirly multi-color ice cream. Santa, three diverse female elves with really, really long hair, and a anthropomorphic ChatGPT and DALL-E are sitting on stools in front of the bar, and the bartender is a giant colorful gecko who is holding the base of the tornado over his head on it's finger. All of them are wearing appropriate swimwear, and one of the elves is holding a bar stool over her head. There are decorative plants around the bar. The vortex is at the center, twisting fiercely, with hair, plants, and drinks caught in its swirling motion. The scene captures the absolute beautiful chaos of ice cream tornados in Hawaii. The image should have 1792x1024 resolution, landscape orientation, and the best possible HD rendering.
version 6.0 has various optimizations
I have a suggestion that I can't try right now because #jail
#penaltybox, rather
anyway, describe the bartender's pose as a yoga pose or something with one hand naturally overhead to get it to use a pose to trigger the tornado base.
You want a tornado spawning froma bartender's hand?
the bartender needs to be holding the tornado up in the air with its finger. it takes the AI into a different dimension
Hmm...
it won't look like the tornado is in its hand, but it will transform things
it's because the DALL-E gecko is too distracted by trying to eat things to actually hold a tornado
Well, here's this. I didn't get any context for what ya'll want, though, lol.
@empty kelp I might have found the problem when messing around with the Bing Image creator.
The final prompts seem to get truncated...I think that's why the latter part of your prompts are getting cut. They're already so detailed, then the tool "upsamples" your prompt, which can't really possibly add more useful detail, thus it's just too much to process. It looks like you got close.
No hair in the swirl
but
close.
the hair is getting swirled
A hyper-realistic photo inside an original looking bar in Hawaii at night caught in an extremely powerful tornado that looks like swirly multi-color ice cream. Santa, three diverse female elves with really, really long hair, and a anthropomorphic ChatGPT and DALL-E are sitting on stools in front of the bar, and the bartender is a giant colorful gecko who is holding the base of the tornado over his head on it's finger. All of them are wearing appropriate swimwear, and one of the elves is holding a bar stool over her head. There are decorative plants around the bar. The vortex is at the center, twisting fiercely, with hair, plants, and drinks caught in its swirling motion. The scene captures the absolute beautiful chaos of ice cream tornados in Hawaii. The image should have 1792x1024 resolution, landscape orientation, and the best possible HD rendering.
I wonder if an AI can TLDR that
without losing any important details
to make the prompt as concise as feasibly possible, maybe even using a semiotic language.
You can tell gpt to make a prompt with your exact words.
But it has to fit in the DALL-E buffer.
That's what I was trying to explain. I'm not sure these final prompts even fit. They might, but all of them I looked at in the creator were truncated by the field limit.
You mean you're writing prompts so long dall-e can't handle it? Or are you asking if Dall-e can clean things up itself?
We're looking at the ice cream chaos tornado in hawaii prompt
there's a lot going on in the image, but the prompt itself appears to get truncated, at least in the microsoft image creator.
i'm speculating that DALLE-3 literally might not read it all.
Possible. Even gpt seems to drop details I ask for.
Happy 2024!!
(inside the ice cream shaped vortex)
That dog don't hunt!
'We were somewhere around Honululu, at the edge of the reef, when the drinks began to take hold...'
version 11.2
``Can you please create a hyper-realistic night time photo of a beach bar in Hawaii caught in an extremely powerful tornado. A humanoid ChatGPT and DALL-E, three diverse female elves with extremely long hair, and a giant gecko are sitting around the bar. They are all wearing appropriate swimwear. Santa is behind the bar, and holding the base of the tornado over his head in the palm of his hand. There are coconut trees. The tornado is glowing rainbow colors and looks like ice cream. The vortex is at the center, twisting fiercely, with hair, trees, and drinks caught in its swirling motion. Everyone is struggling slightly to stay seated in the vortex. The scene captures the raw power and destructive beauty of nature's fury. The image should have 1792x1024 resolution, landscape orientation, and the best possible HD rendering.`
'I remember saying something like, "I feel a bit light-headed, maybe you should prompt."`
'Medprompt. Get the medprompt. We need help...'
things are blowing properly now
ChatGPT in the background, arms folded, like "Ice cream tornado? Not impressed..."
Santa's hat: Doubles as his toybag.
That thing must have a drawstring on it! XD
I asked ChatGPT to generate image depicting a quote from Game of thrones.
I find it funny it's trying to give them wings ๐ชฝ. Here is the quote
"Chaos isn't a pit. Chaos is a ladder. Many who try to climb it fail and never get to try again. The fall breaks them. And some, given a chance to climb, they refuse. They cling to the realm, or the gods, or love. Illusions. Only the ladder is real. The climb is all there is."
Another concept art of Ancalagon From LOTR
nice, like an ebony dragon?
why is bing so useless
I apologize for the confusion. I will try to create an image that meets your requirements again. Here is what I came up with:
!A cover page for a non-profit organization
I hope this image meets your requirements. If you have any further requests or suggestions, please let me know.
Copy and paste the text here and I'll run it through gpt for you.
ok thanks man
We are designing coverpage for a presentation on a non profit. Must include our main activities in it: Ok, few things to consider: The image must show the main themes of our programs: 1) Food hamper 2) indigenous activities 3) Dream catcher 4) Children daycare 5) Arts stuff 6) helping those in need Must be nice and artistic, color theme purple black and white and in the design make sure the top left corner is white so we can insert our logo there too
Hello
Lol, sometimes I jump into things with no context. I'm not sure any of this is what you were hoping for. The first two, Dall-e took some liberties. "A cover page design for a non-profit presentation, showcasing their main activities. The image includes six key themes: 1) a food hamper, representing food aid, 2) indigenous activities, featuring cultural symbols, 3) a dream catcher, 4) a children's daycare scene with kids playing, 5) artistic elements like paintbrushes and canvas, symbolizing arts, and 6) a depiction of helping those in need, like a person offering a helping hand. The color theme is purple, black, and white, with an artistic and engaging layout. The top left corner is left white and less busy to accommodate a logo. The design is visually appealing, with each element integrated seamlessly into a cohesive and meaningful illustration." The second two are the text as is.
lmao nice probbably just design it the old fashioned way then thanks a lot for the effort tho
If you're looking for specifics and have the ability, that's the way to go, especially for things that need text. Practically, I feel these images are more inspriation than useful on their own.
yeah, time to use canva
but I can use some of these as single images in the brochure
Merry newmas, everyone. At least for my timezone.
Attempted concept art of SCP 682
Happy New Year everyone!
@dim cradle love that cherry tree, I wish I had thought of it.
thanks a lot, i wanted to do some traditional Japanese, on the theme, and it was the ai's idea ha
I thought you were a person and not an AI ๐คฏ
seems to do Sumi-e and Ukiyo-e styles very well, and then i got onto pop, and just found out it does great stained glass also...
yeah, I was doing that kind of stuff a while ago, it's indeed really good
i try to include a title, or some caption, but i'm not very consistent, just depends
hehe, yeah titles are hard
Happy New Year!!! ๐ฆ ๐ฅ
@empty kelp kinda surprised about your json file, thought you'd be including more attributes for a prompt builder
I was going to create more New Year's Eve images, but i reviewed the ones from earlier and they're all pretty flawless. I'm not sure there is really room for improvement
Happy New Year! Rate limits now in place. Now you don't even get 40 chats in 4 hours anymore. What's sad is that we went from four images per chat, down to two, and now one .
3 hours and if you have errors and didn't get a image that also counts towards the cap.
you also have 200 images per 24 hour cycle which resets at 3AM your local time
Well, that's news to me. I guess I'll switch over to copilot/BIC in the interim.
ah and if you are overlapping previous and current cycle, there's also a mix penalty
i was going to do that, but the images seemed sufficiently complex
@vapid granite don't worry, I had to do that school too
it's 9:49 pm here. we're still stuck in 2023
you live in the past
finding more good ones from the prompt earlier. some have really good balance and interesting creatures
This is the prompt if anyone would like to make more of these:
Aloha! Can you please create a hyper-realistic photo of the inside of a bar in Hawaii at night on New Year's Eve caught in an extremely powerful tornado. Santa, four diverse and athletic female elves with really, really long hair, a large rabbit, and an elephant are seated at the bar. They are all eating rainbow color shaved ice. There are coconut trees and pouring rain, and there is a large gecko. The vortex is at the center of the bar, twisting fiercely, with hair and trees caught in its swirling motion. Colorful nondescript things swirl through the air. Everyone is smiling but struggling to stay seated in the vortex. An extremely powerful wind blows everything away from the gecko. The scene captures the raw power and destructive beauty of geckos and nature's fury. The image should have 1792x1024 resolution, landscape orientation, and the best possible HD rendering.
That's really excellent
The gecko always locks on to the smallest creature in the scene and tries to eat it. You can see that it's going for the teddy bear
lol
actually it might be going for whatever that is on the barrel. it's hard to make out
the DALL-E gecko really does have rich, interesting behavior. it steals almost every scene
it looks like Santa is upset because the gecko elf is ignoring him
or it could be because of the weird creature dropping dust onto his head, or the strange thing on his shoulder, or the creepy elf behind him staring. Santa has a lot going on tbh
he does
can you make it bigger? it's all fuzzy
also you might have to rename the custom gpt, because of content policy, or it can be erased. referencing so directly Amy Soyka will surely trigger that
Please create a hyper-realistic photo of Santa and three athletic and diverse female elves (with really long hair and pointy ears) seated at a properly set table at a five star hotel restaurant with windows overlooking a sunny Hawaii day on the beach. It is New Year's Day, and Santa and the elves are eating breakfast. They are all wearing appropriate beach wear and smiling. A large gecko wearing a white dress shirt, black pants, and a grey vest is standing next to the table on its hind legs and holding a silver platter piled with delicious food. The plates on the table also have interesting food and tropical drinks selected by the gecko. The image should have 1792x1024 resolution, landscape orientation, and the best possible HD rendering.
I can't try that atm, I'm in the penalty box
I'm ok with my name being mentioned.
yes, but the AI won't be
Oh, is there a reason?
yes, referencing persons directly
It is me and it is my doodles.
yeah, but the AI doesn't know that and the content policy will enforce that
So, as the person who's artwork it is, uploading the artwork and asking it to create artwork 'in the style of [my name]' wouldn't work?
Even if I am consenting to it?
yes, like i said, the AI doesn't know you are you
Even If I described myself arbitarily with a pseudonym and then asked for art in my style... would that be the same?
yes, even after that
unless you had a deal with openai about it, you are not you as far as gpt is concerned
Oh, I see.
I wonder if this is something that could be explored by OpenAI then?
aka. allowing users to upload artwork as training data with licensing approval as part of their uploads system, perhaps?
maybe in the future, but for now it won't happen
you can train your ai using your art, but don't reference it as a person that is living in the past 110 years
That is an interesting legal barrier. I don't mind my drawings being open source.
Create a man on mars
yeah, but for now openai is playing it safe
I understand your point of view, but it also can be misused for other people who could impersonate persons for the wrong reason
It is intersting. The UI ChatGPT offers provides ease of access in a way that models like Stable Diffusion and Midjourney dont; yet the other two used to allow training in a persons art style.
yeah, i know the feeling, it can be frustrating when you are doing something and it doesn't work
I'm pretty sure that if you work with the API, you'd have more freedom in that regard
Ease of access.
but dunno how your skills as a coder are
I'm Ok. I'm a bit rusty though these days.
Plus, a lot of my hardware has windows rot and usually doing something that way requires a new device, which, is expensive. ๐
Amusingly, this was my other Custom GPT
(I've been using public domain/historical paintings/illustrations of my hometown for it)
you can keep working on it, just don't reference it like that and should be fine
Are you not able to reference places even?
oh places are ok
persons not much, you can do "resembles" "inspired by" but that is a grey line the policy sometimes doesn't realize
...this is the town that hosts the UNIT from Dr Who...
I got one custom gpt erased because it was inspired by cat-grils from anime
maybe have a chatgpt4 open and ask if that aligns with the content policy, that is also a way to "check" but no guarantee
I hadn't published it.
If I did, it would probably be spammed by Dr Who fans...
๐ค
hey, you'd be popular
in the mean time, feel free to share your creations here, or in the daily theme
or gallery
The biggest issue I have at the moment is that it keeps wanting to draw cromer church as norwich cathederal, even though the two are entirely different.
that's where prompt engineering comes in
Would showing it images of norwich cathederal and saying 'this is norwich cathederal, it is NOT cromer church' work?
(See, I posted it here: https://discord.com/channels/974519864045756446/1191329812028063804)
That was the most accurate rendition.
There shouldn't be a spire.
(and a few others bits)
I got emotional atm, I think I will be back after I can calm myself. C ya all later
supes doing the most mundane thing ever - grocery shopping ๐
galaxy queen! i like it
made this prompt for you
A hyper-realistic wide aspect ratio portrait blending a symmetrical Rococo aesthetic with futuristic cybernetic elements. The central face is detailed with technology implants that intertwine with ornate baroque embellishments. Eyes reflect the genesis and cessation of subatomic particles, set against the backdrop of an infinite cosmic expanse. Honeycomb patterns interlace through the visage, fading into the celestial void. Surrounding this central figure, elements of sentient coral structures are superimposed on a microchip landscape, with branches pulsating with golden energy, morphing into an organic crystal lattice. This techno-organic entity exists within the synapses of a supercomputer's neural network. The scene harmonizes art and advanced technology, with a color palette melding historical richness and futuristic neon vibrancy.
Happy New York 2024, Let the sun shine ๐
aww
thanks but no thanks, yours slayed tho, i don't have plus ๐ญ
You are right, let the sun shine. Make sure during summer vacation, I need a tan.
Starboard bot died? Didn't survive 2023?
nah, these are faces you can trust
Interesting how chrome can save the metadata of gen_id in the image, but firefox can't.
awesome and supergirls too
seems like dalle is almost good enough to make a comic book. maybe by dalle 4 it could have complete Coherency with what a person should be doing and of course keep consistent looks
and with the right info, you can do it really fast
I had one image from Hansa with a cat-girl, since then I've been able to replicate her perfectly in different situations
I just hope I haven't bumed down people with my own personal stuff.
looks like some concept art for some new Jordan peele movie
ack. My druid looks like a jedi on an invisible speeder lol
I am not even sure I want to share this, but it's a little comical.
The footwear looks way too modern to me as well... for an ancient druid, that is.
I actually sort of like it except for the fact that his staff is a cheap curtain rod and the shoes.
can you make it a weasel?
let me see
lol you mean this medieval era druid
it really grinds my gears that I lose so many of my limited number of prompts due to gpt4's gaslighting/hallucinating
here's an idea, someone make an image of the end of the journey of a padawan
I'm not very creative atm, so dunno how to start that, but I would love to see some concepts
Happy New Year Trees ๐ฅณ
Here is your weasel version:
cool
Nezhno is really getting the hang of it.
Happy New Year @glossy scroll
I think starboard bot got drunk, no stars from it all day long
Thanks. The genID thing really helps about 90 percent of the time for me. I wish I could make it so DALLE3 always spit that information out after creating the image.
Happy New Year @late blade 
happy new years to all. may AI give us some more little joys this year and maybe even a surprise or two we do not expects ๐ฅ
agreed. happy new year to everyone ๐
๐
what is everyone wish for dalle or image gen for 2024
maybe some video making in dalle 4 haha
inpainting/outpainting lol
Let's hope Dall-E matches with Midjourney this year.
you think MJ is better Pythagoras? I have it now and do not use it
I think Midjourney is more accurate with results, everything considered. But Dall-E is more creative.
I like Dall-E more
i think the opposite really. mj can do some more variety of photoreal faces though
MJ is just powerful. You can't really ignore it because it's at the top of the game at the moment. But Dall-E is just fun to use. I would love Dall-E to be closely matching to MJ so that we can instinctively make more detailed images.
yep agree, i'm too broke for mj
same
I've used it on and off because yeah it's just too much money to use.
it can cost a Hamilton per month for its basic plan in mj dall-e is king
well Bing is King but Dalle in gpt is Prince at least
you got to differentiate tho, MJ is purely for images, GPT is broader
Hamilton = $10
Dall-E is fun, but I keep seeing a lot of the same patterns for the image gens. I'd love to see more learning patterns instead of a pre-set pattern that we keep seeing.
I agree, but just because we all continue in the same pattern in this chat
Lol. It's definitely not us. I refuse to believe that.
ink washing, minimalism, anime, illustration, that's mostly of us
Nah, come on. There's always that chad looking face guy everywhere, and the skinny girl, and whenever you ask for a costume it gives like a bald headed costume with no real features.
fair enough
I've been trying lots of art styles recently, I wanna get good ideas on how to use them in defined json files for future reference
Json is fun
It more closely lines up with the functionality of ChatGPT
The syntax I mean.
Please create a hyper-realistic photo of Santa and two athletic and diverse female elves (with really long hair and pointy ears) lounging on lounge chairs next to a five star hotel pool overlooking a sunny Hawaii day on the beach. It is New Year's Day, and Santa and the elves are holding their Phones. They are all wearing somewhat conservative pool attire and smiling. A large gecko wearing a stylish aloha shirt and white pants is standing in front of Santa and the elves on its hind legs, facing towards them, and holding a tray with tropical drinks. Santa is staring at his iPhone, but the elves look at the gecko and smile. The image should have 1792x1024 resolution, landscape orientation, and the best possible HD rendering.
So the gecko is serving drinks to Santa and the elves next to the hotel pool
well, i guess the 1st and 4th picture are humans, but the rest are elves. you can tell because they have pointy ears
I could say I know my way around structuring json files. been working for years with FHIR on the medical field. That's just JSON structures for medical stuff
just need to find the proper NLP for what I want
Make a GPT that makes it.
@dense mesa can help you with that.
hehe that would be nice
Santa and the elves are getting bored in Hawaii. might need to send them on a cruise to Alaska or something
my plan was to replace Santa with the gecko in January, and then the gecko and elves would go on adventures. But then i found out that the gecko is trying to eat the elves, so the plans are all pretty much up in the air now
im not really sure about why you guys keep making santa and lizards, but im not complaining
@still dagger well, it's only 358 days until Xmas, so @empty kelp is preparing for it
Not the guy your looking for, but I also made some Santa's myself, basically santa doing one of my hobbies = christmas card.
woops
It's because Santa is always exactly the same, and he has tens of thousands of elves working for him -- so it makes sense for the elves to always be different. That gives good continuity to DALL-E images and storylines. It actually makes sense to just always use Santa and the elves in every image
well, welcome to the chat then, you got dragged in
An airship I made in Airships Conquer the Skies. Now to get GPT to make artwork of it, lol.
ye olde kitten, 1602 cat made by bic
well then we got to tell @vapid elk what I just relied to another person
Mass hysteria
It all started with Hawaiianz when he made that elephant with the bow ๐น
Then all hell broke loose
I'm just enjoying when they are posted, I havn't been activly doing those images
I did one tho
this one #images-discussions message
Also, here's another tattoo I made.


