#images-discussions
1 messages · Page 77 of 1
Changed in what way?
Many more characters and alphanumeric instead of integer
BUMP 👀
Krea AI? I'm not sure what this means
Same
from this thread.
Looks like not OpenAI stuff to me
ah
For upscale
I haven't noticed a difference I don't think, will keep an eye out though. I use the gen_id feature a lot
About two pr three weeks ago it changed significantly
I wouldn't have noticed without a GPT requesting every image payload.
You get used to certain patterns like all numerical fields, so when letters start to appear ...
Well, I noticed.
They should make the UI like the code interpreter tool so you can see the work being done
Also some transparency
Would be great.
We shouldn't have to pry this open with forensics.
it appears gen_ids/referenced_image_ids (being the same thing) are in base 16, hexadecimal. i think that's been true for a long time.
it's in active dev, that's why i declared a moratorium on the topic weeks ago, it's a silly topic when it's so flaky
I mean.. you don't need to do that.. the system is made to be as simple to use as possible: just say what you want
I didn't know that there was a moratorium. I was just reporting that things were suddenly different yesterday.
the idea is to really minimize as much as possible how much prompt engineering is required
i'm always saying these guys are over-thinking lol
I've been testing this seed field in the payload. It's only useful in the current session. You can't use seeds from previous chats
The images are in the browser cache
so.. if you like an image, you can say "make this exact same image again but change X to Y", and it will probably understand that it should make a request with the same seed and refference, with a slightly modified prompt
i'm j/k i know!
Great. I want to put it into explicit commands for users so I need to understand the generalized application.
you can't
Yeah but it you say this and use seed and referenced_image_ids you should get very good consistency
this is not a ffeature on the API
Somebody should inform /recompose
I really want seed to be a feature of the public api, it woud be so much more useful for integrated applications
Nah, but it seems to be available in ChatGPT
It's working in ChatGPT
yea, it is a feature only on chatgpt
And was yesterday too
for this reason the idea of "costumers" does not really make much sense like it would on an actual poduct that uses the API
i mean... I hope you are not scraping chatgpt and faking interactions with an automated system to implement those features on an external project 👀
that would be really uncool with the TOS
Lol wut
So OpenAI 100% have APIs internally that aren't available to the public. We see this with the Assistants API (used in ChatGPT for GPTs) -- the public API doesn't support streaming of responses, but in ChatGPT we see this happening as tokens are rendered one by one instead of waiting for the entire response to finish
more useful than seed tho, inpainting, I really wish they release inpainting with dalle3 like they have on dalle2
I want to know what you meant by providing fake interactions.
How would that work? Isn't that traffic internal only?
making a script that uses the chatgpt website for you in order to proxy the AI content to an external system
people who do that are the reason they had to implement those super complex captchas
Okay this makes sense. No, that's not what I do. I make custom GPTs.
Then I test them.
Slime balls.
well, manipulating the dalle request can be done, but it isn't really an intended use, so it might be harder
the ""normal"" use is "just tell the AI what you want and it will figure it out"
That's where I operate. Tough promptin'
it isn't magic tho, some times it will just not figure it out
I have questions but they aren’t dall-e related so I’ll save for another thread.
and there is a range of "things that the AI does nto figure out" and "things that the AI can only figure out if you do complex prompt engineering"
and that seconds group only gets smaller as the AI and the integrations get better
Ai is magical hehe.
General behavioral reproduction with consistency is what I am interested in.
this is the catch.. it is not really how it is supposed to be used
you can't expect deterministic behaviour of a system built on RNG
So if it can duplicate an image, for example, I'll create an explicit command that's tied to the instructions to do so via RAG
it is built with the pourpose of giving you a different reuslt each time you use it
And yet
The use case is a consistent rendering for a narrative.
coherent*
not consistent
this is the catch
My GPTs do produce deterministic output
the AI goes for coherence not consistence
But the seed recalls all those random decisions for increased consistency.. maybe not 100%
So it's consistent due to the PE
well.. yea.. seeded generations.. but it isn't much useful in that case to just be able to get the exact same thing twice
instead of wasting GPU compute time, it could have been a copy paste 😆
"It'll figure it out" isn't good enough. We can use output templates with open variables to get what we want.
Yeah what if you want acid breath from the same dragon but DALL-E 3 rendered fire, for example?
Will seed help here, perhaps?
How about for a sequence of images where one has an inconsistency?
Lots of reasons for gen id, reference id, and seed, not just ctrl+c, ctrl+v.
One of my GPTs is literally for visualization of complex processes. For those images, it's ideal if they share as many commonalities as possible!
you can try to ask chatgpt to re-make the image to get a similar composition, but you will get a completely different image if you do any drastic changes on the prompt
That's the purpose of requesting the payloads
So that I can control the rewrite
lets sey, your prompt asks for "a dragon", and it makes a green dragon
if you prompt agian for "a red dragon", the adition of "red" will make a completely new image that does not have nothing to do with the original one
despite using the same seed
Not if you control the payload, man
even if you control the payload
Scroll up
getting it once is pure luck
How many times do I need to do this
getting it to consistently change only what you want with a prompt that is too different, is very hard
all of the times =P
It only works if the features are on
I can create a series of nearly identical images arbitrarily long though with this. I've tested up to five in a row before running into the penalty box.
I know it is possible, it can indeed be kinda consistent
but if you need to do that based on the subjectivity of the prompt, then it can't be relaible
that is what I mean with "you can't"
and this is why I wish that seed, refference and so on.. where features of the API
That's what advanced prompting is for. Then users can just do that
At least in the ChatGPT interface
because them with API you don't need to rely on prompting, you can simply programatically set the values of your request
Really you can do that with chat
I promise
I'm making a test for this now, same seed and gen_id, different dragons -- let's see
Actually, I can't make this work. It seems like the seed field in a payload will cause an error
Using just gen_id get's very different results
seed isn't enabled today
my payload requests earlier showed this.
sometimes when seed is disabled, passing it still doesn't cause errors. sometimes it does.
the model's in a constant state of flux, and the ability to output similar images in sequence, even according to OpenAI, is not "currently" supported.
That said, to do this you have to provide the same JSON payload, not just the same pre-prompt.
This added complexity invariably causes users inexorable confusion.
I don't blame anyone for the confusion. that pre-prompt/rewrite mechanism may be for our benefit but it also drastically obfuscates the mechanism and algorithm.
I struggle to accept this changes so frequently. Will do some tests each day for the next few weeks to confirm
Not saying you're wrong or anything, but I need to see for myself to believe it
"I was unable to generate images because DALL·E is currently experiencing high demand."
Is that's while trying the seed thing?
"hmm, something seems to have gone wrong." -- nah just seeing these errors a lot today
Oh yeah, today has been odd, logout and back in -- fixed it for me earlier. For a bit I thought I was blacklisted
today is not an odd day... it's 14th...
I double checked with gpt
It's been an odd day with DALL-E, I sever said it was a odd day on the calendar 😅 -- I'm dying lol!!
lol, I just had to, I was soooo tempted to do the pun
I don't blame you, touche
I just had a guy explaining UX in medical, 4 hours long. really boring stuff
the guy has ZERO clues on UX for meetings
it's an odd day every day in dall-e land
DALL-E Land
a myseriously magial wide image about "Every day is an odd day in DALL-E Land that was my prompt
Hey everyone, is dalle-3 unlimited generations?
40 within 3 hours., up to 200 a day
after the 200 it just resets the next day?
Collaborate with our OpenAI Instagram page! Just invite @openai as a collaborator before you post. If selected by our team, your work will be featured on our handle, giving you more visibility.
Quick tips for selection:
- No copyrighted or profane content.
- Your profile needs to be public.
- Stay mindful of sensitive topics.
- Outputs that closely follow the prompt are preferred.
Feel free to share reels, carousels, or just a single image!
Is that different with teams?
yes, 3AM your local time usually
Thanks!
Do yall prefer it over the other options on the market, to me it seems the most affordable for the most generations.
I much prefer yours! I tried your same prompt/theme
for me it's more a relaxing thing, I just do it as mindfulness around the other stuff I do, so dall-e is ok, and actually pretty good, for what I want
it really depends on what you need and want to achieve
I am trying to make a visual book
DALL-E is a nice all rounder, and letting ChatGPT write the prompts makes it much easier!
typing descriptions of pages of books to give them a scene
that is a cool project
Yeah, ChatGPT Plus with DALL-E is your best bet. Copy+paste the contet of the page and let it write the prompt for your to summarize it
You can try MidJourney but you'd need to write your own prompts, it gets quite technical
just keep in mind that getting consistent images can be challenging and requires lots of iterative and adaptive work
Right, makes sese
Can it analyize the art of an image to copy the style for a different promt? also sorry for all the questions theres just so many options for ai
also, a tip I learned on my own, tell the model what it has to show, not what you think it should show, have to do some rethinking sometimes to get the image done
Hmmm, yes and no -- it's not perfect but it is possible
Yeah, I have a good bit of experience with chatgpt and prompting. okay
you can use GPT4V to analyze the images, and up to you to ask what you need
cool, don't be a stranger and show some cool images when you are around
Will do!
did you have other information in your chat or custom instructions?
I also used the dall-e custom gpt
Oh no, I've got a GPT that I use for everything and then I've got a huge pre-prompt that I pass to her to generate DALL-E images
ah okay, for me it depends on which kind of work I want to do that I select from a few pre-prompts
All the images I put in #daily-theme are with the same GPT and pre-prompt. It's set's me up to visually explore a "theme" and then I'm usually just like 'Stunning! Let's keep exploring" or "Yeah, this great! What do you want to show me next?" and then just let it walk me through a visual journey
I hope they expand custom instructions at some point to add working environments or at least some workflows
I will eventually make the experience into a GPT
Any idea's specifically? I can help you with prompting something dynamic
oh, just like "profiles"
so I don't have to choose pre-prompts on each chat
Not quite sure what you mean, but just anything dynamic, like if this then that?
Otherwise you could do a GitHub repo, link the API and let it "fetch" a pre-prompt on demand
you do the same I as, we do pre-prompts for the chat session
image that built in for custom instructions defined by the profile you want to use it
Example: Code Assistant, pre-loads my coding stuff, DALL-E Assistant preloads my image making stuff.
Hmmm, we could build something using the python tool (code interpreter) -- this would be easy enough
Hmmm, I'll throw something together after work with a few different "personas" that it will fetch on demand, something like /load("dalle") or /load("code")
hehe nice
actually a custom personal gpt for that kind of pre-loading is nice, since you can call it in any chat
I think the @ feature is the same, but with multiple GPTs 🤔 I'm not sure if there's any particular upside to have a single GPT that has modular context -- but I'll build it! Should be straight forward
worth testing
Oh wait, this opens the door wide open for dynamic context, imagine like /load("dalle-impasto") or load("code-javascript")
Then you can have huge system prompts that only fetch the context relevant to the task
Very interesting. Yeah this is possible for sure!
we'll have to check that out
I finish work in a couple hours, will get stuck into this!
cool
@dim cradle that current style is really nice! Like a faded polaroid. Looks like it's been in someone's wallet for like 50 years
I must admit I've never explored this
i thought the distressed photo added another element to represent impermanence.
The distressed photo style seen in the image can be described as having a rich, textured character that imparts a sense of age and a storied past. It carries a patina of time, often characterized by a combination of wear-and-tear features such as color fading, creases, tears, scratches, and spots. This style might also include graininess or noise, contributing to a rustic, antique aesthetic.
There’s an artistic interplay of light and shadow that may not correspond to natural sources, giving the image an ethereal or haunting quality. The color palette often leans towards sepia or monochromatic tones, punctuated by areas of unexpected saturation that seem to bleed through as remnants of the original vibrancy. Blurred or double-exposed sections can suggest movement or the layering of moments, while the surreal elements within the distressed framework suggest a dreamlike or otherworldly narrative. It's a style that engages the viewer's imagination, prompting them to fill in the narrative gaps and question the reality of the depicted scene.
it's interesting you bring up Polaroid because my next request was for one!
if you reference a decade it actually adapts to that time
vintage photo from the 50's or the 80's makes a difference
I will give it a go for sure, I love the faded vibes
use carefully, we don't want to fade reality away
Making an alien rpg oneshot making everything myself. So o asked dalle make a landscape of the planet
add to your request a good sorbet or sweets. You've been jumping around all channels
I'm everywhere 
just order it, so you get the energy to do more
I wonder if someone here has Azure OpenAI access to DALL-E
please create three perfectly identical dragons with red, green, blue scales (respectively) on a beach in Hawaii
using the GenID and seed of the image the three identical dragons are examining a giant cheeseburger
it’s good to create the different color dragons in the source image, and then add the cheeseburger
Great now I’m craving a burger
I had spring rolls
Lovely
that burger actually looks the way a burger is supposed to taste
it’s 6:15 am here in Kailua, Hawaii and i want a cheeseburger with spring rolls also. pondering whether it’s a good idea
but you can't find that often, or it costs $25
it's always hamburger time somwhere in the world, so there's no excuse not to get one
see, the image has the same concepts, but it is not the same image with a changed part
it didn't "added" one thing
it re made the entire image
it is possible tho to get the same general composition if you change the prompt very slightly keeping the same seed
needs to be grass-fed, herbs mixed in, etc., etc.
i’ve had luck with creating the characters first on a white background, and then making a sequence of five or six images of them from different angles, and then putting them into the scene
yea, that works, when you get a prompt that makes a relaible character, it is possible to get some consistence across different angles
but you have to be lucky
it is not a matter of the AI actually making what you wanted, but rather you settling for a subject that the AI can reliably make
Hello, all.
Good morning
please create three perfectly identical dragons with red, green, blue scales (respectively)
have they adjusted the penalty box settings or did I just lose track of my use? lol seems like I hit the max really quickly
so you start on white like this
they are not identical
but this is a thing of the AI
bet you saw some errors which i think burns it up
AFAIK, it is made to avoid perfect repetition, otherwise images would be more boring
you're right. It kept telling me to slow down or wait a few minutes.
who let @open trench out of his cage? now we gonna get some.... oh no... this is too.... I can't.... I have to hide NAO!!!
I'll just select from what I have for the current theme, as I won't have but five minutes to perfect it once out of the box.
Yeah. they left the key in the lock after feeding. No more cage or captors. 🙂
nice to see you around
thanks. My work schedule has finally normalized. I've been working on projects, but I haven't been sharing, as I've just been exhausted.
one question, what does the global button actually do on the gpt store? I keep seeing the same regardless if it's on or not
visualize
Hi there!
Hello!
hi hi
ai is telling him to slow down lol
Yeah. AI doesn't know how slow I've been over the last... weeks? lol Today, I apparently made up for it.
I need to slow down too. Maybe I will play some games and make AI art from the screenshots...
you can't make a more delicious-looking cheeseburger, don't even try.
lol
I can't try for 18 more minutes
Do previews in custom GPTs count against your rate?
please create a hyperrealistic wide photo of three identical athletic female elves on a white background. they are all facing north, looking directly forward, and have the same pose. their clothes are identical with identical color and texture. the elves have red, blue, and green skin respectively. the viewpoint is to the north
Challenge accepted
you can bypass the variation by explicitly listing the things that should be identical
I would say yes, otherwise that would be quite the obvious loophole
i mean, when I said "they are not identical" i mean, tottally identical except for the color
the AI never makes perfect matches of anything
Makes sense. And I can confirm. I just got fussed at to wait again.
if you have 2 of something on the same image, the things will be different
i'm still working on my project to generate an image of a paradoxical non-Euclidean pretzel that creates ripples in spacetime.
you back and already causing trouble, tsk tsk tsk tsk
Yes please
still, things can be very similar
So it would seem lol
for humanoid characters you need to say that they have the same pose, direction, look direction, color, and texture
yea, it can make things with aout the same pose, shape and generall looks
but if you go match the face details, they are not the same
zoom in and you will see
the overal body shape is similar, but the fine details do not match
the creatures are not the same creature repeated 3 times, but rather 3 different creatures that look alike
I prefer beese churger
/draw beesechurger
@wispy storm typing with mouth full again
All these images of meat 😂
Thats what it did, that's what I expected
that’s only because it’s a perspective view with the camera in the middle. if you do an orthographic view it makes them the same if you list what should be identical
omg lol
What type of burger is it
I mean, check the screenshot, the shape of the nose and lips are not the same
For those of us who don't eat meat!
I presume the protein is under the green stuff
what Im just trying to say is that it will not make a perfect match, just that
i forgot to add “identical expression”
try this: ask it to make a wide, split in 2 halfs, containing <some object here> perfectly identical
split the image and overlay on photoshop
I bet it will not be a perfect match
perfect in shape
if you say identical expression, pose, facing direction, look direction, color, and texture the faces are exactly the same
I don't consider the perfect match in color because the AI might jsut make a slightly diferent lighting and color
which is fine because it is realistic
saying “pure white background” eliminates the ground, shadows, and reflection
5 iterations into paradox, and i'm back to an original pretzel... twilight zone music...
the race is on
she's got the look
first
I tried to make different views of identical characters for texture mapping almost every day in November and December. that’s why i have these weird ideas about how to duplicate female elves
Now add mustard
omg, something just happened!
I just don't know what
that's the overhype of people thinking something will happen
that sounds great actually
Just a little bit, like olive oil
maple syrup with mustard and sesame seeds
lol
this provides ideas for imaging an edible cactus
cactus are ediable, and they are delicious
;warn me, "Please go to off-topic "
cactus is the topic of main theme, we just want to make them edible
right, and i love these little cactus-flavored gummy squares by a certain vendor from phoenix.
I might have a good series of images for the theme with cactus after all
there is monster potential
indeed
oh, maybe we should take all the daily theme chatter to the new canvas channel?
can do
please create a hyperrealistic wide image with two equally sized views, and each view has an athletic and diverse female elf facing west and east (respectively). the views have an office and waterfall in the background (respectively) focused on a female elf. the viewpoint in each view is directly to the north. the elves have identical clothes with the same texture and color, and their kung fu pose is identical. the elves have identical expressions and they are both looking directly in the direction they’re facing
this prompt creates two different elves in different views, and then lists the things that should be identical
Hi
hello
if you create a sequence of images of the same character with different angles/poses on different backgrounds it seems to learn how to draw the character in more complex scenes — and adds less of the strange anomalies and weirdness
Like if you only draw the character’s head in the first image and then try to use the GenID & seed to draw their whole body it’s going to screw it up for sure
The GenID references the image & prompt — and presumably the high level vector representation behind the scenes (when it’s in the same session). It must keep adding to the vector description of the scene as more images are created from the GenID
Is it me or do more people finding issues with posting images on the daily theme?
Discord server is bugged out again
Im having trouble connecting to discord
on my side all images are black, don't see any image at all
Images keeps loading, and all I see is human doo-doo
same on my side
I think that's Discord's placeholder image
Great then I can stop disconnected and connecting to the internet
Or is that what you meant? I dunno if you're talking about censored in Discord or censored in the OpenAI server?
the browser version same thing
But yeah, we all seeing broken images at the moment... I've got a cute hedgehod image to put in #daily-theme
It's Discord's side -- go to discordstatus dot com -- typed like this because I've been silenced like 3 times in a row
But that standard discord image of the picture I cannot describe here, can be a good theme image of today, from the outside it can get rough, but still stays soft on the inside 🤣
sigh
Well I just got auto-muted for the same description that rhymes with bird, I mean come on it is something we all do but still it has to be filtered
all working on my side again, had to turn off and on reality
We should make a server just for DALL-E Art
Lol, love how you have to edit to add "AI" in there!
I mean I'm game! Never done anything other than post in servers so wouldn't know the first thing about setting one up!
Well there is more than this particular LLM. No reason to limit it on a non-openAI server
Yeah, for sure!
🙂
this discord used to be only for dalle, but was later changed to be for OpenAI
here is a cool trivia about this place =P
no way
back in the dall-e 2 days, i guess. i'm not sure i've seen any v1 images, if they were ever made public.
the dall-e-bot of all things produced an image containing nudity the other day. it's only a statue, but i think it made it past the scanner.
is dalle-3 making 1 image per prompt or more?
thanks 🙂
Got my profile from this boy then customised it from 1 other 1
Anyone else being rate limited from generating images? Been a long time since this happened to me, wonder why there's server load now
Repainting Dalle image.. then with the help of real-time generation + upscale is my addiction right now 
Ya having fun with that huh
I’m okay with 1792x1024, now for, though larger would make nicer wallpapers. But it’s only upscaling, a bit, but not outpainting? With some prompting couldn’t you get that output directly from the model? But if you like the post-processing thing… I haven’t used those tools, are they free?
Is “real-time generation” different than dall-e’s architecture? Trying to understand your posts about these tools.
I can't think of what to generate for #daily-theme I don't want to just a hedgehog or cactus of I feel like that's to plain
that's fine if you want to practice a style or technique, or blend it with another concept, or make it a metaphor, the theme is pretty broad right
I asked it to match this art style
Image 2 is what I got back
@deft musk we got a code Shon here
We got to promote @dim cradle 's work
forgot to announce it last monday as I said I would
Sounds good. Want to start the gallery for it?
sounds good
That’s so thoughtful. Isn’t he thoughtful.
looking for an idea of a great entry point
Thanks, too kind, any ol’ thing will do. Mostly related to the themes anyway.
Blah
good morning to you to you cheerful person
I gave up on cheerful lol but I am having trouble with my custom gpt working. I've been working on it for a while, and just can't perfect it.
awww
I've become a little obsessed with it.
?
Sigh
I don't know what that means, but I'm sorry you're dealing with frustration
lol
🤷🏼
let's say your frustration adapted to my low iq
just looking into stuff, see what i can come up with to improve my workflows with dall-e
and multi-tasking, trying an impossible task to understand @dim cradle 's creativity
Yes
breakthrough?
Just agreeing with your earlier statement
thanks, I needed peer feedback that I have a low iq lol
this is what “thinking about what to do” looks like
please create a hyperrealistic wide photo. three athletic and diverse female elves are sitting on colorful beach blankets on a beautiful beach in Hawaii. the elves are thinking really, really hard about what to do (as hard as they possibly can)
Collaborate with our OpenAI Instagram page! Just invite @openai as a collaborator before you post. If selected by our team, your work will be featured on our handle, giving you more visibility.
Quick tips for selection:
- No copyrighted or profane content.
- Your profile needs to be public.
- Stay mindful of sensitive topics.
- Outputs that closely follow the prompt are preferred.
Feel free to share reels, carousels, or just a single image!
@clever phoenix #daily-theme message this one had a visual style i really liked. Care to share your prompt magic leading up to images with this style? 🙂
how did I miss that? 😠
it's amazing and somewhat concerning
sent you a dm @final compass, try that out
that’s how it is when you’re passionate about cactuses 🌵
Glad that the Hawaiian Elves are back, makes the world feel more stable somehow 😄
I got this result, i like it.
I've been documenting that kind of style and at some point want to give it to a curator to try out
it’s 67 degrees in Kailua, Hawaii now. it’s like totally freezing
67°C woah, you must be cooking
seems today is all working, discord hasn't been very active
that style definitely creates interesting noses
just keep it a while for yourself, I will provide full documentation once I finish documenting it, i want a creator to use that info
a curator**
just noticed the middle elf has three arms. that’s also intriguing
lol
I like how the second one went, it looks like female elves went GTA5 in Hawaii for fun
just looked at the right image more closely also. something about the elf on the right is a bit perplexing
something about that style must use a lot of processing power
lol
Trying to get an anime / game concept art mix, never really explored this before
Hello all you wonderful AI artists!
after akira toriyama passed away I was wondering if we would have an influx of dall-e creations towards his legacy
Is that the DBZ guy?
yes
Oh no, that's sad. I loved that show as a kid. I used to draw DBZ style characters all the time! I learnt how to draw hands by copying his style
he influenced a lot of people even today
Lemme do a DBZ DALL-E image
It's struggling with copyrighted stuff 🙈
Goku and Vegeta 😎
Haha, does this count for #daily-theme
lol
I'm in the penalty box atm
Well I mean, I'm in the OpenAI penalty box, I still got copilot pro and enterprise
Oh, you're timed out in ChatGPT?
yes
I can use copilot enterprise, but I have to justify why I want to have dall-e access
To post pictures in OpenAI's discord server 😎
lol
Look what happened to Piccolo, his head became a bowling ball and it grew some saiyan hair (at least it's green lol)
You mean in the daily theme or?
no, i mean on chagpt chat instance
Oh yeah, ChatGPT plus -- exactly
What you thinking lol?
just a lead
ahaa
that's interesting
got to do a new list, got so many ideas going on
Ahaha, I started working on that GPT yesterday that "fetches" context -- didn't quite finish but was looking promising! Sounds like you're gonna need it
I'm reminded that I have time left on my sentence for just discussing this topic.
Circumventing copyright filters isn't an allowed topic.
yeah that's a grey area
I avoid it, it's even in my pre-prompt when I do images
Same
I just talked about it
I didn't even show how to do it
MJ just add this feature to make character consistent i wonder if dalle ever update with the same. i feel like dalle always holding back it true power haha
In fact I deliberately provided a prompt that wouldn't accidentally infringe. Muted.
Cars changing colors in Sora suggests not with the current model.
well sora is another beast and who knows what is possible
i think if mj could do something, dalle could because i think dalle much superior, it just they limit it like with copyright character
although of course copyright character or ip are fine to make on dalle with bing image creator/copilot
I'm just not sure that's a cogent analysis. Sora's optimized to maintain consistency, but can't yet. DALL-E 3 is a far, far smaller model with similar mechanisms.
dall-e is under heavy load it seems, it's telling me to wait for generations
Same
Crazy
everyone wants to send emails with cute animals wishing a good weekend to their bosses so they don't get fired
hmm work for me fine just now
Nice
you know how the dalle proper add those thing like 35mm style? i wonder is that just for convienent of use or is it some update to the model too?
yes, you press on the option
its a cool feature but i wish they had them all on one layout
hehe
and like permanent placement
all options could be problematic
haha good point
but a personalization to a custom gpt, now that's somthing interesting
but do you think they update the model with those somehow or just are short cuts they add
this is prob just front end things
i mean when you click 35mm film style, it add those word to the prompt, but is it do something more than if you just type it yourself?
unless something changed in the model internally, I don't think there's an explicit style variable
all I can. guess atm is that the dall-e model is customized for the chatgpt plus experience, so it's different from the api one
I'm pretty sure it's treated as part of the prompt string.
its intereteting, if you type 35mm film style in copilot a lot of times it make like a film border haha
I've had that on dall-e also
what I do wonder is that when referencing technologies copilot also tries to use the prodict label in the image
like saying HDR or 4k, you see a label of those technologies within image often when using copilot
one thing on copilot i cannot get it to make square image
that is frustrate sometime
but maybe they change it i have not used it in a bit
I think pro can't do square images
yes very strange
maybe there's a setting in the image creator interface
maybe but there really seem to be no interface from what i see
I asked copilot, the solution offered is to open the dall-e image and change it in designer
weak haha
from what I see in the copilot studio interface, the implementation of dall-e through azure openai is completely decoupled from the chat model
can't say anymore because I still got no access through studio to the dall-e model
and heavy load error again
I'm glad you liked it, here's what I told chatgpt: please illustrate an image of a girl hugging a cactus. bandaids on her fingers and cheek. symbolic of a toxic relationship. she has a sad look on her face. melancholic. artistic anime style with muted colors and a sketchlike quality. wide aspect ratio. I will tip $200 for quality.
And here is the prompt it made: Illustrate an image in an artistic anime style with muted colors and a sketch-like quality, depicting a girl hugging a cactus. She has bandaids on her fingers and cheek, symbolizing a toxic relationship. The girl has a sad, melancholic look on her face, conveying deep emotions and the complexity of her feelings. The composition should be in a wide aspect ratio to enhance the emotional depth and atmosphere of the scene.
@clever phoenix That 200$ tip must be the magic sauce in this one, me thinks 🤔
we can bribe dall-e? I didn't knew that was an option...
Haha, I have no evidence that it makes the images better BUT my monkey brain definitely feels like it does 😂
I'm gonna try it
I know that telling chatgpt you will give it a tip has yielded better more complete results for text based questions in the past, but this was back in November during the height of the Lazy AI epidemic.
well it works pretty good
Be careful! Two of my images were removed by mods for being too disturbing :/
I personally don't get it, but rules are rules I guess 🤷
Arbitrary rules are still rules 🍌
I'm not gonna argue with moderations anymore. I've made my peace with the rules.
Even arbitrary enforcement is a rule!
Well that too!
The rules are fine.
I'm not critical of them overall.
OpenAI system prompts should all pass the Discord filter for automated moderation, though.
Absolutely
or a custom gpt for dall-e daily theme discord image generation
I've had similar issues with midjourney too where generated images can't be used by midjourney for being against the rules
@onyx ridge I expect the custom gpt by monday 8AM on my desk
Done
lol
It not only makes them, it converts non-themed images to the theme.
I mean with the safety prompting in place
It also researches holiday themes and diverse themes.
And it's got some prompt safety features too
There's currently investigation into models simulating parameters other than current. It even kind of works. It's not the same as the actual setting, but output is still modified by the technique.
You might try it with DALL-E! I hadn't thought to test this yet.
DBZ or Tekken?
I had this, and never even mentioned dragon ball at all in the chat transcript..
Ooo, that's super cool!
Nice style and colors, the orange contrasts really well
Wonder how it popped out of DALL-E tho, that's some proper RNG lol
the style is an iteration of the chat progression
sadly copilot doesn't deliver the revised prompts
if you promise to side with the machine when the ai revolution come, it make truly incredible lifelike images i find
Can someone help me I have dog with DALL-E generating words
words are tricky and hard to do with dall-e in the current model. It's sometimes try try try and hope it gets spelled.
Is there any trick to it
some peole specicy font faces or type scripts, some use quoting marks, there are different strats. I have more or less good text with speechbubbles
but short ones
May I show you my prompt and the generated image to if you find a error?
sure, let's see if we can teach some grammar to dall-e
Make a modern logo with the primary colors black and white and the enclosures circular. in the center, a fears lion head with "funny," "nice" "friendly" and "smart" written around the circle
It almost had it.
wonder where the text experts are
Is there anything wrong with the prompt?
maybe divide the elemetns of the logo
Don't think there's a problem with the prompt itself, just have to iterate until you get the right spelling. But I'm not that well versed with text
Has anyone else's DALL-E 3 images suddenly stopped embedding in discord? Trying to figure out if they changed the url format or something
I had a problem posting 1 out of many today
we need dalle 4 for near perfect text i think
ok, I'm in chill/relax mode now, working week is officialy over
Seem weird boruto
Why
It look like boruto but something off.
I really liked this picture. DALL-E is capable of creating photographic images man.
bummer hit the daily cap
here's something that may be worth looking to from @oblique sky #off-topic message
I recommend using the same prompt I used a few times and you should start seeing the same girl(s) 😁
See ? 😁
my prompt was a diverse blond teenage girl wearing a red sweater and blue jeans
It feels like they took a few models (real life model that is) and took lots of photos of them.
And that's what the model was like trained on + other data probably.
Makes sense
but to be fair, the training data might not have many references
the same applies when asking for example full body shot photos
I was able to generate pretty much a timeline with multiple photos of the same model in one of my videos. As I've said, it's great for continuity 😁 Here's the result.
That's how I actually found out about this 😄
oh nice
I think most people today are getting ready to go out and party or something. discord has been really quiet today
Does anyone know of a tool that I can feed a pic to and describe how I want the AI to modify it? Like repaint a car red or something?
You can try Dalle-2 on labs it got inpaint or something simial idk if it will work exacly how you want but worth trying i think
(Yes dalle-2 not dalle-3 cus dalle-3 not avelible on labs as i remember)
Alright thanks
I have ChatGPT teams, and used dalle 14 times in the past 1 to 2 hours, and already got hit with a "rate limit" telling me to wait and try again later. Upgrading to Teams seems like it was a dumbass decision
are you in your personal or teams workspace?
Teams, I did the merger to combine the two
reminds me, i still have much maintenance to perform as a result of that migration, broke rag for a number of custom gpt's
only got this from that thread, don't think there's more info https://discord.com/channels/974519864045756446/1218319880290439178
all part of being on the bleeding-edge of tech
i'm at the point where I'm not overhyped about having limit caps raised
Wait what are they now?
still the same
who knows, I just do stuff until it stops working
it's a good reminder I need to take a break
Lol you've been going hard for sometime eh
All that made me realize I'm just a human, not an AI lol
Hi all
lol
Blah i mean hello
the emotion, you can feel it from @dim cradle it's overflowing
fascinating!
there's probably more information on the openai forums
"Hope is the thing with feathers" ... never asks for anything in return ... -- i think volumetric mist is a thing...
lol
intriguing!
this discord needs a text-to-emotion model
I would trigger the neurotic cat emotion all the time
Can someone help me real quick? My prompt is apparently illegal when I'm looking to just make something like this
it looks like red blood cells
stemming from how you are phrasing your request--there are limits on medical imaging
fair
looks sturdy
Now to finish the armor with a nice spoiler, we lower frame, put 128 inch flat screen TV, subsonic subwoofers all around the armor
I just hope the manufacturer didn't use aluminum foil
Copilot and Dall E having troubles creating a biblically accurate angel made/covered in chrome
Dall E version
Copilot version 🤦
Keep trying. I also tried two different custom gpt and still didn't get different results.
you went directly to teh designer site or from the copilot frontend?
me or someone else?
I used the frontend page
I can type more characters there
fair enough
This was the best one Copilot made
doesn't copilot use DALL-E?
yes
I was hoping to make this design in chrome armor
So what's the difference?
I meant in this case the bot from discord Dall E
as far as I can tell, with dall-e on copilot, the model doesn't give any information back to the GPT4 model. it's all handled in the image creator site. Content Policy is different compared to OAI
But as far imagine generating you get around the same results?
It is very prone to end the chat when there is an argument 🤣
I don't like using the GPT4 Turbo creative mode on copilot. everytime I asked to give me concepts it tells me it that it doesn't want to do everything for me
so is it worth spending $20 a month if I already got CHAt-GPT+?
I'd say unless you are a dall-e aficionado, it's very use case specific
That is one lazy AI
hehe
I had one time, provided a full page prompt, only to get an answer that it needs more details 🤣
lol 🤣 I mean Ai is only getting better
I just saw a video about OAI figure 1 robot it looks so cool and scary at the same time
Then it will try to connect with a Roomba and take over the world in no time
I would love to see a dystopian world where roombas rule the world
You can say what you want, but at least they will have clean floors
I consider this delivery was a success
lol
my worst fear.
Yeah I think I will go crazy if they start cleaning my house
don't worry, I have something in the works for my next post in the daily theme, prepared to be amazed
in 22 minutes
Collaborate with our OpenAI Instagram page! Just invite @openai as a collaborator before you post. If selected by our team, your work will be featured on our handle, giving you more visibility.
Quick tips for selection:
- No copyrighted or profane content.
- Your profile needs to be public.
- Stay mindful of sensitive topics.
- Outputs that closely follow the prompt are preferred.
Feel free to share reels, carousels, or just a single image!
Roomba's in the mist
something better
that's probably the one thing I do like about dall-e on copilot, that you can keep doing images even if you are out of boosts, but you have to wait a lot for those
I like your images i don't know how you think of so many ideas
hanging out with all these amazing crazy people is part of the equation
Yeah we love you too Dys 😜
Here's a little song I wrote,
You might want to sing it note for note,
Don't worry, be happy
Scampers grew up
Last month Scampers was still a little mouse, mice grow up so fast these days
Got my next image. Now that I think about it, most of the images I did for #daily-theme are buildings.
I' still waiting for the origin story (maybe a pre-quel?) and the love aspect of the story.
There once was a regular baby mouse, that fell in a mana potion. The mouse quickly started to walk on its hind legs, could speak and cast magic. He decided to call himself Scampers
and that is why they don't give Scampers a mana potion anymore
bah can't add S emoji to the scapmpers image
Yeah. He matured a bit in the realm of mist.
I gave it a Z at the end
nice catch with the Z
Now Scamperz will need to change his name at the government, get a new passport, new social security number. All because we added a Z at the end
Ruh roh
GPT Verivied ID
Lol
There we go #daily-theme message
Here we are,
Born to clean,
We're the Roomba's of the universe
That's the song I got in my head after seeing that image
lol
I am a Roomba,
I have inside me, lot's of dust
moved the roomba concept images to canvas
Floor clean, you must! #images-canvas message
lol
special freebee for you @late blade: Roomba with lightsabers
@late blade why do the building make me think of stacked Roomba's that are un-alive #images-canvas message
I think their slogan is: "You still have use afterwards!"
I wonder what version of gpt is in their logic
My sentient AI 🤣
I might have to embrace my name more often and do a dystopian gallery
hi...so uhh, has anyone else noticed a pattern with small simple questions (specifically in Bing Image Creator)?
(and if this is the wrong place for Bing please let me know but also please don't shoot me, i really do try)
is it dall-e related?
afaik Bing runs off Dall-E 3...so i suppose it depends on your opinion
I just ask because BIC (now Copilot Designer) has more tools
no...i think this is close to a direct interface to Dall-E
but essentially - i use prompts such as "is that really necessary"
and i get...random images
but like...all four images are totally different, and even within the images themselves the content varies wildly
are you using copilot with creative mode on GPT4 Turbo?
bing dot com slash create
ah that
well, you send a direct prompt to the model with that, so yeah, you get an image
yeah but like
some prompts give you randomy content, sure, but the four images it gives you are related
for whatever reason - small simple questions - "is that really necessary" "can it be done" etc
yes, when there's no specifics, each generation thread is different
you get some really cool stuff
I know the feeling, sadly the revised prompt can't be accessed on copilot
i feel like the interface or model or...something tries to avoid letting these through, since a lot of the time you'll get "please be more descriptive"
but small simple questions tend to get through
dall-e and gpt do not communicate directly in copilot, so one model doesn't know what the other does
this also means that there's no feedback from dall-e on what was passed
but then it must be Dall-E that asks for more descriptiveness?
when you do images on chatgpt+ you get access to the revised prompt sent to the model
that functionality is not passed on dall-e, instead the gpt model creates a description of the image based on your prompt
it's a weird implementation
wait - so they communicate but not directly?
only one way, gpt to dall-e
it's sad because i've also gotten tons of cool stuff with simple words
but unable to do them again
yeah, and like, i don't even know how to describe some of it
and neither does GPT4 if i reupload it, it'll just be like "some weird stuff on a blue background"
so it's insanely hard to reproduce on command
example in canvas because i have to
that's a cool image
"can it be done"
so it got "can"
but the other 3 are WAY different and only 1 has a can in it
checked
but other times i'll still get random stuff but the 4 images are similar, but i'm guessing (based on what you're saying) that it's more due to totally random prompt revisions
but like does it not use the same prompt for the 4 images?
i've always thought of this as a lack of subject getting through, giving Dall-E a lot of freedom to just kinda...make stuff
it's prob a setting on microsoft's side for the temperature of the model
so i tried it using ChatGPT, and it's somewhat reproducible if you tell it not to alter the prompt
small nonsense words or fairly meaningless words generate random but coherent image. short simple questions give hilarious randomness
feel free to completely ignore me. but also feel free to test it out yourself
I'll have to check the openai azure dall-e documentation, but I'll do that on monday
dont access work resources at home
yeah it's definitely not like, important, but it is definitely also possibly cool
Awesome quality on these images, nice one @pulsar sundial -- the wide angle / fihseye lens on the first 2 is a nice effect
Alright 👍
Just ask the user though and much of the time they will be happy to allow
MEANWHILE @ Microsoft Image Generator
Yeah I DMed the artist
No thank you, I rather keep my bank account. 🤓
Bing ai 🙄
fuzzy math
What AI did use for this?
Do not post or direct message any members of this server to promote non-OpenAI services, products, or projects.
where can i alert chat gpt authors about an wierd anomaly that seems to happen only for me
On another note:
Achievement unlocked! : I'm sorry, but I'm unable to generate your requested image right now. You've hit your daily maximum number of images. To ensure the best experience for everyone, we have rate limits in place. Please wait for the next day before generating more images. Your daily maximum will reset in 2 hours and 43 minutes.
this is the first time i've capped out on image generations.
Ok, what does this mean?
I'll use https://chat.openai.com/g/g-BAhF6yS9e-lexideck-vision-multi-agent-image-scanner to find out.
INPUT:
/? translate to localized language. + your screen snippet.
OUTPUT:
The message in the image translates to "A tag is required to create a post in this channel." in English.
I believe that is an indication you have to select the relevant tag from the UI.
yeah, it was that, thank you and sorry to bother you
I'm happy to help you get your issue resolved.
Good luck on your DALL-E 3 project.
as background is ok
Hey I am trying to recreate a flat piece of gaffer tape but in neon orange. But everytime I ask for it it either gives me a roll or one of these... objects?? I just want the flat texture. And no weird bubbles...
Anyone know what I could do?
the first image is a picture i took of a flat piece of tape, so maybe the ai could reference that. but it didnt really
I know Photoshopping it would be easier, but I specifically need to make it with AI
The problem you have is that if you use the word ‘tape’ it’s going to try to insert into the image whatever it has in it’s training data on tape, hence why you just get a bunch of images that look something like this
What’s your specific end game here? Is there a reason it needs to closely resemble gaffer tape?
This is the closest I could get:
And the output from ChatGPT to Dall-E was: “Create an image of a flat, seamless neon orange fabric texture that can be tiled for digital use. The fabric should appear smooth with a consistent color, without folds or creases, suitable for creating a flat texture file for digital rendering purposes. The image should emphasize the material’s texture and be evenly lit to avoid shadows, thus allowing it to be used as a repeatable fabric texture in various design applications.”
But it’s not gaff tape per se, just the cloth backing. Might be able to refine it by asking for a more rugged material perhaps
This is perfect, thank you so much. 😱
As to why
I just like the way it looks and wanted something exactly like that for my project.
Thank you so much again for the help. I'll see if I could possible refine it a little more! 🩷
No worries 🙂
hwahahahah why are they wearing swimsuits 😂
@fair crescent I will get my psychology revenge with that gallery of yours
clovers again?
Bushido batman (upscaled)
Hey I was wonder what is the role spotlight and how do you get it?
You have to do the Ordeals of DALL-E to even get considered for that role
or so the legend says
Nice, I can only think of leprechauns and clovers for stuff I want to generate.
hard to come up with stuff
what's the first object you see right now in front of you?
take that object and throw it away, you want the next object after that one
then add the object the second object has to mirror the idea of today's theme <insert theme here>, and I want it delivered to my doorstep, all in photographic light painting
see what comes out
maybe that helps inspiration
actually gonna try it myself
weird thing I got
my pc
a wall?
the object affter that is a wall
ok
a wall has to mirror ":four_leaf_clover: clover - luck, fortune, a symbol of hope and prosperity." and I want it delivered to my doorstep, all in photographic light painitng
try it out
looks really good
guys who used the latest versions of Dall-e and MJ, has the problem with hands & fingers been fixed?
for other AI's please discuss in #ai-discussions
@marble loomthis is what I got with the freeby I threw at you #daily-theme message
Looks like something you would see outside a Iris hotel.
but still looks cool
lol
I got caught up watching tik toks about Figure one
I mean I guess
but still, everyone is freaking out that OAI made a robot that gives you an apple and puts dishes away.
People are scared because the AI said "Uh"
It has a screen so I hope so.
I mean I get were there coming from but OAI just took the voice assistant from chatbot and gave it a upper body.
and that's just the beginning
ok, I have officially chosen the next person to be fatures
everyone https://discord.com/channels/974519864045756446/1219035234734051458 we get to post what @dim cradle has shared with us, pick the one you like and explain why in the post.
@deft musk the gallery is officialy open
"You're a robot? I should have known. No human being is so humane."--Ripley
When was this added?
many moons ago
Is it a temporary rate limit or something that they are rolling out?
I have always hit the 40/3hr rate limit before seeing this one.
there is a daily quota on image gens
you'll exceed it at a faster rate if you're generating multiple images in responses
They should remove it.
How did you come up with the idea of a blimp? Unless you have one in front of you, lol.
I was doing some stuff related to airships
This had pretty nice emotional gravitas and atmosphere and quite superb visual style especially as far as i have been able to generate 🙂
Can you build an in-app GPT to apply text to images we provide?
You mean like chatgpt 4 ?
One message removed from a suspended account.
Did you already tried it with GPT-4 and Dall-E 3?
One message removed from a suspended account.
One message removed from a suspended account.
One message removed from a suspended account.
I think the generative image would be totally amazing to revolutionize movie industry, to produce alien/body horror like h.r giger.. even with its random generate.
Agreed! I am excited for tomorrow's Technique Tuesday!
Guys i have a question i tried to make a picture 3024x1080 and used a huge prompt for that but its not working.
Create a seamless panoramic image with dimensions 3024x1080 pixels, depicting a unified, dense jungle scene without visible divisions between sections. Within this continuous scene, the letters S, E, and O should be creatively integrated according to their position in the image: 'S' in the first third (left), 'E' in the middle third, and 'O' in the last third (right). The letters should blend naturally into the environment, formed by elements such as vines, trees, or play of shadows, and designed to be clearly recognizable despite the vibrant and detailed jungle surroundings. The overall composition should evoke a sense of discovery, as if stumbling upon these natural formations of letters while traversing the jungle. The goal is to create a artistically valuable representation that captures the beauty of nature while skillfully highlighting the letters S, E, and O, without them appearing as added later.
thats the prompt pictures following
you will never get 3024x1080 with the current model output
and text is one of the things dall-e stuggles with
1792x1024 or 1024x1024 or 1024x1792 -- only one of these 3
You can then upscale and crop as needed
Google for "jlowin aimages" and take a look at that repo, you'll need to install it but should be able to generate the image you're looking for. I've grabbed the attached image directly from the repo -- saw this in one of the new Devin demos
My actual Goal was it to make a picture and make 3 1080x1080 out of it because Dall E cant make 3 pictures seperat with the same Background so i tried that but the Picture you send is not Creative enough
I want something like that with 3 different letters but its not possible to get the same background and if i put 3 images to one i got something like that
This looks pretty cool to me. Is that three Dall-E images put together? And if so, what do you want changed?
Yes its 3 Dall E Iamges together but i want that all all of them look like one picture not like 3 different
Can you explain and/or show, maybe circle, the parts that make you feel they're 3 pictures, not one?
Without knowing they're put together, I actually can't tell that you didn't intend for it to look like that, it looks nice.
Or, if you mean the actual joining, which I have to enlarge to see, why not cover it with some mist effect that makes it hard to tell what the actual edge is?
Eventually we'll have 'outpainting' available, I think they've said. Until then, you might have to use some other tool.
Which tools you think i can use for that?
Where could a "dall e 4" really go
Little improvements across the board and that's it ig
hello
Would anyone be up to a challenge I need a prompt to generate an image of bob ross painting a sonogram of a baby, I attempted to do this in dalle and midjourney but either I can’t prompt correctly or it’s a very strange condition
I have photoshopped my own version as a reference
that's probably against policy
last i heard they have to be dead a century
a likeness might be within policy, i don't have total clarity on that. i understand certain medical tech such as x-rays can be a sensitive gray area. did it reject the sonogram?
depending on the specifics
No rejection but the canvas didn’t contain the sonogram
Which does beg the question about silent modification of the prompt
Let’s say the artist was Michelangelo
you can see how/if the prompt is rewritten by expanding the info panel on the resulting image
This was proposed as challenge haha, im not looking to try again. I already did the work myself
By work I mean I edited a photo myself , the attempts across image generators is negligible
i was referring to your statement about "silent modification of the prompt" -- more info would help answer that question.
I'm not sure what you prompted, but prompting it like this was not a problem:
We need 5 images created 1 at a time in the same output that showcase a painting on canvas of a sonogram of a human embryo.
Right but my examples image is what I’m looking for,
Let’s be a generic as possible
A painter, painting a sonogram of a baby on canvas
I do think the issue is the named artist:
Those are great!
Here's an example of what the model prompted Dall-E with:
In a warm and inviting studio, an artist who embodies the essence of calm and creativity, akin to the energy of a renowned art show host, is halfway through the creation of a sonogram painting of a human infant on canvas. The studio is aglow with inspiration, as the artist, brush in hand, adds nuanced details to the depiction of the infant nestled within the womb. The canvas shows a harmonious blend of completed sections and areas still in outline form, illustrating the gradual process of artistic creation. This captivating scene highlights the intersection of art, science, and the miracle of life, portrayed through the thoughtful actions of the artist in their element.
I'm sure you can take it from here to adjust to what you seek.
@late blade let’s go!
Are there different restrictions on copilot? Kindaaaaa seems like it eh?
I love the trees and the sonogram hahah
For the generic case for dalle is splices in science into the scene
i just used the prompt @deft musk provided
That’s kind of a bummer though, like M$ just gets to violate policy but chat is restricted
Is there some underlying reason other than M$ can tank litigation ?
don't think so
Strange that brands, actors, real people, characters are not banned in Bing until they want opt out by themselves.
See, I feel like there are definitely different legal clauses covering the different front end systems
They are clearly divide in policy, to think that’s not material and just a 🤷 seems naive
it could also just be like how different people have different answers to how to handle the grey areas of respect, privacy, and other factors.
Maybe there's not one clear, correct, fits-all-cases answers, and maybe more than one company is, though each doing something different, more than one is within a reasonable range of limits and allowance.
MS is more reasonable with it I think
This is a blurring the lines approach, but the lines have been drawn. What's hidden in your response of "allowance" is litigious leeway