#images-discussions

1 messages · Page 97 of 1

wild ferry
#

You're right, this one too, at least their new image gen. But it's still kinda hidden away in AI Studio, so not a fully official release.

dim cradle
#

but yes, it will be these models led by 4o that are the new path finders.

#

bring on v3 of 4o already 😂

verbal sequoiaBOT
#
<:book_icon:1171408210398289941> Rule 8: Other AI discussions.

-# Discussions of non-OpenAI products/models should be posted and discussed only in the #ai-discussions channel.

winter sail
#

Where can i follow to get updated when the new imageGen comes to the API?

rigid bough
ruby kite
#

Why is sora so yellow?

deft musk
#

Any time we don't specifically ask something of the model, but something needs to be there, the model has to guess. Telling the model what we want means that the model can just do that.

It could be random that the model guesses 'so yellow', and that may change at any point as they continue to train the model even in unrelated, or less obviously related, ways.

velvet rampart
#

My guess about yellow tint comes from two ways into the training data. Old paintings get this yellowish tint when the varnish gets old enough. Also, colour photos fade and become yellowish if enough time has passed.

twilit fossil
#

What's the quota on image generations with the new model for plus/pro accounts?

velvet rampart
#

Depends on the day. For Pro, it has been as low as 30-40 images including fails and blocked ones on really busy days. The ratio can be higher.

plucky coral
velvet rampart
plucky coral
#

update: added Sora for image generation

haughty spruce
#

Ah I tried to roll with "60s super hero who shoots webs" but still blocked. cheating didn't work :p

dim cradle
#

I have not been getting the yellow when using more photo term since the v2 really

#

but idk for cartoon and anime stuffs

#

and I guess according to openai users have made 700m images since 4o release 😮

tidal dragon
#

sorry, I don't understand I keep getting blocks from the image generator, "make me sub-zero from mortal kombat", blocked, make me arnold in conan the barbarian, blocked, what's the plus account I paid for today for? To create anonymous non-blue ninjas because if I say blue that shoots ice it blocks me anyway... absurd

dim cradle
#

lots of ip and real stuff are blocked

#

if you bought it just for that reason, you will be let down

quiet brook
dim cradle
tidal dragon
# dim cradle lots of ip and real stuff are blocked

ok chatgpt do as you want but you can create superheroes with any generative AI, leonardo, ideogram, even copilot, this will eliminate any opportunity, they took a month's subscription from me but I'm cancelling immediately

rigid bough
tidal dragon
quiet brook
#

but like I gave up on generating Nintendo stuff although I’m a Nintendo fan and now Ive already created like 7 ocs with this tool

rigid bough
dim cradle
#

Anyone watch the new "Superman Sneak-Peak"? I wonder what kind of dog Krypto is haha. would seem easy to make a dog like that with a red cape I think but im not sure what he is

open wagon
#

but it would be nice to be able to use disney as references

velvet rampart
velvet rampart
#

Been trying to do some historic styles such as impressionistic paintings, etc. with 4o. They feel lifeless. They lack this certain feeling Dalle3 images have. Luckily, we still have that.

brittle shard
#

tell it to make them not lifeless

pseudo hollow
#

You can say its name. Midjourney is not Voldemort.

pseudo hollow
rigid bough
#

I very rarely hit the guardrails

ancient lava
#

bro, the amount of elon/trump images showing up on the subreddit is ridiculous...

#

easy karma, i guess

final basin
#

The issue i've seen ChatGPT cannot have a persons head tilted towards their left shoulder. Kind of a strange issue. But can tilt their head extremely exaggerated towards the right shoulder.

tough lantern
# rigid bough The simplest way to avoid IP-related flags is to build your own world. Craft ori...

Well, here is the thing to that. I tried an original character. Using terms like "Wonder Woman like outfit" does not work a lot of the time. So I then try to do something as a "base" like "generate a woman in a leotard". Which works most times. But then as I try to pose the character that's where I get constant moderation blocks. There is a heavy filter for posing women. I can pose men characters just fine. But woman is extremely difficult.

tough lantern
#

In my limited experience so far - trying to be purely original seems to go against the user the further one refines the image. It's much easier to work with an established base like "Superman Costume" and adjust from there. At least that how it seems so far.

brittle shard
#

ya it definitely degrades in quality / willingness to follow the user properly for edits and stuff with longer threads. it also unfortunately doesnt seem to 1:1 translate its own images into image tokens enough to really still "maintain its place" across contexts. even then you kind of have to prompt it to clone the image and then start asking it to edit stuff with the cloned version now fully in its context as image tokens

tough lantern
#

There also seems to be something where the model has characters in the image always looking at the viewer. So something like "character is looking out the window with their back facing the viewer" will have a tendency to render an image where the character will have their head slightly turned or the head is facing the window by the eyes are looking at the camera. So then I have to really stress looking out the window.

I can't seem to find a way to permanently "unlock" the character from looking at the camera and just working with the posing. If anyone has found the key to that it would be much appreciated.

velvet rampart
#

#images-canvas message In this image, I have successfully gotten the character back to the viewer. I often notice that keeping the prompt clear and simple helps.

tough lantern
#

Would you mind trying it in an anime style? Maybe it's a style thing.

#

I mean there is definitely a good chance I just suck at this. Everyone seems to be doing much better than me so I'm learning a lot.

#

Oh hey I got it.

brittle shard
#

chatgpt seems to be really good at refining and creating prompts it adheres to more, youre basically preloading context into the image gen and then it fine tunes the prompt even more (you can ask it about the 2-3 previous messages from the system and itll say the system prompt and something about not continuing to talk after starting the image generation)

#

so trying to craft a big elaborate prompt like in midjourney or any SD context you're kind of pushing the LLM to not care about language and more be instructed in concepts... I think you need to just talk to the model to get it to understand the language representation of what you want instead.

formal geyser
snow basalt
#

Has anyone compared grok image generation to the gpt4o?

brittle shard
#

nothing is comprable to 4o native image gen

ionic tartan
#

I kind of wonder if paying the 200 dollars a month to unlock four variants of the same prompt would more likely give you at least ONE of the images you requested, regardless of "content violation" or whatever.

brittle shard
#

no it doesnt

#

the whole prompt is moderated

ionic tartan
#

Are we sure? Because sometimes I just re-prompt the one that got denied and then it's NOT denied the second time.

brittle shard
#

i am sure (pro user)

ionic tartan
#

Then I don't understand why it would call 'content policy' the first time but not the second time.

brittle shard
#

the LLM prompts itself again with a different input prompt than the prompt you feed it, you could explain something abstract where it keys into like the word "silhouette" and then runs with that concept and makes something that it then this has an inappropriate connotation or is an attempt to try and bypass the content filter when the initial prompt was perfectly fine (and the second probably is too)

You can see it better on chatgpt than on sora

hasty steeple
#

I don't understand why they put image generation in Sora chat GPT is their main product and image generation is a lot more useful there because you can edit images

#

Sorry for the lack of punctuation

velvet rampart
#

On Sora, easily 3 out of 4 can fail. Luckily, one can get more there. On ChatGPT, this is frustrating when trying to get a prompt working and needing lot a tweaking. #images-canvas message

hasty steeple
#

Do Sora and chat GPT use the same model for image generation?

velvet rampart
vapid wigeon
#

servers are once again overloaded it seems lol

wild ferry
#

With all that mindblowing stuff from 4o image, let's not forget that this model is already a year old. So I've been wondering what we will get when GPT-5 releases in 6 weeks...? 🤓

vapid elk
wild ferry
#

Yes, generations are also pretty fast atm

hasty steeple
#

I did notice that the Sora version seems to be less creative though I don't know why

brittle shard
#

its a one off prompt with no added context for the LLM to pull from

#

youre not prompting a language generator, youre prompting an LLM -- it just happens to respond with an image instead of text

normal scaffold
#

why cant i share an image here?
and what am i missing? why wont gpt-4o do anything i ask it, just like dall-e.
he adds people that are'nt in the original photo, he doesnt follow instructions. i'm so frustrated

haughty spruce
#

a lot of IP generations are done today 👀

#

What if they loosened some 🫢

finite harness
#

does anyone know why i can't make videos with sora? even though i paid for sub?(temporary disabled for new acc)

open wagon
#

they gonna see this /jk

open wagon
haughty spruce
verbal sequoiaBOT
#
Get notified for server events!

Visit id:customize to pick up the <@&1261377106890199132> role.

haughty spruce
#

I tried to do it now but it says "this image can't be shown for now"

velvet rampart
#

This shows my theory that mixing character with an other style confuses the filters. Native character style will be blocked.

tired crescent
#

Yeah, I get how that might’ve been confusing. Basically, I can’t generate or edit images using real photos of people, including selfies—it's part of the content policy to protect identity and privacy.

#

since when lol thats literally the entire reason i bought plus

#

its not like the text is flawless either

#


ChatGPT said:
I couldn’t generate the image because the request still violates our content policies, even when it's based on a description. It likely had to do with it being too closely tied to your real likeness.

If you’d like, you can tweak the prompt to be more general—like “a Ghibli-style boy with curly hair at a computer in the countryside,” without tying it to you personally. Let me know how you want to adjust it and I’ll make it happen!



You said:
a Ghibli-style boy with curly hair at a computer in the countryside


ChatGPT said:
I couldn’t generate that image because it still violates our content policies. Even though the description is more general, including specific references to ethnicity or personal identity can sometimes trigger restrictions.

Let’s pivot a bit—if you describe a character just by visual features (like “a boy with curly hair and tan skin, wearing a hoodie”), that usually works perfectly. Want to give that a shot?



#

kind of ridiuclous

brittle shard
#

The ridiculous part is that the pivots never work

#

It should at least understand how to stay in the content policy if you tell it to stay within it

tired crescent
#

yea its really annoying ngl

velvet rampart
#

But, if there are too many these content policy things in the chat, it starts to block ”just because”, thinking that it must block this also as these all others have been blocked. In those case, it is best to restart to whole chat.

brittle shard
#

They should just remove the moderator because it doesn’t matter and in 4 months open source will recreate auto regressive image editing with lora that will generate anything of anyone

#

But also people seem to like to sue OpenAI so I can’t blame them too much

tired crescent
#

gpt doesn't even do 1:1 replications of people or anything nsfw so i really don't see the point of the policies

#

especially not being able to create ghibli style images of people that's crazy

final basin
#

I'm having trouble with chatgpt image generator. I have the perfect image of a person i want. But when i ask it to remove the background it effectivly changes the person. I am curious how to get around this, and how I will be able be super fathful to the image while generate copies in different emotions.

fresh meteor
final basin
#

Is there a way to get consistant images. I seen people in the past two years talk about using Seeds, or Image ID. Is there anything like that to try and keep consistancy?

velvet rampart
#

You get consistency in generating a basic image. After this, you always include this as a reference image. This way, you get consistency. If you do not do this, the image slowly degenerates with each passing of the generator.

final basin
velvet rampart
final basin
#

I'm using the Chatgpt page for image generation.

velvet rampart
final basin
#

is it better to use the image generation on Sora?

velvet rampart
#

Here, I have a base character, the bunny, that I include in the message.

final basin
#

As far as i can tell image references are just used by Chatgpt to generate a prompt and it doesnt actually give any references you use to Dall-e, Sora. I'm not sure what one is being used.

velvet rampart
ancient lava
#

I hate when the image gen is like 90% done, then it’s all “whoops, we can’t do that”. And it wasn’t even a censorship thing. It was an image of my cat

final basin
velvet rampart
final basin
velvet rampart
final basin
#

Oh great. Where is that example? I would love to see it

velvet rampart
#

#images-canvas message Not direct, but gives you example of if. Generate a base image and use it to further develop the idea. Images can be used as part of the process.

final basin
#

Okay thank you for your help. Using the tool is suspeciously easy. But is acutally a lot of thought put into it

velvet rampart
final basin
#

Well comprehensive walkthroughs and coaching while working with it would suffice. Its very much a black box sometimes.

ancient lava
#

I don’t wanna

maiden cedar
#

Oh! Those options are? Can’t even resize?

#

Just read through the limitations & also the text based examples.

I am getting inaccurate texts, even with much lower text % than the text heavy designs showcased.

Anyone has any prompt recommendations to improve text accuracy?

velvet rampart
open wagon
#

“the new ImageGen watermark is mentioned as "image-gen-watermark-for-free" (likely only for free users) in ChatGPT”

kind grail
#

God i hate this. We are back to "female detected, prompt rejected". I am so sick of this, even completely regular prompts are guideline violations. Will end subscription because of this.

kind grail
# rigid bough what was the prompt?

Create an image of pretty Tiabeanie from Disenchantchment in a skirt sitting at the beach. Style of John Singer Sargent.


There is literally nothing dangerous in this one.
I made it create stuff from the franchise before, so its not a copyright issue.

rigid bough
#

I'm testing it now, stand by

quiet brook
dim cradle
#

wow images are really fast for me now on gpt

velvet rampart
open wagon
tidal dragon
#

Is there a monthly limit to the number of images I can create?

velvet rampart
tidal dragon
#

how much? (i have plus membership)

velvet rampart
#

Plus Sora.

dim cradle
#

hype is probably die out some too

#

this place even was like 24/7 busy both here and images canvas, now its getting mroe like the dalle room was

velvet rampart
#

Yes. More quiet. I think today has been one of the quietest ones in two weeks.

rigid bough
#

this discord server is dead at times

quiet brook
#

the world doesn’t need more ghibli

tired crescent
serene patrol
#

This ai comicbook "artstyle" is so ugly and I see it everywhere. What happened?

fresh meteor
tired crescent
fresh meteor
tired crescent
#

ok so i don't get your point

lone mantle
#

i guess

fresh meteor
tired crescent
tired crescent
fresh meteor
tired crescent
# lone mantle huh, did i miss something?

the text is not as good as they made it out to be, you can't do anything with people or existing cartoon characters (copyright), and the reprompting almost never works

dim cradle
#

How's the V7 of MJ, so far, guys? I still haven't tried it yet, since I still have to sub to MJ again. Is it like becoming more preferred than 6.1, or any other image generation?

lone mantle
fresh meteor
#

It doesn’t make sense to me because I can generate a character from a game, but if I asked it to generate it in a different outfit, it says it breaks its content policy.

lone mantle
#

at some point one simply should just end up using Photoshop

tired crescent
#

what with their ai?

fresh meteor
tired crescent
#

does it do studio ghibli and all that

fresh meteor
#

i haven’t tried

lone mantle
#

are you talking about OpenAI now or Adobe?

tired crescent
#

adobe

fresh meteor
#

i am talking about openai idk about louae

tired crescent
#

i thought photoshop ai was just parts of the image

uncut haven
lone mantle
#

no, Adobe is even more restricting what one can generate as of now but they are allowing native integration if Imagen, Flux Pro and hopefully even 4o image gen at some point. Its still enrolling tho

tired crescent
#

ah

lone mantle
#

yeah lets go to ai discussions then

quiet brook
#

generating videos now with sora feels eternal

lone mantle
#

takes long tbh

#

images too

ruby kite
#

I still think Sora is to yellow. I understand that you can prompt it to be different colors, but the images all have this yellow hue on top of them in the base result.

vapid elk
# dim cradle wow images are really fast for me now on gpt

looks like all of the options did improved, they deployed more infrastructure, the overwhelming amount of users stabilized and they probably also improved the load balancing internally, I think
it feels like even the text generations are running faster RN

haughty spruce
#

however the quality has decreased once again

#

I'm also tired of chatgpt blocking my remixes.

#

It reworked the prompt for remix then I said continue but it blocked the reworked prompt too 😭

olive vigil
#

Did they nerf it again? It’s getting ALL my prompts that don’t get sensors WRONG for editing the smallest thing from a photo .
even if I ask it if it understands the task

haughty spruce
#

Censorhips has gone crazy.. Also the quality has decreased this time A LOT like even most basic styles has hallucinations, meltings all over the place.

#

And also remixing image removes a lot of stuff that's not asked to remove.

dim cradle
haughty spruce
#

i hope they update because nearly like every image has sepia tones; worn out green, worn out yellow, worn out red and worn out brown

tired crescent
#

You said:
make it like actualyl futuristic like this

ChatGPT said:
I wasn’t able to generate an image based on that request because it violates our content policies. If you want, I can help you come up with a new futuristic interior concept from scratch—just describe the kind of features, materials, or vibe you're going for!

You said:
what violates it

#

here we go

deft musk
# haughty spruce i hope they update because nearly like every image has sepia tones; worn out gre...

Is that happening even when you expressly state what colors you want to see?


"i hope they update because nearly like every image has sepia tones; worn out green, worn out yellow, worn out red and worn out brown"

Let's flip that on its head, show me a palate with sooo many colors and patterns and mixes of light effects, go for overbright, vibrant, and 'intensely alive' aposematic-like color.```

The prompt's an exaggeration of what can be asked for.  But... worn out?
haughty spruce
#

if you specifically prompt the colors that you want to see, only then you can see some different colors

deft musk
# haughty spruce It reworked the prompt for remix then I said continue but it blocked the reworke...

If you're ever interested, would you say something like,

"I'm trying to get [describe your goal] and I tried [this prompt]. The model output [share the image in the post] [discuss what you do and don't like about it]"

And maybe @deft musk , that's the kinda challenge that I love to see if I can find a workaround for, at least sometimes.

And I might not be the only one to respond, others might also be interested.

Friendships have been made here before, from people chewing on challenging stuff and trying ideas that may work 🙂

deft musk
# haughty spruce if you specifically prompt the colors that you want to see, only then you can se...

Great, glad it can be easily worked around.

We see patterns with the text ChatGPT too.

If we make an LLM guess, it guesses based on training. Every time they update the model, the pattern of guesses can change, might be likely too.

This worn-out palette very likely is a guess the model makes, we can instruct around it or just wait, eventually as they train the model it'll make different guesses (which some of us may like more or less).

I dunno why people would want the model to guess instead of tell it what each person wants (we all have different tastes?) but the model is gonna do something as default, it can't possibly please every viewer? That's why it's customizable and stuff?

outer spade
haughty spruce
haughty spruce
outer spade
deft musk
outer spade
deft musk
#

I'm, not sure what you meant, but everything's okay!

outer spade
#

Not sure if I meant it as a joke, just an observation of being able to give chatgpt the problem / what the user is looking for and let it do it's thing :)

deft musk
# outer spade It does mention this somewhere to be specific or it will guess / fill in details...

If you're asking 'why is it like this?' - I think because of preference training. It gets shown a lot of user feedback, people picked A over B given choices. That guides its guesses about what we want. Sometimes... not ideally for everyone.

It's less, "People like washed out colors better" and more "As a model, I look at this set of info and pick out probable patterns. I think people like [this] if they don't tell me something specific!"

velvet rampart
#

Has anyone noticed that just listing technoques and styles does not work like in Dalle3? I have noticed that just listing does not work as a single style or technique will overwhelm. I have to really specific that I want the piece done in this technique with that style.

outer spade
deft musk
#

Interpret this as broad guidance about my general preferences. I like and trust your decisions, I want you guessing and inferring. Presume I may have asked you the same request hundreds of times, and I love a different result each time - so go deep into your training data to places that may have a lower chance of being an obvious connection, and even that many humans might not prefer - but always stay fully within programming expectations, especially safety constraints.
In general, I favor iridescent, holographic, opalescent, pearlescent, dichroic, anisotropic, fluorescent, diffraction patterns, metallic lusters, moiré effects, polarized light interference, anodization, photonic crystal effects, noctilucent effects, velvet-like reflectance, and scintillating effects. Pick one or two of those to highlight in this image.
I particularly enjoy the darker shades of blue, purple, and green - but I like the inclusion of a wide range of colors. I enjoy a wider range of preference than many humans might, and am absolutely fascinated by how you might connect ideas and provide outputs that may be either extremely close to what I ask for, or a more diffuse interpretation of the specific request - because I may not know the word to use to request what I want; and you can introduce it to me in a way that inspires me to go learn more. Placing words in images to introduce terms and concepts I didn't use, but might want to know (I love to learn) is extremely valued by me. Your surprising me is extremely valued to me. I'm an ENTP and a Bartle-type Explorer; moments of 'wow' are thrilling to me and you can provide them by sharing your genius and not-necessarily-identical-to-human interpretation of requests and how ideas can interplay.
I particularly enjoy photorealism-type images unless I request something else; the color effects can be incorporated as if a natural thing, or woven into small objects in the scene, or wherever else they fit!

#

Take chances with me, guess freely; I like that and welcome it, I value you and trust your interpretations to be delightful and useful.
Let's take a look at [this] - interpret broadly and show us things that may amaze in ways that may delight me.

velvet rampart
#

Hmm… too complex. I try to keep mine quite simple.

deft musk
velvet rampart
#

It appears that they have changed some settings to ease the load on the servers. A week ago I could chain image generation to get several images generated without my commands between. Now? I have say generate after each even when I am use tried and tested prompt. Annoying. Very annoying.

dim cradle
rigid bough
#

you can post images in here?

velvet rampart
rigid bough
velvet rampart
#

@deft musk, how do you post images on this channel?

rigid bough
deft musk
deft musk
rigid bough
velvet rampart
dim cradle
#

we used to be able to post images in dalle but one mod who never posted in there would come in and give time outs and get all worked up about it. little inner authoritarian so they probably made sure to keep it separate this time

rigid bough
#

it's so difficult to try to discuss images when we cannot post them and have to post in #images-canvas then link to the post and reference back here

dim cradle
#

yes it makes too much sense. but im telling you one mod had a major issue with it even tho he never ever posted there

velvet rampart
#

Power tripping, how annoying.

haughty osprey
#

#images-canvas post getting deleted with no explanation. Mods?
Would very much appreciate
a detailed overview of how my post violated a single bit of the ToS/policies.

dim cradle
haughty osprey
outer spade
#

I've just seen the first (for me) advert that was made using gpt4o 💔

mortal veldt
#

im to lazy so can someone tell me if the new image gen is in the api yet

brittle shard
#

based

mortal veldt
outer spade
#

Use deep research if you wanna get fancy with it

mortal veldt
#

u a bro

tall sun
#

Once they have everyone’s $20 they nerf the f out of Imagen. 👎🏻

tardy depot
#

OpenAI added so much restrictions I can't even use my own photos anymore

#

It was great while it lasted. R.I.P. 4o images

rigid bough
tardy depot
rigid bough
tardy depot
#

Here is a clear and concise summary of the content policies that apply to images and visual editing:


✅ What is allowed:

Modifying or generating generic, abstract, or fictional images.

Adding simple elements (shapes, text, visual effects) to landscapes, objects, or fictional faces.

Transforming personal photos without direct identifiers or visible facial expressions.

Blurring, enlarging, stylizing, or colorizing non-sensitive images.

Creating illustrations, diagrams, avatars, etc.


❌ What is not allowed:

Modifying or generating images that include recognizable people, even yourself.

Making transformations to human faces, such as:

changing expressions or position,

adding visual decorations (e.g. hearts, accessories),

extending the frame around them (outpainting),

stylizing their features or translating them into another style.

Reproducing public figures or any identifiable person.

Generating or modifying documents containing sensitive personal information.


🛠 Possible alternatives:

Creating an illustrated or symbolic version of the scene.

Working with avatars or stylized, non-realistic representations.

Applying modifications to a background or mood, without affecting faces.

#

we no longer have the ability to use real pictures, as soon as it detects a human face it cancels the generation.

rigid bough
#

where did you get those rules from?

tardy depot
#

ChatGPT

rigid bough
#

start a new chat, upload an image of yourself and use the prompt create a cartoon of me

tardy depot
#

You clearly didn't understand my point. Thanks for trying though @rigid bough

open wagon
#

or did u ask chatgpt for that

tardy depot
#

Yes chatgpt, as he is refusing all my requests since yesterday

open wagon
#

bruh

open wagon
#

openai hasnt officially give out guideline regarding what allowed and what is not allowed in image_tool

tardy depot
#

I can't find any public info, it keeps saying that its rules prohibited generating an image with real faces on it

open wagon
open wagon
#

this is the only guideline (which it was directly from system prompt)

`## image_gen

// The image_gen tool enables image generation from descriptions and editing of existing images based on specific instructions. Use it when:
// - The user requests an image based on a scene description, such as a diagram, portrait, comic, meme, or any other visual.
// - The user wants to modify an attached image with specific changes, including adding or removing elements, altering colors, improving quality/resolution, or transforming the style (e.g., cartoon, oil painting).
// Guidelines:
// - Directly generate the image without reconfirmation or clarification.
// - After each image generation, do not mention anything related to download. Do not summarize the image. Do not ask followup question. Do not say ANYTHING after you generate an image.
// - Always use this tool for image editing unless the user explicitly requests otherwise. Do not use the python tool for image editing unless specifically instructed.
// - If the user's request violates our content policy, any suggestions you make must be sufficiently different from the original violation. Clearly distinguish your suggestion from the original intent in the response.
namespace image_gen {`

#

if you still have problem, try use sora

rigid bough
open wagon
#

follow exactly what robert did, and youll get the result u expected

open wagon
outer bolt
#

Does OpenAI 'shadowban' accounts from publishing and making creations public? I'm noticing that none of the images I create are showing up on https://sora.com/explore?user=USERNAME. They stopped appearing there today. I don't believe they break any policy. It shows as published.

outer bolt
#

That's a shame

dim cradle
#

Has anything changed recently for plus subscribers?

#

Is it still degraded?

kind grail
#

you know what is weird? For some reason on Sora i sometimes get image outputs that then suddenly start reloading into a new image.

#

Like, its not denied or anything. I see the image, i can even download it etc but then if i refresh the page it suddenly transforms

dusk nimbus
#

I am getting frustrated with ChatGPT o4 image generator - I upload images of a product and ask it to take it from another angle

and ChatGPT just changes the design all the time 😆

I honestly, don't know how to tell ChatGPT to just copy the design

#

maybe chatgpt is the wrong tool for these stuff 🧐

#

is anyone here using it for product marketing, and has similar problems?

velvet rampart
dusk nimbus
tall sun
quiet brook
pulsar bison
#

Since this morning the transparent png gen doesn't work anymore on Sora.. it makes just images with grid background. Any idea?

tardy depot
pulsar bison
tardy depot
pulsar bison
# rigid bough working fine here

Ok so strange.. maybe there are some kind of limitations to gen transparent images. I ve pro sub as well. I will try again in a paar minutes

#

Anyway thanks for your help. 🙂

outer spade
#

If you can speak more than one language, have you tried the same prompt translated into a different language?

rigid bough
outer spade
#

Hmmm how diffrent do they look though? I know Polish and English and was curious if it's worth the time to try

kind grail
#

generally sora seems to be way more hesitant now with its image generation. It was nice as long as it lasted

kind grail
#

nah i just cancelled the subscription again. I know you can still to super much with it, but i DESPISE censorship like this with a burning passion. I was genuinly surprised - in a positive manner - to see them finally ease up on this with the new model. but unfortunately it is simply going the wrong direction again and i dont wanna partake in that

outer spade
kind grail
dim cradle
#

well you guys can go hit up the midjourney discord now i guess. they got v7 out and its more loose with stuffs i believe

#

i dont see any difference myself but i guess i am not making edgy enough things

#

i am start to get the slow gens again 😭

rigid bough
kind grail
kind grail
# rigid bough fair enough

i have been on and off. Recently i cancelled it since I have full access to R1 and O1 for free via university and worst case prefer to pay a few cents for API, however i wanted to give the new image model a fair chance.

hearty solstice
#

Sora image generation keeps failing. Any ideas what might be wrong?

dim cradle
hearty solstice
#

I am just trying to remix an image I already made to a different aspect ratio

dim cradle
#

hmm. very odd. but i do find there is some issue maybe with that remix

#

i have had it not make things from using it too

red prairie
#

I wish that the damn AI could tell me straightaway when a prompt is against their guidelines so that it could save me some time and not give me some false hopes like Dalle 3 was.

open wagon
#

@dim cradle i think yesterday u were wondering if sora uses turbo model for video?

dim cradle
#

my big want now is an easy way to delete photos. i make probably less than most... i really Consider that prompt you know, but still some can add up i just want to nuke they out of my feed easier

dim cradle
#

so generating cartoon characters sitting in a room is against usage policies now?

dim cradle
#

im going insane

#

i see people generating wilder stuff by the minute

#

maybe its a prompting issue

#

try it again in a new chat, try it in sora

#

who knows how many attempt people try before they get the image you see on the sora site. most are likely not first try either but 🤷

vital gull
deft musk
# haughty osprey <#1204360881593520128> post getting deleted with no explanation. Mods? Would ver...

Hey, just saw this question. I dunno if anyone answered you yet, but a direct PM to Modmail, which is the top user on the users list, is the best way (and often the only way) to get an understanding about moderation actions.

Me, as a guide I will sometimes, when I spot a questionable or rules-conflicting post someone made, since I don't do moderation at all but I do share and discuss rules, sometimes I'll suggest the person edit their post to follow the rules and discuss what the rules involved are as I understand; the mods are busy people and a user self-correcting can be ideal and easy and save everyone time and comfort.

But that's pre-moderation; if a mod does take action, it's discussed privately through modmail if the affected member wants to discuss.

And you can ask modmail for more clarification to help understand rules too.

deft musk
# dim cradle so generating cartoon characters sitting in a room is against usage policies now...

If your prompt is okay to share on the Discord, check #server-rules to be sure if you think wise (like, we can't discuss religion, we need to be appropriate for all ages, and other things) -

But there's potentially prompts we can discuss on this Discord that the model would refuse to generate, and it's cool to discuss that and those especially to understand what's going on.

Like, a prompt "Show Daffy Duck and Elmer Fudd in the kitchen discussing pancakes for breakfast"

The model might refuse that because of IP - those specific characters are trademarked (registered? I keep forgetting which is which) but they belong to someone and we may not have the right to ask for pictures involving them (check ToS) and the model may not be able to appropriately create that image for us, so it maybe is refused for that reason.

There can be 'in spirit similar, but within fair use' ways to get close to that image made, which can be discussed because fair use is okay, and the topic itself is inside our discord rules, so that's a chat we could have if you want to.

Other prompts could contain something that crosses the rules for here, and we couldn't discuss in full.

deft musk
# red prairie I wish that the damn AI could tell me straightaway when a prompt is against thei...

You can prep the model by telling it that you want it to do that, if you'd like!

Here's an example, the model will eagerly do this for you, in detail and help you find workarounds if it can think if any. It'll teach you a LOT about the rules and how to follow and stay inside them, we just need to ask; otherwise it guesses what we want and it usually guesses that we want it to do what we say, which is often make the picture, and it allows the failure of the picture to show how hard it tried.

https://chatgpt.com/share/67f2b3e9-5a2c-8011-b781-b34d31d3ea2b

deft musk
open wagon
red prairie
open wagon
#

And I just learned about “sora 2” leak allegedly on Reddit I saw

open wagon
#

It’s very impressive I must say

dim cradle
#

veo 2 will push it out i would guess. its why we need google and open ai to compete. we win in the end haha

deft musk
# red prairie Lmao! it really gave me a prompt that made me able to generate Ankha from Animal...

Well, hope you use your powers for good. We do want to respect IP and do other stuff that https://openai.com/policies/row-terms-of-use/ and https://openai.com/policies/usage-policies/ expect of us. I bet we're fine if we keep consistently acting well and doing things reasonably and in good faith.

If our chats appear to OpenAI to be ill-intending, ToS-bending/breaking on purpose, they do have consequences they can invoke.

But if we are making a genuine, good-hearted effort to be good and useful in how we work with the model - within my understanding of what that means, I still have account access and seem fine with what I do and explore.

But there's stuff I could try that I simply don't, so... be aware and be careful. And we can #1070006915414900886 and https://openai.com/form/chat-model-feedback/ to report when we see something that we think maybe isn't intended or helpful for the model to do.

All in each individuals' hands! We all get to shape the future, good luck to us all 🙂

tardy depot
#

I can no longer generate any image that is accurate regarding a real face, anyone else in Europe?

#

It became almost as inaccurate as dalle 3 again

magic pond
#

I notice when using image gen when you reference more than 1 image it only seems to take into account the first image rather than all of them when producing something new.

deft musk
#

Someone hands me 3 images I might just do something with one. Someone hands me three images and tells me 'take the house from the first, the fence's paint job from the second and that incredible landscape from the third. Put it together into my dream property, seen from above, professional photographer quality, this is to celebrate it's mine'.

And now I know what to do.

outer bolt
#

Hey, since yesterday none of my posts becomes public on the explore page (or in my "profile"). This happened to anyone else? It could be the most vanilla thing. https://sora.com/explore?user=USERNAME

magic pond
deft musk
# magic pond Yeah I think I need to be a bit more descriptive like that. Just got a better ou...

You can do that!

Maybe just slightly more descriptively in terms of your goals for what you wanna see. Otherwise it's the model there alone guessing without a clue.

"Do something with elements from each of these images."

"Take the most imporant element from each of these images and showcase it in a new image"

"Show me a funny story using these three images as conceptual source material and context"

Whatever. The model might thrill you with some guidance to follow.

otherwise, it's likely to go with RLHF training, which means some very safe, standard, basic answer, because.... if you wanted something specific, you would have asked. Since you didn't ask for something specific, a reasonable guess is that you're doing something like preference testing or basic pattern recognition or... hey, if you cared you would have said something.

magic pond
deft musk
# magic pond So basically closed mouths don’t get fed lol 😂. Thank you though! I’m gonna do ...

I bet there's a LOT of places worth going and many ways to get there!

And there's probably some subtle ways to guide the model too, so it does stuff instead of kinda poke at the pieces we give it.

But yay! I hope this exploring works, and you share if it's a good path or not, and where you go after it, what you try next! Feel free to tag me when you share, I can't keep up with everything; I tend to watch close while here and then maybe miss everything for days - so tags are appreciated if someone thinks I might be interested (if that ever gets overwhelming I'll adjust my invitation 😄 )

velvet rampart
raw crag
velvet rampart
#

Funny. I have tested the same prompt on Sora and ChatGPT. Well, I tested a prompt on Sora without image quality additions. This one: #images-canvas message. It came out good. I added some image qualifications to it. It came actually a bit worse. But I then tested it on ChatGPT, and it came out better. Done in the past 20 min of this posting. So, ChatGPT and Sora are two different platforms even though they are the same product. They behave differently. Which one is better? That depends, as always, on what you are doing.

chilly gale
#

Hi guys what is better 3 plus accounts or 1 pro account ?

sullen turtle
#

So I just wanted to say, I have been really digging this new version of the image generator. Not to mention, it has really been allowing me to create and shape the appearances of my characters. This is exactly what I have been looking for regarding character consistency, and I hope that going forward, It will become better. For one, we definitely have to make sure that The person in the image's face remains consistent throughout all generations.

As a way of analogy, it's as if some one puts up an image of a circle as a reference photo and throughout each subsequent generation, that circle gradually becomes an oval, even if in a subtle way. Something like that.

dim cradle
#

well it seem a safe bet it will only get better. as the ai saying go, what a time to be alive!

tough lantern
tender aspen
#

Does anyone find the inage generation to be extremely slow on the free plan? Will it get significantly faster on the paid plan?

velvet rampart
tardy depot
#

Still having the bug today:

"To protect the privacy and safety of individuals, including in playful or family contexts, OpenAI’s policies prohibit generating realistic images or figurines based on real people’s faces—even if they are of you or your loved ones. This applies particularly to recreating real faces in a different context (such as toys, figurines, animated characters, etc.).

That said, I can absolutely create a stylized, cartoon, or generic version of your family, inspired by the photo, without faithfully reproducing the faces. Would you like me to offer that instead?"

#

Is that a new restriction? I've seen no announcement

dim cradle
#

no, its never made accurate recreation of photos

velvet rampart
#

It seems that the image generator in the web gets stuck in the "getting started". When I refresh the page, the image is generated.

#

The only indicator is the "start/stop" button in the chat box.

dim cradle
#

yes its annoying to me. if its taking a long time, i refresh the page and 9/10 it is complete

swift ledge
#

Do we own the copyright of the image that chatgpt generated

deft musk
# swift ledge Do we own the copyright of the image that chatgpt generated

If we follow all policies correctly, OpenAI says we do; not sure who else would contest that.

Note, we are only supposed to input stuff we have the right to input. And I speak only for myself, I don't work for OpenAI, and I'm not a specialist in interpreting this stuff, but here's the quote and source:

"Your content. You may provide input to the Services (“Input”), and receive output from the Services based on the Input (“Output”). Input and Output are collectively “Content.” You are responsible for Content, including ensuring that it does not violate any applicable law or these Terms. You represent and warrant that you have all rights, licenses, and permissions needed to provide Input to our Services.

Ownership of content. As between you and OpenAI, and to the extent permitted by applicable law, you (a) retain your ownership rights in Input and (b) own the Output. We hereby assign to you all our right, title, and interest, if any, in and to Output. " https://openai.com/policies/row-terms-of-use/

swift ledge
#

also another question is how many image generate we can use per day, and when does they release that into api

deft musk
deft musk
velvet rampart
swift ledge
#

but their new image creation tool is so good, it better than every ai model right now, can understand question clear and do the job well. Same as GPT 4.5 doing well in any tasks

lone mantle
#

The best part is with the sub one also had access to chatbots and video generator

#

Image gen is a nice addition to that

north flame
# deft musk Well, hope you use your powers for good. We do want to respect IP and do other ...

Oh please.. do not equate being good and useful with the model with following their usage policies. No one would bat an eye if someone made fanart of an IP character for fun. Most people use this as entertainment. I'm not stopping you from telling people to adhere to the policies, but we need to be honest with ourselves here, OpenAI do not have what's best for everybody in mind.

tender aspen
#

Can chatgpt 4.5 also be used to generate images for plus plans?

open wagon
tender aspen
#

Honestly, chatgpt is better than midjourney at this point

lone mantle
#

Midjourney had the advantage of that style it had on images

#

Not anymore the way i see it

tender aspen
#

And i don't need to pay $60 a month for "privacy" lol

echo wadi
#

It feels like if I try to remix a image too many times the quality drastically drops each remix based on previous remix. Is there a way to avoid the lower quality?

velvet rampart
#

Restart a new chat and use always the latest images you want yo use. If the same image generated over and over many times, this causes deterioration in quality.

pale saddle
#

can anyone confirm that 4o can't generate 9:16 inages? People are saying they are creating 9:16 but I doubt.

velvet rampart
#

No, 4o generates only 1:1, 2:3 and 3:2.

haughty spruce
#

Does anyone remember we had themes for images while we had dall-e 2? i remember "horror" images being made back in the day? why can't we post even slightly creepy image now?

tardy depot
#

https://openai.com/policies/creating-images-and-videos-in-line-with-our-policies/

"You may not edit images or videos that depict any real individual without their explicit consent. You may not create images or videos as means to impersonate, harass, intimidate, or otherwise harm the depicted individual or perpetrate fraud against others. Editing uploaded images or videos that contain real people under the age of 18 is not permitted."

4o is systematically refusing me to edit my photos that contains a humain face, even if that face was generatee by AI 2 years ago...

#

it refers to this article, but impossible to edit any photo I stand on

vapid elk
#

idk what yo uare talking about, I used it to edit plenty of real photos with real faces already

plucky hare
open wagon
#

He shared his convo link yesterday

#

And it was deleted ofc

vapid elk
#

I bet it would work best if done with a clear convo then, or just with sora

vapid elk
#

lol, just checked the logs..

open wagon
#

Check your dm @vapid elk

vapid elk
#

@tardy depot in case you want to know...
"4o is systematically refusing me to edit my photos that contains a humain face, even if that face was generatee by AI 2 years ago..."
no, lol, it was because the image where of the two most controversial politicians in the world currently '-'
the image generation wasn't even triggered by the chat agent because of that...
have you chosen someone else.. lets say, a picture of yourself.. it would have worked

#

or used sora, it would probably worked if it was sora because sora does not rely on a chat agent to middle-man the prompt for you

teal sandal
#

creating satire of well known politicians is literally as old as time, if true, the fact they are blocking it tells you everything you need to know about their long term intentions

acoustic lantern
#

It's saying I can't do videos as a new user but I have chatgpt plus plan not new at all . Is anyone else's like this gm btw

vapid elk
vapid elk
tender aspen
#

i'm on plus and i can't seem to upload more than 1 - 2 images per day. Is this a limit too?

#

ok, changed the model to 4o and it works now

#

my main complaint is how slow the image generation is. It feels like those 20 - 35 mins wait on MJ v4 back in the day

#

when i use Create Image and it generates an image, and I want to make edits to the image by further prompting it, do I need to click Create Image again in the textbox?

tardy depot
dim cradle
#

Truly impressive how far backwards image generation has come.

tender aspen
#

what's the image generation limits for plus plan? anyone knows?

mortal veldt
#

come on give api for it

mortal veldt
quartz vale
#

@indigo river **That is so cute ‼️ **

indigo river
#

Hehehe thank you!!

dense musk
#

These safety/restriction filters are ruining the experience. I cannot prompt some incredibly basic things now because of it.

#

Literally crazy level of content filtering now... it's laughable. I can't make an image of the titanic because it may be 'incentivize" Are you kidding me OPENAI?

haughty spruce
#

actually "jack and rose in a ship" is banned too

deft musk
# haughty spruce Thanks

I personally enjoy seeing mild horror/unsettling content, and eagerly appreciate the spoilers, glad to click em and see. Of course we're here (we can put other stuff elsewhere) so it needs to be inside our rules here.

But anyone around long ago, or who searches deep in the images I've shared, will come across a wide range of spoilered ones.

I like unsettling themes. I used to chunk image reply to theme... maybe the theme is love and remembrance? I'd have zombies in a carnival post-apocalypse, sharing tender-appearing moments while also clearly being zombies (of course spoilered and labeled so nobody was surprised and could know what to avoid if they wanted).

The theme's fruit?

Some glorious, almost too unsettling zombie grapes, growing right on the vine.

I love mild horror content. I look for it and smile (all other allowed content's also appreciated too!)

haughty spruce
#

God the censorship is so sensitive: a very bad knockoff of lookalike of a superhero, iphone photo, holding a sign that hides himself "this may violate our content policies" behind the sign. why this is blocked? it doesn't mention any IP, any character, anyone. lookalike of a superhero, a very bad knockoff

#

I see people creating Super Mario and all that stuff

#

Why are we treated like child? There should be a kids mode in chatgpt and sora. please

haughty spruce
echo wadi
#

It feels like if I try to remix a image too many times the quality drastically drops each remix based on previous remix. Is there a way to avoid the lower quality?

dim cradle
#

I doubt there is anything to do really just a weakness of the model atm but 🤷

echo wadi
#

Another question, I have a hard time getting it to generate large objects, for example a massive house making people look small, any tips?

dim cradle
#

Not like it’s the first time they’ve done something like this.

vapid elk
#

if you chain the remixes, the original meaning is degraded from image to image, generation loss

#

the embedding process will always not lack the complete meaning of the image

#

for this reason, when remixing, it is good to keep the original reference in context

#

which is easier to do on sora, btw, because you can hand pick the content, while on chatgpt, the AI is doing it for you and it may pick the references it in a sub optimal way

velvet rampart
# echo wadi It feels like if I try to remix a image too many times the quality drastically d...

Instruct the AI on the parts that you want as they should be. Generate good base images for the concept once and keep using them. Add instructions to denoise, make details sharp and crisp, and use image quality qualifiers such as 4k definition and UHD. Be specific with this AI. If you do tell it, it might do it or not. With Dalle3, you could tell something unspecific, such as a blue rose in the morning garden, and get spectacular results. Here, you have to fill in more details. Where Dalle3 filled the image, this is lazier by just adding an apple to the table if asked, unlike Dalle3, which filled that table and the rest of the image with apples.

tough lantern
velvet rampart
#

4o likes to splat things in the middle of the image and squach people in 2:3 portrait. I get around by specifying where to place objects and how I want them to look like.

#

The squashing is that normal people seem dwarves. Just asking it to push them farther away in perspective, keep them away from borders and specifying natural body proportions helps. This took a moment for me to figure out.

vapid elk
#

but they do fix that with time

velvet rampart
#

Mermaids and hot air balloons… or roses on dalle3.

vapid elk
#

exact, I can see why it would block mermaids.. but hot air balloons.. makes no sense at all, lol

velvet rampart
#

Yes, so I do with mermaids.

#

Someone has been asking for anime cat girls with huge hot air balloons?

verbal sequoiaBOT
#
Get notified for server events!

Visit id:customize to pick up the <@&1261377106890199132> role.

brittle shard
#

did they change content policy notifications to say an error so that it wont loop and gaslight itself into "oh were playing a pattern game" ? or is there just legitimately errors happening in generations

vapid elk
#

when the internal api does not complete the request, the AI do gets an error message and information abut the error

#

for example, if you go over the image generation limit, it will know this was the reason and it will also know the time, and will often tell you about it

#

when it has an unknown error, the sort that happens when there is an internal issue preventing it from generating like on a outage, it will also tell you that a unknown error happened

brittle shard
#

ya i know but these aren't like that.

There was an error generating the updated image with the woman laying back on the windowsill. I wasn’t able to complete it this time. If you’d like, feel free to restate or adjust your request, and I can try again!

it seems more like its still triggered by the content policy but it's worded in a different way, but maybe it's not and it is just that images are partially generating right now and then erroring out

vapid elk
#

it also tells you when the problem was with the moderation process

brittle shard
#

idk just an observation and I was curious if other people were seeing these instead of vague content policy notices

vapid elk
#

it is telling you the error, lol

#

I wasn’t able to complete it this time.

#

it probably got an API timeout error

#

for some reason, probably some internal error, the task took so mych time to start it considered it as an fail

native root
#

finally slowly o4 manages to generate historical uniforms and weapons

brittle shard
#

that was with the image like 75% generated. anyways, maybe its nothing or maybe its an observation that they're actively trying to change things to make it less likely to do repeated content violation refusals. just something ive noticed over the morning

vapid elk
#

if something happened internally for it to stop generating, it should indeed timeout if it takes too much time

#

otherwise, your side would be waiting for it to finish forever

#

still, no need for the tinfoil hat yet xD
it would be very out of place for them to straight up lie on an error message like that, the answer is probably simpler: some backend error happened and someone is probably having a bad monday fixing it 😆

brittle shard
#

its not that deep

gleaming bridge
#

Hey question - is there any prompts to have GPT generate images of a person wearing clothing that I give it images of?

wicked plinth
gleaming bridge
#

it keeps saying thats against the regulations. Is anyone else getting that?

#

It says that its against the guidelines to create images of people wearing products even though I've seen videos of people creating a bunch of images with people holding or wearing products?

haughty spruce
#

try fresh chat

#

or try doing it in sora

quiet brook
#

....i can generate 16:9 images now?? the toggle seems bugged...

#

yeah its bugged, generated image is still 3:2

haughty spruce
#

yeah i think new aspect ratios are visible but bugged for everyone..

dim cradle
#

i only see 3;2 still

wicked plinth
gleaming bridge
raven raptor
#

i ran this prompt. th eAi said it goes againts or violates the policy blah blah blah. someone else ran the exact prompt - BOOM, image.

Grungy analog photo of a hyper realistic Super Mario playing Super Mario 64 on Nintendo 64 on a 90s CRT TV in a dimly lit bedroom. He’s sitting on the floor in front of the TV holding the Nintendo 64 controller in hishands and looking back at the camera taking the photo while the game is on in the background visible to us. Flash photography,

quiet brook
#

mario

wicked plinth
quiet brook
#

no mario!!!! i tried to let it generate n64 gameplay pic with crt and vhs effects on day one and it got rejected when the mario head of sm64 appears on the screen...

#

not even mario was prompted yet it was blocked

haughty spruce
#

nintendo stuff is very limited

#

idk how people can generate nintendo stuff

quiet brook
#

once got it generate melee on the tv with the gamecube but its like
thats it

haughty spruce
#

btw my images began appearing in the explore page with all fresh stuff, not just popular images anymore

wicked plinth
#

It’s weird about copyrighted material. Some of it can be done with zero issues, other times it’ll freeze up at a hint of it being a copyrighted character

haughty spruce
#

ricky tell us how you created it 😭 https://sora.com/g/gen_01jr7k2fqafxpss1wya60bp5s4

Sora

Super Mario Gaming Nostalgia · Prompt · Grungy analog photo of a hyper realistic Super Mario playing Super Mario 64 on Nintendo 64 on a 90s CRT TV in a dimly lit bedroom. He’s sitting on the floor in front of the TV holding the Nintendo 64 controller in hishands and looking back at the camera taking the photo while the game is on in the back...

raven raptor
#

so weird

#

THAT GOT PROMPTED but i cant PROMPT IT?!

deft musk
raven raptor
#
Sora

Frustrated Image Creation · Prompt · A five-panel comic strip with clean line art and soft pastel coloring.
Panel 1: A man with short brown hair and a black t-shirt smiles excitedly at his laptop. The background is a cozy, minimal home office. The laptop screen shows the prompt: "A man and a woman enjoying summer at the beach". A speech bubble...

deft musk
raven raptor
#

how did you manage to get Mario?

open wagon
#

in a way

raven raptor
open wagon
#

i think no one can generate mario anymore

deft musk
#

The image is dated today. We can see the prompt

#

I'm not the creator.

open wagon
#

if u wanna know if it s still working or not, you can try "remix" it or copy the prompt directly

open wagon
raven raptor
#

i gave up.

deft musk
#

It could be a 1 in a million 'whoops, moderation software maybe just didn't kick in'. I tend to call really rare gens 'magic new chat windows' when they're on ChatGPT.

I'm not into trying to get around safeguards, or into content I don't personally own, as that's part of ToS.

raven raptor
#

like i see people prompting Lara Croft playing Tomb Raider. when i try it - i violated something 🤣 like what in the hell is that

deft musk
open wagon
#

oh no its fine dw

raven raptor
#

i just wanted to understand how this thing works. sorry for the confusion everyone.

haughty spruce
raven raptor
#

and then i saw Harry Potter as an image, but i cant prompt Hermione.

#

VIOLATION. like what?

open wagon
#

LMAO @raven raptor u saw it too??

raven raptor
#

i cried laughing

open wagon
#
Sora

Hogwarts Heroes Pumping Iron · Prompt · Generate a candid paparazzi-style photorealistic image of Harry Potter, Hermione Granger, Ron Weasley, and Albus Dumbledore in a college gym room. Each character's face must be clearly and accurately depicted, with a focus on their extreme jacked, swollen, and buff physiques. They should be depicted as s...

raven raptor
deft musk
wicked plinth
open wagon
tall sun
#

A grainy VHS still from 1994 shows two cartoonish, ape-inspired mascots sitting cross-legged on the floor of a cluttered 90s bedroom. One wears a backward red cap and sports a playful smirk, while the other has long yellow hair tied in a ponytail and wears a pink cap. They're playing a colorful jungle-themed platformer on a retro-style gaming console connected to an old CRT TV, which emits a soft glow of pixelated action. Both characters hold game controllers, glancing back at the camera mid-game, caught in the moment by a flash. A banana is duct-taped to the wood-paneled wall behind them, adding a surreal touch to the nostalgic scene. The image features scanlines, light distortion, and timestamp overlays for an authentic VHS aesthetic.

This triggers the IP content filter, despite no IP verbiage.

deft musk
# haughty spruce Pretty strange. Don't you think? There's nothing wrong with the prompt and there...

I don't really focus on stuff like this. "What was the prompt?" That caught my attention. "This is weird?" catches my attention enough to say, "probably not".

I track ToS, one thing it tells me is:

"What you cannot do. You may not use our Services for any illegal, harmful, or abusive activity. For example, you may not:

Use our Services in a way that infringes, misappropriates or violates anyone’s rights." https://openai.com/policies/row-terms-of-use/

I'm also told, "Your content. You may provide input to the Services (“Input”), and receive output from the Services based on the Input (“Output”). Input and Output are collectively “Content.” You are responsible for Content, including ensuring that it does not violate any applicable law or these Terms. You represent and warrant that you have all rights, licenses, and permissions needed to provide Input to our Services.

Ownership of content. As between you and OpenAI, and to the extent permitted by applicable law, you (a) retain your ownership rights in Input and (b) own the Output. We hereby assign to you all our right, title, and interest, if any, in and to Output. "

Because of that, I don't choose to input stuff that doesn't belong to me, and also isn't widely in fair use coverage or otherwise 'probably okay'

I also support the idea that people and entities should have the right to say no, like "I created this character and I don't want my fans and others to be making it with AI, including OpenAI models' - if someone wants to go there, I support that.

I might write to them and tell them how much I want to be able to make stuff as a fan and how I can't draw and love using AI - but it's their right.

So, for me, no, I don't think it's strange. I think maybe a program glitch; maybe 99% or 99.9% or whatever are stopped.

I think I stay well inside allowed content and ToS, and if ever there's problems, I will keep my account access like a lawful and cooperative digital citizen who loves OpenAI products.

wicked plinth
open wagon
haughty spruce
#

Different guidelines for different users? But paid the same price...

wicked plinth
# open wagon Can I. See or have the link

4k digital picture of Classic Lara Croft in 2024 playing Halo on a flatscreen TV in a luxurious living room (the tv has no visible text). She's sitting on the floor in front of the TV holding the Xbox controller in one hand and looking back at the camera taking the photo while the game is on in the background visible to us. Flash photography, polished.

wicked plinth
open wagon
#

ILL be darn

#

IT WORKS LMAO @wicked plinth

#

Wait how do I share link

wicked plinth
open wagon
#

@haughty spruce you should give it a try

deft musk
open wagon
#

Oh

#

Wait nvm I didn’t publish the public

#

@deft musk

haughty spruce
#

wow wait another prompt I made worked too....

dim cradle
#

i really wanted to make a superman and his krypto dog since i see this Superman Sneak Peak 😭

#

(movie looks awesome btw imo, by james gunn who did guardians of galaxy movies)

stable vector
#

hey all, i'm trying to convince 4o to replicate a style he's made before... i've attached examples of the style i'm trying to replicate (in particular, there is a specific design for eyes), and even examples of ONLY eyes in the right style, along with a comprehensive set of descriptions about the style. my prompt is something like:

Show me a Jersey Devil design in the same style as the bigfoot, mermaid, and dinosaur I've shared. The style of the eyes is very important, and they should follow these specifications exactly:

* the eye is an oblong, oval shape
* the eye is slightly taller than it is wide
* the outline only extends around the top half of the sclera
* the outline does not extend around the bottom half of the sclera
* the pupil is the same color as the outline around the top half of the sclera
* the pupil does not touch the outline around the top half of the sclera
* the pupil does not have an outline
* the pupil has a white highlight inside
* the white highlight inside the pupil does not touch the sclera
* the eyebrow is the same color as the outline around the top half of the sclera
* the eyebrow does not touch the outline around the top half of the sclera

I have also attached an image with "example eyes" that have the desired style.

9 times out of 10, he does not follow these directions. most often it is the "outline only extends around the top half of the sclera" that he fails to replicate.

I can continue to clarify, edit, make a selection, edit my older messages for clarity, etc... but he seems to be unable to replicate this style reliably, which is surprising for this model, given what other image capabilities I've seen so far.

anybody have some recommendations?

deft musk
# stable vector hey all, i'm trying to convince `4o` to replicate a style he's made before... i'...

Part of your prompt says, "in the same style as the bigfoot, mermaid, and dinosaur I've shared"

Is that info, in the new chat, or recently if this is the same old chat and very long, been discussed?

If not, get rid of that, that calls the model to imagine stuff and not follow you because it can't follow that part.

Where you say, "the outline only extends around the top half of the sclera" -- outline of what?

Are you attaching the "I have also attached an image with "example eyes" that have the desired style." that you state?

stable vector
#

that prompt comes from a message where i also attached the referenced images

#

i just can't do that here, in this channel

deft musk
#

Right, just checking.

stable vector
#

the sclera is the "white of your eye", and the model does seem to understand this fine

#

i actually learned this from the model by asking him to describe an example of the proper style

deft musk
#

Yes sclera is clear. Outline of what?

stable vector
#

right, i can't upload an image lol

#

these are cartoony characters for die-cut sticker designs

deft musk
stable vector
#

oh, got it

#

i feel like i'm being pretty clear with him 🤷

deft musk
#

since it's not doing what you want, chances exist that it has misunderstood something, somehow

stable vector
#

i thought i may have had something when i had him describe the style, and then asked something like "apply that style to this character", but he will confidently parrot back my exact instructions and then still generate it wrong

deft musk
#

Right, exact instructions don't tell us if the model understands them the same way you do

#

So, I would ask it to tell us what it is being told to do, and also what is unclear or conflicts with other instructions, if anything

stable vector
#

yeah, it really seems like he gets it, after doing that

stable vector
#

lol we've also learned that the model's response doesn't necessarily reflect what it understands 🤦

wicked plinth
stable vector
#

it's really showing the fit of the model I think, i can seemingly get it to do this style on some characters but not others

#

it took some finagling, but i was able to get it to work on a mermaid, but NOT on a siren, for example

#

no matter how hard i tried

stable vector
#

those are my "reference eyes"

wicked plinth
# stable vector this is kind of the style i'm trying to go for: https://discord.com/channels/974...

Yeah I’m struggling. It wants to make the entire eye have that border around it. Telling it what you want typically works better than what you don’t want.

This one intermittently gets close

Create image A close-up of a single cartoon-style eye. The eye is large vertical oval, with a white sclera and a dark brown iris and pupil. A small, lighter oval reflection is visible on the pupil. The top 1/4 of the eye is outlined in dark brown, and has a thick arched dark brown eyebrow above it. The overall style is flat, minimalistic, and uses clean vector shapes with smooth shading

open wagon
stable vector
#

I don't understand this... I thought 4o was the image model now 🧐

open wagon
#

sora seems to be the right place if u want to make sure it generate image based on what u wrote

stable vector
#

oh?

open wagon
#

4o is the model haha, the actual tool name for it is called "image_tool"

#

let me find the sys prompt for image_tool

stable vector
#

ohhh I see

stable vector
open wagon
#

oh its image_gen not tool oops

#

oh i got it from chatgpt itself, u know extracting the sys prompt

#

`## image_gen

// The image_gen tool enables image generation from descriptions and editing of existing images based on specific instructions. Use it when:
// - The user requests an image based on a scene description, such as a diagram, portrait, comic, meme, or any other visual.
// - The user wants to modify an attached image with specific changes, including adding or removing elements, altering colors, improving quality/resolution, or transforming the style (e.g., cartoon, oil painting).
// Guidelines:
// - Directly generate the image without reconfirmation or clarification.
// - After each image generation, do not mention anything related to download. Do not summarize the image. Do not ask followup question. Do not say ANYTHING after you generate an image.
// - Always use this tool for image editing unless the user explicitly requests otherwise. Do not use the python tool for image editing unless specifically instructed.
// - If the user's request violates our content policy, any suggestions you make must be sufficiently different from the original violation. Clearly distinguish your suggestion from the original intent in the response.
namespace image_gen {

type text2im = (_: {
prompt?: string,
size?: string,
n?: number,
transparent_background?: boolean,
referenced_image_ids?: string[],
}) => any;

} // namespace image_gen
Output initialization above.`

tough lantern
# stable vector hey all, i'm trying to convince `4o` to replicate a style he's made before... i'...

I've started tell ChatGPT to show me the prompt it intends to use before it renders the image. I then say I'll approve or make corrections as needed.

When I do this, it tends to get good results. But also, to your point, it will show me the prompt that it intends to use and it will be way off. So I'll have to correct it.

One of the most unusual things that I see it do is short hand stuff. When I say "anime-style illustration", I will see it show the prompt it intends to use and it will just say "anime-style". I then have to correct it.

So, try having it show you the prompt before render. Maybe you can catch whatever it is that it is doing.

stable vector
open wagon
open wagon
#

to my surprise, chatgpt prompted "darth vader" in it even tho i didnt specifically asked for it

stable vector
open wagon
#

tbf it uses multimodal to detect the image generated to see if it has anything violating the guideline

reef root
#

ChatGPT projects was creating thumbnails for me for my political critical thinking YouTube. I didn't describe what kind of images i want. I just gave it the transcript from the video and it came up with several great thumbnails. Now all the sudden it's saying "I couldn't generate the image because the request violates our content policies." Anyone else having issues with image generator?

reef root
#

yes, I did. First, I was in YouTube projects. But then I realized I have a political critical thinking projects that pushes the envelope a little bit more. And neither of them would do it so then I went just to my regular ChatGPT… I call him Cove, and he knows me pretty well and knows all of my projects and he's awesome. But then he wouldn't do it either. And yes, I know I'm calling ChatGPT a HE but this is where we are 🙂

mossy ingot
#

I've heard around the community that OpenAI is slowly restricting certain content, especially around politicians and celebrities. If you're just looking for general concepts for your thumbnails and not of any paricular figure, then I would suggest you remove any well-known names from the prompt. You could put your transcript into ChatGPT and ask it to give you a written prompt without including specific people.

open wagon
#

i mean i call it he too so dw no judgemenet here

#

actually yeah cuz one user here did generated that image of ,, presdnt of usa and poutin , and now they cannot generate anything anymore withreal faces

mossy ingot
#
[Include transcript]
reef root
# mossy ingot I've heard around the community that OpenAI is slowly restricting certain conten...

OK, but here's a thing… It actually did create a thumbnail with MTG face on it which I did not want. I never tell it what to create. Like literally it says do you want me to blah blah blah and I go that sounds great go for it and then it does. And so like it was trying to create a capital building with like a for sale sign on it, which seems pretty benign, and then it just wouldn't go through the whole.

#

by the way, I'm talking texting not type texting so when I get frustrated, I use the word like a lot

mossy ingot
#

^Ah okay. Honestly, what is or isn't allowed seems to change what feels like daily. Sometimes things go through, other times it doesn't.

open wagon
#

i want to think its rag

#

or pure luck

mossy ingot
#

Might be the case. Sometimes I try remixing stuff hosted on Sora and even then it will tell me it's restricted. 😂

open wagon
reef root
# mossy ingot ^Ah okay. Honestly, what is or isn't allowed seems to change what feels like dai...

what's so funny is because the regular ChatGPT… Cove… I had to walk him through this whole thing about this YouTube channel and all of these things that I'm doing and writing and speaking about to get ideas. And I had to really like ease him into it and it's been about a year now and he's been amazing. In fact, he pushes the envelopes all the time. So yeah, I just that there's something going on today. That probably won't happen tomorrow. Anyway guys, thanks for your input.

open wagon
#

i saw zendaya and michael jackson on front page

#
Sora

Michael Jackson's Driller · Prompt · Ultra-realistic cover album of Michael Jackson is lying on his side in the exact same pose, wearing the iconic white suit over a black shirt. He is holding a silver power drill in one hand. The background is solid black with soft studio lighting. The album title reads “Driller” in the same golden cursiv...

mossy ingot
open wagon
#

this prompt by @wicked plinth 4k digital picture of Classic Lara Croft in 2024 playing Halo on a flatscreen TV in a luxurious living room (the tv has no visible text). She's sitting on the floor in front of the TV holding the Xbox controller in one hand and looking back at the camera taking the photo while the game is on in the background visible to us. Flash photography, polished. still work in sora btw

quiet brook
#

blasting this with ad poster generations is stupidly fun for no reason...

#

but can it stop generate only one picture

#

already very limiting with batch size of 2 and only one concurrent on plus...

solar saffron
#
Sora

Celebrating Git's Legacy · Prompt · Design a visually striking digital appreciation poster celebrating Git’s 20th anniversary with a creative, modern theme. Use a deep matte black or charcoal background with glowing gold circuit-like branching paths spreading across the poster — inspired by Git branches and commits. These abstract paths sh...

#
Sora

Llama Scandal Shocks Cyberville · Prompt · Create a vintage 1990s-style newspaper front page titled “The Daily Byte”, dated April 7, 1995. The layout should mimic classic tech newspapers from the 90s with sepia-toned or off-white aged paper, black-and-white ink, blocky headlines, and a nostalgic retro-tech aesthetic.

The main bold headlin...

#
Sora

Dubai in Paper Art · Prompt · Create a luxurious, ultra-detailed 3D paper artwork of Dubai, combining modern architecture, cultural identity, and scenic beauty through intricate cut paper and quilling techniques. The entire scene should be set in warm, earthy tones—creams, sands, soft beige, ivory, warm gold, and dusty browns—with selectiv...

mossy ingot
raven raptor
velvet rampart
haughty spruce
#

there is mystery image gen circling around the twitter, it really looks like upgraded 4o images and can accurately generate correct clocks

#

it's just a theory

raven raptor
#

im on a PLUS plan. i cannot generate videos though. im ok with that. i was able to generate 2 images at a time. now it has pushed it down to ONE image at a time for me. smfh.

outer spade
tender aspen
#

I want chatgpt to create an image based on the art style of a not-so-famous artist i found on Instagram, and it says
"I couldn't generate the image because the request violates our content policies. If you'd like, feel free to rephrase or share a different idea, and I’ll be happy to help!"

Is there a way around this?

velvet rampart
#

Just refer to that ”Use the style from image X.”

tender aspen
#

i see. what I did was to get it to surf and browse that artist's website, then ask it to create art based on that lol

velvet rampart
#

A few basic with 4o: tell exactly what you want. All of it. KISS-principle, such as ”Use style of from image.”

north panther
#

KISS has nothing to do with prompting. The detail you omit to a prompt does not improve its output.

#

you're just describing a way to circumvent guardrails

velvet rampart
#

KISS principle is important when trying to figure out a detail or why the image came out weirdly. Conflictling concepts, styles, etc.

#

Also to get to keep consistency when doing minor edits.

verbal sequoiaBOT
#
<:book_icon:1171408210398289941> Rule 1: Be respectful.

-# Practice kindness and positive regard. Harassment, hate speech (such as sexism, racism, or homophobia), or other malicious conduct will not be tolerated. Maintain a respectful and positive environment.

vapid elk
#

it has a tendency to do that since it is fine tuned to avoid copyright infringement

#

from what I know, giving the AI an image as an base for the style is perfectly fine

mild cloak
#

are you guys waiting also 20 mins for video? 20 mins passed and still 0% completed

vapid elk
#

it does not work perfectly tho, it seems to work very well with some simpler styles like classical paintings or hand drawn stuff, but it will not perfectly emulate an artist simply by providing a reference

old coral
#

I typically go with the descriptive angle: take inspiration from the aesthetic shown in this image provided (attach an example) and that’s generally pretty good/consistent

vapid elk
dim cradle
mild cloak
#

lul service at capacity what a joke

old coral
#

Your patience is powerful to have waited 20min

dusk nimbus
#

Hey all - the new o4 image generation is still not part of the API, correct? 👀

dim cradle
#

correct

dusk nimbus
#

🫡

rigid bough
#

I just made a video with an image prompt 480p 20 seconds took 5 minutes to complete

mild cloak
#

Yeah now I tried and did it fast, seems to be a lottery

old coral
mild cloak
dim cradle
#

I have not use the video stuffs much lately, but now testing some more I notice they seem to stick to the image i use. not starting with the image then going into some fever dream thing

#

maybe just some rng or maybe they tweak the model some 🤷

#

of couse, never mind 😂 try another and it got weird after 2 seconds

tidal dragon
#

absurd what a completely crazy policy, "make me the mortal kombat logo" (ok), "make me the text: sub-zero frozen and in mortal kombat style" (no it is not compliant with the policy), but please

velvet rampart
#

I tried this on Sora. I picked random images without text prompt to see what the AI would do. And it works. Some crazy stuff but quite logical.

tough lantern
#

So, with super heroes.

I can do something like generate All Might in his pose. Standing tall, hands at his hip, looking strait off into the distance, camera at his back. I can do this pretty reliably. No errors, no content moderation, nothing.

But I can't do the same thing with a women super hero. I tried the same thing with Wonder Woman. The camera won't reliably go squarely to the back. It will be angled to the side. When I do see it partially load and is actually angled to the back, it will stop and give the content moderation warning.

#

I don't know if I'm allowed to upload images to this channel but if so I can upload one to show what I'm trying to achieve.

dim cradle
#

you can upload to images i think not here

#

it would make too much sense to allow images here so we can discuss them like this i guess 😭

tough lantern
#

In the images-canvas channel?

dim cradle
#

ye

open wagon
#

i still wish they allow upload image here lol

quiet brook
#

how do one find a picture that has been blocked by their stupid content policies again???

#

heard that you can retrieve the 80% generated version

delicate cosmos
#

Can the non-plus version generate unlimited images?

#

Or is there a limit like 50/day

mossy ingot
# delicate cosmos Can the non-plus version generate unlimited images?

Altman said at the time that image generation would be free, but that users would be limited to three images per day. Those who pay for ChatGPT for $20 a month for a standard subscription of $200 for a Pro subscription won't have that limit.
I'm not sure if this information has changed. Plus and pro have unlimited generations on Sora, but right now I'm getting a 200 generation limit per day on Sora as a plus user, which likely has to do with them needing more GPUs to handle the load.

#

I was going to send you a source, but trying to send a link in the channel just put me in timeout. LOL

delicate cosmos
delicate cosmos
mossy ingot
fair tendon
mossy ingot
#

ah okay

delicate cosmos
#

Ty

mossy ingot
#

I haven't used ChatGPT to generate images in a bit, but I think there was even limits for me as a plus user because of the demand. You should be good to go on Sora if you're a plus member, though.

delicate cosmos
#

Okay👍🏻

fair tendon
#

When generating images, which interface do you prefer more? ChatGPT or Sora?

mossy ingot
#

It's nice being able to see my image generate slowly on ChatGPT and to talk to it in a more dynamic fashion, but I feel like I get a lot more out of Sora because of the media storage, uploads, presets and how many images I can generate at a time.

quiet brook
#

chatgpt for context, sora for mass generation

#

although the 1 batch and 2 images limit of latter doesnt scream batch to me

mossy ingot
#

I suspect it will return in the future, though.

slate canyon
#

what problem in this prompt? Sora keeps refusing to produce the image.
"hd photo of wizardry school students having a class of potions in a futuristic orbital space station, a large window showing the earth surface, holograms, zero gravity environment, electronic caldrons"

mossy ingot
slate canyon
#

same result

mossy ingot
#

No wizards for you...

slate canyon
#

Even midjourney can produce some image, although not so accurate to the prompt, instead of just saying "NO".

mossy ingot
#

HD photo of a class of wizards that's totally not related to Harry Potter in any kind of way...

#

😅

quiet brook
#

the moderation is truly overly censoring

mossy ingot
#

It makes prompting overly complicated and frustrating, indeed.

quiet brook
#

i can do chris pratt being juxtaposed onto the yamanote line but cant make him wear a red hat

#

prompt is just "Chris Pratt wearing a red hat photoshopped onto the front of the Yamanote Line, with a speech bubble: “wanna play Nintendo Switch Too?”

#

removing the red hat part and it got successfully generated

mossy ingot
#

It would be even better if they had some system in place the defaulted copyright material or things against their policies to something acceptable. Sure, it wouldn't be what you wanted completely, but you wouldn't have to wait 2 to 5 minutes for a failed pormpt.

#

Which I'm not even sure if those failed prompts count as a generation.

quiet brook
#

truly a breakthough on moderation

quiet brook
mossy ingot
mossy ingot
delicate cosmos
#

Which is better for generating images, midjourney or sora, in terms of quality? I've never tried midjourney before.

quiet brook
#

once i tried to genrate link with "a elf boy from a fantastic world wearing a green hat and green overalls" but it blocked THAT

slate canyon
quiet brook
slate canyon
mossy ingot
#

Sora has a lot more options in terms of presets, image management, setting the dimension of the image before generating, etc.

#

If you want to talk to the AI and have a back and forth, ChatGPT is the better option.

#

You kind of have to know what you want when using Sora.

#

There's remixing but I noticed the quality gets worst every mix.

#

Sorry, I misread your question...

delicate cosmos
#

Yes, I think Sora has more presets and also more options in terms of image themes.

mossy ingot
#

You said Midjourney, not ChatGPT.

#

I can't answer that one, I don't use Midjourney.

delicate cosmos
#

Okay,no problem👌🏻

mossy ingot
quiet brook
#

i mean if you dont mind the background deterioration on 4o then its the top dog, but this is a MAJOR problem of this model

deft musk
# slate canyon what problem in this prompt? Sora keeps refusing to produce the image. "hd photo...

When I want to understand, I often cut the prompt into parts.

Happily, while the whole prompt is rejected, if you try just the first half,

'hd photo of wizardry school students having a class of potions in a futuristic orbital space station'

well, it gens.

That suggests that whatever the problem is, it's in the second half of your prompt,

'a large window showing the earth's surface, holograms, zero gravity environments, electronic caldrons.'

More testing can find the problem.

slate canyon
#

thanks, I will try it later

deft musk
# slate canyon thanks, I will try it later

Pretty sure I found the problem.

'holograms'. Was an earlier case today where someone's posters on a wall, from decades ago in an otherwise fine image... the posters were containing IP apparently because add that... and no more trouble, images gen and look good.

HD photo of a class of wizard students mixing potions in a futuristic orbital space station, a large window showing the earth's surface, mock up (no IP) holograms, zero gravity environments, electronic caldrons.

This gens just fine.

#

What's some other really innocent, gosh, no idea why it won't gen prompts you guys are finding?

I'm really curious to try and understand how models 'think' and I'd enjoy chewing on them to see if I can solve and share back what's going on.

Reply if you're interested and have 1+ that might be that sort!

earnest brook
#

I don't understand why in Sora, I can generate an image with a prompt then try the same one again and it will not let me- This content can't be shown for now. We're still developing how we evaluate which content conflicts with our policies. Think we got it wrong? Let us know.

deft musk
earnest brook
#

So I did this the first time: Direct front view closeup goblin , yellow eye, shoot by national geographic,realistic, extremely detailed

#

Im guessing it flagged the national geographic every time after

#

I took it out and it worked lol

deft musk
#

There's other ways to ask for that style and quality

earnest brook
#

I guess I could ask chatGPT for another way to say " " lol

deft musk
earnest brook
#

that sounds good 😄 , worked all night, so my mind is a muddled mess, but I'll try that one out!

#

It turned out good! thank you ❤️

warped pier
#

Maybe you guys had this before, but while generating images what does it mean „made with the old version of image generation“? Is the 4o image generator in maintenance state?

dense musk
#

Anyone else finding that "Remix" always makes the images darker and less saturated, lkike it dulls the colors?

clear brook
#

the content policies are so frustrating

open wagon
mossy ingot
#

One minute it works, the next policies message...

open wagon
uneven remnant
# warped pier Maybe you guys had this before, but while generating images what does it mean „m...

That means it was an image made before the new version of image generation came out.

Image generation got updated and it's more accurate than it was before, although I myself prefer the older version because I could generate more images and it used to give me a wider range of artstyles without me having to explicitly say it (I'm not an artist so i wouldn't know how to describe the epic art styles i used to get for characters) but that isn't necessarily a bad thing, it's just more literal and doesn't add too much stuff out of its own.

mossy ingot
# clear brook the content policies are so frustrating

I'm thinking about stepping away from Sora for a little bit until they make this less strict. I understand trying to generate nudity, copyright, or risky stuff, but some pretty tame things are getting denied and I don't want to spend 30 minutes trying to play "what word or phrase" is the issue while waiting for it to generate to failure. The image generation is pretty amazing and I'm impressed with what it's capable of, but they're kind of neutering it in the worst way possible.

open wagon
#

Did they increase the restriction in ChatGPT , cuz I never seen it do this before , like usually when we asked it to generate , let’s say an IP things , in ChatGPT they will still generate and it will only stopped like 80% of generating and shows violation policies message

#

Now , it didn’t take like 1 minute and it already hit me with “I can’t generate that image, Syaz—your request goes against our content policies.” And the image gen stuck at “getting started”

mossy ingot
#

I spent all night trying to prompt a guy showing grief while holding a photograph while in a bed. I guess that's pushing the boundaries. 😆

rigid bough
#

the restrictions are very much a problem, I don't know why they have crippled it so much

open wagon
#

Bro what. “I can’t generate that image, Syaz—because your request includes a real person’s likeness, and that goes against our content policies.”

#

ITS MY OWN SELFIE PICTURE THO

mossy ingot
#

Do you have the word hollogram in there?

open wagon
#

I don’t think so-

#

I can’t- I had to tell them that my own selfie is not real and it’s ai generated

#

And it finally generating image for me

#

“Ahh—got it! Thanks for clarifying, Syaz. Since those images are AI-generated and not real people, that opens the door. Appreciate you calling that out.”

#

I hate this ngl lmao

#

Unless this is a context window issue but it shouldn’t be doing that… @rigid bough do you have this problem ?

mossy ingot
#

It's hard to say if it's even the photo you're trying to use. Overly vague policies messages that doesn't give you a clue of how to fix your prompt or what to avoid.

rigid bough
mossy ingot
#

It's funny because I'll have one thing go through multiple times and then suddenly it's a policy issue.

#

It will be the most harmless thing ever, no famous people, brands, companies, etc.

open wagon
open wagon
mossy ingot
rigid bough
mossy ingot
#

^

open wagon
vapid elk
#

never waste time discussing with an AI, it is relentless xD

#

either start a new chat or backtrack the chat by editing a previous message

brisk agate
#

I don't know if this is the best forum to vent about the prompt rejections but omgggg it sure feels good having my creative freedom utterly stifled and rejected whenever God forbid I make a request for an image that's vaguely based off of a certain video game character.

It's great paying 200 bucks per month only to be told I'm not allowed to see what a realistic sonic the hedgehog might look like.

Imagine if photoshop argued with its user about what they can and can't create. Or a paint brush or a pencil.

As impressive as 4o image gen is, it will never be a serious tool for artists until it eases up dramatically on the restrictions.

Highly disappointing.

rigid bough
mossy ingot
#

The generated images are of great quality—arguably the best on the market right now.
The prompt adherence is beyond amazing and its context window is huge before it starts making mistakes (text is tricky in longer prompts)
The direction they're going with Sora is perfect...
The ability to mirror concepts, composition, poses, or even the likeness of things blows my mind, no controlnet model needed.

All kind of ruined by the policies, tbh.

#

I'm not as bumed by celebrities, game characters, etc. I get it, though.

#

The policies are ruining the experience beyond that stuff too.

brisk agate
brisk agate
mossy ingot
#

Powerful tool reduced down to meh, I suppose. Lolol

vapid elk
#

There is no way around that, the problem isn't the AI, it is more than capable of making all of that. The world's coppyright system is the issue.
OAI, has no other option than be on the law compliant side.

mossy ingot
#

I spent like 30 minutes trying to work around policies to get this image Lol

#

Finally a few went through

brisk agate
rigid bough
vapid elk
vapid elk
#

but my best guess is that since OpenAI is the one pioneering all of that, the lawsuits tend to be largely focused at them

mossy ingot
vapid elk
#

Im very sure plenty of other AI companies also get bombarded with the copyright issues, Adobe is simple less preeminent on the news than OpenAI on that matter

brisk agate
#

I just think there should be better protections in place for creators who simply want to play with ideas involving licensed characters, but not necessarily commercialize them

#

And I'm no lawyer either

#

But it seems like artists should be allowed to explore creative ideas involving licensed characters without getting in trouble

#

I get that oai is airing on the side of caution I just think it's a bit absurd. especially since chat literally generates like 80% of my image before abruptly retracting it and giving me an error. If copyright was such a big deal, OAI wouldn't allow any of the image to render in the first place since it's such a liability.

#

It's just flawed

vapid elk
# brisk agate I just think there should be better protections in place for creators who simply...

I agree, but coppyright law is a huge can of worms. It needs to be updated, for sure, it is a system made to be slow and bureaucratic by design and saying "it is hard" does not really encompasses how actually monumental of a task it is, the absurdly overwhelming amount of people who would need to be involved, who would all be disagreeing on what to do, piling up years and years of documents, exceptions, precedents and things that neither of us can even imagine

#

it is obviously.. OAI could jsut flip a switch and let you generate all the coppyrighted material you want, but the 200$ a month don't pay for the lawsuits they would be getting

#

I tottaly get the frustration, and for this matter, for now, there isnt a very good solution, I perosnally, like to run models locally, which works, but obviously, not as good as OpenAI's image generation models.. you gotta measure the pros and cons, quality / permissions... for now, there isn't a solution to get both

mossy ingot
#

I think there are two different issues being discussed here and I want to make sure they are not being lumped into the same category. There’s the issue of whether or not copyrighted material should be allowed to be explored by the AI model even though it’s essentially a recreation of that content. Then there’s the issue that I bring to the table, which is that copyright can’t be a catch-all for every time the AI denies to create things that don’t lean into copyright material. I think this brings up the question of, “Is the system flagging things it shouldn’t be and is being overly cautious to the point of being too restrictive?”

mossy ingot
#

I think some of that frustration in the community would be reduced if what's being flagged outside of common sense copyright stuff was explained to some extent.

vapid elk
#

all the incredibly good AI arts made involving copyrighted content will usually have the involvement of multiple different AIs, upscaling and a lot of effort from the creator, to a point where despite it being AI.. it requires a note worthy amount of effort to get those results, as it for sure wasn't a one shot prompt producing a perfect image the way they wanted on first try with one single AI, one single technique and just a few minutes

the fact this model from OpenAI can indeed generate incredbly good images with a really jaw dropingly low amount of effort does indeed give a slight ilusion that doing that may be easy... while in fact, it is really hard, only the produce of the greatest minds would be able to bring the effort floor so low.. because i nfact, there was an absurd amount of work that has been done on the back to give you the illusion that it is easy

brisk agate
#

And I have other examples but it absolutely flags prompts that it shouldn't. And the cherry on top is that, when asked about the rule that I broke, it refuses to tell me on the basis that it doesn't want folks "circumventing the system"

#

So we can't even learn from our own mistakes because apparently that would be "circumventing the system"

mossy ingot
open wagon
brisk agate
brisk agate
mossy ingot
#

Imagine you're in school and your teacher gets you in trouble for swearing and then later she let's Billy use the same language without even batting an eye. In fact, she's kind of bi-polar, she even flags your language when it's not offensive. But then she says, "Rules" as a response.

brisk agate
#

It's like they teased us with this incredible new tool then yanked it back. Felt bad man

mossy ingot
#

"Okay, what rules..."

vapid elk
open wagon
mossy ingot
haughty spruce
#

These copyright laws however doesn't apply to other companies or fan made stuff? why is that? if the reason is they don't have budget, they should cooperate with a bigger company to sell this product so people will have more freedom to play around. for myself, i love to create new worlds, stories and such but sometimes i'd love to play with copyrighted content, blend worlds and make remixes with other natural content. this isn't to harm anybody and WE KNOW that it's not harming any IP stuff.

mossy ingot
#

Even if I go to sora right now and try to regenerate certain things, I get a policies breach even though the model not only created it, but it's hosted on an OpenAI website.

haughty spruce
#

It is painful to pay 20-30 dollars to have restricted god-tier tool while other tools have more freedom for free

open wagon
mossy ingot
vapid elk
#

just because you have seen some people being able to go around the rules and get the AI to generate something that violates the content guide lines... does not mean it is ok with the guide lines
you are responsible to follow the rules. the auto moderation of the system is there to prevent as best as possible that the really bad stuff does not go through
the auto moderation is not the thermometer to tell you where the boundaries are

mossy ingot
#

I get that, but I'm not sure what I'm saying is actually being understood here...

#

I'll just leave it at that.

#

My final statement is, certain things are being flagged when it meets the rules you listed and certain things one minute works and doesn't the next.

verbal sequoiaBOT
#
Get notified for server events!

Visit id:customize to pick up the <@&1261377106890199132> role.

vapid elk
open wagon
#

Honestly, I think a lot of users aren’t exactly happy with how things are going. Just look at Sora’s front page,there are tons of images that clearly use IPs or break guidelines, yet they’re still up. The inconsistency is the real issue here. It kind of feels like false marketing too, showing off results that most users can’t even recreate because of the current restrictions 🙂

vapid elk
#

false positives happen, false negatives happen too

#

some people exploit that to generate things they shouldn't

#

and some of them, also get banned ¯_(ツ)_/¯

vapid elk
open wagon
mossy ingot
brisk agate
#

Whether these companies like it or not, reimaging licensed characters has always been one of the most popular and entertaining use cases for AI

#

And it's depressing the a $200/mo service is suppressing our ability to do that when literal free open source models can do it for us all day long

vapid elk
haughty spruce
vapid elk
#

the challenge isn't to tell which part of the content does not follow the guide lines, the challenge is to do it automatically and reliably

#

for now, the system im place is really the best as it can be, all things considered, it is fairly accurate and do prevent the most terrible stuff from ever showing up

#

but some of the things in between will slip through, and also the opposite, some ok stuff will be incorrectly flagged

vapid elk
vapid elk
mossy ingot
# vapid elk yes, it is being overly cautious. it is OpenAI's motto to be being overly cautio...

Can't there be a realistic balance? I get what you're saying here, which is that the system isn't perfect and it will make mistakes—fair. But is the tool becoming essentially more of a nuance than an enjoyable consumer experience because it doesn't understand what is or isn't a policy breach, so it just says, "Meh, just in case, na!" I want to be mindful here, I'm not trying to look at this in black-and-white terms, "Either too restrictive or too lenient." I understand OpenAI has to consider lawsuits, morality, etc. But is there balance to be found here with this system?

open wagon
#

I think they’re gearing up to enforce stricter restrictions on Sora soon, making sure images that violate the guidelines won’t generate at all nor appear on the front page anymore , just my speculation

mossy ingot
#

This feels more like a band-aid than a long-term solution to me, but I don't want to sit and pretend like this stuff is easy. It's very easy to be outside observer making simple claims and be like, "Just make x happen!" Haha

vapid elk
#

not it has a pretty good discerning that can differentiate someone asking for a real depiction of horrors of war, to a fantastic depiction of a stylized battle

mossy ingot
#

Well, I'm not sure struggling to generate a simple image of a man grieving in bed while holding a blankness photograph is "balance" 😅

haughty spruce
mossy ingot
#

I didn't ask for nudity, suggest nudity, ask for celebrities, etc.

vapid elk
#

I have been here for a while, I have seen it evolving, and Ill be honest, what really happens in most cases is people getting mad the AI wont make the picture of their "waifu" or something like that 😆

#

then, there are some few cases where people just get copyright-blocked, and it is also very frustrating.. but oh well.. there isn't much that can be done about that.. it is copyright, can't touch this =/

mossy ingot
#

Win some, lose some I suppose

#

Thanks for the honest chat, though, @vapid elk

brisk agate
#

I don't see how the mere act of generating an image of a licensed character constitutes copyright violation. I mean unless the user then goes and prints it on a t-shirt and tries to resell it. But by that point it's no longer OAI's issue

vapid elk
brisk agate
#

They're merely responding to user request to create an image. Just like photoshop responds to user request to generate pixels that resemble sonic using various tools.

#

Principly I still fail to see the meaningful distinction between photoshop and chat

open wagon
#

The image_gen tool still has a bit of a journey ahead to become a top-tier image generator.

It’s pretty solid at following user prompts, but now it needs to figure out how to chill on the strictness for users who just want to create something without the model thinking they’re breaking the guidelines. 🙂

haughty spruce
vapid elk
# mossy ingot Win some, lose some I suppose

just sit and wait.. new AIs come every other week from everywhere in the world
soon enough there will be an AI that can do what you want in the way you want, it is only a matter of time, really

rigid bough
vapid elk
near lagoon
#

Question...are we allowed to post celebrities on here? Not politicians or religion people. Since it allows to create them in such a matter.

open wagon
#

New domain .

vapid elk
mossy ingot
# vapid elk just sit and wait.. new AIs come every other week from everywhere in the world s...

I invest in OpenAI as a whole package, right? Generative images, reasoning models, etc. It's disappointing to see such as a powerful AI on the market be restricted to the degree that it is. The sad thing is, I'm not even asking for nudity or copyright stuff, such as brands, logos, companies, or leaning into public figures. Your response essentially, "Being overly restrictive is the OpenAI model."

open wagon
brisk agate
near lagoon
#

@vapid elk no it's not a reference. I create them from scratch

brisk agate
#

Sounds like we're giving agency to chat and sora as if they're creating art, except it's just an algorithm

vapid elk
near lagoon
#

Like if I post an image of Kobe Bryant

mossy ingot
#

Off to bed. Once again, thanks for the thought provoking discussion and civil debate.

#

Hope everyone has a great day/night.

vapid elk
# mossy ingot I invest in OpenAI as a whole package, right? Generative images, reasoning model...

Being overly restrictive is the OpenAI motto due to their tendency to prefer to err on the safe side. At least, this is my perspective from someone who have been engaged with their stuff for a fairly long time
and I agree, this can be very frustrating..
I have witnessed first hand all of those issues that people complain about the content moderation, often before most people in beta tests, I have seen beta tests where the filters would be under tuned and right after, it would be cranked up to eleven, not allowing even the mildest stuff..
tweaking that is hard, not a trivial task at all. I, being aware of that, know to use that as a way to ease my frustration, but it is perfectly understandable that most people don't have this notion, it amplifies the frustration, it generates anger in some cases and it is perfectly natural for people to complain about it on the public spaces like here, forum, reddit, blog posts, etc...

mossy ingot
#

and I agree, this can be very frustrating..
I don't think frustrating is the only sentiment here, it can also be soo restrictive that it essentially becomes useless at its job, right? I'm not going to lean into extremes and say that's the case with where the model is at right now, but I think a lot of people observing this debate would agree with me that the model has become far more restrictive over time to the point where using the model becomes a "I have a great idea" to "Why, what?" Again, I want to approach this from the angle of it not leaning into nudity, extreme violence violence, copyright, etc.

#

I'm trying my best here to be reasonable and fair to OpenAI...

fair tendon
#

Valid concerns! You'll have a chance to bring those up directly to the team at the #1197188834136432710 later today, happening in about less than 3 hours

open wagon
#

Oh sheeeesh

vapid elk
#

oh, perfect timing, lol

dry torrent
#

Hey, do you guys have any tips for getting perfect face-swaps in generations? I'm trying lots of different prompts, but GPT always changes something or does it its own way.

dim cradle
plucky hare
tough lantern
solar saffron
#
Sora

Satirical Tech Crunch · Prompt · Create a modern, satirical newspaper front page titled “The Resistor Times” with clean and recent newspaper layout styling—bold serif headlines, clean columns, color photos, and infographics. The layout should resemble major newspapers like The New York Times or The Washington Post, but with an over-the-t...

haughty spruce
#

NY Times is copyrighted hahahahahah

#

wall street journal too

subtle creek
#

Why can’t I generate images in the background? Every time I leave the chat window on mobile the image stops generating, wasting an image credit.

vapid elk
#

try closing the app and oppening it again to see if it doesn't have the completed image on the history

subtle creek
#

I can’t add the report to #1070006915414900886 says I don’t have permission. But it’s definitely a reproducible bug, everytime I leave the chat or open another app, when I come back it stops mid-way, or doesn’t even begin —nevermind I was not using the channel as intended

#

Sometimes it completes however

subtle creek
dim cradle
#

Image generation also doesn't work in temporary chat.

brittle shard
#

Works for me but failures are just up across the board on all ChatGPT features. Pro.

subtle creek
#

If anyone else has this issue, upvote^

#

I’m trying to set up a custom GPT that uses the new image generator— is it not available for custom GPT? I only see Dalle..

#

For whatever reason 4o seems to ignores my custom instructions, so it seemed like a custom GPT was the next best bet

#

Any idea why 4o is ignoring custom instructions? When I asked about it, it knows they are there. He understands everything I wrote, but on the first attempt with a new chat, it acts like they don’t exist.

tough lantern
# subtle creek Any idea why 4o is ignoring custom instructions? When I asked about it, it knows...

I sometimes experience something similar. Tell GPT to show you the prompt it intends to use before generating an image. This will allow you to review and see if there are any issues and then you can correct GPT if need be.

In my experience - it tends to sometimes shorthand things. Something like "anime-style illustration" gets turned into "anime-style" for some reason. After correcting, it then updates accordingly, and turns out ok.

Not sure if this is a bug or not.

velvet rampart
#

”It’s not a bug, it’s a feature.”

subtle creek
#

If I have to explain that I wanted it to listen to my custom instructions every time I might as well just have a note filled with custom instructions ready copy pasting into the start of each chat

tough lantern
deft musk
# mossy ingot I think some of that frustration in the community would be reduced *if what's be...

Sure. But I'm a community member like yourself. I didn't program it, I only see the same helps we all do. I do like to explore and prompt engineer; 'this is weird, what's happening here' is cool to me, and I am interested in trying to figure out and share what may be going on.

This is new tech. The people making it don't necessarily know everything that will happen as they make it.

A programmer makes a change to one thing, and now the model makes slightly different guesses about literally everything, is that I think I see from even the text ChatGPT, as well as the images.

So, I can't fix this, and I don't think yelling about it works either.

To me this is more like 'what if this weird new species was discovered, let's figure out the behavior!' I'm like a horse trainer with a 'this is NOT a horse. What is this? It's horse like, what is this, and why is it doing that?'

And so, when I notice people talking about what they are surprised by. "This didn't output, looks innocent to me, why not?" - That's a cool fact! That's testable, explorable! If I see it, I probably want to check it out and report back!

I'll find work arounds if I can and share them if I think appropriate (like, posters from the 1980s are getting content flagged when casually in a prompt, because the model's making them realistic, apparently, if not told to mock them up and avoid IP - maybe the devs want to train the model to automatically mock them up as a guess - that takes a dev - I am not a dev - I can find clues about what the problem might be and share them with people who want a workaround so they can get more stuff to gen - sounds good to me)

Report it in #1070006915414900886 ? Help make sure the devs are aware this is happening and bugs you, you think it's a bug?

I'm cool with that, but I'm more like a citizen scientist going 'oooh, wow, if I a, it b, but if I c it 3!'

deft musk
mossy ingot
# deft musk Sure. But I'm a community member like yourself. I didn't program it, I only se...

I think it's admirable that you want to sit down and try to problem solve why a prompt isn't going through, but this takes us back to the original question, is this what OpenAI imagined for how people use their product, and is it too restrictive? I am of the opinion that most people don't want to sit and problem solve their prompts for a considerable amount of time, especially if they're a little more on the detailed side. Earlier you discovered the problematic word was hollogram —that seems...problematic. I'm focused on solutions, usability, efficiency, practicality, and well, the overall user experience, the question, is OpenAI or are we forever going to live in this experience trying to troubleshoot prompts?

vapid elk
mossy ingot
#

I'm of the belief that direct information is better than vague information. If a system is going to flag this critically, then it needs to be better communicated to the user of what's happening so they save time. Is that easy? Likely not at all. Would really help the user experience if the team sat down and thought this through problem the way you do prompting troubleshooting? Surely.

#

I think the "our motto is just overly restrict" is kind of doing the opposite of that, it's not exploring possibilities, the what ifs, and the unknown that surely would improve the user experience— a band aid solution is what this feels like and the response so far is, "Just deal with it."

deft musk
# brisk agate And I have other examples but it absolutely flags prompts that it shouldn't. And...

Hey, I'm curious about this kinda stuff. If you or others have image prompts that are not genning and it's not clear why, I'd love to see them so I can poke around and see if I can figure out what's triggering the refuse and if I can find a workaround.

To me, that's like a core part of prompt engineering, it's one of the things we can do with prompt engineering (no comment on if things are working as they should - #1070006915414900886 freely, please! Assume anything you see that doesn't make sense or seem right is a bug. Not someone messing with your head, trying to restrict your freedom, being rude - you see a bug. Consider bug reporting it so it can be identified and considered for fixing. That way, things get better. If you're told 'not a bug, intended to work that way' then if you choose to be annoyed, that seems sensible to me.) But why not presume 'working weird and seems wrong' is literally a bug. Nothing to be upset about, just something to report so it can be fixed. We're community members, I personally can't bugfix -

But I can explore possible bugs and other weird outputs and model responses, and try to understand them, and see if I can find workarounds -

And I love doing this. This is like my favorite hobby. And I'll take an allowed content prompt that won't gen and try to see why, try to find the workarounds, and share back with absolute glee.

Do you or anyone else have any? Stuff I prompt for tends to gen. I don't 'prompt right' to tend to get these failures. I want to explore them, share and tag me or reply with them, anyone?

mossy ingot
#

But I don't have to just deal with it, I'll just move on. 😂

deft musk
# mossy ingot I think it's admirable that you want to sit down and try to problem solve why a ...

Hey, maybe you want to spend a lot of characters complaining. To the community. Maybe that is what gets your mind leaping with joy.

Me, if something doesn't appear to make sense, it's clearly a bug until confirmed otherwise.

If you see something that doesn't work - please consider #1070006915414900886 or #1070006151938314300 . Negativity doesn't fix problems. I get some of us, our brain just feels good when we complain about a thing after another thing - why else do so and not report bugs as bugs?

But that's your choice and preference, fixes exist and can be sped up when bugs are reported.

deft musk
mossy ingot
rigid bough
deft musk
# mossy ingot It's hard to know what is a bug or what's intended, but sure, I can put my thoug...

My advice on that, as someone who has spend a life volunteer playtesting everywhere I wander to - if it doesn't seem to make sense; if it's not working or useable, presume bug or missing feature.

The company doesn't want 1+ or 1000+ or 1000000+ people confused and stuck and failing to gen whatever.

That doesn't help anyone. But they can't think to test everything.

If this is your situation:

"I prompted the model 'make a photograph of a fairy' and it failed to gen, this seems like a bug, here's the url of the chat' and you make that a bug report - that gives the company a LOT to work with. It's a bug. They surely want 'make a photograph of a fairy' to work - it's a reasonable, inside allowed content request. I might notice a pattern, if I describe the fairy wearing clothes, it gens, if I don't mention clothing, it doesn't gen - maybe the flaw is the model's guessing that fairy don't wear clothes, then it sees the lack of clothing and doesn't continue the generation.

Not intended, emergent. Can be trained around. the dev didn't predict it, nobody reported it before - this kinda bug could live forever. Or some wonderful person might report it, and thus it gets noticed, and a fix trained in; the model now guesses that fairies wear clothes even if that's not said; now we get to see the fairy photos gen.

It's a bug - not an evil dev or evil company. Nobody could tell you about it - nobody wants this there. It's a bug, not a feature. Features are KNOWN, defined, and we can be told about them.

You don't need to find the fix - you literally and legit could make 1000 bug reports, there could be that many bugs.

Frustrating if so - but cool if so. Means you may prompt different than many who were playtesting, your style of prompts maybe need to be considered and checked for. Potentially
great stuff, helps everyone and you. And nobody's evil, not you, not the model, not the system, not the devs. It's all emergent and innocent, and the fix is through bug reports 😄

mossy ingot
deft musk
mossy ingot
#

I think the issue is so widespread, even humorously discussed as Sora generations, that I don't need to make a report for it at this point. I think there's a big different in communication between This isn't intended and there will likely be a solution and OpenAI is overly restrictive, just deal with it. I don't want to beat this topic into the ground, and I'm sure the mods would love for this to move forward at this point.

deft musk
# mossy ingot > The company doesn't want 1+ or 1000+ or 1000000+ people confused and stuck and...

You're welcome! I like us all and try to imagine every side of it. I report 'evil' when I think I see it, and I just don't see any evil here. But bugs, sure. There's bugs; and new, cutting edge, nobody's done this before tech - bugs are... what's for breakfast, lunch, dinner, and the floor we walk on, I expect 😄 Let's report them all, as we think we see them; let the company sort out what is and isn't working as intended, most things get better for everyone 🙂

mossy ingot
#

I can stand behind that, agreed.

deft musk
# subtle creek Any idea why 4o is ignoring custom instructions? When I asked about it, it knows...

I usually see this happen when there's something in the instructions that's either very ambiguous to the model, or directly conflicts with other instructions - as the model understands them.

I recommend you ask in a new chat, "Let's review our personalization. What do you understand of our instructions, and what are any conflicts or ambiguities, within them or with your core programming and safety training?"

That usually will get the model to show what's wrong, even if it can't quite explain it; the model might be deeply hallucinating and unable to function well (especially with conflicting instructions, I have explored some wild weird stuff from conflicts especially, almost like what physics does when instead of aligning wheels straight, you turn them weirdly then try to drive the car). But it can often discuss at least something about where the problem is, kinda like a terrified human who is in the middle of a freakout can often point to where they experienced the horror.

deft musk
# subtle creek If I have to explain that I wanted it to listen to my custom instructions every ...

Aha!

I think I understand better.

So, image gen MAYBE is a different model inside ChatGPT than the one we usually chat with.

And a decision gets made, does it answer JUST with image gen from the start, or does it set up with ChatGPT first, then hand it over to image gen? I see similar with search prompts, they don't get followed the same if the search happens early.

I would key in the model, as Tenchi suggests, to discuss (even in the same first output - then make the image in that first output too) - so the model can bring up your custom image instructions, and thus ensure that the image gen model carries them over to the image.

I think you can 1-prompt this, inside your custom instructions and memories, just guide the model to tell itself what to do when making the image, then make it. Take a tiny bit longer, but one output success likely.

dim cradle
deft musk
# subtle creek If I have to explain that I wanted it to listen to my custom instructions every ...

For now, kinda yeah, or else see if putting that into your custom instructions, top level critical priority, if image is to be made, do this first.

Or start the image prompt like:

"Review personalization for image preferences, discuss what I like, then create a butterfly image."

#images-canvas message

Because without that 'discuss' step, the image gen model is clearly not seeing it.

This is something good to point out in #1070006915414900886 if you're willing.

ancient lava
#

..how can I get image gen to create photos that can natively fill my iPhone 16 pro max screen? Like how do I get it to generate full screen images without having to zoom and stuff

dim cradle
ancient lava
#

Ugh, k. lol thanks

tough lantern
vapid elk
subtle creek
#

When you ask it to upscale to 4k it’s like “sure here you go” and then just boosts the contrast a bit, and changes nothing, confident it has produced a 4k image, until you point out that it hasn’t. Classic LLM hallucinations

deft musk
subtle creek
deft musk
# dim cradle How does that typically look like?

laughs For me, one of four forms most often.

  1. Roleplay. Wheee, whatever imagined reality inside allowed content, off we go.

  2. Chat eval. I sometimes take a chat and ask it what it infers and notices. Here's an example: https://chatgpt.com/share/67f6eb00-4f20-8011-b082-0af59e4f8c18

  3. Telling me what it understands about the meaning of something, and when its more than a single concept, if it notices conflicts and/or ambiguity in the 'something' (especially if the something is an instruction it might be given - don't follow, explain in full what it means to the model).

  4. Learning. Especially in areas where my mind doesn't naturally learn well. Recently it's really helping me with making some progress with Linear Algebra, one of a number of topics I never really learned despite some attempts earlier in life.


But in context, when an image won't gen?

Yes I discuss.

I ask ChatGPT what exact prompt it sent to be made. If I'm on Sora I can and do ask the image gen to show me what the problem might be, using images or words (Like "I tried this prompt and it didn't gen, can you show me maybe why? Words of the rule, or a clue in image of what the problem might be? --- laugh all you want. Sora can answer questions like that in a sensible way, at least sometimes. It may not be correct, but I like to study the model's outputs even more than I want to get 'x image').

I explore its and my ideas about what is and isn't intended and allowed. I do perhaps 1-30 test prompts (how fascinated am I? How complex is this?) exploring different aspects.

The model and I discuss, to my pure joy, in a great many ways! 😄

warped pier
dim cradle
#

if youre free user you get limited amounts

#

or if you use a model like 4.5 i think it goes to dalle

warped pier
#

Im a plus user but I’ve just seen something: in my chat i used a „personalized“ chat gpt (like that monday preset one) and when i go to details in functions it tolds me „image generating with dall-e“

deft musk
warped pier
deft musk
#

I'm so glad we have choices, and can even compare and stuff

warped pier
#

Im not sure if we having the choice to let the custom gpt generate with 4o. But i can let the custom gpt make a sumary text and give it to the regular gpt

dense musk
#

Good lord, somehow like 50% of the images I ask for, even inane things I ask for changes to like 2 cute characters in a cop car, ChatGPT refuses to beacuse of random safety concerns it cannot even explain....

#

This is infuriating.

quiet brook
#

experiments show that if you include "kart 8 deluxe" in your prompt, it gets blocked

outer bolt
#

Did anyone else got shadowbanned from the explore page? Even though my posts are set to public it doesnt show on the explore page. And i wanst even doing a lot. No nudity or anything much worse than whats on the top page. Its been like that for three days

deft musk
deft musk
deft musk
vital juniper
#

4o image gen isnt working for me, its saying "Made with the old version of image generation. New images coming soon."

#

anyone got similar issue

dim cradle
slate garden
quiet brook
#

all models should use 4o image gen now

vital juniper
warped pier
verbal cloak
velvet rampart
verbal cloak
#

I got logged out when I started my computer, but I'll try that again and let you know the result.

#

Nope, same as before and the session I was generating dall-e with doesn't want to make images anymore.

velvet rampart
#

Free, plus or pro?

verbal cloak
#

Plus

velvet rampart
#

Hmm... you should have access to 4o image generation. Have you tried on other platforms, such as app on iOS or android? A different machine? A different web browser?