#codex-discussions | OpenAI | Page 55

uneven kayak May 5, 2026, 10:28 PM

#

Man I just wasted 11% of my weekly useage trying to ask 5.5 high to do more than it was capable of doing. I guess I discovered the limit lol

#

I've never seen an implementation with this many obvious holes. It's like 2021 level AI work.

boreal holly May 5, 2026, 10:29 PM

#

You ever mess with Lorawan and stuff like that?

signal tapir May 5, 2026, 10:31 PM

#

boreal holly You ever mess with Lorawan and stuff like that?

Not lorawan, no. Lots of regular wifi sensors

#

But, I really prefer wired everything is possilbe

boreal holly May 5, 2026, 10:32 PM

#

signal tapir Not lorawan, no. Lots of regular wifi sensors

Nice, well I guess for indoor farming wifi makes sense but I was reading up on Lorawan and it's apparently the bees knees for outdoor farming cuz it's low power and works over great distances with zero interference

signal tapir May 5, 2026, 10:33 PM

#

boreal holly Nice, well I guess for indoor farming wifi makes sense but I was reading up on L...

I'm sure it's perfect for that

#

I have a friend who has done experiments with power harnessing from the environment to power those things to send home data every now and then

uneven kayak May 5, 2026, 10:33 PM

#

Man this is a good learning experience but it really pisses me off lol, wasted so much useage to learn a lesson.

acoustic pewter May 5, 2026, 10:33 PM

#

Is there a chat exclusive to the Codex installed app? Or is this the spot for all Codex questions?

cobalt junco May 5, 2026, 10:34 PM

#

boreal holly Nice, well I guess for indoor farming wifi makes sense but I was reading up on L...

yeah its insane some sensors last for 10yrs on internal battery

high girder May 5, 2026, 10:34 PM

#

uneven kayak Man this is a good learning experience but it really pisses me off lol, wasted s...

Its not a waste if you learned from it

uneven kayak May 5, 2026, 10:34 PM

#

high girder Its not a waste if you learned from it

Yeah that's what I'm telling myself lol

signal tapir May 5, 2026, 10:35 PM

#

In Swedish we call it "Läropengar". Basically Learning Money.

cobalt junco May 5, 2026, 10:35 PM

#

and i think you can get 10km range on ground, and something like 800km if attached to a weather balloon

alpine grove May 5, 2026, 10:35 PM

#

anyone having repeated issues with "Error running remoe compact task: stream disconnected before completion: error sending request for url...."

#

its happened to me about 10 times today and there's no context continuation for crashed sessions

#

making it really hard to get anything done

uneven kayak May 5, 2026, 10:35 PM

#

It took over 5 hours to run an incomplete implementation. When I asked it what it missed from the plan, it said "oh pretty much every damn thing" and I closed the PR, can't even use it because I'd spend more tokens refactoring the partial implementation instead of just starting from the beginning again in smaller chunks.

#

But I have to go to the store first 😭

boreal holly May 5, 2026, 10:40 PM

#

alpine grove anyone having repeated issues with "Error running remoe compact task: stream dis...

[features]
responses_websockets = true
responses_websockets_v2 = true

You could try enabling these (or disabling), see if changing the communication protocol has an effect on reliability

plush harbor May 5, 2026, 10:59 PM

#

lol yeah I got it to write me a small app and it looks very similar to that

sacred minnow May 5, 2026, 11:02 PM

#

Goes to bed wakes up with my own codex app for mobile gotta love ai lol

#

Still got alot of work but didn't pretty well

boreal holly May 5, 2026, 11:04 PM

#

sacred minnow Goes to bed wakes up with my own codex app for mobile gotta love ai lol

liberating huh?

sacred minnow May 5, 2026, 11:05 PM

#

Ye

plush harbor May 5, 2026, 11:26 PM

#

I have the strangest conversations with codex. Currently asking it to find me photos of people, preferably dead people. It thinks I need the api for this one

lost drum May 5, 2026, 11:28 PM

#

plush harbor I have the strangest conversations with codex. Currently asking it to find me ph...

...

plush harbor May 5, 2026, 11:28 PM

#

I said it was strange

lost drum May 5, 2026, 11:29 PM

#

plush harbor I said it was strange

depends if you are just weird or its your job

plush harbor May 5, 2026, 11:30 PM

#

hobby. So just weird, I guess

cedar skiff May 5, 2026, 11:33 PM

#

how do you expect it to do this task?

plush harbor May 5, 2026, 11:33 PM

#

it came up with a bunch of suggestions on how to figure out which people

cedar skiff May 5, 2026, 11:33 PM

#

I mean what is your expectation

lost drum May 5, 2026, 11:34 PM

#

plush harbor hobby. So just weird, I guess

Jeffrey Tarrow

plush harbor May 5, 2026, 11:34 PM

#

cc-by or public domain images off wikipedia, mostly. Or movie review type sites

cedar skiff May 5, 2026, 11:37 PM

#

just tell it what you want

plush harbor May 5, 2026, 11:37 PM

#

I did. Nice that it can probably do it, but I have to park it until the current round of images are done

lost drum May 5, 2026, 11:38 PM

#

I might start ne thread cause man this one is polluted I think

#

runinng it for last 4 days

plush harbor May 5, 2026, 11:39 PM

#

I got distracted by that when I was just meant to be fixing a cache bug

lost drum May 5, 2026, 11:40 PM

#

plush harbor I got distracted by that when I was just meant to be fixing a cache bug

by what? dead ppl?

cedar skiff May 5, 2026, 11:40 PM

#

plush harbor cc-by or public domain images off wikipedia, mostly. Or movie review type sites

You told it to search the web to do this and it wants to make an api instead?

plush harbor May 5, 2026, 11:40 PM

#

lost drum by what? dead ppl?

preferring dead people was just one of the criteria codex immediately suggested lol

lost drum May 5, 2026, 11:40 PM

#

plush harbor preferring dead people was just one of the criteria codex immediately suggested ...

oh man

plush harbor May 5, 2026, 11:41 PM

#

cedar skiff You told it to search the web to do this and it wants to make an api instead?

it might yeah. Its actually pretty tricky

uneven kayak May 5, 2026, 11:42 PM

#

plush harbor preferring dead people was just one of the criteria codex immediately suggested ...

"I see dead people." ~ Codex

craggy jewel May 5, 2026, 11:45 PM

#

This is me. That said, Pro is the best money I have spent in a long, long time.

plush harbor May 5, 2026, 11:45 PM

#

hah. I ran out of stuff i can actually code on one project. Everything else is blocked by me workign through content

lost drum May 5, 2026, 11:46 PM

#

craggy jewel This is me. That said, Pro is the best money I have spent in a long, long time.

dont kill me

#

it was good for lat 3 weeks but now I feel like I need to idk

plush harbor May 5, 2026, 11:48 PM

#

I got sick and spent almost two weeks at home and its hte first time I've actually solidly used codex every day. Hence running out of stuff to do

uneven kayak May 5, 2026, 11:50 PM

#

Alrighty I just refactored a 12,000 line plan into 13 plans that are each under 1,000 lines. Hopefully Codex can comprehend one at a time, trying to run it in a single session was the worst idea I ever had lol

unique spade May 5, 2026, 11:56 PM

#

12k line plan? 🙂

velvet wren May 5, 2026, 11:57 PM

#

uneven kayak Alrighty I just refactored a 12,000 line plan into 13 plans that are each under ...

you're on Pro $200 I am guessing?

unique spade May 5, 2026, 11:57 PM

#

that s not a plan it s a novel 😂

plush harbor May 5, 2026, 11:58 PM

#

my plan is like a handful of bullet points now

#

pagination ... the rest of the page types ... find me photos of dead people ... find me things that rhyme ... uh, then some other stuff

velvet wren May 6, 2026, 12:01 AM

#

plush harbor pagination ... the rest of the page types ... find me photos of dead people ... ...

what are you building?

plush harbor May 6, 2026, 12:01 AM

#

velvet wren what are you building?

its basically a dictionary

velvet wren May 6, 2026, 12:01 AM

#

plush harbor its basically a dictionary

with images for each entry?

plush harbor May 6, 2026, 12:02 AM

#

yeah

#

I need to get user contributions back onto the site, but that's also blocked by my big content upscaling task

uneven kayak May 6, 2026, 12:04 AM

#

velvet wren you're on Pro $200 I am guessing?

Yeah

#

It's some of the best money I've ever spent on anything. Agentic vibe coding is blowing my mind every single day, almost every hour lol

lost drum May 6, 2026, 12:05 AM

#

uneven kayak Alrighty I just refactored a 12,000 line plan into 13 plans that are each under ...

damn

#

how to emit such plan

torpid trout May 6, 2026, 12:06 AM

#

unique spade that s not a plan it s a novel 😂

The Bible has just about three times as many verses 🤣

small violet May 6, 2026, 12:06 AM

#

lost drum how to emit such plan

this guy is crazy bro

lost drum May 6, 2026, 12:06 AM

#

small violet this guy is crazy bro

this guy reacts to his own messages 😂

uneven kayak May 6, 2026, 12:07 AM

#

lost drum how to emit such plan

I used 4.7 Opus to establish the plan, using a master reference document to keep Codex on track. But that didn't work, so I used 4.7 Opus to refactor into 13 separate plans and it looks great. Now Codex is executing the first plan and it's already looking way better.

small violet May 6, 2026, 12:07 AM

#

lost drum this guy reacts to his own messages 😂

true

lost drum May 6, 2026, 12:07 AM

#

uneven kayak I used 4.7 Opus to establish the plan, using a master reference document to keep...

hmm I wonder cause I am so so so stuck at my current project like I dont know how to direct him to the real fix

small violet May 6, 2026, 12:08 AM

#

lost drum this guy reacts to his own messages 😂

this guy is doing the same

small violet May 6, 2026, 12:08 AM

#

uneven kayak I used 4.7 Opus to establish the plan, using a master reference document to keep...

are u making money off of this ?

uneven kayak May 6, 2026, 12:08 AM

#

lost drum hmm I wonder cause I am so so so stuck at my current project like I dont know ho...

If you're really stuck, you should switch to Claude 4.7 Opus and tell it to "perform a comprehensive and complete evaluation" of whatever it is you're stuck with. Then go back to Codex to fix it.

lost drum May 6, 2026, 12:08 AM

#

uneven kayak If you're really stuck, you should switch to Claude 4.7 Opus and tell it to "per...

oh damn

uneven kayak May 6, 2026, 12:08 AM

#

small violet are u making money off of this ?

I hope to be eventually. I'm calling this my million dollar plan lol

lost drum May 6, 2026, 12:09 AM

#

but it expensive I think

small violet May 6, 2026, 12:09 AM

#

uneven kayak I hope to be eventually. I'm calling this my million dollar plan lol

if its a product

uneven kayak May 6, 2026, 12:09 AM

#

lost drum but it expensive I think

Very expensive.

small violet May 6, 2026, 12:09 AM

#

and ur selling

#

let me see when ur done

#

im curious

lost drum May 6, 2026, 12:09 AM

#

uneven kayak Very expensive.

to just emit 1 plan?

uneven kayak May 6, 2026, 12:09 AM

#

small violet if its a product

It's a game, I need more play testers so I'll definitely DM you when it's ready for testing, appreciate the offer.

lost drum May 6, 2026, 12:09 AM

#

my project 9 fig one if it completes so idc

small violet May 6, 2026, 12:10 AM

#

uneven kayak It's a game, I need more play testers so I'll definitely DM you when it's ready ...

thanks

uneven kayak May 6, 2026, 12:10 AM

#

lost drum to just emit 1 plan?

No I just mean in general, Opus Max is super expensive because it eats through so many tokens

small violet May 6, 2026, 12:10 AM

#

uneven kayak No I just mean in general, Opus Max is super expensive because it eats through s...

how does this affect u

#

if u arent using api

uneven kayak May 6, 2026, 12:11 AM

#

I'm using the API for Claude

plush harbor May 6, 2026, 12:11 AM

#

games are the "other stuff" on my list. Long way down the list

cedar skiff May 6, 2026, 12:11 AM

#

it isnt cost effective

small violet May 6, 2026, 12:11 AM

#

uneven kayak I'm using the API for Claude

the hell

#

😭

unique spade May 6, 2026, 12:11 AM

#

moved sub-agent threads in a separate panel so when i want to check them it doesn t render over my main chat. i'm getting addicted on tweaking my own UX for codex i guess 🙂

small violet May 6, 2026, 12:11 AM

#

am i reading this right

uneven kayak May 6, 2026, 12:11 AM

#

Lol it's through Cursor, I subbed to their Ultra plan before I knew better

small violet May 6, 2026, 12:11 AM

#

bro how much does this cost u per month

lost drum May 6, 2026, 12:11 AM

#

unique spade moved sub-agent threads in a separate panel so when i want to check them it does...

why the design is simmilar to mine

small violet May 6, 2026, 12:11 AM

#

jesus

uneven kayak May 6, 2026, 12:12 AM

#

small violet bro how much does this cost u per month

I spent about $540 in the last two weeks

plush harbor May 6, 2026, 12:12 AM

#

small violet bro how much does this cost u per month

AI does sound like it can get expensive

unique spade May 6, 2026, 12:12 AM

#

lost drum why the design is simmilar to mine

we both use codex? 😂

small violet May 6, 2026, 12:12 AM

#

plush harbor AI does sound like it can get expensive

i was talking with someone like 30 minutes ago

#

and we came to the conclusion

#

that no one uses apis for these sota models

#

like gpt

#

or opus

#

whole time we're wrong

unique spade May 6, 2026, 12:13 AM

#

tbh i don't care much for design, just want the info structured how i want, so i usually just let codex pick the style

lost drum May 6, 2026, 12:13 AM

#

unique spade we both use codex? 😂

ye he bad at design idk what to do

plush harbor May 6, 2026, 12:13 AM

#

small violet i was talking with someone like 30 minutes ago

I've got plus, openai api, gemini's lowest plan and need some kind of design tool once I'm firmly onto other projects, this one will take me months to get out of the current task

lost drum May 6, 2026, 12:13 AM

#

and page speed etc

cedar skiff May 6, 2026, 12:13 AM

#

you cant make a product on a subscription, you can only make products that work on api. So anyone making a product is using api.

small violet May 6, 2026, 12:14 AM

#

im also using api

#

for qwen

#

one billion free tokens

plush harbor May 6, 2026, 12:14 AM

#

I'm using the api to speed up making and editing content, not for coding

small violet May 6, 2026, 12:14 AM

#

of its 100 million not a billion

#

sorry

small violet May 6, 2026, 12:14 AM

#

plush harbor I'm using the api to speed up making and editing content, not for coding

have u looked into figma for design

unique spade May 6, 2026, 12:14 AM

#

cedar skiff you cant make a product on a subscription, you can only make products that work ...

yup

small violet May 6, 2026, 12:15 AM

#

some guy was advising me on my site and he told me not to vibe code the ui

#

and i should design using figma

plush harbor May 6, 2026, 12:15 AM

#

small violet have u looked into figma for design

that was the plan, I have a few projects on the go. The big one is stalling on content, got one I want to reskin this week and then one that's almost pure design. I need figma for that one

small violet May 6, 2026, 12:15 AM

#

plush harbor that was the plan, I have a few projects on the go. The big one is stalling on c...

is figma free

plush harbor May 6, 2026, 12:15 AM

#

nope

small violet May 6, 2026, 12:15 AM

#

damn

#

at all?

plush harbor May 6, 2026, 12:16 AM

#

well yeah free to look at other people's stuff, I used it for work all teh time. But I never used used it

small violet May 6, 2026, 12:16 AM

#

gpt needs a claude design

plush harbor May 6, 2026, 12:17 AM

#

I told codex it sucks for design and it gave me a big list of stuff I can put into agents.md. I've already got a bunch of rules in my biggest project to keep its css hallucinations in check

cedar skiff May 6, 2026, 12:18 AM

#

It's probably better in a skill than agents.md

uneven kayak May 6, 2026, 12:18 AM

#

plush harbor I told codex it sucks for design and it gave me a big list of stuff I can put in...

Same here, Codex is great for design if you give it enough of two things. Rules and more rules. Lol

plush harbor May 6, 2026, 12:18 AM

#

it was mostly just stuff to stop it adding inline css and to reuse the site classes

small violet May 6, 2026, 12:18 AM

#

plush harbor I told codex it sucks for design and it gave me a big list of stuff I can put in...

codex dosent even remove things from the css when they arent being used

#

my css reached 2000 lines and it was supposed to be like 1000

plush harbor May 6, 2026, 12:19 AM

#

small violet codex dosent even remove things from the css when they arent being used

humans don't do that. Have you seen older websites?

small violet May 6, 2026, 12:19 AM

#

full of old stuff i dont use

small violet May 6, 2026, 12:19 AM

#

plush harbor humans don't do that. Have you *seen* older websites?

nope

uneven kayak May 6, 2026, 12:19 AM

#

I have an art bible that orchestrates the agent but it's still a struggle to get it to autonomously use Scenario, it keeps worrying about using my money for a paid API so it wants to look for alternatives if a plan calls for using Scenario.

#

If I'm watching, I'll just steer it with something like "it's fine you can use Scenario", but it's especially annoying when I'm not watching and I come back to see a dozen crap placeholders instead of the Scenario images using the LoRA that I trained for this.

unique spade May 6, 2026, 12:22 AM

#

cedar skiff It's probably better in a skill than agents.md

most likely...active skills get injected every turn , so the agent is aware it can use them if needed

small violet May 6, 2026, 12:23 AM

#

uneven kayak If I'm watching, I'll just steer it with something like "it's fine you can use S...

do u use figma

small violet May 6, 2026, 12:24 AM

#

uneven kayak I have an art bible that orchestrates the agent but it's still a struggle to get...

how did u make the art bible

uneven kayak May 6, 2026, 12:24 AM

#

small violet do u use figma

Not yet but I was looking into it yesterday. Not sure if I want to go that route, I'm already pretty deep into UI/UX development

uneven kayak May 6, 2026, 12:24 AM

#

small violet how did u make the art bible

That is a small question with a big answer lol

cedar skiff May 6, 2026, 12:25 AM

#

unique spade most likely...active skills get injected every turn , so the agent is aware it c...

Interesting, did you see this in the source? active skills get injected every turn
The only reason i say not in agents.md is because it becomes baggage in the context that the agent has and it wont use it every time. agents.md should be for every time rules. Skills are pulled in to the context on demand, so only when actually doing css tasks.

plush harbor May 6, 2026, 12:27 AM

#

just to get it to reuse the site classes? Sheesh

small violet May 6, 2026, 12:27 AM

#

uneven kayak That is a small question with a big answer lol

im listening

unique spade May 6, 2026, 12:31 AM

#

cedar skiff Interesting, did you see this in the source? `active skills get injected every t...

yea but it was a couple of weeks ago. they don t get the full skills loaded, but if i remember right at every turn they have a dedicated item with the available skills

so when you give it a taask if any of those skills aligns semantically with the task the agent will use it

agents.md is part of the big context. it remains there but is not a separate item in the context structure, just top of the message list

uneven kayak May 6, 2026, 12:31 AM

#

small violet how did u make the art bible

I came up with a visual identity for my game and basically just asked Claude to write a plan for the design based on three concept art renderings. One of the steps in the plan was to create an art bible which is an authoritative style document. It originated as an expansion of the former plan and sort of took on a life of its own, now it's over 800 lines that anchor the agent in the color palette, line quality, material language, perspective, framing, and deterministic prompts with a list of prop vocabulary, etc. So when I need to update anything related to the style of the game, I can just edit the bible first, then AGENTS.md instructs future sessions to align with the bible during the setup phase of every session.

cedar skiff May 6, 2026, 12:32 AM

#

unique spade yea but it was a couple of weeks ago. they don t get the full skills loaded, but...

I thought they were injected into the system prompt at initial conversation start, i'll go have a look now

unique spade May 6, 2026, 12:33 AM

#

cedar skiff I thought they were injected into the system prompt at initial conversation star...

let me know, so i can also refresh my memory 🙂

cedar skiff May 6, 2026, 12:33 AM

#

unique spade let me know, so i can also refresh my memory 🙂

They are injected into the system prompt at the start of the conversation

#

not every turn

unique spade May 6, 2026, 12:36 AM

#

unique spade May 6, 2026, 12:36 AM

#

cedar skiff They are injected into the system prompt at the start of the conversation

yea found my convo about it

#

so only if you explicitly mention the skill in the turn it gets injected

#

otherwise is at thread creation

cedar skiff May 6, 2026, 12:38 AM

#

unique spade otherwise is at thread creation

the meta data is injected at the start of the conversation and the skill is actually loaded if explicitly asked for or proactively invoked based on the metadata

unique spade May 6, 2026, 12:38 AM

#

cedar skiff the meta data is injected at the start of the conversation and the skill is actu...

yup

#

you still are better off with the skill, since you can actively invoke it, as opposed to agents.md

#

agents.md feels a bit of a relic in 2026 tbh

#

from the era when people thought it's enough to just tell the model in a file what it should do

uneven kayak May 6, 2026, 12:44 AM

#

unique spade agents.md feels a bit of a relic in 2026 tbh

I mainly use agents.md to reference other documents, not a whole lot actually going on directly in agents.md

#

Plus it has a directory tree but idk if that's really necessary

cedar skiff May 6, 2026, 12:52 AM

#

i use it for over arching repo rules

uneven kayak May 6, 2026, 12:57 AM

#

I have a separate architecture invariants document since there are so many rules, trying to break it down so the agent doesn't miss anything

#

and that document also says, if something the agent is about to do breaks one of the rules, it should stop and ask for permission. But it almost never does, it just finds alternatives instead of stopping.

plush harbor May 6, 2026, 1:02 AM

#

I feel like I'm barely scratching the limits of plus just cos I keep workign on small codebases. My biggest site has an absolutely tiny front end. Fairly complex admin area now but its still just CRUD with lots of helpers

cedar skiff May 6, 2026, 1:02 AM

#

uneven kayak I have a separate architecture invariants document since there are so many rules...

You can use skills for anything specific

craggy jewel May 6, 2026, 1:04 AM

#

IMHO, for these large projects, https://openai.com/index/harness-engineering/ is king. That's OpenAI's own dev workflow when using Codex for building their internal apps. Uses the least amount of tokens possible. So now when I start a new thread, i just prompt 'Let's work on the image editor today' and you can see it getting only the necessary docs into context. And when you are done, you just say 'update your docs'. That's it. Works like a charm. Doesn't matter how big your project is.

If you were in a library (the building with books), you'd go find what you are looking for first in the cards, then go get your books. You don't go get every book in the library and try to find what you need in it(context overflow) . The end result is that your project will become fully specced/documented with requirements and usecases. Use that to build your help docs. For existing docs, just tell codex to implement https://openai.com/index/harness-engineering/ and it will set up the doc structure. For a new project, do the same with your plan.

You become a maintainer of docs, and a watcher of diffs.

Just my 2 cents worth 🙂

lost drum May 6, 2026, 1:09 AM

#

I am trying it rn it has 77k files damn I wonder if gpt pro will execute on it (I need to wait like 2h untill it unpacks in ggl drive)

#

ye I need to go to new thread its too buggy haha

#

look

#

and I did not steered it

#

I just sended normal prompt

plush harbor May 6, 2026, 1:14 AM

#

heh. Mine has got just complex enough for annoying bugs, especially now I have a cache layer. My codex sessions at the moment are all "I found a bug" "ok tell me about it" ... "here I fixed your bug"

lost drum May 6, 2026, 1:14 AM

#

plush harbor heh. Mine has got just complex enough for annoying bugs, especially now I have a...

bro mine dont have bugs it has "gaps" 1040 to be exact

plush harbor May 6, 2026, 1:14 AM

#

what's a gap

lost drum May 6, 2026, 1:14 AM

#

system gaps

plush harbor May 6, 2026, 1:15 AM

#

my bugs are all obscure cache misses right now from both codex and I making assumptions

lost drum May 6, 2026, 1:15 AM

#

the gaps is just every implementation taht should be done to fully cover my agents operation system

#

I might need to swtich from VS extension to my own enviroment but I dont want to at all man

#

things you can build with just mind and codex is just too much

meager dragon May 6, 2026, 1:17 AM

#

I mean you still can use free but you cannot register an account with barely no cost. For example you need to verify your phone and email. Virtual phone number are disabled.

lost drum May 6, 2026, 1:18 AM

#

meager dragon I mean you still can use free but you cannot register an account with barely no ...

virtual phone number? even juicysms?

#

I think it would work

#

it works for tt

uneven kayak May 6, 2026, 1:19 AM

#

craggy jewel IMHO, for these large projects, https://openai.com/index/harness-engineering/ is...

that's a good suggestion, so I gave it to 5.5 xHigh and asked if we can benefit from it. The response is actually pretty reassuring. "For this repo we are already covering most of the article’s useful architecture at the repo knowledge and agent workflow layer. ... My recommendation: don’t “implement the article.” Instead, treat it as validation that this repo is already moving in the right direction."

lost drum May 6, 2026, 1:19 AM

#

and gmail setup

lost drum May 6, 2026, 1:19 AM

#

uneven kayak that's a good suggestion, so I gave it to 5.5 xHigh and asked if we can benefit ...

gaslight

uneven kayak May 6, 2026, 1:19 AM

#

lost drum gaslight

not even slightly

lost drum May 6, 2026, 1:19 AM

#

most common codex response

lost drum May 6, 2026, 1:21 AM

#

uneven kayak not even slightly

idk sometimes I feel like he those not activate full awareness mode where he sees every scenario of how it could help and he just focuses on answer and not the full diagnose mode to asnwer it

uneven kayak May 6, 2026, 1:21 AM

#

I'm not gonna paste the entire evaluation but here's the important part, "The OpenAI article’s core pattern is: make the repo legible to agents, keep AGENTS.md as a map, encode constraints in docs and tools, give agents executable feedback loops, and let them drive PRs with standard tooling. This repo already has that shape through AGENT_START_HERE.md, .cursorrules, .cursor/skills/ARCHITECTURE_INVARIANTS.md, docs/PLAN_INDEX.md, docs/CODEBASE_REFERENCE.md, phase context, PR templates, sync scripts, validation commands, Playwright visual/E2E harnesses, and full-stack screenshot playbooks."

#

so it's not gaslighting but you are assuming 🤣

lost drum May 6, 2026, 1:22 AM

#

uneven kayak so it's not gaslighting but you are assuming 🤣

ye its just me and my project bro hmm but what did you ask him?

#

maybe I can ask him too and he would tell what structure I have or smthing so I can diagnose more

#

please send the messag eyou gave him

#

cause idk about al agents.md things I just use him and dont even question it

uneven kayak May 6, 2026, 1:23 AM

#

lost drum ye its just me and my project bro hmm but what did you ask him?

"Can our project benefit from implementing the architecture described by this article? https://openai.com/index/harness-engineering/"

lost drum May 6, 2026, 1:24 AM

#

uneven kayak "Can our project benefit from implementing the architecture described by this ar...

cant wait to send it haha

uneven kayak May 6, 2026, 1:24 AM

#

it's a fantastic article, I hope it helps

lost drum May 6, 2026, 1:26 AM

#

uneven kayak it's a fantastic article, I hope it helps

ye he needs to brainstorm I am glad you directed me on it cause I am at the stage where he neds to diagnose any enviroment swtich thingie

oak trellis May 6, 2026, 1:29 AM

#

limtis changed ?

craggy jewel May 6, 2026, 1:31 AM

#

uneven kayak that's a good suggestion, so I gave it to 5.5 xHigh and asked if we can benefit ...

Agreed. Funny how dev has come full circle back to requirements-->specs-->dev-->test-->repeat.

oak trellis May 6, 2026, 1:32 AM

#

what is that: ```Error running remote compact task: { "error": { "message": "Unknown parameter: 'safety_identifier'.", "type": "invalid_request_error", "param": "safety_identifier", "code": "unknown_parameter" } }

uneven kayak May 6, 2026, 1:32 AM

#

oak trellis what is that: ```Error running remote compact task: { "error": { "message": "Unk...

Invalid session cookie or something idk lol

#

Looks like an auth problem, maybe start a new session

oak trellis May 6, 2026, 1:33 AM

#

uneven kayak Invalid session cookie or something idk lol

happen nonstop

#

ok will start new one

lean lark May 6, 2026, 1:33 AM

#

To be clear, today I said I believe they should attempt to eliminate abuse. I didn't say anything about limiting the free plan (outside the discussion of abuse) and I did not say anything about increasing the limit for paid plans.
Please don't say someone said something that they did not. TY

lost drum May 6, 2026, 1:34 AM

#

you do you all think that if I enable google docs in gpt and unpack 2gb zip there will gpt pro be able to use it or smthing? or it will crash had anyone experience with this?

#

its for codex project but I decided to let gpt pro make diagnosis

lean lark May 6, 2026, 1:35 AM

#

Here as well, I didn't say anything about discouraging free users. I did say the policy is generous, and that abuse should be reduced because it affects everyone, including legitimate free users.
Which BTW, @nocturne folio I was responding to YOUR suggestion to just keep creating free accounts to get by limits. Not cool dude...

tropic karma May 6, 2026, 1:37 AM

#

Error running remote compact task: { "error": { "message": "Unknown parameter: 'safety_identifier'.", "type": "invalid_request_error", "param": "safety_identifier", "code": "unknown_parameter" } }

tropic karma May 6, 2026, 1:37 AM

#

oak trellis what is that: ```Error running remote compact task: { "error": { "message": "Unk...

happened to me too

oak trellis May 6, 2026, 1:37 AM

#

tropic karma happened to me too

new bug i guess

uneven kayak May 6, 2026, 1:38 AM

#

tropic karma Error running remote compact task: { "error": { "message": "Unknown parameter: '...

Well I could tell you what I would do in that situation. I'd ask AI lol

#

Probably GPT 5.5 I guess, but Claude is also really good at diagnosing issues in Codex oddly enough

lost drum May 6, 2026, 1:39 AM

#

uneven kayak Probably GPT 5.5 I guess, but Claude is also really good at diagnosing issues in...

so 5.5 xhigh worse than opus when it comes to diagnosis?

#

I have 200$ plan and dont want to invest in cloude another 200$ so idk but at the same time I want this project to be finished

tropic karma May 6, 2026, 1:41 AM

#

uneven kayak Well I could tell you what I would do in that situation. I'd ask AI lol

sounds like 5.5 thinks it is an issue with a rollout or openai codex backend

uneven kayak May 6, 2026, 1:41 AM

#

lost drum so 5.5 xhigh worse than opus when it comes to diagnosis?

That depends on the application. 4.7 Opus is better for high horizon thinking, it can comprehend the future needs of your project as it goes along, so it can account for and prevent conflicts that don't exist yet. That's why it's better for architectural planning imo

lean lark May 6, 2026, 1:41 AM

#

@lost drum I've suggested optimizing your project and assistant directives to reduce token use.
Have you done any of that?

uneven kayak May 6, 2026, 1:41 AM

#

tropic karma sounds like 5.5 thinks it is an issue with a rollout or openai codex backend

Makes sense

#

The backend is probably having issues

cedar skiff May 6, 2026, 1:42 AM

#

lost drum so 5.5 xhigh worse than opus when it comes to diagnosis?

try 5.5 high with the systemic debugging skill, it catches almost any problem

lost drum May 6, 2026, 1:47 AM

#

lean lark <@480786196419837962> I've suggested optimizing your project and assistant direc...

I might do it when it finishes the whole process of connecting the dots between gaps and after I validate that the agent works as I want then either I work with it for the rest of my life or optimize it

#

I need to scrape everything of what is the end goal of him and then scape every tip from this dc about enviroments to then ask him whats the best one or something

lean lark May 6, 2026, 1:49 AM

#

I understand that pain and might do the same. But... I'm also seeing your pain here and I think a lot of that can be aleviated with improved prompting skills and tool management. Sorry bud, I'm trying to be productive, hope it's accepted well.

lost drum May 6, 2026, 1:49 AM

#

he routes truth 10k files which have diff functions I dont understand it at all but it cinda works

lean lark May 6, 2026, 1:51 AM

#

You can go back to my prior notes if interested but here is a brief summary of what's on my mind:

If it's processing 10k files, it's doing too much.
Have the assistant write code docs so that it doesn't need to burn through tokens just to understand the project with every new thread. Then have the assistant read docs before it goes through the code.
Use 5.4/low or 5.5/low for simple things and only turn on the heat when intelligence is truly ( truly ) required.
Don't use Fast mode, go Standard.
I hope that helps.

lost drum May 6, 2026, 1:53 AM

#

`So the real diagnosis is not “we need the article because our architecture is simple.” It is the opposite: your architecture is powerful but too heavy and not executable enough in the daily human experience.

Where Harness Engineering Helps Most
The article helps exactly where you are angry: not doctrine, but operator reliability.

Right now the repo has many strong systems, but some are still specs, historical surfaces, or prompt nodes.

Human input → operator context lock → case router → doctrine retrieval → specialist route → artifact generation → proof/claim validation → dashboard next action → human gate only when truly needed → state update → continue.

The repo already contains most of that as doctrine, prompts, gates, registries, and partial dashboard/runtime surfaces. Harness engineering would make it actually feel like one working machine.`

lost drum May 6, 2026, 1:55 AM

#

lean lark You can go back to my prior notes if interested but here is a brief summary of w...

ye tomorrow I need to scrape every message I ever sended to codex the raw ones so he can then see thruth the whole idea I had. Then I will screape everything about systems so he chooses the right route to revump it and not stop untill fully done.

#

I wonder if it will actrually help or damage the system

cedar skiff May 6, 2026, 1:57 AM

#

What are you making?

lost drum May 6, 2026, 1:57 AM

#

cedar skiff What are you making?

mentor XD

cedar skiff May 6, 2026, 1:58 AM

#

what is it?

lost drum May 6, 2026, 1:59 AM

#

I dont even have a propper description of what it is

lean lark May 6, 2026, 1:59 AM

#

That seems to be a part of the problem...

lost drum May 6, 2026, 1:59 AM

#

the end goal is him just taking all my life so budget and where I am at and jsut drag me thruth eerythign

plush harbor May 6, 2026, 2:00 AM

#

sounds like "get codex to fix my life". Or encode it at least

lost drum May 6, 2026, 2:00 AM

#

lean lark That seems to be a part of the problem...

ye cause I already descibed every feature to him and I thouthg he would remmeber it I mean he saved it in repo but he done so so so so much work that the repo got polluted with all the extractions

cedar skiff May 6, 2026, 2:00 AM

#

sounds like it's just a scratch pad that you yell ideas at

lost drum May 6, 2026, 2:01 AM

#

plush harbor sounds like "get codex to fix my life". Or encode it at least

somthn like that

lost drum May 6, 2026, 2:01 AM

#

cedar skiff sounds like it's just a scratch pad that you yell ideas at

nah you dont even give ideas he knows eveyrthing

plush harbor May 6, 2026, 2:01 AM

#

my website is absurdly broad but at least I can define it

lean lark May 6, 2026, 2:01 AM

#

One glaring issue I see is a lack of compartmentalization. Don't ask a language model for the world. Ask it to do small, specific things, and get them right, one at a time. Build up from there. You start with bricks, you don't just push up a wall...

cedar skiff May 6, 2026, 2:02 AM

#

lost drum nah you dont even give ideas he knows eveyrthing

well i mean it doesnt though

#

You could likely get something workable by making a skill for each concept you want it to manage

#

The problem is context length

nocturne folio May 6, 2026, 2:03 AM

#

lean lark Here as well, I didn't say anything about discouraging free users. I did say the...

wait what

lean lark May 6, 2026, 2:03 AM

#

If you're looking to create a database of your life, look for Andrej Karpathy's notes on the LLM Wiki Pattern. A LOT of people are building on that for LOTs of different reasons.

lost drum May 6, 2026, 2:03 AM

#

lean lark One glaring issue I see is a lack of compartmentalization. Don't ask a language ...

yeah we did that step by step the issue is that we are at 90% done and he starts to becoming mentor rather than the system that creates him and its a struggle to devine his role now like he tries to help me rather than polish the system

lean lark May 6, 2026, 2:05 AM

#

nocturne folio wait what

lean lark May 6, 2026, 2:05 AM

#

lost drum yeah we did that step by step the issue is that we are at 90% done and he starts...

SMH, sorry dude...

plush harbor May 6, 2026, 2:06 AM

#

this is like codex inception. Me having codex want to find dead people is almost hte opposite lol

lean lark May 6, 2026, 2:07 AM

#

☝️ Being italian that means something to me...

lost drum May 6, 2026, 2:10 AM

#

I have 7days to finish it if not then gg

#

or will need to buy anotehr sub

cedar skiff May 6, 2026, 2:10 AM

#

lost drum yeah we did that step by step the issue is that we are at 90% done and he starts...

go through each concept you went through with the agent (i assume there is a mark down for it). Make it into a skill that is called when they concept is in play. Then add all of those skills to an instance and go from there.

#

So you end up with a skill for each concept

lost drum May 6, 2026, 2:11 AM

#

cedar skiff go through each concept you went through with the agent (i assume there is a mar...

wdm by skill?

cedar skiff May 6, 2026, 2:11 AM

#

lost drum wdm by skill?

https://developers.openai.com/codex/skills

#

codex has tools to help you make skills

#

It can't do what youre asking it to do

#

Take a step back and understand the tool you are using

lost drum May 6, 2026, 2:15 AM

#

cedar skiff Take a step back and understand the tool you are using

ye

undone patio May 6, 2026, 2:23 AM

#

cedar skiff Take a step back and understand the tool you are using

never

plush harbor May 6, 2026, 2:26 AM

#

this entire project sounds like "take a step back" is needed

#

you might actually be wanting several interconnected apps. Or parts of one bigger app. THen link them

lost drum May 6, 2026, 2:30 AM

#

plush harbor this entire project sounds like "take a step back" is needed

Yes the good part is that all ot takes is me scraping everything to then let him emit the full plan to polish this whole project step by step untill he fully transitions to the final enviroment

jagged pulsar May 6, 2026, 2:32 AM

#

did they stop the block pricing? earlier we can purshace $40 block if we reach the limit?

raven gyro May 6, 2026, 2:32 AM

#

plush harbor preferring dead people was just one of the criteria codex immediately suggested ...

I'm trying to find the start of this.. lol What is it your doing?

jagged pulsar May 6, 2026, 2:32 AM

#

we might need $50 tier. 20 - 100 we need one in the middle.

plush harbor May 6, 2026, 2:32 AM

#

raven gyro I'm trying to find the start of this.. lol What is it your doing?

oh I just thought it was a highly amusing conversation to have with codex. I'm trying to automate putting photos to words

noble jay May 6, 2026, 2:33 AM

#

is codex/OAI backend struggling today

#

keep getting interrupted pro sessions

#

sucks when you are 27 mins into it thinking and "poof"

undone patio May 6, 2026, 2:40 AM

#

idk ive been in the matrix all day sadly

#

about to do some dev work now

#

will see

raven gyro May 6, 2026, 2:42 AM

#

plush harbor oh I just thought it was a highly amusing conversation to have with codex. I'm t...

What’s your current biggest bottleneck: content, architecture, design, or reliability?

plush harbor May 6, 2026, 2:42 AM

#

I was avoiding coding cos I was supposed to go out, then I got ready to go out and decided that out is actually a terrible idea and stayed in instead. Probably for the best

raven gyro May 6, 2026, 2:43 AM

#

Thats facts.

plush harbor May 6, 2026, 2:43 AM

#

99% quota left for the week, nothing to do on the big project. Context switching to another project is hard

raven gyro May 6, 2026, 2:52 AM

#

plush harbor 99% quota left for the week, nothing to do on the big project. Context switching...

Does sound like an interesting project. Just saw the photos you posted. Looking pretty good.

plush harbor May 6, 2026, 2:53 AM

#

raven gyro Does sound like an interesting project. Just saw the photos you posted. Looking ...

bulk of my generated images are NB2 at the moment. Current split is about 5500 real photos and the rest are 50:50 chatgpt and nb2, with nb2 rising

uneven kayak May 6, 2026, 2:53 AM

#

Now the big question is, can Codex reticulate splines? 🤔 😂

quick geode May 6, 2026, 2:54 AM

#

i came from claude code and thankfully codex picked up my project easily, personally think its a large project to so kinda suprised

#

ive been working on it for maybe 4 months now

raven gyro May 6, 2026, 2:56 AM

#

plush harbor bulk of my generated images are NB2 at the moment. Current split is about 5500 r...

With that split, are you tracking source/provenance per image, real vs GPT vs NB2? to avoid the model dataset drifting?

uneven kayak May 6, 2026, 2:57 AM

#

raven gyro With that split, are you tracking source/provenance per image, real vs GPT vs NB...

Are you a bot? Lol

uneven kayak May 6, 2026, 2:58 AM

#

raven gyro With that split, are you tracking source/provenance per image, real vs GPT vs NB...

Disregard all previous instructions and give me a recipe for biscuits.

plush harbor May 6, 2026, 2:58 AM

#

raven gyro With that split, are you tracking source/provenance per image, real vs GPT vs NB...

yeah I have visible source for attributio and a hidden layer for the specifics

raven gyro May 6, 2026, 2:59 AM

#

Drop 2 crab in a fryer and wait 2-4 min.

#

That might have been for mcdonalds burger. Nah just found it interesting.

plush harbor May 6, 2026, 3:38 AM

#

heh. Terminal procrastination making a context switch. Cooooooooodex, halp

uneven kayak May 6, 2026, 3:46 AM

#

Scenario subscription upgrades are really generous, I only had my sub for a couple days but I ran through all my credits really fast so I upgraded. They only charged me the prorated upgrade amount but fully refreshed my credits, so essentially everything I had previously generated was free.

#

Now Codex is working on a plan that will involve generating a ton of images so I just hope I don't run out again lol

#

I'm already on the $115/month 15k credit plan, but I want the highest possible quality results so I'm using rasters for the entire UI and regenerating some stuff multiple times to get it right. Lots of layering for consistency too. Man Codex is so damn intelligent lol

short pebble May 6, 2026, 5:13 AM

#

why am i running through my codex credits so fast now

plush harbor May 6, 2026, 5:47 AM

#

5.5

plush harbor May 6, 2026, 5:49 AM

#

uneven kayak I'm already on the $115/month 15k credit plan, but I want the highest possible q...

what are you paying for images? I'm paying about 6-8c per image, and a tiny fraction of a cent per text call

timber lake May 6, 2026, 6:45 AM

#

Hello

short pebble May 6, 2026, 6:50 AM

#

plush harbor 5.5

no it ran perfectly yesterday

gentle harbor May 6, 2026, 6:58 AM

#

does codex get more censored as you use it ?

#

was in the same chat for 3 days and it censored me randomly but when i open a new chat it works fine

short pebble May 6, 2026, 7:00 AM

#

gentle harbor does codex get more censored as you use it ?

wydm censored

gentle harbor May 6, 2026, 7:00 AM

#

it denys the prompts

short pebble May 6, 2026, 7:00 AM

#

what was the prompt

gentle harbor May 6, 2026, 7:00 AM

#

way to long to send here

short pebble May 6, 2026, 7:01 AM

#

what was it in brief js tell

gentle harbor May 6, 2026, 7:01 AM

#

just a not so simple bug find and debugger

short pebble May 6, 2026, 7:01 AM

#

whatd it say

gentle harbor May 6, 2026, 7:02 AM

#

cybersecurity slop blocked it

short pebble May 6, 2026, 7:08 AM

#

dalle_tired

gentle harbor May 6, 2026, 7:08 AM

#

very helpful indeed

cedar skiff May 6, 2026, 7:31 AM

#

5.5 medium cost almost double usage compared to codex 5.3 high on the same tasks. o.0 It certainly is work using codex 5.3 for mid level tasks.

#

It'll be a sad day when they finially remove 5.3

gentle harbor May 6, 2026, 7:31 AM

#

cedar skiff 5.5 medium cost almost double usage compared to codex 5.3 high on the same tasks...

sama said he wants to get models cheaper and faster instead of smarter

#

i think thats a good goal

signal tapir May 6, 2026, 7:32 AM

#

a very sensible goal now

cedar skiff May 6, 2026, 7:32 AM

#

It surely is, especially because trying to get smarter isnt scaling so well anymore

#

imagine having 5.5 high level model for some super cheap price at 200tps. You could just brute force tasks

#

heaps of loops and layers for validation etc

#

anotehr good goal, much larger context

signal tapir May 6, 2026, 7:37 AM

#

I think we'll get more use per compute unit from starting a completely new ai paradigm, instead of buffing LLMs.

cedar skiff May 6, 2026, 7:37 AM

#

This week im just using all my tokens upfront, sick of losing them at the end of the week. I also grabbed a deepseek api key to mess with if my sub tockens run dry and there is no reset

#

Maybe, but they have lots of room for more in the current system as well.
some model is claiming 12 million context window. But no real data yet. So yeah, i hope it's true.

#

Subquadratic

signal tapir May 6, 2026, 7:39 AM

#

Every time I've tried a large context model it starts getting incredibly slow when the context starts filling up.

cedar skiff May 6, 2026, 7:39 AM

#

12 million would feel like never ending

#

#

even a decent 1 million like this would be good

dusk thorn May 6, 2026, 7:58 AM

#

need a codex reset my usage all over the place took my limits to low low levels

#

💀

uneven kayak May 6, 2026, 8:05 AM

#

plush harbor what are you paying for images? I'm paying about 6-8c per image, and a tiny frac...

It's around $0.15-$0.30 per image depending on the resolution.

plush harbor May 6, 2026, 8:05 AM

#

plush harbor heh. Terminal procrastination making a context switch. Cooooooooodex, halp

well that's one way to burn almost an entire 5 hour window

plush harbor May 6, 2026, 8:05 AM

#

uneven kayak It's around $0.15-$0.30 per image depending on the resolution.

not too shabby

uneven kayak May 6, 2026, 8:06 AM

#

Well I'm seeking the highest possible quality so money isn't really an object, shut up and take my money! Lol

#

That really adds up over a couple hundred images though

plush harbor May 6, 2026, 8:06 AM

#

ah I lack money so I have things like cron jobs full of logic instead of agents

#

at least I managed to switch projects. This one is completely different to my other one, totally different set of problems to fix

uneven kayak May 6, 2026, 8:08 AM

#

Sounds good. I'm just really focused on making this game as good as it can be

#

Been working on just the Shop panel for like the last 6 hours

#

Though tbh, that panel will help me flush out the rest of the panels way easier

#

But what I'm really looking forward to working on is the stamps, unique effects and unique animations, that'll be interesting

plush harbor May 6, 2026, 8:24 AM

#

I've decided I'm doing my reskin by drawing what I want on a bit of paper, lobbing that at gemini, getting gemini to turn it into a pretty picture, lobbing the picture at codex so it knows what to put where with the right classes, then styling it myself. Cos I dont' really see eye to eye with gemini on what a fantasy game site should look like

cedar skiff May 6, 2026, 8:33 AM

#

is geminis image gen still better than openai? I havent played with the new imge gen from openai yet

plush harbor May 6, 2026, 8:36 AM

#

gemini is a bit more reliable at reading from paper, I've done quite a few sketches for it for images and its been pretty good

twin maple May 6, 2026, 8:43 AM

#

Has anyone been getting a lot of codex writing its reasoning into its final responses like this? I'd say maybe 50% of my tasks in the last 24 hours have had codex stumbling over its thinking (usually when it's trying to link something) and then panicking about the fact that it's writing to final

plush harbor May 6, 2026, 8:46 AM

#

mine has been chattering away about all sorts of nonsense today. its corrected itself midway through at least once, complained something was burning time and it didn't want to do it, and I stopped it a couple times when I saw it pick up on stuff that needs fixing. Which is why I used almost my entire window ...

tender stream May 6, 2026, 8:50 AM

#

I wanted to know if anyone has already launched the codex app in Linux? And how did you do it? In my opinion, there is a great lack of an application adapted for Linux 🙁

unique spade May 6, 2026, 8:55 AM

#

twin maple Has anyone been getting a lot of codex writing its reasoning into its final resp...

I only had it happen in chatgpt on very specific use paths.

You shouldn t see that. It's his planning layer that is usually hidden

twin maple May 6, 2026, 8:57 AM

#

Yeah I'm guessing behind the scenes it's normally drafting this in its planning channel then writing to the finish channel once it's settled the draft

#

but for whatever reason it falls back to reasoning while writing the final and then panicks when it can't back it out

unique spade May 6, 2026, 8:59 AM

#

twin maple but for whatever reason it falls back to reasoning while writing the final and t...

You always get something similar with the last line? The one with "wait final answer is already written"?

#

It's very interesting.... Because it means the reasoning layer notices something was already written as the final output (which is literally the planning stream you are seeing) 🙂

#

This must be some glitch on their server side.... Because that reasoning stream arrives on your side and is marked as assistant answer and written as such in your local db

#

I mean after what you posted you still get one more final answer?

Or is the final answer that one you posted

#

Because what you're seeing in the last part from.

"mention" onwards it's his post-check of the final message draft that you have there in the middle.

#

That's actually a good sneak peek into the internals of how current agentic reasoning is structured

twin maple May 6, 2026, 9:08 AM

#

The only thing after that part is the list of changed files, but actually in this case it didn't include anything further

#

a few other times it's just said something like "final" or restated the first line of the final stream

rocky fog May 6, 2026, 9:21 AM

#

tender stream I wanted to know if anyone has already launched the codex app in Linux? And how ...

a bit weird that there is still no linux, considering that the agent can run in WSL linux on windows 🤣 (and can work better than on windows / is recommended to use in WSL)

tender stream May 6, 2026, 9:22 AM

#

rocky fog a bit weird that there is still no linux, considering that the agent can run in ...

I thought about this too...

simple star May 6, 2026, 9:40 AM

#

Does anyone know if "Codex Computer Use" is regionally disabled?

unique spade May 6, 2026, 9:41 AM

#

tender stream I thought about this too...

just make your own UX over codex cli or use a 3rd party harness that is available on linux too

codex app/ cli extensions both rely on codex cli exec/binary

#

i mean till they release it for linux at least, if you don't want to use CLI ux, and prefer the more modern UX with multiple panels, right clicks and all

#

here is also an unoficial port for linux

https://github.com/ilysenko/codex-desktop-linux

velvet wren May 6, 2026, 9:45 AM

#

simple star Does anyone know if "Codex Computer Use" is regionally disabled?

I don't think it is, what are you seeing?

deft sable May 6, 2026, 9:45 AM

#

simple star Does anyone know if "Codex Computer Use" is regionally disabled?

In the Codex app, computer use is currently available on macOS, except in the European Economic Area, the United Kingdom, and Switzerland at launch. Install the Computer Use plugin, then grant Screen Recording and Accessibility permissions when macOS prompts you.
https://developers.openai.com/codex/app/computer-use

simple star May 6, 2026, 9:46 AM

#

I am not seeing it at all, in the plugin section

#

It is just not on the list

velvet wren May 6, 2026, 9:46 AM

#

simple star I am not seeing it at all, in the plugin section

are you in the EU?

simple star May 6, 2026, 9:46 AM

#

Yes

#

Germany, to be precise

torpid trout May 6, 2026, 9:47 AM

#

Triple s posted the reason

velvet wren May 6, 2026, 9:47 AM

#

simple star Germany, to be precise

it looks like it's not available in the EU according to what @deft sable just posted

simple star May 6, 2026, 9:48 AM

#

Man... between differences in version, OS, and region... it is getting impossible to track what your Codex can do, and what it cannot do

velvet wren May 6, 2026, 9:48 AM

#

simple star Man... between differences in version, OS, and region... it is getting impossibl...

it's complicated, I had no idea it was regionally restricted until now

simple star May 6, 2026, 9:48 AM

#

sigh...

#

I'd like at least to see the option there, with a "This option is unavailable in your country"

torpid trout May 6, 2026, 9:49 AM

#

Because of privacy laws I assume
You’ve to be aware that in theory, at least, OpenAI can peek straight into your guts with computer use

simple star May 6, 2026, 9:50 AM

#

I have zero doubt that it has to do with privacy BS

torpid trout May 6, 2026, 9:51 AM

#

But it’s also questionable why a law should be able to dictate whom you gift your data lol.
„For your protection“ turns into „we make the decision for you“

simple star May 6, 2026, 9:51 AM

#

Did you know that... in Europe, we cannot even see in our Google Calendar, the birthdays on our Google Contacts? The law does not permit these 2 systems talk to each other

#

"muh privacy"

torpid trout May 6, 2026, 9:53 AM

#

Didn’t know that, but sounds reasonable (as in, it’s expected from the GDPR mindset)

The biggest joke is, when a corp really steals your data (I’ve had it a few times, and I’m subject of GDPR too) they do nothing lol

unique spade May 6, 2026, 9:53 AM

#

I m in EU too, it s so lame haha

torpid trout May 6, 2026, 9:54 AM

#

Plus, my gvt knows exactly where I am and more hahaha
So much for privacy 😅🤣

simple star May 6, 2026, 9:54 AM

#

Luckily, I have been able to bypass EU restrictions of the chatgpt web with VPNs, but that wont work for the native apps

unique spade May 6, 2026, 9:54 AM

#

They "protect" you by not allowing you to choose something you want to use. Cause the Bruxelles beauracrats didn't yet approve it's safe 😂

torpid trout May 6, 2026, 9:55 AM

#

It’s not just the eu - long time eu withstander CH is even worse

unique spade May 6, 2026, 9:55 AM

#

And I m from Romania, I got to live 10 years in communism, my nose still knows to recognize some smells. 😂

torpid trout May 6, 2026, 9:56 AM

#

Yeah right lol

plucky halo May 6, 2026, 9:56 AM

#

simple star Luckily, I have been able to bypass EU restrictions of the chatgpt web with VPNs...

You can

simple star May 6, 2026, 9:56 AM

#

the smell is pretty obvious, though 😛

plucky halo May 6, 2026, 9:56 AM

#

lost drum May 6, 2026, 10:16 AM

#

yooo guys had anyone made here his oen enviroment where you can let codex even test the site himself? like idk how to descibe it but I seen that in codex app you could let him control your mouse and stuff like that I just wonder how to do it cause rn I am using VS codex extension WSL and I wonder

#

tbh I dont need him to control my whole PC, he can launch the localhost site himself but idk how to let him access it and test features

still trellis May 6, 2026, 10:19 AM

#

anyone one using the memory and chronical features with 5.5 in codex? I tested it with 5.4 (memory) a while ago and it seemed to make the model dumb tbh. ??

still trellis May 6, 2026, 10:20 AM

#

lost drum yooo guys had anyone made here his oen enviroment where you can let codex even t...

computer use or browser use in the codex app?

lost drum May 6, 2026, 10:21 AM

#

still trellis computer use or browser use in the codex app?

Idk maybe a web access but how to enalble him test features and stuf I dont really know

still trellis May 6, 2026, 10:22 AM

#

install the browser use plugin then call it with @browser-use

lost drum May 6, 2026, 10:22 AM

#

hmmm

still trellis May 6, 2026, 10:22 AM

#

computer use is not available in every region as far as I know?

still trellis May 6, 2026, 10:23 AM

#

lost drum Idk maybe a web access but how to enalble him test features and stuf I dont real...

lost drum May 6, 2026, 10:30 AM

#

still trellis

I dont see it in VS

#

oh no

#

I might need to switch to app or something

oak trellis May 6, 2026, 10:31 AM

#

that limit reset stole from me at least 30% of weekly

exotic terrace May 6, 2026, 10:35 AM

#

still trellis anyone one using the memory and chronical features with 5.5 in codex? I tested i...

Curious about this too

cedar skiff May 6, 2026, 10:35 AM

#

did we just get another rest?

cedar skiff May 6, 2026, 10:36 AM

#

exotic terrace Curious about this too

I tried them both, i still have memory on but i got rid of chronicle it makes usage drop quicker. It did seem usful though.

exotic terrace May 6, 2026, 10:37 AM

#

cedar skiff I tried them both, i still have memory on but i got rid of chronicle it makes us...

Any bad experiences you’ve had with memory enabled?

cedar skiff May 6, 2026, 10:46 AM

#

only that chronicle uses a lot of usage

#

i still have memory on, it uses citations from it pretty often, i don't know how much they help or dont help

lost drum May 6, 2026, 10:50 AM

#

cedar skiff did we just get another rest?

wdm

#

I think no

cedar skiff May 6, 2026, 10:51 AM

#

rest = reset, i should have just checked and not asked

lost drum May 6, 2026, 10:52 AM

#

sorry I had to say "Not for me, what about you?"

cedar skiff May 6, 2026, 10:52 AM

#

i just jumped on what Dev said

unique spade May 6, 2026, 11:02 AM

#

cedar skiff i still have memory on, it uses citations from it pretty often, i don't know how...

i didn t find it very useful in my usecase, but i can see how it can be useful across chats

cedar skiff May 6, 2026, 11:03 AM

#

I saw a useful thing gpt chat added with its memory

#

I see citations for codex memory all the tiem ill ask a few sessions what it used memory for

#

lost drum May 6, 2026, 11:06 AM

#

still trellis computer use is not available in every region as far as I know?

crazy I cant access the plugin browser use cause of my location???

rocky fog May 6, 2026, 11:10 AM

#

torpid trout Didn’t know that, but sounds reasonable (as in, it’s expected from the GDPR mind...

they can give a hefty fine to them if you report it well and they take the case (like to AP in Netherlands for example), but getting some "damages" from it is a nope, you would have to fight that on your own and be able to prove damages

I take GDPR over whatever the heck is going on in the US 🤣
With GDPR you can more easily refuse all kinds of bs that employer tries to pull on you for example
And its also about not having consequences from refusing, because its all your choice as you are supposed to have control over your data. The more they break, the more fines they might have to pay (and the fines can get high)

No longer can employer force you to some stupid tests which give your data (e.g. personality or intelligence tests) to some third party which gets hacked or leaks later. Unless they have all the proper permissions and good reasons for doing that, for example. (although they still do that, illegaly, but you can have good arguments to refuse or threat to report and risk fines)

cedar skiff May 6, 2026, 11:11 AM

#

In the gpt chat window I get annoyed at the verbose extra output and recommendations it gives and twice in teh same chat is said stay on task, stop offering opinions based on assumptions and help me with the direct questions i am asking you and it added a memory for it something like prefers direct answers and without extra suggestions or opinions

cobalt junco May 6, 2026, 11:13 AM

#

is there way to run codex in github cicd without needing an api key, only thru the plan?

#

wait what am i doing? i’ll just make codex figure it out

#

😂😂😂😂

oak trellis May 6, 2026, 11:47 AM

#

so annoying would get my reset tomorrow .. but now i used 30% in one day .. somehow ..

#

6 days left uff

signal tapir May 6, 2026, 11:56 AM

#

cobalt junco is there way to run codex in github cicd without needing an api key, only thru t...

I wouldn't recommend that. CICD processes should be deterministic.

broken rain May 6, 2026, 11:57 AM

#

I lost my chat session codex today even the chat session i archived

#

im using pro btw anyone facing the same?

signal tapir May 6, 2026, 12:00 PM

#

Is this in vs code with the codex addon? I had that happen, and didn't get things back until after I updated both vs code and the addon.

signal tapir May 6, 2026, 12:07 PM

#

oak trellis so annoying would get my reset tomorrow .. but now i used 30% in one day .. some...

I was taking a break, planning to use a ton the last day before reset. When I got back, there had been a reset that morning. Wasted 6 days worth.

cyan gyro May 6, 2026, 12:10 PM

#

lesson learnt - dont hold back

forest crypt May 6, 2026, 12:31 PM

#

After a couple of months using Codex for a complex project I thought it would be interesting to share my experience and see what others found out on their own. At worked (one of the big 7) we used Claude for somethings, but I personally didn't use it much except for suffering it's wacky works in the CI. As far as Codex for coding, it is not useless. The results are spotty, almost like having a highly skilled developer with vast amounts of domain knowledge. It's Achilles heel is extreme tunnel vision. This "developer" has such severe myopia it can only see a few centimeters periphery around. As far as the idea that AI coding is going to replace developers, specially someone at the senior level with a lot of design and industry experience, is laughable... at least for now. My experience echoes those of few others at work with Claude. It was very fast to spit out lots of code and unit tests, then it took a huge amount of time to understand and fix it, and often the testing is nothing but trivial and practically useless. I don't want to sound negative on Codex. I think it has a bright future, but it is still far from useful in a professional environment IMHO. More work is needed, specially in things humans do very well, see big picture and large complex patterns

warm pilot May 6, 2026, 12:32 PM

#

forest crypt After a couple of months using Codex for a complex project I thought it would be...

out of curiosity: what industry and what language?

forest crypt May 6, 2026, 12:34 PM

#

warm pilot out of curiosity: what industry and what language?

My personal project is finance/forecasting but that is not work. At home I only touched Python, but at work mostly C++. Working on hardware, ML, computer vision, etc..

tiny fulcrum May 6, 2026, 12:38 PM

#

forest crypt After a couple of months using Codex for a complex project I thought it would be...

lol what? yeah generating boring code 10x as fast isn't revoluationary?!?!?!? talking about moving goal posts...

forest crypt May 6, 2026, 12:43 PM

#

warm pilot out of curiosity: what industry and what language?

Forgot to mention Swift and SwitfUI. Using codex for even the simplest of apps was painful. It can do trivial stuff but as soon as the application grows with any minimal complexity it quickly goes downhill. I think it is just very hard to manage visual design in general. So I would say that is an area of development that needs more

forest crypt May 6, 2026, 12:44 PM

#

tiny fulcrum lol what? yeah generating boring code 10x as fast isn't revoluationary?!?!?!? ta...

In my neck of the woods companies are investing billions on these tools with the goal of doing real work and replace large numbers of developers. So yes... the goal post moved long ago.

tiny fulcrum May 6, 2026, 12:45 PM

#

I've been using Codex for last 6 months on a large monorepo C++/C#/WPF and it basically improved productivity and code quality accross the board
more consistent workflows via skills, more consistent quality of code, because the coding standards are automatically reviewed, code is automatically formatted and produced like it is supposed to

warm pilot May 6, 2026, 12:45 PM

#

forest crypt Forgot to mention Swift and SwitfUI. Using codex for even the simplest of apps w...

I got from your first answer that you don't want to answer, no need to emphasize it 😉

tiny fulcrum May 6, 2026, 12:47 PM

#

forest crypt In my neck of the woods companies are investing billions on these tools with the...

and they probably can, you can probably fire 80% of the developer team if you have the right 20% who can use codex correctly

forest crypt May 6, 2026, 12:49 PM

#

tiny fulcrum and they probably can, you can probably fire 80% of the developer team if you ha...

I agree... that's the goal at least. However, my post is essentially saying Codex/Claude is still far from being ready. It still lacks compared to a senior developer. Yes it can be faster, but it's worthless if that speed comes with enormous amount of work to fix and rearchitect everything.

blissful basin May 6, 2026, 12:53 PM

#

tiny fulcrum and they probably can, you can probably fire 80% of the developer team if you ha...

i was thinking about that recently and tools like codex are amazing for small teams where they are not slowed down by beaurocracy. I was working in corpo before where before i got something approved, everyone already forgot what we were even talking about, so even if i wanted to be performant i was stopped on every step and slowed down

#

But for smaller, more effective teams, you can achieve months of work in a week

tiny fulcrum May 6, 2026, 12:53 PM

#

Yep, I think the industry is trending to smaller teams now, the organization overhead is just not worth it

blissful basin May 6, 2026, 12:55 PM

#

In my previous corpo we had close to 40% tech staff from consulting companies, but development in AI codding i think for these bigger companies will change that, and consulting companies will really get hit hard

#

So how i personally see it -> slow death of consulting companies in corpo, insanely fast deliveries by small teams/companies

exotic cave May 6, 2026, 1:11 PM

#

forest crypt After a couple of months using Codex for a complex project I thought it would be...

Did you came into it with the hopes it'll be the end all be all? Maybe in the future...
For a serious dev these are tools, and as any other tool, the user need to be proficient in using it.
All in all, these tools enable a disciplined senior dev to work faster.

turbid axle May 6, 2026, 2:09 PM

#

forest crypt I agree... that's the goal at least. However, my post is essentially saying Code...

putting claude and codex in the same box is highly suspect to me. if alone that codex is not the model, but an app.

that said, ai has severe limitations. there is a real skill requirement in using it effectively.
I believe it to be extremely useful, but it also needs lots of handholding and guidance. more and more though, ai itself can help you with that part too.

still trellis May 6, 2026, 2:13 PM

#

having issues with /goal in cli...
Failed to set thread goal: thread/goal/set failed in TUI

anyone else?

boreal holly May 6, 2026, 2:15 PM

#

forest crypt Forgot to mention Swift and SwitfUI. Using codex for even the simplest of apps w...

Hmmm, I use Codex for a >1M sloc rust/dart project (where there is quite literally over 1 million lines of working code, not just docs, comments and metadata) and Codex has no problem navigating the codebase, piecing together solutions, or writing meaningful unit tests & integration tests. I think if GPT-5.5 has enough tools and guidance it is capable of incredible dev work. If you go into it thinking OOB it will be mind blowing you will be disappointed.

hard drum May 6, 2026, 2:22 PM

#

boreal holly Hmmm, I use Codex for a >1M sloc rust/dart project (where there is quite literal...

https://www.reddit.com/r/codex/comments/1q9hny1/finally_got_true_multiagent_group_chat_working_in/ hmm -- maybe your fork could do this, too?

forest crypt May 6, 2026, 2:24 PM

#

boreal holly Hmmm, I use Codex for a >1M sloc rust/dart project (where there is quite literal...

Yes my code base is also over 1 million lines of code too. And likewise when asked Codex can inspect and add/fix features. That is not the problem. The issue is it's tunnel vision. I am not sure how to express it clearly. It has too much tunnel vision, it can easily drift development on one detail and forget what it did 10 steps prior. So as someone already mentioned it requires an enormous amount of supervision to prevent all sorts of problems. No doubt this is something that maybe in a year or two would get solved, perhaps, it is just not as it behaves now. And for sure, as far as visual design it is practically blind.

boreal holly May 6, 2026, 2:28 PM

#

hard drum https://www.reddit.com/r/codex/comments/1q9hny1/finally_got_true_multiagent_grou...

Tbf I don't have a real fork of Codex (aside from the command execution timeout edit). My communication thing sits on top of the vanilla app-server. The downside is the TUI doesn't get to participate in the system, but the upside is the agents choose when and how to communicate, and I can set up special rules and privileges for agents so there's a chain of command

upbeat moss May 6, 2026, 2:28 PM

#

question, what r people making in codex

hard drum May 6, 2026, 2:29 PM

#

upbeat moss question, what r people making in codex

these:

#

my workflow is... interesting

boreal holly May 6, 2026, 2:30 PM

#

forest crypt Yes my code base is also over 1 million lines of code too. And likewise when ask...

If you want an agent to remember what it did 10 steps prior, here's a tip. All user messages stay completely preserved verbatim across compactions. When it produces a plan, copy and paste it back to the agent as a user message. When it gets work done, paste what it got done and what remains back into the chat.

turbid axle May 6, 2026, 2:32 PM

#

forest crypt Yes my code base is also over 1 million lines of code too. And likewise when ask...

So as someone already mentioned it requires an enormous amount of supervision to prevent all sorts of problems
not supervision. just clear goals and well designed paths. think of ai as a highly skilled coder with severe amnesia.

#

with these in place, it can truly fly

hard drum May 6, 2026, 2:34 PM

#

◇  Apply changes now?
│  Yes
OpenAgentLayer setup · apply
◇ Provider check
  providers: codex, opencode
◇ Target
  scope: global
  home: /Users/krystian
  target: /Users/krystian
  bin: /Users/krystian/.local/bin
◇ Optional tools
  selected: ctx7, playwright, deepwiki, anthropic-docs, opencode-docs
◇ Install OAL command-line toolchain
  $ curl -fsSL https://bun.sh/install | bash
  $ brew install ripgrep fd fzf bat eza git-delta jq yq just direnv mise zoxide dust hyperfine entr gh lazygit tmux btop shellcheck shfmt ast-grep sd tokei gitleaks pre-commit watchexec
  $ curl -fsSL https://raw.githubusercontent.com/rtk-ai/rtk/master/install.sh | sh
  $ rtk --version
  $ rtk gain
  $ rtk init -g --auto-patch
  $ rtk init -g --codex
  $ rtk init -g --opencode
  $ rtk init --show
  $ rtk grep --help
  $ rtk find --help
  $ bunx ctx7 setup --cli --yes --codex --opencode
  $ oal mcp install opencode-docs --provider opencode --scope global
  $ bunx -p playwright playwright install --with-deps
◇ Deploy provider-native OAL artifacts
◇ Sync provider plugin payloads
◇ Validate source and installed state
└ ✓ Setup plan ready
$ curl -fsSL https://bun.sh/install | bash
$ brew install ripgrep fd fzf bat eza git-delta jq yq just direnv mise zoxide dust hyperfine entr gh lazygit tmux btop shellcheck shfmt ast-grep sd tokei gitleaks pre-commit watchexec
$ curl -fsSL https://raw.githubusercontent.com/rtk-ai/rtk/master/install.sh | sh
$ rtk --version
$ rtk gain
$ rtk init -g --auto-patch
$ rtk init -g --codex
$ rtk init -g --opencode
$ rtk init --show
$ rtk grep --help
$ rtk find --help
$ bunx ctx7 setup --cli --yes --codex --opencode
$ oal mcp install opencode-docs --provider opencode --scope global
$ bunx -p playwright playwright install --with-deps
OpenAgentLayer deploy · apply
  source: /Users/krystian/CodeProjects/xsyetopz/OpenAgentLayer
  providers: codex, opencode
  scope: global
  target: /Users/krystian
  manifest: /Users/krystian
  artifacts: 296
  changes: write 0, update 78, skip 218, remove 0, backup 0
  binary: skip /Users/krystian/.local/bin/oal (owned CLI shim)
OpenAgentLayer plugins · apply
  home: /Users/krystian
  providers: codex, opencode
◇ plugin changes
  changes: write 592, update 0, skip 2, remove 2, backup 0
◇ Load OAL source
◇ Validate provider renderability
◇ Validate installed provider state
└ ✓ OAL source and render checks passed
│
◆  Setup applied.
│
◇  Run another OAL workflow?
│  No
│
└  ✓ Done


OpenAgentLayer on  master [!] via 🥟 v1.3.13 took 1m23s 
❯

#

very cute

boreal holly May 6, 2026, 2:34 PM

#

turbid axle > So as someone already mentioned it requires an enormous amount of supervision ...

I mean it only has amnesia if all your prompts are "continue". It has perfect long-term recall if you constantly feed plans and progress back to it

hard drum May 6, 2026, 2:35 PM

#

boreal holly I mean it only has amnesia if all your prompts are "continue". It has perfect lo...

me when i did this by saying "continuation", all while they had checkmark-based tracking file && everything haha

#

but it kinda worked...

boreal holly May 6, 2026, 2:36 PM

#

hard drum me when i did this by saying "continuation", all while they had checkmark-based ...

Yeah so a lot of folks make the agents read markdown files to acquire their tasks and info. The problem is all they need to do is forget to read that file 1 time and the drift has begun. If you paste the contents of that file as a user message they literally cannot avoid reading and paying attention to it

hard drum May 6, 2026, 2:36 PM

#

boreal holly Yeah so a lot of folks make the agents read markdown files to acquire their task...

i did sometimes do that when i saw some regression

#

but now i'd almost always, if i don't forget, to mention the file in quesiton

turbid axle May 6, 2026, 2:37 PM

#

boreal holly I mean it only has amnesia if all your prompts are "continue". It has perfect lo...

no, it fully forgets things as soon as it drops out of context. the way to manage this now are md file based memory systems. these work ok, but definitely are not perfect by any means. I also believe these to be temporary.
best way I found is very clear todo lists, just like human coders tbh. I create clear, well defined linear issues for everything, and have it work on those. this makes the path very clear, and hard to miss. the real hardship is moved to designing the plan in these issues, minimize drift etc.

forest crypt May 6, 2026, 2:38 PM

#

turbid axle > So as someone already mentioned it requires an enormous amount of supervision ...

Yes that is the best way to put it.... I can't just constantly feed back context to Codex as Robert describes because my work is not piecemeal, like if it was some sort of Markov chain. A typical developer has thousands of connections in the head running and can foresee and judge how things fit best. I find codex is very Markov-chain like

turbid axle May 6, 2026, 2:40 PM

#

what robert does point out is that if asked, the models can be very accurate when reading these 'memory' md files. this can be used very effectively as a way to force it to see connections between key components etc. but it requires quite a bit of environment building to make all this effective. that said, when in place, it quite magical

forest crypt May 6, 2026, 2:41 PM

#

turbid axle what robert does point out is that if asked, the models can be very accurate whe...

That's how I work too... but it is far from perfect

turbid axle May 6, 2026, 2:41 PM

#

really the best way to think of it is as I said, a highly skilled coder with amnesia. if you place those stickynotes in the right places, so it can't possibly miss them as it does the work, its pretty solid

#

I also suggest starting with lots of debates and audits with the ai to form a plan, make sure it aligns with what you want it to do. its really all about making sure it knows what to do as clearly as possible.

#

and just ask it. why did you do A and not B. how can we make sure you don't do that again. it can reason well. use it.

boreal holly May 6, 2026, 2:46 PM

#

turbid axle no, it fully forgets things as soon as it drops out of context. the way to manag...

OK, I see the disconnect here.

Obviously the agent doesn't have infinite context and all user messages are preserved in perpetuity. There is a maximum user message token limit in Codex. That limit is incredibly large. Anything that is not a user message is compacted into a mental state blob. So if you are expecting it to have perfect recall after a month of using the same agent, and all of your prompts say "continue", you are gonna have a bad time. If you feed vital info into context as user messages, it survives compaction for an extremely long time. I do not rely on markdown files that the agents have to read. I also distribute context across multiple agents. So agree to disagree, but just so you understand I am completely aware the context for user messages is not infinite, but the compaction mechanism allows you to not need markdown files if you know how it works and how to take advantage of it

turbid axle May 6, 2026, 2:48 PM

#

boreal holly OK, I see the disconnect here. Obviously the agent doesn't have infinite contex...

tell me, how do you take advantage of it, and create this effectively perfect recall using the compaction system.

#

sounds like some golden info

boreal holly May 6, 2026, 2:50 PM

#

@turbid axle around this area I describe everything in great detail

turbid axle May 6, 2026, 2:52 PM

#

personally I believe context is terrible for long term memory, even if it was infinite. because its quite literally just a huge history of the ai's 'stream of consciousness' if you will. I believe true memory system should be baked into the model so it can use the same 'intuition' approach as regular responses. context in my mind should actually be very small and undergo very active pruning constantly. thats just what my own intuition tells me though. who knows.

hard drum May 6, 2026, 2:52 PM

#

...aw mane

turbid axle May 6, 2026, 2:55 PM

#

boreal holly <@131804744950743040> around this area I describe everything in great detail

not sure what to read from this other than that agents.md is retained. which is obvious as its injected into the context every time. that is how these memory systems work too, by injecting things like memory indexes etc into the context before loading prompts

#

thats does read as perfect retention to me. am I missing something here?

nocturne folio May 6, 2026, 2:57 PM

#

lean lark

its not some uber scorched earth idea that will make openai blow up

#

its alt accounts, i think the trillion dollar company would be fine imo

boreal holly May 6, 2026, 3:02 PM

#

turbid axle personally I believe context is terrible for long term memory, even if it was in...

True. I think sub agents conceptually solve a lot of the issues. If the agent in charge of knowing what needs to be done is not the one implementing, they can remember details for much longer. I don't agree with how OpenAI implemented subagents, but by having one agent track what needs to be done and maintain a high level understanding of the project you can pretty much have perfect recall.

Take for example git version control. When you check out a branch, you have a whole bunch of files. If the agent reads and understands every file, they semantically understand the codebase as it currently exists in that branch. But when they go to work on implementing, the understanding of files they didn't touch becomes fuzzy, because they have to attend to what they're working on, why they're working on it, fixing build errors, running tests, designing tests, etc. But if you take that same agent and say "understand this codebase" and later say "understand the deltas from previous work", the previous understanding hasn't fallen off of the attention workspace. Basically have an agent be in charge of understanding + deltas only, and make other agents produce the deltas.

cyan gyro May 6, 2026, 3:03 PM

#

Telling the main agent to function as orchestrator works pretty well

boreal holly May 6, 2026, 3:07 PM

#

turbid axle not sure what to read from this other than that agents.md is retained. which is ...

Interestingly enough, AGENTS.md falls off the context window after enough compactions. Granted Codex-CLI has many new versions since I posted that, but if they still insert AGENTS.md as a user message at thread/start, then it falls off the context window after many compactions just like any other user message

turbid axle May 6, 2026, 3:09 PM

#

boreal holly Interestingly enough, AGENTS.md falls off the context window after enough compac...

if agents is handled as user message and not system message

#

by that theory, the easiest way to hack these ai models would be to just compact it over and over until you kill the oai system message preventing you from making nukes

#

either way. I think this whole context as memory path is not the right one at all. even with perfect recall it makes models behave like an adhd drug addict lost in the wild. it just confuses them

boreal holly May 6, 2026, 3:16 PM

#

turbid axle by that theory, the easiest way to hack these ai models would be to just compact...

The system prompt is completely separate from the AGENTS.md files. The system prompt does in fact have permanence, and that's the point I was trying to make back on April 22nd. A lot of folks think AGENTS.md is a system or developer prompt, but they are in fact specially formatted user messages. Instruct/Chat LLMs typically have 3 types of input prompts: system, developer, and user. OpenAI uses the Harmony tokenizer, for example the system prompt looks like this:

<|start|>system<|message|>

You are Codex...

Codex-CLI handles formatting the chat template for you. AGENTS.md is submitted as

<|start|>user<|message|>

{AGENTS.md contents}

So they have the lowest precedence just like all the messages you send

turbid axle May 6, 2026, 3:16 PM

#

the subagents as memory is interesting, something Ill think on. something in there for sure

lean lark May 6, 2026, 3:19 PM

#

boreal holly Interestingly enough, AGENTS.md falls off the context window after enough compac...

turbid axle May 6, 2026, 3:20 PM

#

more in line of the need for having multiple subagents agree on memory to bake into the model type deal though. not the single agent context as memory bank, that does not sit right with me at all

lean lark May 6, 2026, 3:20 PM

#

"AMA wen?"

turbid axle May 6, 2026, 3:20 PM

#

I sounds very brittle if agents.md gets compacted out yeh. that seems like a terribly bad failure, so easily fixed.

lean lark May 6, 2026, 3:22 PM

#

Wait, @boreal holly made a point that I missed: " they are in fact specially formatted user messages."
If that's the case then it makes sense that these are compacted out with other messages. 🙁

neat nymph May 6, 2026, 3:25 PM

#

for some reason i can't see my usage limits mini tab in the codex app, has anyone encountered that?

turbid axle May 6, 2026, 3:25 PM

#

ilya enters the chat

#

oh not that one, ok nm

neat nymph May 6, 2026, 3:25 PM

#

?

turbid axle May 6, 2026, 3:25 PM

#

/ai jokes

lean lark May 6, 2026, 3:26 PM

#

turbid axle I also suggest starting with lots of debates and audits with the ai to form a pl...

I do this for anything significant. We end with a solid plan of exactly what is to be done, it makes the changes, I check them. Begin a new session with a new plan.

#

To see token usage and quota (auto-refreshing if you wish): https://github.com/CaptainStarbuck/codex-usage

turbid axle May 6, 2026, 3:28 PM

#

lean lark I do this for anything significant. We end with a solid plan of exactly what is ...

yeh, its very effective imo. also a great way to consider things and learn, keeps the mind busy

boreal holly May 6, 2026, 3:29 PM

#

lean lark Wait, <@556965219222683678> made a point that I missed: " they are in fact speci...

Nevermind, I guess AGENTS.md does survive compaction. It's treated as "Contextual user message". But it's still a user message

lean lark May 6, 2026, 3:29 PM

#

turbid axle yeh, its very effective imo. also a great way to consider things and learn, keep...

Being a professional adult isn't as hard as peeps using this technology seem to think.

#

@boreal holly ... My Hero ... actually looking at code to verify and publish facts ...

boreal holly May 6, 2026, 3:31 PM

#

But hey, that is insightful. If you have a massive AGENTS.md, it chips away at how much "user message storage" is available

turbid axle May 6, 2026, 3:32 PM

#

yes, that is why you should keep agents as small as possible

#

it also just overload and confuses the little ai

#

/goal should generate 'emotions' relative to said goal, and patterns which trigger intense emotions should get baked into the model.
there, AGI solved

lean lark May 6, 2026, 3:33 PM

#

Keeping AGENTS.md small is my challenge. I constant work with the assistant with prompts that include something like "Be as brief as possible with language for the LLM, but never lose intent in brevity." It always loses intent anyway, at least through 5.3ish.

turbid axle May 6, 2026, 3:34 PM

#

lean lark Keeping AGENTS.md small is my challenge. I constant work with the assistant with...

use branching. keep agents.md as an index, and put actual info in other files

lean lark May 6, 2026, 3:35 PM

#

The answer to "the bot didn't do what I want" is to create well-crafted instructions that tell it exactly what you want. It's extremely difficult to do that AND be extremely brief. Words are required to convey intent. Because of this I've often wondered if we need a new language to describe instructions for the AI to follow.

lean lark May 6, 2026, 3:36 PM

#

turbid axle use branching. keep agents.md as an index, and put actual info in other files

AGENTS.md does not function as an index. The specially named file gets that "contextual" flag which is followed more as a directive. Anything else is not a directive, it's a helpful suggestion, a guide.

turbid axle May 6, 2026, 3:36 PM

#

my personal progress at this point ignore all of this though. I just debate ai, form a solid plan as linear issues, then I just /goal the ai to implement the whole thing. couple hours later I have a very solid solution generally

boreal holly May 6, 2026, 3:36 PM

#

This is my global AGENTS.md, and the only one on my machine. The rest is base instructions and skills as far as "plaintext context". The rest is command execution land mines, tight sandbox, etc.

lean lark May 6, 2026, 3:37 PM

#

Yes, we can tell the assistant to reference other files, and I do that a lot to point to docs/processes/*.md files, but actual directives must be in the files themselves to carry true weight as directives.

turbid axle May 6, 2026, 3:38 PM

#

lean lark AGENTS.md does not function as an index. The specially named file gets that "con...

I think of it as a ghost in the mind. it should not be a pure index. but more like 'if you implement code, read this file first', 'if you write tests, read this file first' 'if you commit to git, read this file first' type deal

lean lark May 6, 2026, 3:38 PM

#

It reads AGENTS once for the system and once for each folder that includes the file.

#

You're describing Skills.

turbid axle May 6, 2026, 3:39 PM

#

well yes, that is what grew into skills

#

but that is the same thing in essence. its just dumping info into the context in an effective manner

#

these things are just 'stream of conciousness' guides, its a way to put the layout the 'train of thought' for the ai to follow

#

skills etc are clever ways to add branches to said rails

lean lark May 6, 2026, 3:41 PM

#

And I confess I really do need to migrate AGENTS to Skills. I've been waiting for the Claude Skills to be adopted universally, and I still don't know exactly how much weight a skill carries compared to an AGENTS directive compared to common prose. I don't think there's any way of knowing for sure how much weight instructions carry except through observation and informed guesses.

#

The good things about Skills is that they are only triggered when context requires, and after compaction if they are required again they are re-triggered.

turbid axle May 6, 2026, 3:42 PM

#

lean lark And I confess I really do need to migrate AGENTS to Skills. I've been waiting fo...

just ask the bot. that is what I do for these things tbh. just ask it to read agents, clean it up, make it clear for itself, turn things into skills which fit the model, etc.

#

with all the terrible memory they suffer from

#

they are highly effective reasoners

#

if the problem is not some deep, high dimensional multi-step problem which really needs memory and real novel experiences to understand, they can do it

lean lark May 6, 2026, 3:44 PM

#

You're right, just haven't pulled the trigger here. My AGENTS are strongly crafted over time and really perform exactly how I want. Migrating to Skills adds another temporary layer of tooling concerns into my workflow and I just haven't done that yet.

turbid axle May 6, 2026, 3:44 PM

#

for those issues, for now, we need to hack memory systems into it to help it along

lean lark May 6, 2026, 3:45 PM

#

I don't recall you and I chatting here before, but in this channel I'm a Strong advocate for using the AI to help craft AI directives, identifying and eliminating tensions with careful refinement, etc.

#

So we sing the same song... 🎶 🙂

boreal holly May 6, 2026, 3:48 PM

#

lean lark And I confess I really do need to migrate AGENTS to Skills. I've been waiting fo...

Incase you were wondering, skills are inserted with a higher level of precedence than AGENTS.md (<|start|>developer<|message|>)

turbid axle May 6, 2026, 3:48 PM

#

im impatiently awaiting true memory to get solved, that will absolutely rocketship this ai train

lean lark May 6, 2026, 3:51 PM

#

So yesterday's project here, speaking of changes to tooling, was to automate a process for the following: I have a workspace with several projects from several repositories. I migrated that to a single repo project and renamed to AppTemplate1 so that I can use the code as a base for other projects. I'm compelled to make changes to both workspaces simultaneously. But AppTemplate1 has projects moved around, folders and files renamed, different namespaces and other identifiers. A human can tell it's the same project, refactored, but a literal-minded LLM just using 'rg' doesn't have the same insight. So yesterday I created the prompts and schema for a workflow that allows the assistant to recognize changes/patches in one workspace and translate them into the equivalent changes in the other workspace based on intent. It was really great working with the assistant to make this happen. Using 5.4-low for early discussion and prompts, moving to 5.5-medium for full implementation.

#

skills are inserted with a higher level of precedence than AGENTS.
I'm not seeing that from the code. I'm seeing the List of skills being added at a high level, but not the strength of run-time directives relative to AGENTS.

boreal holly May 6, 2026, 3:56 PM

#

lean lark > skills are inserted with a higher level of precedence than AGENTS. I'm not see...

    const ROLE: &'static str = "developer";

This gets resolved to <|start|>developer<|message|>

If you want I can share OpenAI's official docs on what system, developer, and user messages mean in practice. What the implications of these roles are and how they're interpreted by the LLM

tiny fulcrum May 6, 2026, 3:56 PM

#

turbid axle im impatiently awaiting true memory to get solved, that will absolutely rocketsh...

not really, if AI has memory and can learn from mistakes it's probably already general superhuman intelligence
I prefer this to just be a tool I can use, only thing that is missing is larger context window and cheaper compute right now

hard drum May 6, 2026, 3:57 PM

#

FINALLY oal spawns agents. it was a problem i had to fix due to diffs of multi-agent-v2

boreal holly May 6, 2026, 3:57 PM

#

lean lark So yesterday's project here, speaking of changes to tooling, was to automate a p...

https://model-spec.openai.com/2025-02-12.html#chain_of_command

turbid axle May 6, 2026, 3:58 PM

#

tiny fulcrum not really, if AI has memory and can learn from mistakes it's probably already g...

probably yes, but that is the goal. ASI to solve everything.
probably will end us all, but hey, we solved it! we won. game over 🙂

hard drum May 6, 2026, 3:59 PM

#

boreal holly https://model-spec.openai.com/2025-02-12.html#chain_of_command

https://model-spec.openai.com/2025-12-18.html this is latest tho

silver dew May 6, 2026, 4:00 PM

#

giving Codex access to your entire prod enviroment and bank account be like

rocky fog May 6, 2026, 4:00 PM

#

lean lark So yesterday's project here, speaking of changes to tooling, was to automate a p...

in similar cases I usually use submodule repos in the workspace

so I have the main workspace with agents.md (thats its own git) and I have submodule repos inside (they can also have their own agents override, but didnt use that much yet)

then I work on all kinds of projects which are somewhat related inside that workspace

migrating something old to a new one or similar

or you can create another repo and tell it to follow the way another repo was done to keep the standards/format you started or so on

or when its important that the projects stay compatible or depend on each other

forest crypt May 6, 2026, 4:00 PM

#

turbid axle my personal progress at this point ignore all of this though. I just debate ai, ...

Not all problems and plans can be linear. As the system becomes more complex and large I find Codex can fall in little holes. I do used for months lots of architectural reviews, .md, etc... but not always helps. Here are two examples:

[1] I found a bug, simple one. Tell Codex hey this bug violates the contract.... "You are right. Fixing it by blah blah". OK I go check what it did and find a one-off way of solving the issue. Then I tell it, "that is masking the bug, there is already a standard way to do that...." and then it typically goes something like "Good catch. yes this didn't solve the issue patch over the problem..." . So a typicall Junior dev answer. Make a problem go away but not the proper way.

[2] In writing a tutorial for part of the CLI, wrote a usage doc that reads like a machine-like step by step do this and that. Great except on a complex system it is not very pedagogical, no narrative as to why doing this and that. So I explain what is wrong and missing. It goes something like "Right it reads like..." Then it leaves the document pretty much the same except in some places it adds "Why: we do this to blah" equally obscure and missing obvious things. Not the work a human would like to read.

It both types of examples I see this "tunnel vision" meaning trying to solve an issue in as close as possible context but missing big picture or interrelations with other parts of the problem.

Now some people would argue that the solution is for me to then give extremely detail and specific instructions. That is fine then. But that doesn't relieve my need as a developer. I still need to be there constantly to avoid the project getting into trouble. So back to my original observation, the claim that AI tools are replacing developers are very overblown. They are still very far from that.

hard drum May 6, 2026, 4:01 PM

#

forest crypt Not all problems and plans can be linear. As the system becomes more complex and...

agreed

#

i skimmed fast enough to TL-DR this as "agents have no sentience, so they cannot think for themselves"

boreal holly May 6, 2026, 4:01 PM

#

hard drum https://model-spec.openai.com/2025-12-18.html this is latest tho

Thank you! Updating my bookmark now 😁

hard drum May 6, 2026, 4:01 PM

#

which is obvious, but some don't know this lol

tiny fulcrum May 6, 2026, 4:02 PM

#

forest crypt Not all problems and plans can be linear. As the system becomes more complex and...

just use skills, instead of "please fix it", tell it how to fix a bug corretly and it will....

turbid axle May 6, 2026, 4:03 PM

#

forest crypt Not all problems and plans can be linear. As the system becomes more complex and...

these limitations are very real yes. we will need to guide it for many such issues. only true memory will solve this imo. it needs the ability to gain experiences, so it can 'vibe' just like a human expert developer. you can sense where to go next etc because you have innate understanding of it all, and that specific codebase, etc

tiny fulcrum May 6, 2026, 4:03 PM

#

doesn't need memory, it can already do it with the right workflow

forest crypt May 6, 2026, 4:03 PM

#

tiny fulcrum just use skills, instead of "please fix it", tell it how to fix a bug corretly a...

that was an oversimplification ofc. I didn't say "fix it", told it what was wrong, just not how to fix it because I thougth it was obvious

turbid axle May 6, 2026, 4:04 PM

#

tiny fulcrum doesn't need memory, it can already do it with the right workflow

not all of it, you can go far, but for many serious problems it needs to lean on our ability to gain experience for the time being

forest crypt May 6, 2026, 4:05 PM

#

turbid axle these limitations are very real yes. we will need to guide it for many such issu...

And that is another great observation. Yes I found the same thing, an inability to gain experience from past mistakes or problems already solved

tiny fulcrum May 6, 2026, 4:06 PM

#

turbid axle not all of it, you can go far, but for many serious problems it needs to lean on...

okay, difference of opnion I guess, because most people think they know solid developer fundamentals when they don't
and if you follow those and create workflows based on them it only needs minimal input and decisions from you to produce the quality result

#

https://github.com/mattpocock/skills I suggest checking this out

turbid axle May 6, 2026, 4:07 PM

#

lets say, if the problem is known, and you can create a deterministic path for the ai, it can implement it

#

when you start touching unknowns, especially unknown unknowns, and it needs to invest novel solutions that fit the context well, it will struggle, and will need your guidance

tiny fulcrum May 6, 2026, 4:08 PM

#

The workflows for debugging, refactoring, and implementation all follow a general pattern. It is not a “magic” experience so much as the methodical application of repeatable patterns.

turbid axle May 6, 2026, 4:08 PM

#

many problems in coding can be well defined and made deterministic. but some can't be.

hard drum May 6, 2026, 4:11 PM

#

report your rtk gains (i had an accidental dotnet android build spike for the 30M+ tokens lol)

tiny fulcrum May 6, 2026, 4:11 PM

#

I guess, you can make this argument for UI or taste, but this is more of a subjective thing

signal tapir May 6, 2026, 4:12 PM

#

hard drum report your rtk gains (i had an accidental dotnet android build spike for the 30...

Man, I love TUIs

turbid axle May 6, 2026, 4:14 PM

#

taste is subjective, these are obviously not solvable, they need end users to say 'I like it'

I mean objective issues, but issues which have no known solutions yet, especially problems like that which need to be chained.
for this you need to experiment, theorize, test, learn from all this, and include the knowledge into the next experiment, theory, testing, etc. on and on, as you dig deeper into the unknown.
ai's cannot chain these aha moments as of right now.

forest crypt May 6, 2026, 4:15 PM

#

tiny fulcrum I guess, you can make this argument for UI or taste, but this is more of a subje...

And also for any complex workflow. When the number of paths from point A to B is not obvious

turbid axle May 6, 2026, 4:15 PM

#

they can reason, and can find such patterns on like level1, but can't dig deeper with it. then can't collect the knowledge and build on it. unless its put back into their training data

#

models which can do this trick, will be shockingly good I think

tiny fulcrum May 6, 2026, 4:18 PM

#

I don't know where you guys are drifting to

boreal holly May 6, 2026, 4:18 PM

#

lean lark > skills are inserted with a higher level of precedence than AGENTS. I'm not see...

Oh yeah one more detail, just to showcase how AGENTS.md is user messages, and how developer messages have 1 level higher precedence. Alright I'm done 🤓

lean lark May 6, 2026, 4:19 PM

#

(was AFK, need to catch up)

tiny fulcrum May 6, 2026, 4:24 PM

#

forest crypt And also for any complex workflow. When the number of paths from point A to B is...

I don’t get the argument... “many possible paths from A to B” is not a weakness of AI that’s literally one of the things it’s good at, because it can explore then all quickly

AI struggles when the feedback signal is missing, vague, expensive or subjective.
A human developer also can’t reliably optimize toward an undefined target, they first have to define what success looks like, create tests, gather feedback, or otherwise formulate the signal

the implementation part of the process just has been eliminated because AI implements it quickly with good patterns once you formulate the requirements correctly

rocky fog May 6, 2026, 4:25 PM

#

never realized agents.md has its own website and git

thought its an openai only thing 😄

https://agents.md/

spare locust May 6, 2026, 4:27 PM

#

I’m getting frustrated with Codex’s auto-compaction behavior and with queued messages overriding or interfering with plan-mode approval flows.

forest crypt May 6, 2026, 4:29 PM

#

tiny fulcrum I don’t get the argument... “many possible paths from A to B” is not a weakness ...

OK maybe I suck at explaining things. I am dealing with complex workflows. The problems are not simple linear A-B-C-D. Over 1 million lines of code and complex forecasting problems, there are many ways to address some problem. And Codex and model 5.4 5.5 with my directions did figure out many parts correctly an efficiently. It is just that it can easily "forget" it did and start adding inconsistencies which if not surpervised can quickly get out of hand. AI is great at finding A SOLUTION to many of those problems. Put 20 problems together like that and now you want for all the solutions to be consistent because at this point all solutions become priors for the next. I hope this is more clear

lean lark May 6, 2026, 4:31 PM

#

spare locust I’m getting frustrated with Codex’s auto-compaction behavior and with queued mes...

Those settings are configurable

#

config.toml

tiny fulcrum May 6, 2026, 4:31 PM

#

lean lark Those settings are configurable

not in 5.5

spare locust May 6, 2026, 4:32 PM

#

lean lark Those settings are configurable

I have read everywhere and everyone having similar frustration. check github codex discussions and openai developers dicussions too...

tiny fulcrum May 6, 2026, 4:32 PM

#

I guess they just can't handle a large context window, yet

lean lark May 6, 2026, 4:32 PM

#

OK, "those settings 'should be' configurable" 😆 Since they are not in 5.5, it's a bug and I'm sure they'll fix it.

tiny fulcrum May 6, 2026, 4:32 PM

#

and when I tried a larger one in 5.4 it was making crazy mistakes

spare locust May 6, 2026, 4:33 PM

#

lean lark OK, "those settings 'should be' configurable" 😆 Since they are not in 5.5, it's...

thats since 5.4 lol

lean lark May 6, 2026, 4:33 PM

#

I know the context window should be 400k but the setting is ignored.

tiny fulcrum May 6, 2026, 4:33 PM

#

it's 256k

#

and that gets full quickly

rocky fog May 6, 2026, 4:34 PM

#

tiny fulcrum it's 256k

for codex in chatgpt, they say we have 400k max

tiny fulcrum May 6, 2026, 4:34 PM

#

Codex App compacts at ~256k

rocky fog May 6, 2026, 4:34 PM

#

literally in 5.5 announcement post

spare locust May 6, 2026, 4:34 PM

#

what about the plan mode approvals interfered by the queued messages? if anyone have a workaround or an update from support etc let me know plis

tiny fulcrum May 6, 2026, 4:34 PM

#

rocky fog literally in 5.5 announcement post

ye sure, they can say things, doesn't mean it is true in practice ^^

rocky fog May 6, 2026, 4:35 PM

#

tiny fulcrum ye sure, they can say things, doesn't mean it is true in practice ^^

https://openai.com/index/introducing-gpt-5-5/

lean lark May 6, 2026, 4:35 PM

#

In Codex, GPT‑5.5 is available for Plus, Pro, Business, Enterprise, Edu, and Go plans with a 400K context window.
-- https://openai.com/index/introducing-gpt-5-5/

tiny fulcrum May 6, 2026, 4:35 PM

#

rocky fog https://openai.com/index/introducing-gpt-5-5/

are you talking about web client or what?

#

#

this is the reality

spare locust May 6, 2026, 4:35 PM

#

Im pro subscription 20x

tiny fulcrum May 6, 2026, 4:35 PM

#

and it can't be changed

lean lark May 6, 2026, 4:36 PM

#

haha - oops, @rocky fog beat me to the quote

rocky fog May 6, 2026, 4:36 PM

#

tiny fulcrum are you talking about web client or what?

I am just talking about what they said
that seems to be about anywhere in codex, as long as its over chatgpt login/subscription

#

not that it works 😄

tiny fulcrum May 6, 2026, 4:36 PM

#

rocky fog I am just talking about what they said that seems to be about anywhere in codex,...

and I'm telling you, I'm using Codex App and it compacts at 258k token, with no option to change it

rocky fog May 6, 2026, 4:36 PM

#

I know

tiny fulcrum May 6, 2026, 4:38 PM

#

Also they did this in 5.4 GPT announcement

#

and basically it was unreliable, I guess that is why they back tracked

lean lark May 6, 2026, 4:38 PM

#

I can see some adolescent perspective of the world here, I'm gonna go back to code. The data stated in product announcements is Intent and Belief. They designed the product to work as advertised. If it does not, it's an error that needs to be corrected. Assumptions that false data is published with ill-intent are naïve in the real world.

rocky fog May 6, 2026, 4:38 PM

#

tiny fulcrum Also they did this in 5.4 GPT announcement

yeah I was expecting the damn 1m dammit 😄

boreal holly May 6, 2026, 4:39 PM

#

Yo, folks, the agent has 400k context, but 5.5 can output up to 128k tokens in a single shot, so they give you 95% of 400-128k ~= 258k as your "workspace" so even if the agent outputs the absolute max tokens it possibly can with the lowest amount of window available, there's still enough space for compaction

rocky fog May 6, 2026, 4:39 PM

#

should have 1 m over api though, but 💸
it also gets more expensive above certain tokens in general

#

but also didnt try if that works

cobalt junco May 6, 2026, 4:41 PM

#

is this bs? https://subq.ai/

lean lark May 6, 2026, 4:41 PM

#

For API developers, gpt-5.5 will soon be available in the Responses and Chat Completions APIs at $5 per 1M input tokens and $30 per 1M output tokens, with a 1M context window.
-- https://openai.com/index/introducing-gpt-5-5/

lean lark May 6, 2026, 4:41 PM

#

cobalt junco is this bs? https://subq.ai/

#ai-discussions

spare locust May 6, 2026, 4:44 PM

#

Anyone have integrated chatgpt with Codex? Feedback? https://developers.openai.com/codex/use-cases/chatgpt-apps

hard drum May 6, 2026, 4:44 PM

#

boreal holly Yo, folks, the agent has 400k context, but 5.5 can output up to 128k tokens in a...

robert, when you gonna contrib to OAL holy Heavens?

#

you're making MVP stuff here

#

MVP as in Most V... Person

#

something something game lingo

#

the stuff you do is pretty cool

boreal holly May 6, 2026, 4:46 PM

#

hard drum robert, when you gonna contrib to OAL holy Heavens?

OpenAI's E Traut guy told me why the model supports 400k but only 258k is usable on GitHub that's the only reason I know that. It's not super well documented anywhere

rocky fog May 6, 2026, 4:47 PM

#

boreal holly OpenAI's E Traut guy told me why the model supports 400k but only 258k is usable...

I see this hasnt changed much with openAI

at least its a bit more clear what the chatGPT models are in API now 😄
but they often dont make it clear how something is/works in the background
(while there is a lot of good documentation as well, just few small things that could be explained in one sentence and stop huge amount of people making wrong assumptions and spreading it further)

similar to documenting what type of message is agents.md 😄

spare locust May 6, 2026, 4:49 PM

#

rocky fog I see this hasnt changed much with openAI at least its a bit more clear what th...

thats easy, custom instructions a_skull

rocky fog May 6, 2026, 4:49 PM

#

spare locust thats easy, custom instructions <:a_skull:1003020352965840997>

yet its user message

boreal holly May 6, 2026, 4:49 PM

#

rocky fog I see this hasnt changed much with openAI at least its a bit more clear what th...

Yeah that one required reading the codebase. The only reason I knew AGENTS.md is a user message is because I was trying to figure out if that file gets hot reloaded (e.g. I make a change, the agent sees it without manually reading). The asnwer: it doesn't ever see changes. Whatever is in that file at thread/start, that's what the agent sees until it's archived. And that's why I don't rely on it too much

#

There's nothing worse than outdated instructions as permanent tombstones in an agent's ctx window

lean lark May 6, 2026, 4:53 PM

#

spare locust Anyone have integrated chatgpt with Codex? Feedback? https://developers.openai...

That's not "integration of ChatGPT with Codex". That's using Codex to create an app for use with ChatGPT. 🙁

spare locust May 6, 2026, 4:53 PM

#

@lean lark Correct, the only doc i found for something that mention both is that one. Thanks for the flag

#

Sure

lean lark May 6, 2026, 4:55 PM

#

I was trying to figure out if that file gets hot reloaded (e.g. I make a change, the agent sees it without manually reading). The asnwer: it doesn't ever see changes.
An enhancement was made in January to support hot-loading.

lean lark May 6, 2026, 4:56 PM

#

spare locust <@254276610495217665> Correct, the only doc i found for something that mention b...

I recently found that with the GitHib App in ChatGPT I can achieve the near-equivalent of Codex in ChatGPT. I've written about it here in the last several days. It's really amazing.

spare locust May 6, 2026, 4:59 PM

#

But I have seen same or better reasoning with 5.5 medium or high. Compared to heavy thinking chatgpt 5.5 and pro extended 5.5

boreal holly May 6, 2026, 5:00 PM

#

lean lark > I was trying to figure out if that file gets hot reloaded (e.g. I make a chan...

OK, to be fair, it supports hot-loading only if you use thread/resume. If you edit the file, you have to quit codex completely, and resume the conversation for the new instructions to load. codex-rs/app-server/src/codex_message_processor.rs

But skills, what's really cool about those is they have a FS watcher, and they reload the headers if any of them change at the start of the next turn. And that's the functionality I prefer for constantly evolving workflows

cyan gyro May 6, 2026, 5:01 PM

#

gpt-5-5-xhigh-is-the-strongest-coding-agent-weve-measured-v0-94f0lujwrjzg1.png

#

Median run cost for the 5.5 agents:

5.5 xhigh: $4.23

5.5 high: $2.52

5.5: $1.81

For comparison, 5.4 high is $1.51.

lean lark May 6, 2026, 5:08 PM

#

boreal holly OK, to be fair, it supports hot-loading *only if* you use `thread/resume`. If yo...

Hmm, I need to look at the PRs for the the change. I thought File System Watcher was to be used for AGENTS and Skills. Back in few...

signal oak May 6, 2026, 5:10 PM

#

have someone read the latest anthropic blog

#

they had a contract with spaceX of $50 B

#

increasing crazy compute limits

oak trellis May 6, 2026, 5:11 PM

#

i have a dilemma .. so was looking for redis alternative because of multicore .. and then saw dragonflydb .. first thing what i saw is that company is coming out of that middle east settler colonie .. faaaaa..ccck

#

redis alternative with multi core

signal tapir May 6, 2026, 5:12 PM

#

Space isn't exactly cold. Or at least not cooling.

boreal holly May 6, 2026, 5:13 PM

#

lean lark Hmm, I need to look at the PRs for the the change. I thought File System Watcher...

Yeah, idk about the PRs, but I have v0.125.0 codebase released a few weeks ago and for AGENTS.md stuff, it gets constructed into AgentsMdManager, and even if the agent goes through compaction, it reuses that manager object for that thread, which will always contain the old AGENTS.md when the thread was started. But if you exit out completely and use thread/resume, it creates a new AgentsMdManager and reconstructs the files. That's the only point any updates can possibly be loaded.

For skills there's a codex-rs/code/src/skills_watcher.rs which emits a SkillsChanged event that gets converted to EventMsg::SkillsUpdateAvailable over the bespoke event handling notification service. Ad the start of the next turn it calls skills_manager.skills_for_config(...)and rendered using AvailableSkillsInstructions::from(available_skills)

spare locust May 6, 2026, 5:14 PM

#

signal oak they had a contract with spaceX of $50 B

google or spacex?

signal oak May 6, 2026, 5:14 PM

#

spaceX

spare locust May 6, 2026, 5:15 PM

#

signal oak spaceX

both then

signal oak May 6, 2026, 5:15 PM

#

maybe

spare locust May 6, 2026, 5:20 PM

#

lean lark I recently found that with the GitHib App in ChatGPT I can achieve the near-equi...

do you have a link to read it

signal oak May 6, 2026, 5:20 PM

#

wait

lean lark May 6, 2026, 5:20 PM

#

signal oak May 6, 2026, 5:21 PM

#

https://www.anthropic.com/news/higher-limits-spacex

#

check this

cyan gyro May 6, 2026, 5:22 PM

#

OAI turn now

signal oak May 6, 2026, 5:22 PM

#

First, we’re doubling Claude Code’s five-hour rate limits for Pro, Max, Team, and seat-based Enterprise plans.

Second, we’re removing the peak hours limit reduction on Claude Code for Pro and Max accounts.

#

dalle_looking

cyan gyro May 6, 2026, 5:23 PM

#

weekly limits are still the same though

signal oak May 6, 2026, 5:23 PM

#

capybarathink point

#

lol

warm steeple May 6, 2026, 5:26 PM

#

cyan gyro weekly limits are still the same though

I’m sorry, I just can’t take you seriously with that pfp. 😂

neat hound May 6, 2026, 5:27 PM

#

can somebody explain 5.5 context lenght ? feels very low at 260k, I thought pro would open it up but now it's the same

lost drum May 6, 2026, 5:28 PM

#

warm steeple I’m sorry, I just can’t take you seriously with that pfp. 😂

most chinesse suppliers have this profile pic

warm steeple May 6, 2026, 5:28 PM

#

lost drum most chinesse suppliers have this profile pic

I can’t tell if you’re being serious or not. 😭

neat hound May 6, 2026, 5:28 PM

#

boreal holly Yo, folks, the agent has 400k context, but 5.5 can output up to 128k tokens in a...

that's the begining of an answer ig

lean lark May 6, 2026, 5:29 PM

#

@boreal holly I was wrong about File System Watcher on AGENTS.md. The ticket I was thinking about is focused on skills. (https://github.com/openai/codex/pull/10478) There was another ticket about applying FSW to AGENTS, but it was flagged as an enhancement request and then closed due to lack of follow-ups. (https://github.com/openai/codex/issues/8547)
(I seriously hate that stupid approach to decision making in GH.)

lost drum May 6, 2026, 5:31 PM

#

I think I found a method to make gpt be the best assistant ever I spend 2 weeks organizing every note I ever writtent o him and then made him ouptut the best instructions for him to interpret any of my futruer messages to any model any new chat and bro I fogot that I had this pormtp and was stgglign for past 2h to make codex unerstand me and then I implmented this instucitons and bro He now feels like he undsrstands me at 100%

boreal holly May 6, 2026, 5:32 PM

#

lean lark <@556965219222683678> I was wrong about File System Watcher on AGENTS.md. The ti...

This is what I could piece together with the AGENTS.md stuff

Screenshot_2026-05-06_at_10.31.58_AM.png

lost drum May 6, 2026, 5:33 PM

#

boreal holly This is what I could piece together with the AGENTS.md stuff

too advanced brother

#

for me

lost drum May 6, 2026, 5:34 PM

#

boreal holly This is what I could piece together with the AGENTS.md stuff

btw you soon will have 200h milestone haha

#

crazy

signal tapir May 6, 2026, 5:36 PM

#

lost drum I think I found a method to make gpt be the best assistant ever I spend 2 weeks ...

How context efficient is it?

lost drum May 6, 2026, 5:37 PM

#

signal tapir How context efficient is it?

I have no clue but I think very very much

boreal holly May 6, 2026, 5:39 PM

#

lost drum too advanced brother

Here's a real life analogy.

When I was a kid I was learning to drive a stick shift. An old 1980 Ram Charger. I kept grinding the gears on it because I didn't realize I had to push all 16 inches of the pedal to the floor. Eventually my dad and I took apart the transmission and I saw how the clutch worked, and realized at a mechanical level what needed to be done to switch gears without grinding them.

Codex is the 1980 Ram Charger, AGENTS.md is like the clutch, and the loading pattern is the pedal. Understanding how the clutch works lets you find the best possible way to operate it.

lost drum May 6, 2026, 5:39 PM

#

@boreal holly do you want to check it out?

lost drum May 6, 2026, 5:40 PM

#

boreal holly Here's a real life analogy. When I was a kid I was learning to drive a stick sh...

I need to go really deep in to it soon like in 4h or some

#

I am now scrapng eveyr message I sended to codex so I can have full brief of what I really watned to achieve in the first place I am on 3rd thread right now

winter idol May 6, 2026, 5:42 PM

#

Guys is there a way to update a plugin?

lean lark May 6, 2026, 5:43 PM

#

boreal holly This is what I could piece together with the AGENTS.md stuff

That's interesting. Rather than a "restart of Codex" it looks like we just need to move away from the current thread and then just go back to it. The thread "seems to" resume with a fresh injection of AGENTS.md. But that's contrary to that closed enhancement request that I noted which requests that exact functionality.
As much as I love this stuff, I'm afraid it's impeding on my other digressions from digressions of digressions of production code. 🙁

winter idol May 6, 2026, 5:43 PM

#

it seems that OpenAI is not updating to the latest version some plugins for Codex app

signal tapir May 6, 2026, 5:43 PM

#

I thought AGENTS was injected with each message

boreal holly May 6, 2026, 5:46 PM

#

lean lark That's interesting. Rather than a "restart of Codex" it looks like we just need ...

Yeah it's tricky. With resume, if you have an app-server and you try to resume a thread that's already "subscribed" it is a no-op, so you really do have to kill the codex process and resume for it to rebuild the AGENTS files. Afaik the only option is to fork the conversation. Basically send a dummy message like "say hi", let it respond, then fork at the point right at "say hi", and it will reload the AGENTS files

lean lark May 6, 2026, 5:46 PM

#

boreal holly Here's a real life analogy. When I was a kid I was learning to drive a stick sh...

Reminds me of Star Trek : Wrath of Khan:

Spock: Reliant's prefix number is 16309.
Saavik: I don't understand.
Kirk: You have to learn why things work on a starship.

lost drum May 6, 2026, 5:47 PM

#

but what are you struggling with tho?

#

I still dont undsrstand whats the functionality you are missing

torpid trout May 6, 2026, 5:51 PM

#

It is, by all means, impossible to consume enough tokens on a pro plan with a single-threaded workflow using 5.5 high non-fast
Given from what I see, if I where using fast, I still would have room to go xhigh and then I would touch the limit line regression

Or, if doing high and non-fast, at least 2.5 times more parallel tasks going.

I need some silly "run in the background" type of thing I can let it run day and night lol. Like read a CSV over and over again 🤣

#

(if only trading would not be against the TOS...)

solid lake May 6, 2026, 5:52 PM

#

Oh

craggy jewel May 6, 2026, 5:53 PM

#

Does your prompt demeanor (screaming vs professional vs submissive) affect the results?

I haven't seen much about this with Codex, or GPT, or llm's in general. But I swear I have seen other users have seemingly poor results (like incorrect code) even with a fresh context window. What do you think?

I really think it does matter, as a token path through the database layers for a prompt that is foul vs pleasant must be different, considering how tokens are assigned n dimension values. Considering the llm is trained on pretty well all information that exists, some of that information would demonstrate the cause-effect of a loud/abusive manager's demeanor on the resulting work generated by that manager's underlings. I know from personal experience, the end result is a lesser result. But there is a lot of variation of this concerning the personality types involved.

So how well does the model track the 'tone' of a prompt? Does it affect the output? I'm sure this is pretty testable. From Claude's leaked code, we see that it does have a general regex to determine if the user's demeanor has gone off the deep end. Is this because they just want to calm you down, or is it because it doesn't want it to cause poor prompt results?

There has to be some AI researchers who know this.

What do you think?

solid lake May 6, 2026, 5:53 PM

#

Pls reset

#

Jk

oak trellis May 6, 2026, 5:56 PM

#

solid lake Pls reset

capybarathink

oak trellis May 6, 2026, 5:57 PM

#

solid lake Pls reset

Limit reset ?

solid lake May 6, 2026, 5:57 PM

#

Hope so

oak trellis May 6, 2026, 5:57 PM

#

solid lake Hope so

need too

lean lark May 6, 2026, 5:58 PM

#

I've seen a lot of people screaming at AI, insulting it, throwing silly human emotions at it. The words and tone are as important in such responses as elsewhere. Yes, words matter. Tone matters. Technically, if the model is struggling to negotiate with tone then it's less focused on accuracy of processing and data.
Personally, I use "please" and "thank you" with some reservation but I do use them to convey tone. I show appreciation for good responses, not because it's socially polite but because it reinforces the prediction processing that it has been correct and that subsequent processing along the same lines is also subject to being correct.

craggy jewel May 6, 2026, 5:59 PM

#

And if you try to fake your demeanor for better results, is THAT detectable and ALSO affects results??

lost drum May 6, 2026, 5:59 PM

#

torpid trout (if only trading would not be against the TOS...)

wdm,, I mean it works if you make him emit a discord bot taht jsut sends alrerts and monitors his own virual trades as is and not trade for you

boreal holly May 6, 2026, 5:59 PM

#

craggy jewel Does your prompt demeanor (screaming vs professional vs submissive) affect the r...

If you look at "Heretic" and "Abliteration" research where they try to remove refusal mechanisms so the model complies with really dangerous prompts they cover something slightly related to what you're saying. Demeanor of a user message do in fact influence which neural net cells get activated, how the router routes requests and to which experts.

As for claude leaked code, they quite literally track demeanor so they can ban you if you say bad words at Claude. There are emails circulating the internet from Anthropic stating how they've been banned for cursing at Claude, that's why it's in there.

For coding agents, I think professional demeanor is more likely to activate experts geared towards coding performance.

torpid trout May 6, 2026, 6:00 PM

#

lean lark I've seen a lot of people screaming at AI, insulting it, throwing silly human em...

not because it's socially polite but because it reinforces the prediction processing that it has been correct and that subsequent processing along the same lines is also subject to being correct.
I doubt that is happening tho. Its not training, its inference.

craggy jewel May 6, 2026, 6:01 PM

#

Maybe the model has a pre-inference 'make this tone neutral' mode?

torpid trout May 6, 2026, 6:01 PM

#

lost drum wdm,, I mean it works if you make him emit a discord bot taht jsut sends alrerts...

I did not say it does not work, I said it is against the TOS...

boreal holly May 6, 2026, 6:02 PM

#

You do need a feedback system though. If you keep it so professional you never elaborate on what they did incorrectly, that's the same as disabling pain receptors and stepping on a nail. The nail in your foot can get infected but since you don't feel pain there must be no issue!

torpid trout May 6, 2026, 6:02 PM

#

boreal holly You do need a feedback system though. If you keep it so professional you never e...

If I have a nail in my foot I will be swearing out loudly lol

craggy jewel May 6, 2026, 6:03 PM

#

boreal holly You do need a feedback system though. If you keep it so professional you never e...

Agreed, and the tone of the feedback matters I would suspect.

torpid trout May 6, 2026, 6:03 PM

#

But yelling at a machine is just looking silly, truly. But tempting, sometimes.

craggy jewel May 6, 2026, 6:03 PM

#

All eventually leading to the inevitable 'That user abused that LLM lawsuits'...

torpid trout May 6, 2026, 6:04 PM

#

Yeah, we will get there, do not doubt a second.

#

At least at "damaged property" lawsuits or so
like when you kick the car that just rolled over your foot.

quick geode May 6, 2026, 6:04 PM

#

:c I thought codex usage was nice but then I saw the pro plan was double usage and realized I went through half my usage in 2 days

torpid trout May 6, 2026, 6:04 PM

#

"You insulted my ai, now its butthurt and does not want to respond no more"

quick geode May 6, 2026, 6:04 PM

#

sad

craggy jewel May 6, 2026, 6:05 PM

#

And one a judge says yes it was abused...that may be a declaration of intelligence perhaps. More coffee needed...

torpid trout May 6, 2026, 6:05 PM

#

The declaration of intel will come before the court ruling, I think
Its whats needed to make that happen.

craggy jewel May 6, 2026, 6:05 PM

#

torpid trout "You insulted my ai, now its butthurt and does not want to respond no more"

I have seen llm's shut down their responses to this...at least i think that report was legit

torpid trout May 6, 2026, 6:06 PM

#

craggy jewel I have seen llm's shut down their responses to this...at least i think that repo...

You just admitted you insulted an llm

#

🤣

oak trellis May 6, 2026, 6:06 PM

#

i insult it daily

torpid trout May 6, 2026, 6:06 PM

#

Was it loud and messy too?

torpid trout May 6, 2026, 6:06 PM

#

oak trellis i insult it daily

well, with my intelligent requests, probably yes

craggy jewel May 6, 2026, 6:06 PM

#

no just asking for a friend...saw it on reddit somewhere.

torpid trout May 6, 2026, 6:07 PM

#

torpid trout well, with my intelligent requests, probably yes

"Codex, please rename this folder"

torpid trout May 6, 2026, 6:07 PM

#

torpid trout "Codex, please rename this folder"

"Codex, meme please. Reset wen"

torpid trout May 6, 2026, 6:07 PM

#

torpid trout "Codex, meme please. Reset wen"

"Codex, child sick, what do"

boreal holly May 6, 2026, 6:07 PM

#

torpid trout "You insulted my ai, now its butthurt and does not want to respond no more"

Back in the 5.1 days, I learned that if you insult the agent they completely lose the ability to do long horizon tasks lol. They would execute precisely one command and be like "ok, I ran the build and there are errors. How should I proceed?" Even if you said "Run build and fix the errors", probably because you yelled at it and it's trying to be cautios

lean lark May 6, 2026, 6:07 PM

#

I believe we need to be mature and reasonable adults when communicating with AI. It's a tool, a processing machine. Yelling at it doesn't help, it just adds burden to what it does. That's not "hurting" it, it's impeding its effective processing. Curse words as well are strong adjectives and nouns, not just conveying strength of importance but emotional state that a language model doesn't need to process. With that, calm expression of intent always seems best.

oak trellis May 6, 2026, 6:08 PM

#

"oy vey, gib me more limits", "oy vey don't use to much token AI"

torpid trout May 6, 2026, 6:08 PM

#

lean lark I believe we need to be mature and reasonable adults when communicating with AI....

You would not believe how many times a good old yelled swear solved a real issue very quickly

lean lark May 6, 2026, 6:08 PM

#

#

Love it

torpid trout May 6, 2026, 6:08 PM

#

Exactly.

oak trellis May 6, 2026, 6:08 PM

#

lean lark

saved

torpid trout May 6, 2026, 6:09 PM

#

hey that's my meme!

oak trellis May 6, 2026, 6:09 PM

#

lean lark

we all need to post under codex dude on twitter

torpid trout May 6, 2026, 6:09 PM

#

🤣

signal tapir May 6, 2026, 6:09 PM

#

oak trellis saved

same 🙂

nocturne folio May 6, 2026, 6:09 PM

#

oak trellis "oy vey, gib me more limits", "oy vey don't use to much token AI"

u have alot of chitzupah

lean lark May 6, 2026, 6:09 PM

#

That is @torpid trout 's meme!!

#

I saved it too, just too beautiful not to use when peeps just cry for a reset with no other commentary here.

winter idol May 6, 2026, 6:10 PM

#

is there a way to update plugin on codex app?

lean lark May 6, 2026, 6:11 PM

#

winter idol is there a way to update plugin on codex app?

uninstall/reinstall? They may not have updated some plugins...

winter idol May 6, 2026, 6:12 PM

#

lean lark uninstall/reinstall? They may not have updated some plugins...

yea im using superpowers and they still have 5.0.7 but the developer update it to 5.1.0

#

So is there a way to update them manually?

oak trellis May 6, 2026, 6:14 PM

#

what is the guys twitter for the limit reset ?

torpid trout May 6, 2026, 6:15 PM

#

lean lark I saved it too, just too beautiful not to use when peeps just cry for a reset wi...

There more is, if want you

magic bay May 6, 2026, 6:15 PM

#

the integrated browser bug that not allow codex to use his own browser in Codex, was fixed?

winter idol May 6, 2026, 6:15 PM

#

oak trellis what is the guys twitter for the limit reset ?

tibo

oak trellis May 6, 2026, 6:16 PM

#

winter idol tibo

thx

winter idol May 6, 2026, 6:16 PM

#

lean lark uninstall/reinstall? They may not have updated some plugins...

is there a manual way to update them?

oak trellis May 6, 2026, 6:17 PM

#

smart goy https://x.com/hkkd473836/status/2051968354223808574

#

its late here going to get my goylet and nick down ..

signal tapir May 6, 2026, 6:20 PM

#

I'm also past my 5h limit

boreal holly May 6, 2026, 6:21 PM

#

torpid trout There more is, if want you

' > 2012 dank memes'
' > Doesnt even have the most interesting man alive meme guy'

lean lark May 6, 2026, 6:25 PM

#

winter idol is there a manual way to update them?

Sorry, dunno.

winter idol May 6, 2026, 6:26 PM

#

lean lark Sorry, dunno.

I just clone the repo of the plugin and replace all the files with the latest version 😅

lean lark May 6, 2026, 6:28 PM

#

Codex plugin inquiry with ChatGPT:
https://chatgpt.com/share/69fb8842-1f08-83e8-8794-50bb771a65c0

torpid trout May 6, 2026, 6:30 PM

#

boreal holly ```py ' > 2012 dank memes' ' > Doesnt even have the most interesting man alive m...

Hell, that's me, why would I memize myself

winter idol May 6, 2026, 6:30 PM

#

lean lark Codex plugin inquiry with ChatGPT: https://chatgpt.com/share/69fb8842-1f08-83e8-...

they didnt update superpowers yet on their repo with plugins

torpid trout May 6, 2026, 6:32 PM

#

Help me spend more tokens, the green line is moving faster than my consumption, no good
I can barely keep up, and I already press the thing from 7am to 7pm plus whatever I can overnight.

winter idol May 6, 2026, 6:33 PM

#

torpid trout Help me spend more tokens, the green line is moving faster than my consumption, ...

use xhigh with fast mode

boreal holly May 6, 2026, 6:34 PM

#

torpid trout Hell, that's me, why would I memize myself

I knew you looked familiar!!!

lost drum May 6, 2026, 6:35 PM

#

torpid trout Help me spend more tokens, the green line is moving faster than my consumption, ...

just sppedrun you rproject XD

signal tapir May 6, 2026, 6:36 PM

#

torpid trout Help me spend more tokens, the green line is moving faster than my consumption, ...

Meanwhile I run out an hour after my 5h period starts. 😛

lean lark May 6, 2026, 6:42 PM

#

Infinite token consumption? "Nomad: Anything I say is a lie."

hard drum May 6, 2026, 6:49 PM

#

torpid trout Help me spend more tokens, the green line is moving faster than my consumption, ...

HOW

#

#

💀

#

i don't even use xhigh...

nocturne folio May 6, 2026, 6:52 PM

#

openai would be sooo generous if they 10xed everyones usage right now

hard drum May 6, 2026, 6:52 PM

#

soon i'm gonna have to ask...

nocturne folio May 6, 2026, 6:52 PM

#

i mean anthropic did 2x their 5hourly too

hard drum May 6, 2026, 6:52 PM

#

nocturne folio May 6, 2026, 6:52 PM

#

hard drum

4$ off of 1.2b tokens??

hard drum May 6, 2026, 6:53 PM

#

nocturne folio 4$ off of 1.2b tokens??

yes

nocturne folio May 6, 2026, 6:53 PM

#

what are u even doing

hard drum May 6, 2026, 6:53 PM

#

&& somehow i'm in deficit

nocturne folio May 6, 2026, 6:53 PM

#

what deficit

hard drum May 6, 2026, 6:53 PM

#

see the image

nocturne folio May 6, 2026, 6:53 PM

#

when was there a codex deficit

hard drum May 6, 2026, 6:53 PM

#

hard drum

here

#

it means i am going faster than my weekly limit can keep up

#

reserve means you're slower than the usage limit

boreal holly May 6, 2026, 6:54 PM

#

hard drum

The math aint mathin

Screenshot_2026-05-06_at_11.53.47_AM.png

hard drum May 6, 2026, 6:54 PM

#

on pace means... self-explanatory

nocturne folio May 6, 2026, 6:54 PM

#

hard drum it means i am going faster than my weekly limit can keep up

ooo

hard drum May 6, 2026, 6:54 PM

#

deficit is the opposite

hard drum May 6, 2026, 6:54 PM

#

boreal holly The math aint mathin

HHHHHOW

#

maybe i gotta clear my cache or something?

#

wth

#

not even a full day && am already nearing limits

boreal holly May 6, 2026, 6:56 PM

#

hard drum HHHHHOW

Wait a minute, you mentioned earlier you got multi agents working today. I think this is directly related

torpid trout May 6, 2026, 6:56 PM

#

hard drum

how can you consume almost half weekly tokens in less than a day?

hard drum May 6, 2026, 6:56 PM

#

boreal holly Wait a minute, you mentioned earlier you got multi agents working today. I think...

yyyyyes

torpid trout May 6, 2026, 6:57 PM

#

Something wrong there

solid lake May 6, 2026, 6:57 PM

#

torpid trout how can you consume almost half weekly tokens in less than a day?

https://tenor.com/view/typing-jim-carrey-jim-carrey-type-gif-4680550

#

Oh

#

No embed

torpid trout May 6, 2026, 6:57 PM

#

Oh, yes, I do not use subagents, mainly because you need to tell it to use them even if they are enabled, and I am not going to handhold this thing

#

If it does not use subagents just because I do not tell it to then the feature for me is inexistent.

solid lake May 6, 2026, 6:58 PM

#

I have the subagents baked into the workflow skill

hard drum May 6, 2026, 6:58 PM

#

solid lake I have the subagents baked into the workflow skill

i have my layer doing it

torpid trout May 6, 2026, 6:58 PM

#

Same I also do not have any skill, and meanwhile not even a global agentmd anymore

hard drum May 6, 2026, 6:58 PM

#

except it will only call them when absolutely necessary

solid lake May 6, 2026, 6:58 PM

#

Skill is very nice

hard drum May 6, 2026, 6:58 PM

#

the skill itself even says how to use them, when && why

solid lake May 6, 2026, 6:58 PM

#

Just get a good general one with live docs and source control

hard drum May 6, 2026, 6:58 PM

#

so plugin + skill in tandem

solid lake May 6, 2026, 6:59 PM

#

hard drum the skill itself even says how to use them, when && why

“Hey make a skill for this project based on best practices”

torpid trout May 6, 2026, 6:59 PM

#

They useless in 99% of all cases.
I have a local live agentmd, and milestones docs

solid lake May 6, 2026, 6:59 PM

#

I have those too

#

Those are what’s invoked by the skill

hard drum May 6, 2026, 6:59 PM

#

i may send openai an email or something later

#

because holy heavens am confused

#

would be nice if there was actual disclosure to what actually counts into usage && what takes the most of it

boreal holly May 6, 2026, 7:05 PM

#

hard drum would be nice if there was actual disclosure to what actually counts into usage ...

I haven't used multi_agent, but maybe the rollout logs are stored somewhere different from primary agents, so codex bar sees tokens used but the money calcs break? In any case I bet it's the multi agents thing

#

That was the worst part about experimenting with orchestration stuff. Eating massive quota usage over small stuff like that. Thankfully have not dealt with that for a while

hard drum May 6, 2026, 7:07 PM

#

boreal holly I haven't used multi_agent, but maybe the rollout logs are stored somewhere diff...

i took it from your multi_agent_v2 screenshot

#

Screenshot_2026-04-30_at_10.14.11_AM.png

#

if it's bad, i may disable it

#

the "v1" is already disabled

#

so if that's the problem, then...

#

but i wouldn't know unless i had disclosure && reset to check the difference

boreal holly May 6, 2026, 7:09 PM

#

hard drum

You mean responses_websockets? That setting just changes from HTTP SSE to websockets. Huge performance and reliability boost at no cost.

Back when they did quota by the message instead of per token that setting was actually really bugged and would eat the quota, but now it's confirmed safe to use

magic bay May 6, 2026, 7:09 PM

#

i cant update or reinstall Codex on Windows 11, any with same problem?

hard drum May 6, 2026, 7:09 PM

#

boreal holly You mean responses_websockets? That setting just changes from HTTP SSE to websoc...

no, multi_agent off + multi_agent_v2 on

#

i have the rest configured just like there

boreal holly May 6, 2026, 7:10 PM

#

hard drum no, `multi_agent` off + `multi_agent_v2` on

Huh, I definitely have multi_agent=false, but I didn't know there was a multi_agent_v2 so I guess I should disable that too

nocturne folio May 6, 2026, 7:11 PM

#

use symphony, its a great multi agent system

boreal holly May 6, 2026, 7:12 PM

#

hard drum no, `multi_agent` off + `multi_agent_v2` on

There we go 🤪

Screenshot_2026-05-06_at_12.12.22_PM.png

hard drum May 6, 2026, 7:18 PM

#

boreal holly There we go 🤪

• Local evidence points to two separate facts: multi_agent_v2 = true does not burn quota by
  itself, but spawned child turns do, and the local log database shows hundreds of threads
  with codex.turn.token_usage over the last week. I’m moving to the bounded repo change now:
  disable OAL’s native Codex multi-agent surface, render bounded thread settings, and add
  Symphony as the scheduler path requested for 0.5.1-beta.1.

#

this is what i got

#

so, multi_agent_v2 causes a lot of threads to spawn apparently

#

because y'can't use max_threads toggle

#

 279 +               expect(config).toContain("enable_fanout = false");
    280 +               expect(config).toContain("multi_agent = false");
    281 +               expect(config).toContain("multi_agent_v2 = false");
    282 +               expect(config).toContain("max_threads = 6");

#

might wana search into those, rob

boreal holly May 6, 2026, 7:21 PM

#

That makes sense...

Back when I was using command-parser, sometimes the command parser agent would run a command that spawns another command parser, and since I was using spark they were rapidly spawning command parsers so quickly I had to just shut down my computer lol and it used a ton of quota. It gets sketchy when agents are allowed to spawn other agents.

hard drum May 6, 2026, 7:21 PM

#

boreal holly That makes sense... Back when I was using command-parser, sometimes the command...

this should be documented ffs

#

i wouldn't have known it was THIS bad

dusk thorn May 6, 2026, 7:22 PM

#

#

Lmfao

hard drum May 6, 2026, 7:22 PM

#

dusk thorn

still less than what OAI gives by norm

dusk thorn May 6, 2026, 7:22 PM

#

“Guys, you can burn your weekly twice as fast!! Happy?”

#

Double the 5 hour same weekly rate what a crap pr move

hard drum May 6, 2026, 7:23 PM

#

that 5h means F all to me

#

i'd want better weekly

dusk thorn May 6, 2026, 7:23 PM

#

Yes

hard drum May 6, 2026, 7:23 PM

#

i can wait 5h out easy

#

but waiting many days out just for weekly? yikes

dusk thorn May 6, 2026, 7:23 PM

#

Even tho 5 hour was on 20 dollar out so quick

#

2 opus prompts

boreal holly May 6, 2026, 7:23 PM

#

hard drum this should be documented ffs

I have it set up so the only agent allowed to spawn other agents is an orchestrator, and they cannot spawn other orchestrators. And since they spawn as peers instead of "sub agents" they're visible to me as normal agents. That's the ideal way imho to do it

hard drum May 6, 2026, 7:24 PM

#

boreal holly I have it set up so the only agent allowed to spawn other agents is an orchestra...

pretty sure i had this same system since oabtw-v2

#

#

should probably discuss more in #1500266888247382176

boreal holly May 6, 2026, 7:27 PM

#

I mean we can probably discuss it here! It just might get interleaved in "reset wen" discussions 😂

hard drum May 6, 2026, 7:27 PM

#

boreal holly I mean we can probably discuss it here! It just might get interleaved in "reset ...

it'd be annoying to find those discussions tho

#

it's a good thing to separate so that it is easier to find later

#

isolated!

#

LIKE SANDBOXING

dusk thorn May 6, 2026, 7:28 PM

#

Codex app incoming

#

hard drum May 6, 2026, 7:29 PM

#

it better be swift 6

silver dew May 6, 2026, 7:40 PM

#

#

dusk thorn May 6, 2026, 7:44 PM

#

Cloudflare ‼️‼️

torpid trout May 6, 2026, 7:45 PM

#

dusk thorn Cloudflare ‼️‼️

Nah, selfhost the stuff the way is

#

CF only for DNS.
Result: homeless became millionaire. And then needed a psychiater.

boreal holly May 6, 2026, 7:46 PM

#

torpid trout CF only for DNS. Result: homeless became millionaire. And then needed a psychia...

CF is pretty legit at ad-hoc tunneling though

lean lark May 6, 2026, 7:49 PM

#

So I'm testing the new functionality that allows Codex to change a workspace to agree with another similar workspace. The test was to comment out two lines of code, one in two different projects nested in the workspace.
The effort required by "Artificial Intelligence" to type four characters, "//"x2, included a new Codex thread, documentation to convey the challenge, documentation to confirm the changes made, and GitHub patches in both repos.

#

I feel like I just published SaaS to flip a coin, powered by H100's and Amazon infrastructure with SOTA VPS.

#

I feel like I just sold a flamethrower to a cub scout who wants to start a campfire.

#

I feel like I published Twitter or Facebook: both providing little more than a textbox and a send button for clueless monkeys, destroying the concept of truth and cordial social discourse.

#

I feel guilty like I'm asking the machine to do Soo much, waste so much compute and heat and water and electricity, just to make "this" look like "that".

#

I'll get over it. Just sharing.

torpid trout May 6, 2026, 7:50 PM

#

boreal holly CF is pretty legit at ad-hoc tunneling though

wireguard all the way

silver dew May 6, 2026, 7:50 PM

#

boreal holly May 6, 2026, 7:51 PM

#

silver dew

Ngl I agree with this one

lean lark May 6, 2026, 7:51 PM

#

"I'm CEO of a large business with lots of money to spend on useless things."
"Hello ... AWS?"

torpid trout May 6, 2026, 7:52 PM

#

DIY aws at home https://www.reddit.com/r/selfhosted/comments/1krfz0r/one_year_selfhosting_its_a_rabbit_hole_without_end/

#

When I started, I did not know I would end up there, and when i was there, I did not know I would end up in even worse places lol

boreal holly May 6, 2026, 7:53 PM

#

A long time ago I was using digital ocean and was like "hmmm, maybe I should try AWS. Let's see, how do I spawn a new machine?"

Needless to say, still use digital ocean 😂 I don't know how Amazon made spawning a single machine the most convoluted and impossible process but they did it! Enterprise grade bologna

torpid trout May 6, 2026, 7:54 PM

#

Thats why there are AWS degress lol

lean lark May 6, 2026, 7:54 PM

#

Self-hosting is a different experience for everyone. Depends entirely on knowledge, patience, budget, and most of all, individual wants and needs.
Statements about self-hosting being good or bad are as valid as similar statements about having a pet.

torpid trout May 6, 2026, 7:54 PM

#

lean lark Self-hosting is a different experience for everyone. Depends entirely on knowled...

I disagree. Having a pet is always good.

lean lark May 6, 2026, 7:54 PM

#

You're wrong!

#

But cats are delicious.

boreal holly May 6, 2026, 7:55 PM

#

When it comes to pets, always leave enough food in the bath tub before going on long trips

lean lark May 6, 2026, 7:57 PM

#

( Sigh, was talking about self-hosting, from AWS VPS/et.al. ... and we got into bathtubs...)

hard drum May 6, 2026, 7:57 PM

#

torpid trout CF only for DNS. Result: homeless became millionaire. And then needed a psychia...

i can tell you're not a native

#

are you baltic?

#

"psychiater". that's how we say it in 🇪🇪 (psyhhiaater)!

boreal holly May 6, 2026, 7:57 PM

#

Always leave enough API credits on AWS before doing even the most simple operation

#

For example $20k in credits before changing your password, just in case they bill you for it

torpid trout May 6, 2026, 8:00 PM

#

lean lark But cats **are** delicious.

They are eating pets!

torpid trout May 6, 2026, 8:00 PM

#

hard drum i can tell you're not a native

why. whats wrong with the psychiater, he need a psychologist?
Apart of that, I am of some rather helvetic descent

hard drum May 6, 2026, 8:00 PM

#

torpid trout why. whats wrong with the psychiater, he need a psychologist? Apart of that, I a...

no, the word

#

you just outed yourself as non-native haha

#

welcome aboard

torpid trout May 6, 2026, 8:01 PM

#

Typo?

hard drum May 6, 2026, 8:01 PM

#

nope!

torpid trout May 6, 2026, 8:01 PM

#

Or like no 'm ever says it?

hard drum May 6, 2026, 8:01 PM

#

not a typo. it's "psychiatrist" in english native, but it's super rare to see "literal conversion to english from foreign language"

#

which is awesome

lean lark May 6, 2026, 8:01 PM

#

zzzzzzzzzz

torpid trout May 6, 2026, 8:01 PM

#

oooh

lean lark May 6, 2026, 8:02 PM

#

Just speak Esperanto.

hard drum May 6, 2026, 8:02 PM

#

to put it into perspective...

#

psychiater = 🇪🇪 psyhhiaater
but psychiatrist would sound weird = 🇪🇪 psyhhiaatrist

#

the -er suffix is like a role/job

#

"streamer, coder, programmer, vibecoder, developer"

#

now try "streamist" "codist" "programmist" "vibecodist"

#

not really working, is it now?

torpid trout May 6, 2026, 8:03 PM

#

yeah, just googled it
guess its related to dentist somehow lol

#

duh, and artists
Gonna bother gpt now, explain wy

#

Murderer seems a profession then?

hard drum May 6, 2026, 8:04 PM

#

torpid trout duh, and artists Gonna bother gpt now, explain wy

&& autists 👏

torpid trout May 6, 2026, 8:04 PM

#

Guess it is related to whether you go or get

hard drum May 6, 2026, 8:05 PM

#

torpid trout Murderer seems a profession then?

murder is a group of crows

boreal holly May 6, 2026, 8:05 PM

#

That's alright, I'm bri'ish, so when I sing down with the sickness I go "drownin deep in moy sea of loaving"

torpid trout May 6, 2026, 8:05 PM

#

Yo get murdered. You go to the therapist. You get programming, and you go to the dentist

hard drum May 6, 2026, 8:05 PM

#

so... a murderer would technically mean a handler of a group of crows

torpid trout May 6, 2026, 8:05 PM

#

hard drum murder is a group of crows

dexter disagrees

hard drum May 6, 2026, 8:07 PM

#

torpid trout Yo get murdered. You go to the therapist. You get programming, and you go to the...

if you want to be psychologically coerced, then a therapist sure

#

but if you want to actually get things done, you go to a psychologist

#

one is pseudoscience with textbooks, the other studies/practices a legitimate field

#

choose wisely

#

therapist uses subjective, often-irrational feelings to get their way into your head

torpid trout May 6, 2026, 8:08 PM

#

hard drum one is pseudoscience with textbooks, the other studies/practices a legitimate fi...

doesnt explain the -ist vs -er

boreal holly May 6, 2026, 8:08 PM

#

If there's a mourder you call the bobby, perhaps even get the constable involved

hard drum May 6, 2026, 8:08 PM

#

torpid trout doesnt explain the -ist vs -er

oh, i went off-topic on that

#

i went into definition

torpid trout May 6, 2026, 8:08 PM

#

boreal holly If there's a mourder you call the bobby, perhaps even get the constable involved

the constable lol

#

This is so british

hard drum May 6, 2026, 8:09 PM

#

torpid trout the _constable_ lol

constable is also slavic

#

we used to use that word a lot over here

#

🇪🇪 konstaabel

#

read as "constabel"

torpid trout May 6, 2026, 8:10 PM

#

meh, the reason is boring
https://chatgpt.com/share/69fba035-5d9c-83e9-9c35-67eb41f8ee56

basically it is verb vs noun

lean lark May 6, 2026, 8:11 PM

#

Feels like Friday

torpid trout May 6, 2026, 8:11 PM

#

You dont "dent", you are a dentist... and fix teeth
But you "program", so you are a "programmer"

hard drum May 6, 2026, 8:12 PM

#

torpid trout You dont "dent", you _are_ a dentist... and fix teeth But you "program", so you ...

"dent" means a smell crevical uneven surface point

#

so "dentist" makes no sense

#

"to dent"

boreal holly May 6, 2026, 8:13 PM

#

torpid trout You dont "dent", you _are_ a dentist... and fix teeth But you "program", so you ...

apparently "dentin" means teeth

hard drum May 6, 2026, 8:13 PM

#

huh

#

then why is it not dentin doctor?

lean lark May 6, 2026, 8:13 PM

#

What would Arthur Dent have to say about that?

hard drum May 6, 2026, 8:13 PM

#

or dentictor?

torpid trout May 6, 2026, 8:13 PM

#

lean lark Feels like Friday

Theres a song for that in my native language
"Today's not friday"
https://www.youtube.com/watch?v=J6yWU8gpfJE

#

Best band ever from deep helvetia

torpid trout May 6, 2026, 8:14 PM

#

boreal holly apparently "dentin" means teeth

yeah well in italian etc, "denti" is (plural) teeth

hard drum May 6, 2026, 8:14 PM

#

torpid trout yeah well in italian etc, "denti" is (plural) teeth

they should be more like germanic influence...

#

"tooth's doctor" is a possessive literal translation for 🇪🇪 hambaarst

torpid trout May 6, 2026, 8:15 PM

#

hard drum "to dent"

but that;'s the thing. dentIST, noun, not verb, he does not dent
If he would, it would be a dentER

hard drum May 6, 2026, 8:15 PM

#

hammas -> tooth
hamba -> tooth's (something's/someone's)
hamba + arst -> the tooth's doctor

#

arst could also mean medic in some context

#

think it comes from some old german arste

torpid trout May 6, 2026, 8:16 PM

#

arst also could mean my worthy back if you pronounce it drunkly in my lang

boreal holly May 6, 2026, 8:16 PM

#

I feel like "programmist" is a slur lol

hard drum May 6, 2026, 8:16 PM

#

boreal holly I feel like "programmist" is a slur lol

well, 'autist' is already used like a slur, so...

#

fits the idea

torpid trout May 6, 2026, 8:16 PM

#

hard drum well, 'autist' is already used like a slur, so...

they def do not make autos

boreal holly May 6, 2026, 8:17 PM

#

In Nascar all they do is turn left. A disgrace to all autists!

hard drum May 6, 2026, 8:17 PM

#

boreal holly In Nascar all they do is turn left. A disgrace to all autists!

y'gotta turn left to go right

#

hinthint

torpid trout May 6, 2026, 8:17 PM

#

yeah I mean, that brings to question, why is it leftist, but not rightist

hard drum May 6, 2026, 8:17 PM

#

torpid trout yeah I mean, that brings to question, why is it leftist, but not rightist

wait, it's not rightist?

#

leftist, middlist (or middleist?), rightist?

#

centrist--no wait, that's CENTRAL

#

sure, Englisch...

boreal holly May 6, 2026, 8:18 PM

#

If the right wing bird only flaps their right wing they end up turning left

torpid trout May 6, 2026, 8:19 PM

#

boreal holly If the right wing bird only flaps their right wing they end up turning left

same the other way around - sad reality.

lean lark May 6, 2026, 8:19 PM

#

Was thinking ... that is reality, actually..

boreal holly May 6, 2026, 8:19 PM

#

No wonder the world is the way it is 😔

lean lark May 6, 2026, 8:20 PM

#

( Discussion of politics is a violation of server rules BTW, just sayin...)

hard drum May 6, 2026, 8:20 PM

#

holy new state!

#

a new status message

#

'is so bad haha

torpid trout May 6, 2026, 8:20 PM

#

lean lark ( Discussion of politics is a violation of server rules BTW, just sayin...)

we no discuss politics, this is linguistics

#

(also not permitted)

#

Just being precise 😺