#codex-discussions

1 messages ยท Page 50 of 1

plush harbor
#

weird. I've had the api hooked in for a week now and everything is zero other than the money they are taking off me

hard drum
#

It's also a social coercivist, as i'd like to call.

torpid trout
#

That is new

#

They removed the usage menu?!

plush harbor
#

oh great so they broke something right when I implented the api

torpid trout
#

And now it is a dahboard ugliness? And yes, it is zero, and should not, clearly, one of my api keys is massive in usage and was reporting usage in the old screen for ... now years

hard drum
#

there's a reason you need something like this in memor/custom instructions:

Does not want unsolicited advice, guidance, reassurance, suggested next steps, coping strategies, interpretations framed as what they should do, or suggested wording/scripts in emotional, interpersonal, imagined, dream, memory, trauma, or hypothetical scenarios. Do not provide 'you could/should/might say/do/try,' 'the safest goal would be,' 'a better/more accurate thing would be,' or similar constructions unless the user explicitly asks for advice, guidance, action steps, or wording. Stick to direct answers and interpretation only.

general-purpose

#

it could be made better, of course

native agate
#

Anyone tried the new Deepseek free model? legendary

hard drum
#

it's basically the whole, "did i ask?"

plush harbor
torpid trout
#

what even is this new measuremnet dashboar dsupposed to be... a vibecoded insult?

plush harbor
#

lol I regret mentioning this now ๐Ÿ™‚ Clearly I missed something good

torpid trout
#

It was horrible before, but at least one was able to see what is used where and when

boreal holly
hard drum
#

since the web version is a freak

boreal holly
#

Oh yeah, web version... I still have not tried it. Sounds frustrating!

cedar skiff
hard drum
#

It's just annoying in thought form

torpid trout
boreal holly
plush harbor
cedar skiff
#

I ended up just inserting some stuff via developer_instructions

#

It's not ideal because it could have conflicts

plush harbor
boreal holly
# cedar skiff It's not ideal because it could have conflicts

Yeah, if you look at a rollout log you can pull the Codex provided, fully formed system prompt out of the first message and refactor it to fit your needs. For example I told em "you don't need to use the apply_patch tool if there's a better way to edit something. Python works fine." And these suckas do incredible amounts of refactoring with that simple tweak

hard drum
#

Mine's been using python3 for a lot of edits, too, for some reason

#

also in middle of rebooting OABTW as OAL

cedar skiff
boreal holly
hard drum
#

i should let them use python3 more for it

boreal holly
# hard drum i should let them use python3 more for it

I think they put that "You must use apply_patch for edits" line in there because in Codex Desktop & VSCode, every apply_patch shows up in the UI with an undo button, so they want the agent to use it as much as possible. I see it as a hinderance. They're pretty clever with their python-fu!

white rover
#

hello codex gang ๐Ÿซฃ

#

bit of a pickle im in, any easy way to get codex chats from one device over to another device, or letting codex that has the chat get evrything it needs, files, handover sheet etc, then downloading what it sends, and sending to the other device??

#

any help would be great gang ๐Ÿ™

plush harbor
#

never tried but you might be able to lift and shift the session directory

hard drum
white rover
#

main codex is on ios and other device is windows ๐Ÿซฃ

hard drum
#

๐ŸŽ

white rover
#

๐Ÿฅต

cedar skiff
#

you mean macos?

hard drum
#

i raise you... so much

white rover
boreal holly
hard drum
#

one is iOS game thing, other is android emu for reference

plush harbor
#

I'm over here on command line like its 2002

hard drum
#

cannot imagine using terminal agents unless i had "vscode but terminal" equivalent

hard drum
#

&& if you say neovim, i will personally force thigh-high socks on ya

plush harbor
#

no agents here, just codex. Old site, small codebase. I have cron jobs not agents lol

hard drum
#

|| even better: nano

plush harbor
#

sublime text

hard drum
#

use nano like real people haha

lost drum
hard drum
#

๐Ÿคฎ

boreal holly
hard drum
#

beautiful hole

nocturne folio
#

somethings off here

tulip osprey
#

FINALLY

#

it is about time they got a disocrd server

boreal holly
tulip osprey
#

ew no pic perms...

hard drum
nocturne folio
hard drum
nocturne folio
plush harbor
#

I have 10,000 unread emails but nothing anywhere else >.>

lost drum
#

just dont go to ecom communities

hard drum
#

English, please?

cedar skiff
lost drum
#

e-commerce

cedar skiff
#

I bet your phone notifications feed is endless

cyan wing
#

so what do when Codex team members just fake click bait like this

#

i thought today was super app day

lost drum
cedar skiff
strange leaf
#

Does changing this option make any difference?

cedar skiff
#

There still heaps of stuff to be built, including the models that run it.

red mulch
#

codex is tuff

#

but too less credits

#

tbh

nocturne folio
#

Maybe check my profile

cyan wing
#

dont check their profile

glacial shadow
#

Lol

steady vigil
#

looks like my weeky limit was moved forward without actually resetting it. (pro) โ”‚ Weekly limit: [โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘โ–‘] 43% left (resets 06:23 on 5 May)

this should be day 3 from the last reset

#

anyone else see this?

frail meadow
#

I thought they reset last night

steady vigil
#

yeah the timing would make sense

#

my % left not so much

#

that makes this literally a reduction in usable credits vs no reset for me

frail meadow
#

Where do you live?

steady vigil
frail meadow
#

Hmm interesting, I feel like not everyone gets the same resets or at the same times

#

I'm in US and got it around midnight Central Time last night I think

steady vigil
nocturne folio
boreal holly
drowsy turtle
#

made codex connect to claude using prive p[ythopn script private connecytion with user

frail meadow
#

What are you guys working on tonight?

frail meadow
lost drum
#

someone make a scan of every message taht comes when seearching "codex" in here and make a complexed guide for new commerse here

#

and create a command like /guide

#

its out now bro

inland sonnet
lost drum
undone patio
ebon skiff
#

guys, please help, my Codex windows cant use 'Browser Use' Plugins

oak trellis
#

how is the limits they coming to an end tomorrow ?

ebon skiff
#

Iโ€™m trying to use the Codex in-app browser automation through the browser-use plugin / browser-client.mjs.

The in-app browser itself is open and working correctly, and I can manually navigate/login in the browser UI. However, Codex cannot attach to or control the browser through the Node REPL automation bridge.

The failure happens when running the Node REPL browser setup code:

const { setupAtlasRuntime } = await import(".../browser-client.mjs");
await setupAtlasRuntime({ globals: globalThis, backend: "iab" });
The returned error is:

Node runtime too old for node_repl (resolved C:\Program Files\nodejs\node.exe):
found v22.18.0, requires >= v22.22.0.
Install/update Node or set NODE_REPL_NODE_PATH to a newer runtime.
Important detail: the system also has a newer bundled Codex runtime available:

C:\Users\yusuf.cache\codex-runtimes\codex-primary-runtime\dependencies\node\bin\node.exe
v24.14.0
But the node_repl tool still resolves to:

C:\Program Files\nodejs\node.exe
v22.18.0
So the issue seems to be that the Node REPL used by Codex/browser-use is not picking up the newer bundled Node runtime, even though it exists locally.

Question:
How can I make Codex / node_repl use the newer Node binary? Should I set NODE_REPL_NODE_PATH, update system Node, restart Codex, or configure this somewhere inside Codex/plugin settings?

oak trellis
#

now thinking buying codex or cc .. with all that new codex stuff

#

their limit reset tricks

oak trellis
#

which one you prefer right now ..

undone patio
#

anthropic is behind and lacking compute

#

codex ez

#

i have all three

oak trellis
#

yes thought that too ..

undone patio
#

i have codex, claude, and google ultra

#

and i dont even use the other two

#

lol

oak trellis
#

loool

#

you do backend ?

undone patio
#

ML/Fullstack

oak trellis
#

yes same here .. backend right now and then frontend

undone patio
#

i like frontend more but its hard to not be fullstack these days

oak trellis
#

what about their limit you think they continue upcoming month ?

undone patio
#

idk wym tbh

oak trellis
#

they have 2x limits for a certain time

#

but i think with the reset mess they doing .. its no more haha

gentle harbor
#

why is it black ?

strange leaf
cedar skiff
#

looks like adobe?

strange leaf
#

OpenAI Codex

inland sonnet
strange leaf
#

Does it interfere with anything? That was my question.

inland sonnet
strange leaf
#

Okay, thank you very much.

strange leaf
gentle harbor
#

how does fast mode work ? does it just tell the model to hurry up or does it just give it more power ?

solemn acorn
#

that's a lot of how openai seems to be handling their pricing now--with a single model and pricing depending on how quickly you want an output

gentle harbor
#

oh ? thats it, then they should make a slow mode as well that saves tokens

solemn acorn
#

there is in the API, but it's very slow

#

like you're fine waiting a few hours slow

gentle harbor
#

in 2 days i used 76%

solemn acorn
#

are you on plus?

gentle harbor
#

on normal mode

gentle harbor
solemn acorn
#

uh

gentle harbor
#

or what ever the 100 usd one is

solemn acorn
#

how?

gentle harbor
#

no clue, 5.5 medium burns through rate limits

solemn acorn
#

I can say that's definitely not true on the $100/mo tier

gentle harbor
#

i thought 5.5 med would be about the same as 5.4 extra high but nope its still higher

solemn acorn
#

I've been using it pretty heavily over the past two days and I'm in no danger of running out of usage lol

solemn acorn
#

idk it just doesn't use that much?

#

granted a lot of what I'm doing requires waiting for things to build which doesn't actively use tokens

gentle harbor
#

maybe its plan mode ? i do spam that alot, i use normal gpt 5.5 thinking to refine my prompts then i put it into codex plan mode and then send it, seems to give better results but kills my rates

solemn acorn
#

maybe, I use plan mode sometimes but usually only once or twice per thread

gentle harbor
#

i was using it for every message unless it was simple

solemn acorn
#

I think maybe you just need to start making separate chat threads for things

solemn acorn
#

granted I have opinions on things and already know how I want codex to do them

#

but a big part of effectively prompting is knowing how to communicate what you want

gentle harbor
solemn acorn
#

claude or gemini, though I would say no LLM is super great at UI design, they all have their telltale styles

#

kinda on you to bring the creativity in that regard

#

even if you bring them a template, they'll apply their telltale AI slop style if you ask them to use it in a design so if you do want something original, kinda just have to mock up the UX flow in figma yourself and give the model that

#

unless its a very opinionated UI library, that can help

gentle harbor
#

im very bad at ui so that wont be happening sigh, this is gpt 5.5 after a quick 5 min prompt

#

just as a test at how good it can do it

solemn acorn
#

doing effects with CSS is one thing, using those effects tastefully is something all LLMs struggle with rn

#

claude loves its purple gradients, gpt loves putting everything in poorly spaced squircles, gemini does wacky stuff which is cool but then has no ability to tweak it upon request

gentle harbor
#

huh, it cut me off when im at 1% not 0%

#

for only 100 more i might get 200 usd plan

#

when i upgrade that would apply right away right ?

solemn acorn
#

for what you're using it for, you could probably bump 5.5 down to low reasoning

solemn acorn
gentle harbor
#

the 5 hour limit is so annoying im not sure why they dont let me burn through my full week usage at once

solemn acorn
#

you're getting subsidized tokens, so there are limits on them to prevent you from burning through them all in the same way that paying full price would afford you

#

spreading out the usage is cheaper for openai and also makes it less likely that you use 100% of your weekly usage

gentle harbor
#

i wonder, how much less is 500 credits compared to a 20 usd plan ?

solemn acorn
#

the $20/mo sub is roughly 150-750 credits worth per 5 hours

#

probably closer to 150

#

I don't think there really is a way to compare the credits to the plan limits though since they're calculated differently

gentle harbor
# solemn acorn it would

sigh, time to wait a few hours until my 5 hour timer is up, should i wait until im at 0% weekly until i upgrade so i go back upto 100% weekly ?

solemn acorn
#

it would get you 4x more tokens for the week, but that is a jaw-dropping amount of tokens that you might not need to be using

#

actually the $100/mo tier is temporarily doubled so it would only be a slightly more than 2x increase through may 31st and then only 2x from what you have right now after that

gentle harbor
#

im assuming its the over use of plan mode and the very long prompts when they dont need to be that long

static maple
#

Does updating the Codex version affect the MCP configuration of Supabase? Why do I keep having to re-add my Supabase MCP every time after an update?

jovial finch
#

They pushed an update after my complaint to them, now everything is working, simply update and you should be good.

short linden
#

$100 plan is cheap than buy extra credit.

#

It's token base usage anyway.

prime ice
#

When we will see codex available for Linux?

tiny fulcrum
oak trellis
#

hmm is 5.4 more dumb now ?

#

very very bad

low cosmos
#

why does my codex randomly stop responding

#

like for no reason at all

#

mid request

chrome raven
oak trellis
chrome raven
oak trellis
#

ok ok .. i thought using 5.4 to save limits..

split oracle
split oracle
# oak trellis its worth it ?

dunno exactly, but i guess it worths if what you are doing with it has risk in any form, not just project for fun.

split oracle
#

i am building several apps with codex just for my personal use so i dont risk anything perhaps codex does a mistake. i can just pull the latest commit of my project

split oracle
oak trellis
oak trellis
#

wow 5.5 is ultra fast

#

lol working entire day and only 4% of weekly used

#

please openAI do not change .. i know you going to bankrupt as your CFO said you cant pay the compute contracts anymore in near future.. just let me finish my project and let me print some coins ..

#

iam greatful for your service

hallow iron
vast crater
#

Anyone else has their Codex App on Windows struggling hard with the path when used with WSL2 agent?

#

5.5 always tells me that node, bun, npx etc aren't on the path, it can't see the MCP... I have to default back to the CLI for a full experience

unique spade
#

Tell it to put them on the path

wicked briar
#

๐Ÿ˜ญ

unique spade
#

Started to use my own UX for codex. Maybe it's due to always loving to customize the software I use, but psychologically is so much better to be able to have all the features I want and need, where I want them

nocturne folio
dawn seal
#

so I realized power of automation

#

with this I can do agentic work so easily

#

create like, an agent that manages my website

#

and such

unique spade
# torpid trout API or codex server?

Using appserver via codex cli exec as default

But separately I'm building my own direct connect path to oai server for gpt subscription

I ve experimented for last half of year with custom functionalities done via codex fork

But it's too much overhead to keep fork updated to codex releases, most of which have 0 impact on my own functionalities, but they still change a lot the core crates so I end up having to re-align the fork every release.

So for the harness, I'm just switching to custom implementation from scratch

low cosmos
unique spade
#

Gptpro 5.5 is really fast

Asked it to prototype me a few design variants for my custom UX surface to also have a multiplexing view for multiple codex threads

It took it 2 minutes and they seem to be using the design I already use (it has access to the repo in that thread)

oak trellis
#

yes its ultra fast!

unique spade
#

I could never use gptpro for brainstorming cause the time I had to wait for any answer was way too long. But things seem to have finally changed

wicked briar
#

very fast

#

how is the usage on $100 plan

unique spade
# wicked briar how is the usage on $100 plan

For gptpro? Not sure is a difference. I never hit a limit inside chatgpt, but is true I don t send gptpro tons of requests.

But right now, my mental model

Is that the key difference between the 2 pro subs is mostly codex qouta

Otherwise for practical purposes (inside chatgpt) they seem quite equivalent

oak trellis
wicked briar
wicked briar
#

I am planning to downgrade

cedar skiff
#

on the x20 chatgpt is unlimited

unique spade
oak trellis
#

i code all day long 5% now LOL

oak trellis
#

sam hear me out .. if you there do NOT change antyhing this next 2 month..

unique spade
wicked briar
oak trellis
#

suck all the data and send to mossad

#

iam fine with it for now..

wicked briar
#

bro like anthropic also sucks data but doesn't really provide value thats why they are losing market

oak trellis
#

anthropic sucks it and kick us when we down already ..

#

openai at least gives us gud service

#

better evil

#

hope elon lost in court too ... he the biggest leacher

wicked briar
#

yeah bro elon just mad he sold his shares

#

codex computer use is black magic

oak trellis
torpid trout
#

So, I like to write articles from time to time, hobby investigative pieces
I used GPT a lot for research, rewriting, etc
So I had this brillinat idea, you know. Codex could create me a full newsroom, basically an app that replaces BBC's offices, and lets me write professional real investigative pieces

oak trellis
#

he would like to use openai to pump is spacex scam..

torpid trout
#

But, life is a b***

oak trellis
#

they all try to get into sp500 .. retirement funds..

#

if they get there .. auto funds coming in

lost drum
oak trellis
#

that is so crazy .. didnt know that .. as soon you hit sp500 .. its auto buy .. your stock

oak trellis
#

wait for bottom .. long

lost drum
#

brez went crazy today

oak trellis
wicked briar
lost drum
oak trellis
#

was about to write you .. we can do soemthing nice with API endpoints those free ones

lost drum
lost drum
oak trellis
#

they track all infos .. related to war zones

oak trellis
lost drum
#

we can later see and use it tohgether

#

now I try to improve it

oak trellis
oak trellis
#

cant explain but codex right now is insane .. long time ago i had that speed .. and accuracy .. wow

#

lucky i ditched cc

oak trellis
#

still so amazed 5.5 is so smooth .

#

cant believe today i used 6%

#

from total week

#

wth

lost drum
#

yo what is the best approach to create android app cause I previouslyt thouthg its a good idea to ask gpt 5.5 pro to pormpt gemini in android studio to create whole app the issue is gemini is not as great as codex when it somes to executing prompts so I now think its better to tell gpt pro to build himself whole app in html and then somehow make codex make it an app but idk about the gradle building and other stuff that codex cant do within his enviroment (its just app for me not cmmercial use)

oak trellis
#

its crazy 5.5 just always update the docs now ..

#

before i had to beg opus ..

#

i have early cc feelings ..

#

i know it will disapear soon

cedar skiff
unique spade
#

My UX philosophy i can now materialize with the help of gpt/codex

lost drum
#

sounds like team of 20 to work on it

spring remnant
#

Day 3 of telling codex desktop team to remove comment in annotation

oak trellis
#

5.5 is so FAST

boreal holly
lost drum
oak trellis
#

6% ..

#

wow

oak trellis
#

its so good unbelievable

#

like self reflecting questions ..

signal tapir
#

I just had the classic "I'll remove mention of the problem instead of handling it" bug. ๐Ÿ˜

torpid trout
#

Just recently codex wisely told me that it fixed the bug instead of updating the documentation to justify the bug.

#

I was like... wowz. I appreciate it fam.

#

Did not know the other thing was an option really.

signal tapir
#

I could have used that today ๐Ÿ˜›

torpid trout
#

๐Ÿคฃ

signal tapir
#

"All the tests pass now!"

torpid trout
#

Yeah well, that is like the essence of tests isnt it

#

๐Ÿ™„

signal tapir
#

I've even seen it in fully test driven projects.

#

Or.. I guess in projects that were supposed to be fully test driven. ๐Ÿ˜

boreal holly
#

Yeah, whenever I find a bug of any kind I make the worker fix it, prove it with integration tests, then fire up the simulator and manually follow the repro in the test to confirm it's fixed. That way I know the fix is real, the test actually tests for the thing it's supposed to protect against, and the behavior is locked in.

inland sonnet
#

GPT 5.5 flags reading or fixing a captcha solver in my codebase as "This content was flagged for possible cybersecurity risk. If this seems wrong, try rephrasing your request. To get authorized for security work, join the Trusted Access for Cyber program: https://chatgpt.com/cyber"

#

Like bro..

celest stag
#

I got that message yesterday when I wanted it to delete a bunch of build pipeline runs and I was feeling too lazy to do it manually (I was doing exploratory testing and didn't want to clutter the build pipeline history). I just did the ID verification, whatever, still annoying though.

rocky fog
# unique spade Using appserver via codex cli exec as default But separately I'm building my ow...

you can just tell codex to sync to upstream and resolve any conflicts
it will just rebase it on latest and apply your "patches"/commits on top (there are also different ways)

I also tend to mention to codex right when forking something that we want to be syncing to upstream later and resolving any conflicts in the best possible way and to try to keep our changes compatible with syncing to upstream and to note all that in agents.md

then you just tell it to sync to upstream and resolve any conflicts any time you want to update

I fork a lot of stuff that way and it works fine to sync and keep your own changes and include new

I guess unless you really change it a lot/fundamentally

woven canyon
#

Guys, can you use codex desktop app on the free plan?

young locust
#

yes with very low limits

jovial finch
oak trellis
#

weekly me working hole day is 94% ..

#

wow

celest stag
#

I used 33% of my 7 day quota for my $200 plan on day 1 of the reset. I'm taking it easier and turned off Fast mode for now. ๐Ÿ˜ข

torpid trout
oak trellis
#

me coming from cc

#

yesterday 20 minutes in it ate 58% of the 5 hours and 10% of the week

#

that ended my sub ..

torpid trout
#

It will not always be like that.
As earlier said, I have a feeling their load balancers randomly mis-assign unsage to other accounts

#

Are you accidentally somewhere in the Southern American area?
It would be intersting to see where who is when they have high usage and low usage

#

I am in a Souther American area, so likely the load balancer sees me as either americas or latin america

frozen shore
#

can anyone help me there is no gpt 5.5

oak trellis
frozen shore
#

its in the codex app but not IDE extension

frozen shore
dark sun
#

Does anyone know what the codex rate limits feel like in comparison to the claude code rate limits?

#

I've been hitting my limits with claude code a lot recently, and I'd like to know what people's opinions are without paying $200 right away

#

oh, i'm on the claude max 20x plan

#

i used my 5 hour in like an hour

#

then i used $50 usd of extra usage in 10 minutes

#

๐Ÿ˜‚

frozen shore
#

how big is your project and mcp etc

dark sun
#

sigh

#

I daily Opus4.7 xhigh

#

the real question is: will i be able to finish this task without running out of the free api credits i got

frozen shore
#

if you want to extend the usage use the smart models for planning and the cheaper model for writing the code.

unique spade
unique spade
#

and then use codex to add or align the prototype to 100% aligned code

unique spade
#

you can start with the 100$ since it's literally half of the 200 plan insofar codex usage.

plus (20) plan is trickier to compare because they made some tweaks to the 5hour limit there and not sure how it compares with the 5hour limit on the pro plans

boreal holly
#

Lol this was really interesting. The worker basically suggested "archive me and spawn a new worker so worktree creation passes" and the orchestrator did it

torpid trout
# signal tapir I could have used that today ๐Ÿ˜›

Got one now lol
"Hey codex its not working"
"ok let me see ah user used the app wrong so let me adjust the code so user mistake is now no mistake anymore"
๐Ÿ”ซ
ctrl+c you have to tell me when I am wrong! enter lol

unique spade
main nimbus
#

A million times better

#

But keep in mind thatโ€™s 2x and essentially what a regular $200 plan feels like

boreal holly
rocky fog
# unique spade in theory yes, but if you want to keep stuff clean long term you want to check w...

I dont neccesarily mean that it will be only doing changes in separate files which patch the main code
(although you could also do that)

it just puts the commits over the top (rebase)
and GPT calls them "your commit patches" ๐Ÿ˜„
(alternatively you can also have your commits in between, proper timeline, but thats much more messy)

I still change/update everything and it merges/fixes/resolves conflicts during the sync I ask

or of course you can also tell it later to put it all together better in another pass

to "try to keep it compatible with syncing to upstream" does not exactly mean that it will only be making separate patches on top in the files

so far didnt have much issues with that and it works great (time to just fork literally everything open source and customize/fix/improve it lol)

but the whole point is that you can just have codex handle that, or also ask how to best set that way of working up

torpid trout
#

This is the first time Codex created me something I have no freaking idea how to use ๐Ÿคฃ
I literally need to ask it "guide me through the app because it is my first time using it"

#

Holy hell. This could probably kill newspapers in one go.
It allows you to run a whole editorial enterprise in one tab.

signal tapir
#

Happens to me when I have it build dev tools

torpid trout
#

Inclusive factchecks, research, analysts, yada yada yada

signal tapir
boreal holly
torpid trout
mossy nest
#

Use computer plugin removed after update codex what should I do?

boreal holly
mossy nest
#

Mac mini m4

torpid trout
mossy nest
#

I also reinstalled but same issue

torpid trout
#

This is too much for me, I do not know where to start, I will go back to asking GPT to write me the article lolol\

boreal holly
# mossy nest Mac mini m4

Hmmm, check Privacy & Security settings, make sure codex has screen recording and accessibility enabled. See if there's a new feature flag in the configs or app settings

boreal holly
torpid trout
#

๐Ÿคฃ

mossy nest
boreal holly
boreal holly
unique spade
mossy nest
boreal holly
hard drum
mossy nest
boreal holly
lean lark
#

@torpid trout Those are some seriously impressive screens. You may recall my ongoing mantra here about using AGENTS.md to direct the assistant to document code functions, and to generate and maintain user documentation for all features. That would include the README, features.md, detailed docs for each primary feature/screen, and developer documentation to explain how things work. This is not only good for us to understand what's going on, and for users - the assistant gets to read the relevant documention in each thread, significantly reducing the time to process changes. Otherwise the assistant needs to learn the app with every new conversation - that's wasteful of time and tokens, and significantly increases opportunities for errors.

torpid trout
#

Hence my earlier meme

#

Bsaically for once I bite off more than I can chew lol

#

Now I want the same thing for Developer biz lol
At least there I will understand the domain-specific terms haha

lean lark
#

The other day I had that same moment : I don't understand what it just did for me. Sometimes I don't understand the challenges it notes when it finds something wrong and I need it to explain ... in my own project.
I suspect those of us who care to discuss the projects with AI will all have these moments from time to time and the weirdness will become the norm.

small violet
#

wheres 5.5-cyber at

lean lark
#

Isn't cyber a separate product?

signal tapir
#

I think you can request access to it

young locust
#

yeah its separate with id check

torpid trout
#

What is that?

#

Me wants, i dont know what I will get but me wants.

signal tapir
#

I believe it's a cybersecurity specific model

#

Correct me if I'm wrong

torpid trout
#

aah

lean lark
torpid trout
#

That this here:
... yeah you faster. There was also a discord channel, but cannot find it now

#

before it was called something else, not cyber, but security or so

lean lark
#

https://openai.com/index/introducing-aardvark/

Introducing Aardvark: OpenAIโ€™s agentic security researcher
Now in private beta: an AI agent that thinks like a security researcher and scales to meet the demands of modern software.
March 6, 2026 Update: Aardvark is now Codex Security, and is available as a research preview.

torpid trout
#

lol

#

what the heck

lean lark
#

[ those flashes happen when people post BC stuff and other ads or off-topic images ]

small violet
torpid trout
#

Is that over?

torpid trout
lean lark
#

Codex Security works with connected GitHub repositories through Codex Web. OpenAI manages access. If you need access or a repository isnโ€™t visible, contact your OpenAI account team and confirm the repository is available through your Codex Web workspace.

#

I'm surprised but pleased that your cat meme survived filters. ๐Ÿ™‚

torpid trout
#

Its because codex creates them lolol

torpid trout
lean lark
#

Ardvark was renamed to Code Security.

slender plinth
#

guys what does feeling codexy mean

#

im new to this

#

lol

#

i got plus sub for like 1 week

signal tapir
#

maybe that someone was feeling like using codex?

lean lark
torpid trout
#

Release incoming

potent mason
lean lark
#

Hmmmm, that could indicate a feature drop.
More likely it just means Tibo is getting more use out of the platform today.
We get into a groove using these tools...

torpid trout
#

Dang, so it was not model release, it was codex release ๐Ÿ™
5.6 still pending then grin

boreal holly
#

It's probably gonna be some big feature like "upload the picture of the back of your head, Codex will generate a picture of your face with 90% accuracy"

lean lark
slender plinth
#

is he gonna reset the limits

torpid trout
slender plinth
#

people say goblins or smth

#

or usage rest

torpid trout
slender plinth
torpid trout
#

off, off, off, off?
ReallY?

#

๐Ÿ˜›

slender plinth
#

im 14 okayy ๐Ÿ˜ญ

#

dont insult me pls

lean lark
#

Just please don't ask about resets. ๐Ÿ™

torpid trout
slender plinth
slender plinth
torpid trout
slender plinth
lean lark
#

๐Ÿคฆโ€โ™‚๏ธ

slender plinth
#

im allowed

torpid trout
signal tapir
#

we all hope for them, but nobody can affect when it happens

slender plinth
torpid trout
#

hows that an offense lol

signal tapir
torpid trout
#

Did you drop your humour somewhere?

#

Sorry deeply if said meme offensed you, I will duly report it to the machinery that created it

boreal holly
lean lark
#

As a buck-toothed brainless troll, I was feeling a bit insulted here by that meme too...

hard drum
lean lark
#

lefties, yeah

hard drum
#

i thought it was "if you RH, then watch on LH, && in reverse"

hard drum
lean lark
#

you thought "right"

hard drum
#

i thought some people in my country telling me i'm wearing my watch wrong was true

#

but am right-handed, so left-hand watch

lean lark
#

u do u

boreal holly
hard drum
signal tapir
#

Actually, if I did, I would be in so good shape. Mentally and physically.

lean lark
#

That's dangerous - Like a politician who believes when ChatGPT tells them how to write new policy to defend against AI ๐Ÿ™„

signal tapir
#

hehe, yea.

#

But I do get some good advice most of the time. Especially when it comes to health and cooking.

#

It's just that I already know it all. But AI don't care. AI halp anuwai!

mossy nest
boreal holly
# mossy nest still same issue .. Computer use plugin unavailable

You should run

/Applications/Codex.app/Contents/MacOS/Codex

In the terminal, which will launch the GUI, but you may see errors being logged in the terminal. Also if you open the Console.app (macOS built-in logging tool), you may be able to find official logged errors from Codex GUI pointing to the real issue

exotic cave
#

in 0.128.0 js_repl has been dropped, good or bad? and does code_mode works the same?

boreal holly
lean lark
boreal holly
lean lark
#

I believe js_repl is only used for internal calcs and determinant processing.

exotic cave
#

the model can run js logic to call multiple tools, transform output, loop, filter etc... while staying inside one tool flow.

boreal holly
#

I think I prefer giving the agent a jupyter kernel for stuff like that

halcyon ingot
#

Hello, I have a question, please. Does the ChatGPT Go subscription have better limits for Codex than the Free subscription?

lean lark
#

It seems to me that it is very Jupyter-like but only for internal use by Codex. Since Codex can do this with Python, I'm thinking they're deprecating the internal dependency.

chrome raven
#

this /goal how can we use it?

lean lark
halcyon ingot
#

Because I'm looking for an affordable subscription to do some development work in my free time.

boreal holly
lean lark
#

Many of us do all of our professional development with Plus, with no requirement for additional credits.

cyan wing
#

anyone else not seeing /goal command?

rain gorge
#

same issue

torpid trout
#

Js repl is sort of a browser tsting thing

#

Thatโ€™s gone now!?

#

Guess I didnโ€™t update yet since it still used it here just a minute ago

torpid trout
lean lark
boreal holly
torpid trout
#

At least thatโ€™s what Iโ€™ve understand so far looking at it when it used it

lean lark
#

That makes sense... again, for determinant content.

#

So if it generates JS for us it should be able to build/transpile that code just to verify it. If it can't do that I'd be annoyed. That said, if the Code functionality in 5.5 is now so good that it doesn't need to build code to verify it, I'm kinda OK with that too.

boreal holly
#

I wonder how it works with sandbox enabled haha

torpid trout
#

it asks a million times for permissoon

#

unless you have the guided permission on

#

I gues they left again

#

They did so before, due to Phisicyans

boreal holly
#

I'm pretty sure codex with unified_exec=true already has repl. They just launch node, then pass

cat << 'EOF'
let hw = "Hello world"
print(hw)
EOF

to stdin, and it stays alive even after they finish their turn. They just keep passing stuff into stdin on that process and it keeps repl'ing their commands. No need for the extra feature, already obeys the sandbox

lean lark
small violet
#

who remembers the "Physics PHD" guy

#

that was his name]

torpid trout
small violet
#

i doubt he has a phd

torpid trout
#

Phisicyans are scammers

small violet
#

lol

#

all of them are just trying to sell their product

lost drum
#

all in one

boreal holly
#

Oh yeah? The physics phd guy was selling something?

#

I just remember him enforcing the "be noice" rules

small violet
#

let me show u

lost drum
#

haha

#

this guy too

small violet
idle cypress
#

new codex update might be the best one yet

lost drum
#

am I missing osmething

idle cypress
#

its clean af

small violet
lost drum
#

ye I jsut use the vs codex extension so ye I might be

orchid plume
#

goals feature was added to Codex as well, for long running tasks, like a ralph loop

small violet
small violet
lean lark
#

Actually the Physics PhD guy seemed to note channel decorum only after I did. I think he was mimicing or being facetious.
He disappeared after insisting on some detail at OpenAI that I verified was completely incorrect. As in HHGTTG, he disappeared in a puff of logic.

lost drum
small violet
lean lark
#

time for ๐Ÿฅช ๐Ÿฅฃ ๐Ÿฅก โ˜ข๏ธ

lost drum
#

he just got payd to advertise that service

small violet
#

whats the last thing

small violet
lost drum
#

yes

small violet
small violet
lost drum
#

depends what you selling

small violet
#

with that context

lost drum
small violet
lost drum
#

ye it has like 33 chapters

boreal holly
# small violet

Probably realized it was a dead end and went back to pressuring old ladies into buying lots of gift cards

small violet
small violet
#

i thought 5.5 cyber was going to be generally available

#

bro thinks he's mythos

cyan wing
lost drum
small violet
lost drum
#

its just mine its not for sale

small violet
#

why would u make a book for urself

#

๐Ÿ˜ญ

#

ggs guys

nocturne folio
#

i havent seen codex cli in so long

nocturne folio
#

the name atleast

small violet
nocturne folio
rain gorge
#

probably using some sort of hacked account

#

to power the service

cyan wing
lean lark
#

I need to schedule, dedicate evening time just to keep up with Codex features. ๐Ÿคฆโ€โ™‚๏ธ

chrome raven
chrome raven
lost drum
chrome raven
south compass
#

I miss the commit button and diff view from previous update..

#

now I have no idea what it did without clicking in the UI and im stuggling to find the commit button

lean lark
#

@teal cargo @left pecan Any chance to get regular updates in #codex-updates for changes to Codex NPM, Apps, Web, or Extension? There's no unified "Codex" place to go for the latest info - info is all over the place or simply non-existent. Thanks.

#

From the latest Codex Extension update:
"Introducing" Fast mode : The feature that's been in the product since v0.0 that most people should actually disable to avoid having their token quota gobbled up.

small violet
lean lark
#

I dunno when this happened but it looks like after I updated to 11.13.0 the permissions for Codex changed from Full Access to Default. Did I miss some notes in here about that?

grim perch
lean lark
#

I'm not saying full access Was default. I have a good sandbox so I have it set for current use - at some point something changed it back to Default (limited perms).
Whoops.... Looking at config.toml, the one project I just opened is not listed with a trust_level. That must be the issue. ๐Ÿ™„

#

Hmm, no, that's not it. Oy. Will look later.

plucky orchid
#

Anyone know if GPT 5.5 is available on AWS Bedrock yet ? On OpenAI website they said that yes but on AWS there is nothing at all.

rocky fog
#

๐Ÿ’€

slender plinth
nocturne folio
#

if openai acquires github i think they would fix the uptime

solemn acorn
#

oh that's cool, the codex desktop linux project has computer user support (for people who do have access which I don't yet :c)

jolly lily
#

Honestly unlike Opus 4.7, GPT 5.5 seems to spend such a little amount of the model limit

torpid trout
#

where do you get this from

#

Jane was not banned. Nor did she sell anything (not on her anyway)

boreal holly
#

Jane leaves when the "newcomer" badge disappears and eventually comes crawling back and pretends to be a newcomer

torpid trout
#

That is a smart move, allows for more tailored ragebaiting.
I might need to adpat that

#

Leave every day and rejoin and drop an offeffeffeffensive memememme evilgenius

boreal holly
#

uwu ๐Ÿ‘‰๐Ÿ‘ˆ is opus better than gtp?

#

And then somebody comes in and says "use gemini"

cedar skiff
neon girder
jolly lily
torpid trout
cedar skiff
fleet geyser
cedar skiff
boreal holly
lost drum
#

5.5 xhigh worse gpt 2.0 worse than 3 days ago

#

only me?

boreal holly
lost drum
#

oh

boreal holly
#

Use medium

lost drum
#

wdm

#

why would medium better

boreal holly
#

More decisive

torpid trout
#

Medium rare

lost drum
#

he just seems to reason less than before now it feels like 5.4

torpid trout
#

I am suing a mix of medium and high on 5.5, I really so far not once used xhigh with it, completely forgot it exists.
Let me try.

#

hello codex create me a meme...

boreal holly
#

I removed xhigh so I don't accidentally select it, that's how useless it is

lost drum
#

I dont get it at all

boreal holly
# lost drum wdm why

I'm not researching an entirely new branch of physics, or solving the mysteries of the universe, so xhigh is worthless

torpid trout
#

So, thats what xhigh can do to code.

#

You ask it to create a quick edit in docker compose file and it comes back with an assertation as of why this is not a new chatbot app

#

๐Ÿคทโ€โ™€๏ธ

lost drum
#

I feel like talking with not thinking model at all when using xhigh like idk he seems to not go deep enough when I tell him too

torpid trout
lost drum
#

I mean maybe I was on stages in my project where it was more to finish and now teh polishing steps are blicking himeach run cause he assumes eveyrthing is validated and works

#

but I remember taht he used to create better plans 3 days ago with same context

boreal holly
lost drum
chrome raven
#

seems /goal is pretty nice

lost drum
# chrome raven seems /goal is pretty nice

I dont have it in vs codex extension and he already has my goals I think its jus matter of me testing the system with him I just cant inget the fact that its now testing it out haha

chrome raven
small violet
#

5.5 low is nice

#

it dosent complicate things

boreal holly
glacial shadow
gentle harbor
#

why is it black ?

#

is that a glitch

#

i love the black way more then grey

boreal holly
gentle harbor
#

windows

boreal holly
boreal holly
gentle harbor
#

wha ? why would i unplug usb devices

boreal holly
#

I mean if your keyboard and mouse are USB then try different keyboard and mouse, or just unplug everything but those two devices

glacial shadow
boreal holly
#

If you have USB headphones, bad drivers or a bad device can cause the graphics stack to do weird stuff.

boreal holly
glacial shadow
lean lark
#

( he's a bot, don't listen to him )

glacial shadow
#

Hopefully we see a flowing state of reasoning effort in future

gentle harbor
boreal holly
lean lark
#

click Retry.

gentle harbor
lean lark
#

Goblins ... and ChatGPT ... not related to this channel.

#

โ˜๏ธ only applies to tiers above plus

glacial shadow
unique spade
# gentle harbor welp

got the issue twice in a gptpro thread on the follow up prompt

that thread was odd from the beginning cause gptpro took 90 min oin the 1st task, even though in a different thread it did it s usual below 10 mins on a task that is not very different

gentle harbor
unique spade
#

yea at least chatgpt

#

codex seems fine

lost drum
#

my pro one is thinking now for 5min and telling he cant access zip

jolly lily
#

I honestly never thought the plus plan would be more than just "worth" it, hopefully they don't smack down the limits in the future

shrewd flint
#

what do you guys use codex for? i wonder if it can be useful for someone who isn't a developer / founder

shrewd flint
jolly lily
jolly lily
boreal holly
jolly lily
#

The limits are honestly very generous

jolly lily
#

Why tho?

unique spade
lost drum
shrewd flint
#

I'm doing a bachelor's in actuarial science and interning at a company where using AI isn't allowed, no idea how to make use of my codex

unique spade
shrewd flint
unique spade
jolly lily
small violet
shrewd flint
#

oh hell no this guy's here aswell

unique spade
boreal holly
shrewd flint
shrewd flint
#

it makes sense tbh, we work with sensitive data and one mistake can cost us a lot

unique spade
shrewd flint
#

i mean i'm an intern so i have less responsibilities haha

shrewd flint
#

research yes but no ai allowed according to my uni

boreal holly
unique spade
lost drum
#

nahhhhhhh pro model bonkers

#

Investigating potential cause of ClientError It seems I might need to create the files in chunks, though the command length looks fine. The ClientError could be due to CPU overload, but Iโ€™ll wait to see if the processes timeout after 10 seconds. Iโ€™ll give it another 15 seconds to see if they die naturally.

shrewd flint
shrewd flint
boreal holly
shrewd flint
#

it's just not allowed at all

small violet
#

how

shrewd flint
#

for any student

#

yeah

small violet
#

๐Ÿ˜ญ

boreal holly
#

Huh, I did my internship at the end of my bachelors

shrewd flint
#

I have a lot of issues w my attendance %

boreal holly
#

AI didn't exist yet (GPT 3.5 was not all that handy lol)

shrewd flint
#

due to my internship, but I wasn't going to my classes anyway so

small violet
shrewd flint
#

now I have something to blame it on haha

#

yeah

small violet
#

damn

#

career maxxin

shrewd flint
#

why would I do an unpaid internship

#

that should literally be illegal

#

the people I work with get paid a lot more than me though, about 10x as much

#

8x actually

boreal holly
shrewd flint
#

oh god

#

why would u

small violet
unique spade
# shrewd flint it's just not allowed at all

maybe it's not allowed to use in the papers/homework you submit

they can't disallow you to DYOR with it

university doesn't own you. they can only impose limitations on certain artefacts you are required to produce

boreal holly
#

Experience ๐Ÿ˜•

small violet
#

because its a requirement for grad and most companies offer unpaid

#

lol

#

paid internship in 1st yr is crazy

#

i hope i get a nice internship

boreal holly
#

What really sucked was doing 4 months internship while finishing degree working as a HVAC service tech full time. Still don't know how I made it through all that ๐Ÿฅธ

shrewd flint
shrewd flint
#

if only codex could attend my classes/lectures for me

unique spade
shrewd flint
unique spade
#

yea :)))

small violet
#

imagine being in uni pre gpt

#

i dont think id survive

shrewd flint
unique spade
small violet
#

who are u to decide that

#

lol

cedar skiff
#

I have a degree with honours never needed to do anything like that. Not deserved?

#

Yes?

small violet
#

i dont care

#

๐Ÿ˜ญ

shrewd flint
#

๐Ÿ˜ญ

#

this guy

unique spade
#

lol

cedar skiff
#

That's Mr Wimp to you

small violet
#

@bronze sable are u physicist

#

i changed it

boreal holly
small violet
shrewd flint
cedar skiff
#

phd in fine arts?

shrewd flint
#

a person called jane invited me with their referral link ๐Ÿ˜ญ

boreal holly
#

We get em a lot

cedar skiff
#

or maybe general humanities

shrewd flint
#

they had their whole dedidcated scam referral link it was so funny

unique spade
#

is that time of the day when this chat is mostly useless banter?

cedar skiff
shrewd flint
#

what are u guys talking about

#

no

cedar skiff
#

5.5 so good at working out how to get stuff going for skills that can be used by weaker models

unique spade
small violet
#

phd in bafoonery

cedar skiff
#

i have a converstation with 5.5 i have been working on manual app testing and it's been through 5+ compactions and just doesnt even matter. It still just works

unique spade
#

for example codex desktop instead of telling me when was each thread last used, it tells me when it was created

which is very useless info in finding the threads in each project i am actually active

small violet
#

i literally dont care

cedar skiff
boreal holly
#

sounds like you're stalking angel yang. sus

unique spade
# cedar skiff what do define as a UX/harness?

UX is just the interface, you can combine it as i'm sure you know with appserver via cli exec for backend

or you can also implement your own backend harness connecting directly to oai server upstream

shrewd flint
#

why are you lying?

cedar skiff
boreal holly
unique spade
#

and sorry, i thought you were Robert haha, he has his own ux

boreal holly
cedar skiff
# unique spade i like it too as an app but it s still good experience to craft your own

I spent a lot of time working on this sort of thing early on with claude code. I had so many ideas of what i wanted and how i could make it better, i spent weeks working on things that would improve my work flow etc. But turns out i wasn't getting my real work done.
This is a common path i see, if your fine spending the time great go ahead and do it. I think it happens with devs because this is our domain of expertise, so it's where we see opportunity. I reevaluated my ideas and focused on what is making me money. Which are all outside of the software improvement domain.

boreal holly
cedar skiff
#

It feels like a trap where the perfect solution is just a few more prompts away every time

unique spade
boreal holly
small violet
#

and im idle now

unique spade
#

or this tiny tweak. when i have gptpro do reviews which i copy paste to codex, in current surfaces i end up with huge replies of mine that i keep have to scroll

so i just made my own messages to keep open only 10 lines with the option to expand if i ever need it

#

also loading by default only last 10 turns

with the option to load more if i ever needed it

#

for me all these, win time , but sure they are tweaked on my own habits

boreal holly
unique spade
#

with only last 10 messages displayed by default, my chats load instantly though, even if they have hundreds of messages

torpid trout
#

Is it time for a meme?
Or are yโ€™all going to be nice to each other?

#

I leave this chat for 5 minutes and all hell breaks lose

boreal holly
unique spade
#

any of you actively using claude too?

#

how it "the other one" doing lately

torpid trout
boreal holly
torpid trout
# boreal holly

You read my mind.
I realized my latest meme wasnโ€™t as funny as could be, and was thinking of doing a physician scamming lol.
Guess you beat me

#

But the joke landed late

#

I get itโ€ฆ Iโ€™ll scale back on the memes ๐Ÿ˜…๐Ÿคฃ

oak trellis
#

I hope we donโ€™t have a anthropic wave coming in and make things worse lol ๐Ÿ˜‚

boreal holly
boreal holly
mental urchin
#

is there any claude/codex skill/plugin for cf

boreal holly
unique spade
mental urchin
#

cloudflare the company that protects like 20% of the internet

unique spade
frail meadow
#

I think they have their own skills, at the very least codex knows how to use the CLI really well and investigate the docs

mental urchin
#

lol yeah ik but cf is widely regarded to as cloudflare

mental urchin
boreal holly
frail meadow
frail meadow
#

Isnโ€™t there a cloudflare codex plugin? Iโ€™m not on my laptop atm

frail meadow
#

I just saw it too haha

unique spade
mental urchin
#

wait i might be missing smth

#

am i slow lemme check that out

#

holy there are like 90k skills but damn its not open source. could've forked it and deloyed it on my own domain

#

thx trippyprism

dark sun
shrewd flint
#

not saying you're lying but that's almost unbelievable

dark sun
#

It's 5:25

#

This is about half-tilt, if I go full-tilt I can use about 20% in 5 minutes, sometimes up to 30%

#

This is Claude btw

cedar skiff
#

Seems to be a common story now days

unique spade
high girder
#

Max 5x best value. Same with the gpt $100 plan. Both get you a lot more than what either $200 option would

unique spade
#

oai is orders of magnitudes better in this respect

maiden escarp
#

Is it preferred to use Claude Code and Codex together to build a tool or an app?

Or can use either one to build

unique spade
high girder
#

You can use either, but it comes down to preference. I use both

unique spade
#

currently my weekly workflow aligns quite well with the 100 plan so i am using that

high girder
unique spade
#

but my experiences in actual implementing in code what i asked opus were rather bad, so i am not using it for anything serious

high girder
#

I understand why opus 4.7 gets the crap that it does, but that's also a workflow issue for the most part. It's the change in model behavior that people need to adjust to.
Ideally, opus is excellent at holding and sticking to a well drafted plan, codex is a little monster when you let it execute the claude plan. It just goes to town and cleans up claudes mess

shrewd flint
#

what's ths physics phd lore I want to knoww

cedar skiff
# high girder I understand why opus 4.7 gets the crap that it does, but that's also a workflow...

This is a common thought in the claude code community. That we see much less of in the codex community. The idea is you can solve it with a better work flow, maybe it needs more detail or another layer, some extra guard rails etc. Lots of the conversation in the claude code channel is about how to wrestle opus to do what you want it to do.
Moving from claude to gpt for coding is not intuitive. The first thin that happens is you notice a mistake codex makes that claude doesnt - some idiosyncrasy that needs to be handled. This leads to a miatake in thinking that because claude doesnt make that mistake fit's better. But claude requires much more work to get results, the difference is people are already used to it's quirks.

#

codex and gpt are just better at coding. It used to be that it was really only if you know what you your doing with coding it was better, but with 5.5 they really closed that particular gap. 5.5 just gets it better now

high girder
# cedar skiff This is a common thought in the claude code community. That we see much less of ...

I agree, but I also think the use case comes into account there. I don't use either of them for just code. In fact, at code, I find codex is usually the better choice. Claude is much better at systems though. When you're working across multiple systems and keeping things managed, claude usually handles that much better than codex would. They both have things they're better at than the other.

frail meadow
#

Found another alt account

cedar skiff
high girder
#

I need to test more with 5.5 for the things I usually have claude handle then. 5.4 didn't cut it, but I'll need to try that soon

cedar skiff
#

5.5 digs more and finds ways to get it done

high girder
#

I usually use a claude parent session for multi agent work. spawns in claude, codex, and other local models as needed, but if GPT 5.5 can handle that, then I can try something else with claude

#

it's one of my favorite things about opus 4.7, is the ability to follow directions and keep them for the duration of the session

frail meadow
cedar skiff
#

5.5 is the best orchestrator of all the gpt models. I go to bed and run automated orchestration sometimes, and i always got up and it would have stopped and asked me something(5.4) 5.5 was still going, it worked for 12 hours straight in orchestration and it only stopped because codex app had a memory leak that start caused apps to pause on my system.

high girder
#

that's one of the things Claude was much better at. I guess I'm about to go test it out

cedar skiff
glacial shadow
#

What was he saying, what were people arguing about towards him?

cedar skiff
#

I read your message history @bronze sable it's all just spam

high girder
#

dude. It was a racist and chaotic mess. Leave it over there, keep this place clear. He also got nuked from the claude discord

cedar skiff
#

Wont take long for him to get the boot here either

quaint hazel
#

Has anyone found a way to make codex write less tests?

Bro writes 20 LoC on the feature, then 600 LoC on tests.

Currently running an autoresearch loop with the sole goal of reducing test LoC and so far it has removed 78.8% of the test code it wrote ๐Ÿฅฒ

quaint hazel
#

Yes, but just regular tests that a human would write, not 200 test cases

#

It's down 78.8% without losing actual test coverage. It's insane.

cedar skiff
#

I have a guard rail skill that is used anytime unit test are written that tempers it. has stuff like this in it:

## Core Rules

- Assert observable behavior: outcomes, state transitions, emitted effects, persisted changes, and externally visible interactions.
- Do not assert internal triggers, private mechanics, or incidental call sequences unless that interaction is itself the public contract.
- Keep assertions resilient to harmless refactors.
- Keep test-writing work focused on proving behavior; do not change production implementation solely to make a newly added or corrected test pass.
- If a correctly scoped test exposes an implementation bug, keep the bug evidence visible; do not weaken the assertion or rewrite the test around incorrect behavior.
- Keep test files concise and comment-free; encode intent with clear test, group, and matrix names.
- Keep touched test files analyzer-clean.
- Prefer the shared outcome assertions (`expectOutcomeSuccess`, `expectOutcomeFailure`) over bespoke outcome destructuring.

## Test Layer Selection (Cheapest Proof First)

- Prove behavior at the lowest-cost layer that can validate the contract:
  - pure logic: unit test
  - provider/notifier behavior: provider or notifier test
  - UI composition/semantics: widget test
- Escalate to a heavier layer only when a lighter layer cannot prove the contract.
- Avoid duplicating the same contract across multiple layers unless the additional layer protects a distinct risk.
- If the required proof crosses app/runtime boundaries and needs integration or emulator coverage, stop using this reference and switch to the integration-testing guidance for that stack instead of stretching unit/widget/provider rules to fit.
cedar skiff
#

I also have some stuff about using test harnessing so test are a little dry because it reinvents the wheel a lot.

#

Cuts down on the tokens and time taken to write and fix them.

quaint hazel
#

Thanks, I'll have a go at fixing it further thru prompting ๐Ÿ™‚

cedar skiff
#

To make the skill i got a session to write the tests for the same code over and over. It was get a subagent to write the tests, audit for my requirements, update the skill, remove the tests, repeat.

#

One thing you have to be careful of is that it doesnt start getting explicit in the skill updates. It might start making rules in the skill that are specfic to one test.
So you have to prompt to be generalised fixes that will work in all cases

#

If you get one explicit instruction, then it will take that as a green light to do that all the time.

oak trellis
#

5.5 is so good

quaint hazel
#

had it go through the changes and propose a plan to make it better

Gave me quite a few promising tips that it's now implementing

oak trellis
#

thank you openai for the great codex .. its sooooo good .. math .. speed

#

uff

#

just something the weekly limit something off but its ok .

boreal holly
oak trellis
boreal holly
oak trellis
#

got twitter API for 100x cheaper .. works flawless

oak trellis
#

yes was thinking .. is there a phone app for codex -> my server

quaint hazel
#

Robex was right there

boreal holly
high girder
oak trellis
boreal holly
#

It could get completely dog piled with messages and still stay focused

high girder
#

That's a problem I would love not to have as often. We'll see how it goes. My longest Claude session was about 10 days, and I had some errors, but not a ton. I'll give 5.5 the same time frame

icy hinge
#

Did codex usage just got significantly trimmed?

cedar skiff
#

what task takes 10 days?

boreal holly
high girder
cedar skiff
#

Well yeah, but for example i can say write these integration tests which is the goal, then there is like 120 task. or what ever

#

But you get what i am asking right?

nocturne folio
#

oh my gooodddddd 1500$!!!! whereeee

high girder
#

I get you. So the parent session I used for the agent itself, even when not actually doing anything, has it going. It's linked to my entire stack. So it depends on what I'm working on. I have local image gen and modeling workflows, physics work, and a bunch of other projects in between. The parent session itself ran for 10 days. Sub agents take care of individual tasks. The main thing the parent session handles is governance and continuity

#

The main reason I left it for 10 days is I forgot to turn it off and let it run on one of the other machines

#

it made for a good test though

cedar skiff
#

No i understand how orchestration works, i just want to know what you did for the 10 day session.

deep totem
#

codex 0.128 has new feature /goal, but I don't see it in cli, how to use it?

cedar skiff
#

with subagents i can see 10 days being 100kloc

boreal holly
#

I was gonna say I spend at least 2 weeks at a time on external integration work. Just now with 5.5 its one agent. Been doing QBO sync for 2 weeks straight. Rigorously testing it

high girder
#

Oh, literally bits of everything. I'm what you would call neurodivergent, so I do lots of things either at once, or some days nothing. I've been working on building out. Also, I don't actually use either claude or codex for code work or scanning PRs or any of that, unless I'm scanning something I'm going to put out. I'm also pretty toasted right now, if I'm being perfectly honest, so hopefully I'm making enough sense

cedar skiff
#

Oh, i miss took what you were saying as an automated orchestration session that went for 10 days.

#

I was like, if this is real i gotta get in on that

high girder
#

Ohhh. I'm not against automation. I just only let things automate once it's been manually done by me, and the acceptance output is continually up to standard. even then, evertyhing is logged and inspected.

oak trellis
deep totem
oak trellis
#

just a reminder: please openai .. do not change anything within 2 month .. if you wanna do the model better that is ok .. but do not downgrade the limits even if you cant afford the compute contract. Just make a new oracle deal .. send the stuff to mossad.. everyone does it anyway.

oak trellis
#

if they keep it stable for 2 month i buy openai stocks

#

sentimental right now i would dump anthropic and buy openai stocks

cedar skiff
#

i didnt notice any degradation since i joined up in 5.3

#

Rcok solid the whole time

oak trellis
#

i have paranoia .. hanged to much around cc and opus .. they did us the rollocoaster .

cedar skiff
#

It's just better i wish i had have moved to the light side sooner

pearl zinc
#

Why am I constantly getting? :

This chat was flagged for possible cybersecurity risk
If this seems wrong, try rephrasing your request or submit /feedback. To get authorized for security work, join the Trusted Access for Cyber program.

Frustrated.

cedar skiff
#

I know you probably don't need to.

#

But it makes life easier

undone patio
cedar skiff
#

I thought you got banned

pearl zinc
cedar skiff
#

slopus

undone patio
#

cant ban me