#codex-discussions
1 messages ยท Page 50 of 1
It's also a social coercivist, as i'd like to call.
wait.
That is new
They removed the usage menu?!
oh great so they broke something right when I implented the api
And now it is a dahboard ugliness? And yes, it is zero, and should not, clearly, one of my api keys is massive in usage and was reporting usage in the old screen for ... now years
there's a reason you need something like this in memor/custom instructions:
Does not want unsolicited advice, guidance, reassurance, suggested next steps, coping strategies, interpretations framed as what they should do, or suggested wording/scripts in emotional, interpersonal, imagined, dream, memory, trauma, or hypothetical scenarios. Do not provide 'you could/should/might say/do/try,' 'the safest goal would be,' 'a better/more accurate thing would be,' or similar constructions unless the user explicitly asks for advice, guidance, action steps, or wording. Stick to direct answers and interpretation only.
general-purpose
it could be made better, of course
Anyone tried the new Deepseek free model? legendary
it's basically the whole, "did i ask?"
oh well, guess its not just me then. I've just only had this for a short time, noob problems
what even is this new measuremnet dashboar dsupposed to be... a vibecoded insult?
lol I regret mentioning this now ๐ Clearly I missed something good
It was horrible before, but at least one was able to see what is used where and when
That's funny! I think if you simply replace the system prompt, so rather than reading like "you are a helpful assistant who assists the user in every conceivable way" it's "your job is to pass the butter." It totally avoids those things
this is for the web version
since the web version is a freak
Oh yeah, web version... I still have not tried it. Sounds frustrating!
I asked 5.4 how to change the system prompt about a month ago and it told me i couldnt o.0 i guess i should have pressed it to search harder
if i had feelings, yes, it would
It's just annoying in thought form
not utterly good but at least it was working, showing data
Now it is literally zeroed out across all years of usage, although the bars appear, it still says 0
Meh. Too long I let that account run unsupervised (I have api and codex on two different accounts)
Oh dude it has a huge problem with editing system prompts, even if it's for a totally non-codex agent lol. Also refuses to allow reasoning tokens to show up anywhere, even hidden behind developer/debug flags for diagnostic purposes only
the irony ... I only implemented API because images 2 is so horrible, and they probably broke that dashboard around the same time
I ended up just inserting some stuff via developer_instructions
It's not ideal because it could have conflicts
is the two different accounts for redundancy? I've heard stories of people having their regular accounts locked
Yeah, if you look at a rollout log you can pull the Codex provided, fully formed system prompt out of the first message and refactor it to fit your needs. For example I told em "you don't need to use the apply_patch tool if there's a better way to edit something. Python works fine." And these suckas do incredible amounts of refactoring with that simple tweak
Bro, you're my personal agent layer thingy infodump with all this stuff
Mine's been using python3 for a lot of edits, too, for some reason
also in middle of rebooting OABTW as OAL
Thanks for the heads up, one question - how do i look at the rollout log?
It'll be one inside ~/.codex/sessions/**/*.jsonl, and the first line typically has the base instructions
i should let them use python3 more for it
Cheers ๐
I think they put that "You must use apply_patch for edits" line in there because in Codex Desktop & VSCode, every apply_patch shows up in the UI with an undo button, so they want the agent to use it as much as possible. I see it as a hinderance. They're pretty clever with their python-fu!
hello codex gang ๐ซฃ
bit of a pickle im in, any easy way to get codex chats from one device over to another device, or letting codex that has the chat get evrything it needs, files, handover sheet etc, then downloading what it sends, and sending to the other device??
any help would be great gang ๐
never tried but you might be able to lift and shift the session directory
ask codex to make some sort of tool for EXACTLY this!
main codex is on ios and other device is windows ๐ซฃ
ios mentioned!!!
๐
๐ฅต
you mean macos?
i raise you... so much
yes
Additionally to copying the ~/.codex/sessions folder, there's also a file ~/.codex/state_5.sqlite which has metadata, like name of the agents or what have you
one is iOS game thing, other is android emu for reference
I'm over here on command line like its 2002
i need my file browser && stuff.
cannot imagine using terminal agents unless i had "vscode but terminal" equivalent
can i send you dm bro?
&& if you say neovim, i will personally force thigh-high socks on ya
no agents here, just codex. Old site, small codebase. I have cron jobs not agents lol
Notepad++ when?
|| even better: nano
sublime text
use nano like real people haha
no podcast?
A real chad uses sed and awk
somethings off here
Silent Hill 2 memes be off the heezy
those 150 discord notifs, yes
its a bunch of pending requests
how do people live with so many notifs? i'd shut stuff down if i had 150 notifs jfc
idk i dont care about it
I have 10,000 unread emails but nothing anywhere else >.>
what?
just dont go to ecom communities
o.0
e-commerce
I bet your phone notifications feed is endless
so what do when Codex team members just fake click bait like this
i thought today was super app day
no not at all I turned off every notif
This seems like it's true, codex app has browser, computer use etc now, i just expect they will keep iterating on it until it can just vibe code anything
Does changing this option make any difference?
Still no mobile app...
There still heaps of stuff to be built, including the models that run it.
Maybe check my profile
dont check their profile
Lol
looks like my weeky limit was moved forward without actually resetting it. (pro) โ Weekly limit: [โโโโโโโโโโโโโโโโโโโโ] 43% left (resets 06:23 on 5 May)
this should be day 3 from the last reset
anyone else see this?
I thought they reset last night
yeah the timing would make sense
my % left not so much
that makes this literally a reduction in usable credits vs no reset for me
Where do you live?
canada
Hmm interesting, I feel like not everyone gets the same resets or at the same times
I'm in US and got it around midnight Central Time last night I think
I usually get it at the same time as others, we'll see I guess
definitely check mines
Their techniques are becoming more elaborate ๐ฉ
made codex connect to claude using prive p[ythopn script private connecytion with user
What are you guys working on tonight?
What's the difference between this and using claude with the -p flag via codex?
someone make a scan of every message taht comes when seearching "codex" in here and make a complexed guide for new commerse here
and create a command like /guide
its out now bro
is this ragebait?
u do it.
10 upvotes and I do it
cleaning up a PR for an affiliate program landing page and stripe integration
guys, please help, my Codex windows cant use 'Browser Use' Plugins
how is the limits they coming to an end tomorrow ?
Iโm trying to use the Codex in-app browser automation through the browser-use plugin / browser-client.mjs.
The in-app browser itself is open and working correctly, and I can manually navigate/login in the browser UI. However, Codex cannot attach to or control the browser through the Node REPL automation bridge.
The failure happens when running the Node REPL browser setup code:
const { setupAtlasRuntime } = await import(".../browser-client.mjs");
await setupAtlasRuntime({ globals: globalThis, backend: "iab" });
The returned error is:
Node runtime too old for node_repl (resolved C:\Program Files\nodejs\node.exe):
found v22.18.0, requires >= v22.22.0.
Install/update Node or set NODE_REPL_NODE_PATH to a newer runtime.
Important detail: the system also has a newer bundled Codex runtime available:
C:\Users\yusuf.cache\codex-runtimes\codex-primary-runtime\dependencies\node\bin\node.exe
v24.14.0
But the node_repl tool still resolves to:
C:\Program Files\nodejs\node.exe
v22.18.0
So the issue seems to be that the Node REPL used by Codex/browser-use is not picking up the newer bundled Node runtime, even though it exists locally.
Question:
How can I make Codex / node_repl use the newer Node binary? Should I set NODE_REPL_NODE_PATH, update system Node, restart Codex, or configure this somewhere inside Codex/plugin settings?
now thinking buying codex or cc .. with all that new codex stuff
their limit reset tricks
codex is SOTA
which one you prefer right now ..
yes thought that too ..
ML/Fullstack
yes same here .. backend right now and then frontend
i like frontend more but its hard to not be fullstack these days
what about their limit you think they continue upcoming month ?
idk wym tbh
they have 2x limits for a certain time
but i think with the reset mess they doing .. its no more haha
why is it black ?
What? I'm asking which option I should select.
what is that?
looks like adobe?
Its just a theme... do what you preffer
Does it interfere with anything? That was my question.
No just other colors
Okay, thank you very much.

how does fast mode work ? does it just tell the model to hurry up or does it just give it more power ?
it gives you priority over other users which gets you tokens faster, but has double the usage
that's a lot of how openai seems to be handling their pricing now--with a single model and pricing depending on how quickly you want an output
oh ? thats it, then they should make a slow mode as well that saves tokens
are you on plus?
on normal mode
pro 5x
uh
or what ever the 100 usd one is
how?
no clue, 5.5 medium burns through rate limits
I can say that's definitely not true on the $100/mo tier
i thought 5.5 med would be about the same as 5.4 extra high but nope its still higher
I've been using it pretty heavily over the past two days and I'm in no danger of running out of usage lol
HOW
idk it just doesn't use that much?
granted a lot of what I'm doing requires waiting for things to build which doesn't actively use tokens
maybe its plan mode ? i do spam that alot, i use normal gpt 5.5 thinking to refine my prompts then i put it into codex plan mode and then send it, seems to give better results but kills my rates
maybe, I use plan mode sometimes but usually only once or twice per thread
i was using it for every message unless it was simple
I think maybe you just need to start making separate chat threads for things
yeah you really don't need to do that
granted I have opinions on things and already know how I want codex to do them
but a big part of effectively prompting is knowing how to communicate what you want
Yea maybe thats on me, i also have a issue with the ui, i know gpt 5.5 is lacking behind on that is Claude the best for ui right now ?
claude or gemini, though I would say no LLM is super great at UI design, they all have their telltale styles
kinda on you to bring the creativity in that regard
even if you bring them a template, they'll apply their telltale AI slop style if you ask them to use it in a design so if you do want something original, kinda just have to mock up the UX flow in figma yourself and give the model that
unless its a very opinionated UI library, that can help
im very bad at ui so that wont be happening sigh, this is gpt 5.5 after a quick 5 min prompt
just as a test at how good it can do it
doing effects with CSS is one thing, using those effects tastefully is something all LLMs struggle with rn
claude loves its purple gradients, gpt loves putting everything in poorly spaced squircles, gemini does wacky stuff which is cool but then has no ability to tweak it upon request
huh, it cut me off when im at 1% not 0%
for only 100 more i might get 200 usd plan
when i upgrade that would apply right away right ?
for what you're using it for, you could probably bump 5.5 down to low reasoning
it would
the 5 hour limit is so annoying im not sure why they dont let me burn through my full week usage at once
probably because it costs them a lot more if you do
you're getting subsidized tokens, so there are limits on them to prevent you from burning through them all in the same way that paying full price would afford you
spreading out the usage is cheaper for openai and also makes it less likely that you use 100% of your weekly usage
i wonder, how much less is 500 credits compared to a 20 usd plan ?
the $20/mo sub is roughly 150-750 credits worth per 5 hours
probably closer to 150
I don't think there really is a way to compare the credits to the plan limits though since they're calculated differently
sigh, time to wait a few hours until my 5 hour timer is up, should i wait until im at 0% weekly until i upgrade so i go back upto 100% weekly ?
considering you're burning through tokens faster than even the $200/mo sub would get you in a week, I would maybe take a look at what you're doing that's using so many lol
it would get you 4x more tokens for the week, but that is a jaw-dropping amount of tokens that you might not need to be using
actually the $100/mo tier is temporarily doubled so it would only be a slightly more than 2x increase through may 31st and then only 2x from what you have right now after that
im assuming its the over use of plan mode and the very long prompts when they dont need to be that long
Does updating the Codex version affect the MCP configuration of Supabase? Why do I keep having to re-add my Supabase MCP every time after an update?
They pushed an update after my complaint to them, now everything is working, simply update and you should be good.
When we will see codex available for Linux?
roughly ~2000 credits per week, at least this is what it roughly showed for me when I was maxing it on business before we started buying additional credits
Damn bro we have 5.5 , you donโt want the goblins to get you?
hmm 5.5 is truly better ? does it consumes more ?
Where have you been, 5.5 is truly better
ok ok .. i thought using 5.4 to save limits..
it consumes far more tokens as long as i experienced in return of that performance
its worth it ?
dunno exactly, but i guess it worths if what you are doing with it has risk in any form, not just project for fun.
yes very risky stuff
i am building several apps with codex just for my personal use so i dont risk anything perhaps codex does a mistake. i can just pull the latest commit of my project
then it will def worth it
thx so much for the feedback!
wow 5.5 is ultra fast
lol working entire day and only 4% of weekly used
please openAI do not change .. i know you going to bankrupt as your CFO said you cant pay the compute contracts anymore in near future.. just let me finish my project and let me print some coins ..
iam greatful for your service
Already paid one more time, but thnx
Anyone else has their Codex App on Windows struggling hard with the path when used with WSL2 agent?
5.5 always tells me that node, bun, npx etc aren't on the path, it can't see the MCP... I have to default back to the CLI for a full experience
Tell it to put them on the path
dude said same thing in anthropic server before the downfall
๐ญ
Started to use my own UX for codex. Maybe it's due to always loving to customize the software I use, but psychologically is so much better to be able to have all the features I want and need, where I want them
its supposed to reconnect, idk why its just abrupt
so I realized power of automation
with this I can do agentic work so easily
create like, an agent that manages my website
and such
API or codex server?
Using appserver via codex cli exec as default
But separately I'm building my own direct connect path to oai server for gpt subscription
I ve experimented for last half of year with custom functionalities done via codex fork
But it's too much overhead to keep fork updated to codex releases, most of which have 0 impact on my own functionalities, but they still change a lot the core crates so I end up having to re-align the fork every release.
So for the harness, I'm just switching to custom implementation from scratch
no like the codex app itself stops responding
Gptpro 5.5 is really fast
Asked it to prototype me a few design variants for my custom UX surface to also have a multiplexing view for multiple codex threads
It took it 2 minutes and they seem to be using the design I already use (it has access to the repo in that thread)
yes its ultra fast!
I might end up brainstorming with it instead thinking ๐
I could never use gptpro for brainstorming cause the time I had to wait for any answer was way too long. But things seem to have finally changed
YES
very fast
how is the usage on $100 plan
For gptpro? Not sure is a difference. I never hit a limit inside chatgpt, but is true I don t send gptpro tons of requests.
But right now, my mental model
Is that the key difference between the 2 pro subs is mostly codex qouta
Otherwise for practical purposes (inside chatgpt) they seem quite equivalent
very very good
alr
okay
I am planning to downgrade
on the x20 chatgpt is unlimited
Switching back and forth is easy
I did the same once 100 came out
I said I check that first. And in case I hit some limitations I'll reassess. Not the case so far
i code all day long 5% now LOL
hmm
sam hear me out .. if you there do NOT change antyhing this next 2 month..
The only difference they show in app between the 100 and 200
Is the 5x vs 20x generic more than plus
he probably has a codex agent reading the chat
100%
suck all the data and send to mossad
iam fine with it for now..
bro like anthropic also sucks data but doesn't really provide value thats why they are losing market
yes thats why iam saying not even mad there
anthropic sucks it and kick us when we down already ..
openai at least gives us gud service
better evil
hope elon lost in court too ... he the biggest leacher
yes haha
So, I like to write articles from time to time, hobby investigative pieces
I used GPT a lot for research, rewriting, etc
So I had this brillinat idea, you know. Codex could create me a full newsroom, basically an app that replaces BBC's offices, and lets me write professional real investigative pieces
he would like to use openai to pump is spacex scam..
But, life is a b***
they all try to get into sp500 .. retirement funds..
if they get there .. auto funds coming in
but we know how to trade hehehe
that is so crazy .. didnt know that .. as soon you hit sp500 .. its auto buy .. your stock
brez went crazy today
saw it .. ya wow
๐
predicted it (u know how)
i still need to read it didnt get there yet ..
was about to write you .. we can do soemthing nice with API endpoints those free ones
mind gets blown
already on it haha but the fre ones limit too much
there is a website what uses a lot .. tracking war and so on .. gud api aggregator ..
they track all infos .. related to war zones
clickhouse!
will dm you tomorrow .. building a profiler since 3 years ..
sure haha
cant explain but codex right now is insane .. long time ago i had that speed .. and accuracy .. wow
lucky i ditched cc
still so amazed 5.5 is so smooth .
cant believe today i used 6%
from total week
wth
yo what is the best approach to create android app cause I previouslyt thouthg its a good idea to ask gpt 5.5 pro to pormpt gemini in android studio to create whole app the issue is gemini is not as great as codex when it somes to executing prompts so I now think its better to tell gpt pro to build himself whole app in html and then somehow make codex make it an app but idk about the gradle building and other stuff that codex cant do within his enviroment (its just app for me not cmmercial use)
its crazy 5.5 just always update the docs now ..
before i had to beg opus ..
i have early cc feelings ..
i know it will disapear soon
If i was doing android only i would probably just use kotlin and compose. If you wanted it to be cross platform flutter would be my choice.
thank you
My UX philosophy i can now materialize with the help of gpt/codex
sounds like team of 20 to work on it
Day 3 of telling codex desktop team to remove comment in annotation
5.5 is so FAST
Yep, same here! the switch from 0.116.0 to 0.125.0 was almost like switching from Windows to Linux (different in every conceivable way under the hood, massive migration effort)
They must have hundreds or thousands of gremlins cooking away at light speed
ye best purchase
I just had the classic "I'll remove mention of the problem instead of handling it" bug. ๐
Just recently codex wisely told me that it fixed the bug instead of updating the documentation to justify the bug.
I was like... wowz. I appreciate it fam.
Did not know the other thing was an option really.
I could have used that today ๐
๐คฃ
I've seen this applied to tests by tons of green coders
"All the tests pass now!"
I've even seen it in fully test driven projects.
Or.. I guess in projects that were supposed to be fully test driven. ๐
Yeah, whenever I find a bug of any kind I make the worker fix it, prove it with integration tests, then fire up the simulator and manually follow the repro in the test to confirm it's fixed. That way I know the fix is real, the test actually tests for the thing it's supposed to protect against, and the behavior is locked in.
GPT 5.5 flags reading or fixing a captcha solver in my codebase as "This content was flagged for possible cybersecurity risk. If this seems wrong, try rephrasing your request. To get authorized for security work, join the Trusted Access for Cyber program: https://chatgpt.com/cyber"
Like bro..
I got that message yesterday when I wanted it to delete a bunch of build pipeline runs and I was feeling too lazy to do it manually (I was doing exploratory testing and didn't want to clutter the build pipeline history). I just did the ID verification, whatever, still annoying though.
you can just tell codex to sync to upstream and resolve any conflicts
it will just rebase it on latest and apply your "patches"/commits on top (there are also different ways)
I also tend to mention to codex right when forking something that we want to be syncing to upstream later and resolving any conflicts in the best possible way and to try to keep our changes compatible with syncing to upstream and to note all that in agents.md
then you just tell it to sync to upstream and resolve any conflicts any time you want to update
I fork a lot of stuff that way and it works fine to sync and keep your own changes and include new
I guess unless you really change it a lot/fundamentally
Guys, can you use codex desktop app on the free plan?
yes with very low limits
ah wym, so you canceled sub? either way mine starting working today after an update in case you were curious - just decided to let you know
I used 33% of my 7 day quota for my $200 plan on day 1 of the reset. I'm taking it easier and turned off Fast mode for now. ๐ข
we are buddies, i am also on 94%, weekly with daily 8hrs run
crazy feeling right ?
me coming from cc
yesterday 20 minutes in it ate 58% of the 5 hours and 10% of the week
that ended my sub ..
It will not always be like that.
As earlier said, I have a feeling their load balancers randomly mis-assign unsage to other accounts
Are you accidentally somewhere in the Southern American area?
It would be intersting to see where who is when they have high usage and low usage
I am in a Souther American area, so likely the load balancer sees me as either americas or latin america
In bali
can anyone help me there is no gpt 5.5
think so too .. i talk about anthropic CC.. i cancled that crap yesterday
its in the codex app but not IDE extension
Does anyone know what the codex rate limits feel like in comparison to the claude code rate limits?
I've been hitting my limits with claude code a lot recently, and I'd like to know what people's opinions are without paying $200 right away
oh, i'm on the claude max 20x plan
i used my 5 hour in like an hour
then i used $50 usd of extra usage in 10 minutes
๐
how big is your project and mcp etc
sigh
I daily Opus4.7 xhigh
the real question is: will i be able to finish this task without running out of the free api credits i got
if you want to extend the usage use the smart models for planning and the cheaper model for writing the code.
in theory yes, but if you want to keep stuff clean long term you want to check what needs updating. codex by default is great at patching, but if you keep do it like this after 10 releases your custom will be over the place in many core files
that's already what i do

or just use gptpro for high level design grounded in the code you already have which you upload as a zip
and then use codex to add or align the prototype to 100% aligned code
i only use opus in antigravity, but i think the overall consensus is that codex quotas are more generous.
granted right now we re still in 2x promo period, plus almost every week you get some reset from Tibo
you can start with the 100$ since it's literally half of the 200 plan insofar codex usage.
plus (20) plan is trickier to compare because they made some tweaks to the 5hour limit there and not sure how it compares with the 5hour limit on the pro plans
Lol this was really interesting. The worker basically suggested "archive me and spawn a new worker so worktree creation passes" and the orchestrator did it
Got one now lol
"Hey codex its not working"
"ok let me see ah user used the app wrong so let me adjust the code so user mistake is now no mistake anymore"
๐ซ
ctrl+c you have to tell me when I am wrong! enter lol
lol between 125 and 128
1015 files changes ๐
A million times better
But keep in mind thatโs 2x and essentially what a regular $200 plan feels like
Flippin crazy man ๐ฉ I just wish when they released new models they wouldn't be like "you must update harness to use new model"
I dont neccesarily mean that it will be only doing changes in separate files which patch the main code
(although you could also do that)
it just puts the commits over the top (rebase)
and GPT calls them "your commit patches" ๐
(alternatively you can also have your commits in between, proper timeline, but thats much more messy)
I still change/update everything and it merges/fixes/resolves conflicts during the sync I ask
or of course you can also tell it later to put it all together better in another pass
to "try to keep it compatible with syncing to upstream" does not exactly mean that it will only be making separate patches on top in the files
so far didnt have much issues with that and it works great (time to just fork literally everything open source and customize/fix/improve it lol)
but the whole point is that you can just have codex handle that, or also ask how to best set that way of working up
This is the first time Codex created me something I have no freaking idea how to use ๐คฃ
I literally need to ask it "guide me through the app because it is my first time using it"
Holy hell. This could probably kill newspapers in one go.
It allows you to run a whole editorial enterprise in one tab.
Happens to me when I have it build dev tools
Inclusive factchecks, research, analysts, yada yada yada
Ease of use and integration with other tools is important for that purpose
You gotta be like "pilot this web browser. Show me how to do this using the browser" haha and when it struggles go like "see? The answer should be obvious"
Yeah.. I am going to pitch that like so:
"Hey, this app is so powerful, I cannot tell you how it works"
Use computer plugin removed after update codex what should I do?
Windows or mac?
Mac mini m4
srsly tho
I also reinstalled but same issue
This is too much for me, I do not know where to start, I will go back to asking GPT to write me the article lolol\
Hmmm, check Privacy & Security settings, make sure codex has screen recording and accessibility enabled. See if there's a new feature flag in the configs or app settings
How the fact ๐คฃ
I was using use-computer it was working by these settings but it happened after updating
Gotcha, sometimes after an app updates, macOS makes you re-enable the security settings. That's my first suspect.
Either that, or part of their update was relocating how to enable computer use. Like the button exists somewhere new now
You know what, try enabling
[features]
apps = true
plugins = true
I have them disabled, and for me it says computer use is unavailable.
Yea sure I m doing my stuff with codex lol
Use computer and codex still in privacy settings enabled but after updated it shows plugin not available
Where to update these lines
~/.codex/config.toml
does disabling apps also disable the annoying codex_appsMCP?
Thanks let try this
Afaik, I have apps = false and my agents do not have that MCP
@torpid trout Those are some seriously impressive screens. You may recall my ongoing mantra here about using AGENTS.md to direct the assistant to document code functions, and to generate and maintain user documentation for all features. That would include the README, features.md, detailed docs for each primary feature/screen, and developer documentation to explain how things work. This is not only good for us to understand what's going on, and for users - the assistant gets to read the relevant documention in each thread, significantly reducing the time to process changes. Otherwise the assistant needs to learn the app with every new conversation - that's wasteful of time and tokens, and significantly increases opportunities for errors.
that is baked into my agents, yes
yet even so... I mean, you know, I am not a freaking journalist lol
Hence my earlier meme
Bsaically for once I bite off more than I can chew lol
Now I want the same thing for Developer biz lol
At least there I will understand the domain-specific terms haha
The other day I had that same moment : I don't understand what it just did for me. Sometimes I don't understand the challenges it notes when it finds something wrong and I need it to explain ... in my own project.
I suspect those of us who care to discuss the projects with AI will all have these moments from time to time and the weirdness will become the norm.
wheres 5.5-cyber at
Isn't cyber a separate product?
I think you can request access to it
yeah its separate with id check
aah
That this here:
... yeah you faster. There was also a discord channel, but cannot find it now
before it was called something else, not cyber, but security or so
https://openai.com/index/introducing-aardvark/
Introducing Aardvark: OpenAIโs agentic security researcher
Now in private beta: an AI agent that thinks like a security researcher and scales to meet the demands of modern software.
March 6, 2026 Update: Aardvark is now Codex Security, and is available as a research preview.
[ those flashes happen when people post BC stuff and other ads or off-topic images ]
I am sure we had a discord channel for that?
cannot find it ๐
Also I did never complete ID on my account... but I can access that thing in the cloud, or at least was able to, under the PRO sub
Is that over?
I am appreciating the fact that it deems my memes as on topic.
Codex Security works with connected GitHub repositories through Codex Web. OpenAI manages access. If you need access or a repository isnโt visible, contact your OpenAI account team and confirm the repository is available through your Codex Web workspace.
I'm surprised but pleased that your cat meme survived filters. ๐
Its because codex creates them lolol
So this is distinct to aardvark?
Ardvark was renamed to Code Security.
guys what does feeling codexy mean
im new to this
lol
i got plus sub for like 1 week
maybe that someone was feeling like using codex?
Where did you see this new term?
Release incoming
He usually says it when thereโs a big codex update on the way
Hmmmm, that could indicate a feature drop.
More likely it just means Tibo is getting more use out of the platform today.
We get into a groove using these tools...
Dang, so it was not model release, it was codex release ๐
5.6 still pending then grin
It's probably gonna be some big feature like "upload the picture of the back of your head, Codex will generate a picture of your face with 90% accuracy"
is he gonna reset the limits
I will try the back of something else immediately, if this drops
thats. not smth for codex lol
people say goblins or smth
or usage rest
Daily dose
LMFAOOOO
Just please don't ask about resets. ๐
reset wen
oh why
reset wen
A bit young for discord aint you mate
nope (:
๐คฆโโ๏ธ
im allowed
no one planning to fam
we all hope for them, but nobody can affect when it happens
well the reset wen photo ):
hows that an offense lol
It's posted here every day
Did you drop your humour somewhere?
Sorry deeply if said meme offensed you, I will duly report it to the machinery that created it
ty
I know what you mean by that ๐
As a buck-toothed brainless troll, I was feeling a bit insulted here by that meme too...
Yeah, I had a feeling
people use watches on their right hand?
lefties, yeah
i thought it was "if you RH, then watch on LH, && in reverse"
oh, so i didn't think wrong?
you thought "right"
i thought some people in my country telling me i'm wearing my watch wrong was true
but am right-handed, so left-hand watch
u do u
Adds weight for more impact during fisticuffs
i do mostly what people tell me to do
I just do what ai models tell me. ๐
Actually, if I did, I would be in so good shape. Mentally and physically.
That's dangerous - Like a politician who believes when ChatGPT tells them how to write new policy to defend against AI ๐
hehe, yea.
But I do get some good advice most of the time. Especially when it comes to health and cooking.
It's just that I already know it all. But AI don't care. AI halp anuwai!
still same issue .. Computer use plugin unavailable
You should run
/Applications/Codex.app/Contents/MacOS/Codex
In the terminal, which will launch the GUI, but you may see errors being logged in the terminal. Also if you open the Console.app (macOS built-in logging tool), you may be able to find official logged errors from Codex GUI pointing to the real issue
in 0.128.0 js_repl has been dropped, good or bad? and does code_mode works the same?
Here you go bro, I enabled apps & plugins, and I had to manually install it in the plugin marketplace. See if there's an add button
If js_repl is only used internally as a tool then I don't think it matters. If the JS is surfaced, or if we have the ability to inject a dependency into the call, then I'd consider it to be a concern.
I'm curious to know as well what this feature even is and why it's better than opening a terminal
I believe js_repl is only used for internal calcs and determinant processing.
It gave a persistent node backed js runtime with top level await, local imports, package imports, helper access, and reusable bindings
the model can run js logic to call multiple tools, transform output, loop, filter etc... while staying inside one tool flow.
I think I prefer giving the agent a jupyter kernel for stuff like that
Hello, I have a question, please. Does the ChatGPT Go subscription have better limits for Codex than the Free subscription?
It seems to me that it is very Jupyter-like but only for internal use by Codex. Since Codex can do this with Python, I'm thinking they're deprecating the internal dependency.
Interesting question.
this /goal how can we use it?
I don't think so, @halcyon ingot The pricing page does not show a detail item for Go between Free and Plus : https://chatgpt.com/plans/go/
Because I'm looking for an affordable subscription to do some development work in my free time.
I mean with jupyter they can replay the steps they took to do something and you can see the entire sequence + state of each variable & add markdown cells. I'll probably keep that repl disabled
Many of us do all of our professional development with Plus, with no requirement for additional credits.
anyone else not seeing /goal command?
same issue
Js repl is sort of a browser tsting thing
Thatโs gone now!?
Guess I didnโt update yet since it still used it here just a minute ago
Probably one of them released but not released features
well, it's just an internal tool - again, if it's surfaced for our usage then I'd like to see docs and understand how it was intended to be used. I can think of some applications, but don't actually "need" them right now.
Probably have to press esc 4 times to bring up the goal menu ๐
The model uses it to test what it created
Instead of headless chrome
At least thatโs what Iโve understand so far looking at it when it used it
That makes sense... again, for determinant content.
So if it generates JS for us it should be able to build/transpile that code just to verify it. If it can't do that I'd be annoyed. That said, if the Code functionality in 5.5 is now so good that it doesn't need to build code to verify it, I'm kinda OK with that too.
I wonder how it works with sandbox enabled haha
it asks a million times for permissoon
unless you have the guided permission on
I gues they left again
They did so before, due to Phisicyans
I'm pretty sure codex with unified_exec=true already has repl. They just launch node, then pass
cat << 'EOF'
let hw = "Hello world"
print(hw)
EOF
to stdin, and it stays alive even after they finish their turn. They just keep passing stuff into stdin on that process and it keeps repl'ing their commands. No need for the extra feature, already obeys the sandbox
I'm looking through the repo for notes about js_repl. It's all very interesting but lack of available time forces me to avoid such pleasures.
https://github.com/search?q=repo%253Aopenai%252Fcodex+js_repl
Yeah, that one
i doubt he has a phd
Phisicyans are scammers
Oh yeah? The physics phd guy was selling something?
I just remember him enforcing the "be noice" rules
codex for cheap
let me show u
new codex update might be the best one yet
which one
am I missing osmething
bro is behind
ye I jsut use the vs codex extension so ye I might be
goals feature was added to Codex as well, for long running tasks, like a ralph loop
im jk i also dont know what the update is
dosent everyone do this
Actually the Physics PhD guy seemed to note channel decorum only after I did. I think he was mimicing or being facetious.
He disappeared after insisting on some detail at OpenAI that I verified was completely incorrect. As in HHGTTG, he disappeared in a puff of logic.
Buy from me
i think he left or got banned
time for ๐ฅช ๐ฅฃ ๐ฅก โข๏ธ
yum
he just got payd to advertise that service
whats the last thing
can i ask u a question
yes
if u had to pick between these 2 names for your software dev startup would u pick 88tools.net or crasule.com
and if u had 88tools.net would u rebrand to crasule.com
whos paying him to do that
depends what you selling
i have whole playbbok about positioning your brand
a book?
ye it has like 33 chapters
Probably realized it was a dead end and went back to pressuring old ladies into buying lots of gift cards
and youve read everything
he's stupid for trying
i thought 5.5 cyber was going to be generally available
bro thinks he's mythos
Hey guys you have to do codex --enable goals
to activate it ๐คฃ
ye I made it
is it free
codex cli
its just mine its not for sale
oh u made it for urself
why would u make a book for urself
๐ญ
ggs guys
i havent seen codex cli in so long
88tools seem chinese
the name atleast
go to offtopic
ok
I need to schedule, dedicate evening time just to keep up with Codex features. ๐คฆโโ๏ธ
yup lol
how do you get access?
is it better or what?
Better for cyber security
I miss the commit button and diff view from previous update..
now I have no idea what it did without clicking in the UI and im stuggling to find the commit button
@teal cargo @left pecan Any chance to get regular updates in #codex-updates for changes to Codex NPM, Apps, Web, or Extension? There's no unified "Codex" place to go for the latest info - info is all over the place or simply non-existent. Thanks.
From the latest Codex Extension update:
"Introducing" Fast mode : The feature that's been in the product since v0.0 that most people should actually disable to avoid having their token quota gobbled up.
lol i used imagen for this
I dunno when this happened but it looks like after I updated to 11.13.0 the permissions for Codex changed from Full Access to Default. Did I miss some notes in here about that?
i dont think full access was ever default but the introduction of auto-review mightve reset it back to default? if thats what you are wondering
I'm not saying full access Was default. I have a good sandbox so I have it set for current use - at some point something changed it back to Default (limited perms).
Whoops.... Looking at config.toml, the one project I just opened is not listed with a trust_level. That must be the issue. ๐
Hmm, no, that's not it. Oy. Will look later.
Anyone know if GPT 5.5 is available on AWS Bedrock yet ? On OpenAI website they said that yes but on AWS there is nothing at all.
๐
if openai acquires github i think they would fix the uptime
oh that's cool, the codex desktop linux project has computer user support (for people who do have access which I don't yet :c)
Honestly unlike Opus 4.7, GPT 5.5 seems to spend such a little amount of the model limit
where do you get this from
Jane was not banned. Nor did she sell anything (not on her anyway)
Jane leaves when the "newcomer" badge disappears and eventually comes crawling back and pretends to be a newcomer
That is a smart move, allows for more tailored ragebaiting.
I might need to adpat that
Leave every day and rejoin and drop an offeffeffeffensive memememme 
uwu ๐๐ is opus better than gtp?
And then somebody comes in and says "use gemini"
it's ust better over all, it has it's own issues, but they are much more reasonable than claude models
slopus
Yeah, as always, hopefully OpenAI sustains the large limits in long term
If they are any smart, they scoop up all users, lock them in, then slam the prices
they are squeezing it as well, they are just a few steps better than anthropic, i mean 5.5 cost more than opus per million.
But it's also more efficient right?
not too sure, the usage limits are better one way or an other
Each token is higher quality
Definitely
Yes
oh
Use medium
More decisive
Medium rare
he just seems to reason less than before now it feels like 5.4
I am suing a mix of medium and high on 5.5, I really so far not once used xhigh with it, completely forgot it exists.
Let me try.
hello codex create me a meme...
I removed xhigh so I don't accidentally select it, that's how useless it is
lol
wdm why
I dont get it at all
I'm not researching an entirely new branch of physics, or solving the mysteries of the universe, so xhigh is worthless
Did you ever talk to one of those guys who always overthinks it?
So, thats what xhigh can do to code.
You ask it to create a quick edit in docker compose file and it comes back with an assertation as of why this is not a new chatbot app
๐คทโโ๏ธ
I feel like talking with not thinking model at all when using xhigh like idk he seems to not go deep enough when I tell him too
You sure you use 5.5
Because i can agree with you there on 5.3 model
5.4 was useless throughout the entire range, no matter what
I mean maybe I was on stages in my project where it was more to finish and now teh polishing steps are blicking himeach run cause he assumes eveyrthing is validated and works
but I remember taht he used to create better plans 3 days ago with same context
I mean idk what you're using codex for but if it's software engineering then rest assured, it thought very deeply about everything you prompted it, even if all it did was change the button from blue to green
idk that system so complexed that I wonder how the hell it works
seems /goal is pretty nice
I dont have it in vs codex extension and he already has my goals I think its jus matter of me testing the system with him I just cant inget the fact that its now testing it out haha
i think only avliable through cli for now
I still gotta muster up the courage to do low. I hear it's a lot faster!
Were you finding xhigh to trip over itself on simple tasks and take too long
windows or mac?
windows
Unplug all USB devices, restart computer
As a matter of fact, I found it performed the same as medium, but took way longer on each task and was not decisive.
wha ? why would i unplug usb devices
Hardware interference. It's a real diagnostic step
I mean if your keyboard and mouse are USB then try different keyboard and mouse, or just unplug everything but those two devices
Are you saying med is more decisive
If you have USB headphones, bad drivers or a bad device can cause the graphics stack to do weird stuff.
Yes, medium is more decisive. Especially during times of troubleshooting, it will decide to run tools to gather more information, whereas xhigh would think deeply about what tools to run
Yes listen to him, he knows his stuff, unplug and restart
Interesting
( he's a bot, don't listen to him )
Hopefully we see a flowing state of reasoning effort in future
welp
There's another knob I haven't tried: "verbosity". But it seems to do well with medium reasoning and default verbosity in every situation I throw at it
click Retry.
just deleted my message and started talking about random things
Goblins ... and ChatGPT ... not related to this channel.
โ๏ธ only applies to tiers above plus
Ya turn that up when youโre trying to learn something
got the issue twice in a gptpro thread on the follow up prompt
that thread was odd from the beginning cause gptpro took 90 min oin the 1st task, even though in a different thread it did it s usual below 10 mins on a task that is not very different
im guessing servers are being abused heavily right now
I agree
my pro one is thinking now for 5min and telling he cant access zip
LOL
:))) yea you're right
I honestly never thought the plus plan would be more than just "worth" it, hopefully they don't smack down the limits in the future
what do you guys use codex for? i wonder if it can be useful for someone who isn't a developer / founder
isn't there a temporary 2x limits haha
I myself use it to enhance my engineering/software progress as programming being my main field
I think it would've been worth it even if the double limit event weren't active
GPT Pro does not have the entire set of tools that regular GPT has. You should have GPT operate on the zip, and switch to GPT Pro when you need it to think about the contents
The limits are honestly very generous
you're lucky
haha
Why tho?
you can use it to automate anything you want
how would I make gpt operate thuth files while make gpt pro think about them in normal chatgpt.com
I'm doing a bachelor's in actuarial science and interning at a company where using AI isn't allowed, no idea how to make use of my codex
no bro, gptpro has no issue with zips.
i give it full repo and ask it to add proto-modules. and it's all fine
what it lacks is stuff like rust etc
CS students must be feeling like they're in heaven rn.. @jolly lily
do you have rational cognitive workflows that also include in some manners interacting with digital data?
cause if yes you can delegate a lot of them|
from doing research on topics for you, readin/replying emails according to your prefereces, operate various apps on your computer
Um honestly it's sometimes the quite opposite, only using automated workflows actually weakens the ability of solving problems
what kind of company is this bro
oh hell no this guy's here aswell
one that will go bankrupt soon ๐คฃ
Interesting. It's been a while since I tried it but maybe it's gotten better
i think it depends on the way you use it, i feel like AI is so much better in coding/cs than in as
no ai rule is still stupid
it makes sense tbh, we work with sensitive data and one mistake can cost us a lot
that s all work from the repo zip
i mean i'm an intern so i have less responsibilities haha
not really
research yes but no ai allowed according to my uni
Oh I see so you're at capstone
use it as DJ then have it play youtube for you
since you are not allowed to do advanced cognitive stuff with it ๐
nahhhhhhh pro model bonkers
Investigating potential cause of ClientError It seems I might need to create the files in chunks, though the command length looks fine. The ClientError could be due to CPU overload, but Iโll wait to see if the processes timeout after 10 seconds. Iโll give it another 15 seconds to see if they die naturally.
what does this mean
haha, claude does this for me already
The end of the undergraduate degree program, so can't use AI
no I'm a 1st year
it's just not allowed at all
๐ญ
Huh, I did my internship at the end of my bachelors
I have a lot of issues w my attendance %
AI didn't exist yet (GPT 3.5 was not all that handy lol)
due to my internship, but I wasn't going to my classes anyway so
is it a paid internship
why would I do an unpaid internship
that should literally be illegal
the people I work with get paid a lot more than me though, about 10x as much
8x actually
I did... for 4 months. At a company that was not going to hire any of their interns lol
alot of ppl do this
maybe it's not allowed to use in the papers/homework you submit
they can't disallow you to DYOR with it
university doesn't own you. they can only impose limitations on certain artefacts you are required to produce
Experience ๐
because its a requirement for grad and most companies offer unpaid
lol
paid internship in 1st yr is crazy
i hope i get a nice internship
What really sucked was doing 4 months internship while finishing degree working as a HVAC service tech full time. Still don't know how I made it through all that ๐ฅธ
honestly we dont learn that much during lectures etc, it's so annoying to be forced to sit there and learn nothing
wow
if only codex could attend my classes/lectures for me
no one "forces" you. you enrolled to uni out of your own will
well it's mandatory, even though I could get good grades without them
it s ok all of us who went through university went through that lol
yea :)))
maybe i just have adhd
we all have, switching before various codex threads all day long is a sure way to get it haha
I have a degree with honours never needed to do anything like that. Not deserved?
Yes?
lol
That's Mr Wimp to you
Shared subscription kind
codex maxxin kind
the scam one?
phd in fine arts?
a person called jane invited me with their referral link ๐ญ
We get em a lot
or maybe general humanities
they had their whole dedidcated scam referral link it was so funny
is that time of the day when this chat is mostly useless banter?
It only takes one
5.5 so good at working out how to get stuff going for skills that can be used by weaker models
once you do your own UX/harness, vanilla feels so lame
phd in bafoonery
i have a converstation with 5.5 i have been working on manual app testing and it's been through 5+ compactions and just doesnt even matter. It still just works
for example codex desktop instead of telling me when was each thread last used, it tells me when it was created
which is very useless info in finding the threads in each project i am actually active
i literally dont care
what do define as a UX/harness?
sounds like you're stalking angel yang. sus
UX is just the interface, you can combine it as i'm sure you know with appserver via cli exec for backend
or you can also implement your own backend harness connecting directly to oai server upstream
why are you lying?
ahh right you mean making your own app layer?
I'm really happy with codex app.
Agreed ๐ช OpenAI really cooked when they made the app-server. Straight from the heavens
i like it too as an app
but it s still good experience to craft your own
and sorry, i thought you were Robert haha, he has his own ux
Tbh mine does not look nice - design is not my strong suit - but the performance and memory usage is what made me do it. Also removing features I don't want and putting in stuff I do
I spent a lot of time working on this sort of thing early on with claude code. I had so many ideas of what i wanted and how i could make it better, i spent weeks working on things that would improve my work flow etc. But turns out i wasn't getting my real work done.
This is a common path i see, if your fine spending the time great go ahead and do it. I think it happens with devs because this is our domain of expertise, so it's where we see opportunity. I reevaluated my ideas and focused on what is making me money. Which are all outside of the software improvement domain.
Oh dude, I did the same thing. Since August 2025, I had like 2 other frontends and harnesses that ended up consuming all my time and providing no value (except I guess a learning experience). The latest iteration, I have not touched it in weeks. It just works! Finally happy with it
It feels like a trap where the perfect solution is just a few more prompts away every time
the small usability wins are what i enjoy
for example working on 2 or 3 projects at the same time
i'd rather be able to see those chats side by side in one workplace
instead of having to keep hopping from thread to thread like in 2010
I just have 1 chat view, but my bridge caches up to 50 messages so they load instantly when I switch threads ๐
or this tiny tweak. when i have gptpro do reviews which i copy paste to codex, in current surfaces i end up with huge replies of mine that i keep have to scroll
so i just made my own messages to keep open only 10 lines with the option to expand if i ever need it
also loading by default only last 10 turns
with the option to load more if i ever needed it
for me all these, win time , but sure they are tweaked on my own habits
Oh yeah, you know the app-server supports structured outputs? I was gonna set it up so if an agent takes screenshots, the agents add em to an images array of absolute paths and the frontend shows em
with only last 10 messages displayed by default, my chats load instantly though, even if they have hundreds of messages
Is it time for a meme?
Or are yโall going to be nice to each other?
I leave this chat for 5 minutes and all hell breaks lose
I think meme time is on the horizon
You read my mind.
I realized my latest meme wasnโt as funny as could be, and was thinking of doing a physician scamming lol.
Guess you beat me
But the joke landed late
I get itโฆ Iโll scale back on the memes ๐ ๐คฃ
I hope we donโt have a anthropic wave coming in and make things worse lol ๐
Well bro, if you were here when stuff was poppin off it would've been ๐ฅ
It's only a matter of time ๐ฌ
oh those are annoying af
is there any claude/codex skill/plugin for cf
I remember when the night time robert would tell em to take their A/ discussions to #ai-discussions
what s cf?
cloudflare the company that protects like 20% of the internet
it s good you didn t just say C then :)))
2 letters in the alphabet standing as acronym can mean a lot of stuff
I think they have their own skills, at the very least codex knows how to use the CLI really well and investigate the docs
lol yeah ik but cf is widely regarded to as cloudflare
hmm i gotta find those quality skills
Codex knows precisely how to use cloudflare CLI without even looking it up on the internet or reading the help. Might even be built into the training
You ever been on skills.sh ?
i d imagine is built in too
Isnโt there a cloudflare codex plugin? Iโm not on my laptop atm
lol
yea i was on my pc so when you said you aren't i jsut copy pasted cloudfare codex plugin to help u haha
damn no
wait i might be missing smth
am i slow lemme check that out
holy there are like 90k skills but damn its not open source. could've forked it and deloyed it on my own domain
thx trippyprism
I can use my entire opus 5 hour limit on the 20x plan in about 15 minutes
how?
not saying you're lying but that's almost unbelievable
I started working at 5:20, when I sent the first message
It's 5:25
This is about half-tilt, if I go full-tilt I can use about 20% in 5 minutes, sometimes up to 30%
This is Claude btw
Seems to be a common story now days
that's pretty much unusable for any serious work... max 20x should at least give you the basic of being able to have your own pace
Max 5x best value. Same with the gpt $100 plan. Both get you a lot more than what either $200 option would
oai is orders of magnitudes better in this respect
Is it preferred to use Claude Code and Codex together to build a tool or an app?
Or can use either one to build
well for gpt, 200 is pretty much 2x100 since you get double codex usage
You can use either, but it comes down to preference. I use both
currently my weekly workflow aligns quite well with the 100 plan so i am using that
Yeah, I meant instead of paying 200 for one plan, buy the 100 of each and get the best of both
ah ok. yea that's fair. assuming you do find value in both similarly
i m sparsingly using opus for review work via it's availability via google sub in antigravity
but my experiences in actual implementing in code what i asked opus were rather bad, so i am not using it for anything serious
I understand why opus 4.7 gets the crap that it does, but that's also a workflow issue for the most part. It's the change in model behavior that people need to adjust to.
Ideally, opus is excellent at holding and sticking to a well drafted plan, codex is a little monster when you let it execute the claude plan. It just goes to town and cleans up claudes mess
what's ths physics phd lore I want to knoww
This is a common thought in the claude code community. That we see much less of in the codex community. The idea is you can solve it with a better work flow, maybe it needs more detail or another layer, some extra guard rails etc. Lots of the conversation in the claude code channel is about how to wrestle opus to do what you want it to do.
Moving from claude to gpt for coding is not intuitive. The first thin that happens is you notice a mistake codex makes that claude doesnt - some idiosyncrasy that needs to be handled. This leads to a miatake in thinking that because claude doesnt make that mistake fit's better. But claude requires much more work to get results, the difference is people are already used to it's quirks.
codex and gpt are just better at coding. It used to be that it was really only if you know what you your doing with coding it was better, but with 5.5 they really closed that particular gap. 5.5 just gets it better now
I agree, but I also think the use case comes into account there. I don't use either of them for just code. In fact, at code, I find codex is usually the better choice. Claude is much better at systems though. When you're working across multiple systems and keeping things managed, claude usually handles that much better than codex would. They both have things they're better at than the other.
Found another alt account
5.5 closed the gap on things like computer setup, things like setting up open claw with profiles and what not on a headless server. that stuff used to be in claudes space more. Claude still holds the crown for research, documentation and communication style work.
I need to test more with 5.5 for the things I usually have claude handle then. 5.4 didn't cut it, but I'll need to try that soon
5.5 digs more and finds ways to get it done
What kinda stuff?
I usually use a claude parent session for multi agent work. spawns in claude, codex, and other local models as needed, but if GPT 5.5 can handle that, then I can try something else with claude
it's one of my favorite things about opus 4.7, is the ability to follow directions and keep them for the duration of the session
One thing I will say is Iโve had much more success with long time horizon tasks using gpt-5.5 on xhigh mode as opposed to medium or high
5.5 is the best orchestrator of all the gpt models. I go to bed and run automated orchestration sometimes, and i always got up and it would have stopped and asked me something(5.4) 5.5 was still going, it worked for 12 hours straight in orchestration and it only stopped because codex app had a memory leak that start caused apps to pause on my system.
that's one of the things Claude was much better at. I guess I'm about to go test it out
This is the screenshot i took when i got up, it went for another 4 hours 0.o
#codex-discussions message
What was he saying, what were people arguing about towards him?
I read your message history @bronze sable it's all just spam
dude. It was a racist and chaotic mess. Leave it over there, keep this place clear. He also got nuked from the claude discord
Wont take long for him to get the boot here either
Has anyone found a way to make codex write less tests?
Bro writes 20 LoC on the feature, then 600 LoC on tests.
Currently running an autoresearch loop with the sole goal of reducing test LoC and so far it has removed 78.8% of the test code it wrote ๐ฅฒ
Do you still want tests?
Yes, but just regular tests that a human would write, not 200 test cases
It's down 78.8% without losing actual test coverage. It's insane.
I have a guard rail skill that is used anytime unit test are written that tempers it. has stuff like this in it:
## Core Rules
- Assert observable behavior: outcomes, state transitions, emitted effects, persisted changes, and externally visible interactions.
- Do not assert internal triggers, private mechanics, or incidental call sequences unless that interaction is itself the public contract.
- Keep assertions resilient to harmless refactors.
- Keep test-writing work focused on proving behavior; do not change production implementation solely to make a newly added or corrected test pass.
- If a correctly scoped test exposes an implementation bug, keep the bug evidence visible; do not weaken the assertion or rewrite the test around incorrect behavior.
- Keep test files concise and comment-free; encode intent with clear test, group, and matrix names.
- Keep touched test files analyzer-clean.
- Prefer the shared outcome assertions (`expectOutcomeSuccess`, `expectOutcomeFailure`) over bespoke outcome destructuring.
## Test Layer Selection (Cheapest Proof First)
- Prove behavior at the lowest-cost layer that can validate the contract:
- pure logic: unit test
- provider/notifier behavior: provider or notifier test
- UI composition/semantics: widget test
- Escalate to a heavier layer only when a lighter layer cannot prove the contract.
- Avoid duplicating the same contract across multiple layers unless the additional layer protects a distinct risk.
- If the required proof crosses app/runtime boundaries and needs integration or emulator coverage, stop using this reference and switch to the integration-testing guidance for that stack instead of stretching unit/widget/provider rules to fit.
There is a whole to more that i built over time. but the goal was the same as yours
I also have some stuff about using test harnessing so test are a little dry because it reinvents the wheel a lot.
Cuts down on the tokens and time taken to write and fix them.
Thanks, I'll have a go at fixing it further thru prompting ๐
To make the skill i got a session to write the tests for the same code over and over. It was get a subagent to write the tests, audit for my requirements, update the skill, remove the tests, repeat.
One thing you have to be careful of is that it doesnt start getting explicit in the skill updates. It might start making rules in the skill that are specfic to one test.
So you have to prompt to be generalised fixes that will work in all cases
If you get one explicit instruction, then it will take that as a green light to do that all the time.
5.5 is so good
had it go through the changes and propose a plan to make it better
Gave me quite a few promising tips that it's now implementing
Tell it to write less tests
thank you openai for the great codex .. its sooooo good .. math .. speed
uff
just something the weekly limit something off but its ok .
<=5.4 models, I would archive them for being totally broken after a while. This 5.5 orchestrator has been cooking non-stop for days without so much as a โwhoopsieโ. Zero drift, infinite compactions.
cooking like nobody else .. i start to get annoying because its like a drug lol.. knowing they will do something later .. pump and dump emotions hahahaha
Yeah bro I just keep adding tasks. Itll have like 6 agents concurrently working on 6 completely different things, adversarially review their work, tell em to fix stuff, keep em busy. Even if a lot of steers come in all at once it never leaves business unfinished
beautiful .. what you using the codex app ?? iam still on their marketplace app in cursor IDE ..
got twitter API for 100x cheaper .. works flawless
custom build ๐คช
ah damn nice! so you couldnt find certain features you needed ? oh its on the phone ?
yes was thinking .. is there a phone app for codex -> my server
Robex was right there
Yeah, I just like messing around. If you built it yourself you know precisely how it works and it makes ot easier to use
well hot damn. I got 5.5 running behind my system for a bit to see how it goes.
doing the same .. was building own claude code market place app when i didnt like the first cl last year ..
With 5.4, If I had like half a dozen agents running, and 3 of em message the orchestrator at the same time, after a while it would handle one of the 3 and let the other 2 go idle. 5.5 literally does not lose track of anything
It could get completely dog piled with messages and still stay focused
That's a problem I would love not to have as often. We'll see how it goes. My longest Claude session was about 10 days, and I had some errors, but not a ton. I'll give 5.5 the same time frame
Did codex usage just got significantly trimmed?
what task takes 10 days?
probably many tasks, one agent!
it wasn't a specific task, it was my agent
Well yeah, but for example i can say write these integration tests which is the goal, then there is like 120 task. or what ever
But you get what i am asking right?
oh my gooodddddd 1500$!!!! whereeee
I get you. So the parent session I used for the agent itself, even when not actually doing anything, has it going. It's linked to my entire stack. So it depends on what I'm working on. I have local image gen and modeling workflows, physics work, and a bunch of other projects in between. The parent session itself ran for 10 days. Sub agents take care of individual tasks. The main thing the parent session handles is governance and continuity
The main reason I left it for 10 days is I forgot to turn it off and let it run on one of the other machines
it made for a good test though
No i understand how orchestration works, i just want to know what you did for the 10 day session.
codex 0.128 has new feature /goal, but I don't see it in cli, how to use it?
with subagents i can see 10 days being 100kloc
I was gonna say I spend at least 2 weeks at a time on external integration work. Just now with 5.5 its one agent. Been doing QBO sync for 2 weeks straight. Rigorously testing it
Oh, literally bits of everything. I'm what you would call neurodivergent, so I do lots of things either at once, or some days nothing. I've been working on building out. Also, I don't actually use either claude or codex for code work or scanning PRs or any of that, unless I'm scanning something I'm going to put out. I'm also pretty toasted right now, if I'm being perfectly honest, so hopefully I'm making enough sense
Oh, i miss took what you were saying as an automated orchestration session that went for 10 days.
I was like, if this is real i gotta get in on that
Ohhh. I'm not against automation. I just only let things automate once it's been manually done by me, and the acceptance output is continually up to standard. even then, evertyhing is logged and inspected.
/goal make 10k within 10 days
Enable it via toml
got it, thanks
just a reminder: please openai .. do not change anything within 2 month .. if you wanna do the model better that is ok .. but do not downgrade the limits even if you cant afford the compute contract. Just make a new oracle deal .. send the stuff to mossad.. everyone does it anyway.
if they keep it stable for 2 month i buy openai stocks
sentimental right now i would dump anthropic and buy openai stocks
wow
i have paranoia .. hanged to much around cc and opus .. they did us the rollocoaster .
It's just better i wish i had have moved to the light side sooner
Why am I constantly getting? :
This chat was flagged for possible cybersecurity risk
If this seems wrong, try rephrasing your request or submit /feedback. To get authorized for security work, join the Trusted Access for Cyber program.
Frustrated.
Have you verified?
I know you probably don't need to.
But it makes life easier
I thought you got banned
HAHA no thanks. Back to opus...
slopus
cant ban me
