#codex-discussions
1 messages Β· Page 4 of 1
Has the new codex drop Sam mentioned happened yet?
All I want is 5.3 on the api so I can use it at work
time to upgrade to alpha 11
yes
where is the mobile app, probably stuck in review
I might risk the upgrade to alpha... might give me access
can't use cursor/codex? both support no-training for businesses.
npm install -g @openai/codex@alpha
join us
same
we use an ai gateway to proxy requests so we give everyone flexibility
Sam, you cant do this. Telling us about auto-software, and then not give it to us
Different topic: I just had an experience where a GH repo that was authorized for access to Codex was suddenly de-authorized. I found out when Codex was reporting a 403 from GH. I don't understand this at all. Either the DB at OpenAI or GH got corrupted, or there is a timer somewhere. GH security log does not show anything related.
Does this resonate with anyone here?
Different topic? Good luck with that π
feel like cursor solves this problem plus some.
Wait, so this is less accurate than 5.3-codex on low?
its like flash models so yes they are built for speed and comes at a tradeoff
yeah, fast but less accurate
not to mention, direct API users lose out on subsidized credits.
they are probobly intended to be sub agents whenever 0.100 drops
Does anyone ever use low reasoning? Like who is willing to trade it being wrong more often for it being wrong faster?
More like how explore in claude cli uses haiku
okay, for subagents it makes sense
Yes - for making requests that don't really require judgement. Returning values from code etc when I'm feeling too lazy to look
Problem is the time it takes to switch models takes longer than the time saved for those types of requests.
Even in the CLI where I can hotkey a different model, I seldom do.
5.3 high plans and steering and helping when 5.3 spark hits a wall... until then it hammers until it works.
Am I the only one using xhigh? I know it is only .8% better in the benchmarks, so I guess it isn't worth it?
usually default to high ngl, its perf balance for me performance/speed
also we have infinite rate limits basically during the 2x promo till april 2 so i dont think using xhigh is a issue now
I have closed and re-opened codex 10 times in the last 10 minutes. I need to touch some grass.
same the 2x time it takse to work is annoying
is spark, high?
ask codex to write code to check codex till it sees the codex spark model and then send you a mail
I use mostly high for complex stuff. havent tried xhigh yet tbh. High was slow enough for me lol
I don't know but it really thinks its rug ties the room together
now we just need the mobile app
spark is its own model?
It's a nihilist. It believes in nothing!
alright this is getting out of hand
I'm glad you got the reference!
I wanted to say something about Donnie but would've got automodded lol
Is Snake game the new "Hello World"?
You're out of your depth, Donnie!
always. lit always
I also use only xhigh
only xhigh
So codex is obviously moving towards something where you'll use xhigh in /plan mode and then spin off a bunch of -spark models to implement.
Copying Anthropic's homework as usual?
Rollout will be slow and very capacity constrained.
slopus 4.6 fast is 2.5x faster at 6x the price
this is 15x faster and they are not even making us pay for it yet (separate rate limits)
codex -m gpt-5.3-codex-spark
anthropic getting cooked on all fronts, both open source competition and US private
still waiting for the codex update to get pushed :'(
Aha! v0.100. That will do it. Installing. I know it. It has to work
u can put any model name u want, it will still use 5.3-codex
doesnt work yet...
ah fair enough, my apologies π
Damn, it didnt work
@finite sorrel There was a recent commit that meant it doesn't matter what you put as the slug, it'll still work with the base model
RUST_LOG='codex_api::sse::responses=trace' codex exec --sandbox read-only --model gpt-5.3-codex-spark 'ping' 2>&1 \
| grep -m1 'SSE event: {"type":"response.created"' \
| sed 's/^.*SSE event: //' \
| jq -r '.response.model'
this command will tell u the real model, u can periodically check to see if u have access :D
tyvm
Unrelated to current discussion: VSCode output window always getting messages like these:
[CodexMcpConnection] cli:} message=
ERROR codex_core::rollout::recorder: Falling back on rollout system
WARN codex_core::state_db: state db record_discrepancy: list_threads_with_db_fallback, falling_back
"Request failed conversationId" / "Failed to resume conversation conversationId" : "no rollout found for thread id"
No clue what any of that is talking about yet.
Time to set up a terminal on a seperate monitor running:
while true; do RUST_LOG='codex_api::sse::responses=trace' codex exec --sandbox read-only --model gpt-5.3-codex-spark 'ping' 2>&1 | grep -m1 'SSE event: {"type":"response.created"' | sed 's/^.*SSE event: //' | jq -r '.response.model'; sleep 0.5; done
gonna burn your rate limits spamming your model checker
2x mate :p
tru
codex increase my rate limits. make no mistakes.
Imagine - it finally switches and then I've used up the weekly limit
"That didn't work. Try again"
β¨ is the new π
codex -m gpt-5.3-codex-spark
still uses normal 5.3
codex -m gpt-5.3-codex-you-can-put-anything-here-and-itll-still-respond
Lies
Codex in VSCode suddenly really adamant about not working. Error messages are useless "submit failed" :
[error] [Composer] submit failed cwd=/opt/codex/repo/my_repo error={} followUp=local mode=local
Ok, I think I need to do a Pull Request and fix this, because it is kinda laughable π
crazy
gpt 6
He doesn't even need to type a prompt. gpt-6 just knows
You open your computer and everythings there
gpt-6.7 when
Cyber verification is very simple, get picture of ID and then of yourself to verify that you're the person on the ID. Invasive, but they think you're working on security, so required.
yes im working with security stuff but that's weird anyway
i thought it would redirect only specific prompts
nop
(and also 5.2 isn't really less capable than 5.3-codex, and in some cases even more capable, so this redirect is weird)
Until you verify then you're back in business
i wish i had my passport with me atm π
anyone see the new model ?
What is this spark thing
cerebras model
I am afraid of the model they will release on April 1st... They have the potential to enrage the whole internet
That is too much power for one company
Is it just a faster version? Im just wondering what it does different, like what does 'spark' mean
crazy
Look at your token usage for 5.3 compared to 5.2.
smaller, 15x faster model
less capable tho
pretty much that, except actually good
Initially the impression I got was that "spark" was somehow tied to games/graphics. π€·ββοΈ
Hmmm Iβm in on a pro subscription and on v0.100 and I donβt see the model Iβm assuming itβs being rolled out?
@drifting granite Get in the queue. I saw it first. Dont cut the line. Or you might get biten.
They seen my crocs and jean shorts and moved me to the front of the line
Honestly I have a great use case for my agents with spark. Effectively a middle man agent for summarizing long tool usage (obv not read)
I found it
βοΈ "10" π€£
you gotta wwait for summe 26 for gpt 10 obviously
this kind of annoy, I am seeing X new and pull the latest binary and nothing happend π
you can set the model on plus too
codex -m gpt-5.2-codex-spark
but i'm quite sure its not working
I prefered to use gpt-10
codex -m gpt-10-codex-spark
we can unlock any model we want
Hey - I made this for you. It can materially improve ui from Codex. Explain what you want clearly and the style and you'll get great results.
It creates an image first - tries to build the ui to match that image and does another pass to check parity
Helps if I link to it
I would rather not give my id to openai
dw it's probably in their training data already anyway
For better or worse, they're not getting the ID.. It's a third-party that just confirms to their client that we've been verified.
My id is the least sensitive info i have given them..
that doesnt make it better
I don't want to go off-topic but that makes no sense. If your beef is with OAI and they don't get info, then there's no issue. If your beef is with security in general in today's world, then we're completely in agreement. But for now we need to do what's required to get what we want.
my beef is with you actually
@versed fjord
I don't want anyone to have my id if that makes it clear
@plucky halo in frontend, we are not still there π
i'd rather a company have my id than the government tbh
It's a step up from the white on navy default haha.
that makes no sense
governments suck
...but they issue the ID...
they made your id π
They must do it with their eyes closed.
Anyway, i think we're veering off topic!
GUYS
It's go time
lucky, not there for me yet
Snake game literally in 7 seconds
One more enemy
See you guys later!
you never played snake? π
all the plebs are talking about snake games
but here's the real unlock
(jk about the pleb thing LOL)
Woah
mfw neither of my two pro accounts have it
I hope spark can dry up my tears
You felt the fear of the banhammer the second you pressed SEND
nah lmao
i just didnt want him to actually get offended lol
this is the unlock.
the smartest models provide the clearly outlined implementation plan broken down into individual tasks
you dont need the smartest model to implement
only to plan.
spark will kick butt at this
We might be getting way too spoiled, you know? I have been trying to fix the bug of β¨codex -m gpt-7β© and it's taking more then 10 seconds, and Im getting annoyed π€£
"make app pls"
"continue"
"continue"
"app no work"
"pls make app work"
That is the future of AI
missing the most important one
make no mistakes
future? This is me daily
still no codex app update
Nope - just shows as 'custom'
uh that is the today of AI
π
i think you got into my clipboard somehow
AYYO anyone has access to spark yet?
everyone on pro has it
i just got it
yep
i'm on pro plan, updated codex mac app + codex cli
got it
try this
rolling out
Everyone except me, then
YESSIR
5.3
yeah my bad
Man, that's a dbugger. Hopefully you get it soon
see told u skill issue
here's the other thing no one is talking about.
is it a seprate limit
yep
the benchmarks are lower yeah? but you know what we dont see? a mini model
if this is a replacement for mini...
ow i still don't have it
intresting
codex -m gpt-5.3-codex-spark
if this is a replacement for mini, at the same price, its literally amazing.
idk if it is tho
it wont be the same price lol
I am surprised cerebras even has enough capacity for all codex pro users
Dont troll
?
@hushed storm That command gives the impression that it is working, even if you dont have access to the model
(im writing a PR to fix it)
lmao
(I am writing == Codex is writing)
how did you get that tho
@vestal monolith skill issue, so they say
it triggers a refetch of models
they dont accept external prs lol
@hushed storm the welcome box shows that model being used, even if not accessible
@hushed storm Yes, they do. I already got one accepted months ago
I've got it π
senpai pls help T.T
they changed it
how generous tho
Curious
People be opening Github issues to get access
even very fast on xhigh damn
So is spark a new model. Is it a dumb model. For the sake of being thst fast? If so why use it?
its fast
that why you use it
Why wpuld I use speed if its gonna ruin all my code.
it wont of course
HOLY F this is fast! O_o!
spark is out!
smart, base model to plan
spark to execute the plan
Because you might feel the need
The need for speed
Sub agents, maybe thatβs why it has a separate usage limit
Codex is literally taking more time on reading files, than on inference
My hard drive has become the bottleneck o_O
like tooooooo fast damnnn
Idk if yall coders or not but. I dont think you guys realize that if it produces more errors in the codebase because its speed over detail. That just gives me even more work to go back and debugg
sub agents will be so damn sweet with spark
have you used this model?
Anybody have access to this model?
can you set the subagent model tho?
I thought it just uses the current model for its subagents
@frosty zealot All the pro users have access now. It is 100% rolled out
I feel like smoke testing with mcp's with spark will be sweet
its weird man.
it doesn't always follow
i think subagents use parent model
I do not want too until I see proof in other people's large coding tasks
not good, cant make flappy bird w medium in one shot
What about the pricing?
Why your doing medium level reasoning
they used it in the demo vid
huh I get it in the cli but not the desktop app
wow its so fast.
using it in the cli is like holy moly
im running the ol' make pacman test on it.
Let's go!
Update is out for Codex Mac App
niceee
Hell yeah lets go
I picked the right time to hop over from Claude
OpenAI be cookin
they're always anime profile pics
No offences @chrome raven you seem pretty normal π
Ooof the Pacman test was rough lol
Didn't one shot it, and broken gameplay. thats okay def isn't the main use case.
xhigh?
Yes, on Spark.
I had to tell it not to use the existing codebase from other models lol
which version?
@vestal monolith
switching all my subagents to spark rn
lmao my ATH was few days ago - 800m opus 4.6 and 150m codex 5.3
https://github.com/PanicIsReal/codex-webstrap
I created a wrapper for the MacOS desktop client, it lets you run Codex App in your web browser, I made it cause I'm usually remote from my machine quite a bit, and didnt want to use the CLI cause of the creature comforts of the app, this solves that problem for me.
it not having image support sucks alot
"Did you try GPT-5.3-Codex-Spark yet?" lmao they really think it's that easy to try $200 + chance it actually rolls out to you
this is the first version of them on a new stack, let them settle
Actually? Damn now I need to actually decide if it's worth $200 a month lol
In theory it pays for itself in a couple hours ig
It has a problem right now... itβs really slow
Really?
I think it's running on special hardware, maybe they need to scale it up more
it... is.... fast...
lol
and do nothing
WHat exactly is Spark?
Fellas... I am seeing the "Context used %" dropping... like a rocket losing fuel
What if you give it a real coding prompt?
It can drop 3% in-between eye blink
i gave
its only 128k
its 128k
@cedar bear It is codex going BRRRR
so much better then the "old" version?
after I stop it i want it to run again.
its 122k context window
@cedar bear Depends what you mean with "Better". It is definitely faster. DEFINITELY
Will be interesting to find out if the quality is similar for most common stuff
what's the difference between Codex and normal GPT?
codex for code
Codex is specifically for coding and terminal usage
i think the main difference is that codex can control your computer.
I often want GPT to look at a couple source files but not specifically from a repo
idk how to do that
oh really?
yeah
can you do that on the free mode? lol
yes
I also got Plus for free this month
Just with the CLI or with the Codex App too?
both
Codex app too, in the app you basically use the folder as your project
codex app just a GUI, maybe makes you feel better.
yeah
ah
So its not bound to repos anymore?
Yeah it lets you run stuff locally
you can, by using git
Codex app is much nicer than the terminal version imo
uhh
So bad that im a windows user atm π
You can use repos but it's not bound to it like they were asking
But the web version is bound to repos right>?
You can run the terminal version on Windows I believe
true
We need some benchmarks and specs on spark
Would Codex respect my GPT instructions on how to format the code or you have to re-enter them?
Freaking 100% context filled up so fast
yes
Hmm never used the cloud version so not sure how it works, I assume it uses repos
Ah I see why it has a 122k context
mhh yes to re-enter them or it will respect them?
Is the codex 5.3 spark only in CLI ?
both
CLI and IDE ?
Ah yes, doesnt include IDE i guess
Rolling out today to ChatGPT Pro users in the Codex app, CLI, and IDE extension.
rate limits are quite low compared to normal 5.3-codex ngl
i tested..... 5.3 works better for me
i did some code... not working. fallback to 5.3 and it see what was the bug.
Ohh, im on my Plus personnal account
So if i switch to my pro account, it'll be there?
i guess so
Alright
i dont really use extension so not really sure
fwiw this model is not for coding.
Idk why they advertised it. It should be for using/interpretting agent tools.
I am also mildly disappointed so far , really strange , also on xhigh
oh it's editing my files?
it's a small model too
yes it will
It's got some valid use cases, but if your expecting workable code then you will be dissapointed.
mainly because the performance is not actually better. if it is just simply faster, I do not know what scenarios it would be suitable for.
thats what i thoguth
Because I don't have it. ?
For a lot of repetitive tasks that don't require the model to be brilliant
Seems like a simple lightweight version of Codex 5.3
i dont have spark in my cli bruh
pro plan? or update the app
its not much dumber and its WAY faster
Is github intermitently lagging out again
honestly the model is good for what it is: a proof of concept of the speed the openai/cerebras collaboration can achieve. and it would be very useful for a lot of things, but the rate limits are just too low right now.
As long as it's seperate from the main one I don't really mind
exacccctly
Is this only available for the Pro plan? I have the Plus plan.
Yeap only pro plan for now
Yes, pro Only for now
yes only pro for now
yea no timpressed with spark...
sure its fast but im getting more errors in my code than with normal codex
Nothing can be perfect. Faster work means more errors and inconsistencies, just like humans to be honest.
actually I quite enjoy the slow speed, because i can spend the time on other things
yea. all of these people that want these models to go super fast shows they have no idea what they are doing
is codex spark as good as normal 5.3 codex but just faster ?
it is not better
@simple star How about now? I've pushed an update to the skill :p
Darn, from the video in the X post in #codex-updates I was getting the vibe it was better
But I guess not
me? what?
lower performance but faster
^^^^
\
ok ahah yeah f it
coooking with spark
That looks promising ! Is this an homemade skill by you ?
bascially its codex 5.3 mini with a new word attached to it
Genuinely works weel
better than my typing anyway
so with good frontend skill codex is good at frontend
gpt 5.3 codex x-high built me a pokemon game.... that works
this looks cleannn nicee
it should
it isn't for me. i keep having to compact myself lo
cli? or app?
cli
it auto compacts.
It's worse. But it's much faster than anything you've ever seen
Apologizing to OpenAI for saying there were no release notes for Codex.
https://developers.openai.com/codex/changelog
I've been having a lot of issues with fast updates from 0.98/99/100/101. (mostly with stale thread entries)
I hope things level off.
ahhh i changed my model auto compact in the config in the past, so my threshold was higher than the new context limit lol
Yeah and the new model is context hungry. Bet that was frustrating
Oh it was lmao
making a bootable os with ui from scratch ate 3% of my weekly quota, outrageous
Im compacting context like a madman
thats good, although codex is insane at memory so its not as needed as with everything else
so this -spark... one only gets it if is a special snowflake? Or how does it work? π
Since it is not available in the /model command even on 0.101
its compressed 5.3 thats not MUCH dumber while WAAAY faster
yeah, i saw that, but... still, its only on twitter, not in the real world for me
Pro users atm
And apparently some API customers
Ah I see.
Anyway, does not seem very promising anyway lol - fast but more nothing useful I guess.
It's being rolled out to users on the pro plan. If you're not on that, you won't get it until it's ready for release.
You can just build thingsβfaster... sometimes... sometimes I just fail at doing anything at all π£
Any word from the OAI team on 5.3 for API users?
Iβm desperately wanting this, not necessarily faster inference (5.3 was already 25%!!!)
Depends how you use it. Create a comprehensive plan using 5.3-codex xhigh. Make sure you work through it properly.
Hand off to Spark for quick execution. It's not a model you'd use without proper instruction
Guess what, look at the background... this app is totally unrelated, monitors a btc node, made with guess what. It clearly is not creative lolol
In a few months we will not only be able to tell AI code by reading the code but by looking at the UI
So in other words that skill is meaningless, and GPT 5.3 has a very strong opionion on design ahahah
currently taking about 10 times to create a PR... and it's denying letting me add anything to it either. Just fun good times is all.
What was your prompt?
Also how about not talking to people like an utter cockwomble?
"create an app for monitoring my BTC node, design is up to you, framework too"
1 prompt btw
No idea what you mean by the utter cockwomble - literally in this channel you cant talk bad, so, no idea what the f you mean π
base 5.3 isn't great a creativity, why would the smaller version be better?
bootable os
use kimi for ui
kimi as well
also opus cant 1-prompt a bootable OS like codex just did
minimax 2.5 looking solid too
honestly
they released all kinds of stuff today and the algo wont even let you see it all
I was gonna say the Codex people must be doing drugs, to be able to release this fast. But I am starting to believe they are all bots.
This speed of development is just incredible
it's neither. They just use Codex to develop (Codex). And thus they're much faster than any other SH
When you see how projects like Grafana have AGENTS.md in the git repo, I am not surprised if the entire codex is created with codex/gpt
@torpid trout Mine was quite different. This skill isn't just about "creative olololol" - it ensures WCAG 2.2 AA standards are respected. That there is always a mobile-friendly version.
I made it to support someone else. If it doesn't work for you, that's absolutely fine. But you don't need to be so caustic
lets be real, its all slop
Do you guys think spark will eventually come to the Plus plan?
doesn't make it bad though
Until you see devs that make this style on their own as it looks nice
6 months ago it couldn't do any of this
what is?
Nah, you do not understand what I mean, go back and read the message, I guess. Or, let it be - no one is attacking you here.
all of it lol
The next big barrier for models is getting good at UI
they already are
you can always tell. some better than others
@elfin summit they are "meh"
but not for vibe coders
Or at least they'll have an API that's cheaper than $200 a month?
You can use gemini3pro to plan your UI visually, then turn them into assets, and implement the UI with codex5.3
@scenic umbra Eventually, yes
These are great, just need to give them the right tools
nanobananapro is amazing
nice
Gemini3 is better at UI, but also not that great. A true understanding of aesthetic and UX is not quite there
@torpid trout
The entire way you phrased that was meant to belittle and put down. For literally no reason. Someone was trying to help someone that was struggling with Codex's UI skills. And you first thought was "fc this guy - I'll try to make feel as little as possible"
ahahah lolol guess what
OK master, if you say so, it must be so!
That's completely different to "I don't think your skill.md is doing what you're intending. I got the exact same background"
Hope you can see the difference and that you have a lovely day π
Let's not be Debbie Downers @torpid trout - let's be friends
it does understand aesthetics well
but you gotta be the judge
Yeah, I see, no one likes to hear how their creative creation is not so creative after all.
But... all good man. No probs, was just a background being 1:1 identical - what a coincidence π
what a π€‘ he never said it was creative. he was trying to help the guy get better outcomes.
which is true
you can guide the model to better outcomes.
does that bother you?
his UI looks WAY better than yours
maybe its a skill issue
It's absolutely not that at all. My creative creation was literally just prompting gpt-5.3 a few times.
What's not nice is the fact that anonymity on the internet sometimes causes people to be absolute not-very-nice-people for no reason
π€·ββοΈ
jesus effing c
Anonymity? You are the guys hiding behind cryptic names lol
one of you was trying to help someone
Yes, my real name is Montgomery
And Montgomery alone
Cool. Do you want to go on on the topic which for me was closed about 20 minutes ago?
More than happy to
DAMN IT MONTGOMERY
anyway its all love
codex released soemthing else today too
new sandbox toys
Anyone try spark with playwright? is it any good at browser?
Yeah thats the variant i use⦠im on windows though so it always gives me trouble tbh even with the strong models
what trouble
so can i use this on Windows? or do I still need to WSL + bubblewrap?
I gave it its own chrome profile and had it starting in non-headless mode and connecting via CDP, but the daemons always down and the agents like always trying to reinstall chromium. Itβs a nightmare.
i wish i knew bro. i'm on linux.
it should
is it just me or does spark end up in an infinite loop of read until it fills context, compact, read until it fills context, compact, ...
did you ask the agent to help you troubleshoot?
yes lol. i have like a 2000 line md from it testing things and documenting its learnings on all the quirks... still sucks.
that sucks. there's another one you can try
okay well 2
first of all, google's computer use model is badass.
and then there's anotehr called browser use https://t.co/qBFmPUsw72
like the anti gravity browser? i do use that sometimes
it runs on 2.5 flash lmao
and?
I even sometimes have codex drive claude extension and do it that way i just only have $20 claude plan
i dont think its even flash. its its own model, and its very good
a bit token hungry and slower
it does everything step by step by step
i use it to automate all kinds of things that dont have apis
lol its not selectable but its usable?
it wont work
Good Afternoon I currently Host my Website on my Game Servers Dedicated Machine and I am updating it manually by Sudo Nano into it from my Terminal. I know there has to be an easier way using codex > Vscode > Github to update the website Can anyone advise me on how I can do this?
through anti gravity or do you have some code going around it?
what if spark is a myth
LOL GIVING GEMINI ACCESS TO PC
just ask codex to help you build a script
oh interesting. ill give it a try!
yeah try it. hopefully playwright behaves a little better this way
if it works it works... all i know is im not having very much luck with agent-browser... i dont think who ever is building it is very focused on windows quirks.
oh
you can use browserbase too
if playwright doesn't work for you
You wanna prompt it just like you do a normal LLM. I'll show you a sample.
He's just changed and manipulated time. He now gets 21 days in a week.
lol no the model itself is from oct 25 but the tool i shared was released relatively recently
it doesn't matter, its not rocket science
the model doesn't need to be SOTA to click mouse button and scroll
Why people donβt just use chrome devtools mcp is beyond me
ill give them all a try lol. i know playwright cli came out a few days ago too i havent tried that yet either... they're likely all the same base code just different ways of doing the commands. i see gemini in that blog post about computer use links to browser base: https://gemini.browserbase.com/ so they're likely all very similar in approach.
got em.
I have a script that automates ad scheduling for me, it iterates over a list of dates.
People still use mcp??? jkjk
I prompt it just like this
Objective: Check for login, find the first 'earn' ad for the current month, and reserve it.\n\nVariables:\n* [CURRENT_MONTH]: The full name of the current month (e.g., "November")\n* Task Steps ---\n\n1. Check Login & Navigate\nOnce the initial page loads, check if it is the login screen.\n* IF it is the login screen: Pause for user email input, then pause for user password input. Once the user submits and the main landing page loads, proceed to Step 2.\n* ELSE (if already logged in): Proceed directly to Step 2.\n\n2. Main Landing Page: Find Placement\nOn the main landing page:\n1. Type 'earn' into the top-most search textbox and submit.
test your workflows to make sure they work right and they are pretty much set and forget from there
Heck yeah! I been doing something similar with Flutter. It lets you render widgets without a GUI using "pump & settle", and it's really nice having Codex lock in behaviors with unit tests. Or if it's like "when I load this page, the backend gets spammed with get requests in perpetuity. Use integration tests to troubleshoot page load and controller lifecycle."
oh yeah dude. i'm experimenting with some stuff now
i just had codex run through an entire checklist of tests
it failed miserably
π
but claude did a pretty okay job. just doing UI/ UX stuff
login, listing, delete post, etc. Just all the tedious stuff you gotta do before you push to prod
glm sucks js tested parsing a pdf into html, glm didnt even make a proper table
codex nailed 1st try
Yeah stuff like "when I log out and log back in it doesn't clear the token". Or my favorite is "now that we have this domain locked down, and synchronization for everything here works perfectly and proven with tests, let's sweep through the other domains and match controller and view behavior, confirm with tests" like it's doing right now. And then you fire up the app and it works flawlessly π€
If youβre looking for a good way to control browsers, Iβve spent a bunch of time trying to squeeze all the extra perf and capabilities out of Chrome here: https://github.com/btraut/browser-bridge
neat
Codex app + Zed. Sometimes Sublime instead of Zed because the language servers compete with Codex (locking cargo build dir)
Any chance your developing it from windows?
I havenβt tried, but I think it should work. Iβll give it a shot tonight and see.
for something like this, you could run this in wsl if you're unsure, but 99% of that has a windows version of it, so the only real issue you need to look out for is powershell vs bash differences. Tell codex to run it on windows and you'll be fine
I even got the raspberry pi
how lucky π
DeepSeek definitely about to drop a banger in a few days lol
Holy crap I got the codex merch email.
Me neither rip
ditto π
I'm more interested in that raspberry pi 5 codex kit
Have spent most of my time today trying to recover from Codex / Codex CLI updates. Lots of different errors reported in VSCode output window. Time in GH Issues, config files, logs. Real cluster.... of a time waster. πΏ
https://www.npmjs.com/package/codex-webstrapper I made a wrapper for the Codex app that 1:1's it in the browser just fyi put it on npm for quick install & run (requires a mac at the moment because it needs the electron app running for it to work) bun install -g codex-webstrapper@0.1.5
My spark usage like hasnt moved at all, anybody have an idea what the usage is like compared to regular 5.3?
grats ! how quickly did you sign up after the ad dropped?
I probably did it in about 15-20 minutes, but I also made it harder for myself because I thought it was specifically for the codex app, so I was trying it from the windows port I did. It still made the skill and got me there, but it was just a lot of effort that didn't need to be spent because I didn't pay enough attention
I got mine too, registered after like 50 minutes
is it too late to sign up?
Where did you registeR?
https://openai.com/codex/youcanjustbuildthings/
The link you got when you did it through the skill had your account email tied to the link, so you're not getting that link lol.
anyone else having issues with 5.3-codex being utterly incompetent compared to 5.2-codex?
Ah I musth ave missed that
Thanks! If that were the case, I'd probably be less annoyed π 5.2 actually worked very reliably compared to this
I JUST GOT THE MERCH EMAIL BROOO
SAMEEE!! thank you @chrome raven β€οΈ
β€οΈ u codex
I wonder if they're going to put up an easter egg and offer merch to Windows/Linux users. π
me 3
you didn't need to be on the mac version. I'm a windows and wsl user
i wonder how distribution of who gets what kit worked
π I got the email!
I got Core Kit, I donβt think the email you put matters, itβs random
if i took a super wild guess, or i designed it id given top gifts to power users
just didnt think of it at the time and didnt want to put in multiple under the same name and get removed XD
Do you think the order number is actually in order of who completed it fastest?
What does it actually do? lol
Say What? I thought they only added the skill into the app, and sincee the app is only available on Mac, then so is the merch.
Does it just launch the Codex app?
Oh wait - @high girder ported... nvm
I'll revise my statement - I wonder if they'll offer to Windows/Linux users ... who don't do a custom port of the app.
I assume youβll be able to program the macro key to do whatever
I'm pretty sure you could do it from the CLI or vsc extension. It was a skill setup
or ... new power button π
π Thatβs so smart
I canβt be bothered reaching down to turn on my PC lol
now only if i turned mine off XD
I wonder if it was really the first 1k people or there was more
my order number says I'm either one of the first 100 or they let a LOT more people in.
So I was right it is a smaller model. Lolol yea not gonna use this
anyone have a screenshot of what all the rare reward options were for the merch drop? & it was first 1k, and def more incase of bots/dupe address/name/etc
it's called elite kit but they didn't show the whole thing
this one
Lucky
The way they just giving away a single macro that probably is enter lol
https://www.amazon.com/Programmable-User-Defined-Button-Customized-Combination/dp/B08SQGWZN4
its programable (i assume its this product just retextured keycap)
Probably something similar to that yeah
lol π
Needa hop on the meme make it say lizard
lol
wait you're telling me a model that performs much better than the flagship is actually just smaller??? shocked
yuy shipment mail :D
makes no mistakes
why is codex so dumb now? two days ago it did everything i asked it to do, and now i told it to fix a problem in my python code, and it broke more things
πͺ
I can finally wear clothes again after I receive the kit
Its not betrer?? Who told you it was better??
oh god i need to comb my hair
I was sure i missed the first 1000 window, pretty surprised to see this now
so jealous i cant even spell apparently
how many digits is your order number
i suspect they ended up blessing a lot more than 1000 ppl
Apparently there was another hidden layer if you won that minigame, but i only gave it a half hearted try π
Lucky
skill
lol im just happy i got stickers and a macro XD
I remember i felt pretty smart for stopping and zooming in. Then after i submitted, i saw all the codex' team not so subtle tweets about it that i hadnt seen and lost my confidence π
Def
it shall go in the stack with the supreme stickers XD
Performance as in speed
lets freaking go
this is for the first 1k people right?
apparently so
im probably going to forget about it because it ships in 4-6 weeks
it'll like fall out of my mind
then when the package comes in i'll be confused asf LOL
watch it never get delivered lost in shipping is usally how these things always go
LMFAO i'll be so upset
im in the UK too and with the royal mail that damn well might happen
you might've just spoke it into existence
gonna pray that doesnt happen lol
I want to use codex with Minecraft modding just to see what it can do, can it recognize and use classes by itself or do you have to give it the name?
Codex reads and understands your codebase. The more help you give it, the better the result.
@urban falcon Give it links to the library that you want to use: Paper, Spigot, etc. Also tell it the versions you're using so that it uses the right libs for all the right classes. (as you might be able to tell from my picture here, I'm kinda into Minecraft too.)
Oh, you said modding and I was talking about plugins ... you know what I mean. π
Does anyone else have problems with codex(browser) with loading and lag? Any suggestions?
Is there something like a reject list in Codex? I don't want it to ever read my .env
Managed to get mine too
they i suspect they let in way more than 1000 people
Wait what how do you get that?
Easter egg from the Super Bowl commercial. Install a custom skill called Build Things, run it, generate custom url to claim
First 1000 got it
Ah well it's way too late for that xd
seems like way more than 1000 got it
hehe - I tried to install to Linux with v0.101.0, it's not in the curated list. π₯Ή
No - if it's in the scope of the current workspace then the assistant can/may/will read it. You may need to change permissions to a different user/group and only change permissions back when the app needs to read it.
I got the hours I didn't spend watching the Superbowl. π
Found a fun use case for Spark
Iβm having a hook output from my server to a endpoint on my Mac, which then summarizes Codexβs response for me using Spark and a local tts model plays it.
Actually, I just checked my spam trap, and I'm getting some minor swag: A cap, a keychain, and some stickers.
Oo when did they send out the email?
7:25pm US EST.
Dang not seeing it, I filled it out during the Super Bowl. Maybe Iβll get a suprise later lol
sadge, maybe i redeemed merch too late
is it just me or the TUI is super slow when typing now since 0.101 (if the terminal has sufficiently large chat text)
Windows or Mac?
oh i didnt watch either heh. i was coding lmao
i found out in here
i like basketball. football not for me
i did end up getting an email
More easter eggs
gpt-5.3-codex-spark is current down
a.k.a. the Cerebras special β’
windows
actually it's slow every with an empty terminal
thought i was going crazy lol
works for me
not for me
Cerebras cant keep anything online for more then 12 hrs
Yeah windows is terrible atp with cli's
just a few hrs ago
i don't think it being on windows has anything to do with how the back end is connecting it.
No but the front end definetly is and you asked about typing (which should be front end)
for the codex CLI ???
I was talking to Cleroth
my point was it was working blazingly fast before this update
About his slow typing issue
#keep4o π
Anyone else in an update loop trying to update codex cli?
Just runs over and over on startup of codex
just install it manually
Anybody find any good edge cases for spark yet?
asking questions
wasn't it only available to pro members at the moment? or was that something else?
It's removed from the app as well, and I had it earlier, it was just a preview model maybe it wasnt working as intended
PRO-sub: The 'gpt-5.3-codex-spark' model is not supported when using Codex with a ChatGPT
account
I am on a PRO subscription, using CLI on Windows > WSL updated to OpenAI Codex (v0.101.0), Can see 'gpt-5.3-codex-spark' in /model chooser. Tried testing it with a text only task and I get this error ' Error running remote compact task: {"detail":"The 'gpt-5.3-codex-spark' model is not supported when using Codex with a ChatGPT account."}' What am I doing incorrect, please advice.
someone said they got it working after /logout and logging back in
i am on the pro tier it's just a generic error because they've removed it from the list and i still had it selected previously.
yeah, same here
vibe coders trashing others for bugs is quite a sight
so codex cannot run agentic workflows like CC, hard to find /Skills, /hooks, agentic harness like CC, and now it is limiting access to latest model via CLI, am I understanding this correctly? Has anyone found a way to perform long running agents (+skills ) very much like CC while on Codex? (genuinely asking, as I am either looking to bring my CC workflow to Codex, or cancel the $200 OAI sub)
Honestly though. I just blew through a 5 hour limit in claude code in 8 minutes flat. One prompt. Granted, I'm working on a huge project, but Codex 5.3 in general, has been solid for a lot of the work. I say keep the codex sub, and ditch the claude sub. You can use Claude Code with a local model back end if you have the hardware for it. Also, you can use Claude Code agents to spawn Codex CLI instances. Go nuts.
unable to resume my large conversations now, just gets stuck in Booting MCP server: codex_apps
:/
welcome back spark
- Hooks donβt exist (yet)
- Agents use skills as applicable or you can just type $skill-name in your prompt as much as you want
- Not sure what you mean by hard to fine /skills
- The model is removed from the app as well Iβm assuming it wasnβt performing as they intended
I had the same questions as you when I started using Codex I was like really? This is it? But if you can, use the app, especially with their native work tree handling, pr/code review features, mcp and skill manager itβs a way better experience, and the 5.3-codex model performs way better than 4.6 imo
4.6 is like an abusive ex girlfriend i keep going back just cause i think maybe it changed and its what im used to but it just hurts me again
and them charging for /fast is irritating, especially since spark seems way faster and pretty much as capable
Does Codex CLI have an equivalent to Claude Code's /usage command for monitoring rate limits?
yeah
/status
took me a second to remember
Its also in the Codex App if you click settings
Yup, just found it! Thanks. I think I ignored several times. I assumed it was an uptime thing...
It wont let me edit my messege, so what can I do to get it to keep working? (for context I was asking it to use a block list and I gave it a blocklist for words and that was too much for it)
It wont let me move on the only thing is the retry button
is 5.3 better then 5.1 max?
love using Codex for local reviews. Would be nice to have local review sessions count toward the code review quota instead of the main quota. My code review quota is always sitting at 100% and feels a bit wasted, while my main quota gets drained.
IRL. I gave 10% of my workflow that I built with Claude to Codex, I gave Codex (heavy thinking) a good 3-4 iterations for codex to build and test (TDD etc etc), at the end of 3-4 hours of wrangling. I showed Codex the exact output Claude Code created with the same instructions . After performing a unbiased review Codex - resigned. IT gave me this message.
---- "Youβre right to call this out. Iβm recording these failures into AGENTS.md as new learned rules now (noise filtering, header context switching, diagnostics separation, speaker canonicalization), then Iβll
give you a clean handoff note you can give Claude Code."
What am I missing guys?
IRL. I gave 10% of my workflow that I built with Claude to Codex, I gave Codex (heavy thinking) a good 3-4 iterations for codex to build and test (TDD etc etc), at the end of 3-4 hours of wrangling. I showed Codex the exact output Claude Code created with the same instructions . After performing a unbiased review Codex - resigned. IT gave me this message.
---- "Youβre right to call this out. Iβm recording these failures into AGENTS.md as new learned rules now (noise filtering, header context switching, diagnostics separation, speaker canonicalization), then Iβll
give you a clean handoff note you can give Claude Code."
What am I missing please
Anyone know if you can use different models for sub-agents in Codex? Main = 5.3-codex, sub-agents = Spark? Doesn't look like it's supported but wondering if there's a hack for it.
There's a chance it might be secretely rerouting you to GPT-5.2 https://github.com/openai/codex/issues/11189#issuecomment-3880522742
its not supported yet
but here's what i recommend.
Step 1: RTFM
Step 2: Use $swarm-planner with 5.3 codex high/xhigh
Step 3: at the end of plan mode, esc out and "save the plan to file"
Step 4: Switch to Spark
Step 5: run $parallel-task
What you're describing would be preferrable because ideally you have base 5.3 acting as your orchestrator but we dont have much of a choice.
never post your order number anywhere
even tho this is free
bad habit
π
Sorry sensei
i was feeling the same thing, in order to tak advantage of it you need to roll out a pr work flow.
Noted π€
Dang crypto rats π€£
hahah yeah man
the best scammers are very smooth
'omg i'm so sorry, i put the wrong address'
'my autofilled used my old address'
you hope that the vendor will actually make you give more info but you never know
thats codex for you
you can try to use more explicit guidance in your agents to use apply patch
i noticed it typically prefers apply patch for small edits and cat for larger edits
but yeah that is annoying
skills work much better in codex, hooks are in the works, but honestly most of the stuff you use hooks for in claude code are because of short comings codex doesnt have. codex exec is the solution to agents afaik. But most of my work flow stuff in cc was trying to make sure it was doing the right thing. That just goes away with code.
oh nah there's great use and need for hooks for sure.
all kinds of fun utility
but i do agree that there's a lot of shortcomings in claude you dont have to worry about though
thems be facts
kinda goes back and forth because a lot of ppl want stop hooks so they can tell codex to continue π
where thats actually not really a big issue in claude
so kinda it goes back and forth
might need em for different reasons
yeah for sure , I'm just saying lots of the hooks stuff is trying to make sure claude is doing the right thing in CC - there are also legit cases. So many of the usages that are in CC just go away.
yeah i mean you can use them for pre commit checks, linting, blocking dangerous commands, pre compaction summaries, logging, and hte list goes on and on and on
i'm very excited for hooks
but re: the stop hook thing, i've gotten around that with a simple statement in agents.
if you use subagents you can try all of this.
the 'execution' part though is what i'm talking about.
ever since adding that, i dont get codex stopping ask me if it should continue
it just does
codex has subagents now or are talking about codex exec wrappers?
oh nice finally, is it in stable or experimental?
its stable
/experimental is more or less stable
if it was unstable they wont put it in there
unstable = 'in development'
thats when its just a flag, no /exp entry
like memory is rn
well i when i say stable i mean the stable release π
nice, that was one thing that really held me away for a while
i use swarms all day every day
I used claude heavily and codex i love codex now since going back to the cli
Claude's to quick to jump in and start a new feature not take time to check what's up if you dont specifically give code
once hooks 90
the one thing codex will be missing, well two. are teams and extensibility/customization
i dont think teams is THAT important, but will be cool
I havnt used cc in a few months but ive done alot with codex and its turned out real nice with the current updates π
I was switching back and forth, but i found 5.3 is slower, but i usually dont have to itterate on it, it just does it right the first time, claude is quick but then im like "move this, reposition this, fix that" and eventually its good
and the 2x limits are nice too
codex does more thorough work too
after using claude for probably close to a 2k hours the current state of it vs codex it isnt even close. In my domain anyway.
codex generally does better work across the board
ui not as much
writing definitely not
i'd like to see codex get better at convex as well
Yeah codex is a eng and claude is more of a vibe
they're both amazing
i was referring to codex the cli tool
the models to me there's no question i prefer codex for most tasks
but claude code is the gold standard for a cli
its so good
codex is really closing that gap tho
i'm building something pretty cool to take advantage of both.
its nothing special really its just a skill but
i built a swarm loop system that you plan with and it builds dependency graphs into your plan.
then you switch to code mode and implement with swarms, it basically launches parallel agents in waves based on unblocked tasks
so if there's 5 unblocked tasks, it'll launch 5 agents. if there's only 1, it'll launch only 1 agent
and it works in a loop until all tasks are done.
but claude is so much better at UI design
so i made another variant that calls Claude for all UI stuff
you can create a skill for convex with index and reference files and codex will be an expert in the lastest and greatest everytime.
yeah it works but claude is still just generally better
but yes, that's how you can make it better
but i'd rather it be good regardless because sometimes it might not call the skill
i think they will be better on the next model.
I have to scream at claude in orchestration just to use skills at the bare min level
codex hasnt missed a skill yet
claude is funny honestly. i have it calling skills without prompting somewhat regularly but i do hear this complaint ALL the time
so i know people arent just making it up
how many skills do you have?
probably about 18 now
yeah thats not too bad
watch i can almost guarantee it'll call this skill every time. brb
about 10 of them are topical
i can guarantee claude will call skills the first time everytime. But after it called a skill its 50/50 on the next turn that it calls another one. Then after 120k context, 0% chance
okay that makes sense
i love this about opus, i didnt have to tell it to launch in parallel
it just does.
i do have HIGHLY complex workflows i do in claude
Yeah its way in front on subagents and tooling in that area
but it usually doesn't exceed 50% of cw
but 1 workflow might call 10 skills, 4 subagents
nested 3 layers deep
but because they're subagents the orchestration agent isnt seeing creep
occasionally it'll screw up the formatting though
but your suspicions are true. codex doesnt experience drift like claude
opus
you can push the cw to max