#codex-discussions

1 messages ยท Page 58 of 1

meager bobcat
#

and I end up thinking like some janitor or engineer when its completely distracting

signal tapir
#

I wouldn't say so. I would say that we game devs are a different category.

uneven kayak
#

yeah in the future for sure, I see a future in which you only need a one line prompt to make a AAA quality game. It's just going to be very surprising, one line isn't much to go on when you're essentially asking an AI to write a novel, so are the users really getting what they want? Maybe, if what they want is vague enough.

meager bobcat
#

I spent 8 hours once adjusting font outlines and rotating text a bit to the left until realizing I didnt even need any of that and eventually I completely rewrote the Ui into way better one in less time anyway

#

I was thinking like a technician, not a gamer. Because gamer would never notice that font outline

#

they would just either get bored, or have so much fun they'd be tunnelfocused on gameplay (not on font, like me)

signal tapir
#

Although the gamer might have nocticed it unconciously, and preferred it that way without realising. As a game dev it is our job to figure such things out ahead of time.

uneven kayak
signal tapir
#

800 hours in some cases. ๐Ÿ˜›

uneven kayak
#

yeah maybe you didn't actually end up using it but that's just an example

uneven kayak
signal tapir
#

I've only spent like 2000 hours on this game so far.

meager bobcat
# signal tapir Although the gamer might have nocticed it unconciously, and preferred it that wa...

I came to my conclusions and concerns because I was playing lots of successful games (still made by traditional AAA companies) that consistently leave small bugs out there in releases

"damn the button text is slightly overflowing in English locale here"

I am sure at least 1 guy noticed this in that 5000 man company. But they purposely gave no fuks.
Which works out because I respect that particular game and it is widely succesfull too. They created value. They did not have to bother about overflowing font like I did.

uneven kayak
#

I love agents though, I'm doing stuff I never would have done before and faster and better than I could have done before, but that's coming from a place of sort of already knowing what I'm doing and being able to see beneath the surface

meager bobcat
#

In fact i am sure nobody noticed like I did

#

from players I mean

#

If you would rather add new features every week/month and release it to world. as company or as individual its smart af not to fix overflowing text if that gives you more time

#

I am sure thats a purposeful prioritization framework utilized by many game devs who ship product fast and effeciently

signal tapir
#

Not me. ๐Ÿ˜›

#

I am not close to efficient

#

I have only artistic vision as a controlling factor

meager bobcat
#

I mean like im perfect lol
but ofc its fun to work on important stuff and move on even when things arent perfect. and thinking like a gamer rather than engineer seemed to help me lately

signal tapir
meager bobcat
#

haha fair enough

#

I mean thats whole point of being a dev working on your vision

#

if overflowing text is not something you would rather ignore then you wont, ezpz

#

so yea

signal tapir
#

ya

uneven kayak
meager bobcat
#

yea

#

I am saying this from personal experience. I absolutely love that game. Its really well made. But I consistently notice "damn devs got REALLY lazy here" and yet... it doesnt even matter. Its almost like they mastered taking shortcuts in the ways that dont matter, yet some solo devs including me would probably spend 10x more time for barely any gain

#

I do wanna learn that.

uneven kayak
#

it's about seeing the forest for the trees, or in this case, taking as many shortcuts as possible to meet a production deadline? lol idk pick one

signal tapir
#

I was expecting to ship in december. ๐Ÿ˜›

#

Might still do that, though next one

meager bobcat
#

lolol

#

same, every year

signal tapir
#

We ended up creating a full language for our game. That idea is incredibly inefficient.

uneven kayak
signal tapir
#

indeed

meager bobcat
#

๐Ÿซฃ

uneven kayak
#

Updates are great but everyone remembers the launch lol

meager bobcat
#

unless they dont xD

signal tapir
#

This galaxy generator I'm working on. I don't think I gained anything by using AI. It's been taking so long I probably would have finished it faster on my own.

#

This is probably going to get more and more rare as time goes by

uneven kayak
# meager bobcat unless they dont xD

if you're talking about games, oh the hell they don't lol. KSP 2, Cities Skylines 2, Roller Coaster Tycoon World, SimCity (2013), half baked products that lost steam because no one could forgive their poor launch state

#

and the list goes on and on

meager bobcat
#

I think devs could not deliver the product so they gave up

#

Most of those games were developed as AAA 1.0s that were intended to meet all their financial goals at release date, same fiscal quarter. And if dev team couldnt deliver a stable valuable product and customers did not shell out enough money, spending more millions on completely overhauling the game would be a disaster

#

Meanwhile paying $20-200 for codex isnt some sinking disaster situation

uneven kayak
# meager bobcat I think devs could not deliver the product so they gave up

sometimes that might be true but it also has to do with being devastated by the real life reaction to something you care about, so why would you want to keep caring about it? That bad launch drives good devs out of bad companies like ants from fire and the business collapses so the publisher has to find another developer and the cycle continues

signal tapir
#

And I just crashed Unity...

meager bobcat
#

yeah ofc.
haha you are right.
for indies its emotional damage
for AAA its bank account gone

uneven kayak
meager bobcat
#

so two different issue, same effect

#

lmao

signal tapir
meager bobcat
#

lol

uneven kayak
#

now, that said, who wants to playtest my half baked barely working game? ๐Ÿคฃ ๐Ÿ˜ญ

#

lol jk, gimme a week

meager bobcat
#

i can but you gotta playtest mine ๐Ÿ˜ผ

uneven kayak
#

yeah definitely

signal tapir
#

Mine isn't even close to playtesting

meager bobcat
#

ezz

#

lolol

cinder plume
#

I've been working on my own game using Codex exclusively for about 6 months, what kind of games are you guys making?

meager bobcat
#

which stack?

signal tapir
#

4x spacegame

meager bobcat
#

me and andrew are using JS/pixi

#

I think Jim is using Unity?

uneven kayak
# signal tapir Mine isn't even close to playtesting

mine is closer than I ever thought possible, it's lacking content and the UI/UX needs polish but I've been mainly focused on underlying functionality and the client/server crap and it's SO close to the point where I can finally shift gears into content creation

cinder plume
#

Im using raw C++ using SDL

signal tapir
#

I started in Unity, switched over to Monogame, and now I am back in Unity.

uneven kayak
#

love Codex so much

meager bobcat
#

nice

cinder plume
#

I almost went with Monogame

signal tapir
#

I liked it, but it wasn't right for this game

uneven kayak
#

I'm using a JS stack for Android deployment, should be enough for a 2D game that's mostly about numbers

meager bobcat
#

yeah I like JS for its codex agentic magic

signal tapir
uneven kayak
slender plinth
#

BRO BE SO FOR REAL LIKE COME ON

meager bobcat
#

hollandianish?

slender plinth
cinder plume
meager bobcat
#

hahah guessing

slender plinth
#

but does anyone know how to fix this damm issue

signal tapir
meager bobcat
#

what did you ask for?

signal tapir
#

Basically asked it to find vulnerabilities in his software

meager bobcat
#

ooh so they limited that?

#

or only when you ask something specific

signal tapir
#

"It might be hacking!"

slender plinth
meager bobcat
#

like people were reverse engineering stuff?

slender plinth
uneven kayak
#

maybe he just needs to be more specific about the fact that he made and owns the software he's scanning

#

idk

slender plinth
#

now it works

meager bobcat
#

ooh

signal tapir
# slender plinth WHY

Their safety AI (probably a separate AI that reads stuff first, and is much much simpler) flagged it because it thinks maybe this is someone who is trying to hack someone elses software.

meager bobcat
#

yeah

slender plinth
#

@signal tapir how do you know what i asked

cedar skiff
#

how do you guys prevent this behaviour: I ripped a spec from a site and then got 5.5 to make a skill from it.
Part of the rip process is to take any vids and convert them into 5x5 frame captures and replace the links to target the frame captures instead. This is workflow history, doesnt need to be mentioned in the skill. 5.5 just goes and adds the history of the work flow to the skill. It doesn't understand that the history doesnt need to be in the document. It does this ALL the time.
WebP contact sheets replace MP4 animation examples in the bundled rip.

slender plinth
#

oh wait

#

youre dutch

#

makes sense

signal tapir
slender plinth
#

HOPPI ANNOYINH it looks like its working now

uneven kayak
slender plinth
meager bobcat
#

you guys speak English as native langauge or

slender plinth
#

dutch and morrocan.

#

and arabic

#

and french

uneven kayak
#

lol idk, I think we're all just citizens of the world. ๐ŸŒŽ

slender plinth
#

those are my natives

meager bobcat
#

my native language is only spoken by 3m people so

slender plinth
#

the helly

#

which languge?

meager bobcat
#

lolol

#

Georgianm

slender plinth
#

dont know what that is

meager bobcat
#

^^ case in point

slender plinth
#

YOU SPEAK THIS

signal tapir
#

I'm Swedish. We have Swedish and Swenglish as native tongues.
Swenglish is English with a really strong dialect, and a tendency to just anglify words we don't remember in English. ๐Ÿ˜›

meager bobcat
#

looool

slender plinth
#

THESE ARE JUST SOME DRAWINGS

#

THE HELLY

meager bobcat
#

yep

uneven kayak
meager bobcat
#

I mean arabic also looks different from most languages similarly

slender plinth
#

i need to learn how to speek greek

meager bobcat
#

written

meager bobcat
#

yeah but its not like latin/english

signal tapir
#

My game has a language where two voices speak at once. One carries the semantic meaning, and the other adds emotional context.

slender plinth
meager bobcat
#

yee

slender plinth
meager bobcat
#

lolol true

slender plinth
#

like i can speak it

#

but draw

#

no

#

just no

#

last time i wrote a cursed word instead the word i needed to write

small patrol
#

bro anybody has now issues with codex? like after last update i start it and then codex simply doesnt answer?

meager bobcat
#

you press enter and nothing happens?

uneven kayak
#

ok back to my questions and answers, I opened a can of Monster and brought my laptop outside for some fresh air and a fresh perspective lol

slender plinth
small patrol
meager bobcat
#

dayum

slender plinth
meager bobcat
#

thats russian

slender plinth
#

it looks like latin

meager bobcat
#

cyrlic

#

very similar

signal tapir
slender plinth
#

it prob says

#

codex he otbe4aet

meager bobcat
#

codes isnt responding

slender plinth
#

codex is beaten

small patrol
#

YES

#

yes

meager bobcat
#

lolol

slender plinth
#

LOLOLOLOLOL

meager bobcat
#

idk try reinstalling

#

theres lots of stability issues with windows

slender plinth
#

Ik praat nederlands, is dat toegestaan

meager bobcat
#

codex aspp

slender plinth
meager bobcat
slender plinth
#

RAAA

#

ana hadry swhea harbia

#

oh and thats allowed

meager bobcat
#

well it doesnt actually read content it reads letters

#

if one of them is not English maybe

#

what if its mostly english but

meager bobcat
#

idk I only have english keyboard...

small patrol
#

thats yex

signal tapir
meager bobcat
#

looool

meager bobcat
# small patrol thats yex

try updating from windows store, or unistalling/reinstalling. or use different PC with different windows ver

signal tapir
#

Try asking GPT

meager bobcat
#

codex app has lots of bugs that seems to be exclusive with windows users

#

ye

small patrol
cedar skiff
#

I dont face any bugs at all on mac

meager bobcat
#

I didnt realize how windows got out of fashion as of late lmao

small patrol
#

yes

meager bobcat
signal tapir
#

agree. be safe.

meager bobcat
#

its annoying how those blue blips on completed chats dissapear after restarting app or updating though. I keep forgetting unfinished work still sitting there or unexecuted plan.md ๐Ÿ˜„

balmy olive
cobalt junco
#

note to self never use /goal again

plush harbor
#

lawdy. I just have a $5 api bill

cobalt junco
#

funny thing is i still have 20% usage reserved

uneven kayak
small patrol
#

would mi files be safe

uneven kayak
uneven kayak
# small patrol would mi files be safe

My agent instructions are stored in the project directory, so the only thing I'd lose is settings in the actual Codex app which only include some minor PR draft instructions and a couple plugins. Would take like 2 minutes to get back up. But that could be different for you if you have more going on in the app.

#

I'm handling most authentication via command line and environment variables so I could use any agent and they could pick up where the last one left off

#

gonna try Claude Code at some point

#

I've been using 4.7 Opus via Cursor so that necessitated the interoperability and modularity of my agentic workflow

simple star
#

Fellas, I have been thinking about something... how about creating a Linux distro that is mostly AI driven? Complete access to everything the system has to offer... Would you use something like that?

uneven kayak
#

I mean Codex seems to have pretty comprehensive control over my Linux server via SSH

simple star
#

Basically having total powers, meaning it can use any app (terminal or GUI), read any file, make any sudo changes... basically full powers, comparable to those of a real user.

cedar skiff
#

Cant we do that already?

uneven kayak
#

but those already exist

simple star
#

I told you, complete root privileges, and able to use the graphic capabilities of the distro

uneven kayak
#

GUI is overrated lol

#

linux thrives on CLI

simple star
#

Im not talking about GUI

uneven kayak
#

Graphical is literally the first word in GUI but okay lol

simple star
#

im talking about anything you use. You use always a GUI, when you use a computer, no?

#

I am talking clicking on a page, watching a YouTube video, etc...

simple star
#

The current tools (like Codex) cannot use websites or things like Photoshop

cedar skiff
#

I think apps like photoshop are changing to include an api for llms, and eventually just an api for llms

uneven kayak
#

A lot of the stuff we use a GUI for or even Photoshop can be accomplished via CLI with Codex, also Codex actually can use websites via Computer Use with multiple browsers

simple star
#

"Computer use" <- MacOs only

That is why I said "Linux distro"

uneven kayak
#

but it's not on Linux so yeah

simple star
#

"Computer user" <- MacOs and Windows only

#

That's why I said "Linux distro"

#

Specially since OpenAI seems to have zero interest of even making a Codex App for Linux

uneven kayak
#

I still think it's a solution looking for a problem, and you're talking about something on a pretty massive scale, designing an agent that can use graphical tools isn't an indie project

#

imo

simple star
#

Im not trying to build the next Ubuntu. It's clearly an experiment. Not something where I would put sensitivie info.

meager dragon
#

My plus expired If I buy again will my usage instantly grow back to 100% or not?

meager dragon
#

cuz my weekly usage was alrady like 0%

uneven kayak
#

if your subscription expired, resubscribing will bring it back to 100% available. Same thing with upgrading.

meager dragon
#

cool

#

but why my plus was on like 10% weekly and when downgrade to free it shows only have 49%
remaining

#

oh prob because it expire an hour ago and I used in CIL and didn't notice it

small patrol
#

guys what ur opinion whats better claude code or codex?

velvet wren
small patrol
#

oh oki

cedar skiff
#

rip

uneven kayak
#

that doesn't make sense, if you had no subscription, subscribing obviously increases your usage. When I upgraded from free to Pro, it went back up to 100% right away

#

so yours and my definition of "resubscribe" might be different

meager dragon
#

if it barely 0% already I don't have to waste my 3 days money I can just wait it reset

meager dragon
uneven kayak
#

unplug it and plug it back in

reef haven
#

Yo guys wsp

uneven kayak
#

lol I mean, try logging out and logging back in, I think there is a known issue with caching

#

but hell idk, seems like any time you pay money, that resets your useage. So if that is your first subscription or if you're upgrading from Plus to Pro, any time you pay them for a subscription the usage should be reset.

#

but maybe I'm totally wrong

reef haven
#

Yo i am new to agentic coding and kinda broke any advice

uneven kayak
#

Kinda broke or flat broke "how are you even online right now?" kinda broke? lol if you can, the $20 Plus subscription has a lot of value

#

you can get some pretty decent beginner stuff fully made and working with less than $20

#

but my biggest advice is, try to understand what it is that the agent is doing, read along with it

reef haven
#

Yea but the condition of plus is bad because people use free trial and plus quota is also filled with people trying to abuse it

uneven kayak
#

oh I didn't know there was a queued system or overcrowding or anything, I'm living that Pro life lol

#

Until the end of May, they're offering 10x useage on Pro compared to Plus instead of the standard 5x.

reef haven
#

Claude is also making massive moves like they are partnering with spacex and aws just to fix overcrowding gpt has to do it also

plucky spindle
#

Hi guys im new

reef haven
plucky spindle
uneven kayak
# plucky spindle Hi guys im new

hey you're here just in time to witness the downfall of humans and the takeover of our new AI overlord. Welcome to the beginning of the end! lol

meager dragon
reef haven
#

Just use glm 5.1 guys it is also paid rn but atleast it is usable and better than codex for now only for now

uneven kayak
#

looks like GLM 5.1 falls significantly short of GPT 5.5 in agentic coding

#

but then GPT 5.5 is about as expensive as is gets

reef haven
#

Yea glm is like 1.40$ input per mil token only

#

Claude is more expensive than gpt it is hella expensive like 10$ man thats too much and they have an 20$ base subscription

#

Thats subscription used in no time

uneven kayak
#

Claude models only seem more expensive because they're less efficient and uses a lot more tokens, Claude is really falling behind these days imo

reef haven
#

Yea it uses alot of tokens but it also leads in coding and thats their main thing and they have more profit margins because of tokens from 3b to 30b in 1 year

uneven kayak
#

true, that's why I pay the money and use 4.7 Opus for high horizon tasks

#

but not for anything other than audits and planning

cedar skiff
meager dragon
#

AI support

uneven kayak
meager dragon
#

Copilot only give AI 50 lines codes, but CodeX give the whole files which spending more tokens, looks like I need make it start spending less.

reef haven
#

Yea but if had the chance i would make an community api for people it will have que system if anyone don for it then I'll mak it lol

meager dragon
#

Any methods make me save some tokens?

reef haven
#

But i will need contributers

reef haven
uneven kayak
#

that's just the very tip of the iceberg lol

reef haven
#

Yo andrew how many subscriptions do you have btw lol

uneven kayak
#

๐Ÿ’€

#

man I'm almost afraid to think about it

reef haven
#

Never ask a woman her age

Never ask a man his salary

And a dev his ai subscriptions lol

uneven kayak
#

$200 Cursor Ultra, $200 ChatGPT Pro, $115 Scenario (gonna cancel that though, probably) and soon I'll have a Claude Code sub as well

#

and I've been at this only 20 days so far, I first learned about agentic vibe coding and left the rest of my life behind lmao

reef haven
#

Fahh lol how much do you make or its just hobby projects

uneven kayak
#

I've been hyper focused on developing this game, a game I've always thought about but I assumed it would be too complex for me to actually accomplish and then I discovered agents

reef haven
#

Oo topic?/name? Of the game

lost drum
#

Yo they added live animation of loc beeing changed haha looks like its about to TP

uneven kayak
uneven kayak
#

the whole game is centered around the stamps, not postage stamps but rubber stamps with handles. You stamp paperwork. That's the game. lol there's a lot more too it but that could get long.

reef haven
#

Oo nice what any help because i dont have ai subscriptions but i specialize in marketing and helping you market this niche game will make me feel good so if you are down then tell me

uneven kayak
#

haha I appreciate the offer but my game isn't anywhere near ready for that, I will need playtesters in a couple weeks though

reef haven
#

Yea i can test it

cedar skiff
#

couple of weeks translation 12 months

#

I say 1 month , it's always 3 ahha

reef haven
#

If you need any help in development then tell me lol

uneven kayak
#

Codex is blowing my mind every day, always learning something new about what it can do

reef haven
uneven kayak
#

well that's a very simple answer

#

but I'm tired

reef haven
#

Lol but then your nightmare comes credits expired no more credits lol

cedar skiff
#

If youre ok having 100k lines that does 35k lines of work

lost drum
#

and it pushes 200kloc

uneven kayak
cedar skiff
jolly lily
#

Is it okay to have two actually paid plus subscriptions on two different accounts to work on a single project?

reef haven
lost drum
lunar crypt
#

Is the Codex app ever coming to Linux I wonder?

uneven kayak
jolly lily
#

Thanks!

reef haven
#

Honestly i came here to get a free ai subscription because i discovered how to edit vids using ai models but i got friends in return fair trade

#

Well its now time to sell my kidney and give all the money to sam altman

cedar skiff
# lost drum ye but it just needs to create a wrap around the code to know how to optimize it...

I think what you need is a good archtecture. The problem that is faced is the llms dont get that. They dont just understand architecture, they need guidance on it. Without guidances they patch things in place with out understanding the bigger picture even if you tell them to. Like a simple reorder and an async gate might solve a whole arm of null value gating in 20+ files, but it wont notice - it will optimise the null gating into an abstraction and make it dry, that means MORE code. It just doesnt see this sort of thing with out guidance.

#

Maybe one day soon it will, but atm it doesnt

#

it just brute forces its way through fragile solutions

lunar crypt
#

Improved the performance so much

cedar skiff
lunar crypt
#

Can you share your solution? I'm very interested to improve it on my side if you got any insights

#

Or at least method if not open-source, would love to learn

uneven kayak
# reef haven Optimization can come later is a lie lol because after you finish the game witho...

There are multiple approaches of course but I just have Opus run an audit between major phases of development. I ask it to "comprehensively evaluate the execution of X plan and PRs ###", and it can use the PR body that has been molded and shaped by agent workflow so it contains blast radius, change contract, migration notes, acceptance criteria, test results, agent handoff notes, content change class and a bunch of other stuff. So Opus has the context to understand exactly what changed and why. The purpose of that is primarily to make sure everything is in order after large changes but the same sort of approach can also be used to optimize the code as well.

velvet wren
uneven kayak
#

ah sorry about that

cedar skiff
# lunar crypt Can you share your solution? I'm very interested to improve it on my side if you...

I have instructions in the developer-instructions that dictate writing code based of local exemplars if its available. I have an architecture skill that is toical and has reference files that define the seams of the layers, the layer constraints and purposes with code examples etc.
Then for idiosyncrasies i encounter that slip through the system i have another topical skill that is project guard rails where i add generalised rules that cover those as they show up.
When the project is first spinning up i build this stuff as i go in the first stages.

#

In the end it just works.

#

I generate skills for packages as i need them. For example if i am using stripe i get chatgpt to do a deep research on the latest api and best practises and i build a topical skill for it.

#

AND i look at the code.

#

The only part i dont pay too much attention to is the front end

#

If it looks good and is responsive i dont care how it got done

lunar crypt
# cedar skiff I have instructions in the developer-instructions that dictate writing code base...

Oh that is actually pretty solid, I do have a baseline layer of exactly this but I've mostly limited it to rules, global constraints and an outline of 'where lives what' on a high level + a basic description of its purposes.

The core of my method lives in entirely removing any semblance of human-defined architecture baseline -- Basically a graph database that is aware of all code construct relationships (docs, functions, structs, enums, etc. etc. etc.) and creates semantic links based on references within code or docs, then use a accumulated spherical similarity index with subgraph recombination as a semantic injection into AI context based on current task / conversation context which always provides AI with all relevant documentation context and relationship between code and its function.

The graph embeddings rebuild with local AI for every batch of changes

cedar skiff
#

I see a lot of this sort of approach for delivering relevant context to the llm, haven't seen it for architecture yet

simple star
#

I might make it today over the 2h goal

#

The best part is that it is all compiling, so I am barely using tokens ๐Ÿ˜„

lunar crypt
lunar crypt
uneven kayak
robust quest
#

This happens everytime I try to resume a session. Any idea?

#

I get this caution after 5 re-tries

woven canyon
#

Guys, does Go plan give more codex usage than Free plan?

exotic cave
#

How is your experience with goals? i am running one now on a mechanical refactor, hoping for the best....

woven canyon
#

mechincal refactor?

exotic cave
#

as in no business logic changes, but the refactor is wide, say when changing terms/naming's both db and code.

lost drum
exotic cave
unique spade
steady vigil
#

usually I forget to use it though ๐Ÿ˜‰

exotic cave
#

My idea with the mechanical work is that it's seemingly easier to test, because logic stays the same but namings/terms change... I for one will never use /goals for any logic refactor/work, maybe when GPT-6 will be released it'll be strong enough for this things...

steady vigil
exotic cave
#

The goal i gave it has a very concrete plan, with tests, what the end goal etc...

steady vigil
#

or /goal the e2e test xyz must pass

orchid echo
#

crazy im on $100 Pro plan and run out of weekly usage in 3 days even with it being on x10 usage. When it goes to 5x, its gonna be completely un-useable for me..

exotic cave
orchid echo
orchid echo
steady vigil
orchid echo
#

i had Claude's $100 plan before and it would last me longer

steady vigil
orchid echo
#

but idk

#

maybe ive just used codex a lot more than when i used Claude before

steady vigil
#

claude token rate is very extreme on xhigh/max compared to codex

unique spade
orchid echo
#

yep

#

so u can use up the weekly usage more faster

steady vigil
#

no they increased opus limits also (not sure how much)

unique spade
orchid echo
unique spade
tired ermine
#

There needs to be better coordination between GPT and Codex. The pulse is awesome it gives good insight but a lot of times it is not on the same page or up to speed on current projects in codex. For example it will make a recomendation for improvment that is already been done in codex. It would be great if the two communicated better. Anybody have way to make them work better together? I think a easy fix would be the "pulse" channel could just be shared between both apps

unique spade
orchid echo
#

๐Ÿฅฒ

woven canyon
#

Guys how often do you refactor code?

unique spade
tired ermine
#

currentlyt using notion to manage my projects so having both chat and codex refer to that has helped

#

github a good idea

unique spade
#

i would probably wait till they release more features to connect the 2

you can make things like having your own custom connector in chatgpt that has a codex from the other side. so you can have chatgpt communicate with codex via that connector for example

#

but right now that's only one way, as in telll gpt to query codex. cause gpt doesn t accept incoming messages via connector

tired ermine
#

i like it. Sweet UX. I used to uses openclaw and stilll do and it built a pretty cool mission contol. But way to many headaches with all the openclaw updates and things breaking. Trying to bring all my workflow to gpt/codex

unique spade
#

like i'm using chatgpt to review specs drafted by codex. precisely because with chatgpt i brainstorm and it holds a broader view of the intent than codex

which is one reason i made that ux, cause i was tired to keep copy pasting in various windows

radiant lintel
#

Anyone having issues?

tired ermine
simple star
#

Has 5.3-spark disappeared from the /status ??

tired ermine
boreal holly
unique spade
# boreal holly Ayyy you integrated ChatGPT into your Codex wrapper? Sick!

yea it's a dedicated chrome tab just for gpt, and i link a specific codex thread id with a specific gpt chat so i can have both on at same time . plan is to also have various automations like one button to send new specs for review in gpt chat and so on.

elctron seems quite cool to use it as some kind of sandbox for your workflow

like it was annoying when i checked the opened PR's by codex to get a new page opened in my main chrome. So i made it so it opens a dedicated github right there inside the middle plane, whenever i click a link from inside codex chat

lost drum
boreal holly
#

Share context in a streamlined fashion

sacred minnow
#

dang i cant wait till my limits reset to test goal o.o

unique spade
signal tapir
sacred minnow
unique spade
lost drum
#

donte reset I have 60% man

unique spade
#

i'm experimenting separately with making my own backend to connect directly to oai server, but that's just gonna be a different option toggle

like i want vanilla codex or my own

cedar skiff
#

i changed my approach this week, i left 20% for the last 4 days for planning and start of next week ill blast 80% early again. That i way i dont get ripped off by a reset

boreal holly
cedar skiff
#

it also doesnt cost subscription tokens

lost drum
#

I just hope no reset

signal tapir
#

My bad. I had 1% left. When I got back to the computer it was 0%.

unique spade
boreal holly
cedar skiff
#

It would be nice, or they could make it opt in

unique spade
jolly lily
#

Thanks to CaptainStarbuck, robertmsale and smileBeda from yesterday for the recommendations on using seperate context files & detailed prompts and making it all clear before telling the agents to start to code, it has significantly reduced my token usage on both Claude Code and Codex

unique spade
#

reset uncertainty feels very much alike with trading uncertainty tbh

on one side you have a hard time over using qouta early on because IF they don t reset you ll have some days without enough usage

on the other side IF they reset you will miss out by not going more early on

jolly lily
#

It really helped me to focus on the quality of the project & product instead of worrying about the limits all the time

sacred minnow
#

id usually have heaps left but had to change to 5x this month ๐Ÿ™

jolly lily
boreal holly
velvet wren
jolly lily
velvet wren
boreal holly
cedar skiff
jolly lily
unique spade
#

let s hope they find a reason to extend the promo haha

solid lake
#

1 pro and 2 plus accounts seem to be the move

velvet wren
solid lake
lunar crypt
unique spade
#

these days i'm working to have my go with my custom workflow at this benchmark

seems a very good bench to me, especially that vanilla models are currently at 0% in terms of full completion of a task

https://programbench.com/

solid lake
#

Or make files

#

โ€œMakeโ€

mighty lion
#

did u guys notice gpt 5.5 got better?

boreal holly
orchid plume
boreal holly
mighty lion
#

yeah i think so aswell.

torpid trout
boreal holly
torpid trout
#

hmmmm

#

That would be bad lol
And I was sure only the pro 5x is doubled
Well, anyway, I guess I will just have it index my ~/ when I run out of credits, so far it has added 1% for each scan ๐Ÿคฃ

boreal holly
#

Also resets every 1m up to 10m subscribers. So Idk. They don't have a page dedicated to active promos

torpid trout
boreal holly
torpid trout
#

However, they probably have wrong faq, since of course, 200/20 == 10x

#

not 20x

torpid trout
#

on an ongoing basis
does that mean forever?

solid lake
boreal holly
#

OpenAI makin us do math n ish

solid lake
torpid trout
rocky fog
#

what to trust ๐Ÿ˜„

torpid trout
#

Classaction incoming ๐Ÿคฃ

uneven kayak
#

They covered their bases by using the term "ongoing basis", nothing to sue over. Companies change prices all the time. And yes I'm going to sleep @signal tapir lol

boreal holly
#

That's the thing... I think they had a "double usage" promo before the $100 plan, and so when they say "ongoing" they mean that they are keeping the $200 plan at double. Idk, it's all very ambiguous

boreal holly
#

They probably created this ambiguity on purpose ๐Ÿ˜‚ marketing master class

solid lake
#

Iโ€™m highkey glad for the usage

nocturne folio
#

maybe i do need the pro plan

torpid trout
#

Guys

#

pick a random number between 1 and 100

#

Try it.

#

I mean. 42 was long the one and only.
I guess gpt is highly religious?

jolly lily
boreal holly
solid lake
torpid trout
#

In default mode it will be 73

solid lake
#

Whatโ€™s default

torpid trout
torpid trout
solid lake
#

I donโ€™t see an auto option what

torpid trout
solid lake
rocky fog
#

what are you using ๐Ÿ˜„

torpid trout
#

its reliably giving the same answer

boreal holly
torpid trout
stable obsidian
#

Can anyone tell me why my codex isnt able to use the @Browser plugin? apparently its missing the "node_repl js" tool?

rocky fog
#

web chatgpt has the model picker inside the input box for a while now

#

and there is no auto

torpid trout
rocky fog
torpid trout
#

Its a mathematical easter egg so to say

solid lake
#

Iโ€™m in the app I donโ€™t see auto

boreal holly
torpid trout
#

and..

lilac thorn
# rocky fog what to trust ๐Ÿ˜„

My bet is the Pro plan will stay at 10X due to popular demand (and it fits sensibly in their pricing scheme). It's a marketing ploy to get people to try it out (double usage because we halved what was going to be in the plan).

torpid trout
#

sorry for the rabbit hole but it gets deeper

#

73 is very important in the three abrahamic religions (probably due to its mathematical uniqueness)

solid lake
#

It gave me 47 reliably

#

3 times

torpid trout
#

with thinking

#

I guess?

solid lake
#

No

#

Instant

#

And thinking

torpid trout
solid lake
#

If you talk to it about religion you might have the distribution point you to that number

torpid trout
#

Do you have any custom system prompts/

solid lake
#

No

solid lake
#

None that say give me 47

torpid trout
#

but you have

#

so it skews it

solid lake
#

I havenโ€™t

torpid trout
#

try pure gpt, no prompts, memory, etc

solid lake
#

New account

torpid trout
#

then you are unique

#

There's a fb post with hundreds of screenshots, all give 73 lol

solid lake
#

Bruh

torpid trout
#

And I confirmed it in now 4 chats lol

solid lake
#

Yall getting psyopd by probability

torpid trout
#

hows it probable that it always gives the same result? Lol

solid lake
#

Itโ€™s probable bongosort can sort something on the first try

torpid trout
#

whats bongosort

solid lake
#

Itโ€™s a joke sort algorithm that just shuffles the list and checks it it fell into place ๐Ÿ˜‚

#

No embed ;-;

cedar skiff
#

tell it to use a script and not its training

torpid trout
#

the point is its supposedly inferred answer is deterministic

cedar skiff
#

It is the point, it picks the same number everytime because of the training

boreal holly
# torpid trout

OK, so you capitalized the P that time, but in the model weights & vocab Pick and pick have practically the same logits. The entropy would be adding more words or punctuation

torpid trout
simple star
#

Still going...

torpid trout
#

wait, I am going to try pick any number over between n and n

simple star
#

I just told it to make a ai-centered linux distro ๐Ÿ˜„

solid lake
torpid trout
torpid trout
#

it will differ

solid lake
#

In a different chat yes?

torpid trout
#

sure

#

it will be different

solid lake
boreal holly
torpid trout
#

and pick any number = 7, reliably

#

so, theres some fishy stuff going on here lololol

solid lake
#

AYY I GOT 73 on third try too

torpid trout
#

it picks different colors on the same prompt, but same number on the same prompt... I dont know lol
funny easter egg

solid lake
torpid trout
#

you need new chat, fresh slate.

boreal holly
boreal holly
torpid trout
solid lake
boreal holly
#

This is fascinating

torpid trout
#

need to compare other ai

#

and, other repeat questions
Like, pick any city, pick any planet, pick any whatever

#

lol

unique spade
#

meh these are pre-saved results for question like these

they won't waste gpu cycles for it

#

๐Ÿ˜‚

torpid trout
#

And ultimately
ask it the reason, over and over again

I picked 73 because it feels โ€œrandom enoughโ€ to humans without being too round or too obviously patterned. People tend to avoid very low numbers, very high numbers, and multiples of 5 or 10 when trying to seem random, so 73 lands in that sweet spot of looking arbitrary.
[...]
torpid trout
unique spade
#

if you want actual rng

#

just prompt

True RNG (1, 100)

in a fresh chat

lost drum
#

my agi would pick 100

#

cause its commplete number he likes perfection

#

HAHA, HA

unique spade
torpid trout
#

all complete to 10. dont get me started now lol

unique spade
#

we know that like since 2 years ago

#

๐Ÿ™‚

boreal holly
#

I mean it's not all that computationally intensive to ask for "pick a number between 1 and 100". The ChatGPT system prompt is cached, you're inserting roughly 11 tokens, and getting back 2. The KV Cache is already prefilled with the most intensive part of the prompt. It's only when dealing with massive uncached inputs does it burn massive gpu cycles

torpid trout
#

but then why are color questions not cached?
That would make sense to do as well, right?

boreal holly
unique spade
#

guys try change the numbers a bit lol

#

you only have 73 acched for 1 to 100

#

if you change the sequence you ll get different numbers

vital hill
#

bro i thought it was a work day today. i hopped out of the bed like a mf

boreal holly
#

Ah ha, so changing the min affects the outcome

torpid trout
#

I tried llama - each chat a new answer
Served over ollama

So i guess yes this is cache

unique spade
#

but if you give it different intervals you don't get the same answer everytime even with instant

torpid trout
#

oh wait, llama picks 27 and 53 interchanged

torpid trout
#

once 27, then 53, and back

boreal holly
#

Something sequential about this

unique spade
#

๐Ÿ˜‚

cedar skiff
#

first time i have used the codex image gen, it's pretty good o.0

#

It can create production ready mock ups that you can loop on

unique spade
#

like content could change but if you asked for top 3 % you always got either 60 25 15 or another one foir same prompt with 0 temp and fixed random seed

torpid trout
boreal holly
torpid trout
#

I went from 1 to n (and 100 as max), and it responds with 73, up and until 59, then it started get different
weird cache ๐Ÿคฃ
Ok, stop the token waste now lol

cedar skiff
#

we should have a way to use tokens at the end of the week for some sort of a charitable cause, like folding proteins used to do.

torpid trout
#

Well you could always have GPT do some amazing stuff with the remaining tokens and then gift the product
Like, pep up the website of some org in your area, or so

cedar skiff
#

I guess that wouldnt work because it would likely bankrupt the providers, they would have to tighten plans

torpid trout
#

Or offer that as a free service

boreal holly
#

Giving access to those who otherwise can't afford it

solid lake
#

I did actually get to use codex on my free account too that I bought pro on eventually

#

Not a bad amount of usage plus it finished task despite me running out of usage on the first prompt

cedar skiff
boreal holly
boreal holly
#

Codex bar calculates reserves, so those reserves could be the throttling metric

solid lake
#

Hmm codex cli just stops but task still going on (5hr limit)

#

Is this a bug

lost drum
#

scam?

torpid trout
unique spade
lost drum
#
  • codex with full access
torpid trout
boreal holly
#

you've come to the right place

lost drum
# torpid trout He wont be here for long, already modmailed the user id

but it mad em wonder if its possible to create virtual box log in to codex and make codex some instructions like remove virus keep eveyrthing else or smthing or determine whether a certain thing is usable cause rn Idk whats the solution when zip is larger than 2gb or some for ex dowloading course from meganz or some

frozen shore
#

guys if 5h runs out while using codex and the prompt is still going, is it still using 5.5 or a lower model

lost drum
frozen shore
lost drum
boreal holly
#

Does it provide any more info than "Failed to create PR"? Like a specific reason it failed?

slim anchor
#

gm

simple star
#

Time to pray to Saint Tibo

#

I dont think I've ever been this low

hard drum
#

is there reset yet?

torpid trout
boreal holly
#

OK, what if you run the task headlessly? Like send the prompt, close the window, watch the PR activity on github, reopen the cloud thing when merged?

#

Seems weird to me that the browser has any effect on the outcome of a cloud agent working autonomously but idk any other option

simple star
boreal holly
#

Hmmm, maybe if you install codex locally, you can do codex cloud exec --env ENV_ID "Summarize open bugs". Basically clone the repo that cloud can see, run that command, and it should trigger a headless cloud task. No stupid browser bologna

meager wadi
# boreal holly plausible

would be interesting to test this with subagents, probably wouldn't consume a lot of tokens if done with 5.4 mini

chrome raven
#

when reset

lusty finch
#

I got an new feature needed to be implemented into our existing app, the UX team design several use flows and new UIs on the figma.

Am I able to let codex read the whole flows then design the code architecture of this new feature, based on my existing app architecture?

dense heron
#

guys with plus i should use 5.4 or 5.5?

small violet
hard drum
small violet
#

high if low/medium cant solve

#

but stick to medium as your default

dense heron
hard drum
#

also cost

dense heron
#

i mean

#

now 5.5 spend like old 5.4?

#

and now 5.4 spend like old 5.3?

signal tapir
#

My feeling is that you get more out of 5.4, unless you are doing very complex stuff

velvet wren
jolly lily
#

Thanks, found it anyway

civic viper
#

Anyone's using Claude Code inside of Codex app? Is the terminal randomly closing for you too?

bronze obsidian
#

Reconnecting....Reconnnecting... what's up with that

jolly lily
#

5.5 Extra High couldn't even script a proper ragdoll system on Roblox and it apparently consumed all of my 5-hourly limit, geniunely annoying

small violet
#

lol

#

xhigh is pretty useless ngl

#

use high

jolly lily
#

I've tried that as well, it didn't work

solid lake
#

Have it check the math in lean

dense heron
#

i use medium with 5.5 in roblox

#

i do crazy games

jolly lily
dense heron
#

ragdoll is a bit more hard bc you need to know what you do

#

you need know a bit of program or make a very specific prompt

jolly lily
#

Yeah I would have literally completed it in a shorter time if I scripted it myself

dense heron
#

search in roblox forum for help

jolly lily
#

My laziness costed me an hour + 100% of my 5-hourly quota

dense heron
#

and say codex the context

velvet wren
jolly lily
#

I do use seperate context files as well besides a master one, apparently an AGENTS.md file

velvet wren
jolly lily
lost drum
#

glad I switched form vs codex ext to normal codex app oh man it is so much easier

small violet
#

im still on app

#

what if you're making manual changes tho

lost drum
# small violet what if you're making manual changes tho

I dont cause I isntructed codex to make orchestration that registers every change setted up git propperly and eveyrhting is tracked I mean if I Were to change code manaully I would just open it thruht \wsl.localhost\Ubuntu\home\metalicpen\code\project1 in VS and change manaully files and just report back to codex app waht I changed manaully and he owuld run check waht I edited and sutff

#

was my answer related to your question or I missunderstood it?

solid lake
#

How is wsl

lost drum
#

with right setup its faster I guess like it feels native

#

had the other thread not use it propperly and the commands where erunning 3x slower

small violet
#

if u guys had to guess

#

what ai did they use to make this

#

im thinking gemini

lost drum
#

is it real tho?

wicked oriole
#

holy slop

small violet
#

what is

lost drum
#

I mena waht is thsi virus

small violet
small violet
#

i like it

#

been seeing it everywhere

#

why didnt i think of this

lost drum
#

lets just focus on codex

wicked oriole
#

wdym havent you seen a million covid virus tracker websites

small violet
#

umm

small violet
solid lake
#

The rats took Europe once btw

#

?

torpid trout
#

there no vaccine (yet) for this one.

wicked oriole
#

we not taking the vaccine

torpid trout
#

Also, an extreme panic-grab. Hanta virus is around and active since thousands of years
Also humanly transmittable - the variatn that ship got at least - is nothing new.
In argentina yearly 100 people die of it averagely

#

So now a swinger cruise ship spread it... and everyone outcries

wicked oriole
torpid trout
# wicked oriole

I like the "rodent control" prevention. Very useful against humans. ๐Ÿ™„ lol
Back to codex.

wicked oriole
#

yeah codex, what do you have to say about it

torpid trout
# wicked oriole yeah codex, what do you have to say about it

I actually did ask it because I have toddlers cleaning the porch with their all and everything every day, and the thing is, as long as you do not live in a trashbin, and have no dead animal there for dinner, the chances are almost non-existent, even with the andean version.
you need extreme close contact (ingestion, very close contact) of infeceted particles, in close rooms, or liek directly ingesting rat poop etc

#

The problem is, many people do live in trashbins, voluntarly or not, and many people do not understand what minimal hygiene is
Last but not least on cruiseships (regular breeding pool for stuff like noro virus and such)

I would not worry about this one other than normal common sense.

wicked oriole
#

totally agree

#

not worried about nun

torpid trout
#

the nuns i mean

lost drum
#

brother thank you all for giving tips I finally made codex unstuck the project that was made by codex and me prompting oh man

wicked oriole
#

"codex make an app to track rat excrements on my ship"

lost drum
#

"codex extract any info which is considered consiparcy and create a trading strattgei if youw ere to believe them"

lucid mason
#

This is the best situation to end up with: it means you can run one big cleanup (or similar) and it will finish no matter the token cost. But: do not send any follow-up prompt, else it will stop immediately and even mid-edit.

lean lark
#

Or you can take a break for a few minutes, say Hi to family and friends, get something nice to drink, then come back to your desk where you'll find a fresh supply of tokens to continue work...

lucid mason
#

I just came back from that when it ended at 1% from the work I started before leaving. ๐Ÿ˜›

#

And when you look at the reset time you'll notice that the family might be sleeping at that time already.

lean lark
#

I saw that. ๐Ÿ™‚

#

Seriously though, if you're running out of tokens, look at how you use the tooling and what's happening under the Thinking (CoT) process, and try to optimize.
Give better prompts. Document code and project. Ensure assistant isn't wasting its time looking at things it doesn't need. Use the right model and reasoning level for each task...
There are lots of ways to use the tools better and avoid the rate limits.

lucid mason
#

These tokens were well spent. I mostly only use Codex in VS Code for review, refactoring and in this instance a bigger restructuring. Everything else runs through ChatGPT.

dense heron
leaden heron
#

i have sent 3 messages even with 0 weekly quota

#

unlimted quota glitch

lean lark
lucid mason
#

Using ChatGPT for most of the grunt work has the drawback of not being integrated and gitless. For my two small apps that fine and my rollback repository is basically a folder full off the ZIP files I download from ChatGPT, but Codex in VS Code would handle that part differently. But then again I don't have to handle GIT to begin with.

What I don't like about Codex in VS Code is how it just stops mid-edit when you send a follow-up at 0% 5h/weekly left. And since the limit report in VS Code is mostly broken that can happen more often than you'd think. ChatGPT has a similar problem when you reach the maximum length of a conversation. It does finish the last prompt and even offers a download link, but the link is always dead then, so you have to do the last step in a new chat again.

#

It's always funny when you exchange Codex/ChatGPT output between both (or one chat to another). It's a bit like them holding a conversation where one partially disagrees/corrects the other.

lucid mason
#

Here is ChatGPT reacting to Codex':

Main correction to Codexโ€™s plan

The line:
Prefer persistent HWND trees for all non-modal managers/editors.
I would not generalize that yet.

lost drum
lucid mason
#

It literally answers in the form of: What I agree with. What I would do differently. What I disagree with.

lean lark
#

There are a few issues at play there: Context is the biggie. ChatGPT has access to your Custom Instructions, Memory, and maybe Project Instructions. Codex has no access to any of that. Codex does have access to the system-level AGENTS.md, project AGENTS.md file(s), maybe Skills, and project code and documentation ... all things that ChatGPT does not have.
So they're working with completely different information and it's no wonder there will be disagreements.

lucid mason
#

That's why I keep documentation (including architekture, file ownership, ai-handoff etc.) inside the package.

lost drum
#

idk why would you use chatgpt as planner if you can make codex do it better

lucid mason
#

Codex has limited tokens, ChatGPT has unlimited tokens.

lost drum
#

I just use chatgpt if my codex says to do deepresearch about certain topic

lean lark
#

Another issue is that these models are never happy with anything. Ask ChatGPT to do something. Ask it if there's anything else to do. If it says no, open a new thread, paste it in, and ask the same question : It'll find a lot wrong with it.
This is the telephone game, where doing this more than a few times makes the result unrecognizable. ๐Ÿ™

lost drum
#

depends what you choose

lean lark
#

The tools are different. Use them for what they do best. If you bang everything with a hammer then all problems start to look like a nail.

lucid mason
#

Sure. There are various culprit to overcome in conversations with GPT. It often helps to tell it to plan as if we were starting from scratch with no regard to keep existing implementations. That opens new idea and helps breaking out of various "running in circles" situations.

lean lark
#

And many people just keep using the same thread for everything. The context is polluted with things that were, that no longer exist, and that creates a ton of confusion. Use new threads for new issues so that the assistant gets a fresh look at the current state.
(It's like dating someone with relationship baggage...)

#

Anyone else seeing that VSCode Codex Extension takes a long time to load prior tasks - or can't load them? There was an update today, dunno if that's relevant.

lucid mason
#

Sometimes you have to flood GPT with data, like measurement/benchmark data and failing test-results of what it thinks to be the right solution just because it is the current implementation. Once it gathered enough data proving different it gets persuaded by sheer amount of new history. And once it is thinks that new/better solutions come from itself it's easier to make it apply real changes instead of just pretending to do so.

lost drum
#

I just used clief knoledge + Jaymin-West and some of mine ANTI_SLOP_DETECTION.md

lucid mason
#

A few days ago I had this answer after confronting it with throwing more smoke than actual solutions:

I did not intentionally lie. I was wrong, and I overstated what had been replaced.

What actually happened is this:

I did remove and rewrite a lot of code.
But I did not achieve the strict architectural reset you asked for.

lost drum
lean lark
#

I suggest it was asked to do a little too much in that one task. These tools are great but they have limits that we need to accommodate.

lucid mason
#

To be fair, even after starting the module in question from scratch (deleting the file) the problem persisted. And in the end it wasn't GPT's fault, but a limitation of the programming language I used that was badly/not documented. So we had to dig into the problem from top to bottom layer by layer until we found the culprit and then it was easier to make GPT implement better solutions.

lost drum
#

ye I am now on same stage haha

lean lark
#

Which language - curious?

lucid mason
#

It basically came up with the same mistake over and over again, because we didn't know what the mistake was and thought it was a correct implementation, aka we looked at the wrong place.

lucid mason
lean lark
#

Ahhh, love AHK.

lucid mason
#

GPT can handle AHK v1 rather well. In the beginning I tried to convert my old existing script into v2 before turning it into the proper app it is now, but that kept throwing errors with every run so many times that I had the impression that GPT doesn't handle v2 so well to begin with (likely because it is even more niche with less code examples on the web).

#

My main grief is that I have to make GPT build our own UI layout manager from scratch. Something like C# would need much less work in that department, especially because I only use UI for settings dialogs. But it's a learning experience.

lean lark
#

The docs have never been great, even though the tool is. The forum is a great source of info/community. I wonder if anyone has dumped AHK docs and the forum history into some KB for a model to consume.

lucid mason
#

Also doubled the memory footprint of that app for the whole settings UI part. But around 15 mb is still absolutely small enough.

#

And today we restructured exactly that part, which is what cost me the 5h token limit. ๐Ÿ˜„

"How would you improve the structure if starting from scratch (we don't have to hold on to existing implementations)?"

"Iโ€™d restructure around a declarative surface model, with feature modules providing data/actions and shared support owning almost all GUI mechanics."

#

Classic "make it think it was its own idea to change the implementation". ๐Ÿค“

lost drum
#

55% to spend in next 24h damn

lean lark
#

You asked it to actually do all of that work in one shot?!? ๐Ÿ˜†

#

Have you created a Custom GPT or just a ChatGPT project with a lot of AHK v1/v2 docs?

lucid mason
#

Implement this in one large restructure pass, you may even delete files and start from blank files to prevent editing existing ones. Also update all corresponding maintenance files, including the architecture files for future chat prompts to know the goal.
From the plan it looked like it was a large change, but most of the structure was already there. But the main goal was to make GPT get over itself and its often overcautious mini-steps to finally have the already implemented dynamic layout manager model be applied app-wide. Despite many former implementation steps there were still too many parts that kept clinging to the older pre DLM implementation. The real changes weren't really that big/many:
16 files changed +331 -133

#

And then in the following cleanup:
22 files changed +355 -252

lucid mason
lean lark
#

Hmm. Honestly I'm starting to get lost without context but I'm following that you didn't ask it for that much and it seemed to choke. It's possible that with each file it's struggling Very hard and consuming a lot of tokens in that process. Given the pattern I suggest much smaller tasks. And yeah, unfortunately this means babying it a bit cuz we can't expect it to do larger tasks with AHK v1 like it can with other libs.

lucid mason
#

And I did some experiments before deciding that GPT could handle AHK v1 well. Even though my files tell it to look at AHK documentation it really works rather well from its own inference without official docs already.

#

I'm fine with it burning tokens on going over larger structures. But both my little projects are also split into modules and GPT only looks at what it needs for each specific task.

#
  `STARTUP_COMPLIANCE: read baseline owner docs and active revision headers; active revision context <package/file revisions>; task type = prompt/bootstrap; extra files read = none.`
- Docs-only:
  `DOCS_COMPLIANCE: read baseline owner docs plus <affected owner docs>; active revision context <package/file revisions>; task type = docs/<topic>; source files read = none|<source groups>.`
- Focused implementation:
  `MODULE_COMPLIANCE: read baseline owner docs plus owner docs <owner docs> and source files <source groups>; active revision context <package/file revisions>; task type = <specific task>.`
- Build/profile:
  `BUILD_COMPLIANCE: read baseline owner docs plus build/profile files <files>; active revision context <package/file revisions>; task type = build-profile.`
- Broad integration:
  `FULL_STARTUP_COMPLIANCE: read baseline owner docs, all maintained source files, and owner docs <UX/DLM/support/build docs as applicable>; active revision context <package/file revisions>; task type = broad integration.```
lean lark
#

Gosh bud, it sounds like you're doing everything you need. Now I'm wondering if GPT 5.5 High will just chew up more tokens or if it will be more intelligent about doing that specific job for you.

#

I have no idea what those instructions are. Is that your AGENTS.md???

lucid mason
#

This is the current maintenance prompt for this one project:

`SysToys` in the current folder.

Start by reading the startup set required by `README.md`:
1. `README.md`
2. `docs/ai-handoff-notes.md`
3. `docs/module-map.md`
4. active revision headers across maintained `.ahk` files only

Use `README.md` for startup mode, compliance-line format, chat-title hint
wording, package rules, and when to upgrade from baseline startup. Use
`docs/module-map.md` for file ownership and task-file selection. Treat the
selected startup-mode files as active package rules, then give the applicable
short compliance line and the README-defined title hint only when README says it
is still useful for this chat.

Read the required startup and owner docs fully before analysis or edits. Treat
their instructions as mandatory for this task, not as background reference. Do
not rely on memory, older chats, or handoff summaries when the owner docs for
this scope apply.

When the concrete task requires more files, owner docs, or source bodies, load
them according to `README.md` and `docs/module-map.md` before analysis or edits.
When a task touches a scoped area, load that area's owner doc first and follow
it as the source of truth for that scope.

When AutoHotkey language/runtime behavior or Windows/platform behavior is part
of the task, consult the relevant primary documentation before relying on
inference. Use AutoHotkey v1.1 docs for AHK behavior, Microsoft docs for
Windows/platform behavior, and the smallest local evidence-backed test only
when the docs are insufficient or ambiguous.

Keep this prompt as a bootstrap pointer. Do not mirror owner-doc rules here.```
velvet wren
#

I have found GPT-5.5 Medium to be my sweet spot for SwiftUI/macOS development

lucid mason
#

I had GPT write this for me. ๐Ÿ˜‰

lean lark
#

Yeah Robert, but he's working with a different language (AHK scripting) so I'm considering how a change to the reasoning or model might help.

lost drum
#

"When the concrete task requires more files," how would he know when is when you cant assume he knows cause tehn its just him doing a coin flip

lean lark
#

When I look at that cryptic stuff, if I can't understand it, I can't imagine how the model can understand it. That's not some human hubris. I'm saying it's way too cryptic to make sense (I believe) even to a model that's opening a task for the first time.

plush harbor
#

installed the new app yesterday, used more tokens to talk about a project than I used on cli a couple weeks ago to develop, test and tweak a complete app on cli and its personality is annoying :\

#

also why can't you move chats into projects

lucid mason
#

Most of the code is written by ChatGPT (web app), so other than using "Thinking" and the model I have little control over it anyway. And then I mostly use Codex to review and cleanup or to try it on more complex code problems where I hope that a "different pair of eyes" can make a difference. In all these cases I usually prefer High. I might use Medium if I did more of the regular coding work there.

In ChatGPT I sometimes switch back to 5.4, because 5.5 regularly producing problems in its own backend/container is getting on my nerves, like Python timeouts. I don't want to watch it fighting its own problems instead of fighting mine. ๐Ÿ‘ท

lucid mason
hard drum
#

so, is the cli version just buggy in terms of usage?

#

should i be using the macOS App instead?

plush harbor
hard drum
#

i did last time have a multi-agent bug hog up my usage (i did not get compensated for a bug i did not introduce)

lean lark
#

It's a good practice to switch models depending on task too.
I need to back away for a while. I'm not helping anyway. ๐Ÿ™‚ I have some issues I'm working through with Codex too. ๐Ÿ™‚
Good Luck!! ๐Ÿ€

velvet wren
lucid mason
#

My original idea was to use sub-agents of lower reasoning effort to due menial tasks like reading data out of log files or scanning data out of screenshot images. But unfortunately it turned out that Codex in VS Code always creates agents at the same reasoning effort that the main chat/agent uses with no way to change it (it changes back when you do so manually).

plush harbor
#

I might get it to summarise and throw it at the grumpy cli

velvet wren
lucid mason
#

And in the end you had to spawn new sub-agents regularly, which meant that every agent had to first use tokens for starting and understanding its task. Agents help to speed things up and parallelize, but the idea to use less tokens was a dead end.

plush harbor
#

came up with a dumb idea last night that would recycle a lot of stuff from my main project, since this one is the dumping ground for overflow anyway so it has a lot of common elements

lucid mason
#

And I just used my free month of Plus and now kept Plus for the time being. So my Codex token limits are generally just too small to use it for everything.

trim rapids
#

anyone had issues with long-running tool calls causing login token invalidation? one of my projects has probably 1000 different unit tests and lately I find that after about half an hour of waiting for the test runner script to finish, my codex oauth token gets invalidated and the CLI session fails saying the token was revoked, at which point I have to log in with the whole 2fa code song and dance again

lean lark
#

Dunno if this helps anyone but ChatGPT with the @ GitHub app active can also read and write to a GH repo, just one file at a time and it prompts for everything. It also can't see the entire repo unless it's already indexed. So there are limits but it's zero tokens for the right use-case.

plush harbor
velvet wren
plush harbor
velvet wren
velvet wren
#

Codex has some standard Pet options but I created my own Koala custom pet

lucid mason
# lost drum "When the concrete task requires more files," how would he know when is when you...

Here is an example. I uploaded the complete current package as ZIP to ChatGPT and posted the maintenance prompt. Then I posted:

Discuss the following two alternatives: "Prefer persistent HWND trees for all non-modal managers/editors." versus "Persistent HWND reuse can improve open time, but it can also retain memory and AHK GUI state. For this project, the better rule is: Persist managers where reuse is common and state is stable. Keep modal editors lightweight unless first-open/reopen cost remains a problem. Do not prebuild hidden HWND trees at startup." plus "remove passive HWNDs only where no interaction is lost"

GPT's first sentence while thinking was:

Iโ€™ll ground this in the packageโ€™s GUI/DLM/support rules, then compare the wording as a maintenance-rule choice rather than as a generic GUI preference.
And then the final answer started with:
DOCS_COMPLIANCE: read baseline owner docs plus docs/dlm-architecture.md, docs/support-architecture.md, docs/ux-style-guide.md, and docs/goal.md

As you can see it knew very well which files to work out of the whole package.

lost drum
plush harbor
#

I'm getting it to make me a bucket I throw cats into at the moment. I suspect it is not going to take long

#

this used to be a drupal 6 site. It got hacked ... I made it static. Talk about archaeology

lost drum
#

XDD would be cool to have such icon above a live codex usage

velvet wren
lost drum
velvet wren
lucid mason
#

So while I also like to use Codex to clean up code its lack of own chat history can make it misinterpret some code intentions. That's the danger of hoping back and forth between Codex and ChatGPT.

plush harbor
#

The core php site that faces the front end though, I made that in 2010 and its been chugging along happily ever since

#

oh god its built me a beige site with 20px round corners, panels with drop shadows, tan buttons, and Georgia for a font. My earlier joke about codex's favouite beige stands

velvet wren
#

it loves those colours

plush harbor
magic moat
#

Wow, I had Codex make a spreadsheet for my Terraria appโ€™s update roadmap.

velvet wren
magic moat
#

Itโ€™s pretty good yeah

craggy jewel
# plush harbor oh god its built me a beige site with 20px round corners, panels with drop shado...

surprising that it went beige as there the code base instructions for at least 5.5 High are :'You do not make one-note palettes: avoid UIs dominated by variations of a single hue family, and limit dominant purple/purple-blue gradients, beige/cream/sand/tan, dark blue/slate, and brown/orange/espresso palettes; scan CSS colors before finalizing and revise if the page reads as one of these themes.' Did you have a prompt/spec that muddied/overrode that somewhat?

plush harbor
#

I am having far too much fun with the openai api. Its like chatgpt but .. deterministic

lucid mason
#

One problem with using Codex/GPT for personal projects is that its planing seems to default to "we still have to deliver working executable to end-users/customers". This makes it overcautious, trying to implements larger changes in smaller transition steps, as makes sense for more public/commercially oriented projects. Sometimes you have to convince it to "go for finish line".

Agreed. With that preference, my earlier phased framing is too conservative.

The corrected project direction should be:
Do the full surface/DLM ownership restructuring as a cohesive architecture pass, even if intermediate revisions are temporarily non-executable. Avoid leaving mixed old/new GUI ownership patterns in place. The goal is one finished GUI framework, not an incremental facade beside the old one.

plush harbor
craggy jewel
plush harbor
velvet wren
torpid trout
#

U can also go and remove the system prompt they have for design, itโ€™s horrible

plush harbor
#

heh. My sites are bright magenta/purple/blue/green kid's site and I had existing style and markup for a small part of the site so I just told it to match markup. The site I just finished was based on a gemini mockup and I had to get gemini to patch the css to get proper metallic gold. It works well off mockups

torpid trout
#

Physician detected

plush harbor
#

I use palleton to get colour sets quite regularly

craggy jewel
#

Here's the 5.5 High base_instructions re: web ui's and the styles/colors prompt portion. I often wonder what it takes for Codex to ignore this.

plush harbor
#

the site I just finished looks like this. I'm not sure I am motivated enough to align all the icons right now

craggy jewel
plush harbor
#

I'm stewing a project in the back of my head that's pure front end and getting codex to play nice with it is going to be interesting

craggy jewel
velvet wren
plush harbor
craggy jewel
velvet wren
lucid mason
plush harbor
#

people keep telling me I should start a business doing websites. I've been a frontend dev for [too long to admit] but not a designer. Then I compare my design skills to codex and yea.

craggy jewel
plush harbor
craggy jewel
#

I'm thinking one needs to point codex very strongly to a really accurate styleguide or it will wander. I'll be finding out this theory soon enough.

plush harbor
#

occurs to me I'm currently asking codex to write a single static site generator from json and this other project is really similar. So I guess I'm on the bridge now

craggy jewel
# plush harbor I was thinking of making a static site generator using design patterns in templa...

thing is if the generator calls a model ex 5.5 to create a site and you get the results just right, subsequent results over time with nerfed/buffed models, new models a year from now etc may give different results because of their weird biases. I'm trying to mitigate that so that when i get the model to create the perfect result, i have it document the path it too to get that result, then have it create a stand-alone script/binary that can replicate the result in an attempt to get some kind of determinism. tricky stuff indeed.

plush harbor
lucid mason
#

For app UX/GUI I pointed Codex/GPT to multiple styleguides and had it compile what these agreed on, then we coupled that with Microsoft's own fluid/fluent styleguide and added that as ux-style-guide in the source package to regularly reference.

#

Sometimes I take a screenshot of a dialog and ask GPT if that follow our ux-style-guide and then let it critique its own work. ๐Ÿง

plush harbor
#

I struggle with getting idea in head out so I tend to go part way and then fiddle with css and I'm having huge wins pointing something that's not right at gemini and describing why its not right and it fixes it. Chatgpt just goes "yeah that does look a bit off, I agree" and doesn't help. Grrrr

lucid mason
#

Overall I get the impression that GUI/frontend might not be GPT's strong side. And I mean to remember that I also read about that opinion.

plush harbor
#

designers are not losing their jobs just yet

craggy jewel
lucid mason
#

The main reason for me to build a whole GUI framework just for settings dialogs is that I got sick of going in circles with GPT trying to create them as static designs. While fixing problems in one area it would always (re)introduce problems in other areas. And it became a chore of telling it to move things left and right. Web design should be a bit easier to begin with, though, because AHK is really old-school Windows GUI based.

#

I can't say much about model differences, as I lack experience and history with that. And even the while I have GPT do frontend work as we type I don't even want to do this, I just have to. After a few days of working on that I specifically called it a day and went back to the actual backend work I was more interested it, even if just to get away from the frontend stuff for a bit.

#

But again, that's also a problem based on using the rather "classic" Autohotkey. There is nothing "fluent" or modern about it to begin with. It's really based on a "move this elements x pixels in that direction mindset".

craggy jewel
#

From the 5.5 High base_instructions, maybe a reference layout style image is required? 'must' is pretty strong guidance.

Visual assets required, with game-specific exception

Websites and games must use visual assets. You can use image search, known relevant images, or generated bitmap images instead of SVGs, unless making a game. Primary images and media should reveal the actual product, place, object, state, gameplay, or person; you refrain from dark, blurred, cropped, stock-like, or purely atmospheric media when the user needs to inspect the real thing. For highly specific game assets you use custom SVG/Three.js/etc.

plush harbor
#

wait, codex swears? I don't recall swearing at codex

lucid mason
#

"Strong" wording to implement "rules" instead of "guidelines" also seem to become softer and softer as chats go on. So you regularly have to remind GPT about those instructions.

craggy jewel
lucid mason
#

Does increasing the context window finally work in Codex 5.5? Didn't check for a few days.

craggy jewel
lucid mason
#

Codex 5.4 was funny. You could set any arbitrary window size, even something like 5M. Doing so didn't make any sense, but it would report that size in its status (and VS Code overlay).

#

Codex 5.4 worked well enough up to 500K, it could be stretched towards 700K, but then it really drifted into becoming very senile.

#

Once it repeated answers to older prompt inputs while completely ignoring what you just wrote you knew it became lobotomized.

craggy jewel
#

Maybe that's why they limited 5.5 to a quarter of that size. Super-long context's would degrade too much, and perhaps they don't have a fix or it might be just how it is with a llm.

#

Now that i think of it, i don't think the standard public model ex. SWE benchmarks we see ever mention the length of the context window when the result was measured, whether compaction occurred etc. Hmmmm....

lost drum
#

oh man I wwish my thread could send promts to another thread

#

I mean all I do is copy paste but would be cool if he could do it

lucid mason
#

I also do the copy & paste thing sometimes.

#

In ChatGPT you have memory, which relays information from one thread to another.

lost drum
#

I mean I am in codex and have just 2 seperate projects anc I need to send pormpt from first one to another one but I guess its the only way for now

lucid mason
#

Maybe you could add an agent to each that writes/reads from a shared file.

plush harbor
#

Told it I need to have a stern word about its use of beige and it said yes, beige is not a hill worth dying on

lost drum
#

I could make a pet for that but its more risky to autmaote such easy task than just make me do it

unique spade
# lost drum oh man I wwish my thread could send promts to another thread

codex threads are just some files on your disk

and on the dynamic side you send some text block over api to the model and get back some response, based on some predefined rules

so ofc you can have 2 threads talk to each other ๐Ÿ™‚

but you have to make it by yourself if you really want it

lost drum
unique spade
# lost drum make it myself? or isntuct codex to do it

codex is a tool. it answers the "how"

you can cut bread with your bare hands or with a knife ๐Ÿ™‚

my point was that what you want is not available in the vanilla implementation. so if you want that you need at least a codex fork of your own. or maybe look for some 3rd party app that provides that behavior

lost drum
unique spade
#

as opposed to chatgpt, where you can't make 2 threads talk to each other without automating some copy paste from one to another

with codex is different because you can program the interaction

#

but yea you have to mutate codex first ๐Ÿ˜„

zenith linden
#

Is codex genuinely good enough to code a sellable website. Obviously good prompting goes a long way. Iโ€™ve been using ChatGPT and codex side by side and playing around. So far Iโ€™m just 3 days Iโ€™ve built a pretty decent website. Itโ€™s not perfect but on the outside it looks just as good as a lot of higher end sites.

cedar skiff
oak trellis
#

i hope they do NOT reset the limit on monday

#

i have 38% left

plush harbor
#

imma see if I can get this entire site done within a 5 hour window, for funsies

uneven kayak
uneven kayak
cedar skiff
plush harbor
#

64 with less than 3 days left over here and was struggling to come up with something to do, hence the sideline into cats

uneven kayak
plush harbor
#

I'm on plus, its not as much as you think

uneven kayak
#

I already have a plan to build my website but I just haven't gotten around to doing it yet

uneven kayak
plush harbor
#

I have several websites, they all predate AI. I rebuilt a couple of the worst ones to very very simple static html last year, could have got codex to do that for me in hindsight

#

rebuilding a small one now from jekyll to an input form -> json -> static

uneven kayak
#

Nice, kicking it up a notch with AI, as it should be lol

#

The primary function of my website will be a player dashboard with lifetime stats and a web shop that integrates with the game

plush harbor
#

got a lot of manual steps for monday's deploy so don't want to do a big project switch before then. Took me far less time for the reskin and closing the backlog than I was thinking

nocturne folio
lost drum
#

I cant go back to plus plan it would feel like prison knowing that I have 20x usage of it and ye

plush harbor
nocturne folio
#

its like an hour of work on low

plush harbor
#

what do you mean an hour of work lol, can use it for quite some time. I'm having lunch now and letting it sleep

lost drum
#

I use 5.5 xhigh speed

#

cause it requires it rn

#

I mean the speed no but I have 52 % left till 12th

nocturne folio
craggy jewel
# lost drum oh man I wwish my thread could send promts to another thread

You could create a mcp server that each agent/thread can access, inform it that exists, get a list of other threads from any other event non codex agents. Develop a standard for interthread communication. After each thread prompt (detectable via skill or hook), tell it to update the mcp server with its status and check for any tasks assigned to it by other threads. Thereโ€™s your vendor-agnostic v pattern right there. Prob take a couple hours to code that with 5.5 high. End result is thread a can pass info to thread b and even wait for a response. Achievable and testable.

nocturne folio
#

15 cents omg

cedar skiff
#

more expensive than opus per token

lost drum
#

those your codex do it "Then Iโ€™ll patch rather than overwrite." haha

uneven kayak
#

Something I didn't really anticipate, hosting costs for my game server.

I just calculated it would cost about $150/month to host it on AWS with absolutely no players. Fortunately that cost stays about the same for the first 1,000 players or so, but that is still a lot of money to host a game that may or may not ever see a profit. $200 for AI services and $150 for hosting, I'm spending no less than $350 per month to make that game happen.

#

So I'll just push forward and do the best I can for like a year. If there's no demand and there are no players in 12 months, I'll reevaluate the future of the game and maybe move in another direction.

zenith linden
plush harbor
lost drum
plush harbor
#

I plan to automate something like that and yeah, needs reporting, seo etc under the hood with it. And automated domain mapping

lost drum
#

I dont thimk you can jsut build a website and tell you wont manage it at all

uneven kayak
uneven kayak
lost drum
uneven kayak
lost drum
plush harbor
lost drum
#

build a connection so they feel like beeing part of your project

meager bobcat
#

ppl in gamedev overemphasize marketing as procrastination justification lol

lost drum
#

would even make them feel special but not just menitoning "thanks to this user this feateure was revumped" not only it would bring more traction cause everyone would desire to feel as speial but as a bonus you would meniton taht "this user will get this and this in the game" idk I think its better to taltleast do smthing liket this than nothing

meager bobcat
plush harbor
#

emphasis low traffic. If you have a database backed site it can get swamped

meager bobcat
#

yeah

uneven kayak
plush harbor
#

putting the cart before the horse hey

lost drum
uneven kayak
lost drum
plush harbor
#

one of my sites has like 600 idioms on it ... a LOT of ai pictures in that. That one was really hard, AI likes to put horses before carts

uneven kayak
plush harbor
#

you'll lose an hour looking at the pictures ๐Ÿ˜› I still have just over 3000 images to upsize, and a heap more to source. Planning to do that over the next few months, get codex to write a script to help find them for me

uneven kayak
plush harbor
#

close to top of google search for photographic idioms for me at least

uneven kayak
plush harbor
#

heh, that's Gemini's text showing in the results

#

if I'm gonna finish this website in my 5 hour window I only have an hour left. Need more coffee