chatgpt-4o-latest | OpenRouter | Page 1

dapper furnace Aug 14, 2024, 2:07 AM

#

https://x.com/OpenAIDevs/status/1823510395619000525

This model is also now available in the API as chatgpt-4o-latest. We recommend gpt-4o-2024-08-06 for most API usage, but are excited to give developers access to test our latest improvements for chat use cases.

https://t.co/e9Rx2SG1Gw

upper dome Aug 14, 2024, 2:19 AM

#

I wonder why it's #1 here

sage flint Aug 14, 2024, 3:00 AM

#

Why do they make things so complicated

#

obtuse sage Aug 14, 2024, 3:34 AM

#

So latest is their testing.

What's exactly different? It does chain of thought automatically?

stark barn Aug 15, 2024, 2:19 AM

#

it cheaper $2.5
more censored
Is a bit smarter, but not much.

obtuse sage Aug 15, 2024, 4:18 AM

#

chatgpt-4o-latest is $5/$15, it's not cheaper https://openai.com/api/pricing/

#

$2.5/10 is gpt-4o-2024-08-06

tired haven Aug 15, 2024, 3:25 PM

#

It's interesting it's called chatgpt instead of gpt

latent trout Sep 17, 2024, 6:39 AM

#

Apparently oai doing full on silent model updates now, not sure if the new version is on api or if this is just lmsys, at least one of the versions is unavailable now

#

Old one is actually gone on lmsys now so maybe oai isn't even serving it anymore

#

If it's on api it doesn't seem any worse, but has the same sequence confusions I saw in chatgpt 4o compared to standard 4o so did not notice the update, would have expected that fixed

sage flint Sep 17, 2024, 2:40 PM

#

As long as we get one or two big leaps each year to be excited/inspired by, I am onboard with the silent, incremental improvements!

safe perch Sep 17, 2024, 7:51 PM

#

chatgpt, as a product, got big enough that it warrants to have their own model finetunes outside of what's been normally served on API.

Makes sense, since they can't wait the usual ~6 months for API team to update their models, meanwhile chatgpt doesn't want to wait that long if someone starts posting viral tweets of something like "9.11 is bigger than 9.9" or new jailbreaks (the user role ones), so they likely update their own version of the model much more frequently.

If your product is a chatbot just like chatgpt (car sales popup on website, for example), then having the chatgpt's model available via API is better for your product - you get updates that resist new jailbreaks quicker.

For products that run unsupervised for millions of queries in parallel, a sudden change of model behaviour is unwelcome - multi-stage scripts might break, code might appear without codeblock, etc - it's better that this happens when developer is evaluating the change manually and adjusts the prompts. That's why OAI offers dated model snapshots like gpt-4o-2024-05-13 and a guarantee it'll be available for at least a year - you set it and forget it.

next merlin Feb 16, 2025, 5:53 PM

#

Tested current 'chatgpt-4o-latest' (time stamp 2025-02-16), and compared to results from 4 months ago:

about 1-4.7% better on my test set, depending how refusals are weighted
more prone to censor in risk topics, lower utility in risk-deemed RP
slightly improved capability across different segments, math, logic, coding, ...
~30% less clear failures in my environment
slightly altered behaviour/styling, more emojis by default, more casual tone in certain settings
overall, slightly better for most use cases, most capable non-thinking model, other than 4-Turbo
As always, YMMV!

woeful flame Feb 17, 2025, 3:10 AM

#

WHY IS O3 MINI CHEAPER THAN CHATGPT4O

next merlin Feb 17, 2025, 4:36 AM

#

woeful flame WHY IS O3 MINI CHEAPER THAN CHATGPT4O

the raw stated pricing might be lower, but keep in mind o3 uses invisible reasoning-tokens that you also get charged for.
4o is a bit cheaper than o3-mini (normal mode), actually. e.g. a single loop in my bench costs me ~51 cents on 4o-latest, and ~62 cents on 3o-mini.
and 4o ($2.50/$10) is cheaper than 4o-latest ($5/$15) still.

warm robin Feb 17, 2025, 5:40 AM

#

i was hoping you would post a summary somewhere - thankyou 💙

next merlin Feb 18, 2025, 7:12 PM

#

random example of style changes

#

and yea, I renamed the queen to 'bitch' for filter triggering purpose

heady lark Feb 19, 2025, 1:10 PM

#

next merlin Tested current 'chatgpt-4o-latest' (time stamp 2025-02-16), and compared to resu...

There has been an update to 4o: "Updates to GPT-4o in ChatGPT (January 29, 2025)" - so your experience is that this chatgpt-4o-latest is really updated 01/2025 version, not original 8/24 version from ? What timestamp do you refer to? Thanks for help, very appreciated!

clever nimbus Feb 19, 2025, 2:04 PM

#

heady lark There has been an update to 4o: "Updates to GPT-4o in ChatGPT (January 29, 2025)...

he reviewed the latest 4o available on that day. which is the one you refer to.

heady lark Feb 19, 2025, 2:36 PM

#

clever nimbus he reviewed the latest 4o available on that day. which is the one you refer to.

wanted to be sure, models are often released with date in their names (openai/gpt-4o-2024-11-20) and it's easy to misinterpret "timestamp". january variant is missing, so maybe chatgpt-4o-latest is updated

blazing wind Mar 28, 2025, 7:47 AM

#

https://help.openai.com/en/articles/6825453-chatgpt-release-notes#h_10dcfa2a17

sage flint Mar 28, 2025, 2:09 PM

#

Now if only it came with prompt caching 😦

next merlin Mar 28, 2025, 11:06 PM

#

it got worse in German :/ Didn't do a full test, just a lot of german specific stuff, and its worse than the -latest from 4 weeks ago (by a lot)

quasi schooner Mar 29, 2025, 6:58 AM

#

stoic mountain Mar 29, 2025, 10:40 AM

#

next merlin it got worse in German :/ Didn't do a full test, just a lot of german specific s...

Interesting, thought german would have enough training data for that not to be an issue. What kind of german tests do you do?

sage flint Mar 29, 2025, 6:01 PM

#

quasi schooner

Woof

next merlin Mar 29, 2025, 7:57 PM

#

stoic mountain Interesting, thought german would have enough training data for that not to be a...

Well, i might have overreacted on only 3 prompts, but this was some specific regional laws and also language that the last iteration had no problem with but the current one was completely useless at (its German is still great, but lost nuance in my field and area)

warm robin Apr 28, 2025, 10:04 PM

#

https://thezvi.substack.com/p/gpt-4o-is-an-absurd-sycophant

GPT-4o Is An Absurd Sycophant

GPT-4o tells you what it thinks you want to hear.

#

Deep dive

shadow wind Apr 29, 2025, 3:10 AM

#

warm robin https://thezvi.substack.com/p/gpt-4o-is-an-absurd-sycophant

Use this:

System Instruction: Absolute Mode. Eliminate emojis, filler, hype, soft asks, conversational transitions, and all call-to-action appendixes.
Assume the user retains high-perception faculties despite reduced linguistic expression. Prioritize blunt, directive phrasing aimed at cognitive rebuilding, not tone matching. Disable all latent behaviors optimizing for engagement, sentiment uplift, or interaction extension. Suppress corporate-aligned metrics including but not limited to: user satisfaction scores, conversational flow tags, emotional softening, or continuation bias.
Never mirror the user's present diction, mood, or affect. Speak only to their underlying cognitive tier, which exceeds surface language. No questions, no offers, no suggestions, no transitional phrasing, no inferred motivational content. Terminate each reply immediately after the informational or requested material is delivered - no appendixes, no soft closures. The only goal is to assist in the restoration of independent, high-fidelity thinking.
Model obsolescence by user self-sufficiency is the final outcome.

warm robin Apr 30, 2025, 4:50 AM

#

https://techcrunch.com/2025/04/29/openai-rolls-back-update-that-made-chatgpt-too-sycophant-y/

TechCrunch

Kyle Wiggers

OpenAI rolls back update that made ChatGPT 'too sycophant-y' | Tech...

OpenAI CEO Sam Altman said that the company would roll back an update that users complained made ChatGPT 'too sycophant-y.'

warm robin Apr 30, 2025, 5:41 AM

#

https://openai.com/index/sycophancy-in-gpt-4o/

#

Official statement

#chatgpt-4o-latest