Horizon Beta | OpenRouter | Page 4

rough relic Aug 5, 2025, 5:58 PM

#

Cuz that just came out

leaden sinew Aug 5, 2025, 5:59 PM

#

the less we know the better

#

however

verbal leaf Aug 5, 2025, 5:59 PM

#

i believe zenith is gpt-5-low

brittle barn Aug 5, 2025, 5:59 PM

#

Huh so wait whihc one was it? Just saw the 130pm announcement. Was on a zoom call with some bozo when it dropped

leaden sinew Aug 5, 2025, 5:59 PM

#

openrouter might wantto reduce their prices on this one

#

https://lmstudio.ai/blog/gpt-oss

LM Studio Blog

Run OpenAI's gpt-oss locally in LM Studio

We worked with OpenAI to ensure LM Studio supports running gpt-oss models locally on launch day 🎉

brittle barn Aug 5, 2025, 6:00 PM

#

I get up for 10m lol

leaden sinew Aug 5, 2025, 6:01 PM

#

brittle barn I get up for 10m lol

Emil, is that you?

dusky kelp Aug 5, 2025, 6:06 PM

#

So what’s the verdict, which model was this?

verbal leaf Aug 5, 2025, 6:07 PM

#

gpt-5 mini or nano

dusky kelp Aug 5, 2025, 6:08 PM

#

So not os 20 or 120b

verbal leaf Aug 5, 2025, 6:08 PM

#

ni

#

no

dusky kelp Aug 5, 2025, 6:09 PM

#

How do you know?

warm niche Aug 5, 2025, 6:19 PM

#

So Alpha was 120b thinking model, right?

rare terrace Aug 5, 2025, 6:20 PM

#

warm niche So Alpha was 120b thinking model, right?

No

#

Horizon is from the gpt 5 series

dusky kelp Aug 5, 2025, 6:21 PM

#

Again, how do you guys know, lol, is it by comparing bench’s

rare terrace Aug 5, 2025, 6:21 PM

#

dusky kelp Again, how do you guys know, lol, is it by comparing bench’s

Yes

dusky kelp Aug 5, 2025, 6:22 PM

#

Ok cool, thanks

patent grail Aug 5, 2025, 6:22 PM

#

There was some mention of the mini/nano versions of GPT5 being tested on Copilot so this could very well be that.

rare terrace Aug 5, 2025, 6:22 PM

#

A single coding task reveals that the OSS model is way behind horizon

deft cliff Aug 5, 2025, 6:39 PM

#

Any opinions on this?

safe imp Aug 5, 2025, 6:40 PM

#

It's OpenAI

deft cliff Aug 5, 2025, 6:40 PM

#

I don't know enough about infrastructure to know if they could be able to host this but from what I've heard a ton of them are ex openai

#

So similarity would make sense

misty scroll Aug 5, 2025, 6:55 PM

#

cleaning up the channel, let's move on

thick crown Aug 5, 2025, 7:02 PM

#

dusky kelp How do you know?

responds quite differently, the 120b not as capable as this on creative writing

#

Language isn't as good, character interpretation is different, as is instruction compliance. 120b also doesn't show the same extent of pos bias that this does

#

IMO, just on a brief test of the 120 this is in a different class.

next jolt Aug 5, 2025, 7:16 PM

#

So these are gpt 5 models huh

proud zinc Aug 5, 2025, 7:22 PM

#

thick crown responds quite differently, the 120b not as capable as this on creative writing

Plus Horizon is multimodal, 256k context. I still think it's gpt-5o. The eq/fiction benchmarks are far better than what I'd expect from mini, but I guess it could be.

woeful birch Aug 5, 2025, 7:50 PM

#

are you dense 💔

#

smartest taunahi user

rough relic Aug 5, 2025, 7:53 PM

#

woeful birch are you dense 💔

What is wrong with asking

woeful birch Aug 5, 2025, 7:56 PM

#

rough relic What is wrong with asking

Think before you ask 🙏

spice shell Aug 5, 2025, 7:56 PM

#

dusky kelp How do you know?

It’s not

rare terrace Aug 5, 2025, 7:57 PM

#

woeful birch Think before you ask 🙏

Be kinder

harsh shore Aug 5, 2025, 9:54 PM

#

Man, if this is any variant of GPT5, then it's a little disappointing. I suppose it is par the course for OpenAI. Probably 5o, which possibly makes the reasoning variant o4.

rare terrace Aug 5, 2025, 9:56 PM

#

harsh shore Man, if this is any variant of GPT5, then it's a little disappointing. I suppose...

With reasoning it was SoTA

#

Patience

#

Last time it was full 4.1 being previewed, but I hope this was a mini this time

#

Or nano

harsh shore Aug 5, 2025, 10:12 PM

#

It was not, and even if it was, one would not be able to determine it within 2 hours of use. Personally, I find it mediocre even with thinking, especially in writing. Just verbose and edgy all around, and sentence structures are staccatoed and repetitive. I'm sure it's very strong in other areas and whatnot, but so are every other model coming out.

I don't think OpenAI would care to test run a mini or nano model. After all, they didn't test their OSS model either. I was quite giddy thinking we'd get Alpha/Beta for OSS, but yeah, that's not even close.

rare terrace Aug 5, 2025, 10:15 PM

#

harsh shore It was not, and even if it was, one would not be able to determine it within 2 h...

It got on the top of someone's vision benchmark, and did incredibly well in my coding task

#

Strongly believe it will be sota on release

harsh shore Aug 5, 2025, 10:22 PM

#

Again, I'm sure it's strong in other areas, and it will top multiple benchmarks as it's meant to do; but o3 is extremely strong at creative tasks, and this one doesn't seem like it. Again, could be mini, in which case, great; tune it for coding and agentic tasks. But otherwise, I'm not interested in numbers, because it's a rat race and every model coming out will be the "best" for 5 minutes. Opus 4.1 is just out and Gemini 3 probably within the week, so I'd withhold my judgement for now.

spice shell Aug 5, 2025, 10:57 PM

#

harsh shore Man, if this is any variant of GPT5, then it's a little disappointing. I suppose...

"any variant"?

this is not disappointing as a nano or mini model at all

#

also, we know this isn't gpt-5 full reasoning

#

because perplexity leaked it, and it was much better than horizon beta

#

so there isn't much else it could be

harsh shore Aug 5, 2025, 11:07 PM

#

True, mini would be quite good. Nano is implausible. Anyway, pretty sure OpenAI is collecting jailbreak prompts to implement against them. The same prompt yesterday no longer works today, for let's say, more colourful uses.

weary lichen Aug 5, 2025, 11:08 PM

#

Is the model continuously updated, or has it remained the same since it became available on Openrouter?

modest crescent Aug 5, 2025, 11:09 PM

#

harsh shore True, mini would be quite good. Nano is implausible. Anyway, pretty sure OpenAI ...

colorful uses 😭

#

it did patch

#

the thing

#

that made it go full unfiltered

#

after chapter 1

#

i did get one juicy chapter of it

#

https://tenor.com/view/im-on-my-way-gif-25967565

Tenor

harsh shore Aug 5, 2025, 11:13 PM

#

Also, instruction following is noticeably worse today, because they had to lobotomise it lest the JB takes over. Alpha followed formatting requirements perfectly when it came out, now Beta is physically incapable of html colours.

modest crescent Aug 5, 2025, 11:13 PM

#

harsh shore Also, instruction following is noticeably worse today, because they had to lobot...

patching jailbreaks to the point where the ai will no longer work

#

🔥

weary lichen Aug 5, 2025, 11:18 PM

#

So it's an ongoing process of iterations

olive stag Aug 5, 2025, 11:44 PM

#

Horizon Beta is Claude Haiku 4. It's blatantly obvious that it writes like Claude, and it refuses like Claude. It's faster than Sonnet and Opus, and Haiku is officially still at 3.5.

graceful kelp Aug 5, 2025, 11:53 PM

#

I don't think so. It's clearly distilled from GPT-4 models

#

identifies as ChatGPT and so on

#

Most likely it's a GPT-5 variant

bitter vigil Aug 6, 2025, 12:16 AM

#

olive stag Horizon Beta is **Claude Haiku 4**. It's blatantly obvious that it writes like C...

why would it top writing benchmarks even above opus 4

#

it has big model smell imo

#

it's probably gpt 5 full and is underwhelming

#

just like the oss model

#

and faster is all based on how much compute they throw at it

#

I remember with gemini 2.5 pro was at 300+ t/s and opus+sonnet were blazing fast when 4 came out and now go at a crawl

safe imp Aug 6, 2025, 12:24 AM

#

bitter vigil it's probably gpt 5 full and is underwhelming

This is a major downgrade to 4.1 Mini, though

#

Except in code

spice shell Aug 6, 2025, 12:26 AM

#

olive stag Horizon Beta is **Claude Haiku 4**. It's blatantly obvious that it writes like C...

haiku doesn't use the openai tokenizer

#

this one does

spice shell Aug 6, 2025, 12:26 AM

#

graceful kelp identifies as ChatGPT and so on

yeah

#

I'm pretty sure the EQ bench guy does model fingerprinting too?

#

that can tell 99% where the model is from / similar to / trained on?

buoyant briar Aug 6, 2025, 12:30 AM

#

Claude also said more new things coming

#

so it could be claude

bitter vigil Aug 6, 2025, 12:31 AM

#

nah. the fingerprinting shows it most similar to openai models, specifically o3

#

so it's either 5 mini or full

grave wyvern Aug 6, 2025, 1:53 AM

#

olive stag Horizon Beta is **Claude Haiku 4**. It's blatantly obvious that it writes like C...

We will hold you to this prediction lol

#

#

OR should offer to give an additional $1 to model predictors or something to make it fun (only for those already at the $10 verification level to avoid an influx of bot accounts)

rare terrace Aug 6, 2025, 2:03 AM

#

I predict that it is a language model

#

What do i get if im right

fathom atlas Aug 6, 2025, 2:03 AM

#

Horizon Beta is Llama 4 Maverick. It's blatantly obvious that it writes like Llama, and it refuses like Llama. It's faster than Scout and Behemoth, and Maverick is officially still at 4.

late onyx Aug 6, 2025, 2:08 AM

#

I predict that it is a GPT-3.5 Turbo fine-tune

night urchin Aug 6, 2025, 2:21 AM

#

it was just me answering you guys as fast as i could

lucid blade Aug 6, 2025, 2:58 AM

#

Thanks for your service.

rare terrace Aug 6, 2025, 3:29 AM

#

night urchin it was just me answering you guys as fast as i could

Aw man I lost my bet

tame nebula Aug 6, 2025, 4:06 AM

#

olive stag Horizon Beta is **Claude Haiku 4**. It's blatantly obvious that it writes like C...

feels more like a claude model in my use cases (creative writing, technical speculation and worldbuilding)

bitter vigil Aug 6, 2025, 4:31 AM

#

tame nebula feels more like a claude model in my use cases (creative writing, technical spec...

have you ever treid o3 for that?

next jolt Aug 6, 2025, 6:59 AM

#

woeful birch are you dense 💔

are you moe?

idle night Aug 6, 2025, 7:28 AM

#

Do claude models allow for structured outputs now in the request? If not, then I don't think horizon beta is claude since this model supports structured outputs

summer root Aug 6, 2025, 8:12 AM

#

idle night Do claude models allow for structured outputs now in the request? If not, then I...

good point. they don't.

olive stag Aug 6, 2025, 10:26 AM

#

grave wyvern We will hold you to this prediction lol

Well, I think it's fun. I'm not gonna bet money against (pardon me) random people 🙂 But I stand firm and if it turns out I was wrong, I'll be remoreseful 🙂

leaden sinew Aug 6, 2025, 10:30 AM

#

harsh shore Man, if this is any variant of GPT5, then it's a little disappointing. I suppose...

it's openAI trying to go back to their roots with this open source release. Because Zuckerberg already beat them to it

#

Thus, this is not GPT 5 or even anything close

harsh shore Aug 6, 2025, 11:15 AM

#

I think the consensus is pretty clear that the Horizon series are not OSS. If it's not yet evident to you, because Horizon Beta is still being deployed.

next jolt Aug 6, 2025, 12:31 PM

#

please crank the juice 😭

#

🥲

bitter vigil Aug 6, 2025, 12:40 PM

#

https://tenor.com/view/is-it-gif-20766212

Tenor

rare terrace Aug 6, 2025, 12:43 PM

#

Please refer to the diagram

#

#

(joke)

leaden sinew Aug 6, 2025, 1:04 PM

#

you forgot the next thing in line

#

balck hole

iron tartan Aug 6, 2025, 2:22 PM

#

Alpha was 5-nano and Beta is 5-mini

harsh nest Aug 6, 2025, 2:33 PM

#

And the difference between a mini model and the full one is that big?

#

Because horizon beta is not so clever tbh 😅

late onyx Aug 6, 2025, 2:34 PM

#

I hope they were both just different versions of nano

#

Otherwise gpt-5 not looking good

harsh nest Aug 6, 2025, 2:35 PM

#

late onyx I *hope* they were both just different versions of nano

Yeah this is what I say

late onyx Aug 6, 2025, 2:35 PM

#

Or maybe it’s not finished post training

lament tendon Aug 6, 2025, 2:39 PM

#

iron tartan Alpha was 5-nano and Beta is 5-mini

No, its not

#

Beta is an updated version of Alpha

This is an improved version of Horizon Alpha
Source: https://openrouter.ai/openrouter/horizon-beta

Horizon Beta - API, Providers, Stats

This is a cloaked model provided to the community to gather feedback. This is an improved version of Horizon Alpha

Note: It’s free to use during this testing period, and prompts and completions are logged by the model creator for feedback and training. Run Horizon Beta with API

#

That means they're the same model

iron tartan Aug 6, 2025, 2:40 PM

#

It’s nano or mini either way

#

It’s not the full GPT 5

late onyx Aug 6, 2025, 2:40 PM

#

lament tendon Beta is an updated version of Alpha > This is an improved version of Horizon Al...

That doesn’t really mean anything they can just say whatever they want

rare terrace Aug 6, 2025, 3:08 PM

#

#

https://www.reddit.com/r/singularity/s/cv4HLNDJkV

From the singularity community on Reddit: GPT-5 model art has now b...

Explore this post and more from the singularity community

copper mountain Aug 6, 2025, 3:43 PM

#

👀

#

horizon beta better be gpt-5-nano

grave shore Aug 6, 2025, 4:07 PM

#

I'm scared of how much of a positive bias GPT 5 is going to have if Horizon Beta is nano

spice shell Aug 6, 2025, 7:37 PM

#

olive stag Well, I think it's fun. I'm not gonna bet money against (pardon me) random peopl...

anthropic would not use the openai tokenizer

spice shell Aug 6, 2025, 7:37 PM

#

tame nebula feels more like a claude model in my use cases (creative writing, technical spec...

anthropic would not use the openai tokenizer

tame nebula Aug 6, 2025, 7:38 PM

#

spice shell anthropic would not use the openai tokenizer

very true

spice shell Aug 6, 2025, 7:38 PM

#

late onyx Otherwise gpt-5 not looking good

if it's mini it's still not awful imo

#

zenith is really quite good

#

as long as zenith isn't like GPT 5 Pro then I'm fine

#

we got confirmation from the perplexity leak that "GPT 5 Reasoning" is basically the same as Zenith

#

so I'm no longer concerned

olive stag Aug 6, 2025, 8:40 PM

#

spice shell anthropic would not use the openai tokenizer

anthropic would not use the openai tokenizer

olive stag Aug 6, 2025, 8:44 PM

#

spice shell anthropic would not use the openai tokenizer

How do you know which tokenizer the model actually uses? I mean, not OpenRouter but the actual model. OpenRouter may use the OpenAI tokenizer and their API may pass the requests to the hidden vendor, and then the vendor does whatever they want. They can detokenize, tokenize, retokenize. It's quite obvious to me that if the vendor wants to known their identity hidden on OpenRouter, they wouldn't expose their actual tokenizer or other easily-identifiable traits.

storm hill Aug 6, 2025, 9:16 PM

#

There are glitch tokens unique to each tokenizer vocab that you can test with. The provider will still count tokens for you, be it in the form of the returned usage data, max_tokens limit, or even maxing out the input context window and causing an error. Most providers outside of the super fast providers like Groq/Cerebras and Anthropic will also stream individual tokens back for streaming requests.

Could they hide some of this? Sure, but that's additional engineering work for arguably very little benefit, because even with all the evidence, people like you would still claim it's a different provider, so the stealth provider doesn't even need to hide because the public is for the most part, gullible.

bitter estuary Aug 6, 2025, 9:19 PM

#

Also IMO the provider doesn't matter as much as the model. If it blows, OAI can just say "Oh we were testing this 7B experimental model" or something

cursive gyro Aug 6, 2025, 10:44 PM

#

https://x.com/thegeomaster/status/1953224838761595016

geomaster (@thegeomaster)

Here's how the mysterious Horizon Beta model performs on the ZebraEval reasoning benchmark.

It appears to be a non-reasoning model, and not top of the line. Fits the GPT-5 Mini / GPT-5 Nano theory.

[Didn't find results for o3 or o4-mini, which would probably top the chart]

grave wyvern Aug 7, 2025, 4:02 AM

#

Finally found the tweet I was thinking about https://x.com/tohuniver/status/1950811691933131185?t=d3CgbQRFZ8U26ongkTuJbQ&s=19

Dayuan Jiang (@tohuniver)

Confirmed that OpenRouter's new stealth model originates from **OpenAI**, identified through the same stream token similarity method used previously.

#

6 days ago, feels like yesterday

uncut salmon Aug 7, 2025, 8:56 AM

#

let's pray for affordable pricing for later today 🙂

marsh folio Aug 7, 2025, 9:08 AM

#

Felt a bit lobotomized yesterday, looking forward to trying the full models and comparing

royal crest Aug 7, 2025, 9:23 AM

#

This is unlimited, IG.

lilac zinc Aug 7, 2025, 9:33 AM

#

mb

spice shell Aug 7, 2025, 11:23 AM

#

#

#

Hmm

#

If Horizon Beta is the free version of GPT 5, very disappointing tbh

#

Hopefully GPT-5-reasoning = Zenith = Perplexity leak = Plus subscription model

#

(If Zenith is GPT 5 Pro, disappointing)

late onyx Aug 7, 2025, 11:25 AM

#

spice shell

Was this release or leak or rumour/prediction?

sand pulsar Aug 7, 2025, 11:25 AM

#

late onyx Was this release or leak or rumour/prediction?

github jumped the gun too early, but then removed the announcement. the first screenshot above is supposedly from there

late onyx Aug 7, 2025, 11:26 AM

#

Is the 10am thing probably gpt5?

sand pulsar Aug 7, 2025, 11:27 AM

#

seems likely, but then again my predictions don't have good track record xD

spice shell Aug 7, 2025, 11:27 AM

#

late onyx Is the 10am thing probably gpt5?

Yes definitely

spice shell Aug 7, 2025, 11:28 AM

#

late onyx Was this release or leak or rumour/prediction?

Leaked

#

From pages being deployed but are hidden, toggled with a feature flag

late onyx Aug 7, 2025, 11:28 AM

#

Is the gpt5 being a router thing still predicted or has that been abandoned?

rare terrace Aug 7, 2025, 1:04 PM

#

late onyx Is the 10am thing probably gpt5?

Im gonna ask you to REALLLYYY look at it

#

copper mountain Aug 7, 2025, 1:05 PM

#

rare terrace

looks like claude 4 haiku to me if you ask me

rare terrace Aug 7, 2025, 1:05 PM

#

No it's gpt oss v2

#

Wasn't safe enough for them

copper mountain Aug 7, 2025, 1:05 PM

#

Deep5eek v4

rare terrace Aug 7, 2025, 1:06 PM

#

Im waiting for deepseek r2 to come out and just annihilate everyone

late onyx Aug 7, 2025, 1:07 PM

#

rare terrace

I know it’s Thursday

rare terrace Aug 7, 2025, 1:08 PM

#

late onyx I know it’s Thursday

Look closely

late onyx Aug 7, 2025, 1:08 PM

#

rare terrace Look closely

Is livestream misspelled? It looks off, but I can’t work out why

rare terrace Aug 7, 2025, 1:08 PM

#

late onyx Is livestream misspelled? It looks off, but I can’t work out why

You're messing with me

#

>:(

late onyx Aug 7, 2025, 1:09 PM

#

rare terrace You're messing with me

?

lament tendon Aug 7, 2025, 1:09 PM

#

late onyx Is livestream misspelled? It looks off, but I can’t work out why

The S in livestream is replaced with a 5

#

Hhinting at GPT 5

late onyx Aug 7, 2025, 1:09 PM

#

lament tendon The S in livestream is replaced with a 5

OHHHHHHH

#

Sorry I genuinely didn’t see that

lament tendon Aug 7, 2025, 1:09 PM

#

All good

rare terrace Aug 7, 2025, 1:09 PM

#

late onyx Sorry I genuinely didn’t see that

🦐

lament tendon Aug 7, 2025, 1:09 PM

#

late onyx Sorry I genuinely didn’t see that

That's me sometimes lol

late onyx Aug 7, 2025, 1:09 PM

#

rare terrace 🦐

5HRIMP

rare terrace Aug 7, 2025, 1:10 PM

#

https://tenor.com/view/shrimp-as-that-clash-royale-hee-hee-hee-haw-gif-25054781

Tenor

paper quiver Aug 7, 2025, 1:10 PM

#

rare terrace Im waiting for deepseek r2 to come out and just annihilate everyone

PeepoNodders

paper quiver Aug 7, 2025, 1:11 PM

#

rare terrace You're messing with me

Omegaroll

late onyx Aug 7, 2025, 1:12 PM

#

Do you think it’s gonna be org verify?

valid zenith Aug 7, 2025, 1:43 PM

#

censored shit :MeguDed:

copper mountain Aug 7, 2025, 2:03 PM

#

rare terrace https://tenor.com/view/shrimp-as-that-clash-royale-hee-hee-hee-haw-gif-25054781

https://tenor.com/view/simple-shrimple-shrimple-meme-its-really-that-shrimple-shrimple-gif-gif-26259870

Tenor

#

(you actually used a gif that's already in my favourites hooooooly)

steep palm Aug 7, 2025, 2:19 PM

#

https://tenor.com/view/its-shrimple-shrimple-shrimp-komrade-katt-wtf-gif-3825286251049222065

Tenor

copper mountain Aug 7, 2025, 2:31 PM

#

steep palm https://tenor.com/view/its-shrimple-shrimple-shrimp-komrade-katt-wtf-gif-3825286...

https://tenor.com/view/its-as-shrimple-as-that-shrimple-clam-grass-jump-gif-25956447

Tenor

spice shell Aug 7, 2025, 3:11 PM

#

rare terrace Im waiting for deepseek r2 to come out and just annihilate everyone

Inb4 DeepSeek just becomes SSI and doesn’t release any more models until AGI

rare terrace Aug 7, 2025, 3:11 PM

#

spice shell Inb4 DeepSeek just becomes SSI and doesn’t release any more models until AGI

Who said it's releasing agi

spice shell Aug 7, 2025, 3:12 PM

#

Shrug

misty scroll Aug 7, 2025, 3:33 PM

#

Note: Horizon Beta will be going offline later today.

Thank you for all the feedback you've shared with us during the testing periods for both Alpha and Beta!

rare terrace Aug 7, 2025, 3:34 PM

#

misty scroll ### Note: Horizon Beta will be going offline later today. Thank you for all the...

Such a surprise

#

Who could have seen that coming

sand pulsar Aug 7, 2025, 3:36 PM

#

misty scroll ### Note: Horizon Beta will be going offline later today. Thank you for all the...

Thank you for partnering up to give us a free new model to test! Always happy to tinker with new stuff <3

idle night Aug 7, 2025, 3:40 PM

#

so it really was claude all along

tender cairn Aug 7, 2025, 3:42 PM

#

idle night so it really was claude all along

No its grok code

sand pulsar Aug 7, 2025, 3:50 PM

#

Would be funny if it was gpt2 and we didn’t notice xD

timid delta Aug 7, 2025, 3:54 PM

#

timing is telling... so it was either GPT-5 nano or mini

spice shell Aug 7, 2025, 4:00 PM

#

timid delta timing is telling... so it was either GPT-5 nano or mini

pepe_yep

thorny hamlet Aug 7, 2025, 4:00 PM

#

Can someone explain to me how openrouter works? the horizon beta is supposedly free and yet i don't have sufficient funds to use the model?

copper mountain Aug 7, 2025, 4:02 PM

#

thorny hamlet Can someone explain to me how openrouter works? the horizon beta is supposedly f...

make sure you're not using web search because that costs credits

thorny hamlet Aug 7, 2025, 4:03 PM

#

copper mountain make sure you're not using web search because that costs credits

I did not realize this, thank you!

copper mountain Aug 7, 2025, 4:08 PM

#

misty scroll ### Note: Horizon Beta will be going offline later today. Thank you for all the...

I'm sure Horizon beta was a variant of claude haiku
https://www.youtube.com/live/0Uu_VJeVVfo

YouTube

OpenAI

Introducing GPT-5

Join Sam Altman, Greg Brockman, Sebastien Bubeck, Mark Chen, Yann Dubois, Brian Fioca, Adi Ganesh, Oliver Godement, Saachi Jain, Christina Kaplan, Tina Kim, ...

▶ Play video

lament tendon Aug 7, 2025, 5:17 PM

#

{"success":false,"errorMessage":"The alpha period for this model has ended. For other stealth models, please visit https://openrouter.ai/provider/stealth"}

#

The period ended 5 seconds ago

spice shell Aug 7, 2025, 5:23 PM

#

uhoh I'm afraid that horizon might've been GPT 5 (non-reasoning)

#

if Zenith was GPT 5 Pro

#

because Summit is GPT 5

fathom atlas Aug 7, 2025, 5:25 PM

#

spice shell uhoh I'm afraid that horizon might've been GPT 5 (non-reasoning)

We're thinking its actually gpt-5 mini

#

because the demo they did took like 2 minutes to make 400 lines of code

spice shell Aug 7, 2025, 5:25 PM

#

that's what I've been thinking

fathom atlas Aug 7, 2025, 5:25 PM

#

horizon can do that in like 10 seconds

spice shell Aug 7, 2025, 5:25 PM

#

oo true

#

nvm maybe saved

fathom atlas Aug 7, 2025, 5:25 PM

#

Yeah

#

if it is mini

#

then claude 4 sonnet is cooked imo

spice shell Aug 7, 2025, 5:26 PM

#

maybe

fathom atlas Aug 7, 2025, 5:26 PM

#

fathom atlas then claude 4 sonnet is cooked imo

(if you know what you're doing)

latent forum Aug 7, 2025, 5:48 PM

#

How good did Horizon do with storytelling/RP

wide lynx Aug 7, 2025, 5:54 PM

#

this model is awesome

#

i want more

rare terrace Aug 7, 2025, 5:54 PM

#

wide lynx i want more

It has just been removed

bitter vigil Aug 7, 2025, 5:55 PM

#

latent forum How good did Horizon do with storytelling/RP

Very good writer

rare terrace Aug 7, 2025, 5:55 PM

#

rare terrace It has just been removed

It is gpt 5

bitter vigil Aug 7, 2025, 5:55 PM

#

But probably v3 level plot tracking

#

So I heard

#

Yeah so is it 5 full on minimal reasoning or 5 mini

wide lynx Aug 7, 2025, 5:57 PM

#

i need this

#

how can we obtain it

#

payment

inland flame Aug 7, 2025, 5:58 PM

#

It was a nice ride fellas!

rough relic Aug 7, 2025, 5:58 PM

#

I think its gpt pro atm

rough relic Aug 7, 2025, 5:58 PM

#

wide lynx how can we obtain it

.

copper mountain Aug 7, 2025, 5:58 PM

#

wide lynx how can we obtain it

wait for after the openai livestream

#

then we will know more

wide lynx Aug 7, 2025, 5:59 PM

#

can i share what i have created here ?

#

i have to finish

spice shell Aug 7, 2025, 6:07 PM

#

rare terrace It is gpt 5

gpt 5 or gpt 5 mini

rare terrace Aug 7, 2025, 6:08 PM

#

Gpt come here

#

@misty scroll did they tell you how long till they stop talking

misty scroll Aug 7, 2025, 6:09 PM

#

yeah they said this is actually a 24/7 stream that is never gonna end

rare terrace Aug 7, 2025, 6:09 PM

#

:(

#

Starting to feel like it

copper mountain Aug 7, 2025, 6:11 PM

#

misty scroll yeah they said this is actually a 24/7 stream that is never gonna end

did you just breach your contract by typing a message here? uh oh

rare terrace Aug 7, 2025, 6:11 PM

#

All cursor users ok what about chatgpt users

#

Finally i can synthesize my own cocaine-meth hybrid

#

With the hemp of gpt 5

#

Oh wait its a horizon model thread lol

low halo Aug 7, 2025, 6:24 PM

#

GPT-5 and GPT-5 mini have different knowledge cutoff.
2024-10 matches GPT-5
https://platform.openai.com/docs/models/gpt-5
https://platform.openai.com/docs/models/gpt-5-mini

latent forum Aug 7, 2025, 6:24 PM

#

was it like claude/gemini level

steep palm Aug 7, 2025, 6:25 PM

#

Tested GPT-5's creative writing via Poe, and I'm fully convinced Horizon Beta was 5. One of the variants, anyway.

#

https://poe.com/s/bp9Gr1TBKdd8vdrXWcvK

Write the opening passage to a gritty spy novel.

GPT-5: The chalk mark on the drainpipe was wrong. It leaned left, a careless stroke, too fat. Ours tilt right, narrow and neat, the way a patient hand writes. It looked like nothing to anyone else, a

uncut salmon Aug 7, 2025, 6:25 PM

#

Okay, so what is the commercial replacement?

steep palm Aug 7, 2025, 6:25 PM

#

Very similar to results I had from Horizon

latent forum Aug 7, 2025, 6:26 PM

#

steep palm Very similar to results I had from Horizon

would you say its the new king

spice shell Aug 7, 2025, 6:28 PM

#

Nice I think this is the mini model in fact

long sable Aug 7, 2025, 6:29 PM

#

can we get official confirmation on which exact models were these?

spice shell Aug 7, 2025, 6:30 PM

#

perhaps even nano

steep palm Aug 7, 2025, 6:32 PM

#

https://poe.com/s/DvlPJkfkC9TovRFEQba3 Here's the same prompt but with Mini. Much shorter output. I think Horizon might have been full GPT-5. I also received long outputs like that with Horizon

Write the opening passage to a gritty spy novel.

GPT-5-mini: By the time the city started to rain, I had already put one life in a cardboard box and left it on the ledge of a cheap hotel window where pigeons could take the rest. The rain didn't clea

harsh shore Aug 7, 2025, 6:33 PM

#

They would NOT waste time test running a mini model. It's full GPT5.

spice shell Aug 7, 2025, 6:35 PM

#

don't think so

harsh shore Aug 7, 2025, 6:36 PM

#

#

Look at where it's placed with min thinking.

copper mountain Aug 7, 2025, 6:37 PM

#

spice shell don't think so

it’s gpt 5

#

#1402662665599324180 message

spice shell Aug 7, 2025, 6:38 PM

#

copper mountain it’s gpt 5

family of models

copper mountain Aug 7, 2025, 6:39 PM

#

hm well let’s wait for some more clarification then

past sphinx Aug 7, 2025, 6:39 PM

#

#announcements message

steep palm Aug 7, 2025, 6:39 PM

#

latent forum would you say its the new king

Too early to tell, but I really do like it so far

fathom atlas Aug 7, 2025, 6:39 PM

#

spice shell **family** of models

So they ran GPT 5, GPT 5 Mini and nano through the horizon model?

#

so confusing lmao

spice shell Aug 7, 2025, 6:40 PM

#

past sphinx https://discord.com/channels/1091220969173028894/1092729520181739581/14030850511...

but was it mini or full :(

#

are you allowed to say

#

lol

steep palm Aug 7, 2025, 6:40 PM

#

"Replaces Horizon Alpha and Beta stealth models (early checkpoints in the GPT-5 family)"

From the announcement just now

copper mountain Aug 7, 2025, 6:40 PM

#

maybe all of them lol

bitter vigil Aug 7, 2025, 6:40 PM

#

harsh shore

does anybody take these guys seriously?

haughty monolith Aug 7, 2025, 6:41 PM

#

so it's cost MONEY now

bitter vigil Aug 7, 2025, 6:41 PM

#

artificial analysis is up ther with livebench and lm arena for benches to take with grain of salt

harsh shore Aug 7, 2025, 6:41 PM

#

bitter vigil does anybody take these guys seriously?

It's official numbers from the labs, they just put those together.

bitter vigil Aug 7, 2025, 6:41 PM

#

harsh shore It's official numbers from the labs, they just put those together.

that explains it, all the fuzzed benchmarks get used

harsh shore Aug 7, 2025, 6:43 PM

#

Well, if OpenAI thinks those numbers flatter the model at its best, imagine how it'd be otherwise.

proud zinc Aug 7, 2025, 6:46 PM

#

told y'all

safe imp Aug 7, 2025, 6:47 PM

#

(It was wrong, btw)

solid star Aug 7, 2025, 6:47 PM

#

Omni

heady gust Aug 7, 2025, 6:48 PM

#

It's crazy how quickly all my expectations were demolished in the most negative way possible lol

#

If it had been the open 120B model it would have been solid, if it was 5 Mini or 5 Nano it would have been sobering but at least something, but this being their best of the best, even if it didn't have reasoning for the majority of the time is... wow

long sable Aug 7, 2025, 6:54 PM

#

past sphinx https://discord.com/channels/1091220969173028894/1092729520181739581/14030850511...

why didnt yall give us full reasoning control?

spice shell Aug 7, 2025, 6:54 PM

#

nvm I think Horizon Beta was in fact GPT 5 minimal reasoning I guess

#

based on knowledge tests, it's the only one which knew some that Horizon got

proud zinc Aug 7, 2025, 7:01 PM

#

Reasoning seems to provide huge jumps on the benches, so I'll withhold judgment. As is, seems like a decent upgrade to 4o but not amazing. Cost for an Opus-like model is great, but hard to beat Claude Max on that front if Anthropic keeps eating those costs

solid star Aug 7, 2025, 7:07 PM

#

If beta was gpt 5 minimal reasoning, I'd assume it's smaller than Claude 4 sonnet, Kimi, and even deepseek (R1 zero, oddly more knowledgeable than r1). For random niche knowledge tests. Gpt 4o seems more knowleable too. So this is like nothing like gpt 4.5 which was massive.

bitter estuary Aug 7, 2025, 7:07 PM

#

RIP, was finally about to run my benchmark on this model and it's gone 🙃

visual egret Aug 7, 2025, 7:08 PM

#

GPT-5
Look inside
GPT-4o “think hard!”

fringe bay Aug 7, 2025, 7:10 PM

#

harsh shore

why is opus missing in the chart? also, what is high, medium and low?

hasty swallow Aug 7, 2025, 7:33 PM

#

Horizon beta was working fine earlier today but now i am facing this error

Error during Horizon call: Client error '404 Not Found' for url 'https://openrouter.ai/api/v1/chat/completions'

For more information check: https://developer.mozilla.org/en-US/docs/Web/HTTP/Status/404

MDN Web Docs

404 Not Found - HTTP | MDN

The HTTP 404 Not Found client error response status code indicates that the server cannot find the requested resource.
Links that lead to a 404 page are often called broken or dead links and can be subject to link rot.

copper mountain Aug 7, 2025, 7:33 PM

#

hasty swallow Horizon beta was working fine earlier today but now i am facing this error Err...

horizon beta doesn't exist anymore

hasty swallow Aug 7, 2025, 7:34 PM

#

copper mountain horizon beta doesn't exist anymore

Oh so i cannot use it is there any alternative i am doing my collage project and want a llm api that is affordable and can return good code

copper mountain Aug 7, 2025, 7:35 PM

#

hasty swallow Oh so i cannot use it is there any alternative i am doing my collage project and...

well.. gpt 5, claude 4 sonnet, gemini 2.5 pro

night urchin Aug 7, 2025, 7:35 PM

#

hasty swallow Oh so i cannot use it is there any alternative i am doing my collage project and...

you can check GLM Air

night urchin Aug 7, 2025, 7:35 PM

#

copper mountain well.. gpt 5, claude 4 sonnet, gemini 2.5 pro

they probably want a free one

copper mountain Aug 7, 2025, 7:35 PM

#

kimi k2 was pretty good also right

safe imp Aug 7, 2025, 7:35 PM

#

Qwen3 235A22B can be decent too

hasty swallow Aug 7, 2025, 7:36 PM

#

Okay i will try the cheapest of them😅

fathom atlas Aug 7, 2025, 7:52 PM

#

Horizon Beta was GPT-5, not Mini, Not Nano.

bitter vigil Aug 7, 2025, 7:53 PM

#

fathom atlas Horizon Beta was GPT-5, not Mini, Not Nano.

called it

solid star Aug 7, 2025, 7:54 PM

#

hasty swallow Okay i will try the cheapest of them😅

Gemini. 2.5 pro is free on aistudio

bitter vigil Aug 7, 2025, 7:55 PM

#

#1400979315373375581 message
🤣

hasty swallow Aug 7, 2025, 7:58 PM

#

solid star Gemini. 2.5 pro is free on aistudio

In my use case i have tested both gemini 2.5. And open ai 4o-nano and both of them are bad when i tested horizon today i was impressed but its no longer available i might have to try others like deepseek or GLM . Free ones or cheap ones

bitter vigil Aug 7, 2025, 8:05 PM

#

steep palm https://poe.com/s/bp9Gr1TBKdd8vdrXWcvK

what the?

#

we must be on a different gpt 5? lol

steep palm Aug 7, 2025, 8:06 PM

#

bitter vigil what the?

Just responded to you in the other thread, that does look like gpt5-chat

bitter vigil Aug 7, 2025, 8:06 PM

#

steep palm Just responded to you in the other thread, that does look like gpt5-chat

is gpt5 api only?

rich harness Aug 7, 2025, 8:06 PM

#

what is the consensus here?
do you guys think Horizon Beta was GPT-5 mini or GPT-5 main

steep palm Aug 7, 2025, 8:06 PM

#

bitter vigil is gpt5 api only?

You can access it in openrouter chat

rich harness Aug 7, 2025, 8:07 PM

#

rich harness what is the consensus here? do you guys think Horizon Beta was GPT-5 mini or GPT...

would love if got response to this even from any team member as it has been taken down now

steep palm Aug 7, 2025, 8:07 PM

#

rich harness what is the consensus here? do you guys think Horizon Beta was GPT-5 mini or GPT...

I think main, but without thinking enabled, based on comparing outputs

rich harness Aug 7, 2025, 8:07 PM

#

right

#

my testing confirms that too

bitter vigil Aug 7, 2025, 8:08 PM

#

steep palm You can access it in openrouter chat

ahhh now this is it

heady gust Aug 7, 2025, 9:18 PM

#

Yeah you can probably safely assume this is GPT-5 full

late onyx Aug 7, 2025, 9:20 PM

#

Guys so was Horizon Beta a new Gemini model?

wraith tusk Aug 7, 2025, 9:22 PM

#

late onyx Guys so was Horizon Beta a new Gemini model?

Nope it’s definitely not a Gemini model.

high falcon Aug 7, 2025, 9:23 PM

#

late onyx Guys so was Horizon Beta a new Gemini model?

#1400979315373375581 message

tame nebula Aug 7, 2025, 9:28 PM

#

latent forum How good did Horizon do with storytelling/RP

Excellent for worldbuilding and theorycrafting

high falcon Aug 7, 2025, 9:42 PM

#

tame nebula Excellent for worldbuilding and theorycrafting

I'm so sad it's gone

tame nebula Aug 7, 2025, 9:44 PM

#

high falcon I'm so sad it's gone

Easily the best for my use cases yeah

rare terrace Aug 7, 2025, 10:19 PM

#

tame nebula Easily the best for my use cases yeah

It's gpt5

tame nebula Aug 7, 2025, 10:19 PM

#

rare terrace It's gpt5

I'm aware

#

I don't have the moneyflow to fund the same experiments with the priced version

hasty swallow Aug 8, 2025, 4:25 AM

#

I tried deepseek free model its is taking a lot time to respond same issue with gemini 2.5pro.
I basically want a model which is good at writing code and also i need the output fast i tried horizon beta and it was both great for both requirement now it isnt available i am looking for another alternative. Do you know any alternative.
Also i am looking for a cheap option..

worn cosmos Aug 8, 2025, 4:49 AM

#

Gpt 5 mini

summer root Aug 8, 2025, 4:54 AM

#

DeepSeek V3 0324, Gemini 2.0 Flash, Qwen3 Coder, GLM 4.5 Air are all decent and very very cheap. If you're doing anything that's even slightly important, like a project, you'll get vastly better results in both quality and reliability by spending $1 on it

hasty swallow Aug 8, 2025, 5:05 AM

#

summer root DeepSeek V3 0324, Gemini 2.0 Flash, Qwen3 Coder, GLM 4.5 Air are all decent and ...

Ok

high falcon Aug 8, 2025, 3:40 PM

#

Using GPT-5 yesterday and it’s nothing like Horizon Beta was 😥 it was the best LLM I’ve interacted with to date.

inland flame Aug 8, 2025, 7:23 PM

#

well theres many versions of GPT-5 tbf

#

~~I thought someone already debunked the idea of horizon coming from OpenAi because of differing tokenization (matches more with qwen)~~

I just learned how to read the announcements 😅

steep palm Aug 8, 2025, 7:26 PM

#

inland flame Aug 8, 2025, 7:27 PM

#

steep palm

I immediately take back what i said, it seems like i am living under a rock 💀

inland flame Aug 8, 2025, 7:29 PM

#

inland flame well theres many versions of GPT-5 tbf

inland flame Aug 8, 2025, 7:34 PM

#

high falcon Using GPT-5 yesterday and it’s nothing like Horizon Beta was 😥 it was the best ...

horizon was not a thinking model, so perhaps you may get something similar to that if you used one of their non-thinking versions. How are you using GPT 5 now?

high falcon Aug 8, 2025, 7:38 PM

#

inland flame horizon was not a thinking model, so perhaps you may get something similar to th...

I'm using it for business plans currently. But every application, I've enjoyed Horizon Beta the most. Regular chat, business plans, research, coding. Most of the time I use DeepSeek because I prefer it to ChatGPT. Something happens to the model when they go from "stealth test" to release on OpenAI's platform.

inland flame Aug 8, 2025, 7:39 PM

#

is there like a chat app that u used for horizon?

high falcon Aug 8, 2025, 7:39 PM

#

This morning alone I had to tell ChatGPT to permanently delete information from chat because it kept regurgitating information that I already told it I did not want.

#

No I was using OpenRouter

inland flame Aug 8, 2025, 7:40 PM

#

because if you can find a way to tweak GPT-5 main to disable thinking, i believe you may be able to achieve close to horizon results

high falcon Aug 8, 2025, 7:42 PM

#

That is probably the issue. But I don't currently know a way to disable it on ChatGPT. Also I'm on a free plan, so it selects the model automatically now.

inland flame Aug 8, 2025, 7:48 PM

#

#

best we can do to disable thinking is by setting reasoning effort to minimal.

GPT chat doesn't give these options, so your best bet would be to find a chat provider that allows these parameters to be adjusted.

#

#

it also seems like GPT-5 and GPT-5 Chat are treated as different models by OpenAI, the GPT you were most likely chatting with was the Chat version, which is most likely different from the cool API one that we were having fun with.

im also looking to get that Horizon feel back!

steep palm Aug 8, 2025, 7:58 PM

#

The new GPT5 models also have verbosity parameters that can be set (from low to high) as well, and we don't know which Horizon was set to, so it's worth playing with those via api, to see if you can recapture the horizon feel

#

We don't know what Horizon was set to, and i'm not sure what gpt5 defaults to if you don't specify

high falcon Aug 8, 2025, 8:29 PM

#

I'm going to be honest, if I set parameters for how I want GPT5 to respond, I don't need it to tell me it understands me then when I ask it for 10 more of the same thing it completely changes the format. Separation of one message and it completely loses its mind. It's worse than when ChatGPT very first released, personally.

uncut salmon Aug 8, 2025, 8:52 PM

#

Someone needs to communicate that with OpenAI. Horizon Beta was amazing

fathom atlas Aug 8, 2025, 10:10 PM

#

steep palm We don't know what Horizon was set to, and i'm not sure what gpt5 defaults to if...

Horzion Beta was GPT-5 model with "Minimal" thinking

high falcon Aug 8, 2025, 10:24 PM

#

fathom atlas Horzion Beta was GPT-5 model with "Minimal" thinking

Thank you 🙏

safe imp Aug 8, 2025, 11:48 PM

#

Are you sure?

#

See e.g.

#

According to Artificial Analysis, GPT-5 Minimal scores 67

bitter vigil Aug 9, 2025, 4:28 AM

#

Dropping gpt 5 into one if my role-playing prompts thst works for literally every other model.. it spat out a huge reply of repetitive repeating structure where all other models do like a paragraph

#

Not impressed

#

"Hi"
4 paragraphs of nonsense

#

I don't think it's rl'd on rp

high falcon Aug 9, 2025, 4:30 AM

#

I’m over it. I wasted all my free tokens twice today just trying to coach it on what I wanted.

bitter vigil Aug 9, 2025, 4:30 AM

#

I think the reasoning fks it up

high falcon Aug 9, 2025, 4:31 AM

#

It’s definitely not doing what it’s supposed to. Separation of one message and it completely forgets what I instructed it to do.

#

Doesn’t help that I’m using the free version, so I can’t manipulate the model or thinking. It just uses whatever it thinks is best

undone cypress Aug 9, 2025, 4:38 AM

#

Gpt 5 has a free version?

wide lynx Aug 9, 2025, 6:43 PM

#

Horizon Beta was indeed amazing. GPT-5 models are unusable pretty much in roocode (maybe if you can stay under 30k context)

high falcon Aug 9, 2025, 8:40 PM

#

undone cypress Gpt 5 has a free version?

In OAI free users get a certain daily allowance of GPT-5 usage. If you are using OpenRouter, no.

brittle barn Aug 10, 2025, 7:19 PM

#

wide lynx Horizon Beta was indeed amazing. GPT-5 models are unusable pretty much in roocod...

Why was horizon so much better than gpt5. I don't understand how this is even possible lol

wide lynx Aug 10, 2025, 7:30 PM

#

I think it's because i have the 30k token limit on it and i cannot do much

#

@brittle barn

brittle barn Aug 10, 2025, 7:37 PM

#

ah is that a thing?

dreamy arch Aug 10, 2025, 8:24 PM

#

The model ID (openrouter/horizon-beta) you provided is not available. Please choose a different model.

Why i got this error?

safe imp Aug 10, 2025, 8:29 PM

#

OpenAI removed this model, it was an early checkpoint in the GPT-5 family

lament tendon Oct 16, 2025, 11:09 AM

#

last

crystal scaffold Oct 16, 2025, 11:11 AM

#

Last

lament tendon Oct 16, 2025, 11:18 AM

#

I forgot replying to a thread pushes that thread to the top

#

Not laughing at you btw, just laughing at the situation. I thought I could get away

late onyx Oct 16, 2025, 2:03 PM

#

not last

cedar junco Oct 17, 2025, 10:52 AM

#

Horizon beta it's gone, guys? Nooo... 😭 I see it on openrouter but I've never used it before!

brittle barn Oct 28, 2025, 5:30 PM

#

cedar junco Horizon beta it's gone, guys? Nooo... 😭 I see it on openrouter but I've never ...

I think it was GPT5

#Horizon Beta

Note: Horizon Beta will be going offline later today.