o1-pro | OpenRouter | Page 1

dusk juniper Mar 19, 2025, 10:34 PM

#

https://platform.openai.com/docs/models/o1-pro

#

Input
$150.00/mtok

Output
$600.00/mtok

#

I am a little surprised though. I was under the impression that "more compute" just meant that it reasoned for a lot longer but maybe this means that it's doing some tree search/parallel generation?

distant smelt Mar 19, 2025, 10:47 PM

#

Lool, Idk why they even launch this stuff API-wise

rare crystal Mar 19, 2025, 10:50 PM

#

it's also Responses api only

frank seal Mar 20, 2025, 12:36 AM

#

dusk juniper I am a little surprised though. I was under the impression that "more compute" j...

It was the most discussed thing when they released Pro subscription
Apparently they are doing some best of N or consensus
Hence really high costs

I am just surprised they released this while o3 is supposed is just around the corner (late considering december announcement) and it beats this without using parallel generation

lethal path Mar 20, 2025, 4:47 AM

#

it is suprising

#

but why not?

jaunty echo Mar 20, 2025, 7:52 AM

#

rare crystal it's also Responses api only

Yeah what about that? https://discord.com/channels/1091220969173028894/1350796020556238939

silver anvil Mar 20, 2025, 7:56 AM

#

dusk juniper **Input** $150.00/mtok **Output** $600.00/mtok

cheap

limber cove Mar 20, 2025, 8:32 AM

#

dusk juniper I am a little surprised though. I was under the impression that "more compute" j...

yah - i'm pretty sure this is it - there was a great talk posted on the AI Engineer Youtube channel yesterday from Ramp, and he talked about for some of their workflows, they just run the completion 50x, and while most of the time for this particularly tough problem individual completions fail, if you run 50x in parallel, you almost always get the correct solution ...

clearly this only works with verifiable domains, but for many of our workflows that is the case, and if you could run search where the consensus isn't just based on LLM-as-Judge but say has access to a code execution tool to verify, that would exponentially increase success rates for many difficult problems

dreamy pawn Mar 20, 2025, 8:58 AM

#

Someone let me try a few question on o1-pro via the subscription, and honestly imo it's not different to the normal o1. But I tested only very few coding questions, so not sure how good of a picture I got from the model.

limber cove Mar 20, 2025, 9:17 AM

#

yes it's not any better for things that o1 or 3.7 reasoning can solve for example - but whenever i hit something that claude 3.7 can't solve, i've seen o1 pro mode be able to solve a good portion of them

of course YMMV, and in my specific use cases these have mostly been extremely complex typescript generics related issues where claude 3.7 would fix the issue, but create another issue, and it just keeps going in whackamole loops ... it's this type of problem that i've seen o1 pro do very well with

lethal path Mar 20, 2025, 3:50 PM

#

some weeks ago I was using 3.7, r1 and o3-mini for some complex C/C++ simd optimization problems. right now I don't have anything really difficult to throw at it

#

I want to be surprised by it

earnest pond Mar 20, 2025, 4:42 PM

#

would you pay that?

digital merlin Mar 20, 2025, 5:40 PM

#

can't wait to bench its chess capability /s

hexed gazelle Mar 20, 2025, 7:17 PM

#

No one can prove your models aren't getting better if they can't afford to benchmark? 🤔 🤔

lilac bolt Mar 20, 2025, 8:29 PM

#

Hello Guys!
I have problems implementing 01-pro, Has this model been dropped?

rare crystal Mar 20, 2025, 8:30 PM

#

lilac bolt Hello Guys! I have problems implementing 01-pro, Has this model been dropped?

We don't support it yet

#

working on it though

lilac bolt Mar 20, 2025, 9:05 PM

#

Does anyone know how to check O1 Pro availability? It seems to be down.

lunar helm Mar 22, 2025, 12:54 AM

#

has anyone done a search with this model yet, curious about the cost lol

barren oak Mar 22, 2025, 3:10 AM

#

it seems no-one want to use this model 🤣

placid siren Mar 22, 2025, 3:45 AM

#

good lord that is some eyewatering cost

#

amazing tech demo

hallow axle Mar 22, 2025, 1:18 PM

#

I got impressive result in brainstorming with GPT-4.5 a few days ago. Gonna try my luck with o1 Pro then

hallow axle Mar 22, 2025, 3:27 PM

#

After testing, I honestly think GPT-4.5 is better in brainstorming 😂

lunar python Mar 23, 2025, 3:34 PM

#

For that pricing I expected o3 at least

obsidian willow Jun 6, 2025, 9:38 PM

#

@rare crystal FYI I think that o1-pro is down.. if I send a message I get the attched.

Note: in my requests I leave the max_tokens blank, so I'm thinking it's somehow defaulting to 100000 and really not liking it, but it shouldn't be from my app, maybe something on your side that default that high? the uptime is showing 0%, so may well be that it's simply down

rare crystal Jun 6, 2025, 9:40 PM

#

obsidian willow <@165587622243074048> FYI I think that o1-pro is down.. if I send a message I ge...

N/A uptime means we don't have enough data to evaluate, not that it's down. for this error message, you either need to buy more credits or set a lower max_tokens. What's happening is that we are estimating the cost of a request, and if it does use all the max output (100k), then you can't afford it

#

so you do need to set a max_tokens on your end, or buy more credits.

#o1-pro