#general
1 messages · Page 85 of 1
Sure, I use both Gemini and Claude a lot
guess where gpt5 will be. I think 66%
78%
Nah guys not these rational benchmarks. I want your personal experiences
Its at 90% rn
90%up
70 percent
To not let people say they are ClosedAI they make the garbage open source
altman is shady guy... i was 100% sure that OSS models were going to be bad
No way
if you think gpt5 is getting 90% on simplebench u are delusional ngl
if you don't care about what you're typing then probably just using lmarena for unlimited use
*privacy
either high 50 or low 60 im guessing
No they dont
idk if lmarena is on thinking high
prob not
idk if its on any thinking tbh
yo anyone found a documentation page for gpt 5?
40.7%
Bro I 😭 that thing is so expensive literally 1 msg with o3 and you're out of tokens.
Guys, it is henceforth Declared that GPT-5 shall score 70% on Simple-bench (Decided by you)
it took more time to answer than gemini 2.5 pro, so probably?
extreme traffic
Gpt 5 thinking vs grok 4 thinking
next month
yeah we don't really sure about tat
My friend tried it. It really overthinks about safety in its CoT
do we have any hexagon or any prompts you guys can give me to test? i have the highest gpt 5 model and no idea what to test
fake
I tried 20b Locally and 80% of its COT is about saftey
GPT five is so close to winning against Gemini
mixtral and again mixtral...
gemini 3:
it's free, i'm using it now
Gemini three hasn’t released yet
reasoning effort high
Limited
Points
You have 3k points, each message costs 250~
don't know about points, have never used that site
2029
Its good to test it out for 10 messages, but the daily 3k points is too little to do anything
2035+
no good improvements since gemini 2.5 pro tbh
yeah
ah ok
Hopefully it'll replace the standard GPT-4.1 model in Github Copilot, standard models are unlimited under Pro plan.
Let's see what does it do
I like how no one says this year lol
ah another irodov follower
probably not in my lifetime, for true AGI
yeahh
It's important for my exam
apparently the gpt 5 high in the interface is better then the gpt 5 on arena
hehe same
Jee advanced
11thie or 12thie?
12th
same
The tangential acceleration equals the time derivative of the speed:
dv/dt = w_t.
Here w_t is the projection of the constant vector a (directed along +x) onto the unit tangent τ to the trajectory:
dv/dt = a · τ.
Since a = a i (i is the x-unit vector), a · τ = a τ_x, where τ_x = dx/ds. Also
dx/dt = v τ_x.
Hence
dv/dx = (dv/dt)/(dx/dt) = (a τ_x)/(v τ_x) = a/v.
Integrate:
v dv = a dx → (1/2) v^2 = a x + C.
With v ≈ 0 at x = 0, C = 0, so
v(x) = sqrt(2 a x).
Thus the speed depends only on x (not on the path shape): v^2 = 2 a x for motion in the +x direction.
how does gpt-5 performs on numerical math? like pi^(pi+e+1)?
is gpt5 available on lm arena?
Amazing because of tool use
yes
most model without python or math module backup cannot do that
Yes
Nvm i see it now
so can we trust gpt-5 for college level and below math and hope it will never get basic algebra wrong?
when is it even gonna be available in chat.openai.com
I cannot trush gemini for that sht
GPT five is like Gemini 2.5 pro but faster can’t wait for Gemini three though
0.5+ 0.05+0.018
does it reach 0.6
?
I've given up on that no ai does that properly. The only ai I haven't tried are the 200$-250$ ones like gemini deepthink and o3 pro mode
holy crap, the output size is indeed solid
AIME scores indicate yes
my take is just ask it to solve with python script and run that script
For common-sense reasoning, here's one from Simple Bench all models get wrong:
A luxury sports-car is traveling north at 30km/h over a roadbridge, 250m long, which runs over a river that is flowing at 5km/h eastward. The wind is blowing at 1km/h westward, slow enough not to bother the pedestrians snapping photos of the car from both sides of the roadbridge as the car passes. A glove was stored in the trunk of the car, but slips out of a hole and drops out when the car is half-way over the bridge. Assume the car continues in the same direction at the same speed, and the wind and river continue to move as stated. 1 hour later, the water-proof glove is (relative to the center of the bridge) approximately?
A) 4 km eastward
B) < 1 km northward
C) > 30 km away north-westerly
D) 30 km northward
E) > 30 km away north-easterly
F) 5 km+ eastward
never seen an OAI output 1300 lines of code without resistance
nah 2.5 pro works fine for me
Ill use high reasoning for this no tools
The high version is a pure trash at coding
Send this a simple question so even you can understand the answer.
Gemini 2.5 pro crashed
In thinking
Ok, but the solution is actually very simple (for a human) and doesn't require much reasoning.
bro the lmarena gpt 5 says its gpt 4
how about chess? does gpt-5 still suck at 900 elo 🤣
is it quadrant 2
Guys, "OpenAl PRs (no browsing)" Has hit a wall. ALL the models are stuck at 44 percent including o3, ChatGPT agent and GPT-5
What is gpt-oss? Its on arena
No, GPT5 is cheaper than GPT4o
Will we see gpt-5 with reasoning on leaderboard ?
playing game with gpt 5 high vs opus 4.1
so far stockfish thinks opus is winning
ass
yeah samee
gpt 5 live on yupp ai
les goooooooo
Open source OpenAI model that just got released. There's 20B and 120B variant.
Api is live
what is the answer?
@hollow imp
the model seems to be blowing every other OAI model out of the model when it comes to UI though
or well, css
@hollow imp From 0 to 1: (2/√3)(arctan(√3) − arctan(1/√3)) = (2/√3)(π/3 − π/6) = π/(3√3).
Who doesn't love seeing this
Let's give it actual jee advanced pyq
yeah
bro jee adv maths is ezz
physics + chem too hard
has anyone benchmarked it yet?
in the process
@void elm If it can solve this then it passed
i have my q bank
i can solve this lol
i have some hard diagram geometry problems (creative thinking not complex formulae)
only grok has solved most
it didnt get 2-3
@keen beacon do you have telegram
is gpt 5 still not on the app?
This wrong btw
i can't see it on selection
This def wrong
is the 74.9% swe bench true? for gpt5
Answer: A) 4 km eastward
Reasoning:
The glove drops at the midpoint of the bridge and (being waterproof) floats on the river.
Once on the water, it’s carried mainly by the river’s current: 5 km/h east.
The light 1 km/h west wind may slow that drift slightly. A simple approximation is 5 − 1 = 4 km/h east.
Any northward motion from the car (30 km/h) is lost almost immediately upon hitting the water; the north offset would be only a few meters—negligible after an hour.
So after 1 hour, it’s roughly 4 km east of the bridge’s center.
this is what the lmarena one said
@keen beacon why this question so heat that even ais and aspirations both get it wrong
How is GPT-5 performing at maths and physics thus far?
selector broke again?
Worse than other models???
No
Quick question: can anyone confirm whether GPT-5 started rolling out before the livestream ended?
yes it did
Can you explain? Does it perform better than others but still bad
it was out even before it started
@keen beacon what do I say
its better than others only a few quesitons its struggling on
cannot confirm, haven't seen it land on my apps
use VPN hehe
ye
works for me
not working
guys when will gpt 5 be available in chatgpt ?
🙄
BRUH
third and fourth quadrants ??
❌
guys when will gpt 5 be available in chatgpt ?
Why gemini becomes dumb these days?
guys when will gpt 5 be available in chatgpt ?
2035
its 2nd quadrant -x flips the function to the left side
Introducing GPT-5
Our best AI system yet, rolling out to all ChatGPT users and developers starting today.
It messed up this simple question
I used to get nice codes but from few days it making mistakes
No bru
3rd quadrant then? -x means left side right
pineapple says that the LMArena team made adjustments and soon (tm) it will be seen, though I don't know when this "soon" will happen
where is gpt 6
launching on monday
gpt 6 in 2077
Me to I cant find it
LOL
In the arrangement shown in figure a weight A possesses mass m = 4 kg, a pulley B possesses mass M = 2 kg. Also known are the moment of inertia I = 2 kgm2 of the pulley relative to its axis and the radii of the pulley R = 1 m and 2R. The mass of the threads is negligible. Find the acceleration of the weight A after the system is set free, taking acceleration due to gravity equal to 9.81 m/s2.
Looking into
The answer is: idk 🫡🫡🫡🫡
@keen beacon is this correct?
Asking for its architecture on LMArena gives GPT 4 while asking on Poe gives GPT 5
Idk who to trsut
yeah its correct 3.5 (for g = 10)
lmarena has no system propmt (api based)
while poe may have a systme propmt
We are never agi
Wrong again
Why I cant find gpt 5 on chatgot web and app 💀💀
Cursor is doing well by offering gpt-5 for free for a bit
Not available
Damn
thinking?
Yeah looks like models are missing atm, team is aware and working on
It's a mystery
Idk, haven't tested
i think he meant chatgpt.com, not entirely sure
o3 high gets this quesiton wrong most of the times:
A home aquarium partly filled with water slides down an inclined plane of inclination angle θ with respect to the horizontal. The surface of water in the aquarium
(a) remains horizontal
(b) remains parallel to the plane of the incline
(c) forms an angle α with the horizon where 0 < α < θ
(d) forms an angle α with horizon, where θ < α < 90
its b
still don't have access to gpt5 yet
Aniway I wil try it in lm arena
Lmarena
bro lmarena is down
Every ai gets the quadrant question wrong
nah its working for me
How to get free access of claude?
😶
Lmarena
lmarena direct chat
I have GPT Plus, where's my upgrade

alpha or beta url still works
It's fully functional as pro version?
@echo auroracan you escalate please
alright how's gpt5 looking so far?
It deleted my chats like twice
better than o3 in STEM
how about versus Gemini 2.5 pro?
yeah (but no tools)
wdym we cant even use it because teh website is down
Already have 
on par
the backend's down probably
thanks
for those who have the working url 😊
i am
Answer: (b) remains parallel to the plane of the incline
Why: In the frame of the sliding aquarium, there’s a pseudo-acceleration up the incline of magnitude a. For a frictionless slide, a = g sinθ. This exactly cancels gravity’s component along the plane, leaving only g cosθ normal to the plane. The free surface is perpendicular to the effective gravity, so it must be parallel to the plane.
Note:
If the tank were sliding at constant speed (a = 0), the surface would stay horizontal (a).
In general, the tilt α (from the horizontal) satisfies tanα = a cosθ / (g − a sinθ), so 0 ≤ α ≤ θ, with α = θ only for the frictionless case.
how do u know?
need to compare token cost for marginal improvement
i am testing it on the quesitons in my q bank
@void elm what did it say about the quadrant question?
Personal opinion gemini is dumb
Will we get the mini/nano models on lmarena?
idk but they are on yupp ai
So guys, what exactly are tools? Does lmarena have them?
nope. tools r extra features that model can acess - google search etc
@hollow imp
but lmarena does have google search !!!
Is thinking a tool?
nope.
thats separate right
Ofc not, lm arena is a testing site for benchmarks
lmarena is fixed
I have perplexity comet but hoping for android version because most of the time we search in phone not in laptop
Uhm @echo aurora (Sorry for ping)
its fixed
Aame
Dem
Same
Cool site
I think lmarena gemini 2.5 pro have google search
(no need to apologize for ping) Team is aware of this and working on a fix
Thanks
looks back for some, but still working on a fix
Huh... I was wondering what model Summit was. I remember having some interesting interactions with that one.
Ok thanks and sorry again
Did you guys fix it with AI lol
Also all my conversations are deleted
yes
Me too
seems like they reseted the whole website
Konsi konsi books se uthaye hai vro 💀
😶
what's this?
the frontend got disconnect from back end. everything will be back to normal
Wait fr???
Short answer: You can’t tell from the given picture.
Why:
For x ≥ 0, f(|x|) = f(x). So the graph shown only tells you how f behaves for non‑negative inputs.
But f(-|x|) = f(t) with t ≤ 0. That uses only the values of f on the negative side, which the picture does not give.
What we can say:
y = f(-|x|) is always an even function (symmetric about the y‑axis).
Its graph is the y‑axis mirror of the left half (x ≤ 0) of y = f(x).
Without knowing f for x ≤ 0, the graph of f(-|x|) could lie in quadrants I–II, III–IV, or cross the x‑axis. Extra assumptions (e.g., f even ⇒ same as f(|x|); f odd ⇒ the negative of f(|x|)) would be needed to decide.
gpt 5 on mc bench
"GPT-5 usage limits 👀
10 messages every 5 hours on Free
80 messages every 3 hours on Plus
Unlimited use on Teams and Pro
"
exams se haha AIME IMO IOQM JEE ADV IRODOV
enjoy the 8k context for free users
Wait what 💀
WAHT
Huh
32 for non enterprise 💔 💔
It's very surprising how low Claude is in the coding category, considering that it's leading SWE-bench.
Are you joking? 😭🙏
Bro, this better include o3 level web search
I'm glad we have lmarena
Do I spend my time on them? Are they relevant? Like I'll save them if you share
🔥
I feel claude is better than gemini
@void elm @keen beacon the answer
Is the GPT-5 in LMArena with thinking?
Nah
Gpt5 can switch from thinking to just giving you the answer
In the future, someone should get an agentic AI to re-run all the code in LMArena's public dataset and verify the votes. I bet 25-40% of votes are inaccurate for the coding category.
Its fixed, atleast for me
nono i am prepping for IOQM too so thats why - Irodov is more than enough
FIXED
Are you in any coaching?
Yay
There is a reason why they don't include Gemini 2.5 Pro in the chart. Gemini is waaay better on long context
right
uhmm. I don't think chat history will get back... that's big rip...
The best one is Google jules but normal gemini app with 2.5 is bad
it will
Samey
aakash
if gpt5 and gemini are on par, why's the market acting like Gemini had won?
considering the context size differences, in a way, yes
i heard gemini is planning a new model
Hi! 👋 How can I help you today? Would you prefer to continue in Italian or another language? We are starting bad
Idk, is there any new Gemini news?
that people think will blow gpt 5 out of the water
Can I dm you and talk to you there? Are you frequently on discord? I have 11th prep doubts that can only be solved by personal experience
Nop
sure
gemini.google.com has a new guided learning mode which is pretty interesting
For some reason gemini is doing much better according to polymarlet
its fire
Which makes no sense to me
Kaha se who btw
GPT‑5 is available to all Plus, Pro, Team, and Free users starting today with access for Enterprise and Edu coming in one week. It may take a few days to roll out to all Free users.
- Pro users get unlimited access to GPT-5 & access to GPT‑5 Pro, ideal for the most challenging,
not very frequently - i need to study too
Dm MEIN
Imagen 4 is best and veo 3
Yeah, basically the Socratic method of learning, haha.
For coding still lacks a lot
I mean like not disappearing and then coming back in 4 months and worse not even coming back
Yessir
not that much heehe (ill check twice a day)
I dno
That's more than good enough I don't even check weekly
💀
lmao
gpt-5 is an amazing model compared to gpt-oss
craig also said gpt oss was great
lmao
Can my conversations not dissapear for 5 minutes?
Elon Musk seeing that gpt 5 is better than Grok 4: 🥀🥀
Gpt 5 available
your late bro
I need advice on how to report the result of a prompt. I asked two AI's a question about doing something on Google Sheets. One was a complete disaster. The second gave me three options; one worked, one failed, and one was half-right. Do I say "both are bad", or "that one is better"?
I think he only tested llama
😶
open-ai 🤝 CPU / GPU companies: at naming production
Both are bad
How about coding in gpt 5
Are you trying to delete a chat? There is a known bug where deleting the chat then refreshing the page brings it back temporarily.
Trash trash ,..
it is MUCH better than their previous models at styling from what i can see
Thank you.
naming it o3. 4o is actually so bad, normal people wouldn't even know which one is better. probably they want you to get confuse
didn't test it that much to be able to tell anything else
No, i am just sending messages
🤔
@echo aurora hello sorry to ping but please can you tell me if lmarena is going to bring pdf attachment anytime soon? I really want it
it's about naming
it was good for its time 👍
the o models have a weird naming scheme
since the number jumped from 1 to 3 
r u the reall craig? (sorry)
@deep adder $1 method goated
o2 became sentient and turned into a company
We're currently having some issues that is likely what you're running into
is the gpt-5 on lm arena same as the gpt-5 on the officialt website?
most people probably thinking: "well, it have 4 and o, so it must be better than o3, right?"
I don’t think so lol
Can someone pls give me some prompts for see how id god gpt5. Thx
Gpt5?
In the arrangement shown in figure a weight A possesses mass m = 4 kg, a pulley B possesses mass M = 2 kg. Also known are the moment of inertia I = 2 kgm2 of the pulley relative to its axis and the radii of the pulley R = 1 m and 2R. The mass of the threads is negligible. Find the acceleration of the weight A after the system is set free, taking acceleration due to gravity equal to 9.81 m/s2
(no need to apologize for pinging me)
is going to bring pdf attachment anytime soon?
we are aware this is a highly requested feature from our community, we'll be sure to make an announcement when something like this is added
Thx but what is the answer
@deep adder r u the real craig?
10.3
Btw, what is style control?
Thxxx
\frac{3g\left(M+3m\right)}{M+9m+\frac{I}{R^2}}\ =\ 10.3\ m/s^2.
gpt-5 is out? im on plus plan
another q:
A uniform cylinder of radius R = 1 meter is spinned about its axis to the angular velocity \mathbf{\omega}_\mathbf{0}=\mathbf{250}\ \mathbit{rpm} and then placed into a corner. The coefficient of friction between the corner walls and the cylinder is equal to k = 0.59. How many turns will the cylinder accomplish before it stops?
Bro have you tried deepseek v2 prover? That pure math model?
Thx but the answer is?
We should be thankful for the competition in this space. Imagine having to wait 2 years from now for GPT6 💀
For another 3% improvement to benchmarks lel
Nah, its coming out next year
4
probably put it on pay wall with limited uses for sometime
yeah but multimodalityyy
Thx
unofficial websites rolled it out faster than openai
I would like to propose an interesting question.
What would be the state of ai in mid 2027? What would be the best model.etc.etc
close to AGI but not AGI
my prediction heh
Well I converted the non diagram questions into natural language through grok 4 and then fed it
hmm
Like in more detail bro
no one knows lol
anyone experiencing the models lagging a lot right now?
they said today for plus, but I dont have it yet
hi
LMAOOOOOO
You should try it. I had a hard time trying to find where to chat with it. It was on huggingface but with limits
i tried it, but its meh.... its not reasoning its a weird mess kinda thing
after this gpt-5 nano = zenith confirmed
Oh
Uhm is normal that I am still waiting a response from gpt 5 ? 💀
artificial analysis
Yes
It's inpossible
goddamn
this feels more like gpt-4.75 to be fair
they are trippin
Thats odd
gpt-5: 10^(π^e) ≈ 2.878446 × 10^22
gemini 2.5 pro: 2.878335... x 10²²
correct answer: 2.878443560...x10^22
gpt-5 is pretty close. it probably rounding issue right?
in my run it got it wrong
seems like its broken
yeah
wait till gemini 2.6 🙃
All the independent benchmarks are coming in and they are trash 💀
Gemini 2.5 said the correct answer
and gpt-5 auto is routing between those models
.
Gg
Artificial analysis tested and they have it as trash kek
Also gpt 5
I have gpt 5 in chatgpt
Why would they lie
bro gpt-5 sucks sorry
Damn you have premium?
i don't have it yet
New people coming to discuss about gpt 5 and new people getting in beef with you 😔
yeah
yeah, apparently, just ask it to “think hard about this” in the prompt, it will switch to thinking mode then, that's what's been said in the release docu 🤓
@keen beacon dm
Will Gemini release a new model this month?
13
21
1
Yes
Will gpt5 beat gemeni in lmarena leaderboard for text?
gpt-5 400k context and they still limited chatgpt pro users with 100k, i'm gonna cancel this s****
@deep adder gpt 5 is good
I add this specific line in the prompt " Perform a deep granular assimilation of the question before proceeding".
what you have on other models?
anyone know when gpt5 will be available 
I mean... it kinda is? SOTA for function calling and instruction following. And challenging for the top spot everywhere else
yeah
good vibes, good overall model, js hope it's smart
gpt-5-thinking
they are rolling out the update to trillions of users
NOOOOOOOOOOOOOOOOOOOO
All previous models are getting deprecated
💀
Only at me gpt 5 is extremely slow?
its thinking
Only GPT5 will exist. I assume it is cheaper or smth
high usage rn
theyll have some time to deprecate
Bruh ok but he is thinking form 2 minutes 💀
ALL of them?
Yes
Which model is the most generally intelligent model right now?
14
26
4
GPT-5
ALL
yeahh they nuked it. Old cache but still wouldn't let using it lol
so sad
never see you again
I think they try to reduce losses. Maybe GPT5 is actually smaller than o3 even
once i had o3-mini think 10.5 minutes for a physics problem
🤣🤣
this can;t be real
10.5 o3 minutes is 7 hours for deepseek lmao
frrr
Yesss it's out
REAL
never doubt gpt-5 nano
it runs out of thinking tokens 🤣🤣
Vote plz
Will gpt5 overtake gemini on the leaderboard?
gemini 3 will be better for sure
Chat, be honest. You are in this group only for see if there are better ai to use for homework 💀🤣
lots of traffic, that's the reason
already
And work man
Not everyone is a com kid like you
nuh uh
Nah not with style guide switched off
GPT-5 has 4 new chat personalities: Cynic, Robot, Listener, Nerd.
Find them in Customize ChatGPT in settings.
What does openAI even do now. The AGI illusion has been shattered kek
why gpt 5 says he is gpt 4
@keen beacon chatgpt 5 said wrong answer
They will never beat the Grok girl
@willow grail Told you it's to be same day release for EU. You wouldn't listen 
what should I try to give gpt 5 thinking
about to get out of college already. I'm worry if my future IT company allow using AI or not
exactlyyy
"gpt5 is good at frontend aesthetics"
gpt5:
ye but how long do you think it would take for other company to beat it?
pretty decent for one-shot
hello
the other one is made bu gemini?
yeah 2.5 pro
Chat, will gpt5 beat gemini on lmarena?
and for a wrong answer (o4-mini)
Only with style control
I'll take left side over strawberry pizza
make a beautiful webpage for a classic pizzeria store (embed css)
The results are already out
On Simple-Bench what will GPT-5 score?
9
20
4
70%
im refreshing my GPT page every 30s. give it to me
What does style control do?
The Gemini bat logo will rule them all
ill see what i get from it lol
another
now @deep adder can start hyping gpt-6, gpt-6 for sure will be a different beast, insane model, sota
At this rate xAI gonna overtake openAI kek
also Google is bankrupt
Annotators on their way to destroy their own career:
@deep adder will gpt-6 be AGI
believe they said all future reasoning models will go into the autorouter (GPT5)
and we should be getting a big reasoner in ~December maybe
GPT5 is O4
grok 5
gpt5 is o3
yeah I think so too
o3 is o2
does anyone have gpt5 access on chatgpt.com yet
lol
yo guys why do some people have sound on their videos?
gpt5?
i have it on my phone app for SOME reason
because they got lucky
gemini ; (
@pearl kiln bcs of veo 3
not on the website though
yeah ik but is it random
Gpt 5 pretty laughably bad at translation compared to something like 2.5 pro, it writes like a textbook XD
it is random
hmm ig gemini gets lucky at times
yes
oh woww i have it too but only on my phone
lmfao
alright thx
OMG I WANT GROK 5
gpt-5 suck for writing btw
GPT 5 seems to suck at lotta things
reminds me of meta's scout and maverick releases
now i need to test it with code
hehe
they said that it's a bunch of phds in my pocket
lol
yall try generating music with web audio api with gpt5
It's strange how it basically is just openAI failing to progress at a fast rate. The Chinese LLMs don't seem to be having a slowdown
PHD from whatsapp?
After GPT-5, what are your AGI timelines?
8
21
2
2028-2030
exactly
well those phds must have bought their degree because they are dumb as hell
its really good lol
fr
claude is not slowing down
just look the paper they released about the llms personalities, that's insane
gpt-5 isn't agi...
but claude sonnet 4 is not that great of an improvement from 3.7
opus 4 is awesome tho
yeah
it is lol
maybe but not imo
According to sam it's even better. An entire team of PhD experts in your pocket
for me it is

hmm
lol where not true bro.
i dont have it on my openai account
agi is just human levle. bro said ASI in your pocket
for me claude sonnet, not opus, is the best model actually
GPT5 vs Gemini 2.5 pro
openai playground
u and ur expensive api
You need to be paying API customer though. But still EU 😇
gpt 5 wins
not for math though
yeah gemini gets lucky at times
for math it was o3
yeah tho i use o4 mini its really fast n a math nerd
both are nice
gemini wins
but GPT OSS 120b on cerebras impressed me
hard questions, answers in < 5s
its amazing
which is gemini
right one
gpt5 looks better
design wise gpt one is ugly makes no sense.... pls look again at it..
the 2.5 pro version is clearly inspired from retro stuff.....
the gpt one... is... not.. its not inspired at all
they are both great
yeah hehe
yess
but folks hate css tho
you're right
I am more interested in day to day stuff since I am not a coder. People will use GPT 5 for more trivial stuff.
@blazing bison
but GPT OSS 120b on cerebras impressed me
hard questions, answers in < 5s
its amazing
even on math arena it tops (even on SMT whose q havent been released)
And decreased hallucinations help with correct info
@echo aurora hi, can you please provide an update on the feature request to allow disabling the sysprompt in direct chat? i requested this before the new lmarena even released and i've heard nothing since
all in on google
i really like claude sonnet for day to day stuff, i like how claude use less dashes, tables, etc

i need the updated 2.5 pro asap
tho the gpt one has more various elements
wait gpt5 is first image?
Actually I trust Google more than OpenAI
formatting is bad
yeah
not TOO bad
100 req / day for their best model
but if you aren't actually talking with it, just using it like a search engine, then gpt is the best right now i think
how can usee formatting
not all that bueno but still
gemini?
formatting means the look n placement of the elements
thats gpt 5
I like to use it for brainstorming stuff
ah I see
ideas etc.
zenith
it was grok 6
Hi
Hello
How are you?
gpt 5 design is disappointing. gemini does better
Doing alright, trynna test GPT-5 but it is so slow because everybody is smashing code into it
which ai do you think is the best right now? gpt 5?
nah
Needs more testing
What did you try it in?
when is gpt-5 actually coming out? I still don't have it anywhere
its out
yup
still not on chatgpt
It is on LMArena at least
ROFBLOX
eu?
yeah ik
no
check your phone bro
ok then thank god
yeah its on lm arena i meant
I am & I updated it
Will take time
oh im in usa
I dont have it either
I told them to recreate Deepl
GPT5
maybe usa ppl got it first
its probably progressively rolling out
but it could be because I am in the EU
vs gemini 2.5 pro
DeepL is based btw
Actually gemini is looking better ngl
better than google translate
i disagree but opinions opinions
gemini better
Looks more like the actual website
isnt that literally the deepl site?
Yeah looks neater
this is what deepl actually looks like
no
it's oblox ahh design
then its very good
design score goes to GPT-5
accuracy score goes to 2.5 Pro (HOW)
i am new in this server
really?
its pretty good like could of removed some of that shadows but its pretty good
booyah
if thats not the actual deepl website gemini did a great job
yeah
helloo
Gemini is very good at a following image instructions
btw
oblox seems to be some peak UI benchmarking lol
can i use veo 3 inthis server?
yes
ofc
if you get lucky enough
SWEEET!!!!
That's right. Unlike chatgpt who sometimes does unwanted things that aren't in the prompt. But gpt 5 handles it very well.
i find that GPT-5 is far from just focusing on intelligence but also how pleasant it is to work with in general use (like, i don't know, ChatGPT)
what do u mean by that
How does gpt-5 behave on coding tasks?
@everyoneIs there a way to generate videos with sound here, pls help I just got here today
the video model you will get is random
you cant choose it
brilliantly from my experience
@everyone!
like
Brilliantly
go to video arena channel bro
Yes! Just go to video arena and write a prompt that includes audio description and it will auto-use veo3.
lol
hHOW MANY Veo3 videos can i GENERATE IN A DAY
kid
Guys I think there is smth actually extremely wrong with gpt5. Like they made a mistake somewhere
8
everything
depends on how many discord accounts you have😂
Look how it performs compared to horizons
Devours Sonnet 4
FREE?
IN THIS SERVER?
yes
Yes. Why you shouting to me?
default thinking or
NAH BRO I AM JUST TRYING TO HIGLIGHT THE MSG JUST IN CASE U SEE IT
i find it a shame people now dumb it down to "thinking" instead of "reasoning"
HELLO
just had gpt-5 answer a question way worse than 4o lol
my bad i dint mean it negetively
gemini is so much better lmao
horizon beta had a great design
Noice joke buddy
I'm pretty sure they screwed smth up in the release
wait horizon models already revealed?
hello
Like artificial analysis benchmark and everything is literally showing GPT5 worse than o3 it's smth wrong in the launch
ok we have some real improvements here
btw
In what time term?
where CAN i actually generate it?
Video Arena 1/2/3/4
HERE
Check out #1397655624103493813
thx bud appreciate u
**HOW ARE YOU **
Which version of GPT-5 do we have in the arena? GPT-5 pro or just the "normal" thinking model that Plus users have?
Can anyone give me a memory prompt for Chatgpt to not use the long dash and use emojis less?
standard version
im pretty sure there is a pro?
pro is not released I believe
After gpt 5 I am atill waiting for deepseek R2
Thank you
deepseek is just a one hit wonder
i don't think so...
After all, we should all really appreciate LM Arena thats let us use their service for free.
Ohhhh they are showing benchmarks for GPT5 high internally but we only have GPT5 medium ofc. That's why everyone else's benchmarks are awful 😂
it just that deepseek don't hype, they release
They didn't release the high version yet
it's proven they can't catch up, they don't even have vision at this point
@echo aurora will we have GPT 5 pro on LMArena direct chat
no we have gpt 5 high
prolly soon or so
it's avaliable already
Why they released two versions of GPT 5 😭
It's not high. There is no reasoning level in the API I have looked
Ohh but there's no such model on the API called GPT 5 PRO
daaaaawmn high
There's no setting
But the website does have
Everyone currently is using GPT5 standard
Maybe they openai will add it? How you know?
Including LM arena
idk if you're talking about the playground, but it's working with python
Only three models are on api lol
Gpt 5, mini, nano
But no pro
I think that's a variant of gpt 5 not pro
What is it
but better
The horizon models don't match up to GPT5 then
All the benchmarks are looking like this and wondering wtf is going on
yeah prolly
Yea it's crazy
It's not just one person. Multiple benchmarks
Wtf did i just see
Have it like that
" Pro, Plus, and Team users can also start coding with GPT‑5 in the Codex CLI(opens in a new window) by signing in with ChatGPT."
honestly of all AI models released these days
Apple's OpenELM is still the gold
no one can top that
wow
There's smth wrong with gpt5 lol
Where's gpt 5 pro benches
Where's Codex CLI now
bro wth lmarena had the model on the second gpt 5 was announced but the chatgpt website is still the old one
Lies
who is lying?
I don't have it
still smarter than gpt oss
Oh it's codex
Oh nice I have it
It was updated an hour ago
Im just dumb
they did this bcs of claude code lmao
I certainly think thats gemini is better at coding than Chatgpt.
Codex is the best coding agent ngl
no
Yes
I use the web
when gemini 3
I work on open source stuff so idc lol
no one knows what zenith was
bro is gpt 5 improving?
I do Linux development
i feel like its better than an hour ago
Bro has 0 clue about anything kek
with AI, uugh
it doesn't just "improve" like a human
I use it with rust just fine
yeah bro, my colleagues that actually do something with rust disagree
claude opus is the best one for rust but it generate super verbose code, so no one uses it
WHATS BETTER IN LMARENA GPT 5 THAN OFFICIAL CHATGPT FREE TIER GPT 5?
it's not
how
Any performance related change?
OpenAI auto switches to mini after a few times, LM Arena doesn't.
yeah
Ofc, not because it is not there yet.
And it is not possible in general
it is for some people
oh
Do you guys know how to get a better version somewhere else? Like a more reasoning one
slow roll out
only if you pay for it
I'm talking about free
Like how you can use o3 and o3 search in lmarena and nowhere else for free
Same with grok 4 search
Same with opus 4 thinking 16k
@blazing bison something like that for a better gpt 5?
for search just click the globe icon at the prompt bar
If summit is gpt 5 then what is zenith?
free biscuits only on lmarena brothers
Chat what is zenith
Well sometimes thinking gets much slower results. (sometimes even worse results)
Especially my eye is on gemini 2.5 deepthink
Even if I get 1 prompt per week
Just free biscuit pls
🙏
@blazing bison
Nothing in SWE?
yeah they gonna direct replace 4o-latest with gpt5-chat-latest
Does anybody know anything about zenith
And o3 with gpt5 medium reasoning
Right. I didn't relate to the analysis you sent.
though that's likely later
o3 lm arena level reasoning vs gpt 5 free tier reasoning
Is gpt5 reasoning not available at all for free on anywhere?
free tier (gpt5-mini-reasoning) is probably like marginally improved o4-mini
though I haven't seen direct comparisons yet
So you can't use gpt 5 reasoning?
What about poe.ai gpt 5 reasoning high?
Well it's probably a minor improvement. Reasoning models are mostly slower so it's probably only improved in the output, and not in the reasoning time.
wdym. I think free tier is gonna be mini
well limited use of the full model probably, but not with reasoning
Nope, after certain amount of prompting, it will automatically switch to the mini model
and then it will fall back to gpt5-mini with and without reasoning
yes
Tell me about this
Who is going to eat openAI first. Chinese LLMs or other American ai companies
If I stop and open the chat the next day, will this be fixed?
yeah that's what I meant. But unlikely that they will do full gpt5 + reasoning of it. For free users. They didn't get o3 either only o4-mini
Gemini 3.0
Deepseek is cooking
Ah, okay
O4 full release when
it's gpt5
No o4?
and "o4-mini" is just marketing name lol
No, because GPT 5 is about combining reasoning into one model
no more O series
Why openai naming so twisted
No, they merged them now with gpt5
Why don't they do it like gemini reasoning forced on every response
But where can u access it?
that's how o3 was
can u access in this server?
LMArena at least but not on the gpt app
at least not for me
you don't need forced reasoning though
Glm5
Is glm reasoning model?
Because it is hybrid. It uses reasoning when needed
Yes but ai agent more than a model
sooo the gpt5 is in this server i mean LMArena?
Is it on par with o3 and gemini 2.5 pro and grok 4 math skills?
Russian 🙀
I didn t test it on math
Im confused. Is gpt5 overtaking gemini on the leaderboard and does this mean openAI won?
I hear lots of mixed views on this
Russian
Please don't send it here.
no
Gpt 5 not upto the mark in coding
His pfp reminds me of him
Your role here always sending this pic 🤣🤣🤣🤣
