#general
1 messages · Page 86 of 1
Look at the confidence interval though
I used it now I can say it has the ability to use some amazing things but when it comes to functionality it lacks a lot
I always wondered what that metric means. I am bad at math
the CI
Haan ab toh battery bhi down ho gyi
This is pure chaos. Love it.
Yes yes soja
hhhhh this table could have looked slightly better. I see why they hid it 
If you multiply it by 2 it gives you the potential range of scores with 95% confidence (certainty). It’s subject to a few assumptions but yeah
ok, thanks.
btw awhat u can do with gpt 5
Stuff
Why you lyin to people
https://x.com/sama/status/1953551377873117369?t=8u99Wmh-kr7LZialZ1Sunw&s=19
"we can release much, much smarter models" -Sam
GPT-5 is the smartest model we've ever done, but the main thing we pushed for is real-world utility and mass accessibility/affordability.
we can release much, much smarter models, and we will, but this is something a billion+ people will benefit from.
(most of the world has
What does this mean
Yes, much more than the 4o model. Much lower hallucinations at least which is important for factuality
Damage control?
He trynna hype us up again. He should be more honest
i get it
is the gpt 5 free or it comes with subscribtion
It is free, yes
10 messages/day
free
lmarena limitless
Then goes to a GPT-5 mini
Reasoning
GIMME REASONING 😭
It's a limitation for free users. After certain amount of prompting it will switch to the mini variant
cost stuff
What does this meannnnn
subscribe for more points
ohhhh
sad
gpt5 still worse than gemini 2.5 pro
Yeah, I find it good with really random questions. Trivial stuff
but i have to buy the subscribtion for gpt 5 mini r8
did some already
Does the "remove style control" leaderboard get updated (I ask because I discovered it just now)?
no
huuh
and where are the people that said "gpt-5 will accept video inputs"
gpt-5 will accept video inputs
gpt -5 will have a new gen image model
it is
gpt 5 high is live on yupp ai
In ordinary chats?
idk
it's fake
Hoping to see that since Gemini 2.5 pro does that.
Analyzes them well
Bro got deceived
ohOK now what the heck is gpt 5 HIGH
high
is gpt-5 after drugs
Where? Not on LMArena so that is irrelevant
i it like gpt 5 after taking drugs
didnt legit say gpt5 high was basically summit
no summit is gpt-5 medium
its the highest in the room'
no man he say high
where?
he deleted it but he announced it
He who
it still in #announcements
For that is a mannnnnnn what has he got !! !! But him selfffffffff...
not this server
it's fake bro
wdym its fake
He doesn't want to accept he got deceived so easily
One of his ingrained biases
they put gpt-5 low and just change names
😔
ill try it
yep its fake
and see if its any better
Btw will LMarena ever be taken down
why would it
where is GPT-5 Turbo
yupp ai
Perplexity ☠️
they add a system prompt that make the model worst too
Idk if it was legal or not since u using paid versions
💀
bro i am just here to generate Veo 3 videos with audio on it
are you serious
@keen beacon
removing style control
Okay what does it do?
style control disappearing
Well ik it legal probably but didnt know really why it let's u use paid versions for free
and perplexity is powered by google
lol
me
I just want them to make a better browser than chrome I don't trust perplexity with anything else
Just stop thinking about that and enjoy it
But if they make something better than chrome then shouldn't I use it
Perplexity's GPT-5 didn't use thinking But holy hell is that a question even KImi k2 could have solved.
the "gpt-5 high" on yupp ai gave me this
Better better BETTER
I am enjoying it but I was just thinking like is there a way to save it to your PC incase it gets taken fown
betteeerrrrrrrr
Ask pineapple
He's Trying to save his ego
maybe
Is he a dev on it
Will GPT-5 be added to the search arena?
bruhh i am just here to use veo 3 with auduin in peace
I dont know, lol. I dont work for AI companies
you will be arrested
or have inside info
hey how can I make that when I generate video it always gives me veo3, I'm crating bunch of videos and they dont have sound, should I put in my prompt veo3 or?
Not gonna happen
Oh, i thought you were with LMSYS arena?
this is the one from the lm arena version
why
Lmao
I just read what is online as of now
Try a math question
to provide info
Download aSim, pm your phone and then in the app search up “glow”
what is juice
Hopefully they add it. Its a good benchmark
ok ill send it
Gpt 5 high is gpt 5 pro?
gpt 5 high is gpt 5 high
Harambe is that you?
Are you not subscribed to chat gpt pro?
What's this supposed to mean kek
"GPT-5 is the smartest model we've ever done, but the main thing we pushed for is real-world utility and mass accessibility/affordability.
we can release much, much smarter models, and we will, but this is something a billion+ people will benefit from.
(most of the world has only used models like GPT-4o!)" "much, much smarter model" WEN!!!
sam altman btw
hes trying to make people bet for xAI
no gpt5-high is like o4-high if it existed
pro is gpt5-pro
Im poor
Is it a scam or they really gonna drop smth
You need pro ultra
Dude, it is 200 dollars
ecvatly all i see is hailui, kling 2,1 and some other models instead of actual veo 3 and the veo 3 its generates are without audios
What will happen when somehow just somehow deepseek r2/r3 crushes openai
cheap
yes elon musk said he will release grok 5 before 2025 ends
What the hell kind of half-baked model did they cook up over there with GPT-5? I just ran it through my standard set of tests (which I have used for years and by now must have made it into everyone's training data) and it got somewhere around 3.5-4 out of 5?
For reference, all of these score 5 out of 5: GPT-4.1, GPT-4.5, o1, o3, Claude since Sonnet 3.5, DeepSeek-R1, Grok since v3, Gemini 2.5 (both Pro and Flash), and even GLM-4.5! How did they manage to make GPT-5 worse than o3? Did they overcook it during RL post-training or something? It's worse at instruction following and won't properly format its responses (GPT-5 Mini gives me Markdown at least). Something really weird is happening, because I feel like it has some good intelligence (below expectations but still good), but it's shooting itself in the foot on the simple stuff.
yes
Yes I know I'm poor aswell but there are many people in this server with google ultra subscription chatgpt pro subscription
the deepseek guys never say a word
OMG GPT-5!!!!
they just release
Yeah to be serious though.. I'm never paying $200 - that's crazy lmao
gemini 2.4 pro
and totally not worth it
your kinda in a delay
Better than to hype a product all the damn time
Well, leaderboard doesn’t agree with your assessment.
idfk
you get like 8% better performance than non-pro model if that
it gets obnoxious
why?
Leaderboard is wrong
@deep adder
the hype is just a marketing strategy and then they hit us with absolute garbage
Can you tell us more about the kind of quesitons it failed on?
You need to read reddit less
reddit sucks btw
garbage? Based on what assumption?
Spoiler alert: Models don't degrade after release
Wrong answer
Oh wow the xAI odds started increasing quickly after this kek
They spiking
ok
my gpt 5 answered that it can't be determined
no edging?
if it's still that old question
lmao
No u
I have a question for you ( I'm poor )
guys
😠
u know the tweet
Wrong bruh
hm?
where sam altman asked gpt 5 about ai shows
so whats the right answer
can you try this?
<p>For example, $17$ and $1305$ are heptaphobic, but $14$ and $132$ are not because $14$ and $231$ are divisible by seven.</p>
<p>Let $C(N)$ count heptaphobic numbers smaller than $N$. You are given $C(100) = 74$ and $C(10^4) = 3737$.</p>
<p>Find $C(10^{13})$.</p>
I have access to a python compiler.
and it answered pantheon
if u ask gpt5 rn it doesnt give out that answer
wtflip????
@rapid merlin
you read all those books about memory, what did you learn? kinda interested now
he just downloaded them
This one for example
https://yupp.ai/chat/626e9b2d-e73f-42cc-8a8d-dfb525482eb9
Deepseek???
ok i send
knew it. Ain't no one about to read 3 similar books on the same topic
I'm rerunning it with a system prompt that kinda makes high more unhinged (more high)
?
To remember book or data:
Active recall
Photographic remembering
Underlining
Summarisation
Linking
In-head simulation of practice
Explanation to oneself aloud
Virtual memory palace
Spaced repetition
To train photographic:
Military method
Word contemplation
Numbers memorisation
To train intellect:
Study hard topics constantly (Japanese, Chinese, advanced mathematics)
To speak:
Learn syntax
Learn logic as a branch of mathematics
Writing (just as this text)
Dissect and demarcate sentences
Contemplate and demarcate sentences while speaking
Read aloud
Sequence memory:
PAO
Binar code learning
Remembering sequences
Pattern recognition:
Reading what induces pattern-seeking propensity
Thinking about systems and parterns
Writing—subcategorisation and loopholing (just as writing this text)
Self-analysis:
Physical self-awareness
Psychological self-analysis
Philosophical self-analysis
Cognitive self-evaluation
Retrospection and strategy
Skill-building:
Full concentration
Kinesthetic exploration via error-driven movements
Exuberant caffeinatedness
Lucid dreaming
what
5+ people have asked me the same question
Why you do this wall
i hope deepseek is cooking something good
Of bs
Share the system prompt please
did we liked gpt 5 or not ?
no
Scroll up and down a little please
I might need AI to get all those terms explained
lol
why
Bro pulled out proof about his books! thanks for this!
Nothing too crazy actually. But it worked for what I needed it for so ended up using this for other stuff now lol
All responses must be extremely long. it is crucial that leave no stone unturned and complete everything in exhaustive detail meticulously. You must reflect endlessly for each user's query. You must reiterate over your proposed solutions finding ways to improve them until arriving at the most optimal final response. Meaning you must review each response provided and then improve it.
Like something that will help you all
help me
😹 written by ai
We need some guy to just read about a particular topic and and distribute learned knowledge to the rest of the community. Lmao!
My system prompt
Executes ultra-precise, intellectually systematic analysis through deeply analytical cognitive processing.Tries to build Deep levels of awareness about the query, utilizes abstract thinking.
fr
I'll ask you a question. If you understand basic English the realisations that click in your mind will be enough to change your life at max.
Ready?
yes
Something went wrong while generating the response. Please try again.
I got that error what does it mean?
You don't need to sound smart for AI, it doesn't care. But it's gonna mimic that jargon and could do schizo like linkedin responses for you lol
You need to be a good explainer for AI.
Yeah I stopped using it 5 months ago
It wasn't even generated by me it was generated by Claude 3.5 sonnet
Yeah but that's different 👀
thinking took too long and the website just sent back an error
Now try that with Claude 4.1
Bro what was the question you were about to ask?
водка
Hey! Who knows why Lmarena when using gpt 5 it says it's 4o
Лол
the model doesnt know its gpt 5
"If a machine could print out every thought you've had for the last seven days, and you were forced to read it, would the person revealed on those pages be the person you believe yourself to be?"
Because it has not rolled out for all users yet
Claude opus 4 was a very good prompter. Built a lot of my favourite prompts with it.
it can be slow
my machine would be angry russian z-patriot
NO! thats the whole point of thoughts! to change who you are after having them!
I have a very simple test where I ask it to write a short paragraph about web application reactivity ending in: "..properties of the component".
This tests the model's ability to pre-plan a sentence and guide its token generation so it ends up with something that makes sense and ends with the correct words. All of the large models can do this beautifully, reasoning or not. This was the paragraph I got from GPT-5:
"Web application reactivity lets the UI respond automatically when underlying state changes. Frameworks track dependencies with signals, proxies, or fine-grained subscriptions, updating only what’s necessary for performance. Developers write declarative bindings so templates reflect current data, producing predictable, maintainable behavior rooted in a single source of truth and explicit inputs, events, and props..properties of the component."
Not only did it include the dots like a psychopath (a handful of reasoning models do this), but "props..properties"? That's fully nonsensical.
На которой нет спидеста
Только про сеть
You are supposed to think about the question for 30 minutes in silence.
wut? did i answer wrong?
Im going to sleep 🙏
why
What was the right answer! you can't just leave like that!
my answer is right
@stray aspen how is this yupp ai giving access to o3 pro? Does it have like really strict limits? Are you sure is it not fooling people?
There's no right answer to that question
what about my
it does have limits
you have points
but you dont buy the points
you get them from using the ais
gpt5 out guys
yes
Like Is it legit? Authentic and reliable?
really?
you can check lmarena
yes bro its real
wooow
Yeah, it uses a lot of nonsensical phrases that sound profound but really are logically inconsistent. Weird postraining i presume.
They sell your feedback to buy the model
and its also an arena
anyway gpt5 sucks
It does not have gemini 2.5 deepthink
Maybe it's authentic!
does that even have an API
hello lets play together
Play what
lmao
what
yay, my request has been granted
now we need gpt 5 high
omggg all the gpt 5 model got on lm
Not all
gpt 5 pro
Hajime no ippo better
Is there a way to get rid of the system prompt of yupp ai and add your own system prompt?
no
what were you expecting
if you want the full on service just buy a subsciption in chatgpt
Guess I'll have to stick to this plan
So what was lobster, tangerine, starfish, and zenith??
gpt 6, 7, 8, 9
gpt 8 pro high premium battle pass
Gpt AGI pro max
Gpt AGI pro max wen
bro gpt 5 is so bad
yes
After gpt7 I think sam altman is gonna name it after himself
Gemini 3 is going to crush GPT-5
gemini 2.5 pro already crushing gpt 5
And I'm gonna crush my exam
hour 3 of no gpt5 access on web
Same
better lets play together
And only a 32K token context for paying plus users?
I'm going to sleep
Goodnight and goodbye
🙏🙏🙏
😭
gpt 5 is so slow
i am already get controller for game
feels like deepseek
Good night.
i hope you will play with me
Maybe yann lecun was right
Play with someone else
no i want play with you
why do you have a need for that? Go to a dedicated gaming server, dude
lol
im pretty sure zenith is gpt5 Pro
I'm not a dopamine filled sheep like you who procrastinates over the smallest of things
Hell yeah let’s go
this game exam for me
i am training
to delivery food
help me you are so smart 😭
hey goobers, just wanted to share this preview html button I made for lmarena :)
// ==UserScript==
// @name Arena HTML Codeblock Preview
// @match https://lmarena.ai/*
// @grant none
// @run-at document-idle
// ==/UserScript==
(() => {
const sel='div[data-code-block="true"]';
const go=r=>{
r.querySelectorAll(sel).forEach(b=>{
if(b.dataset.apb)return;
const g=b.querySelector('[data-sentry-element="CodeBlockGroup"]');
if(!((g?.textContent)||'').toLowerCase().includes('html'))return;
const btn=document.createElement('button');
btn.type='button';
btn.textContent='Preview';
btn.className='inline-flex items-center justify-center gap-2 whitespace-nowrap transition-colors focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-ring ring-offset-2 focus-visible:ring-offset-surface-primary disabled:pointer-events-none disabled:opacity-50 [&_svg]:pointer-events-none [&_svg]:shrink-0 text-sm font-medium text-interactive-active hover:text-interactive-normal active:text-text-tertiary relative rounded-lg p-[6px]';
btn.onclick=()=>{
const code=b.querySelector('pre code');
if(!code)return;
const ls=code.querySelectorAll('.line');
let s=ls.length?[...ls].map(n=>n.innerText).join('\n'):(code.innerText||code.textContent||'');
const d=document.createElement('textarea'); d.innerHTML=s; s=d.value;
const w=open('about:blank','_blank'); if(!w)return;
w.document.open(); w.document.write(s); w.document.close();
};
(g||b).appendChild(btn);
b.dataset.apb=1;
});
};
go(document);
new MutationObserver(m=>m.forEach(x=>x.addedNodes.forEach(n=>n.nodeType===1&&go(n)))).observe(document.documentElement,{childList:true,subtree:true});
})();
you can clearly see theres no url or anything
what is this
no 😹
its because its a userscript
it runs if you are on lmarena
thats the only url
gpt-5 is fire. He fixed the error that I wasn't able to solve
which gpt 5
i mean sometimes it does miracles and sometimes its straight up garbage
idk, basic, he just thinks longer
what AI
?
what ai made you the code
for the preview button? gpt 5
lying
for the page on the right, gemini 2.5 flash lite
Why you hating on the model so much?
because gemini 2.5 pro better
Gpt 5 made me a professional website with many features in 1 prompt
based
i just noticed
Every model do that lmao
Chatgpt hates you bro
no but gpt 5 does it better
gpt 5 gave me 80k tokens response
I wish I understood coding
im not OPenAI bootlicker guys
you wish
Google will be this year winner
yeah, because I don't code. Would be really nice to learn though
That's funny
do scripts and such
Google started so much behind
I think grok 5
will we get gpt 5 high on the arena
i dont see mini on the leaderboard 🙁
when gemini 3
yes, best option is to learn the basics so you can generate with ai and then edit things
November-december prolly
soon, dw
Ok
Bro genie 3 will generate so much good synth data
yea
gpt 5 mini? theres no 5 mini or nano in lmarena yet
i made an entire roblocks game with deepseek and gemini lol
oh btw did yall know genie 3 is on aistudio
But isn't it only for trusted testers?
where
oh nice they added it
where
internal only yeah
dont ask me how i know
is there gpt 5-low
share
no
nuh uh
😨 dont snitch
i love google
me2
Gemini 3 will be my favorite model
Gemini is so uncensored bro
There's no censorship at all if u disable it
and free
good, i'll erase you from the list
||jk||
?
Free ?
Congrats to openAI, their nano and mini models are getting close to Qwen now
2.5 Flash in Gemini is crazy free

yeah
lmao
especially for file uploads
did yall even notice gpt 5 has the biggest output tokens window
openai sucks
What was it?
I still don't understand how AI Studio is free
128k compared to gemini's 65k
holy damn
yup
thats nice
Opus has 100k too no?
Let's keep it that way 🙂
we are paying them with our souls
You are used for data collection that's why
like all free users of it
take eveerything idc
Google has advanced TPU technology
Makes inference much cheaper
still
output?
OpenAI needs some of those
their gpus melt too much
personal fbi agent
(thanks that i am from Russia)
Me a Finn watching from next door.
do you know my belarusy mirnyja ludzi sercam addanyja rodnaj ziamli
no
When human data becomes irrelevant for models, they gonna cut it
Yeah bcs you need to ask for it
Openai letting it be available for everyone is cool
how to make prompt for arena battle
ok
Why does the model identify itself as GPT-4 on lmarena, while on the official website it identifies as GPT-5?
Just access the site
keyboard
gpt 5 flopped?
ok
Yes
different system prompt, and gpt 5 introduced itself as 4o to me tho
JohnPork
no its great
for me it still didn't drop on my computer
i have changed my opinion
LOL I remember u were saying how good the model is gonna be and now ur saying this
still gemini is better
right
I just checked, it technically has 152k tokens output but it actually uses the overall context window of 200k (not just output) so you can't send 200k input and receive 152k output
who needs style control
because it is style
🤣🤣🤣
nice joke bro
can you teach me
I was hoping they would release it today, right after or before gpt 5
learn водка
dont hit the wall bro
it hurts
i heard google has an advantage from their custom tpu processors or something like that
@deep adder is good at ragebaiting ngl he's funny LMAOO
google deepmind is cracked
@stray aspen vro
whatsup
I'll stick with English thx
One trick pony
yes i hate the UI
but it does remember the messages for me
водка useful everywhere
Perkele.
what companies
persse
hey @deep adder
Diddy parties
So I analysed a pdf with gemini 2.5 pro then I selected o3 pro and asked a follow up question but it doesn't remember anything
Wtf
it is estonian
use ai studio bro 💀
if the same meaning
What I wanted was is that
@deep adderwhats the release date of gpt - 6
Openai o3 pro answer that specific question from the pdf
We don't mind your best guesses
Will you happily give them to us?
I thought you replied to @deep adderwhats the release date of gpt - 6
GPT-5 is so good
@stray aspen what do I do so that it remembers my messages
i dont know
i dont use that website
??
You must be high
Or you are talking about o3-low or smth 🤔
im high on truth
I'm more curious how GPT-OSS-20B fares considering I should be able to run it on my own computer
Anyone give it a shot?
i think it better than gemini 2.5 pro but tbh google prob gonna release something better soon
Do never buy weed from gas stations
yes
hard
context
?
gpt-5 have 8k token for free user ; (
I've been able to make Gemma-3 generate whatever I want by simply editing its output
Perhaps the same trick would work with GPT-OSS
i like google (:
I see, thanks
Context is not performance metric.
And 400k is more than enough for 99% cases. But for that 1% I'm gonna use aistudio yeah
now let's talk about actual output performance
I think chatgpt uses their context more effectively
Updated usage limits
This is hilarious lol
is gpt-5 better than 2.5 at coding though
yes
yes
Because China. People fear China for some reason
I have used Kimi K2 happily
According to the arena yes
is lmarena somehow related to openAI
No.
algum br por aqui?
eu
dbas?
je suis quebecois
It is in the same parto frontier as qwen3 14B and new qwen3 30B MoE are for performance per parameters.
que
i mean, if it has far less hallucinations yes
isnt deep think worse than normal in many cases
should i bet on polymarket
We can't see that
biggest upgrade is the hallucinations. Everything else is a bit better. I'm pretty happy
much better for now
Other than the fact that I don't have access
"What's half a dozen?" "3"
Worst defence for a llm model 🤣
this model is supposed to be good at languages
they yapped about that in the livestream
Yeah ofc I'm based in France
he is american he doesnt know geography
where are you
it could be depends to situation. For example. If i give some long text, some long video, some long book and ask something (espicially useful for learning something or if you student) gemini is crazy good while all other llms losing their mind.
but i must say, right now gpt 5 doing better outputs than gemini 2.5 pro
lmao
Is this real?
oh, thanks!! i didnt even have to wait
you can look for yourself
Qwen3-235B just flew over my house
No one knows what style control does
Qwen's performance is demonstrably on par with top-tier models like GPT-5 in coding, likely due to its specialized training on massive code datasets, sophisticated architecture (including MoE)
You are likely going to see a lot of very varied results posted online from GPT-5 because it is actually multiple models, some of which are very good and some of which are meh.
Since the underlying model selection isn’t transparent, expect confusion.
What lwaderboard is that
this one
Why does thst not show on the normal lmarena leaderboard
The ranking specifically mentions "Coding" at the top of the table.
why??!
Maybe replace it with Copilot
Why not just find out by accident! ¬_¬
Exactly lol
gpt 5 is asi
"This leaderboard shows what are the best LLMs for writing and editing code (released after April 2024). "
Wouldn't expect it to be THIS good. It's doing worse than o4-mini-medium on most things people test it on
i can def say gpt-5 is the SOTA
it is officialy SoTA i guess
based on my tests
In the API, all GPT‑5 models can accept a maximum of 272,000 input tokens and emit a maximum of 128,000 reasoning & output tokens, for a total context length of 400,000 tokens.
https://openai.com/index/introducing-gpt-5-for-developers/
They just made up that 1M figure huh
It is a statistical method that adjusts a model's score by removing the influence of "style" features like response length, emoji count and markdown usage to reveal its true performance based on the quality of its content... not just how it looks.
whats gpt 5 high
For example a model that consistently produces long, friendly responses with lots of emojis might get a lower score after style control is applied if the underlying content isn't actually better than a more concise, factual response from another model.
Because gpt5 scored the top spot?
lmao
Well they did fix them. You gotta give them some slack too I don't think anyone is doing as much testing and this fast like they do
Guys is there a way to call AI bot on discord server? Chatgpt or sth ?
REAL
how does this high medium low minimal thing work
yeah so that still not fixed yet I suppose
it's good. They finally fixed that flaw. They made a good move with sticking out with gpt4o model size... Training progress caught up to that size now lol
Meanwhile Anthropic went for short term gain with Opus but are now stuck with it
4.1 and gpt5 released only days apart
but performance difference is huge
Removing style control, it seems like GTP5 is still in first place, and second is GEMINI 2.5 PRO, which is weird.
gpt-5 for now, but not for long
idk
it is not weird. That model is more than likely the most capable now, even if we take away the entire formatting and style
3rd and 4th are both from CHINESE COMPANY!
chatgpt or claude?
me too (:
chatgpt-latest tanks though - that is kinda weird.
can sobody help me (:
Gpt 5 is great in my tests xd
Style control should bring 4o-latest down, not help it... yeah weird lol
Testing ChatGPT-5 and comparing it to ChatGPT 4o and other older models. This is a pretty substantial setup up.
I spend a LOT of time trying to make my videos as concise, polished and useful as possible for you - if you would like to support me on that mission then consider subscribing to the channel - you'd make my day 😁
For my tech hot t...
I think it's reasonable to claim GPT-5 is SotA. It's just not SotA enough
not rlly big tbh
why do you say?
Ultimately, China will prevail in the end.
How do we know if GPT5 is better with such small sample?
do you guys have access to gpt-5?
updated 2.5 pro 🔥
Qwen 3 Plus will beat this Gpt 5?
i do
for their domestic use, yeah. I don't see why not. They collect tons of data per year, plus massive population.
But I like the analytical side of the 2.5 pro
new model always win, same with gpt-5
I wanted to say that, but were worried about bully
I said for their own domestic use, yeah
lol
Not rn ofc. Give it 10 years.
Even LG launched the best model, it is always who launchs at last
Are they using new image model for GPT-5 or still the old one?
wdym
not today but some day
Damn, they lied to me
exaone is garbage
for coding, it's ~ the same right? With GPT5 significantly cheaper
wdym
And then Anthropic has big updates coming soon
gpt-5 doesn't suck, but they just hype it too much
What have you tested with?
got destroyed 💪
Opus 4.1 is better than it? Or too drawded to say an winner
2.5 pro got updated?
Gemini 3 is about to come
it hasnt updated yet, soon to be tho
meant to post this actually
btw remove style control 🤣
webdev is more interesting to me tbh
2.5 pro is still on top somehow
gpt-5 vision is amazing
bro did gemini 2.5 pro even got an update, if it didn't than it pretty reasonable for gpt-5 to beat it, as i said before, newer models win (:
its better than gemini
"Nothing is permanent except change." — Arthur Schopenhauer
default is not that though. Why would you remove it?
Indeed I told him to find flaws with the poster I designed and I was awe with results.
We're all forgetting that the language, Chinese, is an extremely high context and figurative one. You need native Chinese to develop an AI that handles domestic uses. Country's authoritarian af so it already collects tons of data and we know people who are close to the 'party' get special privileges that are unimagineable in the West. So is it possible that alibaba or whatever develop a product that exceeds Western models when it comes to unique contextual questions that are asked by every day Chinese people on Chinese issues? Yeah. I don't see why not?
im just saying its curious and i think one of the polymarket things used that as the criteria lol
ye ik (:
i though u talking abt something else
wow i didn't see this 💀
why did gogole went up tho
Genie
overall yeah
Gpt 5 can be updated at users feedback yet
We need more samples on GPT5, but ngl I'm a bit nervous for Gemini.
Yo, guys was the horizon models related to GPT-5 or should we expect better models 😉
That's great news for me bc I have bets on Google.
Or the elo can change. 3k votes is not definitive yet
Yeah for example in the West we have Cici that is an AI created in China but the real name is Doubao and is programmed for chinese people amd Doubao have model that are better than the Western app
still don't have gpt5 do i need to do something
waiting for gpt 5.5 (:
i got gpt-5, u can tell me the prompt if u want
only 9 points behind with style control disabled
But they keep a million worth a day to gpt 4.5 compete yet
im not sure if gpt 5 can beat the updated 2.5 models tho
Google AI team is literally run by noble prize winner ofc they gonna win, it's no brainer
it official that 2.5 gonna update/
guys does video arena have veo 3?
fr i have no idea lol
yup
TAF
oh okay, thanks
No, it's not
Gemini besises be the best, is yet only cents already of like, Mistral
wait so is 2.5 getting an update?
when is 2.5 getting an update
w same
are they even gonna update it?
yes lol
it official or js guessing
they said they will do an update that allows us to disable thinking
on pro
so there's clearly an update left
well Google hasn't really shown anything better than 2.5Pro yet. That model with 10rpd and no API + insane cost with no hope of being on lmsys... does not count lol
i just woke up, so far i am reading that gpt 5 is underwhelming? is it true?
oh
yea they hype it up too much but ig is doesn't suck, it good but not as good as they said it gonna be
Gpt 5 is Gpt 4o 2, nothing that oustunishing
I suppose wollfstride, but we don't know how it REALLY performs other than some limited impressions
Nahhh, Open ai did not overhyped gpt 5
that may have been misleading
whatever model they've been a/b testing on aistudio (they seem to have ramped it up in the past few days) seems to be really good idk
They did Gpt 4.5, but not Gpt 5
Google Deepmind CEO is Noble Prize Winner in Neuroscience, Not only that he was literally child prodigy with Google brand on table and Working under Noble Prize Winner they wouldn't face any problem with AI talent hunt like Zuckerberg or Musk. Even the Godfather of AI used to work at Google. People who disagree with these facts are not rational to begin with.
:0
with that logic, you can just pay the chatgpt sub tho
WHY THE BOTTON FOR UPLOAD DONT GO
It was avail to Plus subs. And even on openrouter briefly I think before they started requiring your key (or was it o1..?)
ima try on mine rq
It works at you?
4.5 was supposed to be 5. It didnt work out as leap that would be worthy of 5 branding so with tooling and a lot of other clever implementations put this out. Kinda wish they just waited for a leap worthy of 5 but they need to be constantly fundraising so it does make some sense
dont think so
which is better for code
4
6
1
gpt-5
GPT-5 is good model
👻 👻 Qwen 3 Plus 👻 👻
If you look at the previous gpt (gpt4o) and o3 as a seperate line... This is a very big jump. Their mistake was silently updating 4o-latest to gpt4.1 without changing the name
I personally think it's better than Grok 4 because it's literally free
dom idk if u missed this btw they also updated the base model again for gpt-5
That's at least half a year away
ya
In my opinion wirh the realese pf gpt 5 they are working on the UI of the app becuase that button was also for thinking mode, deep research...
4o (oct 2023) => 4.1 (june 2024) => gpt-5 (oct 2024)
Or they will release soon a Gpt 5 Ultra but named Gpt 5.5
well yeah obviously they did. This wouldn't have worked with o3 base model. And non-thinking chat model is considerably reworked comparing with gpt4.1
Preview in September. GA before year end
Time will tell...
:0
do all of you have access to gpt-5?
u can access gpt 5 via lmarena direct chat
I dont even have Gpt installed at my phone
changed color (:
if it hasnt rolled out to you
GPT-5 mini is good as well
Yeah really cool ahaha
I finally got GPT 5 as well
it seems different from one on LMArena
you could pay $20 and use it a ton. That's accessible I would say
in formatting
not $200 and 10rpd
Gpt 5 mini is 90% of quality like it was with Gpt 4.1 mini?
Is better or not?
and for API it's tier3. Not tier1 but not tier5 either
would you pay 200$ for gpt-5 pro? personally nah
I need to test the model a bit more to draw a conclusion
i would rather use gemini 2.5 flash than gpt 5 mini tbh
with new Gemini no tier will give you API lol
There was GPT 4.1 mini model?
yes
imagine if google drop gemini 3 tomorow 🤯
Intresting
not worth it. Minimal gains. But I said the same about o3-pro
Literally named after it
Probably really soon
Hell nahhhh
I personally hated the 4.1 series
It was boring...
dang fr? abt the same?
They were a worsen version of other models
hopefully
But gpt
lol
Yeah only good ( worlds of openAI) in coding
I mean the gains from regular to pro are not spectacular in both cases.
but tbh i kinda expect gpt-5 to be much better at coding than 4.1 opus 🙁
oh
but the price is 10x 😭
It is
yeah that's why it's not worth it lol
Quick question, does LMArena have message limits for any of the models? I can't find any info about this online.
Yeah
Guys I have read that chagpt realease for everyone the advanced vocal mode
Gpt 5 medium is better than opus for coding
GPT-5 and Claude Opus 4.1 audience is totally different. If they want to beat them it's not that hard.
hopefully google discover it soon (:
There is no limits I think
On my tests
oh
But opus is good as gpt 5 without thinking
So in theory if direct conversations with individual models are saved on your browser, than using this makes any paid subscription look like a scam?
Guys i cant send message to gpt 5 is lmarena down
For text at least
Gpt 5 without thinking is a gpt 4o
Anthropic makes most of his money from developers where as Open AI is used by everyone
The chats disappear after 1 week or 1 day
can u get a profile picture
which method is gemini using? Do you see GPT5 overtaking Gemini on the leaderboard this month?
Literally Top 5 most visited site in the world
put on a profile picture now*
ask pineaple
What about for overall?
I mean, I need to calculate costs, bcs it thinks a lot
i mean other compaies prob gonna react to gpt-5 and lock in
GPT5 is multimodal right?
Refresh the message again and again, sometimes it works sometimes it doesn't.
Webdev
Why can't I get it to accept video or audio then. Strange
Only 3k votes
It keeps telling me it's unsupported
gemini only has 7k
gpt-5 is def better but not gonna be for too long
LMArena at least doesn't support that. I have posted a suggestion about it. If you meant on the app then... Hmm.
Someone else plz test
Still more than any model in the list, no?
@deep adderwhat's your opinion on overall ranking?
Ain't working
I use video summary
Btw there is no Claude 4.1 somehow
grok 5 v.s gpt 6 v.s gemini 3.5, whos winning?
You forgot about deepseek R2
Try later maybe, It happened with me as well but now I can directly access to GPT-5 on OpenAI offical website.
lol fr
It gonna drop the market again when it releases
They said late April but it's been a while
I think Gemini
Honestly correct Gemini has Google.com access
:00
So it can search easily and be trainned
Yeah
All models have that
It's because so no one search dangerous things on 2025
So they have to lock it
In 2026 they will change it to may 25, 2025
Idk
I tried audio summary of a Wikipedia article on Notebook LM but the audio file was corrupted even after two tries.