#general

1 messages · Page 217 of 1

astral blaze
#

gemini 3 looks good

spare rune
#

Gemini for the gui and gpt for backend

hoary elbow
#

Gemini 3.0

zealous sparrow
#

sure gemini 3 pro takes 18$ for output, but it doesnt make sh like OpenAI

astral blaze
#

im not trying your game though

devout vault
hoary elbow
#

Gemini 3.0 sucks when I use it, but how did you get that good of a result?

odd geyser
spare rune
rugged abyss
#

tbf id say Gemini, its only issue really is delta time I assume. At least i find it visually more appelaing

torn mantle
#

gemini 3 is unplayable

hoary elbow
#

So it’s deep think

torn mantle
#

i dont like gemini 3 version

#

although UI wise gemini 3 was better

zealous sparrow
grave plaza
#

guys is kat coder in?

hoary elbow
devout vault
#

asura is literally a gpt glazer

hoary elbow
#

Or are you using mobile?

#

Because Gemini needs a mouse to play it

rugged abyss
hoary elbow
#

Oh

#

So you’re saying it’s hard

#

Got it

zealous sparrow
#

from my tests

torn mantle
hoary elbow
# torn mantle yes

I thought you’re having glitches but you’re saying it’s hard. I got it.

sharp mirage
#

i think gemini better cuz it looks like game not gpt is looks like a modren page isnt a game

astral blaze
#

Also gemini 3 is quanted right now and 5.2 is full precision because it just released

#

(just speculating)

zealous sparrow
hoary elbow
#

GPT 5.2 looks more like a website other than a game

zealous sparrow
#

are we deada-

devout vault
#

GPT been releasing the worst models ever

compact sleet
#

I have no idea how to attack on the gpt one lol fun game though, both of them

golden ocean
#

openai is done for

compact sleet
#

Gemini 3 could attack at least with space

torn mantle
modest prism
#

Have we finally achieved AGI with gpt-5.2 ?

devout vault
#

gemini 3 pro is literally in preview rn still

#

gemini 3 pro will dominate

torn mantle
#

one thing they've been talking about in this new model is the world knowledge

devout vault
#

gpt-5.2

torn mantle
#

it has a better understanding of the world

devout vault
hoary elbow
#

No, AGI hasn’t been achieved yet. It would’ve been big news.

devout vault
#

😭

#

ASURA STOP GLAZING CHATGPT

hoary elbow
#

But it’s a good AI

spare rune
rugged abyss
#

I think Asura is ragebaiting

hoary elbow
#

But Gemini three pro is better

astral blaze
zealous sparrow
#

google stock is down 2% but just you wait till they learn what gpt 5.2 really is

spare rune
#

And everyone Gemini keeps making a new stealth model it keeps getting worse

astral blaze
#

it's terrible at world knowledge

grave plaza
#

GPT 5.2 IS GOOOOOODDDDD

torn mantle
#

check their blog

#
  • that new benchmark just proves it further
astral blaze
empty stump
#

i wonder how it does on geobench

zealous sparrow
#

google stock is coming back up

#

OpenAi flopped ggs

modest prism
#

Should I cancel my Gemini pro?

torn mantle
hoary elbow
#

Are there any text 5.2 users in here? If so, how good is it with text

astral blaze
rugged abyss
fleet lintel
#

what is HLE score?

hoary elbow
#

Do you want to use VEO3?

unborn ocean
hoary elbow
#

If so, then you can go on flow

zealous sparrow
empty stump
#

i need gpt 4.5 size model

unborn ocean
fleet lintel
# unborn ocean

that's a good question... I think its not on a new base... but not sure

cloud zinc
#

can u access xhigh through app?

hoary elbow
#

I’m gonna tell 5.2 to stop making website you guys when I try making a game

heavy smelt
#

Why are the chances of the video arena using veo,kling or hailuo always much higher than other models? Like Is it because these are like the top models so the randomizer prefers them over the others or something? Like I've not seen a ray-3 video generated in the arena for example.

torn mantle
#

oh

zealous sparrow
#

no the style is the same ass as before

torn mantle
#

need to ask it about glassmorphism

spare rune
#

Just changed the color blue to other colors

cloud zinc
spare rune
#

The gui layout is exactly the same

torn mantle
torn mantle
rugged abyss
cloud zinc
zealous sparrow
astral blaze
neon idol
#

Is gpt 5.2 flop or slop?

spare rune
#

Apple should hire gpt

zealous sparrow
spare rune
rugged abyss
hoary elbow
#

I believe human’s last exam Gemini three was better than it before it got quantized

torn mantle
#

depends on your use case

sharp mirage
#

guys what did i mised ?

spare rune
#

I think gpt5.2 is thinking infinitely again

zealous sparrow
#

because they are already quantized to the max

astral blaze
neon idol
#

Worse or better than Gemini 3.0?

astral blaze
#

With this they have proven themselves to be not capable of beating google

torn mantle
astral blaze
#

or actually using the moedl

rugged abyss
#

So now that we bashed GPT 5.2
Are you all excited for Gemini 3 Flash when it comes out

sharp mirage
torn mantle
spare rune
#

“Game idea: Wind-Up City (Co-op Survival Maintenance)
Premise:
Players are tiny “clockwork caretakers” liv”

torn mantle
#

try it

spare rune
#

Yeah no

#

I’m gonna cut it off

#

Ai slop

rugged abyss
modest prism
#

The knowledge cutoff is august 2025. They've done a completely new pre training

astral blaze
#

lol

#

It just uses the same UI for anything

sharp mirage
astral blaze
#

let me pull up the one I did with gemini 3

zealous sparrow
sharp mirage
#

yea

#

i alr send it

#

but

torn mantle
#

WAIT

spare rune
#

and arabic is spelled backwards

#

haha

#

get it

torn mantle
#

what happens if we use claude frontend prompt for fixing UI slop

zealous sparrow
astral blaze
#

This is gemini 3

#

it has actual 3d

#

oh it also has procedual audio

spare rune
#

Lowk wait for gpt 6.0 xpro

astral blaze
#

wow very cool GPT 5.2 is this the best you can come up with

fleet lintel
#

i reluctantly opened my office account to test Gpt 5.2 ...
I am sorry, i am not switching from gemini 3.0 pro

zealous sparrow
torn mantle
stray aspen
#

It

rich panther
#

why does the LMArena website say that there is a Gemini 3 Pro when it is actually a Gemini 1.5?

astral blaze
#

Gemini 3 can also do a 2d one

#

and it's vastly superior

astral blaze
stray aspen
astral blaze
stray aspen
sharp mirage
#

studio

spare rune
#

AI slop detected

astral blaze
#

Even gemini 2.5 is better

#

LOL

zealous sparrow
#

that is a year old

#

skill issue

modest prism
#

Can gpt 5.2 generate image too?

zealous sparrow
#

close to a year old

zealous sparrow
#

OpenAi is working

#

on gpt image 2

bleak lake
#

How is 5.2?

astral blaze
rich panther
unborn ocean
#

and that is likely what happened

zealous sparrow
modest prism
#

It says it can output images in API page

astral blaze
zealous sparrow
proud bobcat
#

gpt 5.2 is benchmaxxed

#

how much you wanna bet

zealous sparrow
stray aspen
astral blaze
spare rune
#

sans.. is that you..

warm zodiac
zealous sparrow
spare rune
#

idk

zealous sparrow
spare rune
#

instead of making it by code maybe it wanted to make it by text

proud bobcat
compact sleet
#

People die when they are killed moment

echo aurora
# rich panther then why does he think it's 2024?

say that there is a Gemini 3 Pro when it is actually a Gemini 1.5
This is likely a hallucination. When models are being trained they're unable to perceive it's own existence, as it doesn't yet exist.

vivid coral
#

Where is 5.2 gpt in search arena? Pleeeese 🙏🙏

astral blaze
#

which one is better again

proud bobcat
#

these benchmarks are so bull

zealous sparrow
proud bobcat
#

no way it had a leap of over 30% on arc agi in just one submodel update

astral blaze
#

The problem with these benchmaxxed models is that they are literally benchmaxxed and is terrible at everything else like actually using the model

zealous sparrow
stray aspen
astral blaze
stray aspen
#

It's gemini 3 pro

#

Stop worrying about that

modest prism
#

If gpt 5.2 was actually that good they would name it gpt 6 or at least gpt 5.5

zealous sparrow
#

a huge + of gemini 3 pro is that it knows when you joke

astral blaze
#

Copium.jpg

zealous sparrow
#

and knows it a lot

proud bobcat
#

oh wow... no way...

zealous sparrow
#

it also knowns references

proud bobcat
#

its peak...

astral blaze
proud bobcat
#

im not a gemini fan

#

and also

#

there is no way this one model update jumped that high

stray aspen
#

Nice bullmarks

proud bobcat
#

if gpt 5.2 is peak i will switch to it

zealous sparrow
unborn ocean
#

ngl it really feels like they just wrote down the things that gemini 3 was outperforming them at and (tired to fix) them:

world knowledge (supposedly)
front-end (somewhat)
science tasks
competition-level math
GDPval (anthropic outperformed them here)
ARC-agi 2
vision tasks
and some more algorithmic coding benches

echo aurora
astral blaze
#

Gemini 3 is funny

unborn ocean
#

all things that gemini was leading in compared to openai

zealous sparrow
#

we need a new benchmark

#

Comedy and story benchmark

stray aspen
unborn ocean
#

ye

#

i think i read a paper about that at some point

astral blaze
#

I think only 4.5 ever had a chance of beating gemini but we all know what happened there

proud bobcat
#

prose benchmark

unborn ocean
astral blaze
cloud zinc
#

image model next week

stray aspen
#

It better be greater than nano banana 2

cloud zinc
cloud zinc
unborn ocean
#

well the benchmarks it had the largest gains in are literally build by openai (gdpeval and mrcr), so it is not that impressive

astral blaze
#

honestly I don't know how an OAI image model can beat google

#

especially when google obviously have every photo ever taken on their servers

unborn ocean
#

but they are clearly focusing on it + plus they had it for longer than the other labs

astral blaze
#

it's impossible to beat their dataset

cloud zinc
cloud zinc
#

gemini 3 flash image is coming next week

sharp mirage
#

@zealous sparrow did you get any good game with this prompt ?

cloud zinc
#

which will be way faster than any openai models

zealous sparrow
astral blaze
#

IT'S OVER FOR GEMINI 3 OPENAI HAS THE AGI

sharp mirage
#

Oaky

#

N0

#

Np

zealous sparrow
sharp mirage
#

at least better than mine :/

cloud zinc
#

adult mode delayed

compact flame
#

Damn gpt 5.2 is out?

zealous sparrow
compact flame
#

Is it good?

astral blaze
#

they can just rerelease gpt-3 before they censored it and call it a day

#

honestly

zealous sparrow
zealous sparrow
astral blaze
zealous sparrow
compact flame
zealous sparrow
#

i forgot

zealous sparrow
compact flame
astral blaze
zealous sparrow
#

but playground autocomplete did

#

which probably ran gpt 2

astral blaze
compact flame
#

So how good is gpt 5.2?

zealous sparrow
astral blaze
modest prism
#

Can someone test this prompt? The correct answer is 96:

1,7,18,45,?

compact flame
sick sky
#

They're rushing to the release making the same mistake Google made at the start

astral blaze
#

MODS

#

MOOOOODS

#

<@&1349916362595635286>

compact flame
echo aurora
astral blaze
compact flame
sharp mirage
#

mooooooooooooods

#

wait i have to meow

#

i am cat

#

alr

gusty helm
#

god damn it larry

sharp mirage
#

MMMMMMMMEEEEEEEEEEEOOOOOOOOOOWWWWWW 🐱

gusty helm
#

no snackies

sharp mirage
#

i will get you james

#

james is alywas here

#

but you cant see it

#

😱

sharp mirage
#

alr thats so damatic

astral blaze
modest prism
echo aurora
astral blaze
astral blaze
compact flame
#

I checked chatgpt website but there no 5.2

#

I got scammed by the official chatgpt bruh

astral blaze
#

Did they already unrelease the model

#

it would be the funniest timeline

proud bobcat
modest prism
# astral blaze why is it 96 though

Sequence: 1, 7, 18, 45

Differences: 6, 11, 27

Pattern in differences:
6 = 2² + 2
11 = 3² + 2
27 = 5² + 2

Squared numbers (2, 3, 5) are primes. Next prime is 7.

Next difference: 7² + 2 = 51

Next term: 45 + 51 = 96

Answer: 96

proud bobcat
#

theyre doomed

compact flame
#

I gotta give gpt 5.2 the basic finger test

#

If it fails definitely rushed model

compact flame
#

Nvm you can't even send images to gpt 5.2

#

They be hiding chatgpt failing basic finger test

modest prism
proud bobcat
#

"sota"

spare rune
astral blaze
#

maybe I am just a brainlet

#

I will go back to benchmarking comedy

cloud zinc
compact flame
#

Where's chatgpt 5.2 on the official website damn there only 5.1

astral blaze
# astral blaze

it suddenly got it right maybe they did pull the high precision models back out because 5.2 came out lol

proud bobcat
modest prism
compact flame
proud bobcat
#

this model is defo benchmaxxed

#

even deepseek can do this p sure

astral blaze
#

at least gemini 3 has some genuinely vast world knowledge

cloud zinc
compact flame
#

Gpt 5.2 is def buns

modest prism
# cloud zinc

"Assuming the 3rd difference stays 11"

Lol math doesn't work based on assuming.

cloud zinc
proud bobcat
#

gpt 5.2 mid

spare rune
#

I have no faith in gpt

weary galleon
compact flame
#

Def worse

glass arch
#

is 5.2 better than 3 pro for now

compact flame
#

Very rushed model

astral blaze
#

We need a sneedbench

modest prism
compact flame
#

Atp it's better to buy Gemini ultra instead of gpt pro

astral blaze
#

Oh wow GPT 5.2 actually got this wrong

proud bobcat
#

gemini is goated

astral blaze
#

LMAO

compact flame
#

Google is like cooking rn

proud bobcat
#

deepseek, claude, and gemini are the goats

#

(truth nuke)

astral blaze
#

I literally wasn't expecting this, I think even gpt 5 got this right

#

or 4

#

or 3 for that matter

compact flame
#

Or who owns Gemini I forgot

proud bobcat
compact flame
#

Oh alr

#

Well makes sense google is larger than chatgpt

cloud zinc
compact flame
#

They spend more money on gemini

astral blaze
#

Why it’s funny: the sign is comically specific and redundant—if it’s “Sneed’s,” why do you still need “formerly Chuck’s”? It reads like a small-town business trying too hard to keep old customers.

The joke is a subtle, dirty pun based on the rhyming scheme established by the current owner's name.

Current Owner: Sneed sells Feed and Seed.
Former Owner: Chuck (implies he sold F___ and S___).

sour spindle
#

GPT models always work worse on lmarena than when I use them on app/website for me.

proud bobcat
modest prism
#

Still the goat 🐐

astral blaze
#

are you telling me gpt 5.2 got a pun wrong

#

like, a language based thing

compact flame
proud bobcat
#

gemini uses high quality data

astral blaze
#

it got it wrong?

proud bobcat
#

gpt is bloated

astral blaze
#

what's the full name for LLM again

#

large... something

proud bobcat
#

large language model

astral blaze
#

I forgor

astral blaze
cloud zinc
compact flame
proud bobcat
#

"Extra High" im DYING

spare rune
# cloud zinc

It’s so sad it has to be extra high while Gemini flash can do it in 3 seconds

limber crag
#

What are the benchmarks

astral blaze
#

gemini remains undefeated

polar niche
#

Is it good

spare rune
#

Gpt?

weary galleon
#

This guy 👉@sour spindle voted "big improvement" in my poll🤣

cloud zinc
astral blaze
sleek crow
astral blaze
#

I'M DYING

#

AAAAAAAAAAAAAA

#

SAM ALTMAN YOU DID IT AGAIN

compact flame
#

Gpt x high? That even exists?

proud bobcat
modest prism
proud bobcat
#

EXTRA HIGH NEEDED TO SOLVE A MATH PROBLEM THAT FLASH CAN DO IN 3 SECS

spare rune
proud bobcat
#

sam altman lock IN

astral blaze
#

I hear coping

proud bobcat
#

it was a new olympian problem from what i know

cloud zinc
#

gemini flash wrong

compact flame
#

What Extra high means anyways

astral blaze
#

fax

compact flame
ornate mesa
#

Gpt 5.2 will be better than gemini 3 if i don't care about thinkung time ?

cloud zinc
#

it will take longer, but it is better

compact flame
#

What is even pro medium how to access that model bruh

cloud zinc
astral blaze
#

GPT 5.2 PRO MAX ULTRA XL

compact flame
#

There no like switch or smth

void elm
#

with which proofs

weary galleon
modest prism
# cloud zinc

This question does not require insane math knowledge. It just needs a little bit of creativity. LLMs suck at creativity.

rich panther
astral blaze
#

LLMs don't have self awareness

#

asking it what version it is will never be right

#

I am still amazed at how much gemini actually knows

#

you look at 5.2 and it's just sad

weary galleon
#

X X X X X HIGH ULTRA PLUS

astral blaze
#

and no I don't care about coding stop asking

#

🤓

proud bobcat
astral blaze
#

Like, this reference is actually so niche

#

I didn't think it could get it

proud bobcat
#

it used to be so fun

astral blaze
#

but I guess google could just use all of their index as datasets so it's bound to be better

astral blaze
#

now gemini is the UNDISPUTED KING

#

depending on what you use it for I guess

#

But I firmly believe that a large language model should be good at language first and formost

grave plaza
#

i send a 800 lines Code and it just show me a error 🥲

zealous sparrow
#

June 2025 meme btw

astral blaze
#

Bros I love gemini 3...

unborn ocean
#

ok so guys from my experience the "larger world knowledge" from gpt5.2 is just non existent

astral blaze
#

look at gemini 3

proud bobcat
unborn ocean
#

like even 2.5 pro smokes it on some (but not all) world knowledge questions

astral blaze
proud bobcat
#

it is

#

rightfully so

astral blaze
#

but you guys are free to subscribe to gpt-5.2, more compute for me

unborn ocean
#

gpt5.2 is recent cpt + maybe less agressive quant, but still fundamentally less world knowledge than the others

astral blaze
proud bobcat
#

REAL

astral blaze
#

NOW WITNESS GPT 5.2

proud bobcat
#

5.2 on its way to be the most mid release ever

astral blaze
proud bobcat
proud bobcat
#

technically

keen beacon
#

Gemini is boring

proud bobcat
#

already

#

nah gemini is peak

keen beacon
#

No flavor

astral blaze
#

actually gemini is more right because asia northeast 1 b is actually in tokyo

keen beacon
#

No taste

unborn ocean
#

nah gpt5.1 release was more mid

astral blaze
#

so gemini 3 have the right stuff

unborn ocean
#

gpt5.2 is still a pleasant surprise, expected nothing, got something

weary galleon
astral blaze
celest orchid
#

gpt 5.2 is disappointment

astral blaze
#

but gpt 5.2 is not better

#

lol

celest orchid
astral blaze
#

now this

#

this is funny

weary galleon
proud bobcat
#

opus peak

weary galleon
#

1,7,18,45,? Right answer is 99 or 96?

astral blaze
ivory quest
#

prompt = "A high-quality 10-second shot of a futuristic robot confidently walking through a neon-lit cyber-city street at night."

astral blaze
#

but I personally hate anthropic so I am biased

queen veldt
weary galleon
queen veldt
#

I think they're the most hardworking company for coding

astral blaze
queen veldt
#

They're honest "yes you can use our models but there's limits"

#

They don't have as much money as google and openai imo

astral blaze
#

I think sometimes people forget that LLMs are supposed to be a chatbot

weary galleon
astral blaze
#

but uhhh whatever

#

I just grade models depending on if they are funny

weary galleon
#

1,7,18,45,? Right answer is 99 or 96?

queen veldt
#

Well most of money comes from API's so programmers are burning the API and making the company 🤑

astral blaze
weary galleon
queen veldt
weary galleon
cloud zinc
#

96

queen veldt
#

Look at this cost of claude opus on MAX mode (max context windows basically using as much code as possible)

keen beacon
cloud zinc
keen beacon
#

Yeah just like Suno

keen beacon
#

Yup

queen veldt
#

Hahaha

keen beacon
#

Just like I said

queen veldt
#

They make deal with OAI and sue the Microsoft

#

They picked their side

cloud zinc
#

microsoft?

cloud zinc
#

u mean google?

queen veldt
#

Imagine if Netflix signs with Microsoft

astral blaze
#

if I'm google I'd just kill their SEO reach

keen beacon
#

Microsoft?

#

Open and Microsoft are partners

cloud zinc
#

microsoft?

queen veldt
#

Google*

astral blaze
#

also google has been literally scanning books and putting them on their servers FOR YEARS

blazing bison
#

gpt 5.2 is different

astral blaze
#

they never cared about legal CDs

limber crag
#

The dirty games have started

fair wadi
#

when could one expect the text-arena leaderboard to update for 5.2?

keen beacon
#

Dirty games been here lol

#

Get ready to pay 4.99 for 10 exclusive video gens lol

#

Micro transactions

astral blaze
#

honestly openAI doing this just means that every IP holder will be out for blood for them now

keen beacon
#

What?

#

lol

astral blaze
#

expect Japanese animuu makers knocking on their doors

compact sleet
#

Personalized movie will be a thing with Gen AI, it's far at the future, but I think... it's pretty much there.

keen beacon
#

The opposite means more gonna join

#

Look

compact sleet
#

Choose your own actor, scenario, etc etc, preferences and many other things.

astral blaze
zealous sparrow
astral blaze
#

if they pay disney, that means that every major IP holder's gonna want that too

zealous sparrow
#

after they signed a contract

keen beacon
#

Y’all are crazy

cloud zinc
keen beacon
#

Sora is just good

#

Google has nothing to offer

#

Veo 3 sucks lol

cloud zinc
#

wait for veo 4

keen beacon
#

Can’t wait

astral blaze
keen beacon
#

I hope it’s good

cloud zinc
#

veo 3 is 6 month old

zealous sparrow
#

and veo 3.1 is uh

keen beacon
#

So is Sora

zealous sparrow
#

old

astral blaze
zealous sparrow
keen beacon
#

Yes it is

cloud zinc
#

we are talking about release date

astral blaze
#

OpenAI gets IP usage and Disney gets their API calls

cloud zinc
#

release date is what matters

compact sleet
#

I remember that Google had that world model teased right?

#

what is it called again?

cloud zinc
#

google had veo 3 cooked up way before 6 month, anyone can play that game

rich panther
astral blaze
compact sleet
#

ah yeah, Genie

whole swallow
#

Is gpt 5.2 censored?

astral blaze
#

also crazy expensive

astral blaze
keen beacon
whole swallow
compact sleet
#

Judging by the uhh trailers of it, it's at Will Smith Spaghetti moment right now iirc

#

iunno in 3 years in the future

#

might be the actual winner

zealous sparrow
astral blaze
torn mantle
#

no but honestly gpt 5.2 high has the best translation ever

#

its so accurate and flows better

cloud zinc
#

research demo is not same as product

whole swallow
compact sleet
#

yeah thats why it's far in the future

whole swallow
#

Thanks bro

modest prism
#

Gemini 3 pro is free. Gpt 5.2 high won't be free

compact sleet
#

but the foundation is laid

fickle venture
#

Is gpt 5.2 good?

keen beacon
astral blaze
slim gorge
#

#2 and #6 is crazy for a "SOTA" model 😭

fickle venture
keen beacon
#

Idk

zealous sparrow
#

apparently this is not a lie
what in the f-

whole swallow
fickle venture
zealous sparrow
whole swallow
#

More than opus 4.5 is crazy work

keen beacon
#

Cuz open ai is a premium

fickle venture
#

OpenAi is just boring

slim gorge
#

they dont care about cost anymore they're just desperate to not fall behind the competition in benchmarks

whole swallow
#

OpenBalls

#

🤌 🤌

keen beacon
#

Crazy thing is people are willing to pay it

weary galleon
#

Gemini 3 Pro is completely DESTROYED!!!!🔥🔥🔥

zealous sparrow
#

nvm the cost is actually just 15$ apparently

#

according to ARC-AGI

fickle venture
#

And just waste it

keen beacon
#

Ui design is subjective

#

Not a metic

#

How to use this bot to create video

slim gorge
celest orchid
weary galleon
zealous sparrow
full tangle
#

Is there anyone knowledgeable about music? not just someone who listens, but someone who understands it? I want to send them a specific track and ask what a particular instrument in it is called

slim gorge
#

just ask ai

queen veldt
#

Oooh

fickle venture
fickle venture
queen veldt
#

I canceled subscription 1 month ago tho

weary galleon
queen veldt
#

Now they're trying to bring me back

astral blaze
keen beacon
weary galleon
#
poll_question_text

Will OpanAI drop GPT-5.2 today?

victor_answer_votes

5

total_votes

6

victor_answer_id

2

victor_answer_text

No

fickle venture
#

Dawg these 83% people

compact flame
fair wadi
#

Would Google's Deep Research Agent fall under a new entry for the LMArena?

#

Or is it the same because the underlying model is Gemini 3?

blazing bison
#

for general use i think that gpt 5.2 is the best model

#

bcs the instruction following of this model is the best of all

#

very aligned

compact flame
blazing bison
#

opus still the king

compact flame
#

Ok

blazing bison
#

but gpt 5.2 is not behind, it just slow

#

opus do in 30 seconds what gpt 5.2 takes 5 minutes

queen veldt
#

Making you inefficient

blazing bison
#

yeah

queen veldt
#

I remember i used to waste whole day to create 2 pages on my app

#

Then i found cursor and it's life-changing with claude

blazing bison
#

but the model has the capability

#

they just need to make it faster

queen veldt
#

I used to copy the whole code and paste into chatgpt and than copy what it gave me etc

#

The thinking time + the hussle of it doing the // Rest of the code goes here comments

blazing bison
#

and in the end gpt 5.2 use so many tokens to solve problems that opus still cheap

compact flame
#

Also what is this Gpt 5.2 extra high at yupp

#

Like where did they find Extra

cloud zinc
blazing bison
#

extra is for agentic use

#

useless in chat

compact flame
tawny canyon
#

guys should i pay for chatgpt or just use the gpt 5.2 in lmarena?

blazing bison
#

pay gpt, lmarena is 0 privacy

cloud zinc
#

pay

#

lmarena 0 privacy

compact flame
blazing bison
cloud zinc
blazing bison
blazing bison
#

😆

cloud zinc
tawny canyon
#

is the gpt 5.2 on lmarena same as the 1 in chatgpt?

compact flame
#

I wonder why LMarena didn't add this extra though or it's just expensive

compact flame
cloud zinc
#

extra-high is better than high, thats all it matters

blazing bison
golden ocean
#

is openai cooked

blazing bison
#

openai still the king

#

guys this jump is insane xD

compact flame
#

I mean OpenAi was the first one to give us AI to save some people from homework

cloud zinc
fiery gull
#

20$ per prompt, I'm so anxious 🙂

blazing bison
cloud zinc
fiery gull
cloud zinc
#

its just reasoning effort

#

lmarena could add it if they want

fiery gull
#

docs and text use

compact flame
#

Anyways imagine just buying chatgpt pro for purpose to do only homework

neon idol
#

Gemini 3.0 pro is better

cloud zinc
#

just trust me bro

blazing bison
compact flame
cloud zinc
compact flame
cloud zinc
#

api

fiery gull
compact flame
#

Is extra high on yupp is technically xhigh or nah

compact flame
fiery gull
fiery gull
swift oyster
#

Xhigh loaded, anyone want me to test a prompt?

compact flame
#

I only got 18k

fiery gull
fiery gull
#

later I'll see later how I get this point in money

compact flame
#

What's the highest amount of credits you got per prompt btw

fiery gull
compact flame
#

I think its based on how hard the prompt is

fiery gull
#

but we are in lmarena server calm down 🤣

compact flame
#

Oh yeah fair

#

Kinda got carried away

gentle pasture
#

i neve rused that

fiery gull
gentle pasture
#

bruuuh

#

1$ per message

#

or as a whole

fiery gull
#

14$ the output? and xhigh? 💀

#

the OpenAi is hungry

compact flame
#

Unless this extra high means it's on drugs

hushed gyro
#

Chat, tbh, I didn't find GPT 5.2 that good at reasoning and creative writing

native yarrow
proud bobcat
#

HOLY BLOATED MODEL

cloud zinc
#

gemini 3 still king on vision

native yarrow
#

gpt new model i bet isn't good i assume

balmy mist
#

like is it better than g3 or pus 4.5?

native yarrow
#

i haven't tested

balmy mist
#

me either, after g3 came out i stopped caring fr lol

zealous sparrow
swift oyster
compact flame
#

I mean all I can say OpenAi is good at video generating that all

swift oyster
#

for 5.2 pro

gentle pasture
#

god bless gemini 3

#

i cheat a lot in exams

zealous sparrow
fleet lintel
compact flame
fleet lintel
#

That plus user get 5.2 medium ?

gentle pasture
compact flame
#

Though I feel like

cloud zinc
swift oyster
# gentle pasture god bless gemini 3

OpenAI seems to have rushed the launch of this update, complete with steep pricing, in a bid to temporarily reclaim benchmark dominance. In my view, the core model is essentially GPT-5.1 with targeted post-training refinements focused on areas where Gemini 3 excelled and generated significant buzz on X.

fleet lintel
hardy lion
zealous sparrow
#

Im so hoping this doesn't happen

gentle pasture
compact flame
cloud zinc
gentle pasture
#

whats swe?

cloud zinc
#

coding benchmark

#

google model is most efficient

#

openai model are slow and clunky

swift oyster
hardy lion
#

I did a few of my favorite mathy rating systems questions side by side to gpt-5.1-high and gpt-5.2-high and they were either the same or a couple 5.1 was better. I was suprised

swift oyster
zealous sparrow
blazing bison
#

the base is gpt 4o

#

😆

swift oyster
zealous sparrow
sour spear
# cloud zinc openai model are slow and clunky

This. Gemini 3 is fast. GPT-5 is slow af and often stops mid-generation. Let alone the small context window. I preferred Gemini 2.5 Pro over GPT-5 already, simply because I could iterate so quickly.

blazing bison
#

i don't like gemini 3 bcs it just ignores your instructions

zealous sparrow
blazing bison
#

it ignores for me

zealous sparrow
#

Not for me

blazing bison
#

well, so keep using it

#

if it works for you, i'm happy

cloud zinc
#

not for me

blazing bison
#

each model is good at different use cases

cloud zinc
#

sure.. vibes

sour spear
# blazing bison it ignores for me

It does what I tell it to, but if my instructions are bad, it also tells me so quite bluntly, instead of hallucinating bs just to please me.

swift oyster
echo aurora
blazing bison
#

in my opinion openai still the best $20 deal

swift oyster
swift oyster
blazing bison
#

i think the model is better, my results are better with chatgpt idk, just a feeling

#

and i have all ai plans

swift oyster
blazing bison
#

nop

swift oyster
blazing bison
#

🤔

sour spear
blazing bison
#

i use my own

blazing bison
#

and yes i created a whole storage system in my home

#

😆

#

we have things like groq so, maybe

#

🤔

#

groq was not that some years ago?

#

but yeah, i don't trust random ppl in disc

keen beacon
#

Why trust any at all

#

Matter fact why even share

cloud zinc
#

i dont trust openai

echo aurora
#

@nocturne heath you'll want to review the information in #1397655624103493813 on how to properly use the Video Arena bot

cloud zinc
#

2TB cloud storage wow google

sweet tinsel
#

The MovementlabsAI stuff again, the cycle repeats itself.

#

@swift oyster I am still waiting on that e-mail, by the way.

swift oyster
#

There has been a major shift in our company, we are updating team page, we also have a new CEO. Please bare with us while we check the backlog. Thank you.

queen veldt
#

Yeah movementlabs might be legit company

#

We talked with them few days ago

#

I've had my doubts too

#

Okay why?

#

They aren't selling anything you can go and try the model yourself

#

You don't have to buy to try it

#

Proof for false advertisement?

vivid coral
#

Where is GPT 5.2 search? Is Sammy A. scared for us to test it? 🤔

queen veldt
#

Okay 😂🤣🤣👍

sweet tinsel
echo aurora
lusty tinsel
compact flame
#

Why do people keep requesting gpt pro models

balmy mist
compact flame
#

These don't have API

#

As I remember

whole sundial
#

they do, it's just very expensive

echo aurora
queen veldt
#

Not worth the price too

compact flame
queen veldt
#

Sharing this again

compact flame
#

I mean atp people just gotta buy pro subscription

#

Like there no way any free services gonna provide gpt pro models

queen veldt
#

It's crazy how the o3 was best back in the day

#

I remember that it was expensive and super powerful at that time now when i look at it at this chart it's funny

compact flame
#

But though damn gpt pro costs like a cheap laptop

#

That just gonna work for only a month

queen veldt
#

$10 for a task 😂😂

#

I'd hire some guy to do it instead

#

With plus sub on Chatgpt

compact flame
#

OpenAi be getting greedy these days

#

I mean I know they provide high quality models but still

queen veldt
#

THIS is greed

compact flame
spark python
queen veldt
#

Cached input whayever that is

compact flame
#

Also I check

#

Uhh free users don't have 5.2 in chatgpt

#

Even team sub

queen veldt
#

Not yet maybe

compact flame
#

Only like I've seen it in pro

#

OpenAi wants us to buy pro just to Access 5.2 frfr

queen veldt
#

It'll probably come in few days maybe they want their pro subscribers to have it first

#

So they can potentially make someone who's hyped to buy pro

compact flame
queen veldt
#

Well i guess for teams they want it to work with no issues since it's for businesses

balmy mist
#

just tried gpt-5.2 with my pokemon test and it was horrible, imma get it another chance cause wtf lol

queen veldt
#

Where people are not that smart

balmy mist
queen veldt
balmy mist
#

ik lmaoo like wtf

#

gemini 3 and opus 4.5 blows it out the park in that test so maybe it just not good with UI?

granite tide
#

try inputing "good ui" lol

neon idol
#

Result?

atomic lagoon
torn mantle
#

terrible

atomic lagoon
#

What

leaden meteor
#

So, all those gains in ARC-AGI-2 does not mean much in arena's vibe testing for gpt-5.2-high? I would have thought it atleast competes well with gemini 3 pro but everyone saying it is only as good as 5.1 on text arena?!

empty stump
#

I dont understand the upgrade

atomic lagoon
#

OpenAI likes doing updates on their models to upgrade like one thing at a time and its gonna kill them

leaden meteor
#

Yes, I am wondering more about its non-coding ability...

atomic lagoon
#

Okay cool, it would make sense if its still like 5.1

#

Because 5.1 was an AGI update

empty stump
#

ok its always coding and math but what about general use

atomic lagoon
empty stump
#

5.1 was disappointing

atomic lagoon
#

Sucks to suck then man, I don't know

leaden meteor
#

Yeah, but 5.2 ARC-AGI-2 is much higher than 5.1. And that does not seem to translate much to non-coding performance based on people opinion about 5.2/robin high here. Lot of peope here test on arena with puzzles and abstraction ability. So, I am surprised that even with such a high ARC-AGI-2, robin/5.2 is not sota on text arena....

atomic lagoon
#

They're testing their coding capabilities

#

It happens with every model

empty stump
#

like claude can be natural and coherent, and doesn't always tell you what you want to hear and its good and fast without reasoning even though it may struggle with deep reasoning tasks

queen veldt
#

Openai making confusions not with models only but also with versions too 😂

#

Guess what version is coming next

#

5.21o

#

Codex max super ultra x high

atomic lagoon
queen veldt
#

No i mean they made the models be confusing at first with the names

#

Nano mini regular low thinking extended thinking pro thinking

atomic lagoon
#

5.2?

#

Or that specific models hes talking about

#

5.2 is on lmarena

sour spear
atomic lagoon
#

Oh nvm

#

You said text arena mb lol

#

Yes its still on there

cloud zinc
#

source?

#

show us

torn mantle
#

idc anymore

oak granite
#

Hi

proud bobcat
#

It needed to think

#

I’m crying

pseudo summit
#

i dont think this is Google either

proud bobcat
#

Hot take

#

GPT 5.2 and 5.2 high are different models

#

Like completely different

#

I think 5.2 high is that garlic thing they were preparing

#

Also I’m not surprised with 5.2 honestly

#

It’s mediocre

cedar lotus
#

How to creat video

echo aurora
golden ocean
sour spear
#

Gemini 2.5 still going strong 💪👴

proud bobcat
#

GPT 5.2 is hella expensive as well for a gpt model

#

I don’t know what the use case will be

#

Cause coding and roleplaying are becoming favored for api usage

#

I think this model will mostly be a minor release regardless of the benchmarks

#

People would still rather pay for Gemini 3 and Opus

burnt sinew
#

is 5.2 good or nah

burnt sinew
#

with data sharing

sour spear
fiery gull
sour spear
fiery gull
#

Today I'll use 100% hard work in text and docs 🥱 and I'll talk who is better for me

#

Still the opus 4.5

sour spear
#

"Create a 3D voxel animation in JavaScript, depicting a pelican riding a bicycle."
Gemini 3 did a really neat job. And most importantly, it was fast.

burnt sinew
#

much more time is saved from quality

#

when you are coding

fiery gull
#

Imagine 2026 💀

#

Anxious...

burnt sinew
sour spear
fiery gull
echo aurora
fiery gull
#

Gta san andreas in first prompt 💀

proud bobcat
#

Not great

fiery gull
#

In 2 minutes lol

burnt sinew
#

and assets

proud bobcat
#

It would only have 50 errors

#

Dwdw

#

Ultimately in my eyes gpt is not a profitable model

fiery gull
burnt sinew
fiery gull
burnt sinew
#

let me check

fiery gull
burnt sinew
fiery gull
#

For me

burnt sinew
#

well its flappy bird 1:1 clone

neon idol
fiery gull
#

2026 will be fun, see a model creating a gta 3 in first prompt and you think it is bad because the gemini 4.0 ultra create a gta 5

proud bobcat
jade egret
#

gpt 5.2 good?

proud bobcat
#

Most likely benchmaxxed on some things

#

Some not

jade egret
#

oh

proud bobcat
#

It’s a fine model but it doesn’t really like excel at anything

#

So I think opus and gemini still rule

torn mantle
#

hmm